Models

The Perfect10 Pro - Enterprise Platform supports a comprehensive range of AI models from leading providers such as OpenAI, Google DeepMind, and Anthropic.
These models power diverse functionalities, from natural language understanding and reasoning to multimodal processing and external tool invocation.

Model Categories

1. Generative Large Language Models (LLMs)

Models designed for natural conversation, analytical reasoning, content generation, and code development.
These models serve as the foundation for most AI agents within the platform.

2. Retrieval-Augmented Generation (RAG)

Combines large language model capabilities with external data retrieval to provide responses grounded in enterprise-specific or domain-specific knowledge.

3. Function Calling / Tool Invocation

Models capable of executing structured tasks, such as retrieving data through APIs, triggering workflows, or integrating with external systems.

4. Multimodal Models

Models that process multiple data types—including text, images, and structured documents—to support tasks such as visual analysis, report extraction, and contextual understanding.

OpenAI Models

Recognized for advanced reasoning, precision, and versatility across creative and technical applications.

GPT-4.1

Provider: OpenAI
Description: Flagship model for advanced instruction following, complex reasoning, and large-context comprehension.
Ideal For: Analytical workflows, technical documentation, and complex chat orchestration.
Strengths: High precision, extensive context length.

GPT-4.1 Mini

Description: A balanced model delivering performance comparable to GPT-4o at lower latency and cost.
Ideal For: Cost-efficient agents requiring accurate and responsive output.

GPT-4.1 Nano

Description: Fastest and most lightweight model in the GPT-4.1 family.
Ideal For: High-frequency, low-latency applications.

GPT-4o ("Omni")

Description: Multimodal model capable of processing both text and images, producing fast and context-rich responses.
Ideal For: Conversational and visual analysis tasks.
Strengths: Low latency, basic visual understanding.

GPT-4o Mini

Description: Compact multimodal model offering over 60% cost reduction compared to GPT-3.5 Turbo.
Ideal For: Lightweight, interactive agents.

GPT-5 Mini

Description: Streamlined variant of GPT-5, optimized for lightweight reasoning and balanced accuracy.
Ideal For: Task automation and real-time interaction.

GPT-5 Nano

Description: Smallest and fastest variant, optimized for ultra-low latency environments such as developer tools.
Ideal For: Continuous, high-throughput agent operations.

Google DeepMind Models (Gemini)

Offers scalable, multimodal performance for analytical, scientific, and interactive applications.

Gemini 2.5 Pro

Provider: Google DeepMind
Description: Flagship reasoning and analysis model, supporting complex problem solving across domains including science and mathematics.
Features: Advanced multimodal comprehension (text, image, and video).
Ideal For: Research, data science, and analytical automation.

Gemini 2.5 Flash

Description: High-performance model optimized for rapid reasoning, code generation, and scientific computation.
Ideal For: Low-latency technical workloads.

Gemini 2.5 Flash Lite

Description: Lightweight version focused on cost efficiency and fast response times.
Ideal For: High-volume applications and scalable deployment.

Gemini 2.0 Flash

Description: Offers significantly reduced response latency while maintaining accuracy comparable to larger models.
Ideal For: Interactive and dynamic chat experiences.

Gemini 2.0 Flash Lite

Description: Economic variant designed for minimal latency and token efficiency.
Ideal For: Budget-conscious use cases.

Gemini 1.5 Pro

Description: Multimodal foundation model supporting text, image, and structured data inputs.
Ideal For: Visual and data-intensive analytical workflows.

Gemini 1.5 Flash

Description: Reliable multimodal model for visual understanding, summarization, and creative generation.
Ideal For: Visual inspection and document interpretation.

Gemini 1.5 Flash 8B

Description: Optimized for small prompt workloads such as chat, transcription, and translation.
Ideal For: Rapid, conversational use cases.

Anthropic Models (Claude)

Focused on ethical reasoning, safety, and contextually aware responses.

Claude 3 Haiku

Provider: Anthropic
Description: Compact model optimized for responsiveness and general-purpose reasoning.
Ideal For: Real-time interactions and customer-facing applications.

Claude 3.5 Haiku

Description: Enhanced version offering improved accuracy, reasoning capability, and function calling support.
Ideal For: Safe, conversational, and compliant enterprise environments.
Strengths: Fast, natural, and contextually appropriate dialogue.

Model Comparison

Model	Provider	Primary Use Case	Strengths	Limitations
GPT-4.1	OpenAI	Reasoning, development	High accuracy, long context	High cost, slower speed
GPT-4o	OpenAI	Multimodal conversation	Fast, image understanding	Limited tool access
GPT-4o Mini	OpenAI	Lightweight multimodal tasks	Affordable, responsive	Reduced capability
GPT-5 Nano	OpenAI	Ultra-low latency	Fastest response times	Limited reasoning depth
Gemini 2.5 Pro	Google	Analytical reasoning	Multimodal, efficient	Restricted access
Gemini 2.5 Flash	Google	Fast reasoning workloads	Low latency, high throughput	Shorter context window
Claude 3.5 Haiku	Anthropic	Safe conversation handling	Ethical, natural dialogue	Limited extensibility

Model Selection Guidelines

Use GPT-4.1 for advanced reasoning, complex workflows, or knowledge-intensive agents.
Use GPT-4o or Gemini Flash for fast, low-latency conversational tasks.
Use Claude 3.5 Haiku for safe, compliant, and human-like dialogue.
Use Gemini 2.5 Pro for data-driven analysis and large-scale scientific reasoning.
Use GPT-5 Nano for rapid response automation or developer tooling.

Technical Notes

Each agent within Perfect10 can be configured to use a specific model independently.
Model selection can be defined manually or programmatically (via JSON configuration).
Multi-model workflows are supported, enabling chained processes such as retrieval → generation → execution.
Multimodal input capabilities vary based on provider and version availability.

Model Categories​

1. Generative Large Language Models (LLMs)​

2. Retrieval-Augmented Generation (RAG)​

3. Function Calling / Tool Invocation​

4. Multimodal Models​

OpenAI Models​

GPT-4.1​

GPT-4.1 Mini​

GPT-4.1 Nano​

GPT-4o ("Omni")​

GPT-4o Mini​

GPT-5 Mini​

GPT-5 Nano​

Google DeepMind Models (Gemini)​

Gemini 2.5 Pro​

Gemini 2.5 Flash​

Gemini 2.5 Flash Lite​

Gemini 2.0 Flash​

Gemini 2.0 Flash Lite​

Gemini 1.5 Pro​

Gemini 1.5 Flash​

Gemini 1.5 Flash 8B​

Anthropic Models (Claude)​

Claude 3 Haiku​

Claude 3.5 Haiku​

Model Comparison​

Model Selection Guidelines​

Technical Notes​