Model comparison
Kwoat gives you access to 10+ AI models. Here's how to choose — or just use Auto mode.
Auto mode (recommended)
Auto mode routes your prompt to the best model automatically based on task type, complexity, and your plan. For most users this is the best choice.
Model tiers
Standard tier — fastest, most affordable
Best for: everyday chat, simple questions, drafting, quick tasks.
| Model | Provider | Context | Vision | Best for |
|---|---|---|---|---|
| Gemini 2.5 Flash | 1M tokens | ✓ | Fast general tasks, web search, large context | |
| GPT-4o Mini | OpenAI | 128K tokens | ✓ | General chat, quick answers |
| Mistral Small 4 | Mistral AI | 262K tokens | ✓ | Fast, vision-capable, European data |
| Llama 3.3 70B | Meta | 66K tokens | ✗ | Open source, coding, reasoning — no vision |
Advanced tier — more capable, moderate cost
Best for: complex analysis, long documents, nuanced writing, detailed code.
| Model | Provider | Context | Vision | Best for |
|---|---|---|---|---|
| GPT-5.4 | OpenAI | 1M tokens | ✓ | Complex reasoning, vision, coding |
| Gemini 3.1 Pro | 1M tokens | ✓ + audio | Very long documents, audio analysis, 1M context | |
| Grok 4.3 | xAI | 2M tokens | ✓ | Largest context window — 2M tokens for long research |
Premium tier — most powerful
Best for: the hardest problems, research, complex multi-step reasoning.
| Model | Provider | Context | Vision | Best for |
|---|---|---|---|---|
| Claude Sonnet 4.6 | Anthropic | 1M tokens | ✓ | Writing, analysis, coding, nuanced tasks |
| Claude Opus 4.7 | Anthropic | 200K tokens | ✓ | Hardest reasoning problems, deep research · 10/day Pro, 50/day Pro Max |
Specialised models
| Model | Type | Cost | Notes |
|---|---|---|---|
| Nano Banana 2 | Image generation | $0.15/image | Higher quality, contextual understanding |
| FLUX.2 Klein | Image generation | $0.04/image | Fast, cost-efficient |
| Kling 2.6 Pro | Video generation | $1.50 (5s) · $2.50 (10s) | Cinematic video with audio |
Which model should I use?
For writing
Claude Sonnet 4.6 consistently produces the most natural, nuanced writing. For quick drafts, Gemini 2.5 Flash is fast and excellent value.
For coding
GPT-5.4 and Claude Sonnet 4.6 are both excellent.Llama 3.3 70B is a strong option for standard coding tasks (note: no vision support). For the hardest architecture or debugging problems, Claude Opus 4.7 is the strongest choice.
For research and analysis
Claude Opus 4.7 or GPT-5.4 for deep research. Enable web search to get current information from any model. For very long source documents, Grok 4.3 (2M context) handles the largest inputs.
For long documents
Grok 4.3 has the largest context window at 2M tokens.Gemini 2.5 Flash and Gemini 3.1 Pro also support 1M tokens each.
For image analysis
All models except Llama 3.3 70B support vision. Gemini 3.1 Pro additionally supports audio input. For generating images, use Nano Banana 2 (quality) or FLUX.2 Klein (speed).
Plan access
| Plan | Standard | Advanced | Premium | Image | Video | Missions |
|---|---|---|---|---|---|---|
| Free | ✓ | ✗ | ✗ | ✗ | ✗ | ✗ |
| Pay As You Go | ✓ | ✓ | ✗ | ✓ wallet | ✓ wallet | ✗ |
| Pro | ✓ | ✓ | ✓ | ✓ 20/mo | ✓ wallet | ✗ |
| Pro Max | ✓ | ✓ | ✓ | ✓ 100/mo | ✓ 5/mo incl. | ✓ 10/mo |