What is the one feature only Consilium has?

Multi-AI debate. Consilium is the only CLI in this comparison that runs the same prompt through multiple frontier models simultaneously, has them cross-examine each other across rounds, and produces a synthesized consensus answer with voting math. Every other tool routes to a single model at a time.

Does Consilium replace Claude Code?

It overlaps: hooks, sub-agents, sandbox, plan mode, and the slash command surface mirror Claude Code closely and are largely portable. The difference is that Consilium can call Claude, GPT, Gemini, Grok, Llama, and DeepSeek from the same session and have them debate, while Claude Code is Anthropic-only.

How does pricing compare?

Consilium is BYOK with zero markup - you pay each provider directly at their list rate. Cursor CLI bundles its own subscription. Claude Code uses your Anthropic API key (also BYOK, but single-vendor). Gemini CLI is BYOK Google. Grok Build is BYOK xAI. Consilium also offers a Groq-backed free tier when no key is configured for the requested provider.

Which CLI has the most lifecycle hooks?

Claude Code and Consilium tie at seven lifecycle events each (SessionStart, SessionEnd, UserPromptSubmit, PreToolUse, PostToolUse, PermissionRequest, Stop). Cursor CLI ships four. Gemini CLI and Grok Build do not expose a lifecycle hook system at the time of writing.

Do all of these CLIs support MCP?

Yes - Claude Code, Cursor CLI, Gemini CLI, Grok Build, and Consilium all support the Model Context Protocol. Consilium adds an in-CLI marketplace (`/mcp search`, `/mcp install`) backed by the same registry Anthropic publishes.

Is this comparison kept up to date?

Yes - the matrix is regenerated against the published changelogs and docs of each tool. Feature gaps move quickly in this category; check the source repos linked at the bottom for the latest state.

Back to CLI

CLI Comparison

Multi-AI debate is the one capability only Consilium ships. The CLI runs the same prompt through multiple frontier models, has them cross-examine each other across rounds, and produces a synthesized consensus answer with formal voting math. Every other CLI in this matrix routes to a single model at a time. The table below scores roughly 40 capabilities against Claude Code, Gemini CLI, Grok Build, and Cursor CLI.

The Consilium moat (one line)

Every other CLI in this list is essentially one model with a workbench. Consilium is a workbench that runs many models against each other and converges on an answer. If your use case is high-stakes (architecture, security, compliance) or contested (correctness-critical, multi-perspective), the debate primitive is the difference. If your use case is fast single-shot completion, any of these tools is fine and Consilium will still let you run in single-model mode.

Feature-by-feature parity matrix

Feature	Consilium	Claude Code	Gemini CLI	Grok Build	Cursor CLI
Multi-AI debate (council/jury/red-team)	Yes	No	No	No	No
Cross-model arbitration / consensus voting	Yes (Condorcet + Borda)	No	No	No	No
Models per session	1-5 (any vendor)	1 (Anthropic)	1 (Google)	1 (xAI)	1 (configurable)
BYOK at zero markup	Yes	Yes (Anthropic only)	Yes (Google only)	Yes (xAI only)	Subscription
Free tier fallback (no key required)	Yes (Groq)	No	Limited	No	Subscription
Plan mode (read-only planning, then exec)	Yes	Yes	Partial	Partial	Yes
Lifecycle hooks	7 events	7 events	No	No	4 events
HTTP webhook hooks (not just shell)	Yes	No (shell only)	No	No	No
User-defined sub-agents (YAML frontmatter)	Yes	Yes	No	No	Yes
Per-sub-agent tool allowlist	Yes	Yes	No	No	Partial
OS-level sandbox (macOS Seatbelt)	Yes	Yes	No	No	Yes
OS-level sandbox (Linux bwrap)	Yes	Yes	No	No	Partial
Windows sandbox fallback (worktree+ACL)	Yes	Partial	No	No	Partial
Workspace trust prompt (always/session)	Yes	Yes	No	No	Yes
Git worktree isolation per session	Yes	No	No	No	Yes
MCP client	Yes	Yes	Yes	Yes	Yes
MCP marketplace (search/install in CLI)	Yes	Partial	No	No	Partial
Slash commands in chat	50+	30+	10+	10+	20+
Voice dictation (Whisper)	Yes	No	No	No	Partial
Image generation (DALL-E / Imagen)	Yes	No	Yes (Imagen)	No	No
Image input (vision)	Yes	Yes	Yes	Yes	Yes
Web search grounding	Yes	Yes	Yes	Yes (live)	Yes
Headless JSON output	Yes (--output json)	Yes	Yes	Partial	Yes
Headless stream-JSON output	Yes	Yes	Partial	No	Partial
Long-lived CI tokens	Yes	Yes	Yes	Yes	Yes
Session persistence (~/.<cli>/sessions/*.json)	Yes	Yes	Yes	Partial	Yes
Cross-session search	Yes (/search)	Partial	No	No	Partial
Autonomy: /loop recurring task	Yes	No	No	No	No
Autonomy: /goal long-running objective	Yes	No	No	No	No
Autonomy: /schedule cron-style	Yes	No	No	No	No
Output: cursorrules / claude-md preset	Yes	Partial	No	No	Yes (.cursorrules)
Red-team adversarial mode (built in)	Yes	No	No	No	No
Benchmark runner (MMLU/HumanEval/TruthfulQA)	Yes	No	No	No	No
Probabilistic / market consensus mode	Yes	No	No	No	No
Mandatory dissent (jury mode)	Yes	No	No	No	No
Cost estimator before run (/estimate)	Yes	No	No	No	Partial
Per-model usage dashboard (consilium stats)	Yes	Partial	Partial	No	Yes (dashboard)
VS Code companion extension	Yes	Yes	Partial	No	Yes (native)
TypeScript SDK	Yes (@myconsilium/sdk)	Yes	Yes	Yes	No
Python SDK	Yes (consilium on PyPI)	Yes	Yes	Yes	No

When to pick each tool

Pick Consilium when:you want multiple models to debate a high-stakes decision; you care about formal consensus (Condorcet/Borda) rather than a single model's opinion; you want jury, red-team, or market modes that no single-model CLI ships; or you want one tool that subsumes Claude, GPT, Gemini, Grok, Llama, and DeepSeek without vendor lock-in.

Pick Claude Code when: you are all-in on Anthropic and want the most Anthropic-native surface (deepest hook coverage, model-tuned prompt cache, official Anthropic support).

Pick Gemini CLI when: you live inside Google Cloud, you want Imagen image generation in the loop, or you need first-party Vertex integration.

Pick Grok Build when: you want xAI's live X/Twitter web grounding and the fastest Grok-tuned coding loop.

Pick Cursor CLI when: you already use Cursor's editor, want the IDE and CLI to share state, and prefer a subscription pricing model.

Source links

Consilium CLI source: github.com/skadri1601/consilium-cli

Comparison data is regenerated against the published changelogs of each tool; gaps move quickly in this category. See related pages for deep dives: Hooks, Sub-Agents, Sandbox.