Learn how to use ModelCouncil to make better decisions with AI
Getting Started
ModelCouncil is a multi-model AI decision tool that helps executives and founders make better decisions by querying Claude, GPT, Gemini, and Grok simultaneously against your project documents.
Quick Start
1Create a project — Give it a name that describes your decision context
2Upload documents — Add PDFs, Word docs, or text files with relevant information
3Ask a question — Query the AI models about your documents
4Review the Decision Board — See the synthesized recommendation, risks, and insights
Projects & Documents
Projects are decision contexts — containers for documents and queries related to a specific decision or topic. Each project has its own set of documents that provide context to the AI models.
Supported File Types
PDF Documents
Reports, contracts, research papers
Word Documents
.docx files with formatted content
Text & Markdown
.txt and .md files
Excel Spreadsheets
.xlsx and .xls files
Limits
10 MB maximum per file
20 files maximum per project
200,000 tokens maximum context per project
Making Queries
When to use the Query tab: Quick questions, one-off analysis, exploring ideas without tracking history.
The Query tab lets you ask questions against your uploaded documents. Multiple AI models analyze your question and documents, then their responses are synthesized into a Decision Board.
Model Selection
Choose 2-3 models for each query. Different models have different strengths:
Claude (Anthropic)
Strong reasoning, careful analysis
GPT (OpenAI)
Versatile, great at synthesis
Gemini (Google)
Good with data, structured output
Grok (xAI)
Direct, contrarian perspectives
Decision Helper
When you ask a decision-type question (e.g., "Should we...", "Which option..."), the Decision Helper appears to help you provide structured context:
Options being considered
Constraints (budget, timeline, resources)
Success criteria
Decision timeframe
Follow-up Queries
After receiving a Decision Board, you can ask follow-up questions to dig deeper. Follow-ups include context from the previous query and recommendation.
Tip: Queries made in the Query tab aren't automatically connected to threads, but you can organize them later. See the Decision Threads section for details.
Web search (opt-in)
Each model answers from its training knowledge plus the documents and context you've added to the project. You can also opt into a single Google-grounded web search before the council runs — the search results are added as a shared evidence block that every model reasons over, so all members of the council are working from the same sources.
How to turn it on
Check “Allow models to search the web” in the query input before you send a question. We run one Google search before the council reads your question, and every model — whether you're on a standard query or in Diamond mode — reasons over the same evidence. Adds roughly $0.003 to the query cost.
What the models still can't do
One limitation to plan around:
URLs in your question aren't fetched
If you paste a link like https://example.com/article in your question, the models see the URL as text only. They will not load the page. Worse: a model may pattern-match the domain and make up what it thinks is at that URL based on training data. Treat any URL-summary that wasn't backed by an uploaded document or the search-evidence block as unverified.
What to do instead
For an article or page: copy the relevant text and either paste it directly into your question, or save it as a .txt /.md /.pdf and upload it to the project. The council will then actually reason over the content.
For current data that changes: drop a fresh export (CSV, spreadsheet, screenshot) into the project before each query. Stale uploads stay stale until you replace them.
For time-sensitive decisions: state today's date explicitly in your question and tell the models what they may not know — e.g. "As of May 2026, our main competitor just launched X. Given that…". The models will then frame their answer around the facts you've given them rather than guessing.
When you do paste a URL: add a short note like "I have not given you the page contents. Please don't summarize the URL — just acknowledge the source if relevant." This keeps a model from inventing details.
Understanding the Decision Board
The Decision Board synthesizes all model responses into a structured output that helps you make informed decisions.
Recommendation
The suggested direction with key supporting reasons
Risks
What could go wrong — the top 3 risks to consider
Validate Before Committing
Things to verify before making a final decision
Consensus
Where all models agree — high-confidence points
Disagreements
Where models differ — areas needing more thought
Rare Finds (Gems)
Unique insights from individual models worth saving
Model Response Tabs
Click on model tabs to see the full reasoning from each AI. This helps you understand where different perspectives come from.
Exporting Results
Export your Decision Board to PDF, Excel, Word, or PowerPoint for sharing with stakeholders or archiving.
Decision Threads
When to use the Threads tab: Multi-step decisions, tracking decisions over time, organized workflows where context matters.
Decision Threads group related queries into organized workflows. Unlike the Query tab, threads use intelligent context that stays manageable no matter how many queries you make.
Query Tab vs Threads Tab
Query Tab
Threads Tab
Quick, standalone questions
Organized decision workflows
Full document context every time
Intelligent context (summary + relevant)
Good for exploration
Good for tracking multi-step decisions
Context grows with each follow-up
Context stays stable (compressed history)
How Intelligent Context Works
In a thread, the system automatically manages context to keep it efficient:
Document Summary — An always-on overview of your project documents
Relevant Chunks — Semantic search finds document sections relevant to your current question
Thread History — Compressed summaries of previous decisions in this thread
Cross-Thread Context — Related decisions from other threads in the same project
"Query 50 = Query 1"
Context size stays stable regardless of how many queries you make in a thread. This means consistent performance and costs.
Organizing Queries into Threads
Queries made outside the Threads tab (in the Query tab) aren't automatically connected to threads. You can organize these standalone queries into threads at any time.
1Click Organize in the query history sidebar
2Select the queries you want to group (checkboxes appear on unorganized queries)
3Click Add to Thread and choose an existing thread or create a new one
Note: When you organize queries into a thread, the system automatically generates compressed summaries and embeddings for each query. This enables intelligent context and cross-thread search for future queries.
Branching
From any decision in a thread, you can branch to explore an alternative direction. The new thread knows where it branched from, preserving your decision tree.
Diamond Mode
Premium feature: Models review each other's responses before the final synthesis, resulting in deeper analysis.
How It Works
Round 1Initial Responses — Each model answers your question independently
Round 2Cross-Review — Each model reviews the other models' responses, identifying points to adopt, reject, or add
FinalDiamond Synthesis — A final synthesis that includes what changed, confidence levels, and unresolved disagreements
When to Use Diamond Mode
High-stakes decisions where you want maximum confidence
Complex questions where models might miss each other's insights
When you want to see how models critique each other
Cost: Diamond mode costs approximately 3x a standard query (7 API calls instead of 4).
If a Diamond query gets interrupted
Diamond queries take longer than standard ones — typically 30 seconds to a few minutes — so it's normal to occasionally lose your connection partway through (closed tab, dropped Wi-Fi, etc.). Two ways to recover:
Resume button. Open your query history (Query tab → History). Any Diamond query that's mid-flight shows a blue Resume badge — click it to pick up from where it stopped.
Automatic recovery. Within 12 hours, our background system finishes any abandoned Diamond queries and emails you a link to the Decision Board. No action needed on your end.
See Release Notes for the full set of reliability improvements shipped May 2026.
Integrations (MCP)
ModelCouncil is available as a connector in any MCP-compatible client — Claude Desktop, Claude.ai (web), Cursor, ChatGPT, and others. Setup is self-serve: paste a URL into your client's connector settings, sign in once, and the council shows up as a tool you can call without leaving the chat or editor you're already in.
Same projects, same credits. Queries through MCP charge your normal balance — no separate subscription, no surcharge. Every MCP call has a $10 per-query hard cap so a runaway prompt can't drain your account.
When MCP makes sense
Second opinion mid-conversation. You're working through a hard call in Claude.ai. Get GPT's and Gemini's take on what Claude just told you — in the same thread.
Council-as-a-tool while you code. From Cursor, ask the council about an architecture choice or risky migration without switching context.
Cross-check what one model said. Surface where the models disagree before you commit to a path.
Available tools
Your AI client (Claude.ai, ChatGPT, Cursor, etc.) calls these tools on your behalf. You don't invoke them by name — you ask in plain English and the client picks the right tool.
list_projects — enumerates your projects so the AI can pick the right one. Always called first.
create_project — creates a new project when no existing one fits the topic (find-or-create flow). New projects start empty; for document-grounded answers, upload files in the web app or pass context inline via additional_context.
query_council — asks the council (synchronous, single response, ~30-60s)
query_diamond_mode — cross-model deliberation (async — kicks off in the background, status surfaced in the dashboard or emailed when complete)
get_query_status — poll a Diamond query for completion, or re-fetch the full Decision Board for any past query
How to ask the council from a chat
Your AI client decides when to call ModelCouncil based on what you ask. The trigger phrases below work reliably across Claude.ai, ChatGPT, and Cursor. None require you to memorize tool names — but mentioning "ModelCouncil" or "the council" explicitly is the most reliable nudge.
Basic — second opinion
“Use ModelCouncil to get a second opinion on what you just told me.”
The AI passes the current chat as additional_context, picks (or creates) a fitting project, and runs query_council.
Targeted — specific question
“Ask ModelCouncil whether I should ship the new pricing this quarter or wait until Q4. Include our last 3 calls of context.”
Explicit framing — you control what gets passed as context and what question the council answers.
Deep deliberation — Diamond mode
“Use ModelCouncil's Diamond mode to deeply deliberate on our go-to-market for the enterprise tier.”
Triggers query_diamond_mode — models cross-review each others' responses before synthesis. Async (~3-5 min). You'll get a “Watch progress” deep link immediately + an email when it completes.
Skip a model
“Ask ModelCouncil but skip Claude — I want GPT and Gemini's view since I'm already in Claude.”
Overrides the per-connector default for this call. If you find yourself doing this every time, set it as a connector default in Settings → API Keys & MCP → Connected Apps.
Reuse an existing project
“Use ModelCouncil and run this against my Q3 GTM strategy project.”
Names guide the find-or-create logic. The AI calls list_projects, sees the match, and runs the query against that project's documents and threading history.
Brand-new project
“I'm thinking through a Series A. Start a new ModelCouncil project for it and have the council weigh in on the timing.”
Triggers create_project + immediate query_council. New projects start empty — upload supporting documents at the “Open in ModelCouncil” link in the response.
Power-user nudges
Say “ModelCouncil” or “the council” explicitly. Removes ambiguity — without it, Claude/ChatGPT may just answer in their own voice instead of dispatching the tool call.
If Claude builds an artifact instead of calling the tool (we've seen this with stale connector state), say: “Use the ModelCouncil MCP tools directly — call list_projects then query_council. Do not build an artifact.”
For long-running Diamond queries, click the “Watch progress” link from the tool response — your dashboard renders the R1 / R2 / synthesis phases live, and you'll get an email when it finishes even if you close the tab.
Per-connector defaults
If you're asking the council from inside Claude.ai, getting another Claude answer back is redundant. Open Settings → API Keys & MCP → Connected Apps and edit preferences for each connector — for instance, set Claude.ai to skip the Anthropic model by default. The preference applies to every query from that connector unless the AI explicitly overrides it.
What MCP is NOT (yet)
Not an automatic import of your Claude.ai thread. If you want a specific chunk of your conversation included as context, paste it into the additional_context parameter when you invoke query_council.
Not a live workspace sync. Documents you upload to a ModelCouncil project are available; documents you have open in your editor are not (unless your editor exposes them as MCP tools itself).
Not available in the Anthropic or OpenAI app directories yet — verification is in progress. Until then, setup is self-serve (paste a URL).
Cost expectations through MCP
Same per-query economics as the web dashboard. Through MCP you have a $10 hard cap per call (no interactive UI to confirm higher-cost queries):
Standard queries: typically $0.04–$0.20 each depending on tier and context size
Diamond Mode: typically $0.20–$1.00 per query, up to ~$2 on very large 200K-token projects (~3× the gateway calls of a standard query)
Token usage and cost are reported in the tool response — the AI client surfaces it in the chat reply
Tip: getting the full response back
Claude.ai (and ChatGPT) often summarize the council's response in their own voice rather than quoting the full Decision Board verbatim. This is the AI client's choice — the MCP tool itself returns the complete output (recommendation, key reasons, risks, validate-before-committing, consensus, disagreements, rare finds, plus what each model said). Four ways to see everything:
Click the "View on ModelCouncil →" link. Every council response includes a deep-link at the top that opens this specific Decision Board in your dashboard. Even when Claude summarizes the rest, the link survives — one click and you're looking at the full untruncated board.
Ask explicitly. Type something like "Show me the full Decision Board verbatim" or "Quote the entire tool response — don't summarize." Claude will paste the raw output.
Open your dashboard. Every MCP query mirrors into ModelCouncil with the full content untouched. Visit your projects and the latest query is at the top — this is the canonical record.
Use the query_id. The tool response includes a query_id — ask Claude to call get_query_status with that id to re-fetch the full board.
For step-by-step client setup (config snippets for Claude Desktop, Claude.ai web, Cursor, ChatGPT, and Claude Code CLI) see the dedicated MCP Setup Guide →
Billing & Credits
ModelCouncil uses a credit-based system so you only pay for what you use. Credits cover the cost of AI API calls with a small markup.
Purchasing Credits
$25
~50-100 queries
$50
~100-200 queries
$100
~200-400 queries
5% bonus
Credit purchases require an active trial or Pro subscription. If your subscription has lapsed, reactivate the Pro plan first — the Add Funds card on the Usage page will direct you there.
Platform Subscription
$99/month platform fee provides access to all features. Credits for API usage are purchased separately.
Credit Expiration & Grace Period
Credits roll over month to month while you have active platform access (during your trial or while subscribed). Your balance has two pots: purchased credits (what you bought) and bonus credits (the $10/month retention grant, up to $40 saved). Bonus credits drain first on every query, then purchased credits. If your subscription lapses — cancellation, payment failure, or trial expiry without conversion — both pots enter the same 60-day grace period.
Reactivate within 60 days and your full balance is restored — credits are never deducted during the grace period, just paused.
After 60 days the balance is forfeited and cannot be recovered. You can always start fresh by resubscribing — every paid month adds $10 in bonus credits to your balance, up to $40 saved.
We'll email you reminders at 14 days, 3 days, and the day of forfeiture, plus a banner on your dashboard showing days remaining whenever the clock is running.
Grandfathered customers: users who held credits before our 2026-05-04 policy change keep their original credits-never-expire terms. If this applies to you, no action is needed — your balance is exempt from expiration regardless of subscription status, and any future purchases inherit the same protection.
Viewing Usage
Visit Usage & Billing in the sidebar to see your credit balance, usage history, and purchase more credits. The Transactions tab lists every credit event including purchases, query consumption, bonuses, refunds, and expirations.
Release Notes
Recent improvements to ModelCouncil. Newest first.
Web search, end to end
May 24, 2026
You can now ask the council questions that need current information without leaving the app. Check “Allow models to search the web” in the query input and we run one Google search before any model reads your question. Every member of the council — Claude, GPT, Gemini, Grok — reasons over the same set of sources, so disagreements come from how they interpret the evidence, not from who happened to find which page.
Works on standard queries and Diamond mode from the web app, plus through the MCP integration in Claude.ai, ChatGPT, Cursor, and Claude Code.
Adds about $0.003 per query, folded into the same cost cap and credit balance as the rest of the query.
Failed searches don't bill — if Google grounding returns nothing, the question still runs and you're not charged for the search.
Pasted URLs in your question still aren't fetched — for those, check the limitation note in Web search (opt-in) above.
Diamond mode reliability
May 12, 2026
Newer AI models like GPT-5.5 and Claude Opus 4.8 produce higher-quality responses but can take longer to generate them — sometimes up to a minute or more per call. Diamond mode makes seven model calls per query (three rounds of cross-model deliberation plus a synthesis step), and the original implementation ran all of them in a single web request. As model response times grew, some Diamond queries — especially complex ones using the Pro tier — were running out of time partway through and returning incomplete results.
We rebuilt how Diamond works:
Each phase runs independently with its own time budget. A slow first-round response no longer threatens the rest of the deliberation.
Per-call timeouts are 4× longer for the Pro tier (240 seconds, up from 60), giving slow Pro models the headroom they need.
Abandoned queries auto-recover. If you close your tab partway through a Diamond query, our system finishes it in the background and emails you a link to the result, usually within 12 hours.
Resume button in your query history if a query ever gets stuck — pick it up from where it left off.
Clearer progress indicator showing exactly where you are in the deliberation pipeline (Round 1 → Round 2 → Synthesis) without the counter resetting between rounds.
Diamond queries should now complete reliably even with the slowest Pro-tier models, and you won't lose work to network glitches or browser closes.