Back to Docs

Vertical Templates

6 pre-configured deliberation templates optimized for specific domains. Each template defines the mode, rubric weights, system prompts, and evaluation criteria.

Overview

TemplateModeModelsRounds
Code Review
Red Team
32
Research Synthesis
Council
33
Risk Assessment
Jury
53
Healthcare Diagnostics
Council
33
Legal Review (Dialectical)
Blind
23
Finance Risk Assessment
Jury
33
Code Review
Red Team

Three models independently review code, then adversarially attack each other's findings. Attackers probe for security vulnerabilities, logical flaws, edge cases, and robustness issues. Defenders respond to each attack. A judge evaluates the validity of attacks and strength of defenses.

Default Models

3

Max Rounds

2

Mode

Red Team

Evaluation Rubric

DimensionWeightEvaluates
Security30%Vulnerabilities, injection risks, auth flaws, data exposure
Correctness25%Logic errors, off-by-one, null handling, race conditions
Performance20%Time complexity, memory usage, unnecessary allocations
Maintainability15%Code clarity, naming, structure, testability
Style10%Consistency, formatting, idiomatic patterns

Attack Categories

SECURITY_VULN
LOGICAL_FLAW
EDGE_CASE
ROBUSTNESS_TEST

Example Prompt

Review this authentication middleware for security vulnerabilities and suggest improvements

Output

Vulnerability report with severity ratings, defender's rebuttals, judge's final assessment, prioritized action items

Research Synthesis
Council

Models explore different perspectives on complex research topics, challenge each other's sources and interpretations, and converge on well-supported conclusions. All claims must include citations. Uncertainties are explicitly flagged rather than glossed over.

Default Models

3

Max Rounds

3

Mode

Council

Evaluation Rubric

DimensionWeightEvaluates
Accuracy30%Factual correctness of claims and interpretations
Evidence Quality25%Strength and relevance of cited sources
Completeness20%Coverage of relevant perspectives and findings
Bias Awareness15%Recognition of limitations and potential biases
Citation Quality10%Proper attribution and source reliability

Example Prompt

Synthesize current research on transformer architecture efficiency improvements published in 2024-2025

Output

Comprehensive synthesis with inline citations, flagged uncertainties, confidence scores, areas of consensus and disagreement

Risk Assessment
Jury

Five models evaluate risks with mandatory dissent reporting. Every assessment must explicitly identify both majority and minority positions. Models rate likelihood and impact for each risk, propose concrete mitigations, and map to compliance frameworks.

Default Models

5

Max Rounds

3

Mode

Jury

Evaluation Rubric

DimensionWeightEvaluates
Risk Identification25%Completeness of risk catalog, no blind spots
Likelihood Assessment20%Accuracy of probability estimates
Impact Analysis20%Severity scoring and cascading effects
Mitigation Quality20%Feasibility and effectiveness of proposed controls
Compliance15%Regulatory alignment and framework mapping

Example Prompt

Assess risks of migrating our production database from PostgreSQL to a multi-region CockroachDB setup

Output

Risk matrix with likelihood/impact ratings, mitigation strategies per risk, mandatory minority opinions, compliance mapping

Healthcare Diagnostics
Council

Safety-critical deliberation with mandatory citations and dissent. Models provide differential diagnoses with evidence strength ratings. Red flags are automatically highlighted. Every claim must be backed by medical literature or clinical guidelines.

Default Models

3

Max Rounds

3

Mode

Council

Evaluation Rubric

DimensionWeightEvaluates
Evidence Quality30%Strength of clinical evidence, guideline adherence
Diagnostic Accuracy25%Correctness of differential diagnosis
Safety Considerations20%Red flag identification, contraindication awareness
Completeness15%Coverage of differential, no missed diagnoses
Actionability10%Clear next steps, testable hypotheses

Example Prompt

Evaluate differential diagnosis for patient presenting with acute chest pain, elevated troponin, and normal ECG

Output

Ranked differential diagnosis with evidence strength, safety flags, dissenting opinions, recommended workup

Finance Risk Assessment
Jury

Quantitative financial risk analysis with mandatory dissent. Models evaluate market, credit, operational, and liquidity risks using standard metrics (VaR, CVaR, Sharpe ratio). Stress testing scenarios are required. Results mapped to regulatory frameworks (Basel III, SOX, Dodd-Frank, MiFID II).

Default Models

3

Max Rounds

3

Mode

Jury

Evaluation Rubric

DimensionWeightEvaluates
Quantitative Rigor30%VaR, CVaR, Sharpe ratio, statistical validity
Regulatory Alignment25%Basel III, SOX, Dodd-Frank, MiFID II compliance
Risk Coverage20%Market, credit, operational, liquidity risk completeness
Scenario Analysis15%Stress testing, tail risk, Monte Carlo quality
Actionability10%Hedging strategies, portfolio adjustments, timelines

Example Prompt

Evaluate the risk profile of this investment portfolio under current market conditions with stress scenarios

Output

Risk assessment with quantitative metrics, stress test results, regulatory mapping, hedging strategies, mandatory dissent

Programmatic Usage

Templates can be loaded programmatically via the template registry. Each template returns a configuration object with mode, rubric, system prompts, max rounds, and default models.

from consilium.templates import get_template, TEMPLATES

# List all templates
print(TEMPLATES.keys())
# → ["code_review", "research_synthesis", "risk_assessment",
#    "healthcare", "legal", "finance"]

# Load a template
template = get_template("code_review")
# Returns: {
#   topic: str,
#   mode: "redteam",
#   rubric: { security: 0.30, correctness: 0.25, ... },
#   system_prompts: { attacker: "...", defender: "..." },
#   max_rounds: 2,
#   default_models: 3
# }