Model Family Catalog¶

Overview¶

This page is the human-readable rendering of contracts/model_family_catalog.json.

Use it to answer three distinct questions without weakening the public meaning of the support matrix:

What is supported as a public lane?
What families are implemented in code but not publicly supported?
What families or capabilities should be added next?

Support Tier vs Coverage State¶

Term	Meaning	Source of truth
`support tier`	Public support/assurance posture for a declared lane. Values stay aligned with `support_matrix.json`.	`contracts/support_matrix.json`
`coverage state`	Repo implementation maturity outside the public support matrix, such as `profile_first_class`, `profile_shared_alias`, `auto_or_loader_only`, `loader_only`, or backlog states.	`contracts/model_family_catalog.json`

The support matrix remains strict. The model family catalog is broader by design and records code-level visibility, usage-only checkpoints, and recommended additions. Access-gated vendor checkpoints are intentionally kept out of declared support lanes and included preset inventory.

Declared Support¶

Family	State	Representative models	Notes
GPT-2 causal LM	`published_basis`	`openai-community/gpt2`	Public lane derived from `gpt2-causal-hf`.
BERT / RoBERTa MLM	`published_basis`	`bert-base-uncased`, `roberta-base`	Public lane derived from `bert-mlm-hf`.
Mistral 7B causal LM	`supported_experimental`	`mistralai/Mistral-7B-v0.1`	Pilot preset and calibration config are included.
Ministral 3 causal LM (text-only eval)	`supported_experimental`	`mistralai/Ministral-3-8B-Instruct-2512-BF16`, `mistralai/Ministral-3-14B-Instruct-2512-BF16`	Text-only pilot presets and calibration configs are included for both 8B and 14B checkpoints.
Qwen2 7B causal LM	`supported_experimental`	`Qwen/Qwen2-7B`	Pilot preset and calibration config are included.
Qwen2.5 7B causal LM	`supported_experimental`	`Qwen/Qwen2.5-7B`	Pilot preset and calibration config are included.
Qwen2.5 14B causal LM	`supported_experimental`	`Qwen/Qwen2.5-14B`	Pilot preset and calibration config are included.
Qwen3 causal LM	`supported_experimental`	`Qwen/Qwen3-8B`	Pilot preset and calibration config are included.
DeepSeek-R1-Distill-Qwen causal LM	`supported_experimental`	`deepseek-ai/DeepSeek-R1-Distill-Qwen-7B`	Pilot preset and calibration config are included.
Phi-4 causal LM (text-only eval)	`supported_experimental`	`microsoft/Phi-4-reasoning-plus`	Text-only pilot preset and calibration config are included. Current HF runtime validation closes cleanly when the lane opts into `trust_remote_code`.
Gemma 4 E2B causal LM (text-only eval)	`supported_experimental`	`google/gemma-4-E2B-it`	Text-only pilot preset and calibration config are included. Image-text evaluation uses the explicit `hf_multimodal` + `vision_text` path.
TinyLlama 1.1B causal LM	`supported_experimental`	`TinyLlama/TinyLlama-1.1B-Chat-v1.0`	Ungated Llama-family pilot lane with included preset and calibration config.
OLMo 2 causal LM	`supported_experimental`	`allenai/OLMo-2-1124-7B`, `allenai/OLMo-2-1124-13B-Instruct`	Pilot presets and calibration configs are included for both 7B and 13B scale points.
Qwen3.5 causal LM	`supported_experimental`	`Qwen/Qwen3.5-9B`	Pilot preset and calibration config are included.
Seq2Seq / local pairs	`community_experimental`	`t5-small`, `facebook/bart-base`	Generic seq2seq lane without a published-basis claim.

Implemented Coverage¶

Family	Coverage state	Representative models	Notes
Mixtral	`profile_first_class`	`mistralai/Mixtral-8x7B-v0.1`	Profile and loader code recognize the family directly.
Llama	`profile_first_class`	`openlm-research/open_llama_7b`, `TinyLlama/TinyLlama-1.1B-Chat-v1.0`	Generic Llama-family profile handling is first-class. TinyLlama provides the ungated declared support lane, while access-gated vendor checkpoints remain omitted.
Qwen family aliases (Qwen1.5/Qwen2.5/Qwen3 naming)	`profile_first_class`	`Qwen/Qwen2.5-14B`, `Qwen/Qwen3.5-9B`	Shared qwen-family heuristics cover aliases beyond the declared Qwen2, Qwen2.5 14B, Qwen3, and Qwen3.5 lanes, including usage-only Qwen2.5 checkpoints.
Yi	`profile_first_class`	`01-ai/Yi-34B`	Treated as a RoPE decoder family in profile logic.
Phi family	`profile_first_class`	`microsoft/Phi-3-mini-4k-instruct`, `microsoft/Phi-4-reasoning-plus`	Dedicated phi-family selectors exist. Phi-4 has a declared text-only lane, while multimodal Phi-4 remains backlog-only.
Gemma family	`profile_first_class`	`google/gemma-3-4b-it`, `google/gemma-4-E2B-it`	Gemma 3/4 selectors and loaders are first-class. Gemma 4 E2B has a declared text-only lane, image-text evaluation uses `hf_multimodal` + `vision_text`, and audio remains deferred.
OPT / GPT-NeoX / GPT-J	`profile_shared_alias`	`facebook/opt-1.3b`, `EleutherAI/gpt-neox-20b`	Available through shared GPT-style paths.
GPT-OSS	`profile_first_class`	`openai/gpt-oss-20b`	Dedicated profile selectors and HF causal decoder spec now cover the open-weight checkpoint directly.
Falcon	`auto_or_loader_only`	`tiiuae/falcon-7b`	Visible through adapter-auto heuristics only.
GLM	`auto_or_loader_only`	`THUDM/glm-4-9b-chat`	Visible through adapter-auto heuristics only.
DeepSeek	`profile_first_class`	`deepseek-ai/DeepSeek-R1-Distill-Qwen-7B`	DeepSeek distill checkpoints share the qwen-family route. Oversized FP8 checkpoint-specific repo hooks and included configs are omitted because they do not fit the supported hardware/runtime path.
Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA)	`auto_or_loader_only`	`distilbert-base-uncased`, `microsoft/deberta-v3-base`	Loader/auto support exceeds the public BERT / RoBERTa lane.
Broader seq2seq families (mBART/PEGASUS/Marian)	`auto_or_loader_only`	`facebook/mbart-large-50`, `Helsinki-NLP/opus-mt-en-de`	Loader support is broader than the generic seq2seq public lane.

Usage Only¶

Family	State	Representative models	Notes
Qwen2.5 32B	`usage_only`	`Qwen/Qwen2.5-32B`	Used in evidence-pack suites and validation defaults outside the declared Qwen2.5 14B support lane.
Qwen1.5 72B	`usage_only`	`Qwen/Qwen1.5-72B`	Used concretely in evidence-pack suites.
Yi 34B	`usage_only`	`01-ai/Yi-34B`	Used in workshop and full evidence-pack suites.
Mixtral 8x7B	`usage_only`	`mistralai/Mixtral-8x7B-v0.1`	Used in evidence-pack flows without a public support lane.

<=14B Text Candidate Inventory¶

This section summarizes the contract-tracked <=14B text and MLM candidates that sit outside, adjacent to, or have recently graduated into declared support.

It is a catalog view, not a run ledger. Exact criterion-by-criterion status and decision codes live under promotion_candidates_text_le_14b in contracts/model_family_catalog.json.

Family	Representative model	Promotion status	Catalog location	Notes
Qwen2.5 7B causal LM	`Qwen/Qwen2.5-7B`	`promote_now`	`supported_experimental`	This candidate already graduated into declared support and remains listed here because the contract tracks the reviewed candidate set.
OpenLLaMA 7B causal LM	`openlm-research/open_llama_7b`	`blocked_missing_artifacts`	`implemented_coverage`	Recognition and targeted tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
Phi-3 Mini 4K Instruct causal LM	`microsoft/Phi-3-mini-4k-instruct`	`explicitly_out_of_scope`	`implemented_coverage`	The current declared Phi support surface remains the shipped Phi-4 text-only lane.
Gemma 3 4B IT	`google/gemma-3-4b-it`	`explicitly_out_of_scope`	`implemented_coverage`	The broader Gemma family remains multimodal/audio-capable, so this text-only candidate is not part of the tracked promotion set.
OPT 1.3B causal LM	`facebook/opt-1.3b`	`blocked_missing_artifacts`	`implemented_coverage`	Shared GPT-style recognition exists, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
Falcon 7B causal LM	`tiiuae/falcon-7b`	`blocked_missing_artifacts`	`implemented_coverage`	Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
GLM 4 9B Chat	`THUDM/glm-4-9b-chat`	`blocked_missing_artifacts`	`implemented_coverage`	Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA)	`distilbert-base-uncased`	`blocked_missing_artifacts`	`implemented_coverage`	Loader and adapter tests exist for DistilBERT and DeBERTa, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
mBART large 50 seq2seq	`facebook/mbart-large-50`	`explicitly_out_of_scope`	`implemented_coverage`	Generic seq2seq and community lanes remain outside this tracked candidate set.

The machine-readable criterion-by-criterion ledger for this candidate set lives under promotion_candidates_text_le_14b in contracts/model_family_catalog.json.

Recommended Additions¶

Priority	Family	Planned support mode	Representative models	Notes
`P2`	Audio-text evaluation pipeline	`phase2_audio_eval`	`google/gemma-4-E2B-it`, `google/gemma-4-E4B-it`	Image-text evaluation is included. Audio-capable evaluation for the smaller Gemma 4 checkpoints remains deferred.

Promotion Criteria¶

A family only moves into support_matrix.json after all of the following are present:

explicit adapter/profile recognition
an included preset
an included calibration config
targeted tests
CLI smoke evidence
approved calibration/evaluation evidence