Model Family Catalog¶
Overview¶
This page is the human-readable rendering of
contracts/model_family_catalog.json.
Use it to answer three distinct questions without weakening the public meaning of the support matrix:
- What is supported as a public lane?
- What families are implemented in code but not publicly supported?
- What families or capabilities should be added next?
Support Tier vs Coverage State¶
| Term | Meaning | Source of truth |
|---|---|---|
support tier |
Public support/assurance posture for a declared lane. Values stay aligned with support_matrix.json. |
contracts/support_matrix.json |
coverage state |
Repo implementation maturity outside the public support matrix, such as profile_first_class, profile_shared_alias, auto_or_loader_only, loader_only, or backlog states. |
contracts/model_family_catalog.json |
The support matrix remains strict. The model family catalog is broader by design and records code-level visibility, usage-only checkpoints, and recommended additions. Access-gated vendor checkpoints are intentionally kept out of declared support lanes and included preset inventory.
Declared Support¶
| Family | State | Representative models | Notes |
|---|---|---|---|
| GPT-2 causal LM | published_basis |
openai-community/gpt2 |
Public lane derived from gpt2-causal-hf. |
| BERT / RoBERTa MLM | published_basis |
bert-base-uncased, roberta-base |
Public lane derived from bert-mlm-hf. |
| Mistral 7B causal LM | supported_experimental |
mistralai/Mistral-7B-v0.1 |
Pilot preset and calibration config are included. |
| Ministral 3 causal LM (text-only eval) | supported_experimental |
mistralai/Ministral-3-8B-Instruct-2512-BF16, mistralai/Ministral-3-14B-Instruct-2512-BF16 |
Text-only pilot presets and calibration configs are included for both 8B and 14B checkpoints. |
| Qwen2 7B causal LM | supported_experimental |
Qwen/Qwen2-7B |
Pilot preset and calibration config are included. |
| Qwen2.5 7B causal LM | supported_experimental |
Qwen/Qwen2.5-7B |
Pilot preset and calibration config are included. |
| Qwen2.5 14B causal LM | supported_experimental |
Qwen/Qwen2.5-14B |
Pilot preset and calibration config are included. |
| Qwen3 causal LM | supported_experimental |
Qwen/Qwen3-8B |
Pilot preset and calibration config are included. |
| DeepSeek-R1-Distill-Qwen causal LM | supported_experimental |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B |
Pilot preset and calibration config are included. |
| Phi-4 causal LM (text-only eval) | supported_experimental |
microsoft/Phi-4-reasoning-plus |
Text-only pilot preset and calibration config are included. Current HF runtime validation closes cleanly when the lane opts into trust_remote_code. |
| Gemma 4 E2B causal LM (text-only eval) | supported_experimental |
google/gemma-4-E2B-it |
Text-only pilot preset and calibration config are included. Image-text evaluation uses the explicit hf_multimodal + vision_text path. |
| TinyLlama 1.1B causal LM | supported_experimental |
TinyLlama/TinyLlama-1.1B-Chat-v1.0 |
Ungated Llama-family pilot lane with included preset and calibration config. |
| OLMo 2 causal LM | supported_experimental |
allenai/OLMo-2-1124-7B, allenai/OLMo-2-1124-13B-Instruct |
Pilot presets and calibration configs are included for both 7B and 13B scale points. |
| Qwen3.5 causal LM | supported_experimental |
Qwen/Qwen3.5-9B |
Pilot preset and calibration config are included. |
| Seq2Seq / local pairs | community_experimental |
t5-small, facebook/bart-base |
Generic seq2seq lane without a published-basis claim. |
Implemented Coverage¶
| Family | Coverage state | Representative models | Notes |
|---|---|---|---|
| Mixtral | profile_first_class |
mistralai/Mixtral-8x7B-v0.1 |
Profile and loader code recognize the family directly. |
| Llama | profile_first_class |
openlm-research/open_llama_7b, TinyLlama/TinyLlama-1.1B-Chat-v1.0 |
Generic Llama-family profile handling is first-class. TinyLlama provides the ungated declared support lane, while access-gated vendor checkpoints remain omitted. |
| Qwen family aliases (Qwen1.5/Qwen2.5/Qwen3 naming) | profile_first_class |
Qwen/Qwen2.5-14B, Qwen/Qwen3.5-9B |
Shared qwen-family heuristics cover aliases beyond the declared Qwen2, Qwen2.5 14B, Qwen3, and Qwen3.5 lanes, including usage-only Qwen2.5 checkpoints. |
| Yi | profile_first_class |
01-ai/Yi-34B |
Treated as a RoPE decoder family in profile logic. |
| Phi family | profile_first_class |
microsoft/Phi-3-mini-4k-instruct, microsoft/Phi-4-reasoning-plus |
Dedicated phi-family selectors exist. Phi-4 has a declared text-only lane, while multimodal Phi-4 remains backlog-only. |
| Gemma family | profile_first_class |
google/gemma-3-4b-it, google/gemma-4-E2B-it |
Gemma 3/4 selectors and loaders are first-class. Gemma 4 E2B has a declared text-only lane, image-text evaluation uses hf_multimodal + vision_text, and audio remains deferred. |
| OPT / GPT-NeoX / GPT-J | profile_shared_alias |
facebook/opt-1.3b, EleutherAI/gpt-neox-20b |
Available through shared GPT-style paths. |
| GPT-OSS | profile_first_class |
openai/gpt-oss-20b |
Dedicated profile selectors and HF causal decoder spec now cover the open-weight checkpoint directly. |
| Falcon | auto_or_loader_only |
tiiuae/falcon-7b |
Visible through adapter-auto heuristics only. |
| GLM | auto_or_loader_only |
THUDM/glm-4-9b-chat |
Visible through adapter-auto heuristics only. |
| DeepSeek | profile_first_class |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B |
DeepSeek distill checkpoints share the qwen-family route. Oversized FP8 checkpoint-specific repo hooks and included configs are omitted because they do not fit the supported hardware/runtime path. |
| Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA) | auto_or_loader_only |
distilbert-base-uncased, microsoft/deberta-v3-base |
Loader/auto support exceeds the public BERT / RoBERTa lane. |
| Broader seq2seq families (mBART/PEGASUS/Marian) | auto_or_loader_only |
facebook/mbart-large-50, Helsinki-NLP/opus-mt-en-de |
Loader support is broader than the generic seq2seq public lane. |
Usage Only¶
| Family | State | Representative models | Notes |
|---|---|---|---|
| Qwen2.5 32B | usage_only |
Qwen/Qwen2.5-32B |
Used in evidence-pack suites and validation defaults outside the declared Qwen2.5 14B support lane. |
| Qwen1.5 72B | usage_only |
Qwen/Qwen1.5-72B |
Used concretely in evidence-pack suites. |
| Yi 34B | usage_only |
01-ai/Yi-34B |
Used in workshop and full evidence-pack suites. |
| Mixtral 8x7B | usage_only |
mistralai/Mixtral-8x7B-v0.1 |
Used in evidence-pack flows without a public support lane. |
<=14B Text Candidate Inventory¶
This section summarizes the contract-tracked <=14B text and MLM candidates
that sit outside, adjacent to, or have recently graduated into declared
support.
It is a catalog view, not a run ledger. Exact criterion-by-criterion status and
decision codes live under promotion_candidates_text_le_14b in
contracts/model_family_catalog.json.
| Family | Representative model | Promotion status | Catalog location | Notes |
|---|---|---|---|---|
| Qwen2.5 7B causal LM | Qwen/Qwen2.5-7B |
promote_now |
supported_experimental |
This candidate already graduated into declared support and remains listed here because the contract tracks the reviewed candidate set. |
| OpenLLaMA 7B causal LM | openlm-research/open_llama_7b |
blocked_missing_artifacts |
implemented_coverage |
Recognition and targeted tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Phi-3 Mini 4K Instruct causal LM | microsoft/Phi-3-mini-4k-instruct |
explicitly_out_of_scope |
implemented_coverage |
The current declared Phi support surface remains the shipped Phi-4 text-only lane. |
| Gemma 3 4B IT | google/gemma-3-4b-it |
explicitly_out_of_scope |
implemented_coverage |
The broader Gemma family remains multimodal/audio-capable, so this text-only candidate is not part of the tracked promotion set. |
| OPT 1.3B causal LM | facebook/opt-1.3b |
blocked_missing_artifacts |
implemented_coverage |
Shared GPT-style recognition exists, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Falcon 7B causal LM | tiiuae/falcon-7b |
blocked_missing_artifacts |
implemented_coverage |
Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| GLM 4 9B Chat | THUDM/glm-4-9b-chat |
blocked_missing_artifacts |
implemented_coverage |
Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA) | distilbert-base-uncased |
blocked_missing_artifacts |
implemented_coverage |
Loader and adapter tests exist for DistilBERT and DeBERTa, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| mBART large 50 seq2seq | facebook/mbart-large-50 |
explicitly_out_of_scope |
implemented_coverage |
Generic seq2seq and community lanes remain outside this tracked candidate set. |
The machine-readable criterion-by-criterion ledger for this candidate set lives
under promotion_candidates_text_le_14b in
contracts/model_family_catalog.json.
Recommended Additions¶
| Priority | Family | Planned support mode | Representative models | Notes |
|---|---|---|---|---|
P2 |
Audio-text evaluation pipeline | phase2_audio_eval |
google/gemma-4-E2B-it, google/gemma-4-E4B-it |
Image-text evaluation is included. Audio-capable evaluation for the smaller Gemma 4 checkpoints remains deferred. |
Promotion Criteria¶
A family only moves into support_matrix.json after all of the following are
present:
- explicit adapter/profile recognition
- an included preset
- an included calibration config
- targeted tests
- CLI smoke evidence
- approved calibration/evaluation evidence