TNG: DeepSeek R1T2 Chimera
DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent <think> token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.
Pricing per 1M Tokens
| Input (Prompt) | $0.30 |
| Output (Completion) | $1.10 |
| Cache Read | $0.15 |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 164K |
| Max Output Tokens | 164K |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | DeepSeek |
| Instruct Type | N/A |
| Top Provider Context | 164K |
| Top Provider Max Output | 164K |
| Moderated | No |
Compare this model
See how TNG: DeepSeek R1T2 Chimera stacks up against other models.
Last updated: March 23, 2026
First tracked: March 23, 2026