Cogito V2 Preview Llama 109B

deepcogito/cogito-v2-preview-llama-109b-moe

An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B-16E. Cogito v2 can answer directly or engage an extended “thinking” phase, with alignment guided by Iterated Distillation & Amplification (IDA). It targets coding, STEM, instruction following, and general helpfulness, with stronger multilingual, tool-calling, and reasoning performance than size-equivalent baselines. The model supports long-context use (up to 10M tokens) and standard Transformers workflows. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs(opens in new tab)

Modalities

Context

131K

Knowledge Cutoff

Aug 31, 2024

Cogito V2 Preview Llama 109B

deepcogito/cogito-v2-preview-llama-109b-moe

An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B-16E. Cogito v2 can answer directly or engage an extended “thinking” phase, with alignment guided by Iterated Distillation & Amplification (IDA). It targets coding, STEM, instruction following, and general helpfulness, with stronger multilingual, tool-calling, and reasoning performance than size-equivalent baselines. The model supports long-context use (up to 10M tokens) and standard Transformers workflows. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs(opens in new tab)

Modalities

Context

131K

Knowledge Cutoff

Aug 31, 2024

Cogito V2 Preview Llama 109B

deepcogito/cogito-v2-preview-llama-109b-moe

Cogito V2 Preview Llama 109B

deepcogito/cogito-v2-preview-llama-109b-moe

Recent activity on Cogito V2 Preview Llama 109B

Total usage per day on OpenRouter

Not enough data to display yet.

Recent activity on Cogito V2 Preview Llama 109B

Total usage per day on OpenRouter

Not enough data to display yet.