1.0 GB · GGUF Q4_K_M quantization
Ultra-lightweight. Runs on any laptop — perfect for edge devices and quick prototyping.
Loading downloads...
2.5 GB · GGUF Q4_K_M quantization
Balanced everyday performance — ideal for chatbots, summarization, and content tasks.
Loading downloads...
4.7 GB · GGUF Q4_K_M quantization
Strong reasoning for complex tasks — code, analysis, and multi-step problem solving.
Loading downloads...
5.3 GB · GGUF Q4_K_M quantization
High-performance for demanding workflows — research, RAG pipelines, agent systems.
Loading downloads...
7.0 GB · GGUF Q4_K_M quantization
Maximum quality for professional use — detailed reports, creative writing, advanced code.
Loading downloads...
18.6 GB · GGUF Q4_K_M quantization
Flagship model — frontier-level performance for the most demanding enterprise applications.
Loading downloads...