Paper

GLM-5: From Vibe Coding to Agentic Engineering

2026.02.11

·Web·by 네루

#LLM#Agent#AI#DeepSeek#Open Source

Key Points

1GLM-5 is a new large language model scaling to 744B parameters and 28.5T pre-training tokens, integrating DeepSeek Sparse Attention and a novel `slime` RL infrastructure for improved efficiency.
2Designed for complex systems engineering and long-horizon agentic tasks, GLM-5 demonstrates significant performance improvements across a wide range of academic benchmarks.
3It achieves best-in-class performance among open-source models in reasoning, coding, and agentic tasks, closing the gap with frontier models, and is available open-source as well as via APIs.

4,432.12, showcasing strong long-term planning and resource management, nearing Claude Opus 4.5 (

Paper

2026.02.11

·Web·by 네루

#LLM#Agent#AI#DeepSeek#Open Source

1GLM-5 is a new large language model scaling to 744B parameters and 28.5T pre-training tokens, integrating DeepSeek Sparse Attention and a novel `slime` RL infrastructure for improved efficiency.
2Designed for complex systems engineering and long-horizon agentic tasks, GLM-5 demonstrates significant performance improvements across a wide range of academic benchmarks.
3It achieves best-in-class performance among open-source models in reasoning, coding, and agentic tasks, closing the gap with frontier models, and is available open-source as well as via APIs.

4,432.12, showcasing strong long-term planning and resource management, nearing Claude Opus 4.5 (