Paper

LGAI-EXAONE/K-EXAONE-236B-A23B · Hugging Face

2026.01.04

·Hugging Face·by 이호민

#LLM#MoE#Multilingual#Transformers#AI

Key Points

1K-EXAONE is a 236-billion-parameter Mixture-of-Experts multilingual language model developed by LG AI Research, with 23 billion parameters active during inference.
2It features a 256K context window using a hybrid attention scheme and optimizes inference throughput by approximately 1.5x with Multi-Token Prediction.
3The model demonstrates strong performance across diverse benchmarks, excelling in reasoning, agentic capabilities, general knowledge, multilingual understanding across six languages, and long-context processing.

\tau^2

Paper

2026.01.04

·Hugging Face·by 이호민

#LLM#MoE#Multilingual#Transformers#AI

1K-EXAONE is a 236-billion-parameter Mixture-of-Experts multilingual language model developed by LG AI Research, with 23 billion parameters active during inference.
2It features a 256K context window using a hybrid attention scheme and optimizes inference throughput by approximately 1.5x with Multi-Token Prediction.
3The model demonstrates strong performance across diverse benchmarks, excelling in reasoning, agentic capabilities, general knowledge, multilingual understanding across six languages, and long-context processing.

\tau^2