Paper

naver-hyperclovax/HyperCLOVAX-SEED-Think-32B · Hugging Face

2026.01.04

·Hugging Face·by 네루

#VLM#LLM#Transformer#Korean#Agent

Key Points

1HyperCLOVA X SEED 32B Think is a 32-billion parameter vision-language model with a unified Transformer backbone and a reasoning-centric training recipe.
2It supports multimodal understanding up to 128K tokens, processing text, image, and video inputs within a shared embedding space and offering an optional "thinking mode" for deep reasoning.
3Designed for practical reasoning and agentic capabilities, particularly strong in Korean, this model requires significant GPU resources for deployment via its OmniServe inference system.

<think>...</think>

Paper

2026.01.04

·Hugging Face·by 네루

#VLM#LLM#Transformer#Korean#Agent

1HyperCLOVA X SEED 32B Think is a 32-billion parameter vision-language model with a unified Transformer backbone and a reasoning-centric training recipe.
2It supports multimodal understanding up to 128K tokens, processing text, image, and video inputs within a shared embedding space and offering an optional "thinking mode" for deep reasoning.
3Designed for practical reasoning and agentic capabilities, particularly strong in Korean, this model requires significant GPU resources for deployment via its OmniServe inference system.

<think>...</think>