Service

naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B · Hugging Face

2025.04.27

·Hugging Face·by Anonymous

#LLM#VLM#transformers#text-generation#conversational

Key Points

1HyperCLOVAX-SEED-Vision-Instruct-3B is NAVER's new lightweight, multimodal model designed for efficient visual understanding and text generation, specifically optimized for the Korean language.
2It features a LLaVA-based architecture, combining a 3.2B parameter LLM with a 0.43B parameter SigLIP vision encoder, trained using SFT and RLHF with an automated validation system.
3The model achieves competitive performance, outperforming similarly sized open-source models in Korean benchmarks, and represents Korea's first open-source vision-language model.

Service

2025.04.27

·Hugging Face·by Anonymous

#LLM#VLM#transformers#text-generation#conversational

1HyperCLOVAX-SEED-Vision-Instruct-3B is NAVER's new lightweight, multimodal model designed for efficient visual understanding and text generation, specifically optimized for the Korean language.
2It features a LLaVA-based architecture, combining a 3.2B parameter LLM with a 0.43B parameter SigLIP vision encoder, trained using SFT and RLHF with an automated validation system.
3The model achieves competitive performance, outperforming similarly sized open-source models in Korean benchmarks, and represents Korea's first open-source vision-language model.