Service

BAAI/bge-code-v1 · Hugging Face

2025.05.25

·Hugging Face·by Anonymous

#LLM#code embedding#retrieval#multilingual#FlagEmbedding

Key Points

1BGE-Code-v1 is an LLM-based code embedding model designed for comprehensive retrieval across code, text, and multilingual contexts, supporting natural language queries in English and Chinese, plus 20 programming languages.
2The model demonstrates superior code retrieval, robust text retrieval comparable to specialized text embedding models, and extensive multilingual capabilities including English, Chinese, Japanese, and French.
3BGE-Code-v1 achieves state-of-the-art performance on both the CoIR and CodeRAG benchmarks, showcasing its effectiveness in various code and natural language retrieval tasks.

f'<instruct> {task_description} \n<query> {query}'

Service

2025.05.25

·Hugging Face·by Anonymous

#LLM#code embedding#retrieval#multilingual#FlagEmbedding

1BGE-Code-v1 is an LLM-based code embedding model designed for comprehensive retrieval across code, text, and multilingual contexts, supporting natural language queries in English and Chinese, plus 20 programming languages.
2The model demonstrates superior code retrieval, robust text retrieval comparable to specialized text embedding models, and extensive multilingual capabilities including English, Chinese, Japanese, and French.
3BGE-Code-v1 achieves state-of-the-art performance on both the CoIR and CodeRAG benchmarks, showcasing its effectiveness in various code and natural language retrieval tasks.

f'<instruct> {task_description} \n<query> {query}'