Service

GitHub - workdd/LLM_Foreign_Block: LLM 모델의 외국어 토큰 생성을 막는 코드 구현

workdd

2025.03.22

·GitHub·by Anonymous

#LLM#Token Generation#Logit Processing#Language Model

Key Points

1This repository introduces a method to prevent Large Language Models (LLMs) from generating specific foreign language tokens by adjusting logit values during inference.
2The approach identifies tokens corresponding to languages like Chinese, Japanese, and Russian based on their Unicode ranges and then sets their generation probabilities to negative infinity.
3Implementations are provided for Transformers and vLLM, with a noted performance impact of significantly slowing down the first token generation (TTFT), suggesting a warm-up phase is beneficial.

[0x4E00, 0x9FFF]

Service

workdd

2025.03.22

·GitHub·by Anonymous

#LLM#Token Generation#Logit Processing#Language Model

1This repository introduces a method to prevent Large Language Models (LLMs) from generating specific foreign language tokens by adjusting logit values during inference.
2The approach identifies tokens corresponding to languages like Chinese, Japanese, and Russian based on their Unicode ranges and then sets their generation probabilities to negative infinity.
3Implementations are provided for Transformers and vLLM, with a noted performance impact of significantly slowing down the first token generation (TTFT), suggesting a warm-up phase is beneficial.

[0x4E00, 0x9FFF]