Service

GitHub - Marker-Inc-Korea/COT_steering: This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capability without additional training by leveraging Test-Time Scaling techniques.

Marker-Inc-Korea

2025.04.20

·GitHub·by Anonymous

#LLM#CoT#Reasoning#vLLM#Test-Time Scaling

Key Points

1This paper introduces CoT Steering, an enhanced method for Chain-of-Thought reasoning without explicit prompting, which faithfully re-implements prior work and incorporates "steering tokens."
2CoT Steering subtly guides the model's latent reasoning trajectory by injecting these tokens at the beginning of the assistant's response within standard chat templates, thereby narrowing the search space.
3Evaluation on the Korean CSAT demonstrated significant performance gains, boosting a 33B-parameter model's score from 67 to 84, showcasing improved reasoning capabilities and efficiency without additional training.

k

Service

Marker-Inc-Korea

2025.04.20

·GitHub·by Anonymous

#LLM#CoT#Reasoning#vLLM#Test-Time Scaling

1This paper introduces CoT Steering, an enhanced method for Chain-of-Thought reasoning without explicit prompting, which faithfully re-implements prior work and incorporates "steering tokens."
2CoT Steering subtly guides the model's latent reasoning trajectory by injecting these tokens at the beginning of the assistant's response within standard chat templates, thereby narrowing the search space.
3Evaluation on the Korean CSAT demonstrated significant performance gains, boosting a 33B-parameter model's score from 67 to 84, showcasing improved reasoning capabilities and efficiency without additional training.

k