Blog

GitHub - hyunwoongko/solar-vs-glm-vs-phi: Solar vs GLM vs Phi

hyunwoongko

2026.01.04

·GitHub·by 네루

#LLM#Parameter Analysis#Cosine Similarity#Layernorm

Key Points

1This paper refutes the claim that Solar-Open-100B is derived from GLM-4.5-Air, arguing that high cosine similarity of Layernorm parameters, the basis of the original claim, is an unreliable indicator for model derivation.
2The author demonstrates that Layernorm parameters' initialization near 1.0 and low variance naturally lead to high cosine similarities even between unrelated models, a phenomenon corroborated by a controlled GPT2 toy experiment.
3Furthermore, analyses using centered cosine similarity, Pearson correlation, and various absolute and relative difference metrics failed to show consistent evidence that Solar is uniquely closer to GLM compared to other models like Phi.

(1, 1, 1, \dots)

Blog

hyunwoongko

2026.01.04

·GitHub·by 네루

#LLM#Parameter Analysis#Cosine Similarity#Layernorm

1This paper refutes the claim that Solar-Open-100B is derived from GLM-4.5-Air, arguing that high cosine similarity of Layernorm parameters, the basis of the original claim, is an unreliable indicator for model derivation.
2The author demonstrates that Layernorm parameters' initialization near 1.0 and low variance naturally lead to high cosine similarities even between unrelated models, a phenomenon corroborated by a controlled GPT2 toy experiment.
3Furthermore, analyses using centered cosine similarity, Pearson correlation, and various absolute and relative difference metrics failed to show consistent evidence that Solar is uniquely closer to GLM compared to other models like Phi.

(1, 1, 1, \dots)