Paper

[논문 리뷰] Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment

2025.06.29

·Web·by Anonymous

#LLM#Fine-tuning#AI Ethics#Decision Making#Human Alignment

Key Points

1This research examines how large language models (LLMs) handle exceptions in decision-making, finding that LLMs tend to adhere strictly to policies, diverging from human flexible judgment.
2Comparing ethical framework prompting, chain-of-thought, and supervised fine-tuning (SFT), the study demonstrates that SFT using human explanations significantly improves LLM alignment with human decision-making.
3This effective SFT method allows LLMs to learn the underlying reasons for decisions, enabling them to generalize to new scenarios and offering valuable insights for developing more reliable AI systems.

Paper

2025.06.29

·Web·by Anonymous

#LLM#Fine-tuning#AI Ethics#Decision Making#Human Alignment

1This research examines how large language models (LLMs) handle exceptions in decision-making, finding that LLMs tend to adhere strictly to policies, diverging from human flexible judgment.
2Comparing ethical framework prompting, chain-of-thought, and supervised fine-tuning (SFT), the study demonstrates that SFT using human explanations significantly improves LLM alignment with human decision-making.
3This effective SFT method allows LLMs to learn the underlying reasons for decisions, enabling them to generalize to new scenarios and offering valuable insights for developing more reliable AI systems.