Paper

ES_Trading_Professional_Analysis | Kaggle

2026.02.01

·Service·by 이호민

#LLM#Finance#Trading#AI#Benchmark

Key Points

1This paper introduces a benchmark to evaluate large language models' (LLMs) ability to generate professional, risk-aware one-day-ahead long/short trading signals for E-mini S&P 500 futures using OHLCV data and chart images.
2The evaluation revealed that while all LLMs outperformed a buy-and-hold strategy in volatility and maximum drawdown, none surpassed it in total return or CAGR, indicating strong risk control but limited monetization of signals.
3The study concludes that LLMs exhibit genuine directional skill, with their return underperformance primarily stemming from conservative exposure management (structural limitation) rather than a lack of informational alpha, suggesting potential for improved performance with better signal scaling.

(0, 1]

Paper

2026.02.01

·Service·by 이호민

#LLM#Finance#Trading#AI#Benchmark

1This paper introduces a benchmark to evaluate large language models' (LLMs) ability to generate professional, risk-aware one-day-ahead long/short trading signals for E-mini S&P 500 futures using OHLCV data and chart images.
2The evaluation revealed that while all LLMs outperformed a buy-and-hold strategy in volatility and maximum drawdown, none surpassed it in total return or CAGR, indicating strong risk control but limited monetization of signals.
3The study concludes that LLMs exhibit genuine directional skill, with their return underperformance primarily stemming from conservative exposure management (structural limitation) rather than a lack of informational alpha, suggesting potential for improved performance with better signal scaling.

(0, 1]