News

Introducing GPT-5.3-Codex-Spark

2026.02.13

·Service·by 권준호

#AI#Codex#LLM#OpenAI#Real-time coding

Key Points

1OpenAI introduces GPT-5.3-Codex-Spark, an ultra-fast model optimized for real-time, interactive coding within the Codex app, designed to deliver over 1000 tokens per second for near-instant responses.
2This model runs on Cerebras' Wafer Scale Engine 3, a purpose-built AI accelerator that significantly reduces end-to-end latency and improves responsiveness, complementing existing GPU infrastructure.
3Available as a research preview for ChatGPT Pro users, Codex-Spark is the first in a family of ultra-fast models aimed at blending real-time collaboration with longer-horizon agentic capabilities for software development.

\text{output tokens} \div \text{sampling speed}

News

2026.02.13

·Service·by 권준호

#AI#Codex#LLM#OpenAI#Real-time coding

1OpenAI introduces GPT-5.3-Codex-Spark, an ultra-fast model optimized for real-time, interactive coding within the Codex app, designed to deliver over 1000 tokens per second for near-instant responses.
2This model runs on Cerebras' Wafer Scale Engine 3, a purpose-built AI accelerator that significantly reduces end-to-end latency and improves responsiveness, complementing existing GPU infrastructure.
3Available as a research preview for ChatGPT Pro users, Codex-Spark is the first in a family of ultra-fast models aimed at blending real-time collaboration with longer-horizon agentic capabilities for software development.

\text{output tokens} \div \text{sampling speed}