Paper

Titans + MIRAS: Helping AI have long-term memory

2025.11.23

·Web·by Anonymous

#LLM#AI#Memory#Transformer#Architecture

Key Points

1Google introduces Titans, an architecture featuring a novel deep neural network long-term memory, and MIRAS, a unifying theoretical framework, designed to allow AI models to process massive contexts and adapt in real-time.
2Titans updates its core memory by selectively incorporating "surprising" or novel information based on an internal gradient signal, ensuring efficient retention of critical data while actively running.
3The MIRAS framework reinterprets sequence models as associative memory, enabling the exploration of robust, non-Euclidean objectives, collectively leading to models with superior accuracy, lower perplexity, and unprecedented long-context recall exceeding 2 million tokens.

L_{\delta}(a) = \begin{cases} \frac{1}{2}a^2 & \text{for } |a| \le \delta \\ \delta(|a| - \frac{1}{2}\delta) & \text{for } |a| > \delta \end{cases}

Paper

2025.11.23

·Web·by Anonymous

#LLM#AI#Memory#Transformer#Architecture

1Google introduces Titans, an architecture featuring a novel deep neural network long-term memory, and MIRAS, a unifying theoretical framework, designed to allow AI models to process massive contexts and adapt in real-time.
2Titans updates its core memory by selectively incorporating "surprising" or novel information based on an internal gradient signal, ensuring efficient retention of critical data while actively running.
3The MIRAS framework reinterprets sequence models as associative memory, enabling the exploration of robust, non-Euclidean objectives, collectively leading to models with superior accuracy, lower perplexity, and unprecedented long-context recall exceeding 2 million tokens.

L_{\delta}(a) = \begin{cases} \frac{1}{2}a^2 & \text{for } |a| \le \delta \\ \delta(|a| - \frac{1}{2}\delta) & \text{for } |a| > \delta \end{cases}