Blog

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning | NVIDIA Technical Blog

Chris Alexiuk

2026.03.11

·Web·by 이호민

#Agentic AI#LLM#Mamba#MoE#Transformer

Key Points

1Nemotron 3 Super is an open, 120B total/12B active-parameter hybrid Mamba-Transformer Mixture-of-Experts (MoE) model designed to tackle the "thinking tax" and "context explosion" in agentic AI systems.
2It introduces architectural innovations including a hybrid Mamba-Transformer backbone for 1M-token context, Latent MoE for efficient expert utilization, Multi-token prediction for faster generation, and native NVFP4 pretraining for optimized performance.
3This model achieves leading accuracy on agentic benchmarks like PinchBench and is fully open-sourced with weights, datasets, and recipes to enable easy customization, optimization, and deployment.

Blog

Chris Alexiuk

2026.03.11

·Web·by 이호민

#Agentic AI#LLM#Mamba#MoE#Transformer

1Nemotron 3 Super is an open, 120B total/12B active-parameter hybrid Mamba-Transformer Mixture-of-Experts (MoE) model designed to tackle the "thinking tax" and "context explosion" in agentic AI systems.
2It introduces architectural innovations including a hybrid Mamba-Transformer backbone for 1M-token context, Latent MoE for efficient expert utilization, Multi-token prediction for faster generation, and native NVFP4 pretraining for optimized performance.
3This model achieves leading accuracy on agentic benchmarks like PinchBench and is fully open-sourced with weights, datasets, and recipes to enable easy customization, optimization, and deployment.