Service

gpt-oss

2025.08.10

·Web·by Anonymous

#LLM#OpenAI#Agent#Quantization#Ollama

Key Points

1OpenAI has launched gpt-oss, a series of open-weight models (20B and 120B parameters) in partnership with Ollama, designed for powerful reasoning, agentic tasks, and versatile developer use cases.
2These models feature agentic capabilities like function calling and web browsing, full chain-of-thought access, configurable reasoning effort, and are fine-tunable under a permissive Apache 2.0 license.
3Utilizing MXFP4 quantization for their mixture-of-experts weights, the 20B model requires as little as 16GB memory, while the 120B model is optimized to fit on a single 80GB GPU.

20

Service

2025.08.10

·Web·by Anonymous

#LLM#OpenAI#Agent#Quantization#Ollama

1OpenAI has launched gpt-oss, a series of open-weight models (20B and 120B parameters) in partnership with Ollama, designed for powerful reasoning, agentic tasks, and versatile developer use cases.
2These models feature agentic capabilities like function calling and web browsing, full chain-of-thought access, configurable reasoning effort, and are fine-tunable under a permissive Apache 2.0 license.
3Utilizing MXFP4 quantization for their mixture-of-experts weights, the 20B model requires as little as 16GB memory, while the 120B model is optimized to fit on a single 80GB GPU.

20