Paper

Chain-of-Zoom

2025.06.08

·Web·by Anonymous

#Super-Resolution#AI#VLM#RLHF#Autoregression

Key Points

1Chain-of-Zoom (CoZ) is a novel, model-agnostic framework that enables extreme super-resolution (e.g., 256x) by autoregressively chaining a standard SR backbone, effectively decomposing the problem into tractable sub-problems.
2To overcome diminishing visual cues at high magnifications, CoZ augments each zoom step with multi-scale-aware text prompts generated by a Vision-Language Model (VLM).
3The prompt-extraction VLM is further fine-tuned using Generalized Reward Policy Optimization (GRPO) with a critic VLM and specific penalties to align the generated text guidance with human preferences.

P(\text{HR}|\text{LR}_{\text{extreme}})

Paper

2025.06.08

·Web·by Anonymous

#Super-Resolution#AI#VLM#RLHF#Autoregression

1Chain-of-Zoom (CoZ) is a novel, model-agnostic framework that enables extreme super-resolution (e.g., 256x) by autoregressively chaining a standard SR backbone, effectively decomposing the problem into tractable sub-problems.
2To overcome diminishing visual cues at high magnifications, CoZ augments each zoom step with multi-scale-aware text prompts generated by a Vision-Language Model (VLM).
3The prompt-extraction VLM is further fine-tuned using Generalized Reward Policy Optimization (GRPO) with a critic VLM and specific penalties to align the generated text guidance with human preferences.

P(\text{HR}|\text{LR}_{\text{extreme}})