• Controllable Generation in Diffusion and Flow-Based Models
Published:
Controllable generation is no longer a collection of isolated tricks built around text prompts or guidance scales. In modern diffusion and flow-based models, control can be imposed by reshaping the sampling dynamics, injecting external conditions, optimizing intermediate states, adapting concept carriers, or training native multimodal interfaces. This article develops a unified dynamical view of guided and controlled generation: from classifier guidance and classifier-free guidance to inversion-based editing, measurement guidance, ControlNet-style structural control, DreamBooth-style personalization, training-free identity injection, and instruction-following image editing. Across these methods, the central question remains the same: how can we steer a generative trajectory toward human-specified intent while preserving realism, diversity, editability, and consistency?
