AI / 大模型架构
扩散模型流程
正向加噪、反向去噪、时间步和噪声预测组成的扩散模型结构图。
完整 Prompt
A diffusion model figure showing both the forward (noising) and reverse (denoising) processes.
Top row — Forward process (left to right):
- A clean image x_0 progressively becomes Gaussian noise x_T over T=1000 steps.
- Show 5 representative timesteps as small image thumbnails: x_0 (clean), x_250, x_500, x_750, x_T (pure noise).
- Arrows between each pair labeled with the noise schedule q(x_t | x_{t-1}).
Bottom row — Reverse process (right to left):
- The model denoises x_T back toward x_0.
- At each step, a U-Net (epsilon_theta) predicts the noise from x_t and timestep t.
- Show the U-Net as a centered icon with conditioning inputs (timestep embedding, text prompt embedding).
Right side: a callout box explaining the loss "L = E[||epsilon - epsilon_theta(x_t, t)||^2]" rendered as cleaner LaTeX-style annotation.
Style: clean publication style, white background, minimal palette (navy, coral accent, gray), suitable for NeurIPS / ICML.立即试用此 Prompt适用场景
For generative-AI papers introducing diffusion variants (DDPM, DDIM, score-based, latent diffusion).
变体
Latent diffusion (Stable Diffusion style)
Same flow but operate in latent space. Add a VAE encoder mapping image -> latent at the start and a VAE decoder mapping denoised latent -> image at the end. The diffusion process happens entirely in the latent domain.
使用建议
- Always show both directions. Single-arrow diagrams make the process unclear.
- Use 5 thumbnails for the timestep grid; more than 5 becomes cramped.
- Annotate the loss explicitly — readers expect to see the training objective.
常见问题
How do I add classifier-free guidance?
Add a side branch where the U-Net is called twice (with and without conditioning) and outputs are linearly combined with a guidance scale w.
