Best EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generation alternatives.
Live source-backed alternatives to EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generation for Image generation. Alternatives are selected from the same task category and update whenever the best-of index rebuilds.
EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generation
Text-to-image diffusion models power everyday creative tasks, but they still reproduce the demographic biases in their training data. On common prompts such as ``a photo of a nurse,'' ``a photo of a CEO'', they skew their outputs toward one gender, driven by the statistics of training data rather than anything in the text. Existing debiasing methods show promise in narrow settings but require retraining, batch-level control, or prompt-specific tuning, limiting their scalability. We propose \emph{EquiSteer}, a training-free method that works per sample by steering cross-attention (CA) activations at inference time. For each target attribute, EquiSteer precomputes steering vectors from contrastive prompts. Then at generation time, a prompt-aware gate leaves attribute-specific prompts untouched, while for neutral ones it clears existing attribute signals from the CA activations and injects a target attribute. Across SD-1.5, SD-2.1, SDXL, and SANA, EquiSteer reduces the average parity gap by up to $87\%$, with minimal effect on image quality and text-image alignment. Code is available at \href{https://github.com/Atmyre/EquiSteer}{https://github.com/Atmyre/EquiSteer}.% cs.CV Text-to-image diffusion models power everyday creative tasks, but they still reproduce the demographic biases in their training data. On common prompts such as ``a photo of a nurse,'' ``a photo of a CEO'', they skew their outputs toward one gender, driven by the statistics of training data rather than anything in the text. Existing debiasing methods show promise in narrow settings but require retraining, batch-level control, or prompt-specific tuning, limiting their scalability. We propose \emph{EquiSteer}, a training-free method that works per sample by steering cross-attention (CA) activations at inference time. For each target attribute, EquiSteer precomputes steering vectors from contrastive prompts. Then at generation time, a prompt-aware gate leaves attribute-specific prompts untouched, while for neutral ones it clears existing attribute signals from the CA activations and injects a target attribute. Across SD-1.5, SD-2.1, SDXL, and SANA, EquiSteer reduces the average parity gap by up to $87\%$, with minimal effect on image quality and text-image alignment. Code is available at \href{https://github.com/Atmyre/EquiSteer}{https://github.com/Atmyre/EquiSteer}.% Research signal collected from arXiv metadata; Gemini enrichment can add a clearer summary. cs.CV
Replicate Official Models
Matched image generation, text-to-image, text to image; 3 source links; official model catalog signal; access model: Paid API
fal Model APIs
Matched image generation, text-to-image, text to image; 3 source links; official model catalog signal; access model: Paid API
| # | Alternative | Kind | Access | Fit | Why it appears | Source |
|---|---|---|---|---|---|---|
| 01 | Replicate Official Models | service | Paid API | RDR89 | Matched image generation, text-to-image, text to image; 3 source links; official model catalog signal; access model: Paid API | replicate.com |
| 02 | fal Model APIs | service | Paid API | RDR88 | Matched image generation, text-to-image, text to image; 3 source links; official model catalog signal; access model: Paid API | fal.ai |
| 03 | Runware Model API | service | Paid API | RDR87 | Matched image generation, text-to-image, text to image; 3 source links; official model catalog signal; access model: Paid API | runware.ai |
| 04 | DiffusionBench: On Holistic Evaluation of Diffusion Transformers | paper | Research-only | RDR78 | Matched image generation, text-to-image, text to image; 1 source link; access model: Research-only | arxiv.org |
| 05 | artokun/comfyui-mcp | repo | Open source | RDR78 | Matched image generation, text-to-image, text to image; 1 source link; access model: Open source; freshly updated | github.com |
| 06 | aqm857886159/Nomi | repo | Open source | RDR77 | Matched image generation, text-to-image, text to image; 1 source link; access model: Open source; freshly updated | github.com |
| 07 | Intermediate Text Representation Guided Text-to-Image Generation for Enhancing One-and-Only Alignment | paper | Research-only | RDR77 | Matched image generation, text-to-image, text to image; 1 source link; access model: Research-only; freshly updated | arxiv.org |
Track EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generation alternatives
Get private alerts when source-backed image generation alternatives, access signals, or comparison evidence change.
API and bulk access