Image generation Models
There are 33 AI and NLP models for Image generation in our directory. Browse the full list below, or explore models by provider.
Image generation is a machine-learning task covered in our directory. We list 33 models for it.
Updated June 2026
- Wu Dao 2.0Image captioning,Chat,Image generation,Text-to-image,Language modeling/generation,Question answering,Visual question answeringBeijing Academy of Artificial Intelligence / BAAI
- GPT Image 2Image generationOpenAI
- MAI-Image-2.5Image-to-image,Image generationMicrosoft
- Gemini 2.5 Flash Image (Nano Banana)Image generation,Text-to-image,Image-to-imageGoogle
- gpt-image-1Image generation,Text-to-imageOpenAI
- GPT-4o (Mar 2025)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4o (Jan 2025)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4o (Nov 2024)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4o (Aug 2024)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4oChat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4 Turbo (Apr 2024)Chat,Language modeling/generation,Image generation,Speech synthesis,Table tasks,Visual question answering,Image captioningOpenAI
- GPT-4 Turbo (Nov 2023)Chat,Language modeling/generation,Image generation,Speech synthesis,Table tasks,Visual question answering,Image captioningOpenAI
- ERNIE 4.0Chat,Language modeling/generation,Video generation,Image generationBaidu
- Amazon TitanSemantic search,Image generation,Language modeling/generation,Code generation,Chat,Text-to-image,TranslationAmazon
- FireflyImage generation,Text-to-imageAdobe
- Projected GANImage generationHeidelberg University
- Gemini 3 Pro Image (Nano Banana Pro)Image generationGoogle DeepMind
- Qwen ImageImage generation,Text-to-image,Image-to-imageAlibaba
- Imagen 4Image generation,Text-to-imageGoogle
- InfinityImage generation,Text-to-imageByteDance
- SeedEditImage generationByteDance
- Stable Diffusion 3Image generation,Text-to-imageStability AI
- CTM (CIFAR-10)Image generation,Text-to-imageStanford University,Sony
- DiT-XL/2 + CADSImage generationETH Zurich,Disney Research
- DALL·E 3Image generation,Text-to-imageOpenAI
- Stable Diffusion XL (SDXL)Image generation,Text-to-imageStability AI
- ImageBindImage classification,Speech recognition (ASR),Image generation,Language modeling/generationMeta AI
- DiT-XL/2Image generationNew York University (NYU),University of California (UC) Berkeley
- DDPM-IP (CelebA)Image generation,Text-to-imageUtrecht University
- DiT-XL/2 + Discriminator GuidanceImage generation,Text-to-imageKorea Advanced Institute of Science and Technology (KAIST),NAVER
- Discriminator GuidanceImage generationKorea Advanced Institute of Science and Technology (KAIST),NAVER
- AltCLIP_M9Language modeling/generation,Chat,Visual question answering,Image generationBeijing Academy of Artificial Intelligence / BAAI
- eDiff-IImage generation,Text-to-imageNVIDIA