Google DeepMind Models
There are 40 AI and NLP models from Google DeepMind in our directory. Browse the full list below, or explore models by task.
Models on this page come from Google DeepMind, the research lab behind Gemini, Gemma, and AlphaFold. We catalog 40 models from Google DeepMind.
Official siteUpdated June 2026
- Double DQNAtariGoogle DeepMind
- Gemini Flash 3.1 TTSAudio generationGoogle DeepMind
- Gemini 3.0 Flash-liteLanguage modeling/generationGoogle DeepMind
- Gemini 3.1 ProLanguage modeling/generationGoogle DeepMind
- Veo 2Video generation,Text-to-video,Image-to-videoGoogle DeepMind
- Gemini 3 ProLanguage modeling/generationGoogle DeepMind
- Veo 3.1Image-to-video,Video generation,Text-to-video,Audio generationGoogle DeepMind
- Gemini Robotics-ER 1.5Instruction interpretation,Robotic manipulation,Image captioning,Object detection,Search,Language modeling/generation,Question answering,Speech recognition (ASR)Google DeepMind
- Gemini 2.5 Pro (Jun 2025)Language modeling/generation,Question answering,Code generation,Quantitative reasoning,Visual question answering,Translation,Image captioning,Video description,Speech recognition (ASR)Google DeepMind
- Veo 3Video generation,Image-to-video,Text-to-videoGoogle DeepMind
- Gemini 2.5 Pro (May 2025)Language modeling/generation,Question answering,Code generation,Quantitative reasoning,Visual question answering,Translation,Image captioning,Video description,Speech recognition (ASR)Google DeepMind
- Gemini 2.5 Pro (Mar 2025)Language modeling/generation,Question answering,Code generation,Quantitative reasoning,Visual question answering,Translation,Image captioning,Video description,Speech recognition (ASR)Google DeepMind
- Gemini 2.0 ProCode generation,Language modeling/generation,Question answering,Visual question answering,Speech recognition (ASR),Video descriptionGoogle DeepMind
- Gemini-Exp-1114Language modelingGoogle DeepMind
- OpenVLARobotic manipulationStanford University,University of California (UC) Berkeley,Toyota Research Institute,Google DeepMind,Massachusetts Institute of Technology (MIT),Physical Intelligence
- AlphaFold 3Protein folding prediction,Antibody property prediction,Protein-ligand contact prediction,RNA structure prediction,Protein interaction predictionGoogle DeepMind,Isomorphic Labs
- AlphaGeometryGeometry,Mathematical reasoningGoogle DeepMind,New York University (NYU)
- FunSearchCode generationGoogle DeepMind
- GNoME for crystal discoveryCrystal discoveryGoogle DeepMind
- SIMA 2Google DeepMind
- Gemini 3 Pro Image (Nano Banana Pro)Image generationGoogle DeepMind
- Gemini 2.5 Deep ThinkLanguage modeling/generation,Mathematical reasoning,Code generation,Visual question answering,Question answering,Visual puzzles,Video description,Speech recognition (ASR),Speech-to-textGoogle,Google DeepMind
- Gemini EmbeddingSemantic embeddingGoogle DeepMind
- FGNWeather forecastingGoogle DeepMind
- AlphaProteoProtein generation,ProteinsGoogle DeepMind
- Table Tennis AgentSportsGoogle DeepMind
- GenCastWeather forecastingGoogle DeepMind
- Gemini 1.5 ProLanguage modeling,Visual question answeringGoogle DeepMind
- Gemini Nano-2Chat,Image captioning,Speech recognition (ASR)Google DeepMind
- Gemini Nano-1Chat,Image captioning,Speech recognition (ASR)Google DeepMind
- Gemini 1.0 UltraLanguage modeling,Visual question answering,Chat,TranslationGoogle DeepMind
- Gemini 1.0 ProLanguage modeling,Visual question answering,Chat,TranslationGoogle DeepMind
- GraphCastWeather forecastingGoogle DeepMind
- RT-TrajectoryRobotic manipulationGoogle DeepMind,University of California San Diego,Stanford University
- PaLI-3Visual question answering,Character recognition (OCR),Image captioningGoogle DeepMind,Google Research,Google Cloud
- RT-2-XRobotic manipulationGoogle DeepMind
- AlphaMissenseProtein pathogenicity prediction,Protein folding prediction,ProteinsGoogle DeepMind
- RT-2Robotic manipulationGoogle DeepMind
- Agile Soccer RobotAnimal (human/non-human) imitation,SportsGoogle DeepMind
- SigLIP 400MImage classification,Image embeddingGoogle DeepMind
What tasks do Google DeepMind models cover?
Language modeling/generation (9)Visual question answering (9)Speech recognition (ASR) (8)Image captioning (7)Question answering (6)Code generation (6)Robotic manipulation (5)Translation (5)Video description (5)Language modeling (4)Chat (4)Video generation (3)Text-to-video (3)Image-to-video (3)Quantitative reasoning (3)Weather forecasting (3)Audio generation (2)Protein folding prediction (2)