Skip to content

Gemini Robotics-ER 1.5

Google DeepMindInstruction interpretationRobotic manipulationImage captioningObject detectionSearchLanguage modeling/generationQuestion answeringSpeech recognition (ASR)

Gemini Robotics-ER 1.5 is instruction interpretation model published by Google DeepMind in 2025.

About Gemini Robotics-ER 1.5

Our most capable vision-language model (VLM) reasons about the physical world, natively calls digital tools and creates detailed, multi-step plans to complete a mission. This model now achieves state-of-the-art performance across spatial understandin

Details

Provider
Google DeepMind
Task
Instruction interpretation,Robotic manipulation,Image captioning,Object detection,Search,Language modeling/generation,Question answering,Speech recognition (ASR)
Released
2025-09-25
Open weights
No
View model source

Explore

FAQ