Gemini, introduced by the Gemini Team at Google DeepMind (2023), was designed from the start to be multimodal, processing text, images, audio and video in one model. It ships in sizes (Ultra, Pro, Nano) spanning data-centre reasoning to on-device use, and reported state-of-the-art results across a broad benchmark suite on release.
Gemini is one of the four leading models MultipleChat queries and compares in parallel.