What is Gemini AI? Know everything about Google’s new AI Model

The most powerful AI model from Google has just been launched. Here’s a sneak peek into its features.

 
Follow :
Google Gemini AI | Image: Shutterstock

What is Gemini AI?

Gemini AI is the recent release from the tech giant Google that can not only understand text input but also images, videos, and audio. Working as a multimodal model, Google's Gemini AI is described as taking on complex tasks in math, physics, and all other areas. Moreover, Gemini AI is also capable of understanding and generating high-quality code in various programming languages.

It is now available with the integrations with Google Bard and Google Pixel 8, and in the future will be folded into all other Google services.

Dennis Hassabis, CEO and co-founder of Google DeepMind, said, "Gemini is the result of large-scale collaborative efforts by teams across Google, including our colleagues at Google Research." "It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information, including text, code, audio, image, and video,” he also added.

Who made Gemini AI?

Google’s new AI Gemini was created by Google and Alphabet, the parent company of Google, and released as the most advanced AI model from the company. Google DeepMind has also made a significant contribution to the development of Gemini AI.

Different versions of Google's Gemini AI

Google has described Gemini as a flexible AI model that is capable of working on everything from Google’s data centers to mobile devices. To be able to gain scalability, Google released Gemini in three variants: Gemini Nano, Gemini Pro, and Gemini Ultra.

Gemini Nano: 

The Gemini Nano model size is designed to work on smartphones, specifically like the Google Pixel 8. This model is built to perform on-device tasks which require efficient AI processing and don’t require any connection to external servers.

Gemini Pro:

Gemini Pro runs on Google's data centers. It is designed to power the latest version of Google’s AI chatbot, Google Bard. It is capable of quick response times and understanding complex problems.

Gemini Ultra:

The Google Gemini Ultra is still not available for widespread use, as it is the most capable model, exceeding "current state-of-the-art results on 30 of the 32 widely-used academic benchmarks used in large language model (LLM) research and development."

Google Ultra is designed to solve highly complex tasks and will be released after the first phase of testing is finished.

How to access Google's Gemini AI:

Google's Gemini AI is available on Google products in the Nano and Pro sizes, like the Google Pixel 8 and Google Bard chatbot. Google is planning to integrate Gemini over time into Search, Ads, Chrome, and other Google services.

The developers and enterprise customers will be able to access Google's Gemini AI Pro via the Gemini API in Google’s AI Studio and Google Cloud Vertex AI starting on December 13. Android developers can access the Gemini Nano through AICore, which will be available on an early preview basis.

How is Google's Gemini AI different from other AI models, like ChatGPT-4?

Google's new AI model is referred to as the most advanced AI model to date, though the release of the Gemini Ultra model will determine the statement. However, comparing it to the currently running models which are in use, Gemini AI beats them with its native multimodal characteristics, whereas other models like GPT-4 depend on plugins and integrations to work as multimodal.

Published On: 7 December 2023 at 10:40 IST