Context: Google has launched its Gemini app in India, offering support for nine Indian languages and English, providing users with an AI-powered tool for answering queries via typing, voice, or image uploads.
Google Gemini
- About: It is a new multimodal AI model capable of comprehending and processing various formats simultaneously, such as text, code, audio, image, and video.
- Built: Gemini was built from the ground up to be multimodal, which means it can operate across and combine different types of information, including text, audio, image, code, and video.
- It can recognise images, speak in real-time, and even solve physics with remarkable ingenuity.
- Gemini 1.0 comprises 3 Models: Ultra, Pro, and Nano.
oGemini Ultra: It is Google’s most powerful LLM ever and is aimed at enterprise applications that will run it for “highly complex tasks.
oGemini Pro: It is the most general-purposed of the three and has already been plugged into Bard for prompts that require advanced reasoning, planning, and understanding.
oGemini Nano: It described as the most efficient model for on-device tasks, has been baked into the Pixel 8 Pro to process tasks like information summarisation and Smart Reply.
- Tensor Processing Units: Gemini 1.0 on its AI-optimised infrastructure using its in-house designed Tensor Processing Units (TPUs) v4 and v5e.
- Hindi Languages Available: Hindi, Bengali, Gujarati, Kannada, Malayalam, Marathi, Tamil, Telugu, and Urdu.
- Significance: It will help people access information and complete tasks in their preferred language.