GEMINI (Syllabus: GS Paper 3 – Sci and Tech)

News-CRUX-10 19th June 2024

Download PDF (English)

Context: Google has launched its Gemini app in India, offering support for nine Indian languages and English, providing users with an AI-powered tool for answering queries via typing, voice, or image uploads.

Google Gemini

About: It is a new multimodal AI model capable of comprehending and processing various formats simultaneously, such as text, code, audio, image, and video.
Built: Gemini was built from the ground up to be multimodal, which means it can operate across and combine different types of information, including text, audio, image, code, and video.
It can recognise images, speak in real-time, and even solve physics with remarkable ingenuity.
Gemini 1.0 comprises 3 Models: Ultra, Pro, and Nano.

oGemini Ultra: It is Google’s most powerful LLM ever and is aimed at enterprise applications that will run it for “highly complex tasks.

oGemini Pro: It is the most general-purposed of the three and has already been plugged into Bard for prompts that require advanced reasoning, planning, and understanding.

oGemini Nano: It described as the most efficient model for on-device tasks, has been baked into the Pixel 8 Pro to process tasks like information summarisation and Smart Reply.

Tensor Processing Units: Gemini 1.0 on its AI-optimised infrastructure using its in-house designed Tensor Processing Units (TPUs) v4 and v5e.
Hindi Languages Available: Hindi, Bengali, Gujarati, Kannada, Malayalam, Marathi, Tamil, Telugu, and Urdu.
Significance: It will help people access information and complete tasks in their preferred language.