Context: Recently, Google introduced Project Gemini, an artificial intelligence (AI) model designed to exhibit human-like behavior.
Google Gemini
- About: Gemini is an AI model that cannot be accessed directly. Rather, it acts as a base that Google and, ultimately, other developers can use to build products on top of.
- Built by: Gemini was built from the ground up to be multimodal, which means it can operate across and combine different types of information, including text, audio, image, code, and video.
- It can recognise images, speak in real-time, and even solve physics with remarkable ingenuity.
- Gemini 1.0 comprises 3 Models: Ultra, Pro, and Nano.
oGemini Ultra: It is Google’s most powerful LLM ever and is aimed at enterprise applications that will run it for “highly complex tasks.
oGemini Pro: It is the most general-purposed of the three and has already been plugged into Bard for prompts that require advanced reasoning, planning, and understanding.
oGemini Nano: It described as the most efficient model for on-device tasks, has been baked into the Pixel 8 Pro to process tasks like information summarisation and Smart Reply.
- Tensor Processing Units: Gemini 1.0 on its AI-optimised infrastructure using its in-house designed Tensor Processing Units (TPUs) v4 and v5e.
Is Gemini better than ChatGPT 4?
- Gemini seems to be more flexible that GPT4 at the moment. Also it ability to work with video and on devices without Internet give it an edge.
- Gemini is now free to use while ChatGPT4 is only for paid users.