Google Launches Gemini AI Model
- By Paul Mah
- December 07, 2023
Google this week announced the launch of Gemini, an AI model that it says was designed from scratch to be multimodal and can work with text, code, audio, image, and video.
Gemini is available in three sizes: Ultra, Pro, and Nano. Gemini Pro and Nano are available, while Gemini Ultra is still being tested and will only roll out to developers and enterprise customers early next year.
Gemini is trained using Google’s in-house designed Tensor Processing Units (TPUs) v4 and v5e, Google says Gemini Pro outperforms GPT-3.5, though there is no mention of GPT-4.
Gemini
With the ability to better understand nuanced information and answer questions relating to complicated topics, Google says Gemini is especially good at explaining reasoning in math and physics.
Indeed, Gemini can understand, explain, and generate high-quality code in the world’s most popular programming languages, like Python, Java, C++, and Go. A specialized version of Gemini excels at solving competitive programming problems and solves twice as many problems as the original AlphaCode.
“Its sophisticated multimodal reasoning capabilities can help make sense of complex written and visual information. Its remarkable ability to extract insights from hundreds of thousands of documents through reading, filtering, and understanding information will help deliver new breakthroughs at digital speeds in many fields from science to finance.”
Gemini is now rolling out across a range of products and platforms. Bard has also been upgraded to a fine-tuned version of Gemini Pro for more advanced reasoning, planning, and understanding.
The upgraded Bard will be available in English in over 170 countries and territories, with plans to expand to different modalities and support new languages and locations soon.
Gemini Nano is also finding its way to Pixel 8 Pro, which is touted as the first smartphone engineered to run it. It will power new features like Summarize in the Recorder app and will roll out in Smart Reply in Gboard starting with WhatsApp.
“This is a significant milestone in the development of AI, and the start of a new era for us at Google as we continue to rapidly innovate and responsibly advance the capabilities of our models,” wrote Sundar Pichai, CEO of Google and Alphabet and Demis Hassabis, CEO and co-founder of Google Deepmind in a blog post.
“We’ve made great progress on Gemini so far and we’re working hard to further extend its capabilities for future versions, including advances in planning and memory, and increasing the context window for processing even more information to give better responses.”
Image credit: Google
Paul Mah
Paul Mah is the editor of DSAITrends, where he report on the latest developments in data science and AI. A former system administrator, programmer, and IT lecturer, he enjoys writing both code and prose.