They sound like the names of the latest generation smartphones: Gemini Nano, Gemini Pro and Gemini Ultra. In reality they are three different versions of Googleās new artificial intelligence, which seemed to be postponed to 2024 and which, instead, is available ā in the Pro version ā already today in 170 countries. Italy is not among these. In Europa, dice Sissy Hsiaovice president at Google and general manager of Google Assistant and Bard, Gemini will arrive āvery soonā.
Sundar PichaiCEO of Google and Alphabet who a few months ago compared the impact of generative AI on human life to that determined by the ādiscovery of fire and electricityā, launched Gemini stating that āthe transition we are currently experiencing with artificial intelligence it will be the most profound of our lifetime, much greater than the transition to mobile devices or the web that preceded it.ā
āWeāre just scratching the surface of what will be possible,ā Pichai said.
āGemini is also our most flexible model created to date,ā he said Demis Hassabis, CEO at Google DeepMind, the team born from the merger of Google Brain and DeepMind (AI laboratory acquired by Google in 2014) which develops the Mountain View companyās most advanced artificial intelligence. āGemini can run efficiently on everything from data centers to mobile devices,ā added Hassabis.
We believe in making AI helpful for everyone. Thatās why weāre launching Gemini, our most capable model thatās inspired by the way people understand and interact with the world. #Gemini pic.twitter.com/gNG9ha9xMO
ā Google (@Google) December 6, 2023
This, then, is why they exist three āversionsā of the same AI: Gemini Ultra it is the most powerful model, designed for complicated tasks; Gemini Pro it is the most scalable model, suitable for different tasks; Gemini Nano is the model designed to run AI directly on mobile devices: the first to receive it will be the Google Pixel 8 Proand in particular its āRecorderā app which already today, thanks to AI, does an extraordinary job in terms of transcribing content.
We imagine that generative AI, on Pixels (āand later on other Android devicesā says Google), will also give a significant hand to put the transcribed notes in order: proposing summaries or bullet points, for example.
Certainly Gemini will not take Bardās place, i.e. Googleās free chatbot ā similar to ChatGpt ā which answers usersā questions with natural language and, when required, with a good dose of ācreativityā. Gemini, in fact, will be Bardās āengineā.. The Pro and Ultra versions of the new AI model, trained on a huge amount of data [Google non ha specificato il numero di parametri, nda] they will allow Bard to solve increasingly complex problems and provide increasingly accurate answers to its users.
Before making it available to the public, Google subjected Gemini Pro to a series of industry benchmarks [vale a dire misure di riferimento comunemente accettate e utilizzate per valutare le prestazioni o le caratteristiche di un sistema, nda]. In six of the eight benchmarks Gemini Pro surpassed GPT-3.5. This includes the MMLU (Massive Multitask Language Understanding) benchmark, one of the leading standards for measuring large AI models, and the GSM8K benchmark, which measures elementary school-level mathematical reasoning.
Furthermore, with a score of 90%, Gemini Ultra ā again according to what Google reports ā it is the first model to outperform human experts in the MMLU (massive multitasking language comprehension), which uses a combination of 57 subjects such as mathematics, physics, history, law, medicine and ethics to test both world knowledge and problem-solving skills.
On paper, Googleās new AI looks promising. Eli Collinsvice president of Google DeepMind, explained that Gemini is ānatively multimodal.ā
āSo far the standard approach to creating multimodal models [IA capaci di interagire con diverse modalitĆ di input e output, dai testi alle immagini] was to develop separate components and then put them together,ā Collins said. These models are very efficient when it comes to doing a certain operation, such as describing an image for example, but they struggle when dealing with difficult concepts or complicated reasoning. Gemini, on the other hand, was trained from the beginning on different types of data such as text, images, audio and so on. In this way Gemini can intuit the nuances of certain information contained in images or audio, for example, and can reason about mathematics or physics problems.ā
Collinsā words were followed by a practical demonstration during the meeting organized by Google to reveal Gemini. In a pre-recorded video, Sam Cheung ā Interaction designer at Google ā shows how the new AI is capable not only of āreadingā, analyzing and solving a mathematics problem written on a sheet of paper, but also of checking the answers given by the user to that problem and to explain to the human being where he went wrong and why.
Just two weeks ago, the news of the progress of OpenAI ā the company he created ChatGpt ā in solving elementary mathematics problems he had even raised fears for the future of humanity. There has been talk of a Q* project ā whose existence was confirmed by CEO Sam Altman ā which would bring OpenAIās AI closer to breaking latest news, general artificial intelligence that could one day match human cognitive abilities.
āI donāt know the details of OpenAIās work ā Collins replied to those who asked him if Geminiās performance also suggests progress towards breaking latest news ā so I canāt say anything about it. However, I can say that with Gemini, progress has been made regarding multimodal reasoning and reasoning regarding mathematics and physicsā.
āWith Gemini we have also made enormous progress in terms of factuality,ā Collins said, referring to the AIās ability to base its responses on concrete facts and objective realityso as to avoid the āhallucinationsā typical of generative artificial intelligence, i.e. the tendency to produce plausible and coherent answers but with invented content.
The guide How to recognize fake news and how to defend yourself in the era of artificial intelligence by Emanuele Capone 27 November 2023
āGemini is our best model from this point of view ā added Collins ā but the issue of possible errors is an AI problem that is still unsolved. This is why on Bard we have an integrated tool that allows you to verify the information generated [si tratta della āGā simbolo di Google che permette di accedere alla ricerca tradizionale sul web sullāargomento chiesto a Bard, nda]ā.
Of all the companies involved in the AI āārace, Google is the one that needs to be most careful, since Bardās wrong answers ā despite the āExperimentalā label and the warning āmay show inaccurate informationā ā could undermine the credibility of a company that makes research accurate a business worth 162 billion dollars a year. So much money comes from advertising on Google, equal to approximately 58% of the companyās total revenues.
Environment Google, boom in searches on eco-anxiety but also on solutions to the crisis in 2023 by Green&Blue editorial team 27 November 2023
In Mountain View, the California city that hosts Googleās headquarters, there is talk of a new era. Or rather one āGemini wasā, as CEO Sundar Pichai stated when presenting the new AI, not surprisingly branded āGemini 1.0ā. For Google it is the first version of a technology that could have a huge impact not only in the scientific field, but also in everyday life.
We saw it with our own eyes, during the meeting organized by Google to present its new artificial intelligence. We saw it with our own eyes, during the meeting organized by Google to present Gemini. A very advanced version of AI, not available to the public, āsawā through a video recording the various actions taken by a human being and commented on them in real time ā with voice and text ā dispensing information and jokes.