Google has just announced (but not released) Gemini 1.5, an update to its flagship language model — the model used in the chatbot once known as Bard, but synergistically renamed Gemini a week ago.
The big claim with this release is "a breakthrough in long-context understanding across modalities." It's also meant to be a step-up in terms of efficiency, having been built on an architecture type known as "Mixture-of-Experts (MoE)," meaning performance supposedly akin to Gemini 1.0, but relying on fewer electricity-hungry GPUs cranking away to achieve it.
That first big claim about multi-modal "long-context" understanding is as jargony as it sounds, but Google Deepmind co-founder posted a demo on X meant to show what this means in practice.
Tweet may have been deleted
Smartly utilizing a large piece of public-domain text that won't rankle any copyright sticklers — in this case a 402-page transcript of the NASA mission that landed on the moon — the LLM is able to narrow its focus to what the user needs ("context") in spite of the prompt being absolutely gigantic ("long"), so apparently that's what "long-context" means.
In the demo, Gemini 1.5 is able to pick out three amusing moments from the novel-length piece of text. It's also able to spot the event in the transcript that matches a picture of a lunar boot print — the part where, y'know, Neil Armstrong walks on the moon — which elucidates what "multimodal" is intended to mean in this context: an image recognition model working hand-in-hand with the LLM.
SEE ALSO: Samsung Galaxy S24 series comes with Google's Gemini AI modelThis upgrade is part of an ongoing effort to keep Google in the AI conversation after OpenAI ate everyone's AI lunch in 2022 by releasing ChatGPT. Late last year, Google began seriously hyping changes to come with Bard and the model powering it, which remains an also-ran large language model, more known for being shoehorned into popular Google and Android products than for being used like ChatGPT to solve day-to-day problems and blow minds at cocktail parties. In particular, a December 2023 research paper touted a version of Gemini that had exceeded the performance of OpenAI's GPT-4 model in certain cases, and become the first LLM to get a passing score on a specific AI test of "Multitask Language Understanding " or MLU.
Among other assertions about Gemini 1.5, Google says the new model can crunch large datasets with impressive accuracy, and — in a somewhat more eyebrow raising claim — perform well at reasoningacross all sorts of data types. Reasoning is the most famous weakness among most LLMs.
According to CEO Sundar Pichai, Google is releasing Gemini 1.5 to a limited group. "We’re excited to offer a limited preview of this experimental feature to developers and enterprise customers," Pichai wrote in Google's blog post.
The broader base of Gemini users will be the ultimate judges of Google's performance claims when they're actually permitted to try out Gemini 1.5 as part of an officially-released product. Google's most powerful model, Gemini Ultra was released a week ago, so it may be a while, and it's probably safe to assume that Gemini 1.5, will one day be part of Google's new premium — in other words "paid" — package of Workspace products called Google One AI Premium.
Copyright © 2023 Powered by
Google announces Gemini 1.5, a flashy upgrade to its flagship AI model-书香门户网
sitemap
文章
58119
浏览
49
获赞
6459
Meghan and Harry reveal their newborn son's name
The Duke and Duchess of Sussex have announced their newborn son's name: Archie Harrison Mountbatten-No one noticed, but Uber and Lyft stopped accepting new NYC drivers
New wannabe Lyft and Uber drivers are out of luck in New York City.Both ride-hailing apps stopped taChris Evans may 'cut ties' with Tom Brady over Trump
Chris Evans, Avenger and overall number one Chris, may have to snap his finger and turn his Tom BradAm I the only one who masturbates to podcasts? A look into audio porn.
May is National Masturbation Month, and we're celebrating with Feeling Yourself, a series exploringMia Farrow's Twitter account is joyfully bizarre
Stream of consciousness writing made for great 20th century fiction, and now it's coming for 21st ce'Old Town Road' yeehaws its way to No. 1 on the Billboard Hot 100
Lil Nas X's viral hit "Old Town Road" has reached No. 1 on the Billboard Hot 100. The TikTok mainstaPeople are sharing videos and pictures of their college professors being ridiculous
Imagine you're in high school, goofing around and chatting with your friends when your teacher breakThe dyed Chicago River doubles as a perfect green screen in this hilarious video
The Chicago River was dyed green for St. Patrick's day. And for the memes. Every year, Tom Rowan dyeApple gives students and teachers free AirPods with purchase of Mac or iPad
AirPods are cool. Free AirPods are even cooler. Apple is giving away a free pair of AirPods for studThe CIA has launched its Instagram with an 'eye spy' picture
Welcome to the 'gram, CIA fam!Ok, I'm sorry for that rhyme. But sometimes you've gotta roll out theTesla cars can now diagnose themselves and pre
Tesla has quietly launched several neat features over the past couple of days, making it easier forNYC subway will soon accept Google Pay at select turnstiles
New York City straphangers, rejoice!The city's subway and bus systems are adding new contactless payACLU warns that 'no replies' on Twitter could violate the constitution
Trump was basically Obama's reply guy throughout the 2010s, so it's only fitting that he won't be ab'Old Town Road' yeehaws its way to No. 1 on the Billboard Hot 100
Lil Nas X's viral hit "Old Town Road" has reached No. 1 on the Billboard Hot 100. The TikTok mainstaDesktop vs. Laptop Gaming with the RTX 2070
Since the launch of Nvidia's RTX GPUs we've been intrigued about the performance difference between