Google is taking on OpenAI’s GPT-4o model with Project Astra: Watch to find out how
Summary
In a demonstration shown during the company’s annual developers conference, the Google I/O, Astra could be seen responding almost immediately to queries based on what it saw through the phone’s camera.
If you thought OpenAI’s Gpt-4o model was fascinating, think again.
Tech giant Google, on Tuesday (May 14), unveiled a new prototype from its DeepMind lab, called Project Astra that can answer real-time queries across video, audio and text.
In a demonstration shown during the company’s annual developers conference, the Google I/O, Astra could be seen responding almost immediately to queries based on what it saw through the phone’s camera.
It was able to write an alliteration about crayons, identify programming code functions, suggest improvements to electrical circuit diagrams, give a name to a rock duo consisting of a golden retriever and a stuffed tiger, and identify King’s Cross neighbourhood in London just by looking through a window. It was also able to locate glasses for the user when asked if it had seen them.
Watch here if you haven’t yet.
Project Astra is a prototype from @GoogleDeepMind exploring how a universal AI agent can be truly helpful in everyday life. Watch our prototype in action in two parts, each captured in a single take, in real time ↓ #GoogleIO pic.twitter.com/uMEjIJpsjO
— Google (@Google) May 14, 2024
When Google revealed Gemini in December last year, it received a lot of backlash for faking the capabilities of the AI model in the demo video, where Gemini seemed to instantly recognise objects it was shown. So this time, Google added a disclaimer saying the Astra demo was filmed in two parts, in single takes.
The competition between Google’s Gemini and OpenAI’s ChatGPT seems to be getting close with both models looking to potentially change how we interact with our devices. Google will bring Project Astra — or whatever it will be called officially — to the Gemini app on Pixel smartphones later this year and Apple is said to be closing a deal with OpenAI to power the AI features on its upcoming devices, including the iPhone 16 series. It is possible that the updates Siri could be based on the GPT-4o model.
OpenAI has stated that in the future, improvements will allow for more natural, real-time voice conversation and the ability to converse with ChatGPT via real-time video. Currently, it understands and discusses the images it sees. For example, solving a math problem or understanding your expressions or giving you details about your food.
Dog meets GPT-4o pic.twitter.com/5C0hlYq5ws
— OpenAI (@OpenAI) May 13, 2024
Google smart glasses are likely on table
Google is probably aiming at two birds with one stone with a probability of smart glasses to accompany Project Astra.
The video demonstration during the morning keynote included a pair of glasses that appeared to have a camera and some kind of visual interface.
During interviews with Google DeepMind Chief Executive Officer Demis Hassabis and Google co-founder Sergey Brin, the executives confirmed that the company is experimenting with the idea of making glasses for Project Astra.
“Obviously, it works amazingly on the phones,” said Hassabis, who oversees AI research at Google. “But the whole Valley’s debating this — there probably needs to be other form factors as well, when these systems are fully developed. It seems to me like Glass is an obvious one.”
Brin called Project Astra a “killer app” for an AI-powered glasses effort — adding that Google was 10 years too early to the game. “It’s funny because it’s, like, the perfect hardware,” he said, while stopping short of confirming that Google was actively working on glasses.
“Hands-free is the idea,” Brin said. “A lot of things you want commentary on: You’re cooking or doing some sport, or you want this thing to help you. It’s awkward to do it with your hands also holding your phone.”
(With inputs from Bloomberg)
Also Read: Google I/O 2024: New generative media tools Veo and Imagen 3 revealed
Elon Musk forms several ‘X Holdings’ companies to fund potential Twitter buyout
3 Mins Read
Thursday’s filing dispelled some doubts, though Musk still has work to do. He and his advisers will spend the coming days vetting potential investors for the equity portion of his offer, according to people familiar with the matter