By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

Your #1 guide to start a business and grow it the right way…

  • Home
  • Startups
  • Start A Business
    • Business Plans
    • Branding
    • Business Ideas
    • Business Models
    • Fundraising
  • Growing a Business
  • Funding
  • More
    • Tax Preparation
    • Leadership
    • Marketing
Subscribe
Aa
BrandiaryBrandiary
  • Startups
  • Start A Business
  • Growing a Business
  • Funding
  • Leadership
  • Marketing
  • Tax Preparation
Search
  • Home
  • Startups
  • Start A Business
    • Business Plans
    • Branding
    • Business Ideas
    • Business Models
    • Fundraising
  • Growing a Business
  • Funding
  • More
    • Tax Preparation
    • Leadership
    • Marketing
Made by ThemeRuby using the Foxiz theme Powered by WordPress
Brandiary > Startups > Project Astra Is Google’s ‘Multimodal’ Answer to the New ChatGPT

Project Astra Is Google’s ‘Multimodal’ Answer to the New ChatGPT

News Room By News Room May 14, 2024 4 Min Read
Share

Pulkit Agrawal, an assistant professor at MIT who works on AI and robotics, says Google’s and OpenAI’s latest demos are impressive and show how rapidly multimodal AI models have advanced. OpenAI launched GPT-4V, a system capable of parsing images in September 2023. He was impressed that Gemini is able to make sense of live video—for example, correctly interpreting changes made to a diagram on a whiteboard in real time. OpenAI’s new version of ChatGPT appears capable of the same.

Agrawal says the assistants demoed by Google and OpenAI could provide new training data for the companies as users interact with the models in the real world. “But they have to be useful,” he adds. “The big question is what will people use them for—it’s not very clear.”

Google says Project Astra will be made available through a new interface called Gemini Live later this year. Hassabis said that the company is still testing several prototype smart glasses and has yet to make a decision on whether to launch any of them.

Astra’s capabilities might provide Google a chance to reboot a version of its ill-fated Glass smart glasses, although efforts to build hardware suited to generative AI have stumbled so far. Despite OpenAI and Google’s impressive demos, multimodal modals cannot fully understand the physical world and objects within it, placing limitations on what they will be able to do.

“Being able to build a mental model of the physical world around you is absolutely essential to building more humanlike intelligence,” says Brenden Lake, an associate professor at New York University who uses AI to explore human intelligence.

Lake notes that today’s best AI models are still very language-centric because the bulk of their learning comes from text slurped from books and the web. This is fundamentally different from how language is learned by humans, who pick it up while interacting with the physical world. “It’s backwards compared to child development,” he says of the process of creating multimodal models.

Hassabis believes that imbuing AI models with a deeper understanding of the physical world will be key to further progress in AI, and to making systems like Project Astra more robust. Other frontiers of AI, including Google DeepMind’s work on game-playing AI programs could help, he says. Hassabis and others hope such work could be revolutionary for robotics, an area that Google is also investing in.

“A multimodal universal agent assistant is on the sort of track to artificial general intelligence,” Hassabis said in reference to a hoped-for but largely undefined future point where machines can do anything and everything that a human mind can. “This is not AGI or anything, but it’s the beginning of something.”

Updated 5-14-2024, 4:15 pm EDT: This article has been updated to clarify the full name of Google’s project.

Read the full article here

News Room May 14, 2024 May 14, 2024
Share This Article
Facebook Twitter Copy Link Print
Previous Article The Company Driving Success in the B2B Creator Economy
Next Article Clinton Sparks Podcast: CEO of Complex Shares How Media, Culture Have Shifted in Recent Years
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Wake up with our popular morning roundup of the day's top startup and business stories

Stay Updated

Get the latest headlines, discounts for the military community, and guides to maximizing your benefits
Subscribe

Top Picks

Why creators are taking the reins on event hosting
January 12, 2026
Steve Jobs’ Early Apple Items Are Going Up for Auction—Along With His Bow Ties
January 12, 2026
Disney nearly sold out of ad inventory for college football championship
January 11, 2026
Billion-Dollar Data Centers Are Taking Over the World
January 11, 2026
Disney details vertical video efforts and AI-aided media planning for 2026
January 10, 2026

You Might Also Like

Steve Jobs’ Early Apple Items Are Going Up for Auction—Along With His Bow Ties

Startups

Billion-Dollar Data Centers Are Taking Over the World

Startups

AI Devices Are Coming. Will Your Favorite Apps Be Along for the Ride?

Startups

Google Gemini Is Taking Control of Humanoid Robots on Auto Factory Floors

Startups

© 2023 Brandiary. All Rights Reserved.

Helpful Links

  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Resources

  • Start A Business
  • Funding
  • Growing a Business
  • Leadership
  • Marketing

Popuplar

AI Devices Are Coming. Will Your Favorite Apps Be Along for the Ride?
Games, AI recommendations, and multicams: NBCU’s plans to boost engagement ahead of a sports-packed winter
‘AI hasn’t changed the principles of marketing’: Through the hype, marketers vie for the human touch

We provide daily business and startup news, benefits information, and how to grow your small business, follow us now to get the news that matters to you.

Welcome Back!

Sign in to your account

Lost your password?