Google's Project Astra: AI Assists in Everyday Life with Visual Memory

May 14, 2024 at 5:30:25 PM

TL;DR Google's Project Astra, revealed at I/O 2024, is an AI app that identifies objects and responds to queries using visual data. It can remember items it has seen, even out of frame. The AI can name object parts, give creative alliterations, and suggest system enhancements. It mimics human expressiveness with a range of vocal intonations. There's no launch date, but it could be available on phones or glasses.

Google's Project Astra: AI Assists in Everyday Life with Visual Memory

Google's Project Astra is an application of AI that uses your phone's camera and AI to locate noise makers, misplaced items, and more. It was showcased at Google's I/O 2024 conference. The project results from Google's ambition to develop universal AI agents to assist in everyday life.

Project Astra appears to be an app with a viewfinder as its main interface. In a demonstration video, the Gemini AI could identify a speaker and explain its parts, create an alliteration about crayons, identify and explain parts of code, and even remember where it had seen a pair of glasses that were no longer in the frame.

The app also demonstrated the ability to provide suggestions to improve a system's speed and generate creative ideas, such as a band name for a plush tiger toy and a golden retriever.

Project Astra processes visual data in real time and remembers what it has seen. It achieves this by continuously encoding video frames, combining the video and speech input into a timeline of events, and caching this information for efficient recall.

Google has also been enhancing the vocal expression range of its AI, giving the agents a more comprehensive range of intonations. This is similar to the human-like responses provided by Google's Duplex voice assistant technology.

While Project Astra is still in its early stages, Google's DeepMind CEO, Demis Hassabis, suggests that these assistants could be available through your phone or glasses.

Some of these capabilities are expected to come to Google products, like the Gemini app, later this year.

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Meta AI Adds Hindi and Portuguese for Reel Translations

Meta AI Adds Hindi and Portuguese for Reel Translations

Meta
Meta

Official Source

Official Source

Meta is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Tired of spending too much time creating audits for your clients?

Tired of spending too much time creating audits for your clients?

Featured
Google AI Mode expands to over 200 countries with new languages and features

Google AI Mode expands to over 200 countries with new languages and features

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Search launches AI Mode in Australia for complex, multimodal queries

Google Search launches AI Mode in Australia for complex, multimodal queries

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google AI Mode Enhances Image Search with Visual and Conversational Features

Google AI Mode Enhances Image Search with Visual and Conversational Features

AI Gemini +1 more
Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Meta AI App Launches Vibes for Discovering and Creating AI Videos

Meta AI App Launches Vibes for Discovering and Creating AI Videos

Meta
Meta

Official Source

Official Source

Meta is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Photos adds conversational editing for Android users in the US

Google Photos adds conversational editing for Android users in the US

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Chrome launches Gemini AI with new features to enhance browsing and security

Chrome launches Gemini AI with new features to enhance browsing and security

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Thunderbit logo

Thunderbit

No-code AI apps and automations for business users

Workflow Automation
Formula Bot logo

Formula Bot

AI-powered data analysis and visualization tool

Data Analysis