Google DeepMind enables robots to perform novel tasks

Beyond UPS Ball Google show performer

29.07.2023 - 08:27

Reading now: 616

economictimes.indiatimes.com:

Google has demonstrated its first vision-language-action (VLA) model for robot control that showed improved generalisation capabilities and semantic and visual understanding beyond the robotic data it was exposed to. This includes interpreting new commands and responding to user commands by performing rudimentary reasoning, such as reasoning about object categories or high-level descriptions.

The Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model that learns from both web and robotics data, and translates this knowledge into generalised instructions for robotic control, according to Google DeepMind. A traditional robot can pick up a ball and stumble when picking up a cube.

RT-2's flexible approach enables a robot to train on picking up a ball and can figure out how to adjust its extremities to pick up a cube or another toy it's never seen before. «We also show that incorporating chain-of-thought reasoning allows RT-2 to perform multi-stage semantic reasoning, like deciding which object could be used as an improvised hammer (a rock), or which type of drink is best for a tired person (an energy drink),» said the DeepMind team.

The latest model builds upon Robotic Transformer 1 (RT-1) that was trained on multi-task demonstrations. The team performed a series of qualitative and quantitative experiments on RT-2 models, on over 6,000 robotic trials.

«Across all categories, we observed increased generalisation performance (more than 3x improvement) compared to previous baselines,» the team said. The RT-2 model shows that vision-language models (VLMs) can be transformed into powerful vision-language-action (VLA) models, which can directly control a robot by combining VLM pre-training with robotic data.

. Read more on economictimes.indiatimes.com

All news from economictimes.indiatimes.com

About this in other media

Google Doodle pays tribute to Bollywood icon Sridevi on 60th birth anniversary livemint.com /1 year ago

Google Doodle celebrates Thai Mother's Day 2023 with a special illustration livemint.com /1 year ago

Google doodle celebrates 2023 FIFA Women's World Cup Quarter Finals livemint.com /1 year ago

The website fvbb.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.

Google DeepMind enables robots to perform novel tasks

Related News

Eco-anxiety can help urge climate action, but excessive of it may hinder the progress

Police clears Cardi B of criminal battery charge after rapper hurled microphone at fan

SBFC Finance IPO: GMP jumps after strong subscription status. Should you apply as bidding ends on Monday?

Mrunal Thakur to receive Diversity Cinema award at Indian Film Festival of Melbourne

Vedanta ends week with 11% loss on promoter paring stake. How to trade next week?

BTC hodlers outperformed crypto funds by 69% in H1 2023: 21e6 Capital AG

‘Rocky Aur Rani Kii Prem Kahaani’ box office report: Alia Bhatt-Ranveer Singh's film may cross ₹100 crore-mark this week

Power Stocks: Value Buys or Value Traps?

RBI monetary policy meeting: Date, schedule, time, and expectations from Governor Shaktikanta Das

Beware of phishing scam! IRCTC issues urgent warning against fake mobile app targeting users

Q1 results today: Bank of Baroda to Punjab & Sind Bank — 35 companies to declare Q1 results 2023

Garena Free Fire Max redeem codes for Aug 05, 2023: Get weapons, diamonds, more

Amazon Great Freedom Festival Sale: Up to 60% off on TVs from Samsung, Sony, Redmi, LG and more

Big Tech rebounds and preps for transformative AI investments

Mark Margolis dead: Renowned actor from 'Breaking Bad' and 'Better Call Saul,' Passes Away at 83

Apple shares fall: Nasdaq major is no more a 3 trillion dollar company

App Billing Policy: HC dismisses 14 of 16 petitions against Google

Box Office: Barbie surpasses $900 million worldwide as ‘Teenage Mutant Ninja Turtles’ reboot and ‘Meg 2’ debut. See details

Judge allows key US antitrust Google search claims to go to trial

A Learjet pilot thought he was cleared to take off. He wasn't. Luckily, JetBlue pilots saw him

Facebook owner Meta carries out threat to block news in Canada. Google plans to do the same