Google DeepMind Enables Robots To Perform Novel Tasks

Bijay Pokharel, July 29, 2023 0 2 min read

Google has demonstrated its first vision-language-action (VLA) model for robot control that showed improved generalization capabilities and semantic and visual understanding beyond the robotic data it was exposed to.

This includes interpreting new commands and responding to user commands by performing rudimentary reasoning, such as reasoning about object categories or high-level descriptions.

The Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model that learns from both web and robotics data, and translates this knowledge into generalized instructions for robotic control, according to Google DeepMind.

A traditional robot can pick up a ball and stumble when picking up a cube.

RT-2’s flexible approach enables a robot to train on picking up a ball and can figure out how to adjust its extremities to pick up a cube or another toy it’s never seen before.

“We also show that incorporating chain-of-thought reasoning allows RT-2 to perform multi-stage semantic reasoning, like deciding which object could be used as an improvised hammer (a rock), or which type of drink is best for a tired person (an energy drink),” said the DeepMind team.

The latest model builds upon Robotic Transformer 1 (RT-1) which was trained on multi-task demonstrations.

The team performed a series of qualitative and quantitative experiments on RT-2 models, on over 6,000 robotic trials.

“Across all categories, we observed increased generalization performance (more than 3x improvement) compared to previous baselines,” the team said.

The RT-2 model shows that vision-language models (VLMs) can be transformed into powerful vision-language-action (VLA) models, which can directly control a robot by combining VLM pre-training with robotic data.

READ

WhatsApp Introduces Exciting Upgrades for Calls

“RT-2 is not only a simple and effective modification over existing VLM models, but also shows the promise of building a general-purpose physical robot that can reason, problem solve, and interpret information for performing a diverse range of tasks in the real world,” said Google DeepMind.

IT NEWS & UPDATES

iOS Users Can Now Include Audio When Sharing Screens Using Google Meet

IT NEWS & UPDATES

Meta May Soon Launch Horizon Worlds Mobile App

Bijay Pokharel

Bijay Pokharel is the creator and owner of Abijita.com. He is a freelance technology writer focusing on all things pertaining to Cyber Security. The topics he writes about include malware, vulnerabilities, exploits, internet defense, women's safety and privacy, as well as research and innovation in information security. He is a tech enthusiast, keen learner, rational and cool person in his professional activities and challenges.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Recent Posts

Nigeria Arrests 792 in Major Scam Operation

Texas Tech University Health Sciences Center Hit by Cyberattack, Data of 1.4 Million Patients Exposed

Serbian Police Accused of Hacking Activists’ Phones Using Cellebrite Tools and Spyware

Instagram Now Lets You Schedule DMs: Here’s How It Works

Threads Reaches 100 Million Daily Active Users

What Is the Secret Santa Scam and How Can You Avoid It?

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Google DeepMind Enables Robots To Perform Novel Tasks

Bijay Pokharel

Related posts

LG To Close Mobile Phone Business Worldwide

Tesla’s Income Drops 24% To $2.7 Billion Amid EV Price Cuts

SpaceX Confirms Starlink ‘Network Outage’

Uber Launches Its Advertising Division, Will Show Video Ads During Rides

NASA To Start Training Artemis II Crew For Moon Mission In June

NASA-SpaceX Crew-6 Docks Safely At ISS After Hour-Long Delay

Leave a Reply Cancel reply

Recent Posts

Nigeria Arrests 792 in Major Scam Operation

Texas Tech University Health Sciences Center Hit by Cyberattack, Data of 1.4 Million Patients Exposed

Serbian Police Accused of Hacking Activists’ Phones Using Cellebrite Tools and Spyware

Instagram Now Lets You Schedule DMs: Here’s How It Works

Threads Reaches 100 Million Daily Active Users

What Is the Secret Santa Scam and How Can You Avoid It?

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.