Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from...

Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots

that Learn from Humansin Creating Brain-like Intelligence.

Course: Robots Learning from Humans

Min-Joon Kim

Intelligent Data Systems Lab.

School of Computer Science and Engineering

Seoul National University

September 18th, 2015

Contents Introduction

Physics-Based Model Dynamic Bayesian Network Model

Imitation Process BABIL Imitation Learning Algorithm Planning via Inference

Experiments: Learning Stable Full-Body Humanoid Motion via Imitation

Conclusion

Discussion

Introduction: Brain-Like Intelligence

Brain-like intelligence, our “goal”

From previous chapters…what is brain-like intelligence?

Two major obstacles Lack of mechanisms for rapid learning

https://youtu.be/l0N6mIpoN3M?t=37s Lack of the ability to handle uncertainty

What about people?

Growing evidence that the brain may rely on Bayesian principles for perception and action

Humans can learn new skills by simply watching other humans

But what about robots?

Obvious differences in structure, etc.

Example: Honda ASIMO The question: How much time and code for the robot to

kick a ball? We must keep in mind how “short” the action time is

In order for a robot to “watch and learn” Functional units for segmentation Recognition of human actions Algorithm for constructing an imitative motor plan

If a robot can learn from watching a “teacher” Intuitive Easier due to kinematic similarities Can enable robots to perform noble behaviors

a.k.a learning

But we must be wary… Similar but different.

Not exactly A = B Must be careful in handling uncertainty

Proposed Method

Bayesian framework for imitation-based learning in humanoid robots

Learning a predictive model of the robots dynamics

Taking into account uncertainty and noise + map-ping

Physics-Based Modeling

One can approximate a humanoid robot as a set of articulated rigid bodies A robot with N joints between N+1 rigid bodies

Each joint possibly with multiple degrees of free-dom Expressed in vector form as a six dimensional motion

vector

Physics-Based Modeling Spatial acceleration of rigid body i:

Vector of all joint angles:

Forward Kinematics Computing the velocities and accelerations of all rigid

bodies:

Next, consider inertia and forces to model and constrain dynamics The spatial inertia (I*) must be known or estimated

Forces denoted in spatial notation:

Combined Newton-Euler equation of motion for rigid body i:

Net external force must be known or estimated

Compute the force transmitted from parent:

Apply above to computing the joint forces starting at leaf node to the root: Extract force components through the joint’s DOFs

We have formed the basis for solving the “inverse dynamics” problem: Given desired kinematics, compute the necessary joint

torques

But! Problems! Relative simplicity makes real world problems difficult to

The large number of quantities that we MUST know or be accurately estimated is difficult to ob-tain

The formulation assumes that all external forcesare known.

Can we know, exactly, the … Ground reaction force? Frictional forces? Gravity?

Are all the bodies in a robot completely rigid?

Bayesian Approaches to Uncertainty

Bayesian networks provide a sound theoretical ap-proach to incorporating prior, yet uncertain informa-tion What we just “calculated” before!

Dynamic Bayesian Network Model of the Imita-tion Learning Process

Two sources of information Demonstrative Explorative

Selecting a set of actions based on probabilistic constraints: Matching Egocentric

Dynamic Bayesian Network Model of the Imita-tion Learning Process

Sources of uncertainty

Observing and imitating tasks is inherently difficult

Inter-trial variance of a human performing a skill

The need to predict future states of the agent (robot) given potential control values

The Generative Imitation Approach

Goal is to infer the posterior distributions over Random Variable At

Posterior distribution = the conditional probability that is assigned after the relevant evidence is taken into account

The Generative Imitation Approach

BABIL Imitation Learning Algorithm

Behavior Acquisition via Bayesian Inference and Learning

Planning via Inference

Given a set of evidence, pick actions which have high posterior likelihood = maximum a posteriori (MAP)

But MAP is NP-hard!

= maximum marginal posterior (MMP)

Learning Stable Full-Body Humanoid Motion via Imitation

Log Likelihood of Dynamics Config.

Dynamic Balance Duration over Imitation Trials

Learning Stable Full-Body Humanoid Motion via Imitation

Conclusion

A probabilistic framework that allows a humanoid robot to learn from a human teacher through imita-tion

A general approach to “programming” a complex robot without error-prone physics models

Can handle uncertainty via Bayesian models A more “brain-like” intelligence

Discussion

Do humans act/learn by probabilistic models?

Are we that “mechanical”?

Discussion

Do humans act/learn by probabilistic models?

Are we that “mechanical”?

Can self-consciousness be represented in proba-bilistic models?

Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from...

Documents

The humanoid robots

Imitation of Human Motion on a Humanoid Robot using Non ...his.anthropomatik.kit.edu/pdf_humanoids/Do2008.pdf · Imitation of Human Motion on a Humanoid Robot using Non-Linear Optimization

Humanoid Robots Human-Like Machines

Humanoid Robots

On Human Motion Imitation by Humanoid Robot - Accueil · On Human Motion Imitation by Humanoid Robot Wael Suleiman , Eiichi Yoshida †, Fumio Kanehiro , Jean-Paul Laumond and Andre

Humanoid robots - stability analysis and robustness

Dynamic Imitation in a Humanoid Robot through

Humanoid Robots: A New Kind of Tool - DTICHumanoid, robotics, autonomous, embodied, social, attention, imitation. IEEE Intelligent Systems Introduction While scientific research usually

A Differential Steering System for Humanoid Robots

Loving AI: Humanoid Robots as Agents of - … · Loving AI: Humanoid Robots as Agents of ... 1 The Loving AI Project 2 ... A humanoid robot provides an unparalleled technology platform

Why Humanoid Robots?*

Humanoid Robots Motivation Humanoid Projects RoboCup Humanoid League Robots Alpha RoboSapien Kondo Personal Robots

Humanoid Robot With Imitation Ability - cdn.intechweb.orgcdn.intechweb.org/pdfs/6246.pdf · 15 Humanoid Robot With Imitation Ability WEN-JUNE WANG and LI-PO CHOU National Central

Motion Planning for Legged and Humanoid Robots

Dancing Humanoid RobotsDancing Humanoid Robots Recognition and Generation of Primitive Motions for Dance Imitation Shinichiro Nakaoka, CVL Motion Group (Collaborated with Humanoid

Planning Heavy Lifts for Humanoid Robots* - …€¦ · Planning Heavy Lifts for Humanoid Robots* ... cause of the high dimensionality of humanoid robot systems ... used in this project

Humanoid Robot With Imitation Ability - InTech - Opencdn.intechopen.com/pdfs/...Humanoid_robot_with_imitation_ability.pdf · Humanoid Robot With Imitation Ability 275 The controller

Survey of Humanoid Robots - Department of Computer Sciencejacky/Publications/pdf/baltes03:_survey_hum… · Outline Humanoid robots – Approaches to humanoid robot design – Survey

HUMANOID ROBOTS USED FOR SURVEILLANCE

Humanoid Robots - Portland State University