A Compositional Object-Based Approach to Learning Physical...

A Compositional Object-Based Approach to Learning Physical DynamicsMichael Chang1,2,4, Tomer Ullman1,3, Antonio Torralba1,4, Joshua B. Tenenbaum1,3

1 2 3 4

Project Webpage: http://mbchang.github.io/npe Code: http://github.com/mbchang/dynamics

2 Steps1 Vision

3 Neural Physics Engine (NPE)

Predict the velocity of each object in turn, given the pairwise interactions with its neighborhood context objects.

NPE applied on object 3

Scenario Model Architecture

Baselines

NP applied on object 3 LSTM applied on object 3

pairwise encoder

decodersum pairwise effects

Multi-layer encoder

Pairwise layer Multi-layer decoder

Multi-layer LSTM

No-Pairwise (NP) Baseline: no pairwise factorizationLSTM Baseline: no composition of independent effects

4a Generalization: different numbers of objects

4b Generalization: different scene configurations

Ground Truth NPE NP LSTM

3 Balls

4 Balls

5 Balls

6 Balls

7 Balls

8 Balls

Ground Truth NPE NP LSTM

“O” World

“L” World

“U” World

“I” World

“O”, “L: worlds withoutinternal obstacles“U”, “I:worlds with internal obstacles

3, 4, 5: worlds with fewer objects

6, 7, 8: worlds with more objects

NPE does not overlap with internal obstacles, while the NP and LSTM do. This shows the NPE is invariant to position and scene configuration, while NP and LSTM memorize the training wall configuration.

a) Prediction Task: Cosine similarity (top) and relative magnitude (middle) of prediction vs ground truth over 50 steps of simulation. Log MSE on velocity (bottom) over the course of training

b) Generalization Task: Same metrics as prediction, but train on 3, 4, 5; test on 6, 7, 8.

c) Inference Task: NPE gets about 90% accuracy in prediction and generalization setting.

d) Neighborhood: constrains the search space of context objects

5 Contributions

Object-based representations Context Selection Mechanism

Factorization Compositionality

Factorize larger objects into smaller building blocks

Compute prediction as a composition of pairwise interactions

Ingredients useful for generalizationVision: In order to rapidly learn new tasks and flexibly adapt to changes in inputs and goals, an agent needs a prior on physics that allows it to naturally generalize reasoning to novel scenes.

Beliefs Desires

Actions

planning

World state

Agentstate

perception

physics

Approach: Endow the agent with the prior as a learned physics simulator.

This work: We present the Neural Physics Engine, a framework for learning simulators of intuitive physics that naturally generalizes across variable object count and different scene configurations.

Different numbers of objects

Different scene configurations

Prior experience Novel scenes

What are the primitives, means of combination, and means of abstraction that underlie the training and testing distributions, such that generalization comes for free?

By combining the expressivenessof physics engines and the adaptability of neural networks in a compositional architecture that naturally supports generalization in fundamental aspects of physical reasoning, the Neural Physics Engine is an important step towards lifting an agent’s ability to think at a level of abstraction where the concept of physics is primitive.

Contributions• Presented a framework for learning a physics simulator• Proposed ingredients useful for generalization• Combined the strengths of symbolic and neural

models in an object-based neural network• Demonstrated an instantiation of the NPE framework

for prediction, generalization, and inference tasks in worlds of balls and obstacles

Ingredients useful for generalization• Object-based representations• Context-selection mechanism• Factorization• Compositionality

Combining the advantages of symbolic and neural models

Neural Physics Engine (NPE)

• Generic notions of objects and their interactions• Trained to model specific object properties and dynamics of different worlds • Transfers knowledge across different object counts and scene configurations

Symbolic Physics Engines

• Expressive• Knowledge encoded in structure• Difficult to adapt to scenarios

outside description language

Neural Networks

• Adaptive• Knowledge emerges through training• Requires retraining for new scenarios

How to incorporate inductive biases that are not only strong enough to generalize across scenes, but also flexible enough to learn from and adapt to different inputs?

https://goo.gl/BWYuOF

Simulation videos

A Compositional Object-Based Approach to Learning Physical...

Documents

Influencing Categorical Choices Through Physical Object ... · PDF fileInfluencing Categorical Choices Through Physical Object Interaction ... Introduction Research into ... (pencil,

Quantitative Measurement of Virtual vs. Physical Object Embodiment through Kinesthetic ... · 2019-04-22 · Quantitative Measurement of Virtual vs. Physical Object Embodiment through

MHSAA Physical Form - Cloud Object Storage | Store

A Compositional Object-Based Approach to Learning Physical …phys.csail.mit.edu/papers/11.pdf · A Compositional Object-Based Approach to Learning Physical Dynamics Michael Chang*,

MODELING PHYSICAL OBJECTS WITH OBJECT-ORIENTED … · 2021. 7. 6. · Chapter 2. Building Software Models of Physical Objects. To describe a physical object in our everyday world,

Compositional Learning for HumanObject Interactionopenaccess.thecvf.com/.../Keizo_Kato_Compositional_Learning_of_ECCV... · Compositional Learning for Human Object Interaction 3 graph

Modelica - A Unified Object-Oriented Language for Physical

ModelicaTM - A Unified Object-Oriented Language for Physical

LNCS 8691 - A Graph Theoretic Approach for Object Shape ... · A Graph Theoretic Approach for Object Shape Representation in Compositional Hierarchies Using a Hybrid Generative-Descriptive

Object-Oriented Modeling of Complex Physical Systems Using

Verified Artificial Intelligence and Autonomysseshia/talks/Seshia...Deep Learning-Based Object Detection [Dreossi, Donze, Seshia, “Compositional Falsification of Cyber-Physical Systems

Compositional phase stability of strongly correlated electron materials within DFT U · 2017-01-27 · COMPOSITIONAL PHASE STABILITY OF STRONGLY ... PHYSICAL REVIEW B 95, 045141 (2017)

Dynamical and Compositional Assessment of Near … Dynamical and Compositional Assessment of Near-Earth Object Mission Targets Using an H-plot analysis, we identify 234 currently known

Remote Batch Invocation for Compositional Object Services

Compositional Captioning - University of Chicagottic.uchicago.edu/~klivescu/MLSLP2016/hendricks_MLSLP2016.pdfCompositional Captioning: Describing Novel Object Categories without Paired

Remote Batch Invocation for Compositional Object Servicespeople.cs.vt.edu/~tilevich/papers/batches.pdf · Remote Batch Invocation for Compositional Object Services Ali Ibrahim2, Yang

MANUFACTURING PROCESSES 1WEC. Cutting Cutting is the separation of a physical object, or a portion of a physical object, into two portions, through

"The book as physical object" by Keith Smith

Combining Physical Simulators and Object-Based Networks

Modeling Physical Systems with Modern Object Oriented Perl