Theory Theory| How We Come to Understand Our Worldcsc.ucdavis.edu/~chaos/papers/theorytheory.pdf ·...

Theory Theory—

How We Come to Understand Our World

James P. Crutchfield

www.santafe.edu/∼chaos

3 May 2002

Abtract We are confronted, as never before, with an explosion in the size

of databases: from data-mining the web to dynamical brain imaging, the various

genome projects, and monitoring the millions of shipping containers that enter

our ports. These datasets are so vast that there is no conceivable way humans

can directly examine all of the data. What has been an intuitive and very human

discovery process—hypothesizing structures and building predictive theories about

the systems that produce them—must now be automated. But do we understand

the notions of structure and pattern well enough that we can teach machines to

discover them? How do we build theories that capture patterns in useful and

predictive ways? I will announce some very recent results on automated pattern

discovery and theory building for cellular automata. The lessons hint at what a

future ”artificial” science might look like.

Joint work with Carl McTague.

Data Explosion:

• Neurophysiology: multiple neuron recordings (> 100 neurons @ 1kHz)

• Web Data-Mining: 10-100 GB/day, multi-terabyte databases

• Astrophysical data: Hubble, EUV, ...

• Neuroimaging: MEG 100-200 Squids @ 1 kHz

• Geophysics: Earthquake monitoring with 1000s sensors, of different kinds

• BioInformatics: Genome projects, microarray sequencing, ...

• Searchable Video Databases

• Very large-scale simulations: weather, hydrodynamics, reaction kinetics, ...

• ...

Need machines to help.

Current approach is Pattern Recognition:

• Match data against existing palette of templates

• Fitting polynomials, assuming IID and distributions are Gaus-

sian, taking Fourier/Laplace Transforms, ...

Begs the Question:

Where did that palette come from in the first place?

How is this done now?

Baconian Scientific Algorithm:

Select Domain

Hypothesize

Refine

Hypothesis generation: Guessing within a discipline’s domain.

Must we always “guess” or intuit?

Problem: Hypothesis wrong? Little hint of where to go next.

The Inverse Problem: How to go from Data to Theory?

History of the Inverse Problem

Recently in nonlinear physics,

Attractor reconstruction:

• Packard et al (1980): “Geometry from a Time Series”

• Takens (1981), ...

“Theory From Time Series”:

• JPC/McNamara (1987): Equations of Motion from a Data Series

• Farmer/Sidorovich (1987): Nonlinear prediction

• Casdagli (1989), ...

Problem with this, too, alas.

The Underlying Problem: Intrinsic Representation

Reformulating the Question:

Does data contain information about an appropriate representa-

Solution:

Tackle head on: Define structure, pattern, regularity, ...

Computational Mechanics:

Accounts for computational structure.

Extends Statistical Mechanics: more than “statistics”.

Analogy:

Physics “accounts” for energy flow and transduction.

Computational Mechanics accounts for

1. Amount of historical information stored in process;

2. The architecture supporting that storage; and

3. How stored information is transformed into future states.

More directly:

1. Physics has measures of disorder: temperature, thermodynamic entropy.

2. Comp’l mechanics measures degrees of pattern: structural complexity.

Causal Architecture: A Review of Computational Mechanics

Processes:↔S= . . . S−1S0S1 . . . =

←S→S .

Causal States S: Sets of←S that are equally predictive about

←s ∼

←s′such that P(

→S |←S=

←s ) = P(

→S |←S=

←s′) (1)

ε-Machine: M = S, T

• Theorem: M is the unique, optimal predictor of minimal size.

• Theorem: M is a minimal sufficient statistic for a process.

• The intrinsic representation of a process.

Stored information is Statistical Complexity: Cµ = H[P(S)].

JPC & K Young, Physical Review Letters 63 (1989) 105–108.

CR Shalizi & JPC, J. Statistical Physics 104 (2001) 817–879.

Cellular Automata: Artificial Physics’s

1 1 00

1 0 0 1

Look Up Tableφ

LUT OutputBit String

Lattice of N Sites

Global StateSt

Global StateSt+1

Local State SiNeighborhood(radius = 1)

49Site0

Elementary Cellular Automaton 18

ECA 18 Rule Table:

ηt 000 001 010 011 100 101 110 111st+1 = φ(ηt) 0 1 0 0 1 0 0 0

Cellular Automata Computational Mechanics

Ω Φ(Ω)Φ

Space of configurations A∗: Represented by finite-state automata.

CA LUT: A finite-state transducer TΦ that reads in sets Ωt and outputs Ωt+1.

Language evolution:Implemented by Finite-Machine Evolution (FME) Operator Φ:(i) Φ(Ω), (ii) Drop Transducer Inputs, (iii) Strip Transient States, (iv) Convert NFA → DFA,and (v) Minimize.

Computational Complexity, starting with n-state language:

• Φ(Ω) and Drop Inputs: O(n).

• Identify Transient States: O(n2).

• NFA → DFA: O(2n).

• Minimize: O(n log2 n).

Wolfram (1984): Iterate all configurations Ω0 = A∗:

Ωt = ΦtΩ0 . (2)

Regular Language Complexity: |Ωt|

t ECA 18 ECA 22

1 5 152 47 2803 143 45064 ≥ 20,000 ≥ 20,000

A Structural View

Domain: Spacetime shift-invariant pattern

1. Λ = Φp(Λ), for some p > 0.

2. Λ = σm(Λ), for some m > 0.

Particle:

1. ΛiαΛj, α ∈ A∗.

2. α = Φp(α), for some p > 0.

Particle Interactions:

1. α+ β → γ.

2. α+ ν → ∅.

3. ....

Example: ECA 18

99Site0

Regularity: Regions of (0A)∗ ... A domain?

[0,0] 0|0

0|0 [1,1]

1|00|1

Start([1,1],2)

([1,0],1)

([1,1],1)

([0,0],2)

([1,0],2)

([0,0],1)

([0,1],1)

1|00|0

([0,1],2)

0|0 1|0

0|0 0|0

HypothesizedDomainECA 18 TransducerComposition

Iteration of ECA 18’s Domain

([1,1],2)

([1,0],1)

([1,1],1)

([0,0],2)

([1,0],2)

([0,0],1)

([0,1],1)

([0,1],2)

Strip TransientsDrop Inputs Minimize QED

Recognizing Structure: Domain Filtering

Factor out that data explained by structures one knows.

99Site0

ECA 18: Domain Filtered

ECA 18’s Particle CatalogDomain Process LanguageΛ0 (0Σ)∗

Particle Wedgesα Λ002n+10Λ0

Interactionsα+ α → ∅

The Enigma: ECA 22ECA 22 Lookup Table

ηt 000 001 010 011 100 101 110 111st+1 = φ(ηt) 0 1 1 0 1 0 0 0

99Site0

History: highly disordered (high entropy) but ...

• Moore: ”... nonlocal structures ...”

• Grassberger (1986): ”... long-range, complex correlations ...”; Power law decays.

[0,0] 0|0

0|1 [1,1]

1|00|1

Start([1,1],4)

([1,0],1)

([1,1],1)

([0,0],2)

([1,0],2)

([1,1],3)

([1,0],4)

([0,0],1)

([0,1],1)

([1,1],2)

([1,0],3)

([0,0],4)

([0,0],3)

([0,1],4)

0|11|0

([0,1],3)

0|0 1|1

([0,1],2)

Iterate of ECA 22’s Candidate Domain

CandidateDomainTransducerComposition

([1,1],4)

([1,0],1)

([1,1],1)

([0,0],2)

([1,0],2)

([1,1],3)

([1,0],4)

([0,0],1)

([0,1],1)

([1,1],2)

([1,0],3)

([0,0],4)

([0,0],3)

([0,1],4)

([0,1],3)

([0,1],2)

Drop InputsStrip Transients

& Minimize

ECA 22 Domain Proof?

But ≠ Candidate!18

[0,0] 0|0

0|1 [1,1]

1|00|1

Second Iterate of ECA 22’s Candidate Domain

CandidateIterateTransducerComposition

([1,1],1)

([1,0],2)

([0,0],3)

([1,1],2)

([1,0],3)

([0,0],4)

([1,1],4)

([1,0],1)

([1,1],5)

([0,0],2)

0|1([1,1],10)

([1,0],5)

([0,1],10)

([1,1],3)

([1,0],10)

([0,1],3)

([1,0],4)

([0,1],1)

([0,1],5)

([0,0],1)

([0,1],2)

1|00|0

([0,1],4)

0|11|0

([0,0],5)

([0,0],10)

([1,1],1)

([1,0],2)

([0,0],3)

([1,1],2)

([1,0],3)

([0,0],4)

([1,1],4)

([1,0],1)

([1,1],5)

([0,0],2)

([1,1],10)

([1,0],5)

([0,1],10)

([1,1],3)

([1,0],10)

([0,1],3)

([1,0],4)

([0,1],1)

([0,1],5)

([0,0],1)

([0,1],2)

([0,1],4)

([0,0],5)

([0,0],10)

Drop Inputs Strip Transients

ECA 22 Domain Proof Completed

Minimize QED!

Theorem: Λ ,Λ is a spacetime domain for ECA 22.

Proof: (i) Λ = Φ(Λ )

(ii) Λ = Φ(Λ )

in other words

(iii) Λ = Φ (Λ )

(iv) Λ = Φ (Λ )

Λ10Λ0

This is the first structure discovered for ECA 22.Captures much of ECA 22’s behavior.

The Enigma? ECA 22

99286Site0

ECA 22’s Particle CatalogDomains Process LanguageΛ0 Λ0

0 = (000Σ)∗

Λ01 = (1110+ 0000)

Λ1 0∗

Λ2 (01)∗

Λ3 (0011)∗

Particles Wedgesα Λ0

0A001AΛ00

Λ01A′1111B′Λ0

β Λ00A01AΛ

Λ01A′110B′Λ0

1A′001B′Λ01

Interactionsα+ α → ββ+ β → ∅

α+ β → Λα−βgasβ+ α → Λα−βgas

ECA 22: Pattern Discovery

Theorem: Λ00,Λ01 is the only (positive entropy) domain within

the complexity horizon (up to and including 6-state candidates).

Proof: Automated proof methods using the Haskell, a functional

programming language.

How was this done?

Candidates for Domains: Process Languages

Finite-state process language: represented by (recurrent portionof) ε-machine.

AKA finite-state machine with one strongly connected compo-nent; all states are start and final states.

n-DCL Library: The domain candidates with n-states.

How many are there?

States n-DCL Library

n Size

4 1,388

5 35,186

6 1,132,61324

Automated Search

1. For all Λ ∈ n-DCL, exclude by counterexample:

(a) Generate long s ∈ Λ.

(b) Iterate this single configuration Φt(s).

(c) Test: If Φt(s) /∈ Λ, then Λ is expanding.

2. For all nonexpansive n-DCL, exclude by entropy rate:

hµ(Φt(Λ)) < hµ(Λ)⇒ Candidate contracts. (3)

3. For all nonexpanding, noncontracting n-DCL, attempt to di-

rectly prove invariance theorem: Λ = ΦpΛ?

Λ that pass tests are domains.

ECA 22: Table of a Million Theorems

States n-DCL ECA 22 Nonexpanding n-DCLn Size Number Contracting Domains1 3 2 Σ∗ Λ1 = 0∗

2 7 2 (0Σ)∗ Λ2 = (01)∗

3 78 04 1,388 2 Λ0

0, Λ3 = (0011)∗

5 35,186 5 ((11)(11 + 01)∗(10) + 00)∗

((101)(00+ 1) + (1(00+ 1) + 0))∗

([(101)+(1(00 + 1) + 00)] + [1(00+ 1) + 0])∗

([1+(01)(00 + 1)] + [1+(00) + 0])∗

([(1+01)+(1+00+ 00)] + [1+00+ 0])∗

6 1,132,613 268 267 Λ01

Several months ago: Estimated compute time ∼ 5 Beowulf years!

When finally proved, proof took ∼ 1 week: speed = 7000 tph!

[0,0] 0|0

0|1 [1,1]

1|00|1

ECA 22’s Last Candidate

Transducer Candidate 27

Iterate of the Last Candidate161 states and 318 transitions

Theorem: Not a domain.Proof: Nonexpanding + h(Φ(Λ)) < h(Λ) ⇒ Candidate contracts.

91 1010

97 1320

129 01

The Near Term: All Elementary CAs

Push of a Button:

1. CA Pattern Discoverer will automatically build structural the-

ories for all 256 ECAs.

2. Result: a website of about 1000 pages, with several dozen

pages per ECA.

Beware of Prophets Expounding the CA Gospel!

The Future: Artificial Science

Implications:

1. Artificial Particle Physics.

2. Automated Pattern Discovery.

3. Theorists unemployed?

Theory Theory| How We Come to Understand Our Worldcsc.ucdavis.edu/~chaos/papers/theorytheory.pdf ·...

Documents

1 Group representations Consider the group C 4v ElementMatrix E1 0 0 0 1 0 0 0 1 C 4 0 1 0 -1 0 0 0 0 1 C 2 -1 0 0 0 -1 0 0 0 1 C 4 0 -1 0 1 0 0 0 0 1

ComputationalFluidDynamics todeterminetheviscosity ......Acronyms Notation Description PageList BGK BhatnagarGrossKrook 1,20,21 CA CellularAutomata 1,12 CFD ComputationalFluidDynamics

0 0 1 0 0 1 0 1 1 0 1 0 1 0 1 0

Chapter 2. 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 1 1 0 0 0 0 1 1 0 0 1 1 1 0 1 0 1 0 1 0 0 1 0 1 0 1 0 1 1+ 1- 2+ 3+

· XLS file · Web view2018-04-29 · 1 1 0 0 0 0 0. 1 0 0 1 0 0 0. 0 0 0 0 0 0 0. 0 0 0 0 0 0 0. 1 1 0 0 0 0 0. 1 1 0 0 0 0 0. 1 1 0 0 0 0 0. 1 1 0 0 0 0 0. 0 0 0 0 0 0 0. 0 0

0 1 0 0 1 1 1 1 1 1 0 1 0 0 0 1 0 1 1 0 0 0 1 0 1 1 0 0 1

Con - delta.cs.cinvestav.mxdelta.cs.cinvestav.mx/~mcintosh/cellularautomata/Summer_Research... · 4.2 Regla negativ a: 20 4.3 Re exi on: 21 4.4 Categor as de simetr a: 23 1. Cap tulo

Symmetry and Degeneracy - CINVESTAVdelta.cs.cinvestav.mx/~mcintosh/cellularautomata/... · XIV. Universal Symmetry Groups 129 42 XV. Summary 134 47 References 137 49 Throughout this

Chapter 4. Addition/subtraction of signed numbers s i = c i +1 = 13 7 +Y 1 0 0 0 1 0 1 1 0 0 1 1 0 1 1 0 0 1 1 0 1 0 0 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1

final - 359667-1117794-raikfcquaxqncofqfm.stackpathdns.com · 0 1 010101 0 1 01 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1

New · 2019. 10. 24. · 9 0 0 10 0 8 0 0 7 0 0 1 0 6 0 0 1 2 0 0 1 3 0 0 50 0 1 4 0 4 5 0 15 0 1 6 0 0 8 00 1 0 0 0 9 0 0 1 1 0 0 8 0 0 1 3 0 0 900 1 5 0 0 1 2 0 0 1 0 0 0 1 1 0

0 FINNOVA 0 1 0 1 0 1 0 BANKING SOFTWARE 0 · 2020. 2. 18. · 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 THE SOUNDS OF

0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 1 1 0 0 0 0 1 1 0 0 1 1 1 0 1 0 1 0 1 0 0 1 0 1 0 1 0 1 1+ 1- 2+ 3+ 4+ 5+ 6+ 7+

Making citizen generated data work · 2019. 6. 16. · 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0

ACTING LOCALLY, MONITORING GLOBALLY · 2019-06-16 · 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1

CellularAutomata-BasedParallelRandomNumber ...2 International Journal of Reconﬁgurable Computing produces an unrepeatable sequence of random numbers [17]. Both types of random number

OD BITA DO ROBOTA - e-skola.com.hre-skola.com.hr/images/slike/pdf/OdBitaDoRobota_web.pdf0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1

A NONLINEAR DYNAMICS PERSPECTIVE OF WOLFRAM’S NEW KIND …nonlinear.eecs.berkeley.edu/CellularAutomata/PartI.pdf · torials and Reviews InternationalJournalofBifurcationandChaos,Vol.12,No.12(2002)2655–2766

INTRODUCCIÓN A LA VIDA ARTIFICIAL Y …delta.cs.cinvestav.mx/~mcintosh/cellularautomata/Summer_Research... · La Vida Artiﬁcial intenta desarrollar teorías sobre la vida y los

Error correcting capability of cellular automata based ...pdfs.semanticscholar.org/2797/62e6856df2cbd694c08a...VLSI technology. IndexTerms— Associativememory,cellularautomata(CA),gen-eralized