Simple Classification using Binary Datadeanna/simpleclass.pdf · 2017-10-09 · Option 1 : Build...

Simple Classification using Binary Data

Deanna NeedellMathematicsUCLA

Thanks:• NSF CAREER DMS#1348721• NSF BIGDATA DMS#1740325• MSRI NSF DMS#1440140• Alfred P. Sloan Fdn

Joint work with

Rayan Saab (UCSD) Tina Woolf (CGU)

So much data…

How can we handle all this data?

Option 1 : Build bigger computing systemsv We need the resourcesv Fundamental limitationsv Wasteful (resources, energy, cost, …)

3 MB of internet data transfer = boiling one cup of water(https://www.katescomment.com/energy-of-downloads/)

Option 2 : Design more efficient data analysis methods

Compressed sensing

v Need to solve highly underdetermined linear system

vA has a null-space!

1-bit compressed sensingv Store only the first bit – the SIGN of each measurement

v Remedy: Use “dithers” to estimate the norm of x

Moral:

vOne-bit (binary) data is as efficient as it gets

vIt may still contain enough “information” about the signal to perform inference tasks

Problem: classification

Problem: reality

(Tang etal. 2016)

Background: SVM

Background: Deep Learning

Our goals

Design classification scheme that:

v Uses binary data

v Is simple and efficient

v Uses layers in an interpretable way

Notation

v X : nxp training data matrix (data in columns)

v A : mxn (random) matrix

v Q = sign(AX) : mxp binary training data

v G : # of classes

v b : p training labels (1-G)

v L : # of layers in our design

Main idea

v Each row of A corresponds to a hyperplane

v Each binary measurement Qij indicates on which side of the ith hyperplane data point Xj lies

v If we gather enough of this info for all the training data, we can use it to predict the class label for a new test point x

Single layer

v All the hyperplane information in Q is enough

For a new point x, simply compare its sign pattern with those of the training points and choose the label it matches the most often

Multiple layers

v What about?

For ANY hyperplane, there are both red and blue on at least one side of it

Multiple layers

v Now PAIRS of hyperplanes do the trick

For a new point x, simply compare its sign pattern for hyperplane PAIRS with those of the training points and choose the label it matches the most often

Multiple layers

What about higher dimensions?

v We continue this strategy to build layers, the lthlayer corresponding to l–tuples of hyperplanes

v For simplicity (and computation), we consider m l–tuples at each layer, selected randomly from all possible

v For a new test point x, we use the sign patterns across all layers for classification

Using all layersHow to integrate the info from all layers?

vl : layerv i : index from 1 to m v : the set of l indices indicating which

hyperplanes are selected in the ith l-tuplev t : a possible sign pattern of l +/- 1sv g : a class index (from 1 to G)v Pg|t : the # of training points from the gth class

having sign pattern t from the hyperplanes in

Using all layersv Pg|t : the # of training points from the gth class

fraction of training points in classg out of all points with pattern t

gives more weight to group g when its size is much different than others with same sign pattern

Our method: training

v “For each layer, pick the l–tuples and then compute all values of r(l,i,t,g)”

Our method: testing

v “For a new point x with sign pattern t*, compute the sum of all r(l,i,t*,g) for each class g and then assign the label g which has the largest sum.”

Results (L=1)

Results (L=4)

Results (L=5)

Results

Results (L=1)

Results (L=4)

Results (L=5)

Theory

Ag|t : the angle of class g with sign pattern t for the ith l –tuple in layer l

Theory

vAs m à ∞, P(correct label) à 1

Theory

Take-awayv Simple classification from binary data

v Efficient storage of the datav Efficient and simple algorithmv Theoretical analysis possiblev Already competes with state of the art

v Future workv Dithers to allow for more complicated

geometries?v Theoretical analysis of the discrete case?

� deanna@math.ucla.edu

� math.ucla.edu/~deanna

� “Simple Classification using Binary Data” – Needell, Saab, Woolf

Simple Classification using Binary Datadeanna/simpleclass.pdf · 2017-10-09 · Option 1 : Build...

Documents

Susanna Bigger

VWE Catalogue V2.5 - VWE - Van Wyk's Electrical - Catalogue.pdf · · 2016-09-226 design and manufacture of switchboards, control panels, distribution panels, distrubution boards

Grow Your Business with Bigger Deals, Bigger Customers

Bigger salary

Leo Bigger

Thinking Bigger

Bigger Picture - exchange.telstra.com.auexchange.telstra.com.au/wp-content/uploads/2020/08/Bigger-Picture … · 17 — Bigger Picture 2020 Sustainability Report GRI, T GRI Standards

Bigger better

smarter telescopes Topic 4: Bigger andthemrriddle.weebly.com/uploads/5/9/3/2/59324797/topic_4...build bigger and bigger telescopes. Bigger telescopes can help us to find new objects

Apple iphone 6 bigger than bigger digital launch – content analyses

NUS GRIP LIFTLIFT -OF-OFF F DADAYYnus.edu.sg/grip/wp-content/uploads/2019/12/GRIP-Run-3_Factsheets… · Product Devt Researcher, NUS Bigger Faster Brighter No limitations in size

SUSANNA BIGGER LEO BIGGER WHEN AM I EASILY HURT?

Emma Bigger

Kindergarten Unit 4 Bigger Books Bigger Reading Muscles

BIGGER& BETTER

LEO BIGGER

Chapter 18 Limitations of Computing. Chapter Goals – THE BIG SLICES Limitations of Hardware Limitations of Software Limitations of Problems 2

The Bigger Joints

Wall (ich Ho-vwe..9 Rajhans Paints & Chemicals Corporate

SUSANNA BIGGER