25
Machine Learning Introduction

Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Embed Size (px)

Citation preview

Page 1: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Machine Learning

Introduction

Page 2: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Class Info

• Office Hours– Monday: 11:30 – 1:00– Wednesday: 10:00 – 1:00– Thursday: 11:30 – 1:00

• Course Text– Tom Mitchell: Machine Learning– Course notes

• Prerequisites– CS1, CS2

Page 3: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Important

• Come to class. Pay attention. Ask questions.– There are no stupid questions!!

• Come to my office hours• Start the homework assignments early• Homework in this class requires “thinking time”• Read the textbook and notes

– The textbook can be difficult to read: very technical

Page 4: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Important

• The course material is difficult• Material for every class requires complete

understanding of the material from all the previous classes– Come to my office hours!!

• First two-three classes will cover important mathematical background for the class– You will be tested on this material

Page 5: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Why Machine Learning• Sorting algorithms

– Can you write a program?

• Facial recognition– Can you write a program?– How do people do it? (Can we simulate this process?!)

– Instead of writing a program by hand, we collect lots of examples that specify the correct output for a given input

– A machine learning algorithm then takes these examples and produces a program that does the job

Page 6: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Find Waldo (2)

Page 7: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Examples of Machine Learning Applications

• Predicting discrete labels (classification)– Email spam filtering

• Predicting real numbers– Predicting future stock prices or exchange rates– Recommendation systems (e.g. Netflix competition, for movies)

• Recognizing patterns:– Facial identities or facial expressions– Handwritten or spoken words

• Recognizing anomalies: monitoring– Unusual sequences of credit card transactions– Unusual patterns of sensor readings in a nuclear power plant, or

in a hospital intensive care ward (epidemiology)

Page 8: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Three Niches for Machine Learning

• Data mining : using historical data to improve decisions– medical records medical knowledge

• Software applications we can't program by hand – autonomous driving – speech recognition

• Self customizing programs – Newsreader that learns user interests

Page 9: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Typical Datamining TaskData mining : using historical data to improve decisions

medical records medical knowledge

• Given:– 9714 patient records, each describing a pregnancy and birth– Each patient record contains 215 features

• Learn to predict:– Classes of future patients at high risk for Emergency Cesarean Section

Page 10: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Datamining ResultData mining : using historical data to improve decisions

medical records medical knowledge

One of 18 learned rules:– If

• No previous vaginal delivery, and• Abnormal 2nd Trimester Ultrasound, and• Malpresentation at admission

– Then Probability of Emergency C-Section is 0.6

Over training data: 26/41 = .63, Over test data: 12/20 = .60

Page 11: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Credit Risk Analysis

• Rules learned from synthesized data:– If

• Other-Delinquent-Accounts > 2, and• Number-Delinquent-Billing-Cycles > 1

– Then Profitable-Customer? = No [Deny Credit Card application]

– If • Other-Delinquent-Accounts = 0, and• (Income > $30k) OR (Years-of-Credit > 3)

– Then Profitable-Customer? = Yes [Accept Credit Card application]

Page 12: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Other Prediction Problems

Customer purchase behavior:

Page 13: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Other Prediction Problems

Customer retention

Page 14: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Other Prediction Problems

Process optimization

Page 15: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Displaying the structure of a set of documents using a deep neural network (clustering vs classification)

Page 16: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

R. Mooney, Univ. of Texas at Austin

Why Study Machine Learning?

• Engineering Better Computing Systems

– Develop systems that are too difficult/expensive to construct manually because they require specific detailed skills or knowledge tuned to a specific task (knowledge engineering bottleneck).

– Develop systems that can automatically adapt and customize themselves to individual users.

• Personalized news or mail filter• Personalized tutoring

– Discover new knowledge from large databases (data mining).• Market basket analysis (e.g. diapers and beer)• Medical text mining (e.g. migraines to calcium channel blockers to

magnesium)

Page 17: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

R. Mooney, Univ. of Texas at Austin

Why Study Machine Learning?• Cognitive Science

– Computational studies of learning may help us understand learning in humans and other biological organisms.

• Hebbian neural learning– “Neurons that fire together, wire together.”

• Human’s relative difficulty of learning disjunctive concepts vs. conjunctive ones.

• Power law of practice

log(# training trials)

log(

perf

. tim

e)

Page 18: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

R. Mooney, Univ. of Texas at Austin

Why Study Machine Learning?

• The Time is Ripe – Many basic effective and efficient algorithms

available.– Large amounts of on-line data available.– Large amounts of computational resources

available.

Page 19: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Machine Learning and Statistics

• A lot of work in machine learning can be seen as a rediscovery of things that were known in statistics– Statistics: interpretation– ML: prediction

• Machine learning often refers to tasks associated with artificial intelligence (AI)– Recognition– Diagnosis,– Planning– Robot control

• Goals can be autonomous machine performance, or enabling humans to learn from data (data mining)

Page 20: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

What is a Learning Problem?

• Learning = Improving with experience at some task– Improve over task T– with respect to performance measure P– based on experience E

• E.g., Learn to play checkers– T: Play checkers– P: % of games won in world tournament– E: opportunity to play against self

• What experience?• What exactly should be learned?• How shall it be represented?• What specific algorithm to learn it?

Page 21: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Type of Training Experience

• Direct or indirect?• Teacher or not?

Problem: is training experience representative of performance goal?

Page 22: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Defining the Learning Task

Task T, Performance metric P, Experience E

T: Recognizing hand-written wordsP: Percentage of words correctly classifiedE: Database of human-labeled images of handwritten words

T: Driving on four-lane highways using vision sensorsP: Average distance traveled before a human-judged errorE: A sequence of images and steering commands recorded while observing a human driver.

T: Categorize email messages as spam or legitimate.P: Percentage of email messages correctly classified.E: Database of emails, some with human-given labels

Page 23: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

R. Mooney, Univ. of Texas at Austin

Designing a Learning System• Choose the training experience• Choose exactly what is too be learned, i.e. the target

function.• Choose how to represent the target function.• Choose a learning algorithm to infer the target function

from the experience.

Environment/Experience

Learner

Knowledge

PerformanceElement

Page 24: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Types of learning task• Supervised learning

– Learn to predict output when given an input vector• Who provides the correct answer?

• Reinforcement learning (will not be covered in this course)

– Learn action to maximize payoff• Not much information in a payoff signal• Payoff is often delayed

• Unsupervised learning– Create an internal representation of the input e.g. form

clusters; extract features• How do we know if a representation is good?

– This is the new frontier of machine learning because most big datasets do not come with labels.

Page 25: Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:

Some Issues in Machine Learning: Textbook, Ch. 1

• What algorithms can approximate functions well (and when)?• How does number of training examples influence accuracy?• How does complexity of hypothesis representation impact it?• How does noisy data influence accuracy?• What are the theoretical limits of learnability?• How can prior knowledge of learner help?• What clues can we get from biological learning systems?• How can systems alter their own representations?