Graphical Multi-Task Learning

Dan SheldonCornell University

NIPS SISO Workshop 12/12/2008

Multi-Task Learning (MTL)• Separate but related learning tasks ---

solve them jointly to achieve better performance• E.g., in document collection, learn classifiers

to predict category, relevance to query 1, query 2, etc.

• Neural nets [Caruana 1997]• Shared hidden layers

• Generative models / Hierarchical Bayes• Shared hyper-parameters

Task Relationships• Most previous work: pool of related

• This work: leverage known structural information• Graph structure on tasks• Discriminative setting• Regularized kernel methods

Motivating Application• Predict presence/absence of Tree

Swallow (migratory bird) at locations in NY.

• Observations:• xi – date, time, location, habitat, etc.• yi – saw a Tree Swallow?

• Significant change throughout the year

• How to model?

Percent positive observations by month

Separate Tasks?

• Split training examples by month and train 12 separate models

• OK if lots of training data

FebJan Mar Dec….

Single Task?

• Use all training examples to learn a single classifier

• Include date as a feature to learn about month-to-month heterogeneity

Jan, Feb, Mar, … ,Dec

Symmetric MTL?

FebJan Mar Dec….

• Ignores known problem structure• January is very weakly related to July

Graphical MTL

• Use a priori knowledge about structure of relationships, in the form of a graph.

FebJan Mar Dec….

Marketing in Social Network

Alice Bob

Symmetric Task Relationships.

Prefer to leverage network

structure!(known a

priori)

Idea• Use regularization to penalize

differences between tasks that are directly connected

• Penalize by squared difference || ft – ft-1 ||2

f2f1 f3 f12….

Illustration

Regularized learning: Trade off empirical risk vs.

complexity.

Penalize squared distance from origin.

Illustration

Graphical MTL: Trade off empirical risk vs. task

differences.

Penalize sum of squared edge lengths.

[Evgeniou, Micchelli and Pontil JMLR 2006]

Illustration

Also add edges to origin.

Task-specific regularization

Multi-Task regularization

.Empirical

Note: translation invariant.

Related Work• Multi-Task learning: lots!

• Caruana 1997, Baxter 2000, Ben-David and Schuller 2003, Ando and Zhang 2004

• Multi-Task Kernels: Evgeniou, Michelli, Pontil 2006• General framework• Focus on linear, symmetrical case (all experiments)• Propose graph regularization, nonlinear kernels

• Task Networks: Kato, Kashima, Sugiyama, Asai, 2007• Second order cone programming

This Work

• Build on Evgeniou, Micchelli and Pontil

• Main contribution: Practical development of graphical multi-task kernels, focused on nonlinear case.• Task-specific regularization• New treatment of non-linear kernels• Application

Technical Insights

Key technical insight: Can reduce this problem to a single-task problem by

learning one function f(x,t) and modifying the kernel:

Base kernel:

Multi-taskkernel

Taskkerne

Basekerne

Technical Insights

Multi-task kernel:

Construct task kernel K from graph Laplacian L.

Base kernel:

Proof Sketch1. Define task-specific function as function that

supplies task ID: .

2. Claim: . Hence task-specific functions are comparable via inner products. (Relies on product kernel)

3. Claim: is a weighted sum of inner products between task-specific functions: .

4. Graph Laplacian gives the desired weights:

One more thing…• Normalize task kernel to have unit

diagonal

• Reason: • Preserve scaling of K when choosing α• All entries in [0,1]

Results

• Bird prediction task• > 5%

improvement

• Details:• SVM with RBF kernels• G = cycle• Grid search for C and

γ • α = 2-8 (robust to

many choices)

PooledSeparateMultitask

Sensitivity to C and gamma

Pooled α = 2-10 α = 2-6

Extensions• Learn edge weights: detect periods of stability

vs. change.

• Applications:• Social networks• Bird problem: Spatial regions. Many species.

• Faster training using graph structure.

Percent positive observations by month

Thanks!

Graphical Multi-Task Learning

Documents

Multi-Task Learning & Transfer Learning BasicsPlan for Today Multi-Task Learning -Problem statement-Models, objectives, optimization -Challenges -Case study of real-world multi-task

A Directed Sparse Graphical Model for Multi-Target Trackingopenaccess.thecvf.com/content_cvpr_2018_workshops/papers/w34/Ullah_A... · A Directed Sparse Graphical Model for Multi-Target

TIMME: Twitter Ideology-detection via Multi-task Multi-relational … · 2020. 6. 19. · TIMME: Twitter Ideology-detection via Multi-task Multi-relational Embedding Zhiping Xiao

Human centred Design for Graphical User InterfacesHuman centred Design for Graphical User Interfaces Abstract Good graphical user interface design is not a trivial task. This master

Multi-Task Learning as Multi-Objective Optimization

Non-Count Symmetries in Boolean & Multi-Valued Prob. Graphical …mausam/papers/aistats17.pdf · 2017-03-02 · Non-Count Symmetries in Boolean & Multi-Valued Prob. Graphical Models

Multi-graphical analysis of dynamic PET

Multi-Task and Multi-Level Detection Neural Network Based

Multi-Task & Meta-Learning Basicscs330.stanford.edu/slides/cs330_lec2.pdfPlan for Today Multi-Task Learning -Models & training -Challenges -Case study of real-world multi-task learning

Multi-Floor Indoor Positioning System Using Bayesian Graphical Models

Multi-Task Learning for NLP

Multi-PLI: interpretable multi‐task deep learning model

Multi-Task Homework

Multi-Task Windmill Challenge 2010

Multi Threading - Task Level Parallelism

WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for ...openaccess.thecvf.com/content_ICCV_2019/papers/Yogamani_Wood… · WoodScape: A multi-task, multi-camera ﬁsheye dataset

RAFCON: a Graphical Tool for Task Programming and Mission Control › robocup.de › 2016 › papers › RoboCup... · 2016-06-13 · RAFCON: a Graphical Tool for Task Programming

Multi-Task Learning for Stock Selectionpapers.nips.cc/paper/1221-multi-task-learning-for-stock... · 2014-04-14 · Multi-Task Learning/or Stock Selection 947 we performed a series

Deep Decentralized Multi-task Multi-Agent Reinforcement ... · Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability Shayegan Omidshaﬁei 1Jason

On Multi-Sensor Task Allocation