14

Jiwei Li, NLP Researcher By Pragya Arora & Piyush Ghai

Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Download PDF Report

Upload
others
View
4
Download
0

Embed Size (px)

Citation preview

Page 1: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Jiwei Li, NLP Researcher

By Pragya Arora & Piyush Ghai

Page 2: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Introduction● Graduated from Stanford University in 2017

● Advised by Prof. Dan Jurafsky

● Closely worked with Prof. Eduard Hovy from CMU and Prof. Alan Ritter from OSU

● Affiliated with The Natural Language Processing Group at Stanford University

Page 3: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Research Interests● Jiwei’s research interests focus on computational semantics, language generation and

deep learning. His recent work explores the feasibility of developing a framework and methodology for computing the informational and processing complexity of NLP applications and tasks.

● His PhD thesis was on “Teaching Machines to Converse”

● Has over 12001 citations on Google Scholar.

● Has over 381 scholarly publications.

1 : Google Scholar Site

Page 4: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Teaching Machines to Converse● Jiwei’s primary research focus and his thesis work was on conversational models for

machines.

● Some of his publications in this domain are :

○ Deep Reinforcement learning for dialogue generation [2016], J Li, W Monroe, A Ritter, M Galley, J Dao, D Jurafsky

○ A persona based neural conversation model [2016], J Li, M Galley, C Brockett, GP Spithourakis, J Gao, B Dolan

○ Adverserial Learnig for Neural Dialogue Generation [2017], J Li, W Monroe, T Shi, A Ritter, D Jurafsky

Page 5: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Adverserial Learning for Neural Dialogue Generation

Page 6: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Co-Authors● Will Monroe, PhD Student @Stanford

● Tianlin Shi, PhD Student @Stanford

● Sebastien Jean, PhD Student @NYU Courant

● Alan Ritter, Assistant Professor, Dept of CSE, Ohio State University

● Dan Jurafsky, Professor, Dept of CSE, Stanford University

Page 7: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Goal“To train and produce sequences that are indistinguishable from human-generated dialogue utterances”.

Page 8: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

This paper trended on social media as well...

Page 9: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Adversarial ModelsIt’s a Min-Max game between a Generator & Discriminator

Page 10: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Model Used● Earlier REINFORCE Algorithm was used, which had it’s own drawbacks.

○ The expectation of reward is approximated by only one sample and reward associate with it is used for all the samples.

● Vanilla REINFORCE will assign the same negative weight for all the tokens - [I, don’t, know], even though [I] matched with the human utterance.

Page 11: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

REGS - Reward Generation for Every Step● They reward the sequence generated at intermediate steps as well.

● They essentially train their discriminator for rewarding partially decoded sequences.

● They also use Teacher Forcing as well, where the human responses are also fed to the generator, with a positive reward. This helps it to overcome the problems where it can get stuck in Minimas and it would not know which update steps to take.

Page 12: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Page 13: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Results

Page 14: Jiwei Li NLP Researcher - socialmedia-class.orgsocialmedia-class.org/slides/students_2017/Jiwei Li_NLP_Researche… · Deep Reinforcement learning for dialogue generation [2016],

Crowdsourcing and Human Computation Instructor: Chris ...crowdsourcing-class.org/slides/iterative-and-parallel-processing.pdfHuman computation in general has recently received a lot

Crowdsourcing and Human Computation Instructor: Chris ...crowdsourcing-class.org/slides/iterative-and-parallel-processing.pdfHuman computation in general has recently received a lot

Documents

Blockchain Voting - bitcoin-class.orgbitcoin-class.org/presentations/Blockchain_Voting.pdf · Blockchain Voting. Overview Motivation ... “This system allows legislation to be

Blockchain Voting - bitcoin-class.orgbitcoin-class.org/presentations/Blockchain_Voting.pdf · Blockchain Voting. Overview Motivation ... “This system allows legislation to be

Documents

RULES - lk.rs-class.org

RULES - lk.rs-class.org

Documents

Social Media & Text Analysissocialmedia-class.org/slides_AU2019/lecture8_paraphrase_identificat… · Social Media & Text Analysis lecture 8 - Paraphrase Identiﬁcation and Logistic

Social Media & Text Analysissocialmedia-class.org/slides_AU2019/lecture8_paraphrase_identificat… · Social Media & Text Analysis lecture 8 - Paraphrase Identiﬁcation and Logistic

Documents

16 Part 19 1. - lk.rs-class.org

16 Part 19 1. - lk.rs-class.org

Documents

Logrolling under Fragmented Authoritarianism: Theory … · 1 Logrolling under Fragmented Authoritarianism: Theory and Evidence from China Mario Gillia, Yuan Lib, Jiwei Qianc aDepartment

Logrolling under Fragmented Authoritarianism: Theory … · 1 Logrolling under Fragmented Authoritarianism: Theory and Evidence from China Mario Gillia, Yuan Lib, Jiwei Qianc aDepartment

Documents

Social Media & Text Analysissocialmedia-class.org/slides/lecture6_paraphrase_data.pdf · Alan Ritter socialmedia-class.org Social Media & Text Analysis lecture 5 - Paraphrase Data

Social Media & Text Analysissocialmedia-class.org/slides/lecture6_paraphrase_data.pdf · Alan Ritter socialmedia-class.org Social Media & Text Analysis lecture 5 - Paraphrase Data

Documents

The Ohio State University Linguistic Departmentsocialmedia-class.org/slides/students/OSU_NLP_group.pdfResearch Interest Linguistics : Computational Psycholinguistics, Incremental Parsing

The Ohio State University Linguistic Departmentsocialmedia-class.org/slides/students/OSU_NLP_group.pdfResearch Interest Linguistics : Computational Psycholinguistics, Incremental Parsing

Documents

PART XII - lk.rs-class.org

PART XII - lk.rs-class.org

Documents

Downscaling of passive microwave soil moisture retrievals ...people.umass.edu/jiwei/uploads/8/6/6/6/86667460/ijrs_downscaling.… · Downscaling of passive microwave soil moisture

Downscaling of passive microwave soil moisture retrievals ...people.umass.edu/jiwei/uploads/8/6/6/6/86667460/ijrs_downscaling.… · Downscaling of passive microwave soil moisture

Documents

Cardano’s Rule of Proportional Position in Artis Magnae ZHAO Jiwei Department of Mathematics, Northwest University, Xi’an, China, 710127 Email: zzhaojiwei@tom.com

Cardano’s Rule of Proportional Position in Artis Magnae ZHAO Jiwei Department of Mathematics, Northwest University, Xi’an, China, 710127 Email: [email protected]

Documents

Teaching the Concept of Rates of Chemical Reactions By: Tatiana Vrabie & Jiwei Li

Teaching the Concept of Rates of Chemical Reactions By: Tatiana Vrabie & Jiwei Li

Documents

TCP with Variance Control for Multihop IEEE 802.11 Wireless Networks Jiwei Chen, Mario Gerla, Yeng-zhong Lee

TCP with Variance Control for Multihop IEEE 802.11 Wireless Networks Jiwei Chen, Mario Gerla, Yeng-zhong Lee

Documents

Social Media & Text Analysissocialmedia-class.org/slides/lecture5_paraphrase_identification_a.pdf · Social Media & Text Analysis lecture 5 ... Mini Research Proposal ... Key Limitations

Social Media & Text Analysissocialmedia-class.org/slides/lecture5_paraphrase_identification_a.pdf · Social Media & Text Analysis lecture 5 ... Mini Research Proposal ... Key Limitations

Documents

Causes and Consequences of Corporate Assets Exchange … · Causes and Consequences of Corporate Assets Exchange by China’s Listed Companies Jiwei Wang* Singapore Management University

Causes and Consequences of Corporate Assets Exchange … · Causes and Consequences of Corporate Assets Exchange by China’s Listed Companies Jiwei Wang* Singapore Management University

Documents

Teaching the Concept of Rates of Chemical Reactions By: Tatiana Vrabie & Jiwei Li

Teaching the Concept of Rates of Chemical Reactions By: Tatiana Vrabie & Jiwei Li

Documents

Social Media & Text Analysissocialmedia-class.org/slides_AU2020/lecture6_paraphrase_data.pdf · socialmedia-class.org Social Media & Text Analysis lecture 6 - Paraphrase Data Sources

Social Media & Text Analysissocialmedia-class.org/slides_AU2020/lecture6_paraphrase_data.pdf · socialmedia-class.org Social Media & Text Analysis lecture 6 - Paraphrase Data Sources

Documents

SIMcard’exploitaon’rust-class.org/0/static/classes/class18/131031.SIM_card... · 2014-01-14 · Agacker’SMS’can’requestDESPsigned’SMS’response’with’fully’ predictable’content

SIMcard’exploitaon’rust-class.org/0/static/classes/class18/131031.SIM_card... · 2014-01-14 · Agacker’SMS’can’requestDESPsigned’SMS’response’with’fully’ predictable’content

Documents

Social Media & Text Analysissocialmedia-class.org/slides_AU2017/lecture6... · Wei Xu socialmedia-class.org Social Media & Text Analysis lecture 6 - Paraphrase Identiﬁcation and

Social Media & Text Analysissocialmedia-class.org/slides_AU2017/lecture6... · Wei Xu socialmedia-class.org Social Media & Text Analysis lecture 6 - Paraphrase Identiﬁcation and

Documents

CryptoKupa: Decentralized Communal Funds with Bitcoinbitcoin-class.org/0/projects/CryptoKupa.pdf · CryptoKupa: Decentralized Communal Funds with Bitcoin Ori Shimony University of

CryptoKupa: Decentralized Communal Funds with Bitcoinbitcoin-class.org/0/projects/CryptoKupa.pdf · CryptoKupa: Decentralized Communal Funds with Bitcoin Ori Shimony University of

Documents

Social Media & Text Analysissocialmedia-class.org/slides/lecture3_TwitterNLP_langid.pdf · Alan Ritter socialmedia-class.org Naïve Bayes • Pros: (works well for spam ﬁltering,

Social Media & Text Analysissocialmedia-class.org/slides/lecture3_TwitterNLP_langid.pdf · Alan Ritter socialmedia-class.org Naïve Bayes • Pros: (works well for spam ﬁltering,

Documents

Turkopticon: Interrupting Worker Invisibility in Amazon ...crowdsourcing-class.org/readings/downloads/ethics/turkopticon.pdfbased technology. This paper makes several contributions

Turkopticon: Interrupting Worker Invisibility in Amazon ...crowdsourcing-class.org/readings/downloads/ethics/turkopticon.pdfbased technology. This paper makes several contributions

Documents

Logrolling under Fragmented Authoritarianism: Theory and ... · Evidence from China Mario Gillia, Yuan Lib, Jiwei Qianc aDepartment of Economics, University of Milan-Bicocca. Piazza

Logrolling under Fragmented Authoritarianism: Theory and ... · Evidence from China Mario Gillia, Yuan Lib, Jiwei Qianc aDepartment of Economics, University of Milan-Bicocca. Piazza

Documents

MetaLnc9FacilitatesLungCancerMetastasisviaa PGK1-Activated AKT… · PGK1-Activated AKT/mTOR Pathway Tao Yu 1,Yingjun Zhao2, Zhixiang Hu2, Jing Li1, Dandan Chu1, Jiwei Zhang2, Zhe

MetaLnc9FacilitatesLungCancerMetastasisviaa PGK1-Activated AKT… · PGK1-Activated AKT/mTOR Pathway Tao Yu 1,Yingjun Zhao2, Zhixiang Hu2, Jing Li1, Dandan Chu1, Jiwei Zhang2, Zhe

Documents

Crowd Workers - crowdsourcing-class.orgcrowdsourcing-class.org/slides/crowd-workers.pdf · Requesters like him, and CrowdFlower, collude, explicitly or ... crowd-workers.com • I

Crowd Workers - crowdsourcing-class.orgcrowdsourcing-class.org/slides/crowd-workers.pdf · Requesters like him, and CrowdFlower, collude, explicitly or ... crowd-workers.com • I

Documents

Social Media & Text Analysissocialmedia-class.org/slides_AU2017/lecture1_Intro.pdf · Social Media & Text Analysis lecture 1 - Introduction CSE 5539-0010 Ohio State University Instructor:

Social Media & Text Analysissocialmedia-class.org/slides_AU2017/lecture1_Intro.pdf · Social Media & Text Analysis lecture 1 - Introduction CSE 5539-0010 Ohio State University Instructor:

Documents

A Brief Overview of Facebook and NLPsocialmedia-class.org/slides/students_2017/Facebook_NLP.pdf · Make posts React to and comment on other people’s posts Marketing! Facebook sells

A Brief Overview of Facebook and NLPsocialmedia-class.org/slides/students_2017/Facebook_NLP.pdf · Make posts React to and comment on other people’s posts Marketing! Facebook sells

Documents

Social Media & Text Analysissocialmedia-class.org/slides/lecture9_vector_semantics.pdf · 2. Dimensionality Reduction (e.g. SVD, LSA, LDA) 3. Neural Network inspired models (e.g

Social Media & Text Analysissocialmedia-class.org/slides/lecture9_vector_semantics.pdf · 2. Dimensionality Reduction (e.g. SVD, LSA, LDA) 3. Neural Network inspired models (e.g

Documents

Social Media & Text Analysissocialmedia-class.org/slides/lecture7_paraphrase... · 2020-03-24 · Alan Ritter socialmedia-class.org Social Media & Text Analysis lecture 7 - Paraphrase

Social Media & Text Analysissocialmedia-class.org/slides/lecture7_paraphrase... · 2020-03-24 · Alan Ritter socialmedia-class.org Social Media & Text Analysis lecture 7 - Paraphrase

Documents

Crowdsourcing and Human Computer Interaction Designcrowdsourcing-class.org/slides/crowdsourcing-and-HCI.pdf · Crowdsourcing and Human Computer Interaction Design ... Instructor:

Crowdsourcing and Human Computer Interaction Designcrowdsourcing-class.org/slides/crowdsourcing-and-HCI.pdf · Crowdsourcing and Human Computer Interaction Design ... Instructor:

Documents