AACL 2022 Tutorial: Recent Advances in Pre-trained Language Models: Why Do They Work and How to Use Them

Introduction

Pre-trained language models (PLMs) are language models that are pre-trained on large-scaled corpora in a self-supervised fashion. These PLMs have fundamentally changed the natural language processing community in the past few years. In this tutorial, we aim to provide a broad and comprehensive introduction from two perspectives: why those PLMs work, and how to use them in NLP tasks. The first part of the tutorial shows some insightful analysis on PLMs that partially explain their exceptional downstream performance. The second part first focuses on how contrastive learning can be applied on PLMs to improve the representations extracted by the PLMs, and then illustrates how one can apply those PLMs to downstream tasks under different circumstances. These circumstances include fine-tuning PLMs when under data scarcity, and using PLMs with parameter efficiency. We believe that attendees of different backgrounds would find this tutorial informative and useful.

Instructors

Cheng-Han Chiang

Cheng-Han Chiang is a PhD student at National Taiwan University (NTU) in Taipei, Taiwan. His main research interest is natural language processing, especially self-supervised learning. He studies why pre-trained language models, such as BERT, works so well, and what should we do to make them work properly and safely. He has publications in analysis of pre-trained langauge models in EMNLP and AAAI. He has also given several lectures in pre-trained language models.

Yung-Sung Chuang

Yung-Sung Chuang is a PhD student in Electrical Engineering and Computer Science at MIT CSAIL, where he works with Dr. James Glass. His research focuses on learning representations for natural language which helps downstream tasks such as natural language understanding, natural language generation, question answering. He has published several paper in this direction in EMNLP, ACL, NeurIPS

Hung-yi Lee

Hung-yi Lee is an associate professor of the Department of Electrical Engineering of National Taiwan University, with a joint appointment at the Department of Computer Science & Information Engineering of the university. His research focuses on deep learning, speech processing, and natural language processing. He owns a YouTube channel teaching deep learning (in Mandarin) with more than 8M views and 100k subscribers. He gave tutorials at ICASSP 2018, APSIPA 2018, ISCSLP 2018, INTERSPEECH 2019 , SIPS 2019, INTERSPEECH 2020, ICASSP 2021, ACL 2021.

Tutorial Slides and Video

The video of the tutorial will be released after the tutorial. Stay tuned!

AACL 2022 Tutorial

Nov. 20, 2022