Ling 8800 — Seminar in Computational Linguistics (Spring ’23)

Spring ’23, TTh 3:55–5:15, Oxley Hall 122
Instructor: Michael White

Controllability and Explainability in Neural Language Generation

Large, pretrained neural language models have been receiving just a wee bit of hype of late. For example, the New York Times has recently reported that despite the tech slump, OpenAI, the lab that created ChatGPT, may soon complete a deal valuing it at $29 billion (give or take a few bucks), based on the promise of this type of AI to “reinvent everything.”

At the same time, such models have been criticized as dangerous and on the wrong path. For example, Emily Bender, the ACL’s next president, told her local TV station that “OpenAI’s guard rails are not that strong” and that model is prone to producing lots of racist and sexist language. Moreover, in an interview with Ezra Klein, cognitive scientist Gary Marcus noted that ChatGPT is the “king of pastiche,” i.e., it really just does “a kind of glorified cut and paste.” He then goes on to explain why he considers its development to be on the wrong path: “They’re just trying to autocomplete our sentences. And that’s not the depth that we need to actually get to what people call A.G.I., or artificial general intelligence.”

How should we think of these conflicting views?

Description

This course will dig into the technical details of these models, emphasizing useful tasks such as response generation in dialogue systems, question answering, summarization, simplification, data augmentation and explanation generation. The advent of large, pretrained language models has clearly yielded amazing progress on these tasks, though the black box nature of these models make them hard to understand and control, posing ethical and safety obstacles for deploying them in real world settings. As such, we will focus on readings that examine ways of controlling these models and making them reliably generate explanations, using techniques such as decomposition, chain-of-thought and data-augmented training.

Students are encouraged to pursue related research interests for their term project. Topics will be finalized based on the interests of the participants.

Expectations

Students will be expected to actively participate in the discussion and research carried out in the seminar. As detailed below, students will be required to facilitate discussions and post questions on the readings in advance, as well as locate relevant background/tutorial materials. Additionally, students taking the course for 3 credits will be required to carry out a class project on a topic related to the seminar; alternatively, for students already working on a related topic, integrating their focus into the seminar will be an option.

Prerequisites

Ling 5802 or equivalent, or permission of the instructor.

Carmen

We’ll use Carmen to schedule discussion facilitators and post advance questions on the readings, as well as links to background/tutorial materials. We’ll also use it for submitting project documents.

Requirements

Class participation (25%)

We are aiming for a dynamic discussion of papers, not death by powerpoint. Thus, we plan on taking a page from Eric Fosler-Lussier’s playbook and requiring everyone (this includes you!) to post at least one question to the discussion list on Carmen by 8 p.m. the day before each reading will be discussed. Participants should also feel free to share their (initial) thoughts and views of the papers in their posts. In particular, questions of the type “What did they mean by X” or “Why did they do X instead of Y” are encouraged. Remember that most of the papers are targeted to people who are already expert in the area, so you shouldn’t expect to alway understand everything. Airing such questions can help everyone gain a better understanding of the paper — even those who thought they understood it!

Facilitating discussions (25%)

Each meeting where we discuss a paper will have a discussion facilitator. For the main readings, the facilitator should look over the posted questions and choose a subset for discussion. In class, the facilitator should start the session with a brief, five to ten minute summary of the paper, including the highlights and lowlights. Following the opening summary, the facilitator is responsible for managing the discussion, and ensuring that as many viewpoints are heard as possible. Finally, the facilitator is also tasked with keeping track of the potentially most useful background papers for the reading.

Students will be required to facilitate at least one session during the course. If the discussion does not take up the entire class period, the remaining time may be used to (informally) discuss class projects.

We will also dedicate various sessions to reading background papers that we expect to be useful to better understand the main papers of interest. The sessions and papers dedicated to background readings will be determined collaboratively.

Term project (50%)

As noted above, students taking the course for 3 credits will be required to carry out a term project, either alone or in a team setting. A project sketch will be required to be presented informally in class for brainstorming by the fourth week; followed by a project proposal by the eighth week; followed by a presentation during the last week of class; and (finally) a final report by the day the final exam would be held (if there were one).

For students taking the course for 1 credit, no project will be required, with the other requirements scaled accordingly.

Topics

The topics and readings we expect to cover are listed below; these will be refined as the course progresses.

Pretrained Language Models

  • Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe. 2022. Training language models to follow instructions with human feedback. arXiv:2203.02155. Also: ChatGPT blog post.
  • Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-Gillingham, Jonathan Uesato, Po-Sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving. 2022. Improving alignment of dialogue agents via targeted human judgements. arXiv:2209.14375. Also: DeepMind Sparrow blog post.
  • William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith. 2021. Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand? Transactions of the Association for Computational Linguistics (2021) 9: 1047–1060.

Ethical and Safety Concerns

  • Emily Dinan, Gavin Abercrombie, A. Bergman, Shannon Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser. 2022. SafetyKit: First Aid for Measuring Safety in Open-domain Conversational Systems. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4113–4133, Dublin, Ireland. Association for Computational Linguistics.
  • Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, Zac Kenton, Sasha Brown, Will Hawkins, Tom Stepleton, Courtney Biles, Abeba Birhane, Julia Haas, Laura Rimell, Lisa Anne Hendricks, William Isaac, Sean Legassick, Geoffrey Irving, Iason Gabriel. 2021. Ethical and social risks of harm from Language Models. arXiv:2112.04359.
  • Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova DasSarma, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Jared Kaplan. 2021. A General Language Assistant as a Laboratory for Alignment. arXiv:2112.00861.
  • Deep Ganguli, Amanda Askell, Nicholas Schiefer, Thomas I. Liao, Kamilė Lukošiūtė, Anna Chen, Anna Goldie, Azalia Mirhoseini, Catherine Olsson, Danny Hernandez, Dawn Drain, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jackson Kernion, Jamie Kerr, Jared Mueller, Joshua Landau, Kamal Ndousse, Karina Nguyen, Liane Lovitt, Michael Sellitto, Nelson Elhage, Noemi Mercado, Nova DasSarma, Oliver Rausch, Robert Lasenby, Robin Larson, Sam Ringer, Sandipan Kundu, Saurav Kadavath, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Christopher Olah, Jack Clark, Samuel R. Bowman, Jared Kaplan. 2023. The Capacity for Moral Self-Correction in Large Language Models. arXiv:2302.07459.
  • Hyunwoo Kim, Youngjae Yu, Liwei Jiang, Ximing Lu, Daniel Khashabi, Gunhee Kim, Yejin Choi, Maarten Sap. 2022. ProsocialDialog: A Prosocial Backbone for Conversational Agents. In Proc. EMNLP-22.

Controllability

Explainability

Background

Policy on Academic Misconduct

It is the responsibility of the Committee on Academic Misconduct to investigate or establish procedures for the investigation of all reported cases of student academic misconduct. The term “academic misconduct” includes all forms of student academic misconduct wherever committed; illustrated by, but not limited to, cases of plagiarism and dishonest practices in connection with examinations. Instructors shall report all instances of alleged academic misconduct to the committee (Faculty Rule 3335-5-487). For additional information, see the Code of Student Conduct.

Students with Disabilities

The University strives to make all learning experiences as accessible as possible. In light of the current pandemic, students seeking to request COVID-related accommodations may do so through the university’s request process, managed by Student Life Disability Services. If you anticipate or experience academic barriers based on your disability (including mental health, chronic or temporary medical conditions), please let me know immediately so that we can privately discuss options. To establish reasonable accommodations, I may request that you register with Student Life Disability Services. After registration, make arrangements with me as soon as possible to discuss your accommodations so that they may be implemented in a timely fashion. SLDS contact information: slds@osu.edu; 614-292-3307; slds.osu.edu; 098 Baker Hall, 113 W. 12th Avenue.

Religious Accommodations

Our inclusive environment allows for religious expression. Students requesting accommodations based on faith, religious or a spiritual belief system in regard to examinations, other academic requirements or absences, are required to provide the instructor with written notice of specific dates for which the student requests alternative accommodations at the earliest possible date. For more information about religious accommodations at Ohio State, visit odi.osu.edu/religious-accommodations.

Weather or Other Short-Term Closing

Should in-person classes be canceled, we will meet virtually via CarmenZoom during our regularly scheduled time.

Disclaimer

This syllabus is subject to change. All important changes will be made in writing (email), with ample time for adjustment.