Papers

(see my Google Scholar page or my my Semantic Scholar page for a more complete list)

Ashley Lewis and Michael White. 2023. Mitigating Harms of LLMs via Knowledge Distillation for a Virtual Museum Tour Guide. In Proc. of the SIGDIAL-INLG-23 Workshop on Taming LLMs.

Pulkit Arya, Madeleine Bloomquist, Subhankar Chakraborty, Andrew Perrault, William Schuler, Eric Fosler-Lussier, and Michael White. 2023. Bootstrapping a Conversational Guide for Colonoscopy Prep. In Proc. SIGDIAL-23.

Ziru Chen, Shijie Chen, Michael White, Raymond Mooney, Ali Payani, Jayanth Srinivasa, Yu Su, Huan Sun. 2023. Text-to-SQL Error Correction with Language Models of Code. In Proc. ACL-23.

Lingbo Mo, Ashley Lewis, Huan Sun and Michael White. 2022. INSPIRED: A Large-Scale Dataset and Simulation Framework for Exploring Interactive Learning in Knowledge-Based Question Answering. In Proceedings of the Interactive Learning for Natural Language Processing Workshop at NeurlPS 2022.

Maicher, K., Stiff, A., Scholl, M., White, M., Fosler-Lussier, E., Schuler, W., Serai, P., Sunder, V., Forrestal, H., Mendella, L., Adib, M., Bratton, C., Lee, K., Danforth, D.R. 2022. Artificial Intelligence in Virtual Standardized Patients: Combining Natural Language Understanding and Rule Based Dialogue Management to Improve Conversational Fidelity. Medical Teacher, DOI: https://doi.org/10.1080/0142159X.2022.2130216.

Symon J. Stevens-Guille, Aleksandre Maskharashvili, Xintong Li and Michael White. 2022. Generating Discourse Connectives with Pre-trained Language Models: Conditioning on Discourse Relations Helps Reconstruct the PDTB. In Proceedings of SIGDIAL-22.

Adam Stiff, Michael White, Eric Fosler-Lussier, Lifeng Jin, Evan Jaffe, and Douglas Danforth. 2022. A randomized prospective study of a hybrid rule- and data-driven virtual patient. Natural Language Engineering, 1–42.

Lingbo Mo, Ashley Lewis, Huan Sun and Michael White. 2022. Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction. In Findings of ACL-22. (arxiv)

Soumya Batra, Shashank Jain, Peyman Heidari, Ankit Arun, Catharine Youngs, Xintong Li, Pinar Donmez, Shawn Mei, Shiun-Zu Kuo, Vikas Bhardwaj, Anuj Kumar and Michael White. 2021. Building Adaptive Acceptability Classifiers for Neural NLG. In Proc. EMNLP-21.

Aleksandre Maskharashvili, Symon Stevens-Guille, Xintong Li and Michael White. 2021. Neural Methodius Revisited: Do Discourse Relations Help with Pre-Trained Models Too? In Proc. INLG-21.

Xintong Li, Symon Stevens-Guille, Aleksandre Maskharashvili and Michael White. 2021. Self-Training for Compositional Neural NLG in Task-Oriented Dialogue. In Proc. INLG-21.

Shreyan Bakshi, Soumya Batra, Peyman Heidari, Ankit Arun, Shashank Jain and Michael White. 2021. Structure-to-Text Generation with Self-Training, Acceptability Classifiers and Context-Conditioning for the GEM Shared Task. In Proc. of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021) at ACL.

Peyman Heidari, Arash Einolghozati, Shashank Jain, Soumya Batra, Lee Callender, Ankit Arun, Shawn Mei, Sonal Gupta, Pinar Donmez, Vikas Bhardwaj, Anuj Kumar and Michael White. 2021. Getting to Production with Few-shot Natural Language Generation Models. In Proc. SIGDIAL-21. (bib)

Xintong Li, Aleksandre Maskharashvili, Symon Jory Stevens-Guille and Michael White. 2020. Leveraging Large Pretrained Models for WebNLG 2020. In Proc. WebNLG Workshop 2020.

Symon Stevens-Guille, Aleksandre Maskharashvili, Amy Isard, Xintong Li and Michael White. 2020. Neural NLG for Methodius: From RST Meaning Representations to Texts. In Proc. INLG-20.

Xintong Li and Michael White. 2020. Self-Training for Compositional Neural NLG. West Coast NLP 2020 presentation. (poster) (video)

Ankit Arun, Soumya Batra, Vikas Bhardwaj, Ashwini Challa, Pinar Donmez, Peyman Heidari, Hakan Inan, Shashank Jain, Anuj Kumar, Shawn Mei, Karthik Mohan and Michael White. 2020. Best Practices for Data-Efficient Modeling in NLG: How to Train Production-Ready Neural Models with Less Data. In Proc. COLING-2020. (outstanding paper)

Douglas R. Danforth, Adam Stiff, Kellen R. Maicher, Marisa Scholl, Michael White, Eric Fosler-Lussier and William Schuler. 2020. Artificial Intelligence in Virtual Standardized Patients: Combining Natural Language Understanding and Rule Based Dialogue Management to Improve Conversational Fidelity. In Proc. of the 20th International Meeting on Simulation in Healthcare (IMSH 2020).

Kartikeya Upasani, David King, Jinfeng Rao, Anusha Balakrishnan and Michael White. 2019. The OSU/Facebook Realizer for SRST 2019: Seq2Seq Inflection and Serialized Tree2Tree Linearization. In Proc. of the 2nd Workshop on Multilingual Surface Realisation (MSR 2019). (bib)

Jinfeng Rao, Kartikeya Upasani, Anusha Balakrishnan, Michael White, Anuj Kumar and Rajen Subba. 2019. A Tree-to-Sequence Model for Neural NLG in Task-Oriented Dialog. In Proc. of INLG-19.

Anusha Balakrishnan, Jinfeng Rao, Kartikeya Upasani, Michael White and Rajen Subba. 2019. Constrained Decoding for Neural NLG from Compositional Representations in Task- Oriented Dialogue. In Proc. of ACL-19. (arxiv) (bib)

Kellen R. Maicher, Laura Zimmerman, Bruce Wilcox, Beth Liston, Holly Cronau, Allison Macerollo, Lifeng Jin, Evan Jaffe, Michael White, Eric Fosler-Lussier, William Schuler, David Way, Douglas R. Danforth. 2019. Using Virtual Standardized Patients to Accurately Assess Information Gathering Skills in Medical Students. Medical Teacher, DOI: 10.1080/0142159X.2019.1616683.

Michael White. 2019. Evaluation Order Effects in Dynamic Continuized CCG: From Negative Polarity Items to Balanced Punctuation. In Proc. of the Society for Computation in Linguistics. (bib) (poster)

Reid Fu and Michael White. 2018. LSTM Hypertagging. In Proc. INLG 2018. (bib) (slides)

David L. King and Michael White. 2018. The OSU Realizer for SRST ’18: Neural Sequence-to-Sequence Inflection and Incremental Locality-Based Linearization. In Proc. of the Workshop on Multilingual Surface Realization at ACL-18. (bib)

Lifeng Jin, David King, Amad Hussein, Michael White and Douglas Danforth. 2018. Using Paraphrasing and Memory-Augmented Models to Combat Data Sparsity in Question Interpretation with a Virtual Patient Dialogue System. In Proc. of the 13th Workshop on Innovative Use of NLP for Building Educational Applications at NAACL HLT 2018. (bib)

Ajda Gokcen, Ethan Hill and Michael White. 2018. Madly Ambiguous: A Game for Learning about Structural Ambiguity and Why It’s Hard for Computers. In Proc. of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. (bib) (poster)

Michael White, Simon Charlow, Jordan Needle and Dylan Bumford. 2017. Parsing with Dynamic Continuized CCG. In Proc. of the 13th International Workshop on Tree-Adjoining Grammar and Related Formalisms (TAG+13). (bib) (slides)

Michael White, Manjuan Duan and David L. King. 2017. A Simple Method for Clarifying Sentences with Coordination Ambiguities. In Proc. of the Explainable Computational Intelligence Workshop at INLG-17. (bib)

Lifeng Jin, Michael White, Evan Jaffe, Laura Zimmerman and Douglas Danforth. 2017. Combining CNNs and Pattern Matching for Question Interpretation in a Virtual Patient Dialogue System. In Proc. of the 12th Workshop on Innovative Use of NLP for Building Educational Applications at EMNLP 2017. (bib)

Taylor Mahler, Willy Cheung, Micha Elsner, David King, Marie-Catherine de Marneffe, Cory Shain, Symon Stevens-Guille and Michael White. 2017. Breaking NLP: Using Morphosyntax, Semantics, Pragmatics and World Knowledge to Fool Sentiment Analysis Systems. In Proc. of the Workshop on Building Linguistically Generalizable NLP Systems at EMNLP-17. (bib)

Annie Louis, Michael Roth, Bonnie Webber, Michael White and Luke Zettlemoyer, eds. 2016. Proceedings of the Workshop on Uphill Battles in Language Processing: Scaling Early Achievements to Robust Methods.

David L. King and Michael White. 2016. Enhancing PTB Universal Dependencies for Grammar-Based Surface Realization. In Proc. INLG-16. (bib)

Rajakrishnan Rajkumar, Marten van Schijndel, Michael White and William Schuler. 2016. Investigating Locality Effects and Surprisal in Written English Syntactic Choice Phenomena. Cognition 155:204–232. (manuscript)

Manjuan Duan, Ethan Hill and Michael White. 2016. Generating Disambiguating Paraphrases for Structurally Ambiguous Sentences. In Proc. of the Tenth Linguistic Annotation Workshop at ACL 2016 (LAW-X). (data) (slides) (bib)

Ajda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White and Douglas Danforth. 2016. A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System. In Proc. of the 10th edition of the Language Resources and Evaluation Conference (LREC 2016). (poster) (bib)

Michael White and David M. Howcroft. 2015. Inducing Clause-Combining Rules: A Case Study with the SPaRKy Restaurant Corpus. In Proc. of the 15th European Workshop on Natural Language Generation. (bib)

Evan Jaffe, Michael White, William Schuler, Eric Fosler-Lussier, Alex Rosenfeld and Douglas Danforth. 2015. Interpreting Questions with a Log-Linear Ranking Model in a Virtual Patient Dialogue System. In Proc. of the 10th Workshop on Innovative Use of NLP for Building Educational Applications at NAACL HLT 2015. (bib)

Rajakrishnan Rajkumar and Michael White. 2014. Better Surface Realization through Psycholinguistics. Language and Linguistics Compass, 8(10):428–448.

Michael White, Rajakrishnan Rajkumar, Kiwako Ito and Shari R. Speer. 2014. Eye tracking for the online evaluation of prosody in speech synthesis. In Natural Language Generation in Interactive Systems, Amanda Stent and Srinivas Bangalore, eds., Cambridge University Press, Chapter 12, pages 281–301.

Michael White. 2014. Towards Surface Realization with CCGs Induced from Dependencies. In Proc. INLG-14. (bib) (poster)

Manjuan Duan and Michael White. 2014. That’s Not What I Meant! Using Parsers to Avoid Structural Ambiguities in Generated Text. In Proc. ACL-14. (bib)

David Howcroft, Crystal Nakatsu and Michael White. 2013. Enhancing the Expression of Contrast in the SPaRKy Restaurant Corpus. In Proc. ENLG-13. (bib) (data)

Kapil Thadani, Scott Martin and Michael White. 2012. A Joint Phrasal and Dependency Model for Paraphrase Alignment. In Proc. of COLING 2012. (bib) (poster)

Dennis N. Mehay and Michael White. 2012. Shallow and Deep Paraphrasing for Improved Machine Translation Parameter Optimization. In Proc. of the AMTA 2012 Workshop on Monolingual Machine Translation (MONOMT 2012).

Michael White and Rajakrishnan Rajkumar. 2012. Minimal Dependency Length in Realization Ranking. In Proc. EMNLP-12. (bib) (data)

Michael White. 2012. Shared Task Proposal: Syntactic Paraphrase Ranking. In Proc. of the 7th International Conference on Natural Language Generation (INLG-12). (bib)

Michael White. 2011. Glue Rules for Robust Chart Realization. In Proc. of the 13th European Workshop on Natural Language Generation. (poster)

Anja Belz, Michael White, Dominic Espinosa, Eric Kow, Deirdre Hogan and Amanda Stent. 2011. The First Surface Realisation Shared Task: Overview and Evaluation Results. In Proc. of the 13th European Workshop on Natural Language Generation.

Rajakrishnan Rajkumar, Dominic Espinosa and Michael White. 2011. The OSU System for Surface Realization at Generation Challenges 2011. In Proc. of the 13th European Workshop on Natural Language Generation. (poster)

Rajakrishnan Rajkumar and Michael White. 2011. Linguistically Motivated Complementizer Choice in Surface Realization. In Proc. of the EMNLP-11 Workshop on Using Corpora in NLG. (bib)

Scott Martin and Michael White. 2011. Creating Disjunctive Logical Forms from Aligned Sentences for Grammar-Based Paraphrase Generation. In Proc. of the ACL-11 Workshop on Monolingual Text-to-Text Generation. (bib)

Dominic Espinosa, Rajakrishnan Rajkumar, Michael White and Shoshana Berleant. 2010. Further Meta-Evaluation of Broad Coverage Surface Realization. In Proc. EMNLP-10. (bib) (data)

Rajakrishnan Rajkumar, Michael White, Shari R. Speer and Kiwako Ito. 2010. Evaluating Prosody in Synthetic Speech with Online (Eye-Tracking) and Offline (Rating) Methods. In Proc. 7th Speech Synthesis Workshop.

Dominic Espinosa, Michael White, Eric Fosler-Lussier and Chris Brew. 2010. Machine Learning for Text Selection with Expressive Unit-Selection Voices. In Proc. Interspeech-10.

Rajakrishnan Rajkumar and Michael White. 2010. Designing Agreement Features for Realization Ranking. In Proc. of COLING-10. (poster) (bib)

Crystal Nakatsu and Michael White. 2010. Generating with Discourse Combinatory Categorial Grammar. Linguistic Issues in Language Technology, 4(1):1–62.

Michael White, Robert A. J. Clark and Johanna D. Moore. 2010. Generating tailored, comparative descriptions with contextually appropriate intonation. Computational Linguistics, 36(2):159–201. (link to stimuli)

Michael White, Rajakrishnan Rajkumar, Kiwako Ito and Shari Speer. 2009. Eye Tracking for the Online Evaluation of Prosody in Speech Synthesis: Not So Fast! In Proc. of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-09).

Michael White and Rajakrishnan Rajkumar. 2009. Perceptron Reranking for CCG Realization. In Proc. of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2009). (bib)

Rajakrishnan Rajkumar, Michael White and Dominic Espinosa. 2009. Exploiting Named Entity Classes in CCG Surface Realization. In Proc. of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2009). (bib) (poster)

Scott Martin, Rajakrishnan Rajkumar and Michael White. 2009. Grammar Engineering for CCG using Ant and XSLT. In Proc. of the NAACL HLT 2009 Workshop on Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009). (bib) (poster)

Michael White and Rajakrishnan Rajkumar. 2008. A More Precise Analysis of Punctuation for Broad-Coverage Surface Realization with CCG. In Proc. of the Workshop on Grammar Engineering Across Frameworks (GEAF08). (bib)

Dominic Espinosa, Michael White and Dennis Mehay. 2008. Hypertagging: Supertagging for Surface Realization with CCG. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-08: HLT). (bib)

Stephen A. Boxwell and Michael White. 2008. Projecting Propbank Roles onto the CCGbank. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC-08).

Robert Dale and Michael White, editors. 2007. Report from the Workshop on Shared Tasks and Comparative Evaluation in Natural Language Generation.

Vasile Rus, Arthur C. Graesser, Amanda Stent, Marilyn Walker and Michael White. 2007. Text-to-Text Generation. In Report from the Workshop on Shared Tasks and Comparative Evaluation in Natural Language Generation.

Michael White, Rajakrishnan Rajkumar and Scott Martin. 2007. Towards Broad Coverage Surface Realization with CCG. In Proc. of the 2007 Workshop on Using Corpora for NLG: Language Generation and Machine Translation (UCNLG+MT).

Mary Ellen Foster and Michael White. 2007.  Avoiding Repetition in Generated Text. In Proc. of the 11th European Workshop on Natural Language Generation. (bib)

Robert Dale and Michael White, editors. 2007. Position Papers of the Workshop on Shared Tasks and Comparative Evaluation in Natural Language Generation.

Michael White. 2006. CCG Chart Realization from Disjunctive Inputs. In Proc. of the 4th International Conference on Natural Language Generation (INLG-06). (bib)

Crystal Nakatsu and Michael White. 2006. Learning to Say It Well: Reranking Realizations by Predicted Synthesis Quality. In Proc. COLING-ACL-06. (bib)

Michael White. 2006. Efficient Realization of Coordinate Structures in Combinatory Categorial Grammar. Research on Language and Computation, 4(1):39–75. (prefinal version)

Mary Ellen Foster and Michael White. 2005. Assessing the Impact of Adaptive Generation in the COMIC Multimodal Dialogue System. In Proc. of the IJCAI-05 Workshop on Knowledge and Reasoning in Practical Dialogue Systems.

Carsten Brockmann, Amy Isard, Jon Oberlander, and Michael White. 2005. Modelling alignment for affective dialogue. In Proc. of the UM-05 Workshop on Adapting the Interaction Style to Affective Factors.

Michael White, Mary Ellen Foster, Jon Oberlander, and Ash Brown. 2005. Using Facial Feedback to Enhance Turn-Taking in a Multimodal Dialogue System. In Proc. of the HCI International 2005 Thematic Session on Universal Access in Human-Computer Interaction.

Michael White. 2005. Designing an Extensible API for Integrating Language Modeling and Realization. In Proc. ACL-05 Workshop on Software.

Mary Ellen Foster, Michael White, Andrea Setzer, and Roberta Catizone. 2005. Multimodal generation in the COMIC dialogue system. ACL 2005 Demo Session. (Poster [A0 PDF])

Mary Ellen Foster and Michael White. 2004. Techniques for Text Planning with XSLT. In Proc. of the 4th NLPXML Workshop.

Michael White. 2004. Reining in CCG Chart Realization. In Proc. of the 3rd International Conference on Natural Language Generation (INLG-04).

Rachel Baker, Robert A. J. Clark, and Michael White. 2004. Synthesising Contextually Appropriate Intonation in Limited Domains. In Proc. of the 5th ISCA Speech Synthesis Workshop.

Johanna Moore, Mary Ellen Foster, Oliver Lemon, and Michael White. 2004. Generating Tailored, Comparative Descriptions in Spoken Dialogue. In Proc. of the 17th International FLAIRS Conference.

Michael White and Jason Baldridge. 2003. Adapting Chart Realization to CCG. In Proc. of the 9th European Workshop on Natural Language Generation.