====== Dialog ====== ===== Overviews ===== Best overviews: [[https://arxiv.org/pdf/1711.01731.pdf|Chen et al 2017 - A Survey on Dialogue Systems: Recent Advances and New Frontiers]] and [[https://arxiv.org/pdf/2004.13637.pdf|Roller et al 2020 - Recipes for building an open-domain chatbot]]. * Introductions * [[https://web.stanford.edu/~jurafsky/slp3/24.pdf|S&LP - Ch 24]] * General * **[[https://arxiv.org/pdf/1711.01731.pdf|Chen et al 2017 - A Survey on Dialogue Systems: Recent Advances and New Frontiers]]** * [[https://arxiv.org/pdf/1809.08267.pdf|Gao et al 2018 - Neural Approaches to Conversational AI]] (95 pages) * [[https://arxiv.org/pdf/2006.12442.pdf|Roller et al 2020 - Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions]] * [[https://www.sciencedirect.com/science/article/pii/S1319157821003001|Singh 2021 - A Survey on Near-human Conversational Agents]] * [[https://arxiv.org/pdf/2201.06657.pdf|Caldarini et al 2022 - A Literature Survey of Recent Advances in Chatbots]] * [[https://arxiv.org/pdf/2212.09252.pdf|Shaier et al 2022 - Mind the Knowledge Gap: A Survey of Knowledge-enhanced Dialogue Systems]] * [[https://arxiv.org/pdf/2402.18013|Yi et al 2024 - A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems]] * Courses * **[[https://docs.google.com/spreadsheets/d/1nSKcnM5r9x82BdyPgn-obN1sRUlLC7zZ082a0132Igk/edit?fbclid=IwAR1tX8eztqZ4Kn4NNPUvkkGKCurikWtv-wcSnRyDRg4M544dQtSXTTb3vfM#gid=1523499517|Zhou Yu's Conversational AI course at Columbia]]** * Datasets * [[https://arxiv.org/pdf/1512.05742.pdf|Serban et al 2017 - A Survey of Available Corpora for Building Data-Driven Dialogue Systems]] * [[https://arxiv.org/pdf/2205.06907.pdf|Sundar & Heck 2022 - Multimodal Conversational AI A Survey of Datasets and Approaches]] * Evaluation * [[https://arxiv.org/pdf/1905.04071.pdf|Deriu et al 2019 - Survey on Evaluation Methods for Dialogue Systems]] * [[https://ieeexplore.ieee.org/document/7785371|Fan et al 2020 - A Survey of Dialogue System Evaluation]] * [[https://arxiv.org/pdf/2106.03706.pdf|Yeh et al 2021 - A Comprehensive Assessment of Dialog Evaluation Metrics]] * Other * [[https://ww.sentic.net/empathetic-dialogue-systems.pdf|Ma et al 2020 - A Survey on Empathetic Dialogue Systems]] * [[https://arxiv.org/pdf/2205.06907.pdf|Sundar & Heck 2022 - Multimodal Conversational AI: A Survey of Datasets and Approaches]] * [[https://arxiv.org/pdf/2305.15933.pdf|Rahmani et al 2023 - A Survey on Asking Clarification Questions Datasets in Conversational Systems]] ===== Key Papers ===== * [[https://www.aclweb.org/anthology/D11-1054.pdf|Ritter et al 2011 - Data-Driven Response Generation in Social Media]] * [[https://arxiv.org/pdf/1506.05869.pdf|Vinyals & Le 2015- A Neural Conversational Model]] * [[https://arxiv.org/pdf/1606.01541.pdf|Li et al 2016 - Deep Reinforcement Learning for Dialogue Generation]] * [[https://arxiv.org/pdf/1901.08149.pdf|Wolf et al 2019 - TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents]] * [[https://arxiv.org/pdf/1812.07617.pdf|Li et al 2018 - Towards Deep Conversational Recommendations]] Introduced the [[https://redialdata.github.io/website/|ReDial dataset]] * Meena: [[https://arxiv.org/pdf/2001.09977.pdf|Adiwardana et al 2020 - Towards a Human-like Open-Domain Chatbot]] * SOTA (2020, better than Meena) neural dialog generation: [[https://arxiv.org/pdf/2004.13637.pdf|Roller et al 2020 - Recipes for building an open-domain chatbot]]. [[https://parl.ai/projects/recipes/|Code and models]] * [[https://arxiv.org/abs/2009.11423|Andreas et al 2020 - Task-Oriented Dialogue as Dataflow Synthesis]] * [[https://arxiv.org/pdf/2205.09073.pdf|Dai et al 2022 - Dialog Inpainting: Turning Documents into Dialogs]] ===== Chatbots ===== See also [[Chatbots]]. * [[https://arxiv.org/abs/2008.12579|Madotto et al 2020 - The Adapter-Bot: All-In-One Controllable Conversational Model]] === Out-of-Scope Detection === * [[https://arxiv.org/pdf/2301.06544.pdf|Qian et al 2023 - Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants]] ===== Open-Domain Goal-Oriented Dialog ===== * [[https://arxiv.org/pdf/2002.02878.pdf|Prabhumoye 2020 - I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents]] ===== Open-Domain Dialog Systems with Roles ===== * **[[https://aclanthology.org/2022.naacl-main.155.pdf|Bae et al 2022 - Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models]]** ===== Knowledge-Grounded Conversational Agents ===== ==== Conversational Question Answering ==== ===== Prompting and In-Context Learning ===== ===== Debating Systems ===== * [[https://www.nature.com/articles/s41586-021-03215-w|Slonim et al 2021 - An Autonomous Debating System]] ===== Multimodal Dialog Agents ===== See [[nlp:vision_and_language#Multimodal Dialog Agents|Vision and Language - Multimodal Dialog Agents]]. ===== Dialog Policy ===== For a nice overview, see related work section of [[https://arxiv.org/pdf/2407.05674|Joshi 2024]]. * [[https://arxiv.org/pdf/2407.05674|Joshi et al 2024 - LLM-Based Open-Domain Integrated Task and Knowledge Assistants with Programmable Policies]] (Has a great overview) ===== Dialog Acts ===== A dialog act, also known as a speech act, is the "action" an utterance performs, such as request information, confirm, etc. ==== Dialog Act Tagging ==== Dialog Acts are related to RST, except that dialog acts assume that the relation expressed in the dialog act is between adjacent utterances. See [[https://web.stanford.edu/~jurafsky/ws97/manual.august1.html|here]] for more information. * [[https://web.stanford.edu/~jurafsky/ws97/manual.august1.html|Switchboard Dialog Act Draft 1997]] * [[https://groups.inf.ed.ac.uk/ami/corpus/Guidelines/dialogue_acts_manual_1.0.pdf|2005 - Guidelines for Dialogue Act and Addressee Annotation Version 1.0]] Guidelines for the [[https://groups.inf.ed.ac.uk/ami/corpus/index.shtml|AMI dataset]] * [[https://arxiv.org/pdf/1908.10023.pdf|Yu & Yu 2019 - MIDAS, A Dialog Act Annotation Scheme for Open-Domain Human-Machine Spoken Conversations]] There used to be a corpus and an open-source system available, but it has been taken down. Jurik uses this tagger in his system === Taxonomies of Dialog Acts === * [[https://arxiv.org/pdf/1907.03020.pdf|Paul et al 2019 - Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues]] ==== Predicting Dialog Acts ==== Predicting the next dialog act. Papers: * [[https://www.aclweb.org/anthology/P19-2027.pdf|Tanaka et al 2019 - Dialogue-Act Prediction of Future Responses based on Conversation History]] ===== Dialog Systems that Learn / Teachable Agents ===== See also [[nlp:semantic_parsing#Learning from Feedback]]. * [[https://arxiv.org/pdf/1607.00061.pdf|Melamed & Niraula 2016 - Towards A Virtual Assistant That Can Be Taught New Tasks In Any Domain By Its End-Users]] Helpa paper * [[https://arxiv.org/pdf/1612.04936.pdf|Li et al 2016 - Learning through dialogue interactions by asking questions]] * [[https://aclanthology.org/D18-2025.pdf|Labutov et al 2018 - LIA: A Natural Language Programmable Personal Assistant]] * [[https://aclanthology.org/2020.acl-demos.25.pdf|Li et al 2020 - Interactive Task Learning from GUI-Grounded Natural Language Instructions and Demonstrations]] * [[https://deepblue.lib.umich.edu/bitstream/handle/2027.42/168026/mininger_1.pdf?sequence=1|Mininger 2021 - Expanding Task Diversity in Explanation-Based Interactive Task Learning]] ===== Conversational Search ===== * Overviews * [[https://dl.acm.org/doi/10.1145/3534965|Keyvan et al 2023 - How to Approach Ambiguous Queries in Conversational Search: A Survey of Techniques, Approaches, Tools, and Challenges]] * [[https://arxiv.org/pdf/2305.15933.pdf|Rahmani et al 2023 - A Survey on Asking Clarification Questions Datasets in Conversational Systems]] * Papers * [[https://aclanthology.org/2020.findings-emnlp.354.pdf|2020 - Making Information Seeking Easier: An Improved Pipeline for Conversational Search]] * [[https://dl.acm.org/doi/pdf/10.1145/3397271.3401206|Dalton et al 2020 - CAsT-19: A Dataset for Conversational Information Seeking]] * People * [[https://dblp.org/pid/c/JamesPCallan.html|Jamie Callan]] ===== Datasets ===== See also [[https://github.com/sebastianruder/NLP-progress/blob/master/english/dialogue.md|NLP progress - Dialog]] and [[https://arxiv.org/pdf/1512.05742.pdf|Serban et al 2017 - A Survey of Available Corpora for Building Data-Driven Dialogue Systems]]. * [[https://redialdata.github.io/website/|ReDial dataset]] - annotated dataset of dialogues, where users recommend movies to each other. Paper: [[https://arxiv.org/pdf/1812.07617.pdf|Li et al 2018 - Towards Deep Conversational Recommendations]] * [[https://github.com/budzianowski/multiwoz|MultiWOZ]] Also a data to text generation dataset. Paper v1.0: [[https://arxiv.org/pdf/1807.06517.pdf|Ramadan et al 2018 - Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing]]. Paper v2.0: [[https://arxiv.org/pdf/1810.00278.pdf|Budzianowski et al 2018 - MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling]] * [[https://parl.ai/projects/wizard_of_wikipedia/|Wizard of Wikipedia]] - Paper: [[https://arxiv.org/pdf/1811.01241.pdf|Dinan et al 2018 - Wizard of Wikipedia: Knowledge-Powered Conversational Agents]] * **[[https://github.com/jfainberg/self_dialogue_corpus|Self-dialogue Corpus]] - Paper: [[https://arxiv.org/pdf/1809.06641.pdf|Fainberg et al 2018 - Talking to myself: self-dialogues as data for conversational agents]]** * [[https://github.com/google-research-datasets/dstc8-schema-guided-dialogue|Schema-Guided Dialogue Dataset]] - Paper: [[https://arxiv.org/pdf/1909.05855.pdf|Rastogi et al 2019 - Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset]] * [[https://github.com/alexa/Topical-Chat|Topical-Chat]] - Paper: [[https://m.media-amazon.com/images/G/01/amazon.jobs/3079_Paper._CB1565131710_.pdf|Gopalakrishnan et al 2019 - Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations]] * [[https://github.com/PolyAI-LDN/conversational-datasets|Repository of Conversational Datasets]] - Paper: [[https://arxiv.org/pdf/1904.06472.pdf|Henderson et al 2019 - A Repository of Conversational Datasets]] ==== Multi-Party Datasets ==== * [[https://arxiv.org/pdf/2305.08502.pdf|Apel et al 2023 - MeeQA: Natural Questions in Meeting Transcripts]] See the datasets they used for more multi-party datasets. ==== Dialog Act Tagging ==== * [[https://web.stanford.edu/~jurafsky/ws97/CL-dialog.pdf|Stolcke et al 2000 - Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech]] * [[https://www.aclweb.org/anthology/W10-4356.pdf|Boyer et al 2010 - Dialogue Act Modeling in a Complex Task-Oriented Domain]] * [[https://arxiv.org/abs/1908.10023|Yu & Yu 2019 - MIDAS: A Dialog Act Annotation Scheme for Open Domain Human Machine Spoken Conversations]] ===== Dialog Agent Safety ===== Safety: making sure the dialog agent doesn't use swear words, offensive language, give bad advice, etc, which could happen if the agent has been trained on a corpus of dialogs. * [[https://arxiv.org/pdf/1908.06083.pdf|Dinan et al 2019 - Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack]] * [[https://arxiv.org/pdf/2010.07079.pdf|Xu et al 2020 - Recipes for Safety in Open-domain Chatbots]] === Software === * [[https://www.perspectiveapi.com/|Perspective API]] ===== Evaluation ===== See also [[Chatbots#Evaluation|Chatbots - Evaluation]]. * [[https://aclanthology.org/P97-1035.pdf|Walker et al 1997 - PARADISE: A Framework for Evaluating Spoken Dialogue Agents]] (Task oriented dialog I believe) ===== Ideas for future work ===== * Use dialog to add training data to improve the NER and EL on the fly. As examples get corrected, add them in. There should be prior work on this. ===== Software ===== * Chatbots * Rule-based * [[https://freehal.github.io/|FreeHAL]] [[https://en.wikipedia.org/wiki/FreeHAL|wikipedia]] Rule-based, self-learning chatbot * Data-driven * Task-Oriented Dialog * Miscellaneous * [[https://github.com/hwchase17/langchain|LangChain]] Framework for building applications with prompting (chaining prompts, etc) ===== Workshops and Conferences ===== * SIGDIAL: [[https://www.sigdial.org/files/workshops/conference21/proceedings.html|2020]] [[https://sigdial.org/sites/default/files/workshops/conference22/Proceedings/index.html|2021]] * IWSDS (International Workshop on Spoken Dialog Systems): [[https://sites.google.com/view/iwsds2023/home|2023]] [[https://www.colips.org/conferences/iwsds2021/wp/|2021]] [[https://dblp.org/db/conf/iwsds/iwsds2017.html|2017]] [[https://dblp.org/db/conf/iwsds/index.html|Proceedings up to 2020 on dblp]] This is a nice workshop. ===== People ===== * [[https://scholar.google.com/citations?user=pfqzHqUAAAAJ&hl=en|Emily Dinan]] * [[https://scholar.google.com/citations?user=Gp-Q8DsAAAAJ&hl=en|Maxine Eskenazi]] * [[https://scholar.google.it/citations?user=JBnyLicAAAAJ&hl=en|Andrea Madotto]] * [[https://scholar.google.com/citations?user=evVAmhQAAAAJ&hl=en|David Traum]] * [[https://scholar.google.com/citations?user=TMw2-csAAAAJ&hl=en|Marilyn Walker]] * [[https://scholar.google.com/citations?user=lMkTx0EAAAAJ&hl=en|Jason Weston]] * [[https://scholar.google.com/citations?user=jee2Dy0AAAAJ&hl=en|Zhou Yu]] ===== Related Pages ===== * [[Autonomous Language Agents]] * [[Chatbots]] * [[Discourse Analysis]] * [[robotics:Embodied AI]] * [[nlp:vision_and_language#Multimodal Dialog Agents]] * [[Task Oriented Dialog]] * [[Jeff's List of Interesting Papers]]