====== Dialog ======

===== Overviews =====
Best overviews: [[https://arxiv.org/pdf/1711.01731.pdf|Chen et al 2017 - A Survey on Dialogue Systems: Recent Advances and New Frontiers]] and [[https://arxiv.org/pdf/2004.13637.pdf|Roller et al 2020 - Recipes for building an open-domain chatbot]].
  * Introductions
    * [[https://web.stanford.edu/~jurafsky/slp3/24.pdf|S&LP - Ch 24]]
  * General
    * **[[https://arxiv.org/pdf/1711.01731.pdf|Chen et al 2017 - A Survey on Dialogue Systems: Recent Advances and New Frontiers]]**
    * [[https://arxiv.org/pdf/1809.08267.pdf|Gao et al 2018 - Neural Approaches to Conversational AI]] (95 pages)
    * [[https://arxiv.org/pdf/2006.12442.pdf|Roller et al 2020 - Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions]]
    * [[https://www.sciencedirect.com/science/article/pii/S1319157821003001|Singh 2021 - A Survey on Near-human Conversational Agents]]
    * [[https://arxiv.org/pdf/2201.06657.pdf|Caldarini et al 2022 - A Literature Survey of Recent Advances in Chatbots]]
    * [[https://arxiv.org/pdf/2212.09252.pdf|Shaier et al 2022 - Mind the Knowledge Gap: A Survey of Knowledge-enhanced Dialogue Systems]]
    * [[https://arxiv.org/pdf/2402.18013|Yi et al 2024 - A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems]]
  * Courses
    * **[[https://docs.google.com/spreadsheets/d/1nSKcnM5r9x82BdyPgn-obN1sRUlLC7zZ082a0132Igk/edit?fbclid=IwAR1tX8eztqZ4Kn4NNPUvkkGKCurikWtv-wcSnRyDRg4M544dQtSXTTb3vfM#gid=1523499517|Zhou Yu's Conversational AI course at Columbia]]**
  * Datasets
    * [[https://arxiv.org/pdf/1512.05742.pdf|Serban et al 2017 - A Survey of Available Corpora for Building Data-Driven Dialogue Systems]]
    * [[https://arxiv.org/pdf/2205.06907.pdf|Sundar & Heck 2022 - Multimodal Conversational AI A Survey of Datasets and Approaches]]
  * Evaluation
    * [[https://arxiv.org/pdf/1905.04071.pdf|Deriu et al 2019 - Survey on Evaluation Methods for Dialogue Systems]]
    * [[https://ieeexplore.ieee.org/document/7785371|Fan et al 2020 - A Survey of Dialogue System Evaluation]]
    * [[https://arxiv.org/pdf/2106.03706.pdf|Yeh et al 2021 - A Comprehensive Assessment of Dialog Evaluation Metrics]]
  * Other
    * [[https://ww.sentic.net/empathetic-dialogue-systems.pdf|Ma et al 2020 - A Survey on Empathetic Dialogue Systems]]
    * [[https://arxiv.org/pdf/2205.06907.pdf|Sundar & Heck 2022 - Multimodal Conversational AI: A Survey of Datasets and Approaches]]
    * [[https://arxiv.org/pdf/2305.15933.pdf|Rahmani et al 2023 - A Survey on Asking Clarification Questions Datasets in Conversational Systems]]


===== Key Papers =====
  * [[https://www.aclweb.org/anthology/D11-1054.pdf|Ritter et al 2011 - Data-Driven Response Generation in Social Media]]
  * [[https://arxiv.org/pdf/1506.05869.pdf|Vinyals & Le 2015- A Neural Conversational Model]]
  * [[https://arxiv.org/pdf/1606.01541.pdf|Li et al 2016 - Deep Reinforcement Learning for Dialogue Generation]]
  * [[https://arxiv.org/pdf/1901.08149.pdf|Wolf et al 2019 - TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents]]
  * [[https://arxiv.org/pdf/1812.07617.pdf|Li et al 2018 - Towards Deep Conversational Recommendations]] Introduced the [[https://redialdata.github.io/website/|ReDial dataset]]
  * Meena: [[https://arxiv.org/pdf/2001.09977.pdf|Adiwardana et al 2020 - Towards a Human-like Open-Domain Chatbot]]
  * SOTA (2020, better than Meena) neural dialog generation: [[https://arxiv.org/pdf/2004.13637.pdf|Roller et al 2020 - Recipes for building an open-domain chatbot]]. [[https://parl.ai/projects/recipes/|Code and models]]
  * [[https://arxiv.org/abs/2009.11423|Andreas et al 2020 - Task-Oriented Dialogue as Dataflow Synthesis]]
  * [[https://arxiv.org/pdf/2205.09073.pdf|Dai et al 2022 - Dialog Inpainting: Turning Documents into Dialogs]]

===== Chatbots =====
See also [[Chatbots]].

  * [[https://arxiv.org/abs/2008.12579|Madotto et al 2020 - The Adapter-Bot: All-In-One Controllable Conversational Model]]

=== Out-of-Scope Detection ===
  * [[https://arxiv.org/pdf/2301.06544.pdf|Qian et al 2023 - Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants]]

===== Open-Domain Goal-Oriented Dialog =====
  * [[https://arxiv.org/pdf/2002.02878.pdf|Prabhumoye 2020 - I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents]]

===== Open-Domain Dialog Systems with Roles =====
  * **[[https://aclanthology.org/2022.naacl-main.155.pdf|Bae et al 2022 - Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models]]**


===== Knowledge-Grounded Conversational Agents =====

==== Conversational Question Answering ====

===== Prompting and In-Context Learning =====


===== Debating Systems =====
  * [[https://www.nature.com/articles/s41586-021-03215-w|Slonim et al 2021 - An Autonomous Debating System]]

===== Multimodal Dialog Agents =====
See [[nlp:vision_and_language#Multimodal Dialog Agents|Vision and Language - Multimodal Dialog Agents]].


===== Dialog Policy =====
For a nice overview, see related work section of [[https://arxiv.org/pdf/2407.05674|Joshi 2024]].

  * [[https://arxiv.org/pdf/2407.05674|Joshi et al 2024 - LLM-Based Open-Domain Integrated Task and Knowledge Assistants with Programmable Policies]] (Has a great overview)

===== Dialog Acts =====
A dialog act, also known as a speech act, is the "action" an utterance performs, such as request information, confirm, etc.  
==== Dialog Act Tagging ====
Dialog Acts are related to RST, except that dialog acts assume that the relation expressed in the dialog act is between adjacent utterances. See [[https://web.stanford.edu/~jurafsky/ws97/manual.august1.html|here]] for more information.
  * [[https://web.stanford.edu/~jurafsky/ws97/manual.august1.html|Switchboard Dialog Act Draft 1997]]
  * [[https://groups.inf.ed.ac.uk/ami/corpus/Guidelines/dialogue_acts_manual_1.0.pdf|2005 - Guidelines for Dialogue Act and Addressee Annotation Version 1.0]] Guidelines for the [[https://groups.inf.ed.ac.uk/ami/corpus/index.shtml|AMI dataset]]
  * [[https://arxiv.org/pdf/1908.10023.pdf|Yu & Yu 2019 - MIDAS, A Dialog Act Annotation Scheme for Open-Domain Human-Machine Spoken Conversations]] There used to be a corpus and an open-source system available, but it has been taken down. Jurik uses this tagger in his system

=== Taxonomies of Dialog Acts ===
  * [[https://arxiv.org/pdf/1907.03020.pdf|Paul et al 2019 - Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues]]

==== Predicting Dialog Acts ====
Predicting the next dialog act.
Papers:
  * [[https://www.aclweb.org/anthology/P19-2027.pdf|Tanaka et al 2019 - Dialogue-Act Prediction of Future Responses based on Conversation History]]


===== Dialog Systems that Learn / Teachable Agents =====
See also [[nlp:semantic_parsing#Learning from Feedback]].
  * [[https://arxiv.org/pdf/1607.00061.pdf|Melamed & Niraula 2016 - Towards A Virtual Assistant That Can Be Taught New Tasks In Any Domain By Its End-Users]] Helpa paper
  * [[https://arxiv.org/pdf/1612.04936.pdf|Li et al 2016 - Learning through dialogue interactions by asking questions]]
  * [[https://aclanthology.org/D18-2025.pdf|Labutov et al 2018 - LIA: A Natural Language Programmable Personal Assistant]]
  * [[https://aclanthology.org/2020.acl-demos.25.pdf|Li et al 2020 - Interactive Task Learning from GUI-Grounded Natural Language Instructions and Demonstrations]]
  * [[https://deepblue.lib.umich.edu/bitstream/handle/2027.42/168026/mininger_1.pdf?sequence=1|Mininger 2021 - Expanding Task Diversity in Explanation-Based Interactive Task Learning]]

===== Conversational Search =====
  * Overviews
    * [[https://dl.acm.org/doi/10.1145/3534965|Keyvan et al 2023 - How to Approach Ambiguous Queries in Conversational Search: A Survey of Techniques, Approaches, Tools, and Challenges]]
      * [[https://arxiv.org/pdf/2305.15933.pdf|Rahmani et al 2023 - A Survey on Asking Clarification Questions Datasets in Conversational Systems]]
  * Papers
    * [[https://aclanthology.org/2020.findings-emnlp.354.pdf|2020 - Making Information Seeking Easier: An Improved Pipeline for Conversational Search]]
    * [[https://dl.acm.org/doi/pdf/10.1145/3397271.3401206|Dalton et al 2020 - CAsT-19: A Dataset for Conversational Information Seeking]]
  * People
    * [[https://dblp.org/pid/c/JamesPCallan.html|Jamie Callan]]

===== Datasets =====
See also [[https://github.com/sebastianruder/NLP-progress/blob/master/english/dialogue.md|NLP progress - Dialog]] and [[https://arxiv.org/pdf/1512.05742.pdf|Serban et al 2017 - A Survey of Available Corpora for Building Data-Driven Dialogue Systems]].
  * [[https://redialdata.github.io/website/|ReDial dataset]] - annotated dataset of dialogues, where users recommend movies to each other. Paper: [[https://arxiv.org/pdf/1812.07617.pdf|Li et al 2018 - Towards Deep Conversational Recommendations]]
  * [[https://github.com/budzianowski/multiwoz|MultiWOZ]] Also a data to text generation dataset. Paper v1.0: [[https://arxiv.org/pdf/1807.06517.pdf|Ramadan et al 2018 - Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing]]. Paper v2.0: [[https://arxiv.org/pdf/1810.00278.pdf|Budzianowski et al 2018 - MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling]]
  * [[https://parl.ai/projects/wizard_of_wikipedia/|Wizard of Wikipedia]] - Paper: [[https://arxiv.org/pdf/1811.01241.pdf|Dinan et al 2018 - Wizard of Wikipedia: Knowledge-Powered Conversational Agents]]
  * **[[https://github.com/jfainberg/self_dialogue_corpus|Self-dialogue Corpus]] - Paper: [[https://arxiv.org/pdf/1809.06641.pdf|Fainberg et al 2018 - Talking to myself: self-dialogues as data for conversational agents]]**
  * [[https://github.com/google-research-datasets/dstc8-schema-guided-dialogue|Schema-Guided Dialogue Dataset]] - Paper: [[https://arxiv.org/pdf/1909.05855.pdf|Rastogi et al 2019 - Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset]]
  * [[https://github.com/alexa/Topical-Chat|Topical-Chat]] - Paper: [[https://m.media-amazon.com/images/G/01/amazon.jobs/3079_Paper._CB1565131710_.pdf|Gopalakrishnan et al 2019 - Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations]]
  * [[https://github.com/PolyAI-LDN/conversational-datasets|Repository of Conversational Datasets]] - Paper: [[https://arxiv.org/pdf/1904.06472.pdf|Henderson et al 2019 - A Repository of Conversational Datasets]]

==== Multi-Party Datasets ====
  * [[https://arxiv.org/pdf/2305.08502.pdf|Apel et al 2023 - MeeQA: Natural Questions in Meeting Transcripts]] See the datasets they used for more multi-party datasets.

==== Dialog Act Tagging ====
  * [[https://web.stanford.edu/~jurafsky/ws97/CL-dialog.pdf|Stolcke et al 2000 - Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech]]
  * [[https://www.aclweb.org/anthology/W10-4356.pdf|Boyer et al 2010 - Dialogue Act Modeling in a Complex Task-Oriented Domain]]
  * [[https://arxiv.org/abs/1908.10023|Yu & Yu 2019 - MIDAS: A Dialog Act Annotation Scheme for Open Domain Human Machine Spoken Conversations]]

===== Dialog Agent Safety =====
Safety: making sure the dialog agent doesn't use swear words, offensive language, give bad advice, etc, which could happen if the agent has been trained on a corpus of dialogs.

  * [[https://arxiv.org/pdf/1908.06083.pdf|Dinan et al 2019 - Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack]]
  * [[https://arxiv.org/pdf/2010.07079.pdf|Xu et al 2020 - Recipes for Safety in Open-domain Chatbots]]

=== Software ===
  * [[https://www.perspectiveapi.com/|Perspective API]]

===== Evaluation =====
See also [[Chatbots#Evaluation|Chatbots - Evaluation]].
  * [[https://aclanthology.org/P97-1035.pdf|Walker et al 1997 - PARADISE: A Framework for Evaluating Spoken Dialogue Agents]] (Task oriented dialog I believe)

===== Ideas for future work =====
  * Use dialog to add training data to improve the NER and EL on the fly. As examples get corrected, add them in.  There should be prior work on this.

===== Software =====
  * Chatbots
    * Rule-based
      * [[https://freehal.github.io/|FreeHAL]] [[https://en.wikipedia.org/wiki/FreeHAL|wikipedia]] Rule-based, self-learning chatbot
    * Data-driven
  * Task-Oriented Dialog
  * Miscellaneous
    * [[https://github.com/hwchase17/langchain|LangChain]] Framework for building applications with prompting (chaining prompts, etc)

===== Workshops and Conferences =====
  * SIGDIAL: [[https://www.sigdial.org/files/workshops/conference21/proceedings.html|2020]] [[https://sigdial.org/sites/default/files/workshops/conference22/Proceedings/index.html|2021]]
  * IWSDS (International Workshop on Spoken Dialog Systems): [[https://sites.google.com/view/iwsds2023/home|2023]] [[https://www.colips.org/conferences/iwsds2021/wp/|2021]] [[https://dblp.org/db/conf/iwsds/iwsds2017.html|2017]] [[https://dblp.org/db/conf/iwsds/index.html|Proceedings up to 2020 on dblp]] This is a nice workshop.

===== People =====
  * [[https://scholar.google.com/citations?user=pfqzHqUAAAAJ&hl=en|Emily Dinan]]
  * [[https://scholar.google.com/citations?user=Gp-Q8DsAAAAJ&hl=en|Maxine Eskenazi]]
  * [[https://scholar.google.it/citations?user=JBnyLicAAAAJ&hl=en|Andrea Madotto]]
  * [[https://scholar.google.com/citations?user=evVAmhQAAAAJ&hl=en|David Traum]]
  * [[https://scholar.google.com/citations?user=TMw2-csAAAAJ&hl=en|Marilyn Walker]]
  * [[https://scholar.google.com/citations?user=lMkTx0EAAAAJ&hl=en|Jason Weston]]
  * [[https://scholar.google.com/citations?user=jee2Dy0AAAAJ&hl=en|Zhou Yu]]

===== Related Pages =====
  * [[Autonomous Language Agents]]
  * [[Chatbots]]
  * [[Discourse Analysis]]
  * [[robotics:Embodied AI]]
  * [[nlp:vision_and_language#Multimodal Dialog Agents]]
  * [[Task Oriented Dialog]]
  * [[Jeff's List of Interesting Papers]]