User Tools

Site Tools


nlp:chatbots

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
nlp:chatbots [2023/07/24 22:41] – [Evaluation] jmflanignlp:chatbots [2025/03/07 11:21] (current) – [LLM-Based Chatbots] jmflanig
Line 1: Line 1:
 ====== Chatbots ====== ====== Chatbots ======
 +
 +===== Overviews =====
 +  * [[https://arxiv.org/pdf/2406.16937|Dam et al 2024 - A Complete Survey on LLM-based AI Chatbots]]
 +
 +===== LLM-Based Chatbots =====
 +  * **System descriptions**
 +    * [[https://arxiv.org/pdf/2201.08239|LaMDA: Language Models for Dialog Applications]] According to [[https://arxiv.org/pdf/2406.16937|Dam 2024]], this is the basis for Google's Bard and Gemini
 +    * [[https://arxiv.org/pdf/2204.05862|Bai et al 2022 - Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback]] Anthropic's chatbot
  
 ===== Evaluation ===== ===== Evaluation =====
-  * Overviews+  * **Overviews**
     * [[https://aclanthology.org/2020.sigdial-1.29.pdf|Finch & Choi 2020 - Towards Unified Dialogue System Evaluation: A Comprehensive Analysis of Current Evaluation Protocols]]     * [[https://aclanthology.org/2020.sigdial-1.29.pdf|Finch & Choi 2020 - Towards Unified Dialogue System Evaluation: A Comprehensive Analysis of Current Evaluation Protocols]]
-  * Human Eval+  * **Human Eval**
     * [[https://aclanthology.org/N19-1170.pdf|2019 - What makes a good conversation? How controllable attributes affect human judgments]]     * [[https://aclanthology.org/N19-1170.pdf|2019 - What makes a good conversation? How controllable attributes affect human judgments]]
     * [[https://arxiv.org/pdf/1909.03087.pdf|Li et al 2019 - ACUTE-EVAL: Improved dialogue evaluation with optimized questions and multi-turn comparisons]]     * [[https://arxiv.org/pdf/1909.03087.pdf|Li et al 2019 - ACUTE-EVAL: Improved dialogue evaluation with optimized questions and multi-turn comparisons]]
 +    * [[https://aclanthology.org/2020.emnlp-main.326.pdf|Deriu et al 2020 - Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems]]
   * [[https://aclanthology.org/2021.acl-long.441.pdf|Zhang et al 2021 - DynaEval: Unifying Turn and Dialogue Level Evaluation]] Has some good references   * [[https://aclanthology.org/2021.acl-long.441.pdf|Zhang et al 2021 - DynaEval: Unifying Turn and Dialogue Level Evaluation]] Has some good references
  
nlp/chatbots.1690238511.txt.gz · Last modified: 2023/07/24 22:41 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki