Differences

This shows you the differences between two versions of the page.

--- nlp:chatbots [2023/07/24 22:41] – [Evaluation] jmflanig
+++ nlp:chatbots [2025/03/07 11:21] (current) – [LLM-Based Chatbots] jmflanig
@@ Line 1: / Line 1: @@
 ====== Chatbots ======
+===== Overviews =====
+  * [[https://arxiv.org/pdf/2406.16937|Dam et al 2024 - A Complete Survey on LLM-based AI Chatbots]]
+===== LLM-Based Chatbots =====
+  * **System descriptions**
+    * [[https://arxiv.org/pdf/2201.08239|LaMDA: Language Models for Dialog Applications]] According to [[https://arxiv.org/pdf/2406.16937|Dam 2024]], this is the basis for Google's Bard and Gemini
+    * [[https://arxiv.org/pdf/2204.05862|Bai et al 2022 - Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback]] Anthropic's chatbot
 ===== Evaluation =====
-  * Overviews
+  * **Overviews**
     * [[https://aclanthology.org/2020.sigdial-1.29.pdf|Finch & Choi 2020 - Towards Unified Dialogue System Evaluation: A Comprehensive Analysis of Current Evaluation Protocols]]
-  * Human Eval
+  * **Human Eval**
     * [[https://aclanthology.org/N19-1170.pdf|2019 - What makes a good conversation? How controllable attributes affect human judgments]]
     * [[https://arxiv.org/pdf/1909.03087.pdf|Li et al 2019 - ACUTE-EVAL: Improved dialogue evaluation with optimized questions and multi-turn comparisons]]
+    * [[https://aclanthology.org/2020.emnlp-main.326.pdf|Deriu et al 2020 - Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems]]
   * [[https://aclanthology.org/2021.acl-long.441.pdf|Zhang et al 2021 - DynaEval: Unifying Turn and Dialogue Level Evaluation]] Has some good references