User Tools

Site Tools


ml:trustworthy_ai

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:trustworthy_ai [2023/10/12 18:15] jmflanigml:trustworthy_ai [2025/05/30 17:10] (current) – [Overviews] jmflanig
Line 1: Line 1:
 ====== Trustworthy AI ====== ====== Trustworthy AI ======
 +
 +===== Overviews =====
 +  * [[https://arxiv.org/pdf/2107.06641|Liu et al 2021 - Trustworthy AI: A Computational Perspective]]
 +  * [[https://arxiv.org/pdf/2110.01167|Li et al 2021 - Trustworthy AI: From Principles to Practices]]
 +  * [[https://dl.acm.org/doi/10.1145/3491209|Kaur et al 2022 - Trustworthy Artificial Intelligence: A Review]]
 +  * [[https://arxiv.org/pdf/2306.00380|Wu et al 2023 - Survey of Trustworthy AI: A Meta Decision of AI]]
 +  * **LLMs**
 +    * [[https://arxiv.org/abs/2308.05374|Liu et al 2023 - Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment]]
  
 ===== Papers ===== ===== Papers =====
   * [[https://arxiv.org/pdf/2010.07487.pdf|Jacovi et al 2020 - Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI]]   * [[https://arxiv.org/pdf/2010.07487.pdf|Jacovi et al 2020 - Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI]]
 +
 +===== LLM Trustworthiness =====
 +{{media:llm_trustworthy_pillars.png}}\\
 +Figure from [[https://trustllm.ai/|here]].
  
 ===== Related Pages ===== ===== Related Pages =====
   * [[nlp:Explainability]]   * [[nlp:Explainability]]
   * [[nlp:Hallucination and Factivity]]   * [[nlp:Hallucination and Factivity]]
 +  * [[Mechanistic Interpretability]]
  
ml/trustworthy_ai.1697134541.txt.gz · Last modified: 2023/10/12 18:15 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki