Table of Contents

Autonomous Language Agents

Autonomous Language Agents

LLM agents, etc.

Overviews

See the related work of Chen 2023 for a nice overview.
Wang et al 2023 - A Survey on Large Language Model based Autonomous Agents
- LLM Agent Survey (github) - from the above survey, continuously updated
Li et al 2025 - A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning
- LLM Agent Survey (github) - from the above survey, continuously updated
Xi et al 2023 - The Rise and Potential of Large Language Model Based Agents: A Survey
Wang et al 2024 - Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Masterman et al 2024 - The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Architectures
- Sumers et al 2024 - Cognitive Architectures for Language Agents
Memory Architectures
- Rasmussen et al 2025 - Zep: A Temporal Knowledge Graph Architecture for Agent Memory
Multi-Agents
- Tran et al 2025 - Multi-Agent Collaboration Mechanisms: A Survey of LLMs
Applications
- GUI Agents
  - Wang et al 2024 - GUI Agents with Foundation Models: A Comprehensive Survey
  - Zhang et al 2024 - Large Language Model-Brained GUI Agents: A Survey
- Personal Agents
  - Li et al 2024 - Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

Papers

Key Method Papers
- Yao et al 2022 - ReAct: Synergizing Reasoning and Acting in Language Models - The basis of LangChain. See the Webshop experiments section 4 and appendix D.3.
  - Follow-up work: Jiao et al 2024 - Learning Planning-based Reasoning via Trajectories Collection and Process Reward Synthesizing
- Shinn et al 2023 - Reflexion: Language Agents with Verbal Reinforcement Learning
- Zhao et al 2023 - ExpeL: LLM Agents Are Experiential Learners
- Chen et al 2023 - FireAct: Toward Language Agent Fine-tuning Fine-tunes the LLM agent
- CodeAct: Wang et al 2024 - Executable Code Actions Elicit Better LLM Agents
AutoGPT: Yang et al 2023 - Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions github
Zhou et al 2023 - Agents: An Open-source Framework for Autonomous Language Agents
Chen et al 2023 - Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Wu et al 2025 - Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research
OpenAI 2025 - Deep Research
Yano et al 2025 - LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
Pham et al 2025 - Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems
Tool Use and Agent Skills
- “Agent Skills are instructions, scripts, and resources that agents can discover and use to do things more accurately and efficiently” (from here)
- Claude - Agent Skills (I believe “agent skills” was introduced in Claude)
- Agent Skills (website)
- Li et al 2026 - SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
Software Engineering (SWE) Agents
Web Agents
Mobile UI Agents
- You et al 2024 - Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
OS Agents
- Wu et al 2024 - OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
- Xie et al 2024 - OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Multi-Agents

Overviews
- Tran et al 2025 - Multi-Agent Collaboration Mechanisms: A Survey of LLMs

People

Shunyu Yao website

Related Pages