80 papers compiled into a milestone node with 9 children across 3 layers. This is what the LLM writes and maintains — you never touch it.
Three converging trends crystallized agent self-evolution as a distinct research direction:
| Child | Definition | Papers |
|---|---|---|
| MECHANISM self-evolving-skill-libraries | Autonomous construction and accumulation of reusable executable skills | 7 |
| MECHANISM memory-evolution | Evolution of agent memory from architecture to content | 12 |
| MECHANISM experience-driven-policy-evolution | Policy evolution through training-time or test-time learning from trajectories | 5 |
| MECHANISM llm-guided-evolutionary-search | LLM agents as variation operators in evolutionary program search | 8 |
| MECHANISM multi-agent-co-evolution | Co-evolution of agent populations and coordination strategies | 8 |
| APPLICATION domain-applications | Domain-specific instantiation of agent self-evolution in science and medicine | 10 |
| CROSS-CUTTING agentic-evolution-theory | Theoretical foundations defining agent self-evolution as a scaling axis | 6 |
| CROSS-CUTTING agent-safety-adversarial-evolution | Safety challenges and adversarial dynamics for self-evolving agents | 5 |
| CROSS-CUTTING evolving-agent-surveys-benchmarks | Surveys, taxonomies, and evaluation infrastructure | 6 |
Children organize into three orthogonal layers:
Five children address distinct evolution targets: skill repertoire (self-evolving-skill-libraries), memory system (memory-evolution), decision policy (experience-driven-policy-evolution), programs/algorithms (llm-guided-evolutionary-search), agent populations (multi-agent-co-evolution).
domain-applications aggregates scientific and clinical instantiations that validate mechanism-level principles under real-world constraints — physics-grounded evaluation in science, safety-constrained evolution in medicine.
agentic-evolution-theory provides the conceptual vocabulary (evolution-time compute, clone-and-replace, epistemic routing). agent-safety-adversarial-evolution constrains how mechanisms can operate. evolving-agent-surveys-benchmarks provides evaluation infrastructure.
Mechanism children materialize in applications: skill repertoire → scientific tool synthesis (venusfactory2, skillfoundry); memory system → clinical case accumulation (theraagent, skingpt-x); programs/algorithms → surrogate discovery (aero-blueprint, lensagent).