All Publications (34+)

Complete publication list

2026

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Agent Infrastructure Human-Agent Collaboration Skill Marketplace
2026

Story2Proposal: A Scaffold for Structured Scientific Paper Writing

Multi-Agent Framework Scientific Writing Visual Contract
2026

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

LLM Reasoning Adaptive Cognitive Modes Chain of Mindset
2026

Controlled Self-Evolution for Algorithmic Code Optimization

Self-Evolution CodeAgent Genetic Algorithm EffiBench
2026

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Self-Evolution DeepResearch Finite State Machine Multi-hop QA
ACL 2026

CloneMem: Benchmarking Long-Term Memory for AI Clones

Agent Memory AI Clone Temporal Reasoning
ACL 2026

RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction

Agent Memory Cross-session Dialog Real-world Interaction
ACL 2026

MirrorQA: Benchmarking Multimodal LLMs on Mirror-Orientation Reasoning

Multimodal LLM Mirror Reasoning Benchmark
ACL 2026

Tiny Scales, Great Challenges: The Limits of Multimodal LLMs in Scale Recognition

Multimodal LLM Scale Recognition Benchmark
ACL 2026

SafetyMem: Adaptive Jailbreak Defense via Dual-Component Safety Memory

Jailbreak Defense Safety Memory LLM Safety
ACL 2026

LiveCANNBench: Benchmark SWE AI Coding for Ascend CANN

SWE AI Coding Benchmark Ascend CANN
2025

🐙 Octopus: Agentic Multimodal Reasoning with Six-Capability Orchestration

Multimodal Reasoning Agentic Framework arXiv Preprint
NeurIPS 2025 Spotlight