About Me

I am a Research Scientist at BandAI, ByteDance, working on cutting-edge artificial intelligence technologies. I focus on advancing the capabilities of AI systems through innovative research in deep learning and autonomous agent development. I have been recognized through prestigious talent programs including Alibaba Star and Huawei Genius Youth Program.

My research interests primarily focus on Deep Research, Agentic Reinforcement Learning, Tool Use, and Self Evolution. I am particularly interested in developing intelligent systems that can autonomously learn, adapt, and evolve their capabilities to solve complex real-world problems.

I have published several papers in top-tier conferences, including AAAI, EMNLP, KDD, and contributed to open-source projects like TableBank, DocBank, LayoutLM, and TrOCR that have gained significant impact in the community. Notably, LayoutLM won the 2024 International Congress of Basic Science Frontier Science Award, and my research has received 2,288 citations on Google Scholar. Currently, I am working on next-generation AI systems that can perform autonomous research and continuously improve themselves.

🔬 My Research Interests

  • 🤖 Deep Research
  • 🎯 Agentic Reinforcement Learning
  • 🛠️ Tool Use
  • 🌱 Self Evolution

🔥 News

  • 2025.09: 🌱 Released SamplingEvolve: A test-time scaling framework that transforms model sampling from independent trajectories to experience-guided evolution, achieving 91.36% accuracy on GAIA dataset through continuous trajectory optimization.
  • 2025.08: 📊 Published ReportBench: The first systematic benchmark for evaluating Deep Research agents on academic survey tasks, featuring automated construction from arXiv papers and comprehensive fact-checking mechanisms.

📖 Education

  • 2019.09 - 2024.06, Ph.D. in Computer Science and Technology, Beihang University, Joint Ph.D. program with Microsoft Research Asia (MSRA)
  • 2015.09 - 2019.06, Bachelor in Computer Science and Technology, Beihang University

💼 Experience

  • 2025.04 - Present, Research Scientist, BandAI, ByteDance, Beijing
    • Research Focus: Deep Research, Agentic Reinforcement Learning, Tool Use, Self Evolution
    • Working on next-generation AI systems capable of autonomous research and self-improvement
    • Developing advanced agentic frameworks for complex problem solving
  • 2024.03 - 2025.04, Research Scientist, Seed, ByteDance, Beijing
    • Research Focus: User Preference Optimization, Instruction Following, Model Alignment
    • User Flywheel - Semantic Signal Optimization: Developed methods to extract user preference signals from conversation patterns where users modify prompts or add critiques when unsatisfied with initial responses, constructing RM training data to improve model alignment with user preferences
    • User Flywheel - Behavioral Signal Optimization: Built PointRM, a binary classification model to identify user copy behavior, using it to create preference pairs from equal-rated responses, achieving 3.5% improvement in human evaluation
    • Instruction Following Optimization: Designed verifiable atomic instructions across multiple languages and user needs, combined with nested logical relationships (AND, OR, NOT, IF) for instruction composition, achieving ~10 point improvement on public benchmarks and 10% advantage in human evaluation GSB
  • 2023.02 - 2024.02, Research Intern, Conversational AI, DAMO Academy, Alibaba, Beijing
    • Research Focus: Tool-Augmented LLMs, Human Alignment
    • Developed API-Bank benchmark and PRO (Preference Ranking Optimization) method
    • Contributed to evaluation standards for tool-calling capabilities in LLMs
  • 2018.07 - 2023.02, Research Intern, Natural Language Computing Group, Microsoft Research Asia (MSRA), Beijing
    • Mentor: Dr. Lei Cui
    • Research Focus: Document Intelligence, Multi-modal Pre-training Models
    • Long-term internship as part of joint Ph.D. program

🎖 Honors and Awards

  • 2024 International Congress of Basic Science Frontier Science Award (for LayoutLM) - Awarded at the International Congress of Basic Science founded by S.T. Yau, hosted by Beijing Municipal Government, Ministry of Science and Technology, China Association for Science and Technology, and International Congress of Chinese Mathematicians
  • 2024 Alibaba Star & Huawei Genius Youth Program

📝 Publications

LLMs Agents

Document AI