LLM R&D Lead
Leading the model R&D team of CodeBuddy / WorkBuddy. Large language model, code intelligence, AI agent.
LLM R&D Lead, CodeBuddy/WorkBuddy
Tencent, Shenzhen, China
Email: donghd66 AT gmail.com
Google Scholar
•
Github
I am the LLM R&D Lead of Tencent CodeBuddy/WorkBuddy, leading the model research and development team. My expertise is centered around Large Language Models, spanning pre-training, post-training, reinforcement learning, and LLM Agent. Currently, I am focused on leveraging the vast experience data generated by widely deployed Agent applications to enhance model capabilities.
Leading the model R&D team of CodeBuddy / WorkBuddy. Large language model, code intelligence, AI agent.
Model research and development of CodeBuddy. Large language model, code intelligence, RAG, code agent.
Code understanding and generation, pretrained language model, large language model.
Advisor: Prof. Xiangnan He
Chung-Yao Chao Talent Program in Applied Physics
Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective
ReCreate: Reasoning and Creating Domain Agents Driven by Experience
LEPO: Latent Reasoning Policy Optimization for Large Language Models
ACL 2026 Findings • Corresponding author
Scheduling Your LLM Reinforcement Learning with Reasoning Trees
AP2O: Correcting LLM-Generated Code Errors Type by Type Like Humans via Adaptive Progressive Preference Optimization
UniSVG: A Unified Dataset for Vector Graphic Understanding and Generation with Multimodal Large Language Models
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse
Survey of Code Search based on Deep Learning
ACM TOSEM • arXiv
Bias and Debias in Recommender System: A Survey and Future Directions
ACM TOIS • arXiv
AutoDebias: Learning to Debias for Recommendation