工作簡歷
2011.09 - 2015.06.18,南京航空航天大學,本科
毛航宇研究方向為強化學習、大模型智能體系統及芯片架構。在人工智能頂會和頂刊上發表論文 50 余篇,申請專利 10 余項,作為負責人和核心骨干承擔 10 余項國家自然科學基金項目、中國科學院引才項目、千萬級別企業內部項目、校企合作項目等,獲多項省部級及以上獎勵。長期擔任人工智能頂會的高級程序委員會委員、地方政府智庫專家;曾在多家高科技互聯網企業擔任研發團隊負責人,具備豐富的產業落地經驗,主導開發的大模型智能體系統實現超千萬的用戶規模及月活。
強化學習、智能體與多智能體系統、大模型、AI 芯片與系統
全部論文參考谷歌學術:??
https://scholar.google.com/citations?user=EtVHsgcAAAAJ
本人指導的學生一作/本人二作或共同一作(5 篇代表作):??
1. Guanting Dong*, Hangyu Mao*, Kai Ma, Licheng Bao, Yifei Chen, Zhongyuan??
Wang, Zhongxia Chen, Jiazhen Du, Huiyang Wang, Fuzheng Zhang, Guorui Zhou,??
Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. Agentic Reinforced Policy??
Optimization. ICLR 2026.
2. Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang??
Fan. Sequential Asynchronous Action Coordination in Multi-Agent Systems: A??
Stackelberg Decision Transformer Approach. ICML 2024 (CCF-A).
3. Yiqun Chen, Hangyu Mao, Jiaxin Mao, Shiguang Wu, Tianle Zhang, Bin Zhang, Wei??
Yang, and Hongxing Chang. PTDE: Personalized Training with Distilled Execution??
for Multi-Agent Reinforcement Learning. IJCAI 2024 (CCF-A).
4. Mingzhe Xing, Hangyu Mao, Shenglin Yin, Lichen Pang, Zhengchao Zhang, Zhen??
Xiao, and Jieyi Long. A Dual-Agent Scheduler for Distributed Deep Learning Jobs??
on Public Cloud via Reinforcement Learning. KDD 2023 (CCF-A).
5. Mingzhe Xing, Hangyu Mao, and Zhen Xiao. Fast and Fine-grained Autoscaler for??
Streaming Jobs with Reinforcement Learning. IJCAI 2022 (CCF-A)
人才隊伍