About Me

I am currently a Lecturer (a.k.a Assistant Professor in USA) with the CGCL/SCTS/BDTS Lab, College of Computer Science and Technology at Huazhong University of Science and Technology (HUST), Wuhan, China. Prior to that, I got my Ph.D degree from Zhejiang University in Dec., 2019, under the supervision of Prof. Jian Wu and Prof. Zhou Zhao. I am also working closely with Prof. Yulei Sui from University of Technology Sydney. My research interests are mainly focusing on the synergy between Artificial Intelligence (AI)‎ and Software Engineering (a.k.a ASE), especially on natural language processing, programming language analysis, data mining and machine learning.

During my Ph.D life, I am fortunate to have the following three wonderful experiences hosted by three distinguished professors who have provided me much support, and I am also happy to meet with many wonderful friends and collaborators during these experiences. I have been visiting Shenzhen Research Institute, Chinese University of Hong Kong, China (working with Prof. Zibin Zheng) in 2014, University of Technology Sydney, Australia (working with Prof. Guandong Xu) in 2016, and University of Illinois at Chicago, USA (working with Prof. Philip S. Yu) in 2018.

(I am looking for highly-motivated graduate and under-graduate students to work with me. If interested, please drop me a message by email.)

Research Highlights

NaturalCC Logo

NaturalCC is a sequence modeling toolkit that allows researchers and developers to train custom models for many software engineering tasks, e.g., code summarization, code generation, code retrieval, code clone detection, and so on. Our vision is to bridge the gap between programming language and natural language through some machine learning techniques. [FSE'22, ICSE'22, ASE'18, ASE'19, TSE'20, ACL'21, EMNLP'21, TOSEM'21]

Selected Publications

You See What I Want You to See: Poisoning Vulnerabilities in Neural Code Search
Yao Wan, Shijie Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Dezhong Yao, Hai Jin, and Lichao Sun
ESEC/FSE 2022. The 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering.
PDF CCF-A
NaturalCC: An Open-Source Toolkit for Code Intelligence
Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Yulei Sui, Hongyu Zhang, Kazuma Hashimoto, Hai Jin, Guandong Xu, Caiming Xiong, Philip S. Yu
ICSE 2022 Demo Track.
PDF arXiv Code Homepage CCF-A
What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code
Yao Wan, Wei Zhao, Hongyu Zhang, Yulei Sui, Guandong Xu and Hai Jin
ICSE 2022. The 44th ACM/IEEE International Conference on Software Engineering
PDF CCF-A
Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization
Juncai Guo, Jin Liu, Yao Wan, Li Li, Pingyi Zhou
ACL 2022. The 60th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
Compilable Neural Code Generation with Compiler Feedback
Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu
Findings of ACL 2022. The 60th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
Disentangled Code Representation Learning for Multiple Programming Languages
Jingfeng Zhang, Haiwen Hong, Yin Zhang, Yao Wan, Ye Liu, Yulei Sui
Findings of ACL 2021. The 59th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
Multi-Modal Attention Network Learning for Semantic Source Code Retrieval
Yao Wan, Jingdong Shu, Yulei Sui, Guandong Xu, Zhou Zhao, Jian Wu, Philip S. Yu
ASE 2019. The 34th ACM/IEEE International Conference on Automated Software Engineering
PDF Code CCF-A
Improving Automatic Source Code Summarization via Deep Reinforcement Learning
Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu
ASE 2018. The 33rd ACM/IEEE International Conference on Automated Software Engineering
PDF Code CCF-A

Professional Services

  Confenrence PC/Reviewer
  • ACL: 2022,2021; EMNLP: 2021; AAAI: 2022,2021; IJCAI: 2021; SIGKDD: 2022; WSDM: 2022; COLING: 2020; NLPCC: 2020; BESC: 2021, 2020
  Journal Reviewer
  • TSE: 2021; TKDE: 2021; WWWJ: 2017-2021; TRel: 2020