Copyright notice
The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Preprint

SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang, Yasheng Wang, Fei Mi, Pingyi Zhou, Yao Wan, Xiao Liu, Li Li, Hao Wu, Jin Liu, Xin Jiang
arXiv 2021.
PDF arXiv

2024

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu
ACM Computing Survey 2024.
PDF arXiv
HonestLLM: Toward an Honest and Helpful Large Language Model
Chujie Gao, Siyuan Wu, Yue Huang, Dongping Chen, Qihui Zhang, Zhengyan Fu, Yao Wan*, Lichao Sun, Xiangliang Zhang
NeurIPS 2024. The 38th Annual Conference on Neural Information Processing Systems
PDF arXiv CCF-A
Pandora's Box: Towards Building Universal Attackers against Real-World Large Vision-Language Models
Daizong Liu, Mingyu Yang, Xiaoye Qu, Pan Zhou, Xiang Fang, Keke Tang, Yao Wan, Lichao Sun
NeurIPS 2024. The 38th Annual Conference on Neural Information Processing Systems
PDF CCF-A
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan, Yao Wan, Zhangqian Bi, Zheng Wang, Hongyu Zhang, Pan Zhou, Lichao Sun
EMNLP 2024 (Findings). The 2024 Conference on Empirical Methods in Natural Language Processing
PDF arXiv CCF-B
Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates
Zhihong Sun, Yao Wan, Jia Li, Hongyu Zhang, Zhi Jin, Ge Li, Chen Lyu
ASE 2024. The 39th IEEE/ACM International Conference on Automated Software Engineering
PDF arXiv CCF-A
Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback
Zhangqian Bi, Yao Wan*, Zheng Wang, Hongyu Zhang, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin, Xuanhua Shi
ACL 2024 (Findings). The 62nd Annual Meeting of the Association for Computational Linguistics
PDF arXiv CCF-A
KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents
Yihe Wang, Jin Liu, Yao Wan, Yitong Li, Zifeng Liu, Weipeng Chen
ACL 2024 (Findings, Short Paper). The 62nd Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark
Dongping Chen, Ruoxi Chen, Shilin Zhang, Yinuo Liu, Yaochen Wang, Huichi Zhou, Qihui Zhang, Yao Wan*, Pan Zhou, Lichao Sun
ICML 2024 (Oral). The Forty-first International Conference on Machine Learning
PDF arXiv CCF-A
Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs
Zhihong Sun, Chen Lyu, Bolun Li, Yao Wan, Hongyu Zhang, Ge Li, Zhi Jin
COLING 2024. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation
PDF CCF-B
kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning
Wenting Zhao, Ye Liu, Yao Wan, Yibo Wang, Qingyang Wu, Zhongfen Deng, Jiangshu Du, Shuaiqi Liu, Yunlong Xu, Philip S. Yu
NAACL 2024. 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics
PDF CCF-B
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text
Wenting Zhao, Ye Liu, Tong Niu, Yao Wan, Philip S. Yu, Shafiq Joty, Yingbo Zhou, Semih Yavuz
NAACL 2024 (Findings). 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics
PDF CCF-B
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?
Qihui Zhang, Chujie Gao, Dongping Chen, Yue Huang, Yixin Huang, Zhenyang Sun, Shilin Zhang, Weiye Li, Zhengyan Fu, Yao Wan, Lichao Sun
NAACL 2024 (Findings). 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics
PDF CCF-B
Graph Neural Networks for Vulnerability Detection: A Counterfactual Explanation
Zhaoyang Chu, Yao Wan*, Qian Li, Yang Wu, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin
ISSTA 2024. The ACM SIGSOFT International Symposium on Software Testing and Analysis
PDF CCF-A
Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study
Yang Wu#, Yao Wan#*, Hongyu Zhang, Yulei Sui, Wucai Wei, Wei Zhao, Guandong Xu, Hai Jin
SIGMOD 2024. ACM Special Interest Group on Management of Data
PDF CCF-A
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
Yue Huang, Jiawen Shi, Yuan Li, Chenrui Fan, Siyuan Wu, Qihui Zhang, Yixin Liu, Pan Zhou, Yao Wan, Neil Zhenqiang Gong, Lichao Sun
ICLR 2024. The Twelfth International Conference on Learning Representations
PDF
IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion
Bolun Li, Zhihong Sun, Tao Huang, Hongyu Zhang, Yao Wan, Ge Li, Zhi Jin, Chen Lyu
FSE 2024. The ACM International Conference on the Foundations of Software Engineering
PDF CCF-A
NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries
Wei Zhao, Zhitao Hou, Siyuan Wu, Yan Gao, Haoyu Dong, Yao Wan*, Hongyu Zhang, Yulei Sui, Haidong Zhang
EACL 2024 (Findings). The 18th Annual Meeting of the European chapter of the Association for Computational Linguistics
PDF

2023

SiMFy: A Simple Yet Effective Approach for Temporal Knowledge Graph Reasoning
Zhengtao Liu, Lei Tan, Mengfan Li, Yao Wan, Hai Jin, Xuanhua Shi
EMNLP 2023 (Findings). The 2023 Conference on Empirical Methods in Natural Language Processing
PDF CCF-B
Localize, Retrieve and Fuse: A Generalized Framework for Free-Form Question Answering over Tables
Wenting Zhao, Ye Liu, Yao Wan, Yibo Wang, Zhongfen Deng and Philip S. Yu
IJCNLP-AACL 2023 (Findings).The 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
PDF
Named Entity Recognition via Machine Reading Comprehension: A Multi-Task Learning Approach
Yibo Wang, Wenting Zhao, Yao Wan, Zhongfen Deng and Philip Yu
IJCNLP-AACL 2023 (Short Paper, Findings).The 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
PDF
FedGKD: Towards Heterogeneous Federated Learning via Global Knowledge Distillation
Dezhong Yao, Wanning Pan, Yutong Dai, Yao Wan, Xiaofeng Ding, Chen Yu, Hai Jin, Zheng Xu, and Lichao Sun
TC 2023. IEEE Transactions on Computers
PDF CCF-A
Collaborative Knowledge Graph Fusion by Exploiting the Open Corpus
Yue Wang, Yao Wan, Lu Bai, Lixin Cui, Zhuo Xu, Ming Li, Philip S. Yu, and Edwin R. Hancock
TKDE 2023. IEEE Transactions on Knowledge and Data Engineering
PDF CCF-A
Summarizing source code with Heterogeneous Syntax Graph and dual position
Juncai Guo, Jin Liu, Xiao Liu, Yao Wan, Li Li
IPM 2023. nformation Processing & Management
PDF CCF-B
Diverse title generation for Stack Overflow posts with multiple-sampling-enhanced transformer
Fengji Zhang, Jin Liu, Yao Wan, Xiao Yu, Xiao Liu, Jacky Keung
JSS 2023. Journal of Systems and Software
PDF CCF-B
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks
Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, Yuting Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang
TWEB 2023. ACM Transactions on the Web
PDF CCF-B
SOR-TC: Self-Attentive Octave ResNet with Temporal Consistency for Compressed Video Action Recognition
Junsan Zhang, Xiaomin Wang, Yao Wan, Leiquan Wang, Philip S. Yu, and Zehua Zhang
Neurocomputing 2023.
PDF CCF-C

2022

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding
Jiahao Zhu, Daizong Liu, Pan Zhou, Xing Di, Yu Cheng, Song Yang, Wenzheng Xu, Zichuan Xu, Yao Wan, Lichao Sun and Zeyu Xiong
EMNLP 2022 (Findings). The 2022 Conference on Empirical Methods in Natural Language Processing
PDF CCF-B
You See What I Want You to See: Poisoning Vulnerabilities in Neural Code Search
Yao Wan, Shijie Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Dezhong Yao, Hai Jin, and Lichao Sun
ESEC/FSE 2022. The 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering.
PDF CCF-A
NaturalCC: An Open-Source Toolkit for Code Intelligence
Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Yulei Sui, Hongyu Zhang, Kazuma Hashimoto, Hai Jin, Guandong Xu, Caiming Xiong, Philip S. Yu
ICSE 2022 Demo Track.
PDF arXiv Code Homepage CCF-A
What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code
Yao Wan, Wei Zhao, Hongyu Zhang, Yulei Sui, Guandong Xu and Hai Jin
ICSE 2022. The 44th ACM/IEEE International Conference on Software Engineering, May 21–29, 2021, Pittsburgh, PA, USA.
PDF CCF-A
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu
NAACL 2022 (Findings). 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics
PDF CCF-C
Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection
Jianguo Zhang, Kazuma Hashimoto, Yao Wan, Zhiwei Liu, Ye Liu, Caiming Xiong, Philip Yu
ACL 2022 Workshop NLP ConvAI.
PDF arXiv
Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization
Juncai Guo, Jin Liu, Yao Wan, Li Li, Pingyi Zhou
ACL 2022. The 60th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
Compilable Neural Code Generation with Compiler Feedback
Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, jin liu, hao wu, Xin Jiang, Qun Liu
ACL 2022 (Findings). The 60th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
FedBERT: When Federated Learning Meets Pre-Training
Yuanyishu Tian, Yao Wan, Lingjuan Lyu, Dezhong Yao, Hai Jin, Lichao Sun
TIST 2022. ACM Transactions on Intelligent Systems and Technology
PDF
Cross-Language Binary-Source Code Matching with Intermediate Representations
Yi Gui, Yao Wan*, Hongyu Zhang, Huifang Huang, Yulei Sui, Guandong Xu, Zhiyuan Shao and Hai Jin
SANER 2022. 29th IEEE International Conference on Software Analysis, Evolution and Reengineering
PDF CCF-B
DANets: Deep Abstract Networks for Tabular Data Processing
Jintai Chen, KuanLun Liao, Yao Wan, Danny Chen, Jian Wu
AAAI 2022. The 36th AAAI Conference on Artificial Intelligence (AAAI)
PDF CCF-A

2021

XCode: Towards Cross-Language Code Representation with Large-Scale Pre-Training
Zehao Lin, Guodun Li, Jingfeng Zhang, Yue Deng, Xiangji Zeng, Yin Zhang, Yao Wan
TOSEM 2021. ACM Transactions on Software Engineering and Methodology
PDF CCF-A
Multi-Triage: A Multi-Task Learning Framework for Bug Triage
Thazin Win Win Aung, Yao Wan, Huan Huo and Yulei Sui
JSS 2021. Journal of Systems and Software
PDF CCF-B
Modeling Sequential Listening Behaviors with Attentive Temporal Point Process for Next and Next New Music Recommendation
Dongjing Wang, Xin Zhang, Yao Wan, Dongjin Yu, Guandong Xu, Shuiguang Deng
TMM 2021. IEEE Transactions on Multimedia
PDF CCF-B
Fix-Filter-Fix: Intuitively Connect Any Models for Effective Multi-task Bug Fixing
Haiwen Hong, Jingfeng Zhang, Yin Zhang, Yao Wan and Yulei Sui
EMNLP 2021. The 2021 Conference on Empirical Methods in Natural Language Processing
PDF CCF-B
Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots
Wenting Zhao, Ye Liu, Yao Wan and Philip Yu
EMNLP 2021 (Findings). The 2021 Conference on Empirical Methods in Natural Language Processing
PDF CCF-B
HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization
Ye Liu, Jianguo Zhang, Yao Wan, Congying Xia, Lifang He and Philip Yu
EMNLP 2021 (Short Paper). The 2021 Conference on Empirical Methods in Natural Language Processing
PDF CCF-B
Disentangled Code Representation Learning for Multiple Programming Languages
Jingfeng Zhang, Haiwen Hong, Yin Zhang, Yao Wan, Ye Liu, Yulei Sui
ACL 2021 (Findings). The 59th Annual Meeting of the Association for Computational Linguistics
PDF CCF-A
Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation
Ye Liu, Yao Wan, Jianguo Zhang, Wenting Zhao and Philip S. Yu
EACL 2021. The 16th Conference of the European Chapter of the Association for Computational Linguistics
PDF arXiv Top conference in NLP
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
Ye Liu, Yao Wan, Lifang He, Hao Peng, Philip S. Yu
AAAI 2021. The 35th AAAI Conference on Artificial Intelligence (AAAI)
PDF arXiv CCF-A

2020

Cross-Supervised Joint-Event-Extraction with Heterogeneous Information Networks
Yue Wang, Zhuo Xu, Lu Bai, Yao Wan, Lixin Cui, Qian Zhao, Edwin Hancock, Philip Yu
ICPR 2020. The 25th 25th International Conference on Pattern Recognition (ICPR)
PDF CCF-C
A Dual Strategy for Slot-Value Prediction through Reading Comprehension on Multi-Domain Dialog State Tracking
Jianguo Zhang, Kazuma Hashimoto, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher and Caiming Xiong
STARSEM 2020. The 9th Joint Conference on Lexical and Computational Semantics
PDF arXiv Top conference in NLP
Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher and Caiming Xiong
EMNLP 2020. The 2020 Conference on Empirical Methods in Natural Language Processing
PDF Slides Code CCF-B
FCCA: Hybrid Code Representation for Functional Clone Detection Using Attention Networks
Wei Hua, Yulei Sui, Yao Wan, Guangzhong Liu, Guandong Xu
TRel 2020. IEEE Transaction on Reliability
PDF Top journal in reliability
Reinforcement-Learning-Guided Source Code Summarization via Hierarchical Attention
Wenhua Wang, Yuqun Zhang, Yulei Sui, Yao Wan, Zhou Zhao, Jian Wu, Philip S. Yu, Guandong Xu
TSE 2020. IEEE Transaction on Software Engineering
PDF CCF-A
Multi-View Factorization Machines for Mobile App Recommendation based on Hierarchical Attention
Tingting Liang, Lei Zheng, Liang Chen, Yao Wan, Philip S. Yu, Jian Wu
KBS 2020. Knowledge-Based Systems
PDF CCF-C

2019

Multi-Modal Attention Network Learning for Semantic Source Code Retrieval
Yao Wan, Jingdong Shu, Yulei Sui, Guandong Xu, Zhou Zhao, Jian Wu, Philip S. Yu
ASE 2019. The 34th ACM/IEEE International Conference on Automated Software Engineering, November 11–15, 2019, San Diego, United States. ACM, New York, NY, USA.
PDF Code CCF-A
Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking
Yue Wang, Yao Wan, Chenwei Zhang, Lu Bai, Lixin Cui, Philip S. Yu
ICDM 2019. The 19th International Conference on Data Mining, November 8–11, 2019, Beijing, China.
PDF CCF-B
Multi-Modal Generative Adversarial Network for Short Product Title Generation in Mobile E-Commerce
Jian-Guo Zhang, Pengcheng Zou, Zhao Li, Yao Wan, Xiuming Pan, Yu Gong, Philip S. Yu
NAACL-HLT 2019. The 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, June 2-7, 2019, Minneapolis, USA.
PDF CCF-C

2018

Improving Automatic Source Code Summarization via Deep Reinforcement Learning
Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu
ASE 2018. The 33rd ACM/IEEE International Conference on Automated Software Engineering, September 3–7, 2018, Montpellier, France.
PDF Code CCF-A
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Yao Wan, Wenqiang Yan, Jianwei Gao, Zhou Zhao, Jian Wu, Philip S. Yu
IEEE BigData 2018. 2018 IEEE International Conference on BigDataDecember 10–13, 2018, Seattle, WA, USA
PDF CCF-C
SCSMiner: Mining Social Coding Sites for Software Developer Recommendation with Relevance Propagation
Yao Wan, Liang Chen, Guandong Xu, Zhou Zhao, Jie Tang, Jian Wu
WWWJ.World Wide Web Journal
PDF CCF-B
Exploiting Cross-source Knowledge for Warming up Community Question Answering Services
Yao Wan, Guandong Xu, Liang Chen, Zhou Zhao, Jian Wu
Neurocomputing.
PDF CCF-C
Product Title Refinement via Multi-Modal Generative Adversarial Learning
Jian-Guo Zhang, Pengcheng Zou, Zhao Li, Yao Wan, Ye Liu, Xiuming Pan, Yu Gong, Philip S. Yu
NeurIPS Workshop on Visually Grounded Interaction and Language, December 7th, 2018, Montreal, Canada.
PDF

2017 and before

Exploiting Geographical Location for Team Formation in Social Coding Sites
Yuqiang Han, Yao Wan, Liang Chen, Guandong Xu, Jian Wu
PAKDD 2017. The 21th Pacific Asia Conference on Knowledge Discovery and Data Mining, Jeju, South Korea, May 23-26, 2017
PDF CCF-C
Incorporating Heterogeneous Information for Mashup Discovery with Consistent Regularization
Yao Wan, Liang Chen, Qi Yu, Tingting Liang, Jian Wu
PAKDD 2016. The 20th Pacific Asia Conference on Knowledge Discovery and Data Mining, Auckland, New Zealand, April 19-22, 2016
PDF CCF-C
Time-aware API Popularity Prediction via Heterogeneous Features
Yao Wan, Liang Chen, Jian Wu, Qi Yu
ICWS 2015. The 22nd International Conference on Web Services, Application Track, New York, USA, June 27 - July 2, 2015
PDF