• 01/2022: Two papers are accepted by IJCNN 2022 ~


Hi, I am Zheng Gao, an Applied Scientist at Amazon Alexa AI. I received my Ph.D. degree in Information Science and minor in Computer Science from Indiana University Bloomington, advised by Prof. Xiaozhong Liu in 2020. My research interests are primarily in the area of Graph Mining and Natural Language Processing (NLP). Particularly, I am applying deep learning techniques on the interdisciplinary field therein them to solve Community Detection, Information Retrieval and Recommendation related tasks.


  • Ph.D. in Information Science
    Minor in Computer Science
    Indiana University Bloomington, United States (2015 - 2020)
  • M.S. in Information Science
    University of Pittsburgh, United States (2013 - 2015)
  • B.M. in Information Management and System
    Shanghai International Studies University, China (2009 - 2013)


  • Applied Scientist II, Amazon Alexa AI (06/2020 - now)
  • Data Scientist Intern, Amazon Alexa AI (06/2019 - 09/2019)
  • NLP Research Intern, Alibaba DAMO Academy / AI Lab (02/2018 - 03/2019)


  • Xiyao Ma, Zheng Gao, Qian Hu, Mohamed AbdelHady. HCL: Hybrid contrastive learning for graph-based recommendation. International Joint Conference on Neural Networks (IJCNN), 2022. [PDF][Video][Slides][Poster]
  • Xiyao Ma, Qian Hu, Zheng Gao, Mohamed AbdelHady. Contrastive Co-training for Diversified Recommendation. International Joint Conference on Neural Networks (IJCNN), 2022. [PDF][Video][Slides]
  • Xiyao Ma*,Zheng Gao*, Qian Hu, Mohamed AbdelHady. Contrastive Knowledge Graph Attention Network For Request-based Recipe Recommendation. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. [PDF][Video][Slides][Poster]
  • Zheng Gao, Chun Guo, Shutian Ma, Xiaozhong Liu. Improving Community Detection Performance in Heterogeneous Music Network by Learning Edge-type Usefulness Distribution. International Conference on Information, 2022. [PDF][Video] [Slides]
  • Zheng Gao, Mohamed AbdelHady, Radhika Arava, Xibin Gao, Qian Hu, Wei Xiao, and Thahir Mohamed. X-SHOT: Learning to Rank Voice Applications via Cross-Locale Shard-based Co-Training. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021. [PDF] [Video] [Slides]
  • Zheng Gao, Mohamed AbdelHady, Radhika Arava, Xibin Gao, Qian Hu, Wei Xiao, and Thahir Mohamed. X-SHOT: Learning to Rank Voice Applications via Cross-Locale Shard-based Co-Training. Workshop on Machine Learning in Speech and Language Processing (MLSLP), 2021. [PDF]
  • Zheng Gao, Radhika Arava, Qian Hu, Xibin Gao, Thahir Mohamed, Wei Xiao, Mohamed AbdelHady. Paraphrase Label Alignment for Voice Application Retrieval in Spoken Language Understanding. Interspeech, 2021. [PDF] [Video] [Slides]
  • Xibin Gao, Radhika Arava, Qian Hu, Thahir Mohamed, Wei Xiao, Zheng Gao and Mohamed AbdelHady. Graphire: Novel Intent Discovery with Pretraining on Prior Knowledge using Contrastive Learning. KDD Workshop on Pretraining: Algorithms, Architectures, and Applications, 2021. [PDF]
  • Wei Xiao, Qian Hu, Thahir Mohamed, Zheng Gao, Xibin Gao, Radhika Arava, Mohamed AbdelHady. Two-stage Voice Application Recommender System for Unhandled Utterances in Intelligent Personal Assistant. KDD 2nd International Workshop: Industrial Recommendation Systems, 2021. [PDF]
  • Qian Hu, Thahir Mohamed, Wei Xiao, Zheng Gao, Xibin Gao, Radhika Arava, Xiyao Ma, Mohamed AbdelHady. Collaborative Data Relabeling for Robust and Diverse Voice Apps Recommendation in Intelligent Personal Assistants. EMNLP Third Workshop on NLP for Conversational AI, 2021. [PDF]
  • Zheng Gao, Hongsong Li, Zhuoren Jiang, Xiaozhong Liu. Detecting User Community in Sparse Domain via Cross-Graph Pairwise Learning. ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020. [PDF] [Video] [Slides]
  • Zheng Gao, Lujun Zhao, Heng Huang, Hongsong Li, Changlong Sun, Luo Si, Xiaozhong Liu. Behavior based Dynamic Summarization on Product Aspects via Reinforcement Neighbour Selection. European Conference on Artificial Intelligence (ECAI), 2020. [PDF] [Video] [Slides]
  • Zhuoren Jiang, Zheng Gao, Jinjiong Lan, Hongxia Yang, Yao Lu, Xiaozhong Liu. Task-Oriented Genetic Activation for Large-Scale Complex Heterogeneous Graph Embedding. The Web Conference (WWW), 2020. [PDF]
  • Zheng Gao, Chun Guo, Xiaozhong Liu. Efficient Personalized Community Detection via Genetic Evolution. The Genetic and Evolutionary Computation Conference (GECCO), 2019. [PDF]
  • Zheng Gao, Gang Fu, Chunping Ouyang, Satoshi Tsutsui, Xiaozhong Liu, Jeremy Yang, Christopher Gessner, Brian Foote, David Wild, Ying Ding, Qi Yu. edge2vec: Representation Learning Using Edge Semantics for Biomedical Knowledge Discovery. BMC Bioinformatics, 2019. (impact factor = 2.511). [PDF] [Code]
  • Yongzhen Wang, Xiaozhong Liu, Zheng Gao. Neural Related Work Summarization with a Joint Context-driven Attention Mechanism. Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018. [PDF]
  • Zheng Gao, Lin Guo, Chi Ma, Xiao Ma, Kai Sun, Hang Xiang, Xiaoqiang Zhu, Hongsong Li, Xiaozhong Liu. AMAD: Adversarial Multiscale Anomaly Detection on High-Dimensional and Time-Evolving Categorical Data. Deep Learning Practice for High-Dimensional Sparse Data Workshop at ACM SIGKDD Conference on Knowledge Discovery and Data Mining (DLP-KDD), 2019. [PDF] [Video] [Slides]
  • Zizhe Gao, Zheng Gao, Heng Huang, Zhuoren Jiang, Yuliang Yan. An End-to-end Model of Predicting Diverse Ranking On Heterogeneous Feeds. eCOM Workshop at ACM SIGIR Conference on Research and Development in Information Retrieval (eCom-SIGIR), 2018. [PDF]
  • Zhuoren Jiang, Liangcai Gao, Ke Yuan, Zheng Gao, Zhi Tang, Xiaozhong Liu. Mathematics Content Understanding for Cyberlearning via Formula Evolution Map. ACM International Conference on Information and Knowledge Management (CIKM), 2018. [PDF] [Code]
  • Xiaozhong Liu, Xing Yu, Zheng Gao, Tian Xia, Johan Bollen. Comparing Community-based Information Adoption and Diffusion across Different Microblogging Sites. ACM Conference on Hypertext and Social Media, 2016. [PDF]
  • Zheng Gao, Vincent Malic, Shutian Ma, Patrick Shih. How to Make a Successful Movie: Factor Analysis from both Financial and Critical Perspectives. International Conference on Information, 2019. [PDF]
  • Yongzhen Wang, Yan Lin, Zheng Gao, Yan Chen. A Two-stage Iterative Approach to Improve Crowdsourcing-based Relevance Assessment. Arabian Journal for Science and Engineering, 2019. [PDF]
  • Zheng Gao, Xiaozhong Liu. Personalized Community Detection in Scholarly Network. International Conference on Information, 2017. [PDF]
  • Tian Xia, Xing Yu, Zheng Gao, Yijun Gu, Xiaozhong Liu. Internal/External Information Access and Information Diffusion in Social Media. International Conference on Information, 2017. [PDF]
  • Nan Li, Naren Suri, Zheng Gao, Tian Xia, Xiaozhong Liu, Katy Borner. Enter a Job, Get Course Recommendations. International Conference on Information, 2017. [PDF]
  • Zheng Gao, John Wolohan, Fast NLP-based Pattern Matching in Real Time Tweet Recommendation. Text REtrieval Conference (TREC), 2017. [PDF]
  • Chenwei Zhang, Zheng Gao, Xiaozhong Liu. How Others Affect Your Twitter #hashtag Adoption? Examination of Community-based and Context-based Information Diffusion in Twitter. International Conference on Information, 2015. [PDF]
  • Zheng Gao, Rui Bi. University of Pittsburgh at TREC 2014 Microblog Track. Text REtrieval Conference (TREC), 2014. [PDF]
  • Zheng Gao, Patrick C. Shih. Communities of Support: Social Support Discussion in a HIV Online Forum. International Symposium of Chinese CHI, 2019. [PDF] [Video]
  • Satoshi Tsutsui, Zheng Gao, Yuzhuo Wang, Guilin Meng, Ying Ding. A Case Study on Viziometrics: What’s the Role of Western Blots in Alzheimers Disease Literature. International Conference on Information, 2018. [PDF] [Code]


Conference Reviewer:

  • AAAI Conference on Artificial Intelligence (AAAI 2022)
  • IEEE International Conference on Multimedia and Expo (ICME 2022)
  • The Web Conference (WWW 2018, 2019, 2020)
  • ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018, 2022)
  • IEEE International Conference on Big Data (BigData 2020)
  • Joint Conference on Digital Libraries (JCDL 2021, 2022)
  • ACM SIGKDD Conference on Knowledge Discovery and Data Mining Workshops (DLP-KDD 2020,2021; IWKG-KDD 2020)
  • Annual Conference of the North American Chapter of the Association for Computational Linguistics Workshop (SDP-NAACL 2021)
  • International Conference on Information Systems (ICIS 2021)
  • China Conference on Information Retrieval (CCIR 2021))

Journal Reviewer:

  • The Social Science Journal (2022)
  • Journal of Informetrics (JOI 2021)
  • Computers in Industry (2021)
  • Journal of the Association for Information Science and Technology (JASIST 2018, 2019, 2021)
  • PeerJ Computer Science (2020)
  • PLoS ONE (2020, 2021)
  • BMC Bioinformatics (2019, 2020, 2022)
  • Social Network Analysis and Mining (SNAM 2018, 2019, 2020, 2021)
  • Medical Science Monitor (2019)
  • ACM Transactions on Computing for Healthcare (2020)

Administrative Service:

  • Chair of Doctoral Student Association (DSA) at Department of Information and Library Science, Indiana University Bloomington (2016 - 2018)


  • T’ung-li Yuan Memorial Fellowship, Indiana University Bloomington (2015 - 2018)
  • Clayton A. Shepherd Scholarship, Indiana University Bloomington (2018 - 2019)
  • IUB SICE Ph.D. Travel Award (2015 - 2019)
  • NetSci Student Travel Award (2017)
  • IUB GPSG Travel Award (2017)