curriculum vitae
General Information
Full Name | Zheng Gao |
Contact | woshigaozheng [at] gmail [dot] com |
Research Interests | Large Language Model, Natural Langauge Processing, Graph Mining |
Languages | English, Chinese |
Education
- 2015 - 2020
Ph.D. in Information Science
Indiana University Bloomington, United States
- Minor in Computer Science
- Advised by Prof. Xiaozhong Liu
- 2013 - 2015
M.S. in Information Science
University of Pittsburgh, United States
- 2009 - 2013
B.M. in Information Management and System
Shanghai International Studies University, China
Experience
- 02/2023 - now
Senior Algorithm Engineer
Ant Group
- Trained Ant Group self-innovated large language model (LLM) via supervised fine-tuning (SFT) and reinforcement learning (RL) techniques.
- Developed an LLM agent platform serving hundreds of real-world applications for enterprise-wide internal management.
- Implemented an LLM evaluation platform serving as the default evaluation framework for all Ant Group Artificial General Intelligence (AGI) teams.
- 06/2020 - 01/2023
Applied Scientist
Amazon Alexa AI
- Led a large-scale voice application ranking project by analyzing customer utterances to enhance Alexa's natural understanding capabilities.
- Contributed session-based features to Alexa core Natural Language Understanding (NLU) pipeline for customer utterance interpretation, including domain \& intent classification and slot detection.
- 06/2019 - 09/2019
Data Scientist Intern
Amazon Alexa AI
- Applied deep language models (i.e. Bert, ELMo) and advanced clustering techniques to extract influential text patterns from user requests, fully automating the interpretation of annotated datasets and replacing manual analysis.
- Developed an automated pipeline using Spark and Shell scripts to facilitate model training from diverse data sources, including Amazon S3 and Redshift.
- 02/2018 - 03/2019
NLP Research Intern
Alibaba DAMO Academy AI Lab
- Generated product review summary from user consecutive behaviors by leveraging dynamic matrix factorization, deep reinforcement learning and neural machine translation with attention techniques.
- Proposed an end-to-end pairwise ranking model with transfer learning techniques to detect communities in sparse knowledge graphs.
- Detected multilevel anomalies from high dimensional dynamic use logs via adversarial autoencoder and attention-based hierarchical representation learning.
Services
-
Conference Reviewer
- Annual Meeting of the Association for Computational Linguistics (ACL 2024)
- ACM International Conference on Web Search and Data Mining (WSDM 2023,2024,2025)
- AAAI Conference on Artificial Intelligence (AAAI 2022,2023,2024,2025)
- The Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)
- iConference (2023,2024)
- International Workshop on Deep Learning Practice for High-Dimensional Sparse Data (DLP-RecSys 2023; DLP-KDD 2020,2021)
- Workshop on Information Extraction from Scientific Publications (WIESP-AACL 2022)
- China Conference on Knowledge Graph and Semantic Computing (CCKS 2022)
- Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE 2022)
- IEEE International Conference on Multimedia and Expo (ICME 2022)
- The Web Conference (WWW 2019, 2020)
- ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018, 2022)
- IEEE International Conference on Big Data (BigData 2020, 2022)
- Joint Conference on Digital Libraries (JCDL 2021, 2022)
- International Workshop on Knowledge Graph (IWKG-KDD 2020)
- Workshop on Scholarly Document Processing (SDP-NAACL 2021, SDP-COLING 2022)
- International Conference on Information Systems (ICIS 2021)
- China Conference on Information Retrieval (CCIR 2021)
-
Journal Reviewer
- Data Intelligence (2022)
- The Social Science Journal (2022)
- Journal of Informetrics (JOI 2021)
- Computers in Industry (2021)
- Journal of the Association for Information Science and Technology (JASIST 2019, 2021)
- PeerJ Computer Science (2020)
- PLoS ONE (2020, 2021)
- BMC Bioinformatics (2019, 2020, 2022)
- Social Network Analysis and Mining (SNAM 2019, 2020, 2021)
- Medical Science Monitor (2019)
- ACM Transactions on Computing for Healthcare (2020)
-
Funding Reviewer
- Amazon Research Awards (ARA 2022)
-
Administrative Service
- Chair of Doctoral Student Association (DSA) at Department of Information and Library Science, Indiana University Bloomington (2016 - 2018)
Honors and Awards
- 2018 - 2019
- Clayton A. Shepherd Scholarship, Indiana University Bloomington
- 2015 - 2018
- T’ung-li Yuan Memorial Fellowship, Indiana University Bloomington