curriculum vitae

General Information

Full Name Zheng Gao
Contact woshigaozheng [at] gmail [dot] com
Research Interests Large Language Model, Natural Langauge Processing, Graph Mining
Languages English, Chinese

Education

  • 2015 - 2020
    Ph.D. in Information Science
    Indiana University Bloomington, United States
  • 2013 - 2015
    M.S. in Information Science
    University of Pittsburgh, United States
  • 2009 - 2013
    B.M. in Information Management and System
    Shanghai International Studies University, China

Experience

  • 02/2023 - now
    Senior Algorithm Engineer
    Ant Group
    • Trained Ant Group self-innovated large language model (LLM) via supervised fine-tuning (SFT) and reinforcement learning (RL) techniques.
    • Developed an LLM agent platform serving hundreds of real-world applications for enterprise-wide internal management.
    • Implemented an LLM evaluation platform serving as the default evaluation framework for all Ant Group Artificial General Intelligence (AGI) teams.
  • 06/2020 - 01/2023
    Applied Scientist
    Amazon Alexa AI
    • Led a large-scale voice application ranking project by analyzing customer utterances to enhance Alexa's natural understanding capabilities.
    • Contributed session-based features to Alexa core Natural Language Understanding (NLU) pipeline for customer utterance interpretation, including domain \& intent classification and slot detection.
  • 06/2019 - 09/2019
    Data Scientist Intern
    Amazon Alexa AI
    • Applied deep language models (i.e. Bert, ELMo) and advanced clustering techniques to extract influential text patterns from user requests, fully automating the interpretation of annotated datasets and replacing manual analysis.
    • Developed an automated pipeline using Spark and Shell scripts to facilitate model training from diverse data sources, including Amazon S3 and Redshift.
  • 02/2018 - 03/2019
    NLP Research Intern
    Alibaba DAMO Academy AI Lab
    • Generated product review summary from user consecutive behaviors by leveraging dynamic matrix factorization, deep reinforcement learning and neural machine translation with attention techniques.
    • Proposed an end-to-end pairwise ranking model with transfer learning techniques to detect communities in sparse knowledge graphs.
    • Detected multilevel anomalies from high dimensional dynamic use logs via adversarial autoencoder and attention-based hierarchical representation learning.

Services

  • Conference Reviewer
    • Annual Meeting of the Association for Computational Linguistics (ACL 2024)
    • ACM International Conference on Web Search and Data Mining (WSDM 2023,2024,2025)
    • AAAI Conference on Artificial Intelligence (AAAI 2022,2023,2024,2025)
    • The Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)
    • iConference (2023,2024)
    • International Workshop on Deep Learning Practice for High-Dimensional Sparse Data (DLP-RecSys 2023; DLP-KDD 2020,2021)
    • Workshop on Information Extraction from Scientific Publications (WIESP-AACL 2022)
    • China Conference on Knowledge Graph and Semantic Computing (CCKS 2022)
    • Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE 2022)
    • IEEE International Conference on Multimedia and Expo (ICME 2022)
    • The Web Conference (WWW 2019, 2020)
    • ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018, 2022)
    • IEEE International Conference on Big Data (BigData 2020, 2022)
    • Joint Conference on Digital Libraries (JCDL 2021, 2022)
    • International Workshop on Knowledge Graph (IWKG-KDD 2020)
    • Workshop on Scholarly Document Processing (SDP-NAACL 2021, SDP-COLING 2022)
    • International Conference on Information Systems (ICIS 2021)
    • China Conference on Information Retrieval (CCIR 2021)
  • Journal Reviewer
    • Data Intelligence (2022)
    • The Social Science Journal (2022)
    • Journal of Informetrics (JOI 2021)
    • Computers in Industry (2021)
    • Journal of the Association for Information Science and Technology (JASIST 2019, 2021)
    • PeerJ Computer Science (2020)
    • PLoS ONE (2020, 2021)
    • BMC Bioinformatics (2019, 2020, 2022)
    • Social Network Analysis and Mining (SNAM 2019, 2020, 2021)
    • Medical Science Monitor (2019)
    • ACM Transactions on Computing for Healthcare (2020)
  • Funding Reviewer
    • Amazon Research Awards (ARA 2022)
  • Administrative Service
    • Chair of Doctoral Student Association (DSA) at Department of Information and Library Science, Indiana University Bloomington (2016 - 2018)

Honors and Awards

  • 2018 - 2019
    • Clayton A. Shepherd Scholarship, Indiana University Bloomington
  • 2015 - 2018
    • T’ung-li Yuan Memorial Fellowship, Indiana University Bloomington