publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2023

  1. ACL
    Why Does Zero-shot Cross-lingual Generation Fail? An Explaination and A Solution
    Tianjian Li, and Kenton Murray
    In Proceedings of the 2023 Annual Meeting of the Association for Computational Linguistics (ACL Findings), Jul 2023

2024

  1. preprint
    Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
    Tianjian Li, Haoran Xu, Weiting Tan, and 2 more authors
    Jul 2024
  2. preprint
    Benchmarking Language Model Creativity: A Case Study on Code Generation
    Yining Lu, Dixuan Wang, Tianjian Li, and 2 more authors
    Jul 2024
  3. preprint
    Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
    Jingyu Zhang, Marc Marone, Tianjian Li, and 2 more authors
    Jul 2024
  4. ICLR
    Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
    Tianjian Li, Haoran Xu, Philipp Koehn, and 2 more authors
    In The Twelfth International Conference on Learning Representations (ICLR)
    (Spotlight - Top 5%), Jul 2024