Hayase, J., Liu, A., Choi, Y., Oh, S., & Smith, N. A. (2024). Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Chicago Style (17th ed.) CitationHayase, Jonathan, Alisa Liu, Yejin Choi, Sewoong Oh, and Noah A. Smith. Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? 2024.
MLA (9th ed.) CitationHayase, Jonathan, et al. Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? 2024.
Warning: These citations may not always be 100% accurate.