Salvato in:
Dettagli Bibliografici
Autori principali: Chen, Meng, Arthur, Philip, Feng, Qianyu, Hoang, Cong Duy Vu, Hong, Yu-Heng, Moghaddam, Mahdi Kazemi, Nezami, Omid, Nguyen, Thien, Tangari, Gioacchino, Vu, Duy, Vu, Thanh, Johnson, Mark, Kenthapadi, Krishnaram, Dharmasiri, Don, Duong, Long, Li, Yuan-Fang
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2411.00005
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
Sommario:
  • Large language models (LLMs) have shown impressive performance in \emph{code} understanding and generation, making coding tasks a key focus for researchers due to their practical applications and value as a testbed for LLM evaluation. Data synthesis and filtering techniques have been widely adopted and shown to be highly effective in this context. In this paper, we present a focused survey and taxonomy of these techniques, emphasizing recent advancements. We highlight key challenges, explore future research directions, and offer practical guidance for new researchers entering the field.