Padula, A. G., & Soemers, D. J. N. J. (2024). Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards.
Chicago Style (17th ed.) CitationPadula, Alexander G., and Dennis J. N. J. Soemers. Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards. 2024.
MLA (9th ed.) CitationPadula, Alexander G., and Dennis J. N. J. Soemers. Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards. 2024.
Warning: These citations may not always be 100% accurate.