Vanlioglu, A. (2025). Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning.
Chicago Style (17th ed.) CitationVanlioglu, Abdullah. Entropy-guided Sequence Weighting for Efficient Exploration in RL-based LLM Fine-tuning. 2025.
MLA (9th ed.) CitationVanlioglu, Abdullah. Entropy-guided Sequence Weighting for Efficient Exploration in RL-based LLM Fine-tuning. 2025.
Warning: These citations may not always be 100% accurate.