Saved in:
| Main Authors: | Shi, Derek, Glatt, Ruben, Klymko, Christine, Mohole, Shubham, Choi, Hongjun, Kushwaha, Shashank, Sakla, Sam, da Silva, Felipe Leno |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.02561 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VERIRAG: A Post-Retrieval Auditing of Scientific Study Summaries
by: Mohole, Shubham, et al.
Published: (2025)
by: Mohole, Shubham, et al.
Published: (2025)
SIFOTL: A Principled, Statistically-Informed Fidelity-Optimization Method for Tabular Learning
by: Mohole, Shubham, et al.
Published: (2025)
by: Mohole, Shubham, et al.
Published: (2025)
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
by: Mohole, Shubham, et al.
Published: (2025)
by: Mohole, Shubham, et al.
Published: (2025)
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization
by: Choi, Hongjun, et al.
Published: (2024)
by: Choi, Hongjun, et al.
Published: (2024)
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
by: Lin, Jiaye, et al.
Published: (2025)
by: Lin, Jiaye, et al.
Published: (2025)
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
by: Lee, Harrison, et al.
Published: (2023)
by: Lee, Harrison, et al.
Published: (2023)
Offline RLAIF: Piloting VLM Feedback for RL via SFO
by: Beck, Jacob
Published: (2025)
by: Beck, Jacob
Published: (2025)
RLAIF-SPA: Structured AI Feedback for Semantic-Prosodic Alignment in Speech Synthesis
by: Yang, Qing, et al.
Published: (2025)
by: Yang, Qing, et al.
Published: (2025)
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
by: Ahn, Daechul, et al.
Published: (2024)
by: Ahn, Daechul, et al.
Published: (2024)
Oracle modalities
by: Swan, Andrew W
Published: (2024)
by: Swan, Andrew W
Published: (2024)
Safe, Efficient, and Robust Reinforcement Learning for Ranking and Diffusion Models
by: Gupta, Shashank
Published: (2025)
by: Gupta, Shashank
Published: (2025)
Learning nuclear cross sections across the chart of nuclides with graph neural networks
by: Choi, Hongjun, et al.
Published: (2024)
by: Choi, Hongjun, et al.
Published: (2024)
Sparse Autoencoders as a Steering Basis for Phase Synchronization in Graph-Based CFD Surrogates
by: Hu, Yeping, et al.
Published: (2026)
by: Hu, Yeping, et al.
Published: (2026)
Improving Robustness In Sparse Autoencoders via Masked Regularization
by: Narayanaswamy, Vivek, et al.
Published: (2026)
by: Narayanaswamy, Vivek, et al.
Published: (2026)
Why Does RLAIF Work At All?
by: Young, Robin
Published: (2026)
by: Young, Robin
Published: (2026)
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
by: Yu, Tianyu, et al.
Published: (2024)
by: Yu, Tianyu, et al.
Published: (2024)
Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF
by: Hengle, Amey, et al.
Published: (2024)
by: Hengle, Amey, et al.
Published: (2024)
A esquerda, o Estado, a economia: considerações em torno à crise socioeconômica hodierna
by: Leno Francisco Danner
Published: (2014)
by: Leno Francisco Danner
Published: (2014)
HABERMAS: da globalização da economia à globalização da política
by: Leno Francisco Danner
Published: (2014)
by: Leno Francisco Danner
Published: (2014)
Political praxis, social analysis and western modernization: a theoretical-political route for critical social theory
by: Leno Francisco Danner
Published: (2020)
by: Leno Francisco Danner
Published: (2020)
Da concomitância entre direitos humanos e direito: sobre a base fundacional da democracia como um sistema público de direito com caráter antifascista
by: Leno Francisco Danner
Published: (2021)
by: Leno Francisco Danner
Published: (2021)
Em busca da terra sem males: violência, migração e resistência em Kaká Werá Jecupé e Eliane Potiguara
by: Leno Francisco Danner
Published: (2019)
by: Leno Francisco Danner
Published: (2019)
ESTEREOTIPO PSICO-SOCIO-CULTURAL DE LA MENOPAUSIA EN MUJERES RURALES.
by: D. Leno González
Published: (2006)
by: D. Leno González
Published: (2006)
ESFERA PÚBLICA E POLÍTICA RADICAL: APONTAMENTOS A PARTIR DE HABERMAS
by: Leno Francisco Danner
Published: (2015)
by: Leno Francisco Danner
Published: (2015)
ÍNDICE DE BARTHEL: ¿ADECUADO PARA PLANIFICAR NECESIDADES AL ALTA HOSPITALARIA?
by: D. Leno González
Published: (2008)
by: D. Leno González
Published: (2008)
Decolonialidade, lugar de fala e voz-práxis estético-literária: reflexões desde a literatura indígena brasileira
by: Leno Francisco Danner
Published: (2020)
by: Leno Francisco Danner
Published: (2020)
Pacificando o branco: uma história da modernidade contada pelos indígenas
by: Leno Francisco Danner
Published: (2022)
by: Leno Francisco Danner
Published: (2022)
Educação, resistência e politização: sobre o sentido da educação na literatura indígena brasileira contemporânea
by: Leno Francisco Danner
Published: (2020)
by: Leno Francisco Danner
Published: (2020)
América latina, o discurso filosófico-sociológico da modernidade, a ce-gueira histórico-sociológica das teorias da modernidade: notas programá-ticas para uma práxis decolonial latino-americana
by: Leno Francisco Danner
Published: (2018)
by: Leno Francisco Danner
Published: (2018)
Um mundo sem mediações: descolonização africana como teoria política da modernização periférica
by: Leno Francisco Danner
Published: (2022)
by: Leno Francisco Danner
Published: (2022)
Um xamã yanomami frente ao discurso filosófico-sociológico da modernidade
by: Leno Francisco Danner
Published: (2018)
by: Leno Francisco Danner
Published: (2018)
Estado, política e evolução social: uma tendência para este século XXI
by: Leno Francisco Danner
Published: (2017)
by: Leno Francisco Danner
Published: (2017)
IMPORTANCIA DE UN DIAGNÓSTICO PRECOZ Y CUIDADOS DE ENFERMERÍA EN DIABETES GESTACIONAL.
by: D. Leno González
Published: (2005)
by: D. Leno González
Published: (2005)
A evolução democrática entre institucionalização e espontaneidade. Pesos e medidas da política democrática contemporânea
by: Leno Francisco Danner
Published: (2015)
by: Leno Francisco Danner
Published: (2015)
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles
by: Tang, Zhiwei, et al.
Published: (2023)
by: Tang, Zhiwei, et al.
Published: (2023)
Voting with the Graph: Stable RLAIF via Topological Consistency Maximization
by: Liu, Boyin, et al.
Published: (2025)
by: Liu, Boyin, et al.
Published: (2025)
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
by: Dutta, Sujan, et al.
Published: (2024)
by: Dutta, Sujan, et al.
Published: (2024)
Effect of ethephon and indolebutyric acid on yellow mombin propagation via cutting
by: Mário Leno Martins Véras
Published: (2017)
by: Mário Leno Martins Véras
Published: (2017)
Mitigation of chilling injury in sweet potato roots subjected to low-temperature conditioning
by: Mário Leno Martins Véras
Published: (2022)
by: Mário Leno Martins Véras
Published: (2022)
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics
by: Hayes, Conor F., et al.
Published: (2025)
by: Hayes, Conor F., et al.
Published: (2025)
Similar Items
-
VERIRAG: A Post-Retrieval Auditing of Scientific Study Summaries
by: Mohole, Shubham, et al.
Published: (2025) -
SIFOTL: A Principled, Statistically-Informed Fidelity-Optimization Method for Tabular Learning
by: Mohole, Shubham, et al.
Published: (2025) -
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
by: Mohole, Shubham, et al.
Published: (2025) -
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization
by: Choi, Hongjun, et al.
Published: (2024) -
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
by: Lin, Jiaye, et al.
Published: (2025)