:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Kamao, Mina, Ono, Hayato, Yamashita, Ayumu, Amano, Kaoru, Sawayama, Masataka
Format:	Preprint
Publié:	2025
Sujets:	Machine Learning
Accès en ligne:	https://arxiv.org/abs/2503.13212
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

BrainCodec: Neural fMRI codec for the decoding of cognitive brain states
par: Nishimura, Yuto, et autres
Publié: (2024)

Model Metamers Reveal Invariances in Graph Neural Networks
par: Xu, Wei, et autres
Publié: (2025)

Prediction of Alpha Power Using Multiple Subjective Measures and Autonomic Responses
par: Yuting Xu, et autres
Publié: (2025)

Stick to your Role! Stability of Personal Values Expressed in Large Language Models
par: Kovač, Grgur, et autres
Publié: (2024)

Sampling Method for Generalized Graph Signals with Pre-selected Vertices via DC Optimization
par: Yamashita, Keitaro, et autres
Publié: (2025)

Data-dependent Exploration for Online Reinforcement Learning from Human Feedback
par: Zhang, Zhen-Yu, et autres
Publié: (2026)

Aligning Generative Speech Enhancement with Perceptual Feedback
par: Li, Haoyang, et autres
Publié: (2025)

Scalable predictive processing framework for multitask caregiving robots
par: Idei, Hayato, et autres
Publié: (2025)

Pure Exploration with Feedback Graphs
par: Russo, Alessio, et autres
Publié: (2025)

Understanding Cross-Model Perceptual Invariances Through Ensemble Metamers
par: Boehm, Lukas, et autres
Publié: (2025)

Pure Exploration under Mediators' Feedback
par: Poiani, Riccardo, et autres
Publié: (2023)

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
par: Hong, Ilgee, et autres
Publié: (2024)

Towards Efficient Online Exploration for Reinforcement Learning with Human Feedback
par: Li, Gen, et autres
Publié: (2025)

Adaptive Querying for Reward Learning from Human Feedback
par: Anand, Yashwanthi, et autres
Publié: (2024)

Adaptive Scoring and Thresholding with Human Feedback for Robust Out-of-Distribution Detection
par: Yamada, Daisuke, et autres
Publié: (2025)

Human-Flow Digital Twin for Predicting the Effects of Mobility Introduction on Visitor Circulation
par: Shima, Chiharu, et autres
Publié: (2026)

VarteX: Enhancing Weather Forecast through Distributed Variable Representation
par: Ueyama, Ayumu, et autres
Publié: (2024)

Guided Diffusion Sampling for Precipitation Forecast Interventions
par: Ueyama, Ayumu, et autres
Publié: (2026)

Capture-Calibrate-Coach: A Graph-Based Framework for Knowledge Monitoring Estimation and Adaptive Feedback
par: Li, Gen, et autres
Publié: (2026)

Pure Exploration Beyond Reward Feedback: The Role of Post-Action Context
par: Shahverdikondori, Mohammad, et autres
Publié: (2025)

Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback
par: Li, Zitian, et autres
Publié: (2026)

Human and AI Perceptual Differences in Image Classification Errors
par: Liu, Minghao, et autres
Publié: (2023)

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints
par: Tian, Yunsheng, et autres
Publié: (2024)

Reinforcement Learning from Human Feedback
par: Lambert, Nathan
Publié: (2025)

Out-of-Distribution Learning with Human Feedback
par: Bai, Haoyue, et autres
Publié: (2024)

Prompt Optimization with Human Feedback
par: Lin, Xiaoqiang, et autres
Publié: (2024)

Complex System Exploration with Interactive Human Guidance
par: Morel, Bastien, et autres
Publié: (2025)

Strategyproof Reinforcement Learning from Human Feedback
par: Buening, Thomas Kleine, et autres
Publié: (2025)

Proximal Policy Optimization with Adaptive Exploration
par: Lixandru, Andrei
Publié: (2024)

QAdaPrune: Adaptive Parameter Pruning For Training Variational Quantum Circuits
par: Kulshrestha, Ankit, et autres
Publié: (2024)

Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing
par: Yang, Yifan, et autres
Publié: (2025)

DiBA: Diagonal and Binary Matrix Approximation for Neural Network Weight Compression
par: Ono, Nobutaka
Publié: (2026)

Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback
par: Yang, Yongjin, et autres
Publié: (2025)

Proximal Point Nash Learning from Human Feedback
par: Tiapkin, Daniil, et autres
Publié: (2025)

Off-Policy Evaluation from Logged Human Feedback
par: Bhargava, Aniruddha, et autres
Publié: (2024)

Robust Reinforcement Learning from Corrupted Human Feedback
par: Bukharin, Alexander, et autres
Publié: (2024)

ARISE: Adaptive Reinforcement Integrated with Swarm Exploration
par: M, Rajiv Chaitanya, et autres
Publié: (2026)

Learning Representations for CSI Adaptive Quantization and Feedback
par: Rizzello, Valentina, et autres
Publié: (2022)

Reassessing Evaluation Functions in Algorithmic Recourse: An Empirical Study from a Human-Centered Perspective
par: Tominaga, Tomu, et autres
Publié: (2024)

On the attainment of the Wasserstein--Cramer--Rao lower bound
par: Nishimori, Hayato, et autres
Publié: (2025)