:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Dumoulin, Vincent, Johnson, Daniel D., Castro, Pablo Samuel, Larochelle, Hugo, Dauphin, Yann
Format:	Preprint
Veröffentlicht:	2023
Schlagworte:	Machine Learning Artificial Intelligence Computation and Language
Online-Zugang:	https://arxiv.org/abs/2311.14115
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Capturing Individual Human Preferences with Reward Features
von: Barreto, André, et al.
Veröffentlicht: (2025)

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
von: Sokar, Ghada, et al.
Veröffentlicht: (2024)

BRIDGE: Predicting Human Task Completion Time From Model Performance
von: Liu, Fengyuan, et al.
Veröffentlicht: (2026)

An evolutionary perspective on modes of learning in Transformers
von: Ku, Alexander Y., et al.
Veröffentlicht: (2025)

Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants
von: Huang, Vincent, et al.
Veröffentlicht: (2025)

Many-Shot In-Context Learning
von: Agarwal, Rishabh, et al.
Veröffentlicht: (2024)

REFA: Reference Free Alignment for multi-preference optimization
von: Gupta, Taneesh, et al.
Veröffentlicht: (2024)

The Formalism-Implementation Gap in Reinforcement Learning Research
von: Castro, Pablo Samuel
Veröffentlicht: (2025)

In value-based deep reinforcement learning, a pruned network is a good network
von: Obando-Ceron, Johan, et al.
Veröffentlicht: (2024)

Robustmix: Improving Robustness by Regularizing the Frequency Bias of Deep Nets
von: Ngnawe, Jonas, et al.
Veröffentlicht: (2023)

Large language models can accurately predict searcher preferences
von: Thomas, Paul, et al.
Veröffentlicht: (2023)

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
von: Dubois, Yann, et al.
Veröffentlicht: (2024)

ReviewerToo: Should AI Join The Program Committee? A Look At The Future of Peer Review
von: Sahu, Gaurav, et al.
Veröffentlicht: (2025)

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?
von: Cui, Yingqian, et al.
Veröffentlicht: (2026)

AfriStereo: A Culturally Grounded Dataset for Evaluating Stereotypical Bias in Large Language Models
von: Beux, Yann Le, et al.
Veröffentlicht: (2025)

Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
von: Liu, Jiacheng, et al.
Veröffentlicht: (2023)

Soft Tokens, Hard Truths
von: Butt, Natasha, et al.
Veröffentlicht: (2025)

Low-rank finetuning for LLMs: A fairness perspective
von: Das, Saswat, et al.
Veröffentlicht: (2024)

Adaptive Test-Time Reasoning via Reward-Guided Dual-Phase Search
von: Cui, Yingqian, et al.
Veröffentlicht: (2025)

Eliciting Language Model Behaviors with Investigator Agents
von: Li, Xiang Lisa, et al.
Veröffentlicht: (2025)

Ethics2vec: aligning automatic agents and human preferences
von: Bontempi, Gianluca
Veröffentlicht: (2025)

Unlearning via Sparse Representations
von: Shah, Vedant, et al.
Veröffentlicht: (2023)

Quantization Dominates Rank Reduction for KV-Cache Compression
von: Salfati, Samuel
Veröffentlicht: (2026)

Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text
von: Albanese, Federico, et al.
Veröffentlicht: (2026)

The Elements of Differentiable Programming
von: Blondel, Mathieu, et al.
Veröffentlicht: (2024)

CALE: Continuous Arcade Learning Environment
von: Farebrother, Jesse, et al.
Veröffentlicht: (2024)

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
von: Sokar, Ghada, et al.
Veröffentlicht: (2025)

A Survey of State Representation Learning for Deep Reinforcement Learning
von: Echchahed, Ayoub, et al.
Veröffentlicht: (2025)

On the consistency of hyper-parameter selection in value-based deep reinforcement learning
von: Obando-Ceron, Johan, et al.
Veröffentlicht: (2024)

Stylometry recognizes human and LLM-generated texts in short samples
von: Przystalski, Karol, et al.
Veröffentlicht: (2025)

Language models are better than humans at next-token prediction
von: Shlegeris, Buck, et al.
Veröffentlicht: (2022)

Rethinking Layer Redundancy: Calibration Matters More Than Search in LLM Depth Pruning
von: Kim, Minkyu, et al.
Veröffentlicht: (2026)

Post-training makes large language models less human-like
von: Binz, Marcel, et al.
Veröffentlicht: (2026)

Language models show human-like content effects on reasoning tasks
von: Dasgupta, Ishita, et al.
Veröffentlicht: (2022)

The language of time: a language model perspective on time-series foundation models
von: Xie, Yi, et al.
Veröffentlicht: (2025)

Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification
von: Faye, Géraud, et al.
Veröffentlicht: (2024)

Leveraging Zero-Shot Prompting for Efficient Language Model Distillation
von: Vöge, Lukas, et al.
Veröffentlicht: (2024)

AgentRxiv: Towards Collaborative Autonomous Research
von: Schmidgall, Samuel, et al.
Veröffentlicht: (2025)

Robustly Improving LLM Fairness in Realistic Settings via Interpretability
von: Karvonen, Adam, et al.
Veröffentlicht: (2025)

The Geometry of Meaning: Perfect Spacetime Representations of Hierarchical Structures
von: Anabalon, Andres, et al.
Veröffentlicht: (2025)