Saved in:
| Main Authors: | Dognin, Pierre, Melnyk, Igor, Mroueh, Youssef, Padhi, Inkit, Rigotti, Mattia, Ross, Jarret, Schiff, Yair, Young, Richard A., Belgodere, Brian |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2012.11696 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
by: Belgodere, Brian, et al.
Published: (2023)
by: Belgodere, Brian, et al.
Published: (2023)
Risk Aware Benchmarking of Large Language Models
by: Nitsure, Apoorva, et al.
Published: (2023)
by: Nitsure, Apoorva, et al.
Published: (2023)
Distributional Preference Alignment of LLMs via Optimal Transport
by: Melnyk, Igor, et al.
Published: (2024)
by: Melnyk, Igor, et al.
Published: (2024)
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance
by: Navratil, Jiri, et al.
Published: (2025)
by: Navratil, Jiri, et al.
Published: (2025)
Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy Training
by: Mroueh, Youssef, et al.
Published: (2025)
by: Mroueh, Youssef, et al.
Published: (2025)
CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery
by: Mroueh, Youssef, et al.
Published: (2026)
by: Mroueh, Youssef, et al.
Published: (2026)
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
by: Rioux, Gabriel, et al.
Published: (2024)
by: Rioux, Gabriel, et al.
Published: (2024)
Value Alignment from Unstructured Text
by: Padhi, Inkit, et al.
Published: (2024)
by: Padhi, Inkit, et al.
Published: (2024)
Programming Refusal with Conditional Activation Steering
by: Lee, Bruce W., et al.
Published: (2024)
by: Lee, Bruce W., et al.
Published: (2024)
GP-MoLFormer: A Foundation Model For Molecular Generation
by: Ross, Jerret, et al.
Published: (2024)
by: Ross, Jerret, et al.
Published: (2024)
Reinforcement Learning with Verifiable Rewards: GRPO's Effective Loss, Dynamics, and Success Amplification
by: Mroueh, Youssef
Published: (2025)
by: Mroueh, Youssef
Published: (2025)
Information Theoretic Guarantees For Policy Alignment In Large Language Models
by: Mroueh, Youssef
Published: (2024)
by: Mroueh, Youssef
Published: (2024)
When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
by: Nagireddy, Manish, et al.
Published: (2024)
by: Nagireddy, Manish, et al.
Published: (2024)
Contextual Moral Value Alignment Through Context-Based Aggregation
by: Dognin, Pierre, et al.
Published: (2024)
by: Dognin, Pierre, et al.
Published: (2024)
Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs
by: Gourabathina, Abinitha, et al.
Published: (2026)
by: Gourabathina, Abinitha, et al.
Published: (2026)
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
Guided Speculative Inference for Efficient Test-Time Alignment of LLMs
by: Geuter, Jonathan, et al.
Published: (2025)
by: Geuter, Jonathan, et al.
Published: (2025)
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
by: Wei, Dennis, et al.
Published: (2024)
by: Wei, Dennis, et al.
Published: (2024)
GIST: Gauge-Invariant Spectral Transformers for Scalable Graph Neural Operators
by: Rigotti, Mattia, et al.
Published: (2026)
by: Rigotti, Mattia, et al.
Published: (2026)
Eliciting Reasoning in Language Models with Cognitive Tools
by: Ebouky, Brown, et al.
Published: (2025)
by: Ebouky, Brown, et al.
Published: (2025)
Library Technology: Bibliography, 1950-68
by: Melnyk, Andrew
Published: (1969)
by: Melnyk, Andrew
Published: (1969)
Hétérocères nouveaux de l'Amérique du Sud
by: Dognin, Paul
Published: (1901)
by: Dognin, Paul
Published: (1901)
Heterocores nouveaux de l'Amerique du Sud
by: Dognin, Paul
Published: (1913)
by: Dognin, Paul
Published: (1913)
Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection
by: Xue, Yihao, et al.
Published: (2025)
by: Xue, Yihao, et al.
Published: (2025)
FuzzWiz -- Fuzzing Framework for Efficient Hardware Coverage
by: Gadde, Deepak Narayan, et al.
Published: (2024)
by: Gadde, Deepak Narayan, et al.
Published: (2024)
Tiburones en la olla --
by: Wrisley, Jarret
Published: (2008)
by: Wrisley, Jarret
Published: (2008)
Evaluation of medication adherence among Lebanese diabetic patients
by: Lara Mroueh
Published: (2018)
by: Lara Mroueh
Published: (2018)
GraphWiz: An Instruction-Following Language Model for Graph Problems
by: Chen, Nuo, et al.
Published: (2024)
by: Chen, Nuo, et al.
Published: (2024)
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
by: Aminian, Gholamali, et al.
Published: (2025)
by: Aminian, Gholamali, et al.
Published: (2025)
Evaluating Assistive Technologies on a Trade Fair: Methodological Overview and Lessons Learned
by: Baumeister, Annalies, et al.
Published: (2024)
by: Baumeister, Annalies, et al.
Published: (2024)
Outline-Guided Object Inpainting with Diffusion Models
by: Pobitzer, Markus, et al.
Published: (2024)
by: Pobitzer, Markus, et al.
Published: (2024)
Assistive Technologies in the Library
by: Mates, Barbara T.
Published: (2011)
by: Mates, Barbara T.
Published: (2011)
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
by: Hou, Yufang, et al.
Published: (2024)
by: Hou, Yufang, et al.
Published: (2024)
Integrative Review of Recruitment Literature: Conceptual Evolution, Technological Developments, and Scope for Future Research
by: Preeti Sharma, et al.
Published: (2026)
by: Preeti Sharma, et al.
Published: (2026)
El Conflicto del Campo. Matrices culturales e identificaciones políticas
by: Sebastián Rigotti
Published: (2014)
by: Sebastián Rigotti
Published: (2014)
Rh Potenziale Moiré come Operatore di Scattering e il confinamento topologico sulla varietà di Klein
by: Rigotti, Alex
Published: (2026)
by: Rigotti, Alex
Published: (2026)
Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
by: Zhang, Zhengxin, et al.
Published: (2024)
by: Zhang, Zhengxin, et al.
Published: (2024)
Best-of-N through the Smoothing Lens: KL Divergence and Regret Analysis
by: Aminian, Gholamali, et al.
Published: (2025)
by: Aminian, Gholamali, et al.
Published: (2025)
Escucha intermedial: auralidad desde una perspectiva retórica
by: Jarret Julián Woodside Woods
Published: (2019)
by: Jarret Julián Woodside Woods
Published: (2019)
Smooth Solutions of the Navier-Stokes Equation
by: Glimm, James, et al.
Published: (2025)
by: Glimm, James, et al.
Published: (2025)
Similar Items
-
Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
by: Belgodere, Brian, et al.
Published: (2023) -
Risk Aware Benchmarking of Large Language Models
by: Nitsure, Apoorva, et al.
Published: (2023) -
Distributional Preference Alignment of LLMs via Optimal Transport
by: Melnyk, Igor, et al.
Published: (2024) -
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance
by: Navratil, Jiri, et al.
Published: (2025) -
Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy Training
by: Mroueh, Youssef, et al.
Published: (2025)