Saved in:
| Main Authors: | Belgodere, Brian, Dognin, Pierre, Ivankay, Adam, Melnyk, Igor, Mroueh, Youssef, Mojsilovic, Aleksandra, Navratil, Jiri, Nitsure, Apoorva, Padhi, Inkit, Rigotti, Mattia, Ross, Jerret, Schiff, Yair, Vedpathak, Radhika, Young, Richard A. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2304.10819 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Risk Aware Benchmarking of Large Language Models
by: Nitsure, Apoorva, et al.
Published: (2023)
by: Nitsure, Apoorva, et al.
Published: (2023)
Distributional Preference Alignment of LLMs via Optimal Transport
by: Melnyk, Igor, et al.
Published: (2024)
by: Melnyk, Igor, et al.
Published: (2024)
Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge
by: Dognin, Pierre, et al.
Published: (2020)
by: Dognin, Pierre, et al.
Published: (2020)
Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy Training
by: Mroueh, Youssef, et al.
Published: (2025)
by: Mroueh, Youssef, et al.
Published: (2025)
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
by: Rioux, Gabriel, et al.
Published: (2024)
by: Rioux, Gabriel, et al.
Published: (2024)
GP-MoLFormer: A Foundation Model For Molecular Generation
by: Ross, Jerret, et al.
Published: (2024)
by: Ross, Jerret, et al.
Published: (2024)
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance
by: Navratil, Jiri, et al.
Published: (2025)
by: Navratil, Jiri, et al.
Published: (2025)
CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery
by: Mroueh, Youssef, et al.
Published: (2026)
by: Mroueh, Youssef, et al.
Published: (2026)
Value Alignment from Unstructured Text
by: Padhi, Inkit, et al.
Published: (2024)
by: Padhi, Inkit, et al.
Published: (2024)
Reinforcement Learning with Verifiable Rewards: GRPO's Effective Loss, Dynamics, and Success Amplification
by: Mroueh, Youssef
Published: (2025)
by: Mroueh, Youssef
Published: (2025)
Information Theoretic Guarantees For Policy Alignment In Large Language Models
by: Mroueh, Youssef
Published: (2024)
by: Mroueh, Youssef
Published: (2024)
Programming Refusal with Conditional Activation Steering
by: Lee, Bruce W., et al.
Published: (2024)
by: Lee, Bruce W., et al.
Published: (2024)
When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
by: Nagireddy, Manish, et al.
Published: (2024)
by: Nagireddy, Manish, et al.
Published: (2024)
Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations
by: Achintalwar, Swapnaja, et al.
Published: (2024)
by: Achintalwar, Swapnaja, et al.
Published: (2024)
Contextual Moral Value Alignment Through Context-Based Aggregation
by: Dognin, Pierre, et al.
Published: (2024)
by: Dognin, Pierre, et al.
Published: (2024)
Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs
by: Gourabathina, Abinitha, et al.
Published: (2026)
by: Gourabathina, Abinitha, et al.
Published: (2026)
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
The Narasimhan-Seshadri Theorem revisited
by: Nitsure, Nitin
Published: (2025)
by: Nitsure, Nitin
Published: (2025)
Guided Speculative Inference for Efficient Test-Time Alignment of LLMs
by: Geuter, Jonathan, et al.
Published: (2025)
by: Geuter, Jonathan, et al.
Published: (2025)
Trade, migration and welfare : the impact of social capital / Maurice Schiff
by: Schiff, Maurice
Published: (1999)
by: Schiff, Maurice
Published: (1999)
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
by: Wei, Dennis, et al.
Published: (2024)
by: Wei, Dennis, et al.
Published: (2024)
GIST: Gauge-Invariant Spectral Transformers for Scalable Graph Neural Operators
by: Rigotti, Mattia, et al.
Published: (2026)
by: Rigotti, Mattia, et al.
Published: (2026)
Eliciting Reasoning in Language Models with Cognitive Tools
by: Ebouky, Brown, et al.
Published: (2025)
by: Ebouky, Brown, et al.
Published: (2025)
Hétérocères nouveaux de l'Amérique du Sud
by: Dognin, Paul
Published: (1901)
by: Dognin, Paul
Published: (1901)
Heterocores nouveaux de l'Amerique du Sud
by: Dognin, Paul
Published: (1913)
by: Dognin, Paul
Published: (1913)
Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection
by: Xue, Yihao, et al.
Published: (2025)
by: Xue, Yihao, et al.
Published: (2025)
Regional integration and technology diffusion : the case of the North America Free Trade Agreement / Maurice Schiff, Yanling Wang
by: Schiff, Maurice
Published: (2003)
by: Schiff, Maurice
Published: (2003)
Evaluation of medication adherence among Lebanese diabetic patients
by: Lara Mroueh
Published: (2018)
by: Lara Mroueh
Published: (2018)
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
by: Aminian, Gholamali, et al.
Published: (2025)
by: Aminian, Gholamali, et al.
Published: (2025)
VHDL-Eval: A Framework for Evaluating Large Language Models in VHDL Code Generation
by: Vijayaraghavan, Prashanth, et al.
Published: (2024)
by: Vijayaraghavan, Prashanth, et al.
Published: (2024)
Optimization and Mechanistic Insights of Zinc Ascorbate Catalyst for Ring‐Opening Polymerization of Caprolactone Using RSM Methodology and DFT Calculations
by: Sonali S. Naik, et al.
Published: (2025)
by: Sonali S. Naik, et al.
Published: (2025)
Outline-Guided Object Inpainting with Diffusion Models
by: Pobitzer, Markus, et al.
Published: (2024)
by: Pobitzer, Markus, et al.
Published: (2024)
Nerve function impairment and quality of life in patients with leprosy: a prospective, observational study
by: Apoorva Sharma, et al.
Published: (2024)
by: Apoorva Sharma, et al.
Published: (2024)
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
by: Hou, Yufang, et al.
Published: (2024)
by: Hou, Yufang, et al.
Published: (2024)
El Conflicto del Campo. Matrices culturales e identificaciones políticas
by: Sebastián Rigotti
Published: (2014)
by: Sebastián Rigotti
Published: (2014)
Rh Potenziale Moiré come Operatore di Scattering e il confinamento topologico sulla varietà di Klein
by: Rigotti, Alex
Published: (2026)
by: Rigotti, Alex
Published: (2026)
SYMDIREC: A Neuro-Symbolic Divide-Retrieve-Conquer Framework for Enhanced RTL Synthesis and Summarization
by: Vijayaraghavan, Prashanth, et al.
Published: (2026)
by: Vijayaraghavan, Prashanth, et al.
Published: (2026)
Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
by: Zhang, Zhengxin, et al.
Published: (2024)
by: Zhang, Zhengxin, et al.
Published: (2024)
Best-of-N through the Smoothing Lens: KL Divergence and Regret Analysis
by: Aminian, Gholamali, et al.
Published: (2025)
by: Aminian, Gholamali, et al.
Published: (2025)
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
by: Huang, Yue, et al.
Published: (2025)
by: Huang, Yue, et al.
Published: (2025)
Similar Items
-
Risk Aware Benchmarking of Large Language Models
by: Nitsure, Apoorva, et al.
Published: (2023) -
Distributional Preference Alignment of LLMs via Optimal Transport
by: Melnyk, Igor, et al.
Published: (2024) -
Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge
by: Dognin, Pierre, et al.
Published: (2020) -
Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy Training
by: Mroueh, Youssef, et al.
Published: (2025) -
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
by: Rioux, Gabriel, et al.
Published: (2024)