Saved in:
| Main Authors: | Resck, Lucas E., Raimundo, Marcos M., Poco, Jorge |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03098 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Empirical analysis of binding precedent efficiency in Brazilian Supreme Court via case classification
by: Tinarrage, Raphaël, et al.
Published: (2024)
by: Tinarrage, Raphaël, et al.
Published: (2024)
Emissions and Performance Trade-off Between Small and Large Language Models
by: Garg, Anandita, et al.
Published: (2025)
by: Garg, Anandita, et al.
Published: (2025)
Exploring Accuracy-Fairness Trade-off in Large Language Models
by: Zhang, Qingquan, et al.
Published: (2024)
by: Zhang, Qingquan, et al.
Published: (2024)
Exploring the Plausibility of Hate and Counter Speech Detectors with Explainable AI
by: Böck, Adrian Jaques, et al.
Published: (2024)
by: Böck, Adrian Jaques, et al.
Published: (2024)
Downstream Trade-offs of a Family of Text Watermarks
by: Ajith, Anirudh, et al.
Published: (2023)
by: Ajith, Anirudh, et al.
Published: (2023)
Distill n' Explain: explaining graph neural networks using simple surrogates
by: Pereira, Tamara, et al.
Published: (2023)
by: Pereira, Tamara, et al.
Published: (2023)
Text to Trust: Evaluating Fine-Tuning and LoRA Trade-offs in Language Models for Unfair Terms of Service Detection
by: Juttu, Noshitha Padma Pratyusha, et al.
Published: (2025)
by: Juttu, Noshitha Padma Pratyusha, et al.
Published: (2025)
Understanding the Quality-Diversity Trade-off in Diffusion Language Models
by: Buzzard, Zak
Published: (2025)
by: Buzzard, Zak
Published: (2025)
Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility
by: Palta, Shramay, et al.
Published: (2025)
by: Palta, Shramay, et al.
Published: (2025)
Reasoning Distillation and Structural Alignment for Improved Code Generation
by: Jalilifard, Amir, et al.
Published: (2025)
by: Jalilifard, Amir, et al.
Published: (2025)
Safeguarding Large Language Models in Real-time with Tunable Safety-Performance Trade-offs
by: Fonseca, Joao, et al.
Published: (2025)
by: Fonseca, Joao, et al.
Published: (2025)
Large Language Models in the Task of Automatic Validation of Text Classifier Predictions
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness
by: Rathore, Darshita, et al.
Published: (2025)
by: Rathore, Darshita, et al.
Published: (2025)
Counterfactual Training: Teaching Models Plausible and Actionable Explanations
by: Altmeyer, Patrick, et al.
Published: (2026)
by: Altmeyer, Patrick, et al.
Published: (2026)
Fundamental Safety-Capability Trade-offs in Fine-tuning Large Language Models
by: Chen, Pin-Yu, et al.
Published: (2025)
by: Chen, Pin-Yu, et al.
Published: (2025)
Reframing Data Value for Large Language Models Through the Lens of Plausibility
by: Rammal, Mohamad Rida, et al.
Published: (2024)
by: Rammal, Mohamad Rida, et al.
Published: (2024)
Exploring Explanations Improves the Robustness of In-Context Learning
by: Honda, Ukyo, et al.
Published: (2025)
by: Honda, Ukyo, et al.
Published: (2025)
Navigating the Alignment-Calibration Trade-off: A Pareto-Superior Frontier via Model Merging
by: Hu, Tiancheng, et al.
Published: (2025)
by: Hu, Tiancheng, et al.
Published: (2025)
Structural Rationale Distillation via Reasoning Space Compression
by: Yang, Jialin, et al.
Published: (2026)
by: Yang, Jialin, et al.
Published: (2026)
Debiasing Text Safety Classifiers through a Fairness-Aware Ensemble
by: Sturman, Olivia, et al.
Published: (2024)
by: Sturman, Olivia, et al.
Published: (2024)
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
Towards LLM-guided Causal Explainability for Black-box Text Classifiers
by: Bhattacharjee, Amrita, et al.
Published: (2023)
by: Bhattacharjee, Amrita, et al.
Published: (2023)
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
by: Nirmal, Ayushi, et al.
Published: (2024)
by: Nirmal, Ayushi, et al.
Published: (2024)
Forest vs Tree: The $(N, K)$ Trade-off in Reproducible ML Evaluation
by: Pandita, Deepak, et al.
Published: (2025)
by: Pandita, Deepak, et al.
Published: (2025)
Probabilistically Plausible Counterfactual Explanations with Normalizing Flows
by: Wielopolski, Patryk, et al.
Published: (2024)
by: Wielopolski, Patryk, et al.
Published: (2024)
Contrast-CAT: Contrasting Activations for Enhanced Interpretability in Transformer-based Text Classifiers
by: Han, Sungmin, et al.
Published: (2025)
by: Han, Sungmin, et al.
Published: (2025)
Rule2Text: Natural Language Explanation of Logical Rules in Knowledge Graphs
by: Shirvani-Mahdavi, Nasim, et al.
Published: (2025)
by: Shirvani-Mahdavi, Nasim, et al.
Published: (2025)
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
by: Xu, Tianyang, et al.
Published: (2024)
by: Xu, Tianyang, et al.
Published: (2024)
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction
by: Lee, Yooseop, et al.
Published: (2025)
by: Lee, Yooseop, et al.
Published: (2025)
Empirical Characterization of Rationale Stability Under Controlled Perturbations for Explainable Pattern Recognition
by: Sakib, Abu Noman Md, et al.
Published: (2026)
by: Sakib, Abu Noman Md, et al.
Published: (2026)
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models
by: Ventura, Mor, et al.
Published: (2023)
by: Ventura, Mor, et al.
Published: (2023)
Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)
by: Lee, Jaehyeok, et al.
Published: (2024)
CELL your Model: Contrastive Explanations for Large Language Models
by: Luss, Ronny, et al.
Published: (2024)
by: Luss, Ronny, et al.
Published: (2024)
Binary Classifier Optimization for Large Language Model Alignment
by: Jung, Seungjae, et al.
Published: (2024)
by: Jung, Seungjae, et al.
Published: (2024)
Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance
by: Goldman, Omer, et al.
Published: (2024)
by: Goldman, Omer, et al.
Published: (2024)
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning
by: Chen, Yanda, et al.
Published: (2024)
by: Chen, Yanda, et al.
Published: (2024)
Towards Unified Alignment Between Agents, Humans, and Environment
by: Yang, Zonghan, et al.
Published: (2024)
by: Yang, Zonghan, et al.
Published: (2024)
FaithLM: Towards Faithful Explanations for Large Language Models
by: Chuang, Yu-Neng, et al.
Published: (2024)
by: Chuang, Yu-Neng, et al.
Published: (2024)
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
by: Matton, Katie, et al.
Published: (2025)
by: Matton, Katie, et al.
Published: (2025)
Time and Memory Trade-off of KV-Cache Compression in Tensor Transformer Decoding
by: Chen, Yifang, et al.
Published: (2025)
by: Chen, Yifang, et al.
Published: (2025)
Similar Items
-
Empirical analysis of binding precedent efficiency in Brazilian Supreme Court via case classification
by: Tinarrage, Raphaël, et al.
Published: (2024) -
Emissions and Performance Trade-off Between Small and Large Language Models
by: Garg, Anandita, et al.
Published: (2025) -
Exploring Accuracy-Fairness Trade-off in Large Language Models
by: Zhang, Qingquan, et al.
Published: (2024) -
Exploring the Plausibility of Hate and Counter Speech Detectors with Explainable AI
by: Böck, Adrian Jaques, et al.
Published: (2024) -
Downstream Trade-offs of a Family of Text Watermarks
by: Ajith, Anirudh, et al.
Published: (2023)