Saved in:
| Main Authors: | Song, Xiaoying, Anik, Anirban Saha, Blanco, Eduardo, Frias-Martinez, Vanessa, Hong, Lingzi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.01053 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)
by: Anik, Anirban Saha, et al.
Published: (2025)
A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models
by: Liu, Jinyu, et al.
Published: (2025)
by: Liu, Jinyu, et al.
Published: (2025)
Outcome-Constrained Large Language Models for Countering Hate Speech
by: Hong, Lingzi, et al.
Published: (2024)
by: Hong, Lingzi, et al.
Published: (2024)
ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims
by: Anik, Anirban Saha, et al.
Published: (2025)
by: Anik, Anirban Saha, et al.
Published: (2025)
Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024)
by: Song, Xiaoying, et al.
Published: (2024)
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
by: Su, Haoxiang, et al.
Published: (2026)
by: Su, Haoxiang, et al.
Published: (2026)
Improving the Fairness of Deep-Learning, Short-term Crime Prediction with Under-reporting-aware Models
by: Wu, Jiahui, et al.
Published: (2024)
by: Wu, Jiahui, et al.
Published: (2024)
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths
by: Nair, Inderjeet, et al.
Published: (2025)
by: Nair, Inderjeet, et al.
Published: (2025)
Psittacines of Innovation? Assessing the True Novelty of AI Creations
by: Mukherjee, Anirban
Published: (2024)
by: Mukherjee, Anirban
Published: (2024)
MoCoRP: Modeling Consistent Relations between Persona and Response for Persona-based Dialogue
by: Lee, Kyungro, et al.
Published: (2025)
by: Lee, Kyungro, et al.
Published: (2025)
Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs
by: Saha, Swayamjit
Published: (2025)
by: Saha, Swayamjit
Published: (2025)
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference
by: Zeng, Ziqian, et al.
Published: (2023)
by: Zeng, Ziqian, et al.
Published: (2023)
Consistency of Responses and Continuations Generated by Large Language Models on Social Media
by: Xu, Wentao, et al.
Published: (2025)
by: Xu, Wentao, et al.
Published: (2025)
Detoxification of Large Language Models through Output-layer Fusion with a Calibration Model
by: Tian, Yuanhe, et al.
Published: (2025)
by: Tian, Yuanhe, et al.
Published: (2025)
Evaluating Large Language Models in Crisis Detection: A Real-World Benchmark from Psychological Support Hotlines
by: Deng, Guifeng, et al.
Published: (2025)
by: Deng, Guifeng, et al.
Published: (2025)
Psychological Assessments with Large Language Models: A Privacy-Focused and Cost-Effective Approach
by: Blanco-Cuaresma, Sergi
Published: (2024)
by: Blanco-Cuaresma, Sergi
Published: (2024)
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)
by: Hong, Zhaochen, et al.
Published: (2025)
A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions
by: Vaugrante, Laurène, et al.
Published: (2024)
by: Vaugrante, Laurène, et al.
Published: (2024)
RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models
by: Zhang, Fujun, et al.
Published: (2025)
by: Zhang, Fujun, et al.
Published: (2025)
Multi-Modal Sentiment Analysis with Dynamic Attention Fusion
by: Abdulhalim, Sadia, et al.
Published: (2025)
by: Abdulhalim, Sadia, et al.
Published: (2025)
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
by: Song, Mooho, et al.
Published: (2025)
by: Song, Mooho, et al.
Published: (2025)
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)
by: Cui, Wendi, et al.
Published: (2024)
CLLMs: Consistency Large Language Models
by: Kou, Siqi, et al.
Published: (2024)
by: Kou, Siqi, et al.
Published: (2024)
LLM Essay Scoring Under Holistic and Analytic Rubrics: Prompt Effects and Bias
by: Kucia, Filip J., et al.
Published: (2026)
by: Kucia, Filip J., et al.
Published: (2026)
Error Taxonomy-Guided Prompt Optimization
by: Singh, Mayank, et al.
Published: (2026)
by: Singh, Mayank, et al.
Published: (2026)
PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation
by: Divekar, Abhishek, et al.
Published: (2026)
by: Divekar, Abhishek, et al.
Published: (2026)
Self-Consistency Boosts Calibration for Math Reasoning
by: Wang, Ante, et al.
Published: (2024)
by: Wang, Ante, et al.
Published: (2024)
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant
by: Tomar, Mohit, et al.
Published: (2024)
by: Tomar, Mohit, et al.
Published: (2024)
Post-Training Language Models for Crosslingual Consistency
by: Liu, Tianyu, et al.
Published: (2026)
by: Liu, Tianyu, et al.
Published: (2026)
Calibrating Reasoning in Language Models with Internal Consistency
by: Xie, Zhihui, et al.
Published: (2024)
by: Xie, Zhihui, et al.
Published: (2024)
Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese
by: Xu, Yunqi, et al.
Published: (2024)
by: Xu, Yunqi, et al.
Published: (2024)
STED and Consistency Scoring: A Framework for Evaluating LLM Structured Output Reliability
by: Wang, Guanghui, et al.
Published: (2025)
by: Wang, Guanghui, et al.
Published: (2025)
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models
by: Jiang, Xinyan, et al.
Published: (2025)
by: Jiang, Xinyan, et al.
Published: (2025)
Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models
by: McDaniel, Emma, et al.
Published: (2024)
by: McDaniel, Emma, et al.
Published: (2024)
Cross-Modal Consistency in Multimodal Large Language Models
by: Zhang, Xiang, et al.
Published: (2024)
by: Zhang, Xiang, et al.
Published: (2024)
Evaluating Consistency and Reasoning Capabilities of Large Language Models
by: Saxena, Yash, et al.
Published: (2024)
by: Saxena, Yash, et al.
Published: (2024)
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
by: Zhao, Sihang, et al.
Published: (2024)
by: Zhao, Sihang, et al.
Published: (2024)
Similar Items
-
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025) -
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025) -
A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models
by: Liu, Jinyu, et al.
Published: (2025) -
Outcome-Constrained Large Language Models for Countering Hate Speech
by: Hong, Lingzi, et al.
Published: (2024) -
ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims
by: Anik, Anirban Saha, et al.
Published: (2025)