Saved in:
| Main Authors: | Sonoda, Ryosuke, Srinivasan, Ramya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.16640 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Efficient Zero-Shot AI-Generated Image Detection
by: Sonoda, Ryosuke, et al.
Published: (2026)
by: Sonoda, Ryosuke, et al.
Published: (2026)
Fair and Interpretable Deepfake Detection in Videos
by: Yoshii, Akihito, et al.
Published: (2025)
by: Yoshii, Akihito, et al.
Published: (2025)
Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model
by: Wang, Minghan, et al.
Published: (2025)
by: Wang, Minghan, et al.
Published: (2025)
From Words to Proverbs: Evaluating LLMs Linguistic and Cultural Competence in Saudi Dialects with Absher
by: Al-Monef, Renad, et al.
Published: (2025)
by: Al-Monef, Renad, et al.
Published: (2025)
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
by: Liu, Chen Cecilia, et al.
Published: (2023)
by: Liu, Chen Cecilia, et al.
Published: (2023)
MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs
by: Kalhor, Ghazal, et al.
Published: (2026)
by: Kalhor, Ghazal, et al.
Published: (2026)
ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding
by: Azime, Israel Abebe, et al.
Published: (2024)
by: Azime, Israel Abebe, et al.
Published: (2024)
Proverbs or Pythian Oracles? Sentiments and Emotions in Greek Sayings
by: Korre, Katerina, et al.
Published: (2025)
by: Korre, Katerina, et al.
Published: (2025)
Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking
by: Magdy, Samar M., et al.
Published: (2025)
by: Magdy, Samar M., et al.
Published: (2025)
FFE-Hallu:Hallucinations in Fixed Figurative Expressions:Benchmark of Idioms and Proverbs in the Persian Language
by: Hosseini, Faezeh, et al.
Published: (2026)
by: Hosseini, Faezeh, et al.
Published: (2026)
A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters
by: Lam, Long Hei Matthew, et al.
Published: (2024)
by: Lam, Long Hei Matthew, et al.
Published: (2024)
The Rarity Blind Spot: A Framework for Evaluating Statistical Reasoning in LLMs
by: Maekawa, Seiji, et al.
Published: (2025)
by: Maekawa, Seiji, et al.
Published: (2025)
Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification
by: Thatikonda, Ramya Keerthy, et al.
Published: (2024)
by: Thatikonda, Ramya Keerthy, et al.
Published: (2024)
Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom's Taxonomy
by: Kumar, Ramya, et al.
Published: (2025)
by: Kumar, Ramya, et al.
Published: (2025)
Self-Distillation as a Performance Recovery Mechanism for LLMs: Counteracting Compression and Catastrophic Forgetting
by: Liu, Chi, et al.
Published: (2026)
by: Liu, Chi, et al.
Published: (2026)
SLIM-LLMs: Modeling of Style-Sensory Language RelationshipsThrough Low-Dimensional Representations
by: Khalid, Osama, et al.
Published: (2025)
by: Khalid, Osama, et al.
Published: (2025)
The Statistical Signature of LLMs
by: Hadad, Ortal, et al.
Published: (2026)
by: Hadad, Ortal, et al.
Published: (2026)
Advancing NLP Security by Leveraging LLMs as Adversarial Engines
by: Srinivasan, Sudarshan, et al.
Published: (2024)
by: Srinivasan, Sudarshan, et al.
Published: (2024)
Exploring Database Normalization Effects on SQL Generation
by: Kohita, Ryosuke
Published: (2025)
by: Kohita, Ryosuke
Published: (2025)
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
by: Zhang, Xiaoying, et al.
Published: (2024)
by: Zhang, Xiaoying, et al.
Published: (2024)
Compare without Despair: Reliable Preference Evaluation with Generation Separability
by: Ghosh, Sayan, et al.
Published: (2024)
by: Ghosh, Sayan, et al.
Published: (2024)
Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting
by: Srinivasan, Krishna Prasad Varadarajan, et al.
Published: (2024)
by: Srinivasan, Krishna Prasad Varadarajan, et al.
Published: (2024)
JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization
by: Guo, Xiaobo, et al.
Published: (2024)
by: Guo, Xiaobo, et al.
Published: (2024)
Do LLMs Overthink Basic Math Reasoning? Benchmarking the Accuracy-Efficiency Tradeoff in Language Models
by: Srivastava, Gaurav, et al.
Published: (2025)
by: Srivastava, Gaurav, et al.
Published: (2025)
Bayesian Statistical Modeling with Predictors from LLMs
by: Franke, Michael, et al.
Published: (2024)
by: Franke, Michael, et al.
Published: (2024)
TaxoBell: Gaussian Box Embeddings for Self-Supervised Taxonomy Expansion
by: Mishra, Sahil, et al.
Published: (2026)
by: Mishra, Sahil, et al.
Published: (2026)
FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMs
by: Munnangi, Monica, et al.
Published: (2024)
by: Munnangi, Monica, et al.
Published: (2024)
Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A Multifaceted Statistical Approach
by: Sun, Kun, et al.
Published: (2024)
by: Sun, Kun, et al.
Published: (2024)
Using Large Language Models in Public Transit Systems, San Antonio as a case study
by: Jonnala, Ramya, et al.
Published: (2024)
by: Jonnala, Ramya, et al.
Published: (2024)
SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition
by: Wu, Mengsong, et al.
Published: (2025)
by: Wu, Mengsong, et al.
Published: (2025)
Cascaded Self-Evaluation Augmented Training for Lightweight Multimodal LLMs
by: Lv, Zheqi, et al.
Published: (2025)
by: Lv, Zheqi, et al.
Published: (2025)
DMDTEval: An Evaluation and Analysis of LLMs on Disambiguation in Multi-domain Translation
by: Man, Zhibo, et al.
Published: (2025)
by: Man, Zhibo, et al.
Published: (2025)
Exploring the Potential of LLMs as Personalized Assistants: Dataset, Evaluation, and Analysis
by: Mok, Jisoo, et al.
Published: (2025)
by: Mok, Jisoo, et al.
Published: (2025)
MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs
by: Liu, Zhiwei, et al.
Published: (2025)
by: Liu, Zhiwei, et al.
Published: (2025)
Relationship Detection on Tabular Data Using Statistical Analysis and Large Language Models
by: Koletsis, Panagiotis, et al.
Published: (2025)
by: Koletsis, Panagiotis, et al.
Published: (2025)
Assessing the Sensitivity and Alignment of FOL Closeness Metrics
by: Thatikonda, Ramya Keerthy, et al.
Published: (2025)
by: Thatikonda, Ramya Keerthy, et al.
Published: (2025)
Improving the Distributional Alignment of LLMs using Supervision
by: Kambhatla, Gauri, et al.
Published: (2025)
by: Kambhatla, Gauri, et al.
Published: (2025)
Augmenting Bias Detection in LLMs Using Topological Data Analysis
by: Varadarajan, Keshav, et al.
Published: (2025)
by: Varadarajan, Keshav, et al.
Published: (2025)
Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights
by: Chen, Yi, et al.
Published: (2026)
by: Chen, Yi, et al.
Published: (2026)
When LLMs Benchmark Themselves: Deconstructing Self-Bias in Automated Evaluation
by: Xu, Wenda, et al.
Published: (2025)
by: Xu, Wenda, et al.
Published: (2025)
Similar Items
-
Efficient Zero-Shot AI-Generated Image Detection
by: Sonoda, Ryosuke, et al.
Published: (2026) -
Fair and Interpretable Deepfake Detection in Videos
by: Yoshii, Akihito, et al.
Published: (2025) -
Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model
by: Wang, Minghan, et al.
Published: (2025) -
From Words to Proverbs: Evaluating LLMs Linguistic and Cultural Competence in Saudi Dialects with Absher
by: Al-Monef, Renad, et al.
Published: (2025) -
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
by: Liu, Chen Cecilia, et al.
Published: (2023)