:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sonoda, Ryosuke, Srinivasan, Ramya
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.16640
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Efficient Zero-Shot AI-Generated Image Detection
by: Sonoda, Ryosuke, et al.
Published: (2026)

Fair and Interpretable Deepfake Detection in Videos
by: Yoshii, Akihito, et al.
Published: (2025)

Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model
by: Wang, Minghan, et al.
Published: (2025)

From Words to Proverbs: Evaluating LLMs Linguistic and Cultural Competence in Saudi Dialects with Absher
by: Al-Monef, Renad, et al.
Published: (2025)

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
by: Liu, Chen Cecilia, et al.
Published: (2023)

MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs
by: Kalhor, Ghazal, et al.
Published: (2026)

ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding
by: Azime, Israel Abebe, et al.
Published: (2024)

Proverbs or Pythian Oracles? Sentiments and Emotions in Greek Sayings
by: Korre, Katerina, et al.
Published: (2025)

Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking
by: Magdy, Samar M., et al.
Published: (2025)

FFE-Hallu:Hallucinations in Fixed Figurative Expressions:Benchmark of Idioms and Proverbs in the Persian Language
by: Hosseini, Faezeh, et al.
Published: (2026)

A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters
by: Lam, Long Hei Matthew, et al.
Published: (2024)

The Rarity Blind Spot: A Framework for Evaluating Statistical Reasoning in LLMs
by: Maekawa, Seiji, et al.
Published: (2025)

Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification
by: Thatikonda, Ramya Keerthy, et al.
Published: (2024)

Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom's Taxonomy
by: Kumar, Ramya, et al.
Published: (2025)

Self-Distillation as a Performance Recovery Mechanism for LLMs: Counteracting Compression and Catastrophic Forgetting
by: Liu, Chi, et al.
Published: (2026)

SLIM-LLMs: Modeling of Style-Sensory Language RelationshipsThrough Low-Dimensional Representations
by: Khalid, Osama, et al.
Published: (2025)

The Statistical Signature of LLMs
by: Hadad, Ortal, et al.
Published: (2026)

Advancing NLP Security by Leveraging LLMs as Adversarial Engines
by: Srinivasan, Sudarshan, et al.
Published: (2024)

Exploring Database Normalization Effects on SQL Generation
by: Kohita, Ryosuke
Published: (2025)

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
by: Zhang, Xiaoying, et al.
Published: (2024)

Compare without Despair: Reliable Preference Evaluation with Generation Separability
by: Ghosh, Sayan, et al.
Published: (2024)

Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting
by: Srinivasan, Krishna Prasad Varadarajan, et al.
Published: (2024)

JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization
by: Guo, Xiaobo, et al.
Published: (2024)

Do LLMs Overthink Basic Math Reasoning? Benchmarking the Accuracy-Efficiency Tradeoff in Language Models
by: Srivastava, Gaurav, et al.
Published: (2025)

Bayesian Statistical Modeling with Predictors from LLMs
by: Franke, Michael, et al.
Published: (2024)

TaxoBell: Gaussian Box Embeddings for Self-Supervised Taxonomy Expansion
by: Mishra, Sahil, et al.
Published: (2026)

FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMs
by: Munnangi, Monica, et al.
Published: (2024)

Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A Multifaceted Statistical Approach
by: Sun, Kun, et al.
Published: (2024)

Using Large Language Models in Public Transit Systems, San Antonio as a case study
by: Jonnala, Ramya, et al.
Published: (2024)

SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition
by: Wu, Mengsong, et al.
Published: (2025)

Cascaded Self-Evaluation Augmented Training for Lightweight Multimodal LLMs
by: Lv, Zheqi, et al.
Published: (2025)

DMDTEval: An Evaluation and Analysis of LLMs on Disambiguation in Multi-domain Translation
by: Man, Zhibo, et al.
Published: (2025)

Exploring the Potential of LLMs as Personalized Assistants: Dataset, Evaluation, and Analysis
by: Mok, Jisoo, et al.
Published: (2025)

MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs
by: Liu, Zhiwei, et al.
Published: (2025)

Relationship Detection on Tabular Data Using Statistical Analysis and Large Language Models
by: Koletsis, Panagiotis, et al.
Published: (2025)

Assessing the Sensitivity and Alignment of FOL Closeness Metrics
by: Thatikonda, Ramya Keerthy, et al.
Published: (2025)

Improving the Distributional Alignment of LLMs using Supervision
by: Kambhatla, Gauri, et al.
Published: (2025)

Augmenting Bias Detection in LLMs Using Topological Data Analysis
by: Varadarajan, Keshav, et al.
Published: (2025)

Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights
by: Chen, Yi, et al.
Published: (2026)

When LLMs Benchmark Themselves: Deconstructing Self-Bias in Automated Evaluation
by: Xu, Wenda, et al.
Published: (2025)