:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Chytas, Sotirios Panagiotis, Singh, Vikas
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2601.11575
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

FoGE: Fock Space inspired encoding for graph prompting
di: Chytas, Sotirios Panagiotis, et al.
Pubblicazione: (2025)

ReCo: Reminder Composition Mitigates Hallucinations in Vision-Language Models
di: Chytas, Sotirios Panagiotis, et al.
Pubblicazione: (2025)

Pooling Image Datasets With Multiple Covariate Shift and Imbalance
di: Chytas, Sotirios Panagiotis, et al.
Pubblicazione: (2024)

Evaluating the Efficacy of AI Techniques in Textual Anonymization: A Comparative Study
di: Asimopoulos, Dimitris, et al.
Pubblicazione: (2024)

Benchmarking Concept-Spilling Across Languages in LLMs
di: Badanin, Ilia, et al.
Pubblicazione: (2026)

Error Taxonomy-Guided Prompt Optimization
di: Singh, Mayank, et al.
Pubblicazione: (2026)

Systematic Evaluation of Long-Context LLMs on Financial Concepts
di: Gupta, Lavanya, et al.
Pubblicazione: (2024)

Concept-Based Interpretability for Toxicity Detection
di: Garg, Samarth, et al.
Pubblicazione: (2025)

Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection
di: Şenol, Ali, et al.
Pubblicazione: (2025)

CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
di: Xue, Yuyang, et al.
Pubblicazione: (2025)

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
di: Zhang, Zhen, et al.
Pubblicazione: (2025)

Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
di: Zhao, Haiyan, et al.
Pubblicazione: (2024)

Funny or Persuasive, but Not Both: Evaluating Fine-Grained Multi-Concept Control in LLMs
di: Labroo, Arya, et al.
Pubblicazione: (2026)

Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval
di: Nguyen, Hai-Long, et al.
Pubblicazione: (2024)

NEAT: Concept driven Neuron Attribution in LLMs
di: Kavuri, Vivek Hruday, et al.
Pubblicazione: (2025)

Can LLMs Capture Human Preferences?
di: Goli, Ali, et al.
Pubblicazione: (2023)

Are Today's LLMs Ready to Explain Well-Being Concepts?
di: Jiang, Bohan, et al.
Pubblicazione: (2025)

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
di: Gao, Lang, et al.
Pubblicazione: (2025)

LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals
di: Toker, Gilat, et al.
Pubblicazione: (2026)

The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans
di: Chlapanis, Odysseas S., et al.
Pubblicazione: (2026)

From Concrete to Abstract: A Multimodal Generative Approach to Abstract Concept Learning
di: Xie, Haodong, et al.
Pubblicazione: (2024)

SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
di: Raunak, Vikas, et al.
Pubblicazione: (2023)

Benchmarking Advanced Text Anonymisation Methods: A Comparative Study on Novel and Traditional Approaches
di: Asimopoulos, Dimitris, et al.
Pubblicazione: (2024)

Solve the Loop: Attractor Models for Language and Reasoning
di: Fein-Ashley, Jacob, et al.
Pubblicazione: (2026)

PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLMs
di: Yadav, Ankit, et al.
Pubblicazione: (2024)

Position: Avoid Overstretching LLMs for every Enterprise Task
di: Singh, Kuldeep, et al.
Pubblicazione: (2026)

Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages
di: Nguyen, Hoang H, et al.
Pubblicazione: (2024)

On Instruction-Finetuning Neural Machine Translation Models
di: Raunak, Vikas, et al.
Pubblicazione: (2024)

A Toolbox, Not a Hammer -- Multi-TAG: Scaling Math Reasoning with Multi-Tool Aggregation
di: Yao, Bohan, et al.
Pubblicazione: (2025)

Reasoning about concepts with LLMs: Inconsistencies abound
di: Uceda-Sosa, Rosario, et al.
Pubblicazione: (2024)

Hallucination as Trajectory Commitment: Causal Evidence for Asymmetric Attractor Dynamics in Transformer Generation
di: Akarlar, G. Aytug
Pubblicazione: (2026)

Towards Reliable Evaluation of Behavior Steering Interventions in LLMs
di: Pres, Itamar, et al.
Pubblicazione: (2024)

Interpreting the Effects of Quantization on LLMs
di: Singh, Manpreet, et al.
Pubblicazione: (2025)

Estimation of Concept Explanations Should be Uncertainty Aware
di: Piratla, Vihari, et al.
Pubblicazione: (2023)

EtiCor++: Towards Understanding Etiquettical Bias in LLMs
di: Dwivedi, Ashutosh, et al.
Pubblicazione: (2025)

XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs
di: Chen, Zichen, et al.
Pubblicazione: (2023)

Addressing Bias in LLMs: Strategies and Application to Fair AI-based Recruitment
di: Peña, Alejandro, et al.
Pubblicazione: (2025)

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
di: Thakur, Aman Singh, et al.
Pubblicazione: (2024)

Benchmarking LLMs for Pairwise Causal Discovery in Biomedical and Multi-Domain Contexts
di: Anuyah, Sydney, et al.
Pubblicazione: (2026)

Futureproof Static Memory Planning
di: Lamprakos, Christos, et al.
Pubblicazione: (2025)