:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Zhang, Shenyu, Li, Yu, Wu, Rui, Huang, Xiutian, Chen, Yongrui, Xu, Wenhao, Qi, Guilin
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2403.11509
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation
di: Li, Yu, et al.
Pubblicazione: (2024)

StressEval: Failure-Driven Dynamic Benchmarking for Knowledge-Intensive Reasoning in Large Language Models
di: Chen, Yongrui, et al.
Pubblicazione: (2026)

K-DeCore: Facilitating Knowledge Transfer in Continual Structured Knowledge Reasoning via Knowledge Decoupling
di: Chen, Yongrui, et al.
Pubblicazione: (2025)

Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems
di: Zhang, Yuxin, et al.
Pubblicazione: (2025)

DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping
di: Chen, Yongrui, et al.
Pubblicazione: (2023)

Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data
di: Min, Dehai, et al.
Pubblicazione: (2024)

Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning
di: Chen, Yongrui, et al.
Pubblicazione: (2025)

Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge
di: Chen, Yongrui, et al.
Pubblicazione: (2025)

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs
di: Hu, Nan, et al.
Pubblicazione: (2024)

C$^3$TG: Conflict-aware, Composite, and Collaborative Controlled Text Generation
di: Li, Yu, et al.
Pubblicazione: (2025)

After Retrieval, Before Generation: Enhancing the Trustworthiness of Large Language Models in Retrieval-Augmented Generation
di: Dai, Xinbang, et al.
Pubblicazione: (2025)

TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks
di: Jiang, Dongfu, et al.
Pubblicazione: (2023)

Table-r1: Self-supervised and Reinforcement Learning for Program-based Table Reasoning in Small Language Models
di: Jin, Rihui, et al.
Pubblicazione: (2025)

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
di: Song, Yifan, et al.
Pubblicazione: (2024)

Question Answering Over Spatio-Temporal Knowledge Graph
di: Dai, Xinbang, et al.
Pubblicazione: (2024)

ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization
di: Zhao, Xiutian, et al.
Pubblicazione: (2024)

Measuring the Inconsistency of Large Language Models in Preferential Ranking
di: Zhao, Xiutian, et al.
Pubblicazione: (2024)

An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making
di: Zhao, Xiutian, et al.
Pubblicazione: (2024)

MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing
di: Li, Jiaqi, et al.
Pubblicazione: (2024)

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
di: Jin, Rihui, et al.
Pubblicazione: (2024)

OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
di: Chen, Yongrui, et al.
Pubblicazione: (2025)

A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations
di: Lan, Tian, et al.
Pubblicazione: (2025)

Discovering and Causally Validating Emotion-Sensitive Neurons in Large Audio-Language Models
di: Zhao, Xiutian, et al.
Pubblicazione: (2026)

Embedding Ontologies via Incorporating Extensional and Intensional Knowledge
di: Wang, Keyu, et al.
Pubblicazione: (2024)

PRIMO: Progressive Induction for Multi-hop Open Rule Generation
di: Liu, Jianyu, et al.
Pubblicazione: (2024)

Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
di: Xiong, Weimin, et al.
Pubblicazione: (2024)

SyntaxShap: Syntax-aware Explainability Method for Text Generation
di: Amara, Kenza, et al.
Pubblicazione: (2024)

LM$^2$otifs : An Explainable Framework for Machine-Generated Texts Detection
di: Zheng, Xu, et al.
Pubblicazione: (2025)

StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis
di: Li, Siyuan, et al.
Pubblicazione: (2025)

SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation
di: Xu, Ziyao, et al.
Pubblicazione: (2024)

Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models
di: Zhao, Xiutian, et al.
Pubblicazione: (2026)

UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
di: Min, Dehai, et al.
Pubblicazione: (2024)

Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
di: Ji, Jiazhou, et al.
Pubblicazione: (2024)

Finding Culture-Sensitive Neurons in Vision-Language Models
di: Zhao, Xiutian, et al.
Pubblicazione: (2025)

Decision-Oriented Text Evaluation
di: Huang, Yu-Shiang, et al.
Pubblicazione: (2025)

CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering
di: Wu, Yike, et al.
Pubblicazione: (2024)

Challenges and Opportunities in Text Generation Explainability
di: Amara, Kenza, et al.
Pubblicazione: (2024)

Holistic Evaluation for Interleaved Text-and-Image Generation
di: Liu, Minqian, et al.
Pubblicazione: (2024)

Can Large Language Models Understand DL-Lite Ontologies? An Empirical Study
di: Wang, Keyu, et al.
Pubblicazione: (2024)

SQLStructEval: Structural Evaluation of LLM Text-to-SQL Generation
di: Zhou, Yixi, et al.
Pubblicazione: (2026)