Saved in:
| Main Authors: | Silva, Hugo, Mendes, Mateus, Oliveira, Hugo Gonçalo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.17312 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unsupervised Flow Discovery from Task-oriented Dialogues
by: Ferreira, Patrícia, et al.
Published: (2024)
by: Ferreira, Patrícia, et al.
Published: (2024)
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
by: Zhu, Lianghui, et al.
Published: (2023)
by: Zhu, Lianghui, et al.
Published: (2023)
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
by: Wu, Tianhao, et al.
Published: (2024)
by: Wu, Tianhao, et al.
Published: (2024)
Sabiá-2: A New Generation of Portuguese Large Language Models
by: Almeida, Thales Sales, et al.
Published: (2024)
by: Almeida, Thales Sales, et al.
Published: (2024)
Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization
by: Jin, Keyan, et al.
Published: (2025)
by: Jin, Keyan, et al.
Published: (2025)
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
by: de Carvalho, Gonçalo Hora, et al.
Published: (2024)
by: de Carvalho, Gonçalo Hora, et al.
Published: (2024)
Span-Level Machine Translation Meta-Evaluation
by: Perrella, Stefano, et al.
Published: (2026)
by: Perrella, Stefano, et al.
Published: (2026)
Can Large Language Models Generate Effective Datasets for Emotion Recognition in Conversations?
by: Kaplan, Burak Can, et al.
Published: (2025)
by: Kaplan, Burak Can, et al.
Published: (2025)
Continual Learning in Large Language Models: Methods, Challenges, and Opportunities
by: Chen, Hongyang, et al.
Published: (2026)
by: Chen, Hongyang, et al.
Published: (2026)
JudgeLRM: Large Reasoning Models as a Judge
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
by: Medjad, Maya, et al.
Published: (2025)
by: Medjad, Maya, et al.
Published: (2025)
Large Language Model for Patent Concept Generation
by: Ren, Runtao, et al.
Published: (2024)
by: Ren, Runtao, et al.
Published: (2024)
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking
by: Niu, Tong, et al.
Published: (2024)
by: Niu, Tong, et al.
Published: (2024)
Using Large Language Models to Suggest Informative Prior Distributions in Bayesian Statistics
by: Riegler, Michael A., et al.
Published: (2025)
by: Riegler, Michael A., et al.
Published: (2025)
Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
by: Han, Kaiqiao, et al.
Published: (2024)
by: Han, Kaiqiao, et al.
Published: (2024)
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach
by: Pernes, Diogo, et al.
Published: (2024)
by: Pernes, Diogo, et al.
Published: (2024)
Emotion Concepts and their Function in a Large Language Model
by: Sofroniew, Nicholas, et al.
Published: (2026)
by: Sofroniew, Nicholas, et al.
Published: (2026)
Identifying Linear Relational Concepts in Large Language Models
by: Chanin, David, et al.
Published: (2023)
by: Chanin, David, et al.
Published: (2023)
Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
by: Kumar, Shachi H, et al.
Published: (2024)
by: Kumar, Shachi H, et al.
Published: (2024)
ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models
by: Li, Haoxuan, et al.
Published: (2025)
by: Li, Haoxuan, et al.
Published: (2025)
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data
by: Bedemariam, Rewina, et al.
Published: (2025)
by: Bedemariam, Rewina, et al.
Published: (2025)
Evaluation of Large Language Models in Legal Applications: Challenges, Methods, and Future Directions
by: Hu, Yiran, et al.
Published: (2026)
by: Hu, Yiran, et al.
Published: (2026)
Text-as-Signal: Quantitative Semantic Scoring with Embeddings, Logprobs, and Noise Reduction
by: Moreira, Hugo
Published: (2026)
by: Moreira, Hugo
Published: (2026)
Automated Concept Discovery for LLM-as-a-Judge Preference Analysis
by: Wedgwood, James, et al.
Published: (2026)
by: Wedgwood, James, et al.
Published: (2026)
CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024)
by: Teehan, Ryan, et al.
Published: (2024)
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
by: Cao, Yuji, et al.
Published: (2024)
by: Cao, Yuji, et al.
Published: (2024)
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
by: Wu, Yanan, et al.
Published: (2024)
by: Wu, Yanan, et al.
Published: (2024)
Legal Evalutions and Challenges of Large Language Models
by: Wang, Jiaqi, et al.
Published: (2024)
by: Wang, Jiaqi, et al.
Published: (2024)
Challenges and Responses in the Practice of Large Language Models
by: Zhu, Hongyin
Published: (2024)
by: Zhu, Hongyin
Published: (2024)
Uncovering Implicit Bias in Large Language Models with Concept Learning Dataset
by: Wang, Leroy Z.
Published: (2025)
by: Wang, Leroy Z.
Published: (2025)
Meta-Reasoning Improves Tool Use in Large Language Models
by: Alazraki, Lisa, et al.
Published: (2024)
by: Alazraki, Lisa, et al.
Published: (2024)
Meta-aware Learning in text-to-SQL Large Language Model
by: Zhang, Wenda
Published: (2025)
by: Zhang, Wenda
Published: (2025)
Audio-Aware Large Language Models as Judges for Speaking Styles
by: Chiang, Cheng-Han, et al.
Published: (2025)
by: Chiang, Cheng-Han, et al.
Published: (2025)
EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models
by: Mohammadi, Hadi, et al.
Published: (2025)
by: Mohammadi, Hadi, et al.
Published: (2025)
Cross-model Transferability among Large Language Models on the Platonic Representations of Concepts
by: Huang, Youcheng, et al.
Published: (2025)
by: Huang, Youcheng, et al.
Published: (2025)
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
by: Park, Kiho, et al.
Published: (2024)
by: Park, Kiho, et al.
Published: (2024)
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
by: Zhang, Bang, et al.
Published: (2025)
by: Zhang, Bang, et al.
Published: (2025)
Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge
by: Cantini, Riccardo, et al.
Published: (2025)
by: Cantini, Riccardo, et al.
Published: (2025)
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
by: Li, Wenjun, et al.
Published: (2025)
by: Li, Wenjun, et al.
Published: (2025)
Navigating the Concept Space of Language Models
by: Marcílio-Jr, Wilson E., et al.
Published: (2026)
by: Marcílio-Jr, Wilson E., et al.
Published: (2026)
Similar Items
-
Unsupervised Flow Discovery from Task-oriented Dialogues
by: Ferreira, Patrícia, et al.
Published: (2024) -
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
by: Zhu, Lianghui, et al.
Published: (2023) -
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
by: Wu, Tianhao, et al.
Published: (2024) -
Sabiá-2: A New Generation of Portuguese Large Language Models
by: Almeida, Thales Sales, et al.
Published: (2024) -
Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization
by: Jin, Keyan, et al.
Published: (2025)