Guardado en:
| Autores principales: | Hassell, Jackson, Zhang, Dan, Kim, Hannah, Mitchell, Tom, Hruschka, Estevam |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.19897 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
por: Oketunji, Abiodun Finbarrs
Publicado: (2023)
por: Oketunji, Abiodun Finbarrs
Publicado: (2023)
Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
por: Cho, Hanjun, et al.
Publicado: (2026)
por: Cho, Hanjun, et al.
Publicado: (2026)
Towards Probabilistic Question Answering Over Tabular Data
por: Shen, Chen, et al.
Publicado: (2025)
por: Shen, Chen, et al.
Publicado: (2025)
Uncovering Biases with Reflective Large Language Models
por: Chang, Edward Y.
Publicado: (2024)
por: Chang, Edward Y.
Publicado: (2024)
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
por: Peters, Sydney, et al.
Publicado: (2025)
por: Peters, Sydney, et al.
Publicado: (2025)
Associative Recurrent Memory Transformer
por: Rodkin, Ivan, et al.
Publicado: (2024)
por: Rodkin, Ivan, et al.
Publicado: (2024)
A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs
por: Park, Chiwan, et al.
Publicado: (2025)
por: Park, Chiwan, et al.
Publicado: (2025)
SPRInG: Continual LLM Personalization via Selective Parametric Adaptation and Retrieval-Interpolated Generation
por: Kim, Seoyeon, et al.
Publicado: (2026)
por: Kim, Seoyeon, et al.
Publicado: (2026)
Self-Supervised Position Debiasing for Large Language Models
por: Liu, Zhongkun, et al.
Publicado: (2024)
por: Liu, Zhongkun, et al.
Publicado: (2024)
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
por: Lyu, Bohan, et al.
Publicado: (2024)
por: Lyu, Bohan, et al.
Publicado: (2024)
Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation
por: A, Snegha, et al.
Publicado: (2025)
por: A, Snegha, et al.
Publicado: (2025)
MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification
por: Sirbu, Iustin, et al.
Publicado: (2025)
por: Sirbu, Iustin, et al.
Publicado: (2025)
Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models
por: Kurtic, Eldar, et al.
Publicado: (2024)
por: Kurtic, Eldar, et al.
Publicado: (2024)
Graph Memory Transformer (GMT)
por: Zanarini, Nicola, et al.
Publicado: (2026)
por: Zanarini, Nicola, et al.
Publicado: (2026)
AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks
por: Wang, Fali, et al.
Publicado: (2025)
por: Wang, Fali, et al.
Publicado: (2025)
Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
por: Salehmohamed, Shoaib Sadiq, et al.
Publicado: (2026)
por: Salehmohamed, Shoaib Sadiq, et al.
Publicado: (2026)
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)
ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
por: Kadu, Ankush, et al.
Publicado: (2025)
por: Kadu, Ankush, et al.
Publicado: (2025)
Large Language Model (LLM) Bias Index -- LLMBI
por: Oketunji, Abiodun Finbarrs, et al.
Publicado: (2023)
por: Oketunji, Abiodun Finbarrs, et al.
Publicado: (2023)
Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource Languages
por: Bajpai, Ashutosh, et al.
Publicado: (2024)
por: Bajpai, Ashutosh, et al.
Publicado: (2024)
Uncovering Latent Human Wellbeing in Language Model Embeddings
por: Freire, Pedro, et al.
Publicado: (2024)
por: Freire, Pedro, et al.
Publicado: (2024)
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
por: Gadzhiev, Artem, et al.
Publicado: (2026)
por: Gadzhiev, Artem, et al.
Publicado: (2026)
Key-Value Means: Transformers with Expandable Block-Recurrent Compressed Memory
por: Goldstein, Daniel, et al.
Publicado: (2026)
por: Goldstein, Daniel, et al.
Publicado: (2026)
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
por: Saji, Alan, et al.
Publicado: (2025)
por: Saji, Alan, et al.
Publicado: (2025)
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
por: Quevedo, Ernesto, et al.
Publicado: (2024)
por: Quevedo, Ernesto, et al.
Publicado: (2024)
ProMedTS: A Self-Supervised, Prompt-Guided Multimodal Approach for Integrating Medical Text and Time Series
por: Niu, Shuai, et al.
Publicado: (2025)
por: Niu, Shuai, et al.
Publicado: (2025)
Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents
por: Kim, San, et al.
Publicado: (2024)
por: Kim, San, et al.
Publicado: (2024)
Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions
por: Yun, Taedong, et al.
Publicado: (2025)
por: Yun, Taedong, et al.
Publicado: (2025)
Skill Availability and Presentation Granularity in Large-Language-Model Agents: A Controlled SkillsBench Study
por: Xu, Xiaonan, et al.
Publicado: (2026)
por: Xu, Xiaonan, et al.
Publicado: (2026)
$\text{Memory}^3$: Language Modeling with Explicit Memory
por: Yang, Hongkang, et al.
Publicado: (2024)
por: Yang, Hongkang, et al.
Publicado: (2024)
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity
por: Pan, Xinghan
Publicado: (2025)
por: Pan, Xinghan
Publicado: (2025)
In-Context Fixation: When Demonstrated Labels Override Semantics in Few-Shot Classification
por: Liu, Ming
Publicado: (2026)
por: Liu, Ming
Publicado: (2026)
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
por: Yıldırım, Alper, et al.
Publicado: (2025)
por: Yıldırım, Alper, et al.
Publicado: (2025)
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers
por: Lee, Myeonghwa, et al.
Publicado: (2024)
por: Lee, Myeonghwa, et al.
Publicado: (2024)
A Llama walks into the 'Bar': Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam
por: Fernandes, Rean, et al.
Publicado: (2025)
por: Fernandes, Rean, et al.
Publicado: (2025)
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
por: Wu, Zhengxuan, et al.
Publicado: (2025)
por: Wu, Zhengxuan, et al.
Publicado: (2025)
Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
por: Hong, Chunsan, et al.
Publicado: (2025)
por: Hong, Chunsan, et al.
Publicado: (2025)
Adaptive Focus Memory for Language Models
por: Cruz, Christopher
Publicado: (2025)
por: Cruz, Christopher
Publicado: (2025)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
por: Fadli, Samih
Publicado: (2025)
por: Fadli, Samih
Publicado: (2025)
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
por: Frydenlund, Arvid
Publicado: (2025)
por: Frydenlund, Arvid
Publicado: (2025)
Ejemplares similares
-
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
por: Oketunji, Abiodun Finbarrs
Publicado: (2023) -
Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
por: Cho, Hanjun, et al.
Publicado: (2026) -
Towards Probabilistic Question Answering Over Tabular Data
por: Shen, Chen, et al.
Publicado: (2025) -
Uncovering Biases with Reflective Large Language Models
por: Chang, Edward Y.
Publicado: (2024) -
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
por: Peters, Sydney, et al.
Publicado: (2025)