:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Hassell, Jackson, Zhang, Dan, Kim, Hannah, Mitchell, Tom, Hruschka, Estevam
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language Artificial Intelligence Machine Learning I.2.7
Acceso en línea:	https://arxiv.org/abs/2510.19897
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
por: Oketunji, Abiodun Finbarrs
Publicado: (2023)

Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
por: Cho, Hanjun, et al.
Publicado: (2026)

Towards Probabilistic Question Answering Over Tabular Data
por: Shen, Chen, et al.
Publicado: (2025)

Uncovering Biases with Reflective Large Language Models
por: Chang, Edward Y.
Publicado: (2024)

Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
por: Peters, Sydney, et al.
Publicado: (2025)

Associative Recurrent Memory Transformer
por: Rodkin, Ivan, et al.
Publicado: (2024)

A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs
por: Park, Chiwan, et al.
Publicado: (2025)

SPRInG: Continual LLM Personalization via Selective Parametric Adaptation and Retrieval-Interpolated Generation
por: Kim, Seoyeon, et al.
Publicado: (2026)

Self-Supervised Position Debiasing for Large Language Models
por: Liu, Zhongkun, et al.
Publicado: (2024)

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
por: Lyu, Bohan, et al.
Publicado: (2024)

Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation
por: A, Snegha, et al.
Publicado: (2025)

MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification
por: Sirbu, Iustin, et al.
Publicado: (2025)

Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models
por: Kurtic, Eldar, et al.
Publicado: (2024)

Graph Memory Transformer (GMT)
por: Zanarini, Nicola, et al.
Publicado: (2026)

AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks
por: Wang, Fali, et al.
Publicado: (2025)

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
por: Salehmohamed, Shoaib Sadiq, et al.
Publicado: (2026)

Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)

ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
por: Kadu, Ankush, et al.
Publicado: (2025)

Large Language Model (LLM) Bias Index -- LLMBI
por: Oketunji, Abiodun Finbarrs, et al.
Publicado: (2023)

Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource Languages
por: Bajpai, Ashutosh, et al.
Publicado: (2024)

Uncovering Latent Human Wellbeing in Language Model Embeddings
por: Freire, Pedro, et al.
Publicado: (2024)

Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
por: Gadzhiev, Artem, et al.
Publicado: (2026)

Key-Value Means: Transformers with Expandable Block-Recurrent Compressed Memory
por: Goldstein, Daniel, et al.
Publicado: (2026)

RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
por: Saji, Alan, et al.
Publicado: (2025)

Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
por: Quevedo, Ernesto, et al.
Publicado: (2024)

ProMedTS: A Self-Supervised, Prompt-Guided Multimodal Approach for Integrating Medical Text and Time Series
por: Niu, Shuai, et al.
Publicado: (2025)

Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents
por: Kim, San, et al.
Publicado: (2024)

Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions
por: Yun, Taedong, et al.
Publicado: (2025)

Skill Availability and Presentation Granularity in Large-Language-Model Agents: A Controlled SkillsBench Study
por: Xu, Xiaonan, et al.
Publicado: (2026)

$\text{Memory}^3$: Language Modeling with Explicit Memory
por: Yang, Hongkang, et al.
Publicado: (2024)

Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity
por: Pan, Xinghan
Publicado: (2025)

In-Context Fixation: When Demonstrated Labels Override Semantics in Few-Shot Classification
por: Liu, Ming
Publicado: (2026)

Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
por: Yıldırım, Alper, et al.
Publicado: (2025)

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers
por: Lee, Myeonghwa, et al.
Publicado: (2024)

A Llama walks into the 'Bar': Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam
por: Fernandes, Rean, et al.
Publicado: (2025)

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
por: Wu, Zhengxuan, et al.
Publicado: (2025)

Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
por: Hong, Chunsan, et al.
Publicado: (2025)

Adaptive Focus Memory for Language Models
por: Cruz, Christopher
Publicado: (2025)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
por: Fadli, Samih
Publicado: (2025)

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
por: Frydenlund, Arvid
Publicado: (2025)