:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zve, Evangelia, Bourgne, Gauvain, Icard, Benjamin, Ganascia, Jean-Gabriel
Formato:	Preprint
Publicado:	2026
Materias:	Computation and Language Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2603.18358
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

From Outliers to Topics in Language Models: Anticipating Trends in News Corpora
por: Zve, Evangelia, et al.
Publicado: (2025)

Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language Models
por: Icard, Benjamin, et al.
Publicado: (2025)

Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings
por: Icard, Benjamin, et al.
Publicado: (2026)

Reliable News or Propagandist News? A Neurosymbolic Model Using Genre, Topic, and Persuasion Techniques to Improve Robustness in Classification
por: Faye, Géraud, et al.
Publicado: (2026)

An Argumentative Explanation Framework for Generalized Reason Model with Inconsistent Precedents
por: Fungwacharakorn, Wachara, et al.
Publicado: (2025)

Identifying Narrative Patterns and Outliers in Holocaust Testimonies Using Topic Modeling
por: Ifergan, Maxim, et al.
Publicado: (2024)

An action language-based formalisation of an abstract argumentation framework
por: Munro, Yann, et al.
Publicado: (2024)

How Causal Abstraction Underpins Computational Explanation
por: Geiger, Atticus, et al.
Publicado: (2025)

HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with Structured Information for Check-Worthiness Estimation
por: Faye, Géraud, et al.
Publicado: (2024)

HYBRINFOX at CheckThat! 2024 -- Task 2: Enriching BERT Models with the Expert System VAGO for Subjectivity Detection
por: Casanova, Morgane, et al.
Publicado: (2024)

Bucketing the Good Apples: A Method for Diagnosing and Improving Causal Abstraction
por: Puyin, Li, et al.
Publicado: (2026)

When Numbers Tell Half the Story: Human-Metric Alignment in Topic Model Evaluation
por: Prouteau, Thibault, et al.
Publicado: (2026)

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
por: Huang, Jing, et al.
Publicado: (2025)

From Noise to Signal to Selbstzweck: Reframing Human Label Variation in the Era of Post-training in NLP
por: Xu, Shanshan, et al.
Publicado: (2025)

Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
por: Li, Raymond, et al.
Publicado: (2026)

Personalized Topic Selection Model for Topic-Grounded Dialogue
por: Fan, Shixuan, et al.
Publicado: (2024)

Outliers Dimensions that Disrupt Transformers Are Driven by Frequency
por: Puccetti, Giovanni, et al.
Publicado: (2022)

Outlier Dimensions Encode Task-Specific Knowledge
por: Rudman, William, et al.
Publicado: (2023)

Text-as-Signal: Quantitative Semantic Scoring with Embeddings, Logprobs, and Noise Reduction
por: Moreira, Hugo
Publicado: (2026)

Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
por: Suzgun, Mirac, et al.
Publicado: (2024)

Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification
por: Faye, Géraud, et al.
Publicado: (2024)

Is It a Free Lunch for Removing Outliers during Pretraining?
por: Liao, Baohao, et al.
Publicado: (2024)

Rethinking the Outlier Distribution in Large Language Models: An In-depth Study
por: Raman, Rahul, et al.
Publicado: (2025)

CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech Documents
por: Kostelník, Martin, et al.
Publicado: (2026)

AlbNews: A Corpus of Headlines for Topic Modeling in Albanian
por: Çano, Erion, et al.
Publicado: (2024)

TopicENA: Enabling Epistemic Network Analysis at Scale through Automated Topic-Based Coding
por: Lu, Owen H. T., et al.
Publicado: (2026)

SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
por: Tong, Yao, et al.
Publicado: (2025)

Systematic Outliers in Large Language Models
por: An, Yongqi, et al.
Publicado: (2025)

Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
por: Jiang, Feng, et al.
Publicado: (2023)

TopicImpact: Improving Customer Feedback Analysis with Opinion Units for Topic Modeling and Star-Rating Prediction
por: Häglund, Emil, et al.
Publicado: (2025)

A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments
por: Wu, Zhengxuan, et al.
Publicado: (2024)

Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
por: Wu, Wei, et al.
Publicado: (2026)

TopicProphet: Prophesies on Temporal Topic Trends and Stocks
por: Kim, Olivia
Publicado: (2025)

Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning
por: Yuan, Zhihang, et al.
Publicado: (2026)

Signal in the Noise: Polysemantic Interference Transfers and Predicts Cross-Model Influence
por: Gong, Bofan, et al.
Publicado: (2025)

When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models
por: Kostelec, Juan Gabriel, et al.
Publicado: (2026)

BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization
por: Li, Ji-Fu, et al.
Publicado: (2026)

Topic-Conversation Relevance (TCR) Dataset and Benchmarks
por: Fan, Yaran, et al.
Publicado: (2024)

Topic Segmentation Using Generative Language Models
por: Mackenzie, Pierre, et al.
Publicado: (2025)

FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning
por: Luo, Haozheng, et al.
Publicado: (2026)