Guardado en:
| Autores principales: | Zve, Evangelia, Bourgne, Gauvain, Icard, Benjamin, Ganascia, Jean-Gabriel |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.18358 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
From Outliers to Topics in Language Models: Anticipating Trends in News Corpora
por: Zve, Evangelia, et al.
Publicado: (2025)
por: Zve, Evangelia, et al.
Publicado: (2025)
Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language Models
por: Icard, Benjamin, et al.
Publicado: (2025)
por: Icard, Benjamin, et al.
Publicado: (2025)
Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings
por: Icard, Benjamin, et al.
Publicado: (2026)
por: Icard, Benjamin, et al.
Publicado: (2026)
Reliable News or Propagandist News? A Neurosymbolic Model Using Genre, Topic, and Persuasion Techniques to Improve Robustness in Classification
por: Faye, Géraud, et al.
Publicado: (2026)
por: Faye, Géraud, et al.
Publicado: (2026)
An Argumentative Explanation Framework for Generalized Reason Model with Inconsistent Precedents
por: Fungwacharakorn, Wachara, et al.
Publicado: (2025)
por: Fungwacharakorn, Wachara, et al.
Publicado: (2025)
Identifying Narrative Patterns and Outliers in Holocaust Testimonies Using Topic Modeling
por: Ifergan, Maxim, et al.
Publicado: (2024)
por: Ifergan, Maxim, et al.
Publicado: (2024)
An action language-based formalisation of an abstract argumentation framework
por: Munro, Yann, et al.
Publicado: (2024)
por: Munro, Yann, et al.
Publicado: (2024)
How Causal Abstraction Underpins Computational Explanation
por: Geiger, Atticus, et al.
Publicado: (2025)
por: Geiger, Atticus, et al.
Publicado: (2025)
HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with Structured Information for Check-Worthiness Estimation
por: Faye, Géraud, et al.
Publicado: (2024)
por: Faye, Géraud, et al.
Publicado: (2024)
HYBRINFOX at CheckThat! 2024 -- Task 2: Enriching BERT Models with the Expert System VAGO for Subjectivity Detection
por: Casanova, Morgane, et al.
Publicado: (2024)
por: Casanova, Morgane, et al.
Publicado: (2024)
Bucketing the Good Apples: A Method for Diagnosing and Improving Causal Abstraction
por: Puyin, Li, et al.
Publicado: (2026)
por: Puyin, Li, et al.
Publicado: (2026)
When Numbers Tell Half the Story: Human-Metric Alignment in Topic Model Evaluation
por: Prouteau, Thibault, et al.
Publicado: (2026)
por: Prouteau, Thibault, et al.
Publicado: (2026)
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
por: Huang, Jing, et al.
Publicado: (2025)
por: Huang, Jing, et al.
Publicado: (2025)
From Noise to Signal to Selbstzweck: Reframing Human Label Variation in the Era of Post-training in NLP
por: Xu, Shanshan, et al.
Publicado: (2025)
por: Xu, Shanshan, et al.
Publicado: (2025)
Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
por: Li, Raymond, et al.
Publicado: (2026)
por: Li, Raymond, et al.
Publicado: (2026)
Personalized Topic Selection Model for Topic-Grounded Dialogue
por: Fan, Shixuan, et al.
Publicado: (2024)
por: Fan, Shixuan, et al.
Publicado: (2024)
Outliers Dimensions that Disrupt Transformers Are Driven by Frequency
por: Puccetti, Giovanni, et al.
Publicado: (2022)
por: Puccetti, Giovanni, et al.
Publicado: (2022)
Outlier Dimensions Encode Task-Specific Knowledge
por: Rudman, William, et al.
Publicado: (2023)
por: Rudman, William, et al.
Publicado: (2023)
Text-as-Signal: Quantitative Semantic Scoring with Embeddings, Logprobs, and Noise Reduction
por: Moreira, Hugo
Publicado: (2026)
por: Moreira, Hugo
Publicado: (2026)
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
por: Suzgun, Mirac, et al.
Publicado: (2024)
por: Suzgun, Mirac, et al.
Publicado: (2024)
Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification
por: Faye, Géraud, et al.
Publicado: (2024)
por: Faye, Géraud, et al.
Publicado: (2024)
Is It a Free Lunch for Removing Outliers during Pretraining?
por: Liao, Baohao, et al.
Publicado: (2024)
por: Liao, Baohao, et al.
Publicado: (2024)
Rethinking the Outlier Distribution in Large Language Models: An In-depth Study
por: Raman, Rahul, et al.
Publicado: (2025)
por: Raman, Rahul, et al.
Publicado: (2025)
CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech Documents
por: Kostelník, Martin, et al.
Publicado: (2026)
por: Kostelník, Martin, et al.
Publicado: (2026)
AlbNews: A Corpus of Headlines for Topic Modeling in Albanian
por: Çano, Erion, et al.
Publicado: (2024)
por: Çano, Erion, et al.
Publicado: (2024)
TopicENA: Enabling Epistemic Network Analysis at Scale through Automated Topic-Based Coding
por: Lu, Owen H. T., et al.
Publicado: (2026)
por: Lu, Owen H. T., et al.
Publicado: (2026)
SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
por: Tong, Yao, et al.
Publicado: (2025)
por: Tong, Yao, et al.
Publicado: (2025)
Systematic Outliers in Large Language Models
por: An, Yongqi, et al.
Publicado: (2025)
por: An, Yongqi, et al.
Publicado: (2025)
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
por: Jiang, Feng, et al.
Publicado: (2023)
por: Jiang, Feng, et al.
Publicado: (2023)
TopicImpact: Improving Customer Feedback Analysis with Opinion Units for Topic Modeling and Star-Rating Prediction
por: Häglund, Emil, et al.
Publicado: (2025)
por: Häglund, Emil, et al.
Publicado: (2025)
A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments
por: Wu, Zhengxuan, et al.
Publicado: (2024)
por: Wu, Zhengxuan, et al.
Publicado: (2024)
Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
por: Wu, Wei, et al.
Publicado: (2026)
por: Wu, Wei, et al.
Publicado: (2026)
TopicProphet: Prophesies on Temporal Topic Trends and Stocks
por: Kim, Olivia
Publicado: (2025)
por: Kim, Olivia
Publicado: (2025)
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning
por: Yuan, Zhihang, et al.
Publicado: (2026)
por: Yuan, Zhihang, et al.
Publicado: (2026)
Signal in the Noise: Polysemantic Interference Transfers and Predicts Cross-Model Influence
por: Gong, Bofan, et al.
Publicado: (2025)
por: Gong, Bofan, et al.
Publicado: (2025)
When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models
por: Kostelec, Juan Gabriel, et al.
Publicado: (2026)
por: Kostelec, Juan Gabriel, et al.
Publicado: (2026)
BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization
por: Li, Ji-Fu, et al.
Publicado: (2026)
por: Li, Ji-Fu, et al.
Publicado: (2026)
Topic-Conversation Relevance (TCR) Dataset and Benchmarks
por: Fan, Yaran, et al.
Publicado: (2024)
por: Fan, Yaran, et al.
Publicado: (2024)
Topic Segmentation Using Generative Language Models
por: Mackenzie, Pierre, et al.
Publicado: (2025)
por: Mackenzie, Pierre, et al.
Publicado: (2025)
FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning
por: Luo, Haozheng, et al.
Publicado: (2026)
por: Luo, Haozheng, et al.
Publicado: (2026)
Ejemplares similares
-
From Outliers to Topics in Language Models: Anticipating Trends in News Corpora
por: Zve, Evangelia, et al.
Publicado: (2025) -
Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language Models
por: Icard, Benjamin, et al.
Publicado: (2025) -
Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings
por: Icard, Benjamin, et al.
Publicado: (2026) -
Reliable News or Propagandist News? A Neurosymbolic Model Using Genre, Topic, and Persuasion Techniques to Improve Robustness in Classification
por: Faye, Géraud, et al.
Publicado: (2026) -
An Argumentative Explanation Framework for Generalized Reason Model with Inconsistent Precedents
por: Fungwacharakorn, Wachara, et al.
Publicado: (2025)