:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Laborde, Stanislas, Cousseau, Martin, Yaacoub, Antoun, Prevost, Lionel
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language Artificial Intelligence Machine Learning 68P30 (Primary) 68T07, 68T50 (Secondary) I.2.6; I.5.1; I.2.7
Online-Zugang:	https://arxiv.org/abs/2505.07289
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer
von: Zhang, Tony, et al.
Veröffentlicht: (2025)

CogniLoad: A Synthetic Natural Language Reasoning Benchmark With Tunable Length, Intrinsic Difficulty, and Distractor Density
von: Kaiser, Daniel, et al.
Veröffentlicht: (2025)

Do Reasoning Models Enhance Embedding Models?
von: Chan, Wun Yu, et al.
Veröffentlicht: (2026)

Research on a hybrid LSTM-CNN-Attention model for text-based web content classification
von: Kuz, Mykola, et al.
Veröffentlicht: (2025)

Linguistic Collapse: Neural Collapse in (Large) Language Models
von: Wu, Robert, et al.
Veröffentlicht: (2024)

When Does Content-Based Routing Work? Representation Requirements for Selective Attention in Hybrid Sequence Models
von: Basu, Abhinaba
Veröffentlicht: (2026)

NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
von: Breneur, Oleksandr Marchenko, et al.
Veröffentlicht: (2026)

Rethinking the Multilingual Reasoning Gap with Layer Swap
von: Lasbordes, Maxence, et al.
Veröffentlicht: (2026)

Harnessing non-adversarial robustness in large language models
von: Zhou, Qinghua, et al.
Veröffentlicht: (2026)

CLMN: Concept based Language Models via Neural Symbolic Reasoning
von: Yang, Yibo
Veröffentlicht: (2025)

Can Agentic AI Match the Performance of Human Data Scientists?
von: Luo, An, et al.
Veröffentlicht: (2025)

Approaches to Semantic Textual Similarity in Slovak Language: From Algorithms to Transformers
von: Radosky, Lukas, et al.
Veröffentlicht: (2026)

SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Models
von: Guo, Dongxin, et al.
Veröffentlicht: (2026)

Unpacking Hateful Memes: Presupposed Context and False Claims
von: Cai, Weibin, et al.
Veröffentlicht: (2025)

Mechanistic Analysis of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
von: Imanov, Olaf Yunus Laitinen
Veröffentlicht: (2026)

Latent Object Permanence: Topological Phase Transitions, Free-Energy Principles, and Renormalization Group Flows in Deep Transformer Manifolds
von: Alpay, Faruk, et al.
Veröffentlicht: (2026)

Inference acceleration for large language models using "stairs" assisted greedy generation
von: Grigaliūnas, Domas, et al.
Veröffentlicht: (2024)

Strategic Doctrine Language Models (sdLM): A Learning-System Framework for Doctrinal Consistency and Geopolitical Forecasting
von: Imanov, Olaf Yunus Laitinen, et al.
Veröffentlicht: (2026)

A Survey on Vision-Language-Action Models for Embodied AI
von: Ma, Yueen, et al.
Veröffentlicht: (2024)

How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
von: Borobia, Hector, et al.
Veröffentlicht: (2026)

AIPsy-Affect: A Keyword-Free Clinical Stimulus Battery for Mechanistic Interpretability of Emotion in Language Models
von: Keeman, Michael
Veröffentlicht: (2026)

The Concept Allocation Zone: Tracking How Concepts Form Across Transformer Depth
von: Henry, James
Veröffentlicht: (2026)

Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference
von: Mathew, Aby Mammen
Veröffentlicht: (2026)

Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching
von: Gadd, Stephen
Veröffentlicht: (2026)

Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
von: Pather, Kaviraj, et al.
Veröffentlicht: (2025)

Measuring Intent Comprehension in LLMs
von: Kunievsky, Nadav, et al.
Veröffentlicht: (2025)

ProactBench: Beyond What The User Asked For
von: Harfi, Sepehr, et al.
Veröffentlicht: (2026)

Causal Dimensionality of Transformer Representations: Measurement, Scaling, and Layer Structure
von: Sarkar, Nilesh, et al.
Veröffentlicht: (2026)

Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pretraining
von: Mitchell, Rupert, et al.
Veröffentlicht: (2025)

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
von: Luo, An, et al.
Veröffentlicht: (2025)

Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining
von: Cao, Deyu, et al.
Veröffentlicht: (2025)

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science
von: Luo, An, et al.
Veröffentlicht: (2026)

Rule Extraction in Machine Learning: Chat Incremental Pattern Constructor
von: Nwokocha, Caleb Princewill
Veröffentlicht: (2022)

Beyond Subtokens: A Rich Character Embedding for Low-resource and Morphologically Complex Languages
von: Schneider, Felix, et al.
Veröffentlicht: (2026)

Transactional Attention: Semantic Sponsorship for KV-Cache Retention
von: Basu, Abhinaba
Veröffentlicht: (2026)

$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy
von: Dang, Kieu, et al.
Veröffentlicht: (2025)

SAGE: Sign-Adaptive Gradient for Memory-Efficient LLM Optimization
von: Lee, Wooin, et al.
Veröffentlicht: (2026)

Application of deep learning approaches for medieval historical documents transcription
von: Voloshchuk, Maksym, et al.
Veröffentlicht: (2025)

RACAS: Controlling Diverse Robots With a Single Agentic System
von: Ashley, Dylan R., et al.
Veröffentlicht: (2026)

JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation
von: Huang, Yingbing, et al.
Veröffentlicht: (2025)