:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Atzberger, Daniel, Cech, Tim, Scheibel, Willy, Döllner, Jürgen, Behrisch, Michael, Schreck, Tobias
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2407.17876
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Standardness Clouds Meaning: A Position Regarding the Informed Usage of Standard Datasets
by: Cech, Tim, et al.
Published: (2024)

Athena: Smart order routing on centralized crypto exchanges using a unified order book
by: Robert Henker, et al.
Published: (2024)

FDive: Learning Relevance Models using Pattern-based Similarity Measures
by: Dennig, Frederik L., et al.
Published: (2019)

Improving Clustering on Occupational Text Data through Dimensionality Reduction
by: García, Iago Xabier Vázquez, et al.
Published: (2025)

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian
by: Hendria, Willy Fitra
Published: (2023)

LCIP: Loss-Controlled Inverse Projection of High-Dimensional Image Data
by: Wang, Yu, et al.
Published: (2026)

Guided Perturbation Sensitivity (GPS): Detecting Adversarial Text via Embedding Stability and Word Importance
by: Tuck, Bryan E., et al.
Published: (2025)

Large Language Models Are Overparameterized Text Encoders
by: K, Thennal D, et al.
Published: (2024)

Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation
by: Bhardwaj, Dhrupad, et al.
Published: (2025)

An Improved Deep Learning Model for Word Embeddings Based Clustering for Large Text Datasets
by: Sutrakar, Vijay Kumar, et al.
Published: (2025)

Non-Monotonic Latency in Apple MPS Decoding: KV Cache Interactions and Execution Regimes
by: Hendria, Willy Fitra
Published: (2026)

Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation
by: Song, Zhuo-Yang, et al.
Published: (2025)

When Latent Geometry Is Not Enough: Draft-Conditioned Latent Refinement for Non-Autoregressive Text Generation
by: Zhang, De Shuai
Published: (2026)

On Debiasing Text Embeddings Through Context Injection
by: Uriot, Thomas
Published: (2024)

Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting
by: Shrestha, Gyanendra, et al.
Published: (2025)

Better Embeddings with Coupled Adam
by: Stollenwerk, Felix, et al.
Published: (2025)

Scaling Embedding Layers in Language Models
by: Yu, Da, et al.
Published: (2025)

To MRL or not to MRL: Text Embeddings are Robust to Truncation Without Matryoshka Learning, Except In Heavy Truncation Scenarios
by: Takeshita, Sotaro, et al.
Published: (2026)

Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification
by: Pecher, Branislav, et al.
Published: (2026)

SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification
by: Jiao, Difan, et al.
Published: (2023)

Challenging Assumptions in Learning Generic Text Style Embeddings
by: Ostheimer, Phil, et al.
Published: (2025)

Luxical: High-Speed Lexical-Dense Text Embeddings
by: DatologyAI, et al.
Published: (2025)

Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text Synthesis
by: Lee, Jun-Min, et al.
Published: (2023)

The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models
by: Rizvi-Martel, Michael, et al.
Published: (2026)

Reconciling Kaplan and Chinchilla Scaling Laws
by: Pearce, Tim, et al.
Published: (2024)

An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models
by: Janakiraman, Anantharaman, et al.
Published: (2025)

AnglE-optimized Text Embeddings
by: Li, Xianming, et al.
Published: (2023)

Hakim: Farsi Text Embedding Model
by: Sarmadi, Mehran, et al.
Published: (2025)

Towards Latent Diffusion Suitable For Text
by: Midavaine, Nesta, et al.
Published: (2026)

GD-VAEs: Geometric Dynamic Variational Autoencoders for Learning Nonlinear Dynamics and Dimension Reductions
by: Lopez, Ryan, et al.
Published: (2022)

ShaRP: Shape-Regularized Multidimensional Projections
by: Machado, Alister, et al.
Published: (2023)

Controlling Large Language Model with Latent Actions
by: Jia, Chengxing, et al.
Published: (2025)

Latent Performance Profiling of Large Language Models
by: Chakraborty, Tanmoy, et al.
Published: (2026)

A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models
by: Herbold, Steffen, et al.
Published: (2026)

On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models
by: Varshavsky-Hassid, Miri, et al.
Published: (2024)

Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding
by: Xiao, Feng, et al.
Published: (2025)

Scaling Embeddings Outperforms Scaling Experts in Language Models
by: Liu, Hong, et al.
Published: (2026)

What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
by: Zhang, Liyi, et al.
Published: (2024)

RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Streams
by: Iturra-Bocaz, Gabriel, et al.
Published: (2025)

Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior
by: Huang, Zeyi, et al.
Published: (2026)