Saved in:
| Main Authors: | Atzberger, Daniel, Cech, Tim, Scheibel, Willy, Döllner, Jürgen, Behrisch, Michael, Schreck, Tobias |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.17876 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Standardness Clouds Meaning: A Position Regarding the Informed Usage of Standard Datasets
by: Cech, Tim, et al.
Published: (2024)
by: Cech, Tim, et al.
Published: (2024)
Athena: Smart order routing on centralized crypto exchanges using a unified order book
by: Robert Henker, et al.
Published: (2024)
by: Robert Henker, et al.
Published: (2024)
FDive: Learning Relevance Models using Pattern-based Similarity Measures
by: Dennig, Frederik L., et al.
Published: (2019)
by: Dennig, Frederik L., et al.
Published: (2019)
Improving Clustering on Occupational Text Data through Dimensionality Reduction
by: García, Iago Xabier Vázquez, et al.
Published: (2025)
by: García, Iago Xabier Vázquez, et al.
Published: (2025)
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian
by: Hendria, Willy Fitra
Published: (2023)
by: Hendria, Willy Fitra
Published: (2023)
LCIP: Loss-Controlled Inverse Projection of High-Dimensional Image Data
by: Wang, Yu, et al.
Published: (2026)
by: Wang, Yu, et al.
Published: (2026)
Guided Perturbation Sensitivity (GPS): Detecting Adversarial Text via Embedding Stability and Word Importance
by: Tuck, Bryan E., et al.
Published: (2025)
by: Tuck, Bryan E., et al.
Published: (2025)
Large Language Models Are Overparameterized Text Encoders
by: K, Thennal D, et al.
Published: (2024)
by: K, Thennal D, et al.
Published: (2024)
Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation
by: Bhardwaj, Dhrupad, et al.
Published: (2025)
by: Bhardwaj, Dhrupad, et al.
Published: (2025)
An Improved Deep Learning Model for Word Embeddings Based Clustering for Large Text Datasets
by: Sutrakar, Vijay Kumar, et al.
Published: (2025)
by: Sutrakar, Vijay Kumar, et al.
Published: (2025)
Non-Monotonic Latency in Apple MPS Decoding: KV Cache Interactions and Execution Regimes
by: Hendria, Willy Fitra
Published: (2026)
by: Hendria, Willy Fitra
Published: (2026)
Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation
by: Song, Zhuo-Yang, et al.
Published: (2025)
by: Song, Zhuo-Yang, et al.
Published: (2025)
When Latent Geometry Is Not Enough: Draft-Conditioned Latent Refinement for Non-Autoregressive Text Generation
by: Zhang, De Shuai
Published: (2026)
by: Zhang, De Shuai
Published: (2026)
On Debiasing Text Embeddings Through Context Injection
by: Uriot, Thomas
Published: (2024)
by: Uriot, Thomas
Published: (2024)
Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting
by: Shrestha, Gyanendra, et al.
Published: (2025)
by: Shrestha, Gyanendra, et al.
Published: (2025)
Better Embeddings with Coupled Adam
by: Stollenwerk, Felix, et al.
Published: (2025)
by: Stollenwerk, Felix, et al.
Published: (2025)
Scaling Embedding Layers in Language Models
by: Yu, Da, et al.
Published: (2025)
by: Yu, Da, et al.
Published: (2025)
To MRL or not to MRL: Text Embeddings are Robust to Truncation Without Matryoshka Learning, Except In Heavy Truncation Scenarios
by: Takeshita, Sotaro, et al.
Published: (2026)
by: Takeshita, Sotaro, et al.
Published: (2026)
Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification
by: Pecher, Branislav, et al.
Published: (2026)
by: Pecher, Branislav, et al.
Published: (2026)
SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification
by: Jiao, Difan, et al.
Published: (2023)
by: Jiao, Difan, et al.
Published: (2023)
Challenging Assumptions in Learning Generic Text Style Embeddings
by: Ostheimer, Phil, et al.
Published: (2025)
by: Ostheimer, Phil, et al.
Published: (2025)
Luxical: High-Speed Lexical-Dense Text Embeddings
by: DatologyAI, et al.
Published: (2025)
by: DatologyAI, et al.
Published: (2025)
Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text Synthesis
by: Lee, Jun-Min, et al.
Published: (2023)
by: Lee, Jun-Min, et al.
Published: (2023)
The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models
by: Rizvi-Martel, Michael, et al.
Published: (2026)
by: Rizvi-Martel, Michael, et al.
Published: (2026)
Reconciling Kaplan and Chinchilla Scaling Laws
by: Pearce, Tim, et al.
Published: (2024)
by: Pearce, Tim, et al.
Published: (2024)
An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models
by: Janakiraman, Anantharaman, et al.
Published: (2025)
by: Janakiraman, Anantharaman, et al.
Published: (2025)
AnglE-optimized Text Embeddings
by: Li, Xianming, et al.
Published: (2023)
by: Li, Xianming, et al.
Published: (2023)
Hakim: Farsi Text Embedding Model
by: Sarmadi, Mehran, et al.
Published: (2025)
by: Sarmadi, Mehran, et al.
Published: (2025)
Towards Latent Diffusion Suitable For Text
by: Midavaine, Nesta, et al.
Published: (2026)
by: Midavaine, Nesta, et al.
Published: (2026)
GD-VAEs: Geometric Dynamic Variational Autoencoders for Learning Nonlinear Dynamics and Dimension Reductions
by: Lopez, Ryan, et al.
Published: (2022)
by: Lopez, Ryan, et al.
Published: (2022)
ShaRP: Shape-Regularized Multidimensional Projections
by: Machado, Alister, et al.
Published: (2023)
by: Machado, Alister, et al.
Published: (2023)
Controlling Large Language Model with Latent Actions
by: Jia, Chengxing, et al.
Published: (2025)
by: Jia, Chengxing, et al.
Published: (2025)
Latent Performance Profiling of Large Language Models
by: Chakraborty, Tanmoy, et al.
Published: (2026)
by: Chakraborty, Tanmoy, et al.
Published: (2026)
A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models
by: Herbold, Steffen, et al.
Published: (2026)
by: Herbold, Steffen, et al.
Published: (2026)
On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models
by: Varshavsky-Hassid, Miri, et al.
Published: (2024)
by: Varshavsky-Hassid, Miri, et al.
Published: (2024)
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding
by: Xiao, Feng, et al.
Published: (2025)
by: Xiao, Feng, et al.
Published: (2025)
Scaling Embeddings Outperforms Scaling Experts in Language Models
by: Liu, Hong, et al.
Published: (2026)
by: Liu, Hong, et al.
Published: (2026)
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
by: Zhang, Liyi, et al.
Published: (2024)
by: Zhang, Liyi, et al.
Published: (2024)
RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Streams
by: Iturra-Bocaz, Gabriel, et al.
Published: (2025)
by: Iturra-Bocaz, Gabriel, et al.
Published: (2025)
Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior
by: Huang, Zeyi, et al.
Published: (2026)
by: Huang, Zeyi, et al.
Published: (2026)
Similar Items
-
Standardness Clouds Meaning: A Position Regarding the Informed Usage of Standard Datasets
by: Cech, Tim, et al.
Published: (2024) -
Athena: Smart order routing on centralized crypto exchanges using a unified order book
by: Robert Henker, et al.
Published: (2024) -
FDive: Learning Relevance Models using Pattern-based Similarity Measures
by: Dennig, Frederik L., et al.
Published: (2019) -
Improving Clustering on Occupational Text Data through Dimensionality Reduction
by: García, Iago Xabier Vázquez, et al.
Published: (2025) -
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian
by: Hendria, Willy Fitra
Published: (2023)