Saved in:
| Main Authors: | Seputis, Dominykas, Li, Yongkang, Langerak, Karsten, Mihailov, Serghei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.07700 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
by: Chen, Jianlyu, et al.
Published: (2025)
by: Chen, Jianlyu, et al.
Published: (2025)
Equity by Design: Fairness-Driven Recommendation in Heterogeneous Two-Sided Markets
by: Seputis, Dominykas, et al.
Published: (2026)
by: Seputis, Dominykas, et al.
Published: (2026)
Interpretable Text Embeddings and Text Similarity Explanation: A Survey
by: Opitz, Juri, et al.
Published: (2025)
by: Opitz, Juri, et al.
Published: (2025)
Improving Text Embeddings with Large Language Models
by: Wang, Liang, et al.
Published: (2023)
by: Wang, Liang, et al.
Published: (2023)
FinMTEB: Finance Massive Text Embedding Benchmark
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
JFinTEB: Japanese Financial Text Embedding Benchmark
by: Suzuki, Masahiro, et al.
Published: (2026)
by: Suzuki, Masahiro, et al.
Published: (2026)
Text Embeddings by Weakly-Supervised Contrastive Pre-training
by: Wang, Liang, et al.
Published: (2022)
by: Wang, Liang, et al.
Published: (2022)
Multilingual E5 Text Embeddings: A Technical Report
by: Wang, Liang, et al.
Published: (2024)
by: Wang, Liang, et al.
Published: (2024)
ChEmbed: Enhancing Chemical Literature Search Through Domain-Specific Text Embeddings
by: Kasmaee, Ali Shiraee, et al.
Published: (2025)
by: Kasmaee, Ali Shiraee, et al.
Published: (2025)
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
by: Merrick, Luke, et al.
Published: (2024)
by: Merrick, Luke, et al.
Published: (2024)
A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
by: Nie, Zhijie, et al.
Published: (2024)
by: Nie, Zhijie, et al.
Published: (2024)
Multi-Modal Adapter for Vision-Language Models
by: Seputis, Dominykas, et al.
Published: (2024)
by: Seputis, Dominykas, et al.
Published: (2024)
Enhancing Lexicon-Based Text Embeddings with Large Language Models
by: Lei, Yibin, et al.
Published: (2025)
by: Lei, Yibin, et al.
Published: (2025)
MMTEB: Massive Multilingual Text Embedding Benchmark
by: Enevoldsen, Kenneth, et al.
Published: (2025)
by: Enevoldsen, Kenneth, et al.
Published: (2025)
Quantifying Positional Biases in Text Embedding Models
by: Lee, Reagan J., et al.
Published: (2024)
by: Lee, Reagan J., et al.
Published: (2024)
Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks
by: Babakhin, Yauhen, et al.
Published: (2025)
by: Babakhin, Yauhen, et al.
Published: (2025)
Applying Text Embedding Models for Efficient Analysis in Labeled Property Graphs
by: Podstawski, Michal
Published: (2025)
by: Podstawski, Michal
Published: (2025)
Pooling and Semantic Shift: The Fundamental Challenges in Long Text Embedding and Retrieval
by: Gao, Hang, et al.
Published: (2026)
by: Gao, Hang, et al.
Published: (2026)
Training Sparse Mixture Of Experts Text Embedding Models
by: Nussbaum, Zach, et al.
Published: (2025)
by: Nussbaum, Zach, et al.
Published: (2025)
FaMTEB: Massive Text Embedding Benchmark in Persian Language
by: Zinvandi, Erfan, et al.
Published: (2025)
by: Zinvandi, Erfan, et al.
Published: (2025)
Bagging-Based Model Merging for Robust General Text Embeddings
by: Zhang, Hengran, et al.
Published: (2026)
by: Zhang, Hengran, et al.
Published: (2026)
When Text Embedding Meets Large Language Model: A Comprehensive Survey
by: Nie, Zhijie, et al.
Published: (2024)
by: Nie, Zhijie, et al.
Published: (2024)
LEAF: Knowledge Distillation of Text Embedding Models with Teacher-Aligned Representations
by: Vujanic, Robin, et al.
Published: (2025)
by: Vujanic, Robin, et al.
Published: (2025)
Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment
by: Fazili, Barah, et al.
Published: (2026)
by: Fazili, Barah, et al.
Published: (2026)
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
by: Liao, Baohao, et al.
Published: (2023)
by: Liao, Baohao, et al.
Published: (2023)
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
by: Zhou, Junjie, et al.
Published: (2024)
by: Zhou, Junjie, et al.
Published: (2024)
Do We Really Need Specialization? Evaluating Generalist Text Embeddings for Zero-Shot Recommendation and Search
by: Attimonelli, Matteo, et al.
Published: (2025)
by: Attimonelli, Matteo, et al.
Published: (2025)
Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data
by: Navasardyan, Zaruhi, et al.
Published: (2026)
by: Navasardyan, Zaruhi, et al.
Published: (2026)
Reproducing HotFlip for Corpus Poisoning Attacks in Dense Retrieval
by: Li, Yongkang, et al.
Published: (2025)
by: Li, Yongkang, et al.
Published: (2025)
Spectral Tempering for Embedding Compression in Dense Passage Retrieval
by: Li, Yongkang, et al.
Published: (2026)
by: Li, Yongkang, et al.
Published: (2026)
Position: Text Embeddings Should Capture Implicit Semantics, Not Just Surface Meaning
by: Sun, Yiqun, et al.
Published: (2025)
by: Sun, Yiqun, et al.
Published: (2025)
QuOTE: Question-Oriented Text Embeddings
by: Neeser, Andrew, et al.
Published: (2025)
by: Neeser, Andrew, et al.
Published: (2025)
E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
by: Liu, Qi, et al.
Published: (2025)
by: Liu, Qi, et al.
Published: (2025)
Rethinking Schema Linking: A Context-Aware Bidirectional Retrieval Approach for Text-to-SQL
by: Nahid, Md Mahadi Hasan, et al.
Published: (2025)
by: Nahid, Md Mahadi Hasan, et al.
Published: (2025)
Don't Reinvent the Wheel: Efficient Instruction-Following Text Embedding based on Guided Space Transformation
by: Feng, Yingchaojie, et al.
Published: (2025)
by: Feng, Yingchaojie, et al.
Published: (2025)
Text Data Integration
by: Rahman, Md Ataur, et al.
Published: (2026)
by: Rahman, Md Ataur, et al.
Published: (2026)
Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval
by: Eichholtz, Arne, et al.
Published: (2026)
by: Eichholtz, Arne, et al.
Published: (2026)
Text2Token: Unsupervised Text Representation Learning with Token Target Prediction
by: An, Ruize, et al.
Published: (2025)
by: An, Ruize, et al.
Published: (2025)
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
UniGLM: Training One Unified Language Model for Text-Attributed Graph Embedding
by: Fang, Yi, et al.
Published: (2024)
by: Fang, Yi, et al.
Published: (2024)
Similar Items
-
ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
by: Chen, Jianlyu, et al.
Published: (2025) -
Equity by Design: Fairness-Driven Recommendation in Heterogeneous Two-Sided Markets
by: Seputis, Dominykas, et al.
Published: (2026) -
Interpretable Text Embeddings and Text Similarity Explanation: A Survey
by: Opitz, Juri, et al.
Published: (2025) -
Improving Text Embeddings with Large Language Models
by: Wang, Liang, et al.
Published: (2023) -
FinMTEB: Finance Massive Text Embedding Benchmark
by: Tang, Yixuan, et al.
Published: (2025)