Saved in:
| Main Authors: | Li, Shiyu, Tang, Yang, Chen, Shizhe, Chen, Xi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.15710 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings
by: Li, Shiyu, et al.
Published: (2025)
by: Li, Shiyu, et al.
Published: (2025)
HiNS: Hierarchical Negative Sampling for More Comprehensive Memory Retrieval Embedding Model
by: Tian, Motong, et al.
Published: (2026)
by: Tian, Motong, et al.
Published: (2026)
Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025)
by: Tang, Yang, et al.
Published: (2025)
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
by: Li, Shiyu, et al.
Published: (2025)
by: Li, Shiyu, et al.
Published: (2025)
Negative Matters: Multi-Granularity Hard-Negative Synthesis and Anchor-Token-Aware Pooling for Enhanced Text Embeddings
by: Pan, Tengyu, et al.
Published: (2025)
by: Pan, Tengyu, et al.
Published: (2025)
FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents
by: Pham, Quang Hieu, et al.
Published: (2026)
by: Pham, Quang Hieu, et al.
Published: (2026)
Interleaving Reasoning for Better Text-to-Image Generation
by: Huang, Wenxuan, et al.
Published: (2025)
by: Huang, Wenxuan, et al.
Published: (2025)
Language Models that Think, Chat Better
by: Bhaskar, Adithya, et al.
Published: (2025)
by: Bhaskar, Adithya, et al.
Published: (2025)
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
by: Tang, Yixuan, et al.
Published: (2026)
by: Tang, Yixuan, et al.
Published: (2026)
Optimal Embedding Guided Negative Sample Generation for Knowledge Graph Link Prediction
by: Takamoto, Makoto, et al.
Published: (2025)
by: Takamoto, Makoto, et al.
Published: (2025)
A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization
by: Chu, KuanChao, et al.
Published: (2024)
by: Chu, KuanChao, et al.
Published: (2024)
Towards Better Text-to-Image Generation Alignment via Attention Modulation
by: Wu, Yihang, et al.
Published: (2024)
by: Wu, Yihang, et al.
Published: (2024)
More Samples or More Prompts? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering
by: Yao, Bingsheng, et al.
Published: (2023)
by: Yao, Bingsheng, et al.
Published: (2023)
FinMTEB: Finance Massive Text Embedding Benchmark
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
Let LLMs Speak Embedding Languages: Generative Text Embeddings via Iterative Contrastive Refinement
by: Tsai, Yu-Che, et al.
Published: (2025)
by: Tsai, Yu-Che, et al.
Published: (2025)
Hierarchical Attention Generates Better Proofs
by: Chen, Jianlong, et al.
Published: (2025)
by: Chen, Jianlong, et al.
Published: (2025)
No More Distractions: an Adaptive Up-Sampling Algorithm to Reduce Data Artifacts
by: Chen, Han
Published: (2024)
by: Chen, Han
Published: (2024)
Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning
by: Du, Yexing, et al.
Published: (2024)
by: Du, Yexing, et al.
Published: (2024)
Evolutionary Negative Module Pruning for Better LoRA Merging
by: Cao, Anda, et al.
Published: (2026)
by: Cao, Anda, et al.
Published: (2026)
Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
by: Qiu, Huming, et al.
Published: (2024)
by: Qiu, Huming, et al.
Published: (2024)
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
by: Chen, Yingquan, et al.
Published: (2025)
by: Chen, Yingquan, et al.
Published: (2025)
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
by: Solatorio, Aivin V.
Published: (2024)
by: Solatorio, Aivin V.
Published: (2024)
Ruri: Japanese General Text Embeddings
by: Tsukagoshi, Hayato, et al.
Published: (2024)
by: Tsukagoshi, Hayato, et al.
Published: (2024)
ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
by: Chen, Jianlyu, et al.
Published: (2025)
by: Chen, Jianlyu, et al.
Published: (2025)
Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)
by: Lv, Ang, et al.
Published: (2024)
Progressive Residual Warmup for Language Model Pretraining
by: Chen, Tianhao, et al.
Published: (2026)
by: Chen, Tianhao, et al.
Published: (2026)
Are Clinical T5 Models Better for Clinical Text?
by: Li, Yahan, et al.
Published: (2024)
by: Li, Yahan, et al.
Published: (2024)
Graph Contrastive Learning via Cluster-refined Negative Sampling for Semi-supervised Text Classification
by: Ai, Wei, et al.
Published: (2024)
by: Ai, Wei, et al.
Published: (2024)
Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation
by: Lopez, Ivan, et al.
Published: (2025)
by: Lopez, Ivan, et al.
Published: (2025)
Are Large Language Models More Honest in Their Probabilistic or Verbalized Confidence?
by: Ni, Shiyu, et al.
Published: (2024)
by: Ni, Shiyu, et al.
Published: (2024)
Self-Correction Makes LLMs Better Parsers
by: Zhang, Ziyan, et al.
Published: (2025)
by: Zhang, Ziyan, et al.
Published: (2025)
Semantic Adapter for Universal Text Embeddings: Diagnosing and Mitigating Negation Blindness to Enhance Universality
by: Cao, Hongliu
Published: (2025)
by: Cao, Hongliu
Published: (2025)
Predicting Through Generation: Why Generation Is Better for Prediction
by: Kowsher, Md, et al.
Published: (2025)
by: Kowsher, Md, et al.
Published: (2025)
Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
by: Deng, Ningyuan, et al.
Published: (2025)
by: Deng, Ningyuan, et al.
Published: (2025)
Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework
by: Tao, Zhen, et al.
Published: (2024)
by: Tao, Zhen, et al.
Published: (2024)
Learning from Negative Samples in Biomedical Generative Entity Linking
by: Kim, Chanhwi, et al.
Published: (2024)
by: Kim, Chanhwi, et al.
Published: (2024)
Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding
by: Feng, Xincan, et al.
Published: (2024)
by: Feng, Xincan, et al.
Published: (2024)
TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
Learning from Mistakes: Negative Reasoning Samples Enhance Out-of-Domain Generalization
by: Tian, Xueyun, et al.
Published: (2026)
by: Tian, Xueyun, et al.
Published: (2026)
Similar Items
-
Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings
by: Li, Shiyu, et al.
Published: (2025) -
HiNS: Hierarchical Negative Sampling for More Comprehensive Memory Retrieval Embedding Model
by: Tian, Motong, et al.
Published: (2026) -
Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025) -
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
by: Li, Shiyu, et al.
Published: (2025) -
Negative Matters: Multi-Granularity Hard-Negative Synthesis and Anchor-Token-Aware Pooling for Enhanced Text Embeddings
by: Pan, Tengyu, et al.
Published: (2025)