:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Shiyu, Tang, Yang, Chen, Shizhe, Chen, Xi
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.15710
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings
by: Li, Shiyu, et al.
Published: (2025)

HiNS: Hierarchical Negative Sampling for More Comprehensive Memory Retrieval Embedding Model
by: Tian, Motong, et al.
Published: (2026)

Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025)

ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
by: Li, Shiyu, et al.
Published: (2025)

Negative Matters: Multi-Granularity Hard-Negative Synthesis and Anchor-Token-Aware Pooling for Enhanced Text Embeddings
by: Pan, Tengyu, et al.
Published: (2025)

FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents
by: Pham, Quang Hieu, et al.
Published: (2026)

Interleaving Reasoning for Better Text-to-Image Generation
by: Huang, Wenxuan, et al.
Published: (2025)

Language Models that Think, Chat Better
by: Bhaskar, Adithya, et al.
Published: (2025)

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
by: Tang, Yixuan, et al.
Published: (2026)

Optimal Embedding Guided Negative Sample Generation for Knowledge Graph Link Prediction
by: Takamoto, Makoto, et al.
Published: (2025)

A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization
by: Chu, KuanChao, et al.
Published: (2024)

Towards Better Text-to-Image Generation Alignment via Attention Modulation
by: Wu, Yihang, et al.
Published: (2024)

More Samples or More Prompts? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering
by: Yao, Bingsheng, et al.
Published: (2023)

FinMTEB: Finance Massive Text Embedding Benchmark
by: Tang, Yixuan, et al.
Published: (2025)

Let LLMs Speak Embedding Languages: Generative Text Embeddings via Iterative Contrastive Refinement
by: Tsai, Yu-Che, et al.
Published: (2025)

Hierarchical Attention Generates Better Proofs
by: Chen, Jianlong, et al.
Published: (2025)

No More Distractions: an Adaptive Up-Sampling Algorithm to Reduce Data Artifacts
by: Chen, Han
Published: (2024)

Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning
by: Du, Yexing, et al.
Published: (2024)

Evolutionary Negative Module Pruning for Better LoRA Merging
by: Cao, Anda, et al.
Published: (2026)

Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
by: Qiu, Huming, et al.
Published: (2024)

A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
by: Chen, Yingquan, et al.
Published: (2025)

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
by: Solatorio, Aivin V.
Published: (2024)

Ruri: Japanese General Text Embeddings
by: Tsukagoshi, Hayato, et al.
Published: (2024)

ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
by: Chen, Jianlyu, et al.
Published: (2025)

Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion
by: Zhang, Yu, et al.
Published: (2025)

More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)

Progressive Residual Warmup for Language Model Pretraining
by: Chen, Tianhao, et al.
Published: (2026)

Are Clinical T5 Models Better for Clinical Text?
by: Li, Yahan, et al.
Published: (2024)

Graph Contrastive Learning via Cluster-refined Negative Sampling for Semi-supervised Text Classification
by: Ai, Wei, et al.
Published: (2024)

Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation
by: Lopez, Ivan, et al.
Published: (2025)

Are Large Language Models More Honest in Their Probabilistic or Verbalized Confidence?
by: Ni, Shiyu, et al.
Published: (2024)

Self-Correction Makes LLMs Better Parsers
by: Zhang, Ziyan, et al.
Published: (2025)

Semantic Adapter for Universal Text Embeddings: Diagnosing and Mitigating Negation Blindness to Enhance Universality
by: Cao, Hongliu
Published: (2025)

Predicting Through Generation: Why Generation Is Better for Prediction
by: Kowsher, Md, et al.
Published: (2025)

Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
by: Deng, Ningyuan, et al.
Published: (2025)

Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework
by: Tao, Zhen, et al.
Published: (2024)

Learning from Negative Samples in Biomedical Generative Entity Linking
by: Kim, Chanhwi, et al.
Published: (2024)

Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding
by: Feng, Xincan, et al.
Published: (2024)

TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
by: Cao, Yang, et al.
Published: (2025)

Learning from Mistakes: Negative Reasoning Samples Enhance Out-of-Domain Generalization
by: Tian, Xueyun, et al.
Published: (2026)