:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Junkyum, Mahajan, Divya
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2504.08930
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EarthSight: A Distributed Framework for Low-Latency Satellite Intelligence
by: Erol, Ansel Kaplan, et al.
Published: (2025)

Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG
by: Zhang, Yicheng, et al.
Published: (2026)

4bit-Quantization in Vector-Embedding for RAG
by: Jeong, Taehee
Published: (2025)

LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG
by: Zheng, Yijia, et al.
Published: (2026)

Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding
by: Shim, Seongwoong, et al.
Published: (2025)

LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments
by: Huang, Jin, et al.
Published: (2025)

EdgeRAG: Online-Indexed RAG for Edge Devices
by: Seemakhupt, Korakit, et al.
Published: (2024)

MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing
by: Go, Seokjin, et al.
Published: (2025)

Reward-RAG: Enhancing RAG with Reward Driven Supervision
by: Nguyen, Thang, et al.
Published: (2024)

Sink-Token-Aware Pruning for Fine-Grained Video Understanding in Efficient Video LLMs
by: Kim, Kibum, et al.
Published: (2026)

HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
by: Sarmah, Bhaskarjit, et al.
Published: (2024)

Mitigating Bias in RAG: Controlling the Embedder
by: Kim, Taeyoun, et al.
Published: (2025)

PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning
by: Kachuee, Mohammad, et al.
Published: (2025)

Stop-RAG: Value-Based Retrieval Control for Iterative RAG
by: Park, Jaewan, et al.
Published: (2025)

HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
by: Ge, Danying, et al.
Published: (2025)

Prompt Tuning for Natural Language to SQL with Embedding Fine-Tuning and RAG
by: Jang, Jisoo, et al.
Published: (2025)

NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning
by: Wang, Irene, et al.
Published: (2026)

Forecasting GPU Performance for Deep Learning Training and Inference
by: Lee, Seonho, et al.
Published: (2024)

MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG
by: Lim, Woosang, et al.
Published: (2025)

Don't Lag, RAG: Training-Free Adversarial Detection Using RAG
by: Kazoom, Roie, et al.
Published: (2025)

RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition
by: Cofala, Tim, et al.
Published: (2025)

HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving
by: Hu, Zhengding, et al.
Published: (2025)

RAGPulse: An Open-Source RAG Workload Trace to Optimize RAG Serving Systems
by: Wang, Zhengchao, et al.
Published: (2025)

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation
by: Liu, Dongqi, et al.
Published: (2026)

MimirRAG: A Multi-Agent RAG Framework for Financial Data Retrieval with Metadata Integration
by: Samuelsen, Magnus, et al.
Published: (2026)

Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
by: de Lima, Rafael Teixeira, et al.
Published: (2024)

iRAG: Advancing RAG for Videos with an Incremental Approach
by: Arefeen, Md Adnan, et al.
Published: (2024)

FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems
by: Fajardo, Val Andrei, et al.
Published: (2025)

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
by: Balaguer, Angels, et al.
Published: (2024)

Fine-Tuning vs. RAG for Multi-Hop Question Answering with Novel Knowledge
by: Yang, Zhuoyi, et al.
Published: (2026)

DAPI: Domain Adaptive Toxicity Probe Vector Intervention for Fine-Grained Detoxification
by: Hyeonsu, Cho, et al.
Published: (2025)

SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)

METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation
by: Ray, Siddhant, et al.
Published: (2024)

SimulRAG: Simulator-based RAG for Grounding LLMs in Long-form Scientific QA
by: Xu, Haozhou, et al.
Published: (2025)

MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
by: Ha, Hyeonjeong, et al.
Published: (2025)

Characterizing Compute-Communication Overlap in GPU-Accelerated Distributed Deep Learning: Performance and Power Implications
by: Lee, Seonho, et al.
Published: (2025)

Enhancing classroom teaching with LLMs and RAG
by: Mullins, Elizabeth A, et al.
Published: (2024)

TimeGNN-Augmented Hybrid-Action MARL for Fine-Grained Task Partitioning and Energy-Aware Offloading in MEC
by: Ai, Wei, et al.
Published: (2026)

RAG with Differential Privacy
by: Grislain, Nicolas
Published: (2024)

LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
by: Kabir, Muhammad Rafsan, et al.
Published: (2025)