Saved in:
| Main Authors: | Kim, Junkyum, Mahajan, Divya |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.08930 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EarthSight: A Distributed Framework for Low-Latency Satellite Intelligence
by: Erol, Ansel Kaplan, et al.
Published: (2025)
by: Erol, Ansel Kaplan, et al.
Published: (2025)
Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG
by: Zhang, Yicheng, et al.
Published: (2026)
by: Zhang, Yicheng, et al.
Published: (2026)
4bit-Quantization in Vector-Embedding for RAG
by: Jeong, Taehee
Published: (2025)
by: Jeong, Taehee
Published: (2025)
LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG
by: Zheng, Yijia, et al.
Published: (2026)
by: Zheng, Yijia, et al.
Published: (2026)
Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding
by: Shim, Seongwoong, et al.
Published: (2025)
by: Shim, Seongwoong, et al.
Published: (2025)
LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments
by: Huang, Jin, et al.
Published: (2025)
by: Huang, Jin, et al.
Published: (2025)
EdgeRAG: Online-Indexed RAG for Edge Devices
by: Seemakhupt, Korakit, et al.
Published: (2024)
by: Seemakhupt, Korakit, et al.
Published: (2024)
MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing
by: Go, Seokjin, et al.
Published: (2025)
by: Go, Seokjin, et al.
Published: (2025)
Reward-RAG: Enhancing RAG with Reward Driven Supervision
by: Nguyen, Thang, et al.
Published: (2024)
by: Nguyen, Thang, et al.
Published: (2024)
Sink-Token-Aware Pruning for Fine-Grained Video Understanding in Efficient Video LLMs
by: Kim, Kibum, et al.
Published: (2026)
by: Kim, Kibum, et al.
Published: (2026)
HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
Mitigating Bias in RAG: Controlling the Embedder
by: Kim, Taeyoun, et al.
Published: (2025)
by: Kim, Taeyoun, et al.
Published: (2025)
PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning
by: Kachuee, Mohammad, et al.
Published: (2025)
by: Kachuee, Mohammad, et al.
Published: (2025)
Stop-RAG: Value-Based Retrieval Control for Iterative RAG
by: Park, Jaewan, et al.
Published: (2025)
by: Park, Jaewan, et al.
Published: (2025)
HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
by: Ge, Danying, et al.
Published: (2025)
by: Ge, Danying, et al.
Published: (2025)
Prompt Tuning for Natural Language to SQL with Embedding Fine-Tuning and RAG
by: Jang, Jisoo, et al.
Published: (2025)
by: Jang, Jisoo, et al.
Published: (2025)
NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning
by: Wang, Irene, et al.
Published: (2026)
by: Wang, Irene, et al.
Published: (2026)
Forecasting GPU Performance for Deep Learning Training and Inference
by: Lee, Seonho, et al.
Published: (2024)
by: Lee, Seonho, et al.
Published: (2024)
MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG
by: Lim, Woosang, et al.
Published: (2025)
by: Lim, Woosang, et al.
Published: (2025)
Don't Lag, RAG: Training-Free Adversarial Detection Using RAG
by: Kazoom, Roie, et al.
Published: (2025)
by: Kazoom, Roie, et al.
Published: (2025)
RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition
by: Cofala, Tim, et al.
Published: (2025)
by: Cofala, Tim, et al.
Published: (2025)
HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving
by: Hu, Zhengding, et al.
Published: (2025)
by: Hu, Zhengding, et al.
Published: (2025)
RAGPulse: An Open-Source RAG Workload Trace to Optimize RAG Serving Systems
by: Wang, Zhengchao, et al.
Published: (2025)
by: Wang, Zhengchao, et al.
Published: (2025)
Disco-RAG: Discourse-Aware Retrieval-Augmented Generation
by: Liu, Dongqi, et al.
Published: (2026)
by: Liu, Dongqi, et al.
Published: (2026)
MimirRAG: A Multi-Agent RAG Framework for Financial Data Retrieval with Metadata Integration
by: Samuelsen, Magnus, et al.
Published: (2026)
by: Samuelsen, Magnus, et al.
Published: (2026)
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
by: de Lima, Rafael Teixeira, et al.
Published: (2024)
by: de Lima, Rafael Teixeira, et al.
Published: (2024)
iRAG: Advancing RAG for Videos with an Incremental Approach
by: Arefeen, Md Adnan, et al.
Published: (2024)
by: Arefeen, Md Adnan, et al.
Published: (2024)
FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems
by: Fajardo, Val Andrei, et al.
Published: (2025)
by: Fajardo, Val Andrei, et al.
Published: (2025)
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
by: Balaguer, Angels, et al.
Published: (2024)
by: Balaguer, Angels, et al.
Published: (2024)
Fine-Tuning vs. RAG for Multi-Hop Question Answering with Novel Knowledge
by: Yang, Zhuoyi, et al.
Published: (2026)
by: Yang, Zhuoyi, et al.
Published: (2026)
DAPI: Domain Adaptive Toxicity Probe Vector Intervention for Fine-Grained Detoxification
by: Hyeonsu, Cho, et al.
Published: (2025)
by: Hyeonsu, Cho, et al.
Published: (2025)
SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)
by: Zhang, Xuechen, et al.
Published: (2025)
METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation
by: Ray, Siddhant, et al.
Published: (2024)
by: Ray, Siddhant, et al.
Published: (2024)
SimulRAG: Simulator-based RAG for Grounding LLMs in Long-form Scientific QA
by: Xu, Haozhou, et al.
Published: (2025)
by: Xu, Haozhou, et al.
Published: (2025)
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
by: Ha, Hyeonjeong, et al.
Published: (2025)
by: Ha, Hyeonjeong, et al.
Published: (2025)
Characterizing Compute-Communication Overlap in GPU-Accelerated Distributed Deep Learning: Performance and Power Implications
by: Lee, Seonho, et al.
Published: (2025)
by: Lee, Seonho, et al.
Published: (2025)
Enhancing classroom teaching with LLMs and RAG
by: Mullins, Elizabeth A, et al.
Published: (2024)
by: Mullins, Elizabeth A, et al.
Published: (2024)
TimeGNN-Augmented Hybrid-Action MARL for Fine-Grained Task Partitioning and Energy-Aware Offloading in MEC
by: Ai, Wei, et al.
Published: (2026)
by: Ai, Wei, et al.
Published: (2026)
RAG with Differential Privacy
by: Grislain, Nicolas
Published: (2024)
by: Grislain, Nicolas
Published: (2024)
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
by: Kabir, Muhammad Rafsan, et al.
Published: (2025)
by: Kabir, Muhammad Rafsan, et al.
Published: (2025)
Similar Items
-
EarthSight: A Distributed Framework for Low-Latency Satellite Intelligence
by: Erol, Ansel Kaplan, et al.
Published: (2025) -
Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG
by: Zhang, Yicheng, et al.
Published: (2026) -
4bit-Quantization in Vector-Embedding for RAG
by: Jeong, Taehee
Published: (2025) -
LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG
by: Zheng, Yijia, et al.
Published: (2026) -
Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding
by: Shim, Seongwoong, et al.
Published: (2025)