Saved in:
| Main Author: | Mahendru, Sakshi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.04490 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Venn Diagram Prompting : Accelerating Comprehension with Scaffolding Effect
by: Mahendru, Sakshi, et al.
Published: (2024)
by: Mahendru, Sakshi, et al.
Published: (2024)
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection
by: Mahendru, Sakshi, et al.
Published: (2024)
by: Mahendru, Sakshi, et al.
Published: (2024)
Lean Finder: Semantic Search for Mathlib That Understands User Intents
by: Lu, Jialin, et al.
Published: (2025)
by: Lu, Jialin, et al.
Published: (2025)
IntentCoding: Amplifying User Intent in Code Generation
by: Fang, Zheng, et al.
Published: (2026)
by: Fang, Zheng, et al.
Published: (2026)
Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024)
by: Saxena, Utkarsh, et al.
Published: (2024)
GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching
by: Regmi, Sajal, et al.
Published: (2024)
by: Regmi, Sajal, et al.
Published: (2024)
vCache: Verified Semantic Prompt Caching
by: Schroeder, Luis Gaspar, et al.
Published: (2025)
by: Schroeder, Luis Gaspar, et al.
Published: (2025)
MeanCache: User-Centric Semantic Caching for LLM Web Services
by: Gill, Waris, et al.
Published: (2024)
by: Gill, Waris, et al.
Published: (2024)
SemBench: A Benchmark for Semantic Query Processing Engines
by: Lao, Jiale, et al.
Published: (2025)
by: Lao, Jiale, et al.
Published: (2025)
IntentRec: Predicting User Session Intent with Hierarchical Multi-Task Learning
by: Oh, Sejoon, et al.
Published: (2024)
by: Oh, Sejoon, et al.
Published: (2024)
Semantics-Aware Caching for Concept Learning
by: Teyou, Louis Mozart Kamdem, et al.
Published: (2026)
by: Teyou, Louis Mozart Kamdem, et al.
Published: (2026)
Intent Recognition and Out-of-Scope Detection using LLMs in Multi-party Conversations
by: Castillo-López, Galo, et al.
Published: (2025)
by: Castillo-López, Galo, et al.
Published: (2025)
KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction
by: Kim, Jang-Hyun, et al.
Published: (2025)
by: Kim, Jang-Hyun, et al.
Published: (2025)
Intent-Aware Neural Query Reformulation for Behavior-Aligned Product Search
by: Yetukuri, Jayanth, et al.
Published: (2025)
by: Yetukuri, Jayanth, et al.
Published: (2025)
CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference
by: Zhou, Enyu, et al.
Published: (2025)
by: Zhou, Enyu, et al.
Published: (2025)
A Population-to-individual Tuning Framework for Adapting Pretrained LM to On-device User Intent Prediction
by: Gong, Jiahui, et al.
Published: (2024)
by: Gong, Jiahui, et al.
Published: (2024)
VLN-Cache: Enabling Token Caching for VLN Models with Visual/Semantic Dynamics Awareness
by: Zheng, Zihao, et al.
Published: (2026)
by: Zheng, Zihao, et al.
Published: (2026)
A Natural Language Processing Framework for Hotel Recommendation Based on Users' Text Reviews
by: Aravani, Lavrentia, et al.
Published: (2024)
by: Aravani, Lavrentia, et al.
Published: (2024)
Hydro: Adaptive Query Processing of ML Queries
by: Kakkar, Gaurav Tarlok, et al.
Published: (2024)
by: Kakkar, Gaurav Tarlok, et al.
Published: (2024)
Contextual Font Recommendations based on User Intent
by: Sharma, Sanat, et al.
Published: (2023)
by: Sharma, Sanat, et al.
Published: (2023)
From Shallow to Deep: Pinning Semantic Intent via Causal GRPO
by: Zhou, Shuyi, et al.
Published: (2026)
by: Zhou, Shuyi, et al.
Published: (2026)
Augmenting Automation: Intent-Based User Instruction Classification with Machine Learning
by: Basyal, Lochan, et al.
Published: (2024)
by: Basyal, Lochan, et al.
Published: (2024)
How DDAIR you? Disambiguated Data Augmentation for Intent Recognition
by: Castillo-López, Galo, et al.
Published: (2026)
by: Castillo-López, Galo, et al.
Published: (2026)
A General Framework for User-Guided Bayesian Optimization
by: Hvarfner, Carl, et al.
Published: (2023)
by: Hvarfner, Carl, et al.
Published: (2023)
CacheFormer: High Attention-Based Segment Caching
by: Singh, Sushant, et al.
Published: (2025)
by: Singh, Sushant, et al.
Published: (2025)
MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation
by: Noshad, Ali, et al.
Published: (2026)
by: Noshad, Ali, et al.
Published: (2026)
Continuous Semantic Caching for Low-Cost LLM Serving
by: Atalar, Baran, et al.
Published: (2026)
by: Atalar, Baran, et al.
Published: (2026)
Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
by: Zhou, Qianrui, et al.
Published: (2023)
by: Zhou, Qianrui, et al.
Published: (2023)
Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs
by: Pons, Gerard, et al.
Published: (2024)
by: Pons, Gerard, et al.
Published: (2024)
LiteCache: A Query Similarity-Driven, GPU-Centric KVCache Subsystem for Efficient LLM Inference
by: Yi, Jiawei, et al.
Published: (2025)
by: Yi, Jiawei, et al.
Published: (2025)
Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches
by: Fang, Shaoke, et al.
Published: (2026)
by: Fang, Shaoke, et al.
Published: (2026)
Randomization Boosts KV Caching, Learning Balances Query Load: A Joint Perspective
by: Wu, Fangzhou, et al.
Published: (2026)
by: Wu, Fangzhou, et al.
Published: (2026)
Multi-Graph Co-Training for Capturing User Intent in Session-based Recommendation
by: Yang, Zhe, et al.
Published: (2024)
by: Yang, Zhe, et al.
Published: (2024)
Contextual Multilingual Spellchecker for User Queries
by: Sharma, Sanat, et al.
Published: (2023)
by: Sharma, Sanat, et al.
Published: (2023)
Steered Generation via Gradient-Based Optimization on Sparse Query Features
by: Bhattacharyya, Sumanta, et al.
Published: (2026)
by: Bhattacharyya, Sumanta, et al.
Published: (2026)
Frontend Diffusion: Exploring Intent-Based User Interfaces through Abstract-to-Detailed Task Transitions
by: Zhang, Qinshi, et al.
Published: (2024)
by: Zhang, Qinshi, et al.
Published: (2024)
Category-Aware Semantic Caching for Heterogeneous LLM Workloads
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
Assembling the Mind's Mosaic: Towards EEG Semantic Intent Decoding
by: Li, Jiahe, et al.
Published: (2026)
by: Li, Jiahe, et al.
Published: (2026)
Improving Sequential Query Recommendation with Immediate User Feedback
by: Parambath, Shameem A Puthiya, et al.
Published: (2022)
by: Parambath, Shameem A Puthiya, et al.
Published: (2022)
MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning
by: Zhang, Tao, et al.
Published: (2025)
by: Zhang, Tao, et al.
Published: (2025)
Similar Items
-
Venn Diagram Prompting : Accelerating Comprehension with Scaffolding Effect
by: Mahendru, Sakshi, et al.
Published: (2024) -
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection
by: Mahendru, Sakshi, et al.
Published: (2024) -
Lean Finder: Semantic Search for Mathlib That Understands User Intents
by: Lu, Jialin, et al.
Published: (2025) -
IntentCoding: Amplifying User Intent in Code Generation
by: Fang, Zheng, et al.
Published: (2026) -
Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024)