:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Mahendru, Sakshi
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2406.04490
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Venn Diagram Prompting : Accelerating Comprehension with Scaffolding Effect
by: Mahendru, Sakshi, et al.
Published: (2024)

SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection
by: Mahendru, Sakshi, et al.
Published: (2024)

Lean Finder: Semantic Search for Mathlib That Understands User Intents
by: Lu, Jialin, et al.
Published: (2025)

IntentCoding: Amplifying User Intent in Code Generation
by: Fang, Zheng, et al.
Published: (2026)

Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024)

GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching
by: Regmi, Sajal, et al.
Published: (2024)

vCache: Verified Semantic Prompt Caching
by: Schroeder, Luis Gaspar, et al.
Published: (2025)

MeanCache: User-Centric Semantic Caching for LLM Web Services
by: Gill, Waris, et al.
Published: (2024)

SemBench: A Benchmark for Semantic Query Processing Engines
by: Lao, Jiale, et al.
Published: (2025)

IntentRec: Predicting User Session Intent with Hierarchical Multi-Task Learning
by: Oh, Sejoon, et al.
Published: (2024)

Semantics-Aware Caching for Concept Learning
by: Teyou, Louis Mozart Kamdem, et al.
Published: (2026)

Intent Recognition and Out-of-Scope Detection using LLMs in Multi-party Conversations
by: Castillo-López, Galo, et al.
Published: (2025)

KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction
by: Kim, Jang-Hyun, et al.
Published: (2025)

Intent-Aware Neural Query Reformulation for Behavior-Aligned Product Search
by: Yetukuri, Jayanth, et al.
Published: (2025)

CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference
by: Zhou, Enyu, et al.
Published: (2025)

A Population-to-individual Tuning Framework for Adapting Pretrained LM to On-device User Intent Prediction
by: Gong, Jiahui, et al.
Published: (2024)

VLN-Cache: Enabling Token Caching for VLN Models with Visual/Semantic Dynamics Awareness
by: Zheng, Zihao, et al.
Published: (2026)

A Natural Language Processing Framework for Hotel Recommendation Based on Users' Text Reviews
by: Aravani, Lavrentia, et al.
Published: (2024)

Hydro: Adaptive Query Processing of ML Queries
by: Kakkar, Gaurav Tarlok, et al.
Published: (2024)

Contextual Font Recommendations based on User Intent
by: Sharma, Sanat, et al.
Published: (2023)

From Shallow to Deep: Pinning Semantic Intent via Causal GRPO
by: Zhou, Shuyi, et al.
Published: (2026)

Augmenting Automation: Intent-Based User Instruction Classification with Machine Learning
by: Basyal, Lochan, et al.
Published: (2024)

How DDAIR you? Disambiguated Data Augmentation for Intent Recognition
by: Castillo-López, Galo, et al.
Published: (2026)

A General Framework for User-Guided Bayesian Optimization
by: Hvarfner, Carl, et al.
Published: (2023)

CacheFormer: High Attention-Based Segment Caching
by: Singh, Sushant, et al.
Published: (2025)

MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation
by: Noshad, Ali, et al.
Published: (2026)

Continuous Semantic Caching for Low-Cost LLM Serving
by: Atalar, Baran, et al.
Published: (2026)

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
by: Zhou, Qianrui, et al.
Published: (2023)

Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs
by: Pons, Gerard, et al.
Published: (2024)

LiteCache: A Query Similarity-Driven, GPU-Centric KVCache Subsystem for Efficient LLM Inference
by: Yi, Jiawei, et al.
Published: (2025)

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches
by: Fang, Shaoke, et al.
Published: (2026)

Randomization Boosts KV Caching, Learning Balances Query Load: A Joint Perspective
by: Wu, Fangzhou, et al.
Published: (2026)

Multi-Graph Co-Training for Capturing User Intent in Session-based Recommendation
by: Yang, Zhe, et al.
Published: (2024)

Contextual Multilingual Spellchecker for User Queries
by: Sharma, Sanat, et al.
Published: (2023)

Steered Generation via Gradient-Based Optimization on Sparse Query Features
by: Bhattacharyya, Sumanta, et al.
Published: (2026)

Frontend Diffusion: Exploring Intent-Based User Interfaces through Abstract-to-Detailed Task Transitions
by: Zhang, Qinshi, et al.
Published: (2024)

Category-Aware Semantic Caching for Heterogeneous LLM Workloads
by: Wang, Chen, et al.
Published: (2025)

Assembling the Mind's Mosaic: Towards EEG Semantic Intent Decoding
by: Li, Jiahe, et al.
Published: (2026)

Improving Sequential Query Recommendation with Immediate User Feedback
by: Parambath, Shameem A Puthiya, et al.
Published: (2022)

MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning
by: Zhang, Tao, et al.
Published: (2025)