Saved in:
| Main Authors: | Jayakody, Shakya, Zhao, Youpeng, Wang, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.02006 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving
by: Jayakody, Shakya, et al.
Published: (2026)
by: Jayakody, Shakya, et al.
Published: (2026)
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching
by: Zhao, Youpeng, et al.
Published: (2024)
by: Zhao, Youpeng, et al.
Published: (2024)
Thinking in Groups: Permutation Tests Reveal Near-Out-of-Distribution
by: Jayawardana, Yasith, et al.
Published: (2024)
by: Jayawardana, Yasith, et al.
Published: (2024)
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs
by: Ji, Shengwei, et al.
Published: (2024)
by: Ji, Shengwei, et al.
Published: (2024)
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
by: Zhao, Youpeng, et al.
Published: (2024)
by: Zhao, Youpeng, et al.
Published: (2024)
GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design
by: You, Haoran, et al.
Published: (2021)
by: You, Haoran, et al.
Published: (2021)
Laplacian-LoRA: Delaying Oversmoothing in Deep GCNs via Spectral Low-Rank Adaptation
by: Alisetti, Sai Vamsi
Published: (2026)
by: Alisetti, Sai Vamsi
Published: (2026)
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
by: You, Haoran, et al.
Published: (2022)
by: You, Haoran, et al.
Published: (2022)
GCN-TULHOR: Trajectory-User Linking Leveraging GCNs and Higher-Order Spatial Representations
by: Tran, Khoa, et al.
Published: (2025)
by: Tran, Khoa, et al.
Published: (2025)
Graph Learning in 4D: a Quaternion-valued Laplacian to Enhance Spectral GCNs
by: Fiorini, Stefano, et al.
Published: (2023)
by: Fiorini, Stefano, et al.
Published: (2023)
MC$^2$A: Enabling Algorithm-Hardware Co-Design for Efficient Markov Chain Monte Carlo Acceleration
by: Zhao, Shirui, et al.
Published: (2025)
by: Zhao, Shirui, et al.
Published: (2025)
Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning
by: Jayakody, Dilith, et al.
Published: (2023)
by: Jayakody, Dilith, et al.
Published: (2023)
SentimentLens: Reconciling Sentiment and Ratings via Dual-Modality in the Hospitality Sector
by: Jayakody, Dineth, et al.
Published: (2026)
by: Jayakody, Dineth, et al.
Published: (2026)
RsGCN: Subgraph-Based Rescaling Enhances Generalization of GCNs for Solving Traveling Salesman Problems
by: Huang, Junquan, et al.
Published: (2025)
by: Huang, Junquan, et al.
Published: (2025)
Out-of-Core Dimensionality Reduction for Large Data via Out-of-Sample Extensions
by: Reichmann, Luca, et al.
Published: (2024)
by: Reichmann, Luca, et al.
Published: (2024)
Early-Bird GCNs: Graph-Network Co-Optimization Towards More Efficient GCN Training and Inference via Drawing Early-Bird Lottery Tickets
by: You, Haoran, et al.
Published: (2021)
by: You, Haoran, et al.
Published: (2021)
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design
by: Liang, Yanbiao, et al.
Published: (2025)
by: Liang, Yanbiao, et al.
Published: (2025)
Reducing Latency of LLM Search Agent via Speculation-based Algorithm-System Co-Design
by: Huang, Zixiao, et al.
Published: (2025)
by: Huang, Zixiao, et al.
Published: (2025)
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
by: Tang, Huadong, et al.
Published: (2024)
by: Tang, Huadong, et al.
Published: (2024)
CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems
by: Chen, Zhen, et al.
Published: (2024)
by: Chen, Zhen, et al.
Published: (2024)
A Comparative Analysis of Machine Learning Models for DDoS Detection in IoT Networks
by: Shakya, Sushil, et al.
Published: (2024)
by: Shakya, Sushil, et al.
Published: (2024)
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
Scalable Generative Game Engine: Breaking the Resolution Wall via Hardware-Algorithm Co-Design
by: Zeng, Wei, et al.
Published: (2026)
by: Zeng, Wei, et al.
Published: (2026)
MultiSense-Pneumo: A Multimodal Learning Framework for Pneumonia Screening in Resource-Constrained Settings
by: Jayakody, Dineth, et al.
Published: (2026)
by: Jayakody, Dineth, et al.
Published: (2026)
COBRA: Algorithm-Architecture Co-optimized Binary Transformer Accelerator for Edge Inference
by: Qiao, Ye, et al.
Published: (2025)
by: Qiao, Ye, et al.
Published: (2025)
Next Word Suggestion using Graph Neural Network
by: Magar, Abisha Thapa, et al.
Published: (2025)
by: Magar, Abisha Thapa, et al.
Published: (2025)
RAGPulse: An Open-Source RAG Workload Trace to Optimize RAG Serving Systems
by: Wang, Zhengchao, et al.
Published: (2025)
by: Wang, Zhengchao, et al.
Published: (2025)
Beyond Perfect Scores: Proof-by-Contradiction for Trustworthy Machine Learning
by: Wadduwage, Dushan N., et al.
Published: (2026)
by: Wadduwage, Dushan N., et al.
Published: (2026)
FedCAP: Robust Federated Learning via Customized Aggregation and Personalization
by: Li, Youpeng, et al.
Published: (2024)
by: Li, Youpeng, et al.
Published: (2024)
Towards Understanding the Role of Sharpness-Aware Minimization Algorithms for Out-of-Distribution Generalization
by: Schapiro, Samuel, et al.
Published: (2024)
by: Schapiro, Samuel, et al.
Published: (2024)
ALISE: Accelerating Large Language Model Serving with Speculative Scheduling
by: Zhao, Youpeng, et al.
Published: (2024)
by: Zhao, Youpeng, et al.
Published: (2024)
Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment
by: Ji, Yuhao, et al.
Published: (2024)
by: Ji, Yuhao, et al.
Published: (2024)
Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report
by: Wang, Maolin, et al.
Published: (2026)
by: Wang, Maolin, et al.
Published: (2026)
Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation
by: Zhao, Chu, et al.
Published: (2025)
by: Zhao, Chu, et al.
Published: (2025)
DiskGNN: Bridging I/O Efficiency and Model Accuracy for Out-of-Core GNN Training
by: Liu, Renjie, et al.
Published: (2024)
by: Liu, Renjie, et al.
Published: (2024)
Accelerated Bayesian Optimal Experimental Design via Conditional Density Estimation and Informative Data
by: Huang, Miao, et al.
Published: (2025)
by: Huang, Miao, et al.
Published: (2025)
A Novel Zero-Touch, Zero-Trust, AI/ML Enablement Framework for IoT Network Security
by: Shakya, Sushil, et al.
Published: (2025)
by: Shakya, Sushil, et al.
Published: (2025)
Towards Privacy-Preserving Data-Driven Education: The Potential of Federated Learning
by: Khalil, Mohammad, et al.
Published: (2025)
by: Khalil, Mohammad, et al.
Published: (2025)
Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction
by: Shakya, Anup, et al.
Published: (2024)
by: Shakya, Anup, et al.
Published: (2024)
AuToMATo: An Out-Of-The-Box Persistence-Based Clustering Algorithm
by: Huber, Marius, et al.
Published: (2024)
by: Huber, Marius, et al.
Published: (2024)
Similar Items
-
GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving
by: Jayakody, Shakya, et al.
Published: (2026) -
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching
by: Zhao, Youpeng, et al.
Published: (2024) -
Thinking in Groups: Permutation Tests Reveal Near-Out-of-Distribution
by: Jayawardana, Yasith, et al.
Published: (2024) -
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs
by: Ji, Shengwei, et al.
Published: (2024) -
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
by: Zhao, Youpeng, et al.
Published: (2024)