Saved in:
| Main Authors: | Kowsher, Md, Khan, Abdul Rafae, Xu, Jia |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.09573 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-Mixer: Multiscale Mixing in LLMs for Time Series Forecasting
by: Kowsher, Md, et al.
Published: (2024)
by: Kowsher, Md, et al.
Published: (2024)
ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
by: Xu, Bingxin, et al.
Published: (2025)
by: Xu, Bingxin, et al.
Published: (2025)
L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs
by: Kowsher, Md., et al.
Published: (2023)
by: Kowsher, Md., et al.
Published: (2023)
Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning
by: Prottasha, Nusrat Jahan, et al.
Published: (2026)
by: Prottasha, Nusrat Jahan, et al.
Published: (2026)
LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning
by: Kowsher, Md, et al.
Published: (2026)
by: Kowsher, Md, et al.
Published: (2026)
Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM
by: Kowsher, Md., et al.
Published: (2024)
by: Kowsher, Md., et al.
Published: (2024)
TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting
by: Cao, Defu, et al.
Published: (2023)
by: Cao, Defu, et al.
Published: (2023)
Farsight: Fostering Responsible AI Awareness During AI Application Prototyping
by: Wang, Zijie J., et al.
Published: (2024)
by: Wang, Zijie J., et al.
Published: (2024)
Metadata Matters for Time Series: Informative Forecasting with Transformers
by: Dong, Jiaxiang, et al.
Published: (2024)
by: Dong, Jiaxiang, et al.
Published: (2024)
Learning Novel Transformer Architecture for Time-series Forecasting
by: Zhang, Juyuan, et al.
Published: (2025)
by: Zhang, Juyuan, et al.
Published: (2025)
TyphoFormer: Language-Augmented Transformer for Accurate Typhoon Track Forecasting
by: Li, Lincan, et al.
Published: (2025)
by: Li, Lincan, et al.
Published: (2025)
Sparse Transformer with Local and Seasonal Adaptation for Multivariate Time Series Forecasting
by: Zhang, Yifan, et al.
Published: (2023)
by: Zhang, Yifan, et al.
Published: (2023)
Strategic Fusion Optimizes Transformer Compression
by: Rahman, Md Shoaibur
Published: (2025)
by: Rahman, Md Shoaibur
Published: (2025)
DetoxLLM: A Framework for Detoxification with Explanations
by: Khondaker, Md Tawkat Islam, et al.
Published: (2024)
by: Khondaker, Md Tawkat Islam, et al.
Published: (2024)
Learning Extrapolative Sequence Transformations from Markov Chains
by: Hager, Sophia, et al.
Published: (2025)
by: Hager, Sophia, et al.
Published: (2025)
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
by: Liu, Weiyang, et al.
Published: (2023)
by: Liu, Weiyang, et al.
Published: (2023)
Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence Embeddings
by: Jia, Mumin, et al.
Published: (2026)
by: Jia, Mumin, et al.
Published: (2026)
Consistent Kernel Change-Point Detection under m-Dependence for Text Segmentation
by: Diaz-Rodriguez, Jairo, et al.
Published: (2025)
by: Diaz-Rodriguez, Jairo, et al.
Published: (2025)
GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models
by: Liao, Ruotong, et al.
Published: (2023)
by: Liao, Ruotong, et al.
Published: (2023)
Transformer-Driven Triple Fusion Framework for Enhanced Multimodal Author Intent Classification in Low-Resource Bangla
by: Islam, Ariful, et al.
Published: (2025)
by: Islam, Ariful, et al.
Published: (2025)
Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
by: Gado, Elena Grazia, et al.
Published: (2024)
by: Gado, Elena Grazia, et al.
Published: (2024)
Advancing Exchange Rate Forecasting: Leveraging Machine Learning and AI for Enhanced Accuracy in Global Financial Markets
by: Rahat, Md. Yeasin, et al.
Published: (2025)
by: Rahat, Md. Yeasin, et al.
Published: (2025)
Automatic Differential Diagnosis using Transformer-Based Multi-Label Sequence Classification
by: Sadi, Abu Adnan, et al.
Published: (2024)
by: Sadi, Abu Adnan, et al.
Published: (2024)
Short Data, Long Context: Distilling Positional Knowledge in Transformers
by: Huber, Patrick, et al.
Published: (2026)
by: Huber, Patrick, et al.
Published: (2026)
Propulsion: Steering LLM with Tiny Fine-Tuning
by: Kowsher, Md, et al.
Published: (2024)
by: Kowsher, Md, et al.
Published: (2024)
Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings
by: Xu, Kan, et al.
Published: (2021)
by: Xu, Kan, et al.
Published: (2021)
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
by: Lou, Chao, et al.
Published: (2024)
by: Lou, Chao, et al.
Published: (2024)
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
by: Arefin, Md Rifat, et al.
Published: (2024)
by: Arefin, Md Rifat, et al.
Published: (2024)
Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
by: Choi, Euntae, et al.
Published: (2025)
by: Choi, Euntae, et al.
Published: (2025)
Mixture of Group Experts for Learning Invariant Representations
by: Kang, Lei, et al.
Published: (2025)
by: Kang, Lei, et al.
Published: (2025)
Provable Knowledge Acquisition and Extraction in One-Layer Transformers
by: Xu, Ruichen, et al.
Published: (2025)
by: Xu, Ruichen, et al.
Published: (2025)
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
by: Sakib, Md Nazmus, et al.
Published: (2024)
by: Sakib, Md Nazmus, et al.
Published: (2024)
Dynamics of Spontaneous Topic Changes in Next Token Prediction with Self-Attention
by: Jia, Mumin, et al.
Published: (2025)
by: Jia, Mumin, et al.
Published: (2025)
Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning
by: Deng, Jingcheng, et al.
Published: (2026)
by: Deng, Jingcheng, et al.
Published: (2026)
AWARE, Beyond Sentence Boundaries: A Contextual Transformer Framework for Identifying Cultural Capital in STEM Narratives
by: Khan, Khalid Mehtab, et al.
Published: (2025)
by: Khan, Khalid Mehtab, et al.
Published: (2025)
Do VLMs Truly "Read" Candlesticks? A Multi-Scale Benchmark for Visual Stock Price Forecasting
by: Hu, Kaiqi, et al.
Published: (2026)
by: Hu, Kaiqi, et al.
Published: (2026)
When Does Multimodality Lead to Better Time Series Forecasting?
by: Zhang, Xiyuan, et al.
Published: (2025)
by: Zhang, Xiyuan, et al.
Published: (2025)
Intervention-Aware Forecasting: Breaking Historical Limits from a System Perspective
by: Xu, Zhijian, et al.
Published: (2024)
by: Xu, Zhijian, et al.
Published: (2024)
From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers
by: Duan, Shaoxiong, et al.
Published: (2023)
by: Duan, Shaoxiong, et al.
Published: (2023)
Similar Items
-
LLM-Mixer: Multiscale Mixing in LLMs for Time Series Forecasting
by: Kowsher, Md, et al.
Published: (2024) -
ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
by: Xu, Bingxin, et al.
Published: (2025) -
L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs
by: Kowsher, Md., et al.
Published: (2023) -
Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning
by: Prottasha, Nusrat Jahan, et al.
Published: (2026) -
LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning
by: Kowsher, Md, et al.
Published: (2026)