:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kowsher, Md, Khan, Abdul Rafae, Xu, Jia
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2402.09573
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM-Mixer: Multiscale Mixing in LLMs for Time Series Forecasting
by: Kowsher, Md, et al.
Published: (2024)

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
by: Xu, Bingxin, et al.
Published: (2025)

L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs
by: Kowsher, Md., et al.
Published: (2023)

Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning
by: Prottasha, Nusrat Jahan, et al.
Published: (2026)

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning
by: Kowsher, Md, et al.
Published: (2026)

Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM
by: Kowsher, Md., et al.
Published: (2024)

TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting
by: Cao, Defu, et al.
Published: (2023)

Farsight: Fostering Responsible AI Awareness During AI Application Prototyping
by: Wang, Zijie J., et al.
Published: (2024)

Metadata Matters for Time Series: Informative Forecasting with Transformers
by: Dong, Jiaxiang, et al.
Published: (2024)

Learning Novel Transformer Architecture for Time-series Forecasting
by: Zhang, Juyuan, et al.
Published: (2025)

TyphoFormer: Language-Augmented Transformer for Accurate Typhoon Track Forecasting
by: Li, Lincan, et al.
Published: (2025)

Sparse Transformer with Local and Seasonal Adaptation for Multivariate Time Series Forecasting
by: Zhang, Yifan, et al.
Published: (2023)

Strategic Fusion Optimizes Transformer Compression
by: Rahman, Md Shoaibur
Published: (2025)

DetoxLLM: A Framework for Detoxification with Explanations
by: Khondaker, Md Tawkat Islam, et al.
Published: (2024)

Learning Extrapolative Sequence Transformations from Markov Chains
by: Hager, Sophia, et al.
Published: (2025)

BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
by: Liu, Weiyang, et al.
Published: (2023)

Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence Embeddings
by: Jia, Mumin, et al.
Published: (2026)

Consistent Kernel Change-Point Detection under m-Dependence for Text Segmentation
by: Diaz-Rodriguez, Jairo, et al.
Published: (2025)

GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models
by: Liao, Ruotong, et al.
Published: (2023)

Transformer-Driven Triple Fusion Framework for Enhanced Multimodal Author Intent Classification in Low-Resource Bangla
by: Islam, Ariful, et al.
Published: (2025)

Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
by: Gado, Elena Grazia, et al.
Published: (2024)

Advancing Exchange Rate Forecasting: Leveraging Machine Learning and AI for Enhanced Accuracy in Global Financial Markets
by: Rahat, Md. Yeasin, et al.
Published: (2025)

Automatic Differential Diagnosis using Transformer-Based Multi-Label Sequence Classification
by: Sadi, Abu Adnan, et al.
Published: (2024)

Short Data, Long Context: Distilling Positional Knowledge in Transformers
by: Huber, Patrick, et al.
Published: (2026)

Propulsion: Steering LLM with Tiny Fine-Tuning
by: Kowsher, Md, et al.
Published: (2024)

Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings
by: Xu, Kan, et al.
Published: (2021)

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
by: Lou, Chao, et al.
Published: (2024)

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
by: Arefin, Md Rifat, et al.
Published: (2024)

Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
by: Choi, Euntae, et al.
Published: (2025)

Mixture of Group Experts for Learning Invariant Representations
by: Kang, Lei, et al.
Published: (2025)

Provable Knowledge Acquisition and Extraction in One-Layer Transformers
by: Xu, Ruichen, et al.
Published: (2025)

Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
by: Sakib, Md Nazmus, et al.
Published: (2024)

Dynamics of Spontaneous Topic Changes in Next Token Prediction with Self-Attention
by: Jia, Mumin, et al.
Published: (2025)

Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning
by: Deng, Jingcheng, et al.
Published: (2026)

AWARE, Beyond Sentence Boundaries: A Contextual Transformer Framework for Identifying Cultural Capital in STEM Narratives
by: Khan, Khalid Mehtab, et al.
Published: (2025)

Do VLMs Truly "Read" Candlesticks? A Multi-Scale Benchmark for Visual Stock Price Forecasting
by: Hu, Kaiqi, et al.
Published: (2026)

When Does Multimodality Lead to Better Time Series Forecasting?
by: Zhang, Xiyuan, et al.
Published: (2025)

Intervention-Aware Forecasting: Breaking Historical Limits from a System Perspective
by: Xu, Zhijian, et al.
Published: (2024)

From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers
by: Duan, Shaoxiong, et al.
Published: (2023)