Saved in:
| Main Authors: | Hou, Haowen, Huang, Zhiyi, Tan, Kaifeng, Lu, Rongchang, Yu, Fei Richard |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.21463 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models
by: Hou, Haowen, et al.
Published: (2024)
by: Hou, Haowen, et al.
Published: (2024)
VisualRWKV-HD and UHD: Advancing High-Resolution Processing for Visual Language Models
by: Li, Zihang, et al.
Published: (2024)
by: Li, Zihang, et al.
Published: (2024)
EmbeddingRWKV: State-Centric Retrieval with Reusable States
by: Hou, Haowen, et al.
Published: (2026)
by: Hou, Haowen, et al.
Published: (2026)
RWKV-7 "Goose" with Expressive Dynamic State Evolution
by: Peng, Bo, et al.
Published: (2025)
by: Peng, Bo, et al.
Published: (2025)
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression
by: Hou, Haowen, et al.
Published: (2024)
by: Hou, Haowen, et al.
Published: (2024)
ModRWKV: Transformer Multimodality in Linear Time
by: Kang, Jiale, et al.
Published: (2025)
by: Kang, Jiale, et al.
Published: (2025)
RWKV-UI: UI Understanding with Enhanced Perception and Reasoning
by: Yang, Jiaxi, et al.
Published: (2025)
by: Yang, Jiaxi, et al.
Published: (2025)
The Evolution of RWKV: Advancements in Efficient Language Modeling
by: Datta, Akul
Published: (2024)
by: Datta, Akul
Published: (2024)
A Survey of RWKV
by: Li, Zhiyuan, et al.
Published: (2024)
by: Li, Zhiyuan, et al.
Published: (2024)
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression
by: Goldstein, Daniel, et al.
Published: (2024)
by: Goldstein, Daniel, et al.
Published: (2024)
Triplet-Block Diffusion RWKV
by: Lin, Ke, et al.
Published: (2026)
by: Lin, Ke, et al.
Published: (2026)
Enhancing RWKV-based Language Models for Long-Sequence Text Generation
by: Pan, Xinghan
Published: (2025)
by: Pan, Xinghan
Published: (2025)
RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks
by: Hou, Haowen, et al.
Published: (2024)
by: Hou, Haowen, et al.
Published: (2024)
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
by: Peng, Bo, et al.
Published: (2024)
by: Peng, Bo, et al.
Published: (2024)
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
by: Fei, Zhengcong, et al.
Published: (2024)
by: Fei, Zhengcong, et al.
Published: (2024)
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
by: Zhang, Junxuan, et al.
Published: (2024)
by: Zhang, Junxuan, et al.
Published: (2024)
A Hybrid Framework for Natural Language Querying of IFC Models with Relational and Graph Representations
by: Lamsal, Rabindra, et al.
Published: (2026)
by: Lamsal, Rabindra, et al.
Published: (2026)
EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs
by: Cai, Zhengge, et al.
Published: (2025)
by: Cai, Zhengge, et al.
Published: (2025)
SproutBench: A Benchmark for Safe and Ethical Large Language Models for Youth
by: Xing, Wenpeng, et al.
Published: (2025)
by: Xing, Wenpeng, et al.
Published: (2025)
Delta-WKV: A Novel Meta-in-Context Learner for MRI Super-Resolution
by: Lu, Rongchang, et al.
Published: (2025)
by: Lu, Rongchang, et al.
Published: (2025)
EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models
by: Zou, Tao, et al.
Published: (2025)
by: Zou, Tao, et al.
Published: (2025)
A Simple Linear Patch Revives Layer-Pruned Large Language Models
by: Chen, Xinrui, et al.
Published: (2025)
by: Chen, Xinrui, et al.
Published: (2025)
RRWKV: Capturing Long-range Dependencies in RWKV
by: Wang, Leilei
Published: (2023)
by: Wang, Leilei
Published: (2023)
Scaling Laws for Linear Complexity Language Models
by: Shen, Xuyang, et al.
Published: (2024)
by: Shen, Xuyang, et al.
Published: (2024)
OpenGuardrails: A Configurable, Unified, and Scalable Guardrails Platform for Large Language Models
by: Wang, Thomas, et al.
Published: (2025)
by: Wang, Thomas, et al.
Published: (2025)
Cross-attention for State-based model RWKV-7
by: Xiao, Liu, et al.
Published: (2025)
by: Xiao, Liu, et al.
Published: (2025)
Experimentation in Content Moderation using RWKV
by: Yildirim, Umut, et al.
Published: (2024)
by: Yildirim, Umut, et al.
Published: (2024)
Probing Large Language Models in Reasoning and Translating Complex Linguistic Puzzles
by: Lin, Zheng-Lin, et al.
Published: (2025)
by: Lin, Zheng-Lin, et al.
Published: (2025)
Lost in Diffusion: Uncovering Hallucination Patterns and Failure Modes in Diffusion Large Language Models
by: Guo, Zhengnan, et al.
Published: (2026)
by: Guo, Zhengnan, et al.
Published: (2026)
RWKVTTS: Yet another TTS based on RWKV-7
by: yueyu, Lin, et al.
Published: (2025)
by: yueyu, Lin, et al.
Published: (2025)
Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective
by: Qin, Zhen, et al.
Published: (2024)
by: Qin, Zhen, et al.
Published: (2024)
State Tuning: State-based Test-Time Scaling on RWKV-7
by: Xiao, Liu, et al.
Published: (2025)
by: Xiao, Liu, et al.
Published: (2025)
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
by: Yu, Le, et al.
Published: (2024)
by: Yu, Le, et al.
Published: (2024)
Exploring Forgetting in Large Language Model Pre-Training
by: Liao, Chonghua, et al.
Published: (2024)
by: Liao, Chonghua, et al.
Published: (2024)
LLMs can be Dangerous Reasoners: Analyzing-based Jailbreak Attack on Large Language Models
by: Lin, Shi, et al.
Published: (2024)
by: Lin, Shi, et al.
Published: (2024)
A Systematic Analysis of Hybrid Linear Attention
by: Wang, Dustin, et al.
Published: (2025)
by: Wang, Dustin, et al.
Published: (2025)
Self-Powered LLM Modality Expansion for Large Speech-Text Models
by: Yu, Tengfei, et al.
Published: (2024)
by: Yu, Tengfei, et al.
Published: (2024)
A Survey on Unlearning in Large Language Models
by: Qiu, Ruichen, et al.
Published: (2025)
by: Qiu, Ruichen, et al.
Published: (2025)
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
by: Yu, Le, et al.
Published: (2023)
by: Yu, Le, et al.
Published: (2023)
AceGPT, Localizing Large Language Models in Arabic
by: Huang, Huang, et al.
Published: (2023)
by: Huang, Huang, et al.
Published: (2023)
Similar Items
-
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models
by: Hou, Haowen, et al.
Published: (2024) -
VisualRWKV-HD and UHD: Advancing High-Resolution Processing for Visual Language Models
by: Li, Zihang, et al.
Published: (2024) -
EmbeddingRWKV: State-Centric Retrieval with Reusable States
by: Hou, Haowen, et al.
Published: (2026) -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
by: Peng, Bo, et al.
Published: (2025) -
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression
by: Hou, Haowen, et al.
Published: (2024)