Saved in:
| Main Authors: | Wang, Chao, Cai, Yuqing, Duojie, Renzeng, Zhang, Jin, Liu, Yutong, Tashi, Nyima |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.09085 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Framework for Ü-Tsang, Amdo and Kham Speech Dataset Generation
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
TiSpell: A Semi-Masked Methodology for Tibetan Spelling Correction covering Multi-Level Error with Data Augmentation
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
Tibetan-TTS:Low-Resource Tibetan Speech Synthesis with Large Model Adaptation
by: He, Jiaxu, et al.
Published: (2026)
by: He, Jiaxu, et al.
Published: (2026)
TLUE: A Tibetan Language Understanding Evaluation Benchmark
by: Gao, Fan, et al.
Published: (2025)
by: Gao, Fan, et al.
Published: (2025)
TFD: A Comprehensive Structured Tibetan Foundation Dataset for Low-Resource Language Processing and Large-Scale Modeling
by: Huang, Cheng, et al.
Published: (2025)
by: Huang, Cheng, et al.
Published: (2025)
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
by: Huang, Cheng, et al.
Published: (2025)
by: Huang, Cheng, et al.
Published: (2025)
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
TIBSTC-CoT: A Multi-Domain Instruction Dataset for Chain-of-Thought Reasoning in Language Models
by: Gao, Fan, et al.
Published: (2025)
by: Gao, Fan, et al.
Published: (2025)
RetrieveAll: A Multilingual Named Entity Recognition Framework with Large Language Models
by: Zhang, Jin, et al.
Published: (2025)
by: Zhang, Jin, et al.
Published: (2025)
POTSA: A Cross-Lingual Speech Alignment Framework for Speech-to-Text Translation
by: Li, Xuanchen, et al.
Published: (2025)
by: Li, Xuanchen, et al.
Published: (2025)
When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs
by: Li, Linyu, et al.
Published: (2026)
by: Li, Linyu, et al.
Published: (2026)
Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Listening, Imagining & Refining: A Heuristic Optimized ASR Correction Framework with LLMs
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
by: Rong, Yiming, et al.
Published: (2025)
by: Rong, Yiming, et al.
Published: (2025)
LaSR: Context-Aware Speech Recognition via Latent Reasoning
by: Liu, Heyang, et al.
Published: (2026)
by: Liu, Heyang, et al.
Published: (2026)
SASST: Leveraging Syntax-Aware Chunking and LLMs for Simultaneous Speech Translation
by: Yang, Zeyu, et al.
Published: (2025)
by: Yang, Zeyu, et al.
Published: (2025)
SSCFormer: Push the Limit of Chunk-wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution
by: Wang, Fangyuan, et al.
Published: (2022)
by: Wang, Fangyuan, et al.
Published: (2022)
Robust and Efficient Autoregressive Speech Synthesis with Dynamic Chunk-wise Prediction Policy
by: Li, Bohan, et al.
Published: (2025)
by: Li, Bohan, et al.
Published: (2025)
Streaming Speech-to-Confusion Network Speech Recognition
by: Filimonov, Denis, et al.
Published: (2023)
by: Filimonov, Denis, et al.
Published: (2023)
Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language Models
by: Sheng, Boheng, et al.
Published: (2025)
by: Sheng, Boheng, et al.
Published: (2025)
LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing
by: Li, Dongfang, et al.
Published: (2026)
by: Li, Dongfang, et al.
Published: (2026)
ChunkFT: Byte-Streamed Optimization for Memory-Efficient Full Fine-Tuning
by: Liu, Yongkang, et al.
Published: (2026)
by: Liu, Yongkang, et al.
Published: (2026)
Dynamic Chunking for Diffusion Language Models
by: Zhu, Yichen, et al.
Published: (2026)
by: Zhu, Yichen, et al.
Published: (2026)
U-Fold: Dynamic Intent-Aware Context Folding for User-Centric Agents
by: Su, Jin, et al.
Published: (2026)
by: Su, Jin, et al.
Published: (2026)
TopoChunker: Topology-Aware Agentic Document Chunking Framework
by: Liu, Xiaoyu
Published: (2026)
by: Liu, Xiaoyu
Published: (2026)
Efficient Streaming LLM for Speech Recognition
by: Jia, Junteng, et al.
Published: (2024)
by: Jia, Junteng, et al.
Published: (2024)
Efficient RAG with Intent-Aware Retrieval and Semantics-Preserving Chunking
by: Puspitasari, Fachrina Dewi, et al.
Published: (2026)
by: Puspitasari, Fachrina Dewi, et al.
Published: (2026)
Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference
by: Tao, Wei, et al.
Published: (2025)
by: Tao, Wei, et al.
Published: (2025)
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models
by: Günther, Michael, et al.
Published: (2024)
by: Günther, Michael, et al.
Published: (2024)
ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark
by: Wang, He, et al.
Published: (2025)
by: Wang, He, et al.
Published: (2025)
Grounding Language Model with Chunking-Free In-Context Retrieval
by: Qian, Hongjin, et al.
Published: (2024)
by: Qian, Hongjin, et al.
Published: (2024)
SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)
by: Zhang, Xuechen, et al.
Published: (2025)
Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition
by: Xia, Yinfeng, et al.
Published: (2026)
by: Xia, Yinfeng, et al.
Published: (2026)
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
by: Liu, Xiang, et al.
Published: (2025)
by: Liu, Xiang, et al.
Published: (2025)
Beyond Chunk-Local Extraction: Cross-Chunk Graph Augmentation for GraphRAG
by: Zhang, Jiaming, et al.
Published: (2026)
by: Zhang, Jiaming, et al.
Published: (2026)
Decoder-only Architecture for Streaming End-to-end Speech Recognition
by: Tsunoo, Emiru, et al.
Published: (2024)
by: Tsunoo, Emiru, et al.
Published: (2024)
Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern
by: Tang, Hongyin, et al.
Published: (2024)
by: Tang, Hongyin, et al.
Published: (2024)
Similar Items
-
TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Framework for Ü-Tsang, Amdo and Kham Speech Dataset Generation
by: Liu, Yutong, et al.
Published: (2025) -
FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation
by: Liu, Yutong, et al.
Published: (2025) -
TiSpell: A Semi-Masked Methodology for Tibetan Spelling Correction covering Multi-Level Error with Data Augmentation
by: Liu, Yutong, et al.
Published: (2025) -
Tibetan-TTS:Low-Resource Tibetan Speech Synthesis with Large Model Adaptation
by: He, Jiaxu, et al.
Published: (2026) -
TLUE: A Tibetan Language Understanding Evaluation Benchmark
by: Gao, Fan, et al.
Published: (2025)