Saved in:
| Main Authors: | Jain, Ayush, Meenachi, N. M., Venkatraman, B. |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2003.13821 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025)
by: Reddy, K. Sahit, et al.
Published: (2025)
Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis
by: Zhai, Wei, et al.
Published: (2024)
by: Zhai, Wei, et al.
Published: (2024)
Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
by: Richter-Pechanski, Phillip, et al.
Published: (2024)
by: Richter-Pechanski, Phillip, et al.
Published: (2024)
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
by: Zhang, Ying, et al.
Published: (2024)
by: Zhang, Ying, et al.
Published: (2024)
BERT-like Pre-training for Symbolic Piano Music Classification Tasks
by: Chou, Yi-Hui, et al.
Published: (2021)
by: Chou, Yi-Hui, et al.
Published: (2021)
LakotaBERT: A Transformer-based Model for Low Resource Lakota Language
by: Parankusham, Kanishka, et al.
Published: (2025)
by: Parankusham, Kanishka, et al.
Published: (2025)
On Importance of Layer Pruning for Smaller BERT Models and Low Resource Languages
by: Shirke, Mayur, et al.
Published: (2025)
by: Shirke, Mayur, et al.
Published: (2025)
Parameter Estimation in Quantum Metrology Technique for Time Series Prediction
by: Sharma, Vaidik A, et al.
Published: (2024)
by: Sharma, Vaidik A, et al.
Published: (2024)
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
by: Joshi, Raviraj, et al.
Published: (2024)
by: Joshi, Raviraj, et al.
Published: (2024)
Pre-trained knowledge elevates large language models beyond traditional chemical reaction optimizers
by: MacKnight, Robert, et al.
Published: (2025)
by: MacKnight, Robert, et al.
Published: (2025)
polyGen: A Learning Framework for Atomic-level Polymer Structure Generation
by: Jain, Ayush, et al.
Published: (2025)
by: Jain, Ayush, et al.
Published: (2025)
Diffusion Domain Expansion: Learning to Coordinate Pre-trained Diffusion Models
by: Lifar, Egor, et al.
Published: (2026)
by: Lifar, Egor, et al.
Published: (2026)
Explainable AI in Big Data Fraud Detection
by: Jain, Ayush, et al.
Published: (2025)
by: Jain, Ayush, et al.
Published: (2025)
Software Vulnerability Prediction in Low-Resource Languages: An Empirical Study of CodeBERT and ChatGPT
by: Le, Triet H. M., et al.
Published: (2024)
by: Le, Triet H. M., et al.
Published: (2024)
Cross-Domain Pre-training with Language Models for Transferable Time Series Representations
by: Cheng, Mingyue, et al.
Published: (2024)
by: Cheng, Mingyue, et al.
Published: (2024)
Amortizing intractable inference in diffusion models for vision, language, and control
by: Venkatraman, Siddarth, et al.
Published: (2024)
by: Venkatraman, Siddarth, et al.
Published: (2024)
LOST: Low-rank and Sparse Pre-training for Large Language Models
by: Li, Jiaxi, et al.
Published: (2025)
by: Li, Jiaxi, et al.
Published: (2025)
Improving training time and GPU utilization in geo-distributed language model training
by: Palak, et al.
Published: (2024)
by: Palak, et al.
Published: (2024)
Unified Multi-Domain Graph Pre-training for Homogeneous and Heterogeneous Graphs via Domain-Specific Expert Encoding
by: Liang, Chundong, et al.
Published: (2026)
by: Liang, Chundong, et al.
Published: (2026)
Train on Validation (ToV): Fast data selection with applications to fine-tuning
by: Jain, Ayush, et al.
Published: (2025)
by: Jain, Ayush, et al.
Published: (2025)
Scaling laws for learning with real and surrogate data
by: Jain, Ayush, et al.
Published: (2024)
by: Jain, Ayush, et al.
Published: (2024)
Efficient Continual Pre-training of LLMs for Low-resource Languages
by: Nag, Arijit, et al.
Published: (2024)
by: Nag, Arijit, et al.
Published: (2024)
LatticeGraphNet: A two-scale graph neural operator for simulating lattice structures
by: Jain, Ayush, et al.
Published: (2024)
by: Jain, Ayush, et al.
Published: (2024)
Large Pre-trained time series models for cross-domain Time series analysis tasks
by: Kamarthi, Harshavardhan, et al.
Published: (2023)
by: Kamarthi, Harshavardhan, et al.
Published: (2023)
Lightweight reranking for language model generations
by: Jain, Siddhartha, et al.
Published: (2023)
by: Jain, Siddhartha, et al.
Published: (2023)
An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates
by: Soutif--Cormerais, Albin, et al.
Published: (2024)
by: Soutif--Cormerais, Albin, et al.
Published: (2024)
C-LoRA: Continual Low-Rank Adaptation for Pre-trained Models
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Improving Low-Resource Knowledge Tracing Tasks by Supervised Pre-training and Importance Mechanism Fine-tuning
by: Zhang, Hengyuan, et al.
Published: (2024)
by: Zhang, Hengyuan, et al.
Published: (2024)
PreLoRA: Hybrid Pre-training of Vision Transformers with Full Training and Low-Rank Adapters
by: Thapa, Krishu K, et al.
Published: (2025)
by: Thapa, Krishu K, et al.
Published: (2025)
Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment
by: Anastasiou, Dimitrios, et al.
Published: (2025)
by: Anastasiou, Dimitrios, et al.
Published: (2025)
CEEBERT: Cross-Domain Inference in Early Exit BERT
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
SmilesT5: Domain-specific pretraining for molecular language models
by: Spence, Philip, et al.
Published: (2025)
by: Spence, Philip, et al.
Published: (2025)
Data filtering methods for training language models
by: Shevchenko, Egor, et al.
Published: (2026)
by: Shevchenko, Egor, et al.
Published: (2026)
Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023)
by: Hu, Edward J., et al.
Published: (2023)
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
by: Yang, Kailai, et al.
Published: (2025)
by: Yang, Kailai, et al.
Published: (2025)
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
by: Nielsen, Jacob, et al.
Published: (2025)
by: Nielsen, Jacob, et al.
Published: (2025)
Pre-training under infinite compute
by: Kim, Konwoo, et al.
Published: (2025)
by: Kim, Konwoo, et al.
Published: (2025)
Utilizing Strategic Pre-training to Reduce Overfitting: Baguan -- A Pre-trained Weather Forecasting Model
by: Niu, Peisong, et al.
Published: (2025)
by: Niu, Peisong, et al.
Published: (2025)
SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence
by: Aghaei, Ehsan, et al.
Published: (2025)
by: Aghaei, Ehsan, et al.
Published: (2025)
GP2F: Cross-Domain Graph Prompting with Adaptive Fusion of Pre-trained Graph Neural Networks
by: He, Dongxiao, et al.
Published: (2026)
by: He, Dongxiao, et al.
Published: (2026)
Similar Items
-
MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025) -
Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis
by: Zhai, Wei, et al.
Published: (2024) -
Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
by: Richter-Pechanski, Phillip, et al.
Published: (2024) -
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
by: Zhang, Ying, et al.
Published: (2024) -
BERT-like Pre-training for Symbolic Piano Music Classification Tasks
by: Chou, Yi-Hui, et al.
Published: (2021)