Saved in:
| Main Authors: | Arora, Abhishek, Dell, Melissa |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.00789 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Linking Representations with Multimodal Contrastive Learning
by: Arora, Abhishek, et al.
Published: (2023)
by: Arora, Abhishek, et al.
Published: (2023)
Contrastive Entity Coreference and Disambiguation for Historical Texts
by: Arora, Abhishek, et al.
Published: (2024)
by: Arora, Abhishek, et al.
Published: (2024)
Trainable Transformer in Transformer
by: Panigrahi, Abhishek, et al.
Published: (2023)
by: Panigrahi, Abhishek, et al.
Published: (2023)
Newswire: A Large-Scale Structured Database of a Century of Historical News
by: Silcock, Emily, et al.
Published: (2024)
by: Silcock, Emily, et al.
Published: (2024)
News Deja Vu: Connecting Past and Present with Semantic Search
by: Franklin, Brevin, et al.
Published: (2024)
by: Franklin, Brevin, et al.
Published: (2024)
EnsembleLink: Accurate Record Linkage Without Training Data
by: Dasanaike, Noah
Published: (2026)
by: Dasanaike, Noah
Published: (2026)
Selective Neuron Amplification in Transformer Language Models
by: Akhtar, Ryyan, et al.
Published: (2026)
by: Akhtar, Ryyan, et al.
Published: (2026)
Deep Learning for Economists
by: Dell, Melissa
Published: (2024)
by: Dell, Melissa
Published: (2024)
AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models
by: He, Yinghui, et al.
Published: (2025)
by: He, Yinghui, et al.
Published: (2025)
Leveraging Large Language Models for Generating Labeled Mineral Site Record Linkage Data
by: Pyo, Jiyoon, et al.
Published: (2024)
by: Pyo, Jiyoon, et al.
Published: (2024)
Outliers Dimensions that Disrupt Transformers Are Driven by Frequency
by: Puccetti, Giovanni, et al.
Published: (2022)
by: Puccetti, Giovanni, et al.
Published: (2022)
General Transform: A Unified Framework for Adaptive Transform to Enhance Representations
by: Budiutama, Gekko, et al.
Published: (2025)
by: Budiutama, Gekko, et al.
Published: (2025)
Examining the Mental Health Impact of Misinformation on Social Media Using a Hybrid Transformer-Based Approach
by: Arora, Sarvesh, et al.
Published: (2025)
by: Arora, Sarvesh, et al.
Published: (2025)
Markovian Transformers for Informative Language Modeling
by: Viteri, Scott, et al.
Published: (2024)
by: Viteri, Scott, et al.
Published: (2024)
Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision
by: Yoshida, Ryo, et al.
Published: (2024)
by: Yoshida, Ryo, et al.
Published: (2024)
A Family of Pretrained Transformer Language Models for Russian
by: Zmitrovich, Dmitry, et al.
Published: (2023)
by: Zmitrovich, Dmitry, et al.
Published: (2023)
Representing Rule-based Chatbots with Transformers
by: Friedman, Dan, et al.
Published: (2024)
by: Friedman, Dan, et al.
Published: (2024)
Will Large Language Models Transform Clinical Prediction?
by: Yildiz, Yusuf, et al.
Published: (2025)
by: Yildiz, Yusuf, et al.
Published: (2025)
Linearity of Relation Decoding in Transformer Language Models
by: Hernandez, Evan, et al.
Published: (2023)
by: Hernandez, Evan, et al.
Published: (2023)
Semformer: Transformer Language Models with Semantic Planning
by: Yin, Yongjing, et al.
Published: (2024)
by: Yin, Yongjing, et al.
Published: (2024)
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)
by: Zhao, Yida, et al.
Published: (2024)
Training Language Models to Reason Efficiently
by: Arora, Daman, et al.
Published: (2025)
by: Arora, Daman, et al.
Published: (2025)
ASCENDgpt: A Phenotype-Aware Transformer Model for Cardiovascular Risk Prediction from Electronic Health Records
by: Sainsbury, Chris, et al.
Published: (2025)
by: Sainsbury, Chris, et al.
Published: (2025)
Transformer-based Single-Cell Language Model: A Survey
by: Lan, Wei, et al.
Published: (2024)
by: Lan, Wei, et al.
Published: (2024)
A Primer on the Inner Workings of Transformer-based Language Models
by: Ferrando, Javier, et al.
Published: (2024)
by: Ferrando, Javier, et al.
Published: (2024)
Anatomical Heterogeneity in Transformer Language Models
by: Wietrzykowski, Tomasz
Published: (2026)
by: Wietrzykowski, Tomasz
Published: (2026)
Word Meanings in Transformer Language Models
by: Grindrod, Jumbly, et al.
Published: (2025)
by: Grindrod, Jumbly, et al.
Published: (2025)
Fast-and-Frugal Text-Graph Transformers are Effective Link Predictors
by: Coman, Andrei C., et al.
Published: (2024)
by: Coman, Andrei C., et al.
Published: (2024)
The Impact of Depth on Compositional Generalization in Transformer Language Models
by: Petty, Jackson, et al.
Published: (2023)
by: Petty, Jackson, et al.
Published: (2023)
Probing the Category of Verbal Aspect in Transformer Language Models
by: Katinskaia, Anisia, et al.
Published: (2024)
by: Katinskaia, Anisia, et al.
Published: (2024)
Sneaking Syntax into Transformer Language Models with Tree Regularization
by: Nandi, Ananjan, et al.
Published: (2024)
by: Nandi, Ananjan, et al.
Published: (2024)
Characterizing the Expressivity of Fixed-Precision Transformer Language Models
by: Li, Jiaoda, et al.
Published: (2025)
by: Li, Jiaoda, et al.
Published: (2025)
Can Transformers Learn $n$-gram Language Models?
by: Svete, Anej, et al.
Published: (2024)
by: Svete, Anej, et al.
Published: (2024)
UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
by: Qi, Liu, et al.
Published: (2024)
by: Qi, Liu, et al.
Published: (2024)
Jamba: A Hybrid Transformer-Mamba Language Model
by: Lieber, Opher, et al.
Published: (2024)
by: Lieber, Opher, et al.
Published: (2024)
Transformers Can Represent $n$-gram Language Models
by: Svete, Anej, et al.
Published: (2024)
by: Svete, Anej, et al.
Published: (2024)
Linking In-context Learning in Transformers to Human Episodic Memory
by: Ji-An, Li, et al.
Published: (2024)
by: Ji-An, Li, et al.
Published: (2024)
Sparser, Faster, Lighter Transformer Language Models
by: Cetin, Edoardo, et al.
Published: (2026)
by: Cetin, Edoardo, et al.
Published: (2026)
Intra-Layer Recurrence in Transformers for Language Modeling
by: Nguyen, Anthony, et al.
Published: (2025)
by: Nguyen, Anthony, et al.
Published: (2025)
ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model
by: Nguyen, Luan Thanh
Published: (2024)
by: Nguyen, Luan Thanh
Published: (2024)
Similar Items
-
Linking Representations with Multimodal Contrastive Learning
by: Arora, Abhishek, et al.
Published: (2023) -
Contrastive Entity Coreference and Disambiguation for Historical Texts
by: Arora, Abhishek, et al.
Published: (2024) -
Trainable Transformer in Transformer
by: Panigrahi, Abhishek, et al.
Published: (2023) -
Newswire: A Large-Scale Structured Database of a Century of Historical News
by: Silcock, Emily, et al.
Published: (2024) -
News Deja Vu: Connecting Past and Present with Semantic Search
by: Franklin, Brevin, et al.
Published: (2024)