Saved in:
| Main Authors: | Lupsa, Dana, Avram, Sanda-Maria, Lupsa, Radu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.15650 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Language Detection by Means of the Minkowski Norm: Identification Through Character Bigrams and Frequency Analysis
by: Pogăcean, Paul-Andrei, et al.
Published: (2025)
by: Pogăcean, Paul-Andrei, et al.
Published: (2025)
RELATE: A Modern Processing Platform for Romanian Language
by: Păiş, Vasile, et al.
Published: (2024)
by: Păiş, Vasile, et al.
Published: (2024)
Age determination of sediment core CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018)
by: Farcas, Sorina, et al.
Published: (2018)
Pollen profile CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018)
by: Farcas, Sorina, et al.
Published: (2018)
Lithology of sediment core CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018)
by: Farcas, Sorina, et al.
Published: (2018)
RoQLlama: A Lightweight Romanian Adapted Language Model
by: Dima, George-Andrei, et al.
Published: (2024)
by: Dima, George-Andrei, et al.
Published: (2024)
RoIt-XMASA: Multi-Domain Multilingual Sentiment Analysis Dataset for Romanian and Italian
by: Avram, Andrei-Marius, et al.
Published: (2026)
by: Avram, Andrei-Marius, et al.
Published: (2026)
Modelling Intertextuality with N-gram Embeddings
by: Xing, Yi
Published: (2025)
by: Xing, Yi
Published: (2025)
MoRoVoc: A Large Dataset for Geographical Variation Identification of the Spoken Romanian Language
by: Avram, Andrei-Marius, et al.
Published: (2025)
by: Avram, Andrei-Marius, et al.
Published: (2025)
PsihoRo: Depression and Anxiety Romanian Text Corpus
by: Ciobotaru, Alexandra, et al.
Published: (2026)
by: Ciobotaru, Alexandra, et al.
Published: (2026)
F5-TTS-RO: Extending F5-TTS to Romanian TTS via Lightweight Input Adaptation
by: Chivereanu, Radu-Gabriel, et al.
Published: (2025)
by: Chivereanu, Radu-Gabriel, et al.
Published: (2025)
Lngram: N-gram Conditional Memory in Latent Space
by: Zheng, Yunao, et al.
Published: (2026)
by: Zheng, Yunao, et al.
Published: (2026)
PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts
by: Rogoz, Ana-Cristina, et al.
Published: (2024)
by: Rogoz, Ana-Cristina, et al.
Published: (2024)
HistNERo: Historical Named Entity Recognition for the Romanian Language
by: Avram, Andrei-Marius, et al.
Published: (2024)
by: Avram, Andrei-Marius, et al.
Published: (2024)
Every Character Counts: From Vulnerability to Defense in Phishing Detection
by: Chiper, Maria, et al.
Published: (2025)
by: Chiper, Maria, et al.
Published: (2025)
Understanding Transformers via N-gram Statistics
by: Nguyen, Timothy
Published: (2024)
by: Nguyen, Timothy
Published: (2024)
RoDia: A New Dataset for Romanian Dialect Identification from Speech
by: Rotaru, Codrut, et al.
Published: (2023)
by: Rotaru, Codrut, et al.
Published: (2023)
Identifying Influential N-grams in Confidence Calibration via Regression Analysis
by: Ozaki, Shintaro, et al.
Published: (2026)
by: Ozaki, Shintaro, et al.
Published: (2026)
N-gram-like Language Models Predict Reading Time Best
by: Michaelov, James A., et al.
Published: (2026)
by: Michaelov, James A., et al.
Published: (2026)
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index
by: Xu, Hao, et al.
Published: (2025)
by: Xu, Hao, et al.
Published: (2025)
RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword Generation
by: Avram, Andrei-Marius, et al.
Published: (2024)
by: Avram, Andrei-Marius, et al.
Published: (2024)
Faster Transformer Decoding: N-gram Masked Self-Attention
by: Chelba, Ciprian, et al.
Published: (2020)
by: Chelba, Ciprian, et al.
Published: (2020)
N-gram Prediction and Word Difference Representations for Language Modeling
by: Heo, DongNyeong, et al.
Published: (2024)
by: Heo, DongNyeong, et al.
Published: (2024)
From N-grams to Pre-trained Multilingual Models For Language Identification
by: Sindane, Thapelo, et al.
Published: (2024)
by: Sindane, Thapelo, et al.
Published: (2024)
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
by: Masala, Mihai, et al.
Published: (2024)
by: Masala, Mihai, et al.
Published: (2024)
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus
by: Poesina, Eduard, et al.
Published: (2024)
by: Poesina, Eduard, et al.
Published: (2024)
Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling
by: Chen, Yilong, et al.
Published: (2026)
by: Chen, Yilong, et al.
Published: (2026)
Exploring Large Language Models for Translating Romanian Computational Problems into English
by: Dumitran, Adrian Marius, et al.
Published: (2025)
by: Dumitran, Adrian Marius, et al.
Published: (2025)
Evaluating Large Language Models for Diacritic Restoration in Romanian Texts: A Comparative Study
by: Nadas, Mihai, et al.
Published: (2025)
by: Nadas, Mihai, et al.
Published: (2025)
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
by: Ou, Jie, et al.
Published: (2024)
by: Ou, Jie, et al.
Published: (2024)
The Role of $n$-gram Smoothing in the Age of Neural Networks
by: Malagutti, Luca, et al.
Published: (2024)
by: Malagutti, Luca, et al.
Published: (2024)
Can Transformers Learn $n$-gram Language Models?
by: Svete, Anej, et al.
Published: (2024)
by: Svete, Anej, et al.
Published: (2024)
Transformers Can Represent $n$-gram Language Models
by: Svete, Anej, et al.
Published: (2024)
by: Svete, Anej, et al.
Published: (2024)
LLMic: Romanian Foundation Language Model
by: Bădoiu, Vlad-Andrei, et al.
Published: (2025)
by: Bădoiu, Vlad-Andrei, et al.
Published: (2025)
A Large-Scale Benchmark for Evaluating Large Language Models on Medical Question Answering in Romanian
by: Rogoz, Ana-Cristina, et al.
Published: (2025)
by: Rogoz, Ana-Cristina, et al.
Published: (2025)
RoMath: A Mathematical Reasoning Benchmark in Romanian
by: Cosma, Adrian, et al.
Published: (2024)
by: Cosma, Adrian, et al.
Published: (2024)
An Interpretable N-gram Perplexity Threat Model for Large Language Model Jailbreaks
by: Boreiko, Valentyn, et al.
Published: (2024)
by: Boreiko, Valentyn, et al.
Published: (2024)
Neural Grammatical Error Correction for Romanian
by: Cotet, Teodor-Mihai, et al.
Published: (2026)
by: Cotet, Teodor-Mihai, et al.
Published: (2026)
Contrastive Analysis of Linguistic Representations in Large Language Model Outputs through Structured Synthetic Data Generation and Abstracted N-gram Associations
by: Desimone, S. A., et al.
Published: (2026)
by: Desimone, S. A., et al.
Published: (2026)
Similar Items
-
Language Detection by Means of the Minkowski Norm: Identification Through Character Bigrams and Frequency Analysis
by: Pogăcean, Paul-Andrei, et al.
Published: (2025) -
RELATE: A Modern Processing Platform for Romanian Language
by: Păiş, Vasile, et al.
Published: (2024) -
Age determination of sediment core CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018) -
Pollen profile CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018) -
Lithology of sediment core CAPATANA, Capatana, Romania
by: Farcas, Sorina, et al.
Published: (2018)