Saved in:
| Main Author: | Perreaux, Nicolas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.12230 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
by: Bourne, Jonathan
Published: (2025)
by: Bourne, Jonathan
Published: (2025)
A maturity model for catalogues of semantic artefacts
by: Corcho, Oscar, et al.
Published: (2023)
by: Corcho, Oscar, et al.
Published: (2023)
Unveiling Temporal Trends in 19th Century Literature: An Information Retrieval Approach
by: Datta, Suchana, et al.
Published: (2025)
by: Datta, Suchana, et al.
Published: (2025)
Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata
by: Ahlin, Axel, et al.
Published: (2024)
by: Ahlin, Axel, et al.
Published: (2024)
Citation proximus: the role of social and semantic ties in citing behaviour
by: Kozlowski, Diego, et al.
Published: (2025)
by: Kozlowski, Diego, et al.
Published: (2025)
Estimating the prevalence of LLM-assisted text in scholarly writing
by: Gray, Andrew
Published: (2025)
by: Gray, Andrew
Published: (2025)
HERITRACE in action: the ParaText project as a case study for semantic data management in Classical Philology
by: Filograsso, Francesca, et al.
Published: (2025)
by: Filograsso, Francesca, et al.
Published: (2025)
Identifying and extracting Data Access Statements from full-text academic articles
by: Pride, David, et al.
Published: (2025)
by: Pride, David, et al.
Published: (2025)
Have LLM-associated terms increased in article full texts in all fields?
by: Thelwall, Mike, et al.
Published: (2026)
by: Thelwall, Mike, et al.
Published: (2026)
Predicting citation impact of research papers using GPT and other text embeddings
by: Vital Jr., Adilson, et al.
Published: (2024)
by: Vital Jr., Adilson, et al.
Published: (2024)
Automatically detecting scientific political science texts from a large general document index
by: Smirnova, Nina
Published: (2024)
by: Smirnova, Nina
Published: (2024)
Beyond coauthorship: semantic structure and phantom collaborators in transportation research, 1967--2025
by: Choi, Seongjin
Published: (2026)
by: Choi, Seongjin
Published: (2026)
Which topics are best represented by science maps? An analysis of clustering effectiveness for citation and text similarity networks
by: Bascur, Juan Pablo, et al.
Published: (2024)
by: Bascur, Juan Pablo, et al.
Published: (2024)
Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction
by: Manrique-Gómez, Laura, et al.
Published: (2024)
by: Manrique-Gómez, Laura, et al.
Published: (2024)
The changing role of cited papers over time: An analysis of highly cited papers based on a large full-text dataset
by: Lin, Gege, et al.
Published: (2025)
by: Lin, Gege, et al.
Published: (2025)
How much are LLMs changing the language of academic papers after ChatGPT? A multi-database and full text analysis
by: Kousha, Kayvan, et al.
Published: (2025)
by: Kousha, Kayvan, et al.
Published: (2025)
Unsupervised extraction of local and global keywords from a single text
by: Aleksanyan, Lida, et al.
Published: (2023)
by: Aleksanyan, Lida, et al.
Published: (2023)
Requirements for a cooperative information infrastructure for the digital preservation of scholarly blogs
by: Ochsner, Catharina, et al.
Published: (2026)
by: Ochsner, Catharina, et al.
Published: (2026)
Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish
by: Cohen, Kevin, et al.
Published: (2025)
by: Cohen, Kevin, et al.
Published: (2025)
Archives, archival bond, and digital representation: A case study with the International Image Interoperability Framework
by: Critelli, Martin
Published: (2026)
by: Critelli, Martin
Published: (2026)
KwicKwocKwac, a tool for rapidly generating concordances and marking up a literary text
by: Barzaghi, Sebastian, et al.
Published: (2024)
by: Barzaghi, Sebastian, et al.
Published: (2024)
Diamond open access and open infrastructures have shaped the Canadian scholarly journal landscape since the start of the digital era
by: van Bellen, Simon, et al.
Published: (2024)
by: van Bellen, Simon, et al.
Published: (2024)
Designing large language model prompts to extract scores from messy text: A shared dataset and challenge
by: Thelwall, Mike
Published: (2026)
by: Thelwall, Mike
Published: (2026)
Are there stars in Bluesky after the return of Donald Trump to the White House?
by: Arroyo-Machado, Wenceslao, et al.
Published: (2025)
by: Arroyo-Machado, Wenceslao, et al.
Published: (2025)
Are there stars in Bluesky? A comparative exploratory analysis of altmetric mentions between X and Bluesky
by: Arroyo-Machado, Wenceslao, et al.
Published: (2024)
by: Arroyo-Machado, Wenceslao, et al.
Published: (2024)
Time, bits, and nickel: Managing digital and analog continuity
by: Momméja, Julie
Published: (2024)
by: Momméja, Julie
Published: (2024)
Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke
by: Wu, Yu, et al.
Published: (2026)
by: Wu, Yu, et al.
Published: (2026)
Towards Machine-actionable FAIR Digital Objects with a Typing Model that Enables Operations
by: Inckmann, Maximilian, et al.
Published: (2025)
by: Inckmann, Maximilian, et al.
Published: (2025)
A digital perspective on the role of a stemma in material-philological transmission studies
by: Kapitan, Katarzyna Anna
Published: (2025)
by: Kapitan, Katarzyna Anna
Published: (2025)
Automated Generation of Research Workflows from Academic Papers: A Full-text Mining Framework
by: Zhang, Heng, et al.
Published: (2025)
by: Zhang, Heng, et al.
Published: (2025)
Understanding Archives: Towards New Research Interfaces Relying on the Semantic Annotation of Documents
by: Gutehrlé, Nicolas, et al.
Published: (2024)
by: Gutehrlé, Nicolas, et al.
Published: (2024)
GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books
by: Daly, Liza, et al.
Published: (2025)
by: Daly, Liza, et al.
Published: (2025)
A Comparative Analysis of Modeling Approaches for the Association of FAIR Digital Objects Operations
by: Blumenröhr, Nicolas, et al.
Published: (2025)
by: Blumenröhr, Nicolas, et al.
Published: (2025)
PRITES: An integrative framework for investigating and assessing web-scraped HTTP-response datasets for research applications
by: Huang, Cynthia A., et al.
Published: (2025)
by: Huang, Cynthia A., et al.
Published: (2025)
Decoding Knowledge Claims: The Evaluation of Scientific Publication Contributions through Semantic Analysis
by: D'Aniello, Luca, et al.
Published: (2024)
by: D'Aniello, Luca, et al.
Published: (2024)
Assessing the impact of Open Research Information Infrastructures using NLP driven full-text Scientometrics: A case study of the LXCat open-access platform
by: Pandya, Kalp, et al.
Published: (2026)
by: Pandya, Kalp, et al.
Published: (2026)
2024 Brick & Click: An Academic Library Conference (24th, Maryville, Missouri, November 1, 2024)
by: Frank Baudino, Editor, et al.
Published: (2024)
by: Frank Baudino, Editor, et al.
Published: (2024)
One file to share them all: Using the COMBINE Archive and the OMEX format to share all information about a modeling project
by: Bergmann, Frank T., et al.
Published: (2014)
by: Bergmann, Frank T., et al.
Published: (2014)
Building a Media Ecosystem Observatory from Scratch: Infrastructure, Methodology, and Insights
by: Pehlivan, Zeynep, et al.
Published: (2025)
by: Pehlivan, Zeynep, et al.
Published: (2025)
Libraries, Digital Libraries, and Data: Forty years, Four Challenges
by: Borgman, Christine L.
Published: (2025)
by: Borgman, Christine L.
Published: (2025)
Similar Items
-
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
by: Bourne, Jonathan
Published: (2025) -
A maturity model for catalogues of semantic artefacts
by: Corcho, Oscar, et al.
Published: (2023) -
Unveiling Temporal Trends in 19th Century Literature: An Information Retrieval Approach
by: Datta, Suchana, et al.
Published: (2025) -
Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata
by: Ahlin, Axel, et al.
Published: (2024) -
Citation proximus: the role of social and semantic ties in citing behaviour
by: Kozlowski, Diego, et al.
Published: (2025)