:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Waldetoft, Hannes, Torgander, Jakob, Magnusson, Måns
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2505.04643
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Formalising Anti-Discrimination Law in Automated Decision Systems
by: Sargeant, Holli, et al.
Published: (2024)

posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms
by: Magnusson, Måns, et al.
Published: (2024)

Posterior Sampling of Probabilistic Word Embeddings
by: Yrjänäinen, Väinö, et al.
Published: (2025)

AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
by: Shingi, Geet, et al.
Published: (2021)

Classification is a RAG problem: A case study on hate speech detection
by: Willats, Richard, et al.
Published: (2025)

Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media
by: Kikkisetti, Dhanush, et al.
Published: (2024)

Density estimation with LLMs: a geometric investigation of in-context learning trajectories
by: Liu, Toni J. B., et al.
Published: (2024)

Enriching language models with graph-based context information to better understand textual data
by: Roethel, Albert, et al.
Published: (2023)

DataDecide: How to Predict Best Pretraining Data with Small Experiments
by: Magnusson, Ian, et al.
Published: (2025)

Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?
by: Weber, Manuel, et al.
Published: (2025)

Cross-Modal Safety Alignment: Is textual unlearning all you need?
by: Chakraborty, Trishna, et al.
Published: (2024)

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
by: Na, Clara, et al.
Published: (2024)

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
by: Heineman, David, et al.
Published: (2025)

A density estimation perspective on learning from pairwise human preferences
by: Dumoulin, Vincent, et al.
Published: (2023)

Linear probes rely on textual evidence: Results from leakage mitigation studies in language models
by: Boxo, Gerard, et al.
Published: (2025)

Class flipping for uplift modeling and Heterogeneous Treatment Effect estimation on imbalanced RCT data
by: Rudaś, Krzysztof, et al.
Published: (2024)

Ultra-imbalanced classification guided by statistical information
by: Jin, Yin, et al.
Published: (2024)

Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators
by: González-González, Jaime, et al.
Published: (2024)

Exploring Bias and Prediction Metrics to Characterise the Fairness of Machine Learning for Equity-Centered Public Health Decision-Making: A Narrative Review
by: Raza, Shaina, et al.
Published: (2024)

What's In My Big Data?
by: Elazar, Yanai, et al.
Published: (2023)

Bridging the gap in online hate speech detection: a comparative analysis of BERT and traditional models for homophobic content identification on X/Twitter
by: McGiff, Josh, et al.
Published: (2024)

Gradient boundaries through confidence intervals for forced alignment estimates using model ensembles
by: Kelley, Matthew C.
Published: (2025)

Computational Measurement of Political Positions: A Review of Text-Based Ideal Point Estimation Algorithms
by: Parschan, Patrick, et al.
Published: (2025)

Hallucinations are inevitable but can be made statistically negligible
by: Suzuki, Atsushi, et al.
Published: (2025)

Proposal and study of statistical features for string similarity computation and classification
by: Rodrigues, E. O., et al.
Published: (2026)

Artificial Intelligence for Public Health Surveillance in Africa: Applications and Opportunities
by: Tshimula, Jean Marie, et al.
Published: (2024)

Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)

Automatic Extraction of Disease Risk Factors from Medical Publications
by: Rubchinsky, Maxim, et al.
Published: (2024)

Fluid Language Model Benchmarking
by: Hofmann, Valentin, et al.
Published: (2025)

Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction
by: Opiełka, Gustaw, et al.
Published: (2025)

Causality $\neq$ Invariance: Function and Concept Vectors in LLMs
by: Opiełka, Gustaw, et al.
Published: (2026)

Large language model as user daily behavior data generator: balancing population diversity and individual personality
by: Li, Haoxin, et al.
Published: (2025)

Analysing Public Transport User Sentiment on Low Resource Multilingual Data
by: Myoya, Rozina L., et al.
Published: (2024)

PopBERT. Detecting populism and its host ideologies in the German Bundestag
by: Erhard, L., et al.
Published: (2023)

A statistical theory of overfitting for imbalanced classification
by: Lyu, Jingyang, et al.
Published: (2025)

Exploring Public Attention in the Circular Economy through Topic Modelling with Twin Hyperparameter Optimisation
by: Song, Junhao, et al.
Published: (2024)

MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
by: Fieback, Laura, et al.
Published: (2024)

Predicting Evoked Emotions in Conversations
by: Altarawneh, Enas, et al.
Published: (2023)

Predicting Emergent Capabilities by Finetuning
by: Snell, Charlie, et al.
Published: (2024)

Joint Training for Selective Prediction
by: Li, Zhaohui, et al.
Published: (2024)