Saved in:
| Main Authors: | Waldetoft, Hannes, Torgander, Jakob, Magnusson, Måns |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.04643 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Formalising Anti-Discrimination Law in Automated Decision Systems
by: Sargeant, Holli, et al.
Published: (2024)
by: Sargeant, Holli, et al.
Published: (2024)
posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms
by: Magnusson, Måns, et al.
Published: (2024)
by: Magnusson, Måns, et al.
Published: (2024)
Posterior Sampling of Probabilistic Word Embeddings
by: Yrjänäinen, Väinö, et al.
Published: (2025)
by: Yrjänäinen, Väinö, et al.
Published: (2025)
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
by: Shingi, Geet, et al.
Published: (2021)
by: Shingi, Geet, et al.
Published: (2021)
Classification is a RAG problem: A case study on hate speech detection
by: Willats, Richard, et al.
Published: (2025)
by: Willats, Richard, et al.
Published: (2025)
Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media
by: Kikkisetti, Dhanush, et al.
Published: (2024)
by: Kikkisetti, Dhanush, et al.
Published: (2024)
Density estimation with LLMs: a geometric investigation of in-context learning trajectories
by: Liu, Toni J. B., et al.
Published: (2024)
by: Liu, Toni J. B., et al.
Published: (2024)
Enriching language models with graph-based context information to better understand textual data
by: Roethel, Albert, et al.
Published: (2023)
by: Roethel, Albert, et al.
Published: (2023)
DataDecide: How to Predict Best Pretraining Data with Small Experiments
by: Magnusson, Ian, et al.
Published: (2025)
by: Magnusson, Ian, et al.
Published: (2025)
Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?
by: Weber, Manuel, et al.
Published: (2025)
by: Weber, Manuel, et al.
Published: (2025)
Cross-Modal Safety Alignment: Is textual unlearning all you need?
by: Chakraborty, Trishna, et al.
Published: (2024)
by: Chakraborty, Trishna, et al.
Published: (2024)
Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
by: Na, Clara, et al.
Published: (2024)
by: Na, Clara, et al.
Published: (2024)
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
by: Heineman, David, et al.
Published: (2025)
by: Heineman, David, et al.
Published: (2025)
A density estimation perspective on learning from pairwise human preferences
by: Dumoulin, Vincent, et al.
Published: (2023)
by: Dumoulin, Vincent, et al.
Published: (2023)
Linear probes rely on textual evidence: Results from leakage mitigation studies in language models
by: Boxo, Gerard, et al.
Published: (2025)
by: Boxo, Gerard, et al.
Published: (2025)
Class flipping for uplift modeling and Heterogeneous Treatment Effect estimation on imbalanced RCT data
by: Rudaś, Krzysztof, et al.
Published: (2024)
by: Rudaś, Krzysztof, et al.
Published: (2024)
Ultra-imbalanced classification guided by statistical information
by: Jin, Yin, et al.
Published: (2024)
by: Jin, Yin, et al.
Published: (2024)
Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators
by: González-González, Jaime, et al.
Published: (2024)
by: González-González, Jaime, et al.
Published: (2024)
Exploring Bias and Prediction Metrics to Characterise the Fairness of Machine Learning for Equity-Centered Public Health Decision-Making: A Narrative Review
by: Raza, Shaina, et al.
Published: (2024)
by: Raza, Shaina, et al.
Published: (2024)
What's In My Big Data?
by: Elazar, Yanai, et al.
Published: (2023)
by: Elazar, Yanai, et al.
Published: (2023)
Bridging the gap in online hate speech detection: a comparative analysis of BERT and traditional models for homophobic content identification on X/Twitter
by: McGiff, Josh, et al.
Published: (2024)
by: McGiff, Josh, et al.
Published: (2024)
Gradient boundaries through confidence intervals for forced alignment estimates using model ensembles
by: Kelley, Matthew C.
Published: (2025)
by: Kelley, Matthew C.
Published: (2025)
Computational Measurement of Political Positions: A Review of Text-Based Ideal Point Estimation Algorithms
by: Parschan, Patrick, et al.
Published: (2025)
by: Parschan, Patrick, et al.
Published: (2025)
Hallucinations are inevitable but can be made statistically negligible
by: Suzuki, Atsushi, et al.
Published: (2025)
by: Suzuki, Atsushi, et al.
Published: (2025)
Proposal and study of statistical features for string similarity computation and classification
by: Rodrigues, E. O., et al.
Published: (2026)
by: Rodrigues, E. O., et al.
Published: (2026)
Artificial Intelligence for Public Health Surveillance in Africa: Applications and Opportunities
by: Tshimula, Jean Marie, et al.
Published: (2024)
by: Tshimula, Jean Marie, et al.
Published: (2024)
Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)
Automatic Extraction of Disease Risk Factors from Medical Publications
by: Rubchinsky, Maxim, et al.
Published: (2024)
by: Rubchinsky, Maxim, et al.
Published: (2024)
Fluid Language Model Benchmarking
by: Hofmann, Valentin, et al.
Published: (2025)
by: Hofmann, Valentin, et al.
Published: (2025)
Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction
by: Opiełka, Gustaw, et al.
Published: (2025)
by: Opiełka, Gustaw, et al.
Published: (2025)
Causality $\neq$ Invariance: Function and Concept Vectors in LLMs
by: Opiełka, Gustaw, et al.
Published: (2026)
by: Opiełka, Gustaw, et al.
Published: (2026)
Large language model as user daily behavior data generator: balancing population diversity and individual personality
by: Li, Haoxin, et al.
Published: (2025)
by: Li, Haoxin, et al.
Published: (2025)
Analysing Public Transport User Sentiment on Low Resource Multilingual Data
by: Myoya, Rozina L., et al.
Published: (2024)
by: Myoya, Rozina L., et al.
Published: (2024)
PopBERT. Detecting populism and its host ideologies in the German Bundestag
by: Erhard, L., et al.
Published: (2023)
by: Erhard, L., et al.
Published: (2023)
A statistical theory of overfitting for imbalanced classification
by: Lyu, Jingyang, et al.
Published: (2025)
by: Lyu, Jingyang, et al.
Published: (2025)
Exploring Public Attention in the Circular Economy through Topic Modelling with Twin Hyperparameter Optimisation
by: Song, Junhao, et al.
Published: (2024)
by: Song, Junhao, et al.
Published: (2024)
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
by: Fieback, Laura, et al.
Published: (2024)
by: Fieback, Laura, et al.
Published: (2024)
Predicting Evoked Emotions in Conversations
by: Altarawneh, Enas, et al.
Published: (2023)
by: Altarawneh, Enas, et al.
Published: (2023)
Predicting Emergent Capabilities by Finetuning
by: Snell, Charlie, et al.
Published: (2024)
by: Snell, Charlie, et al.
Published: (2024)
Joint Training for Selective Prediction
by: Li, Zhaohui, et al.
Published: (2024)
by: Li, Zhaohui, et al.
Published: (2024)
Similar Items
-
Formalising Anti-Discrimination Law in Automated Decision Systems
by: Sargeant, Holli, et al.
Published: (2024) -
posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms
by: Magnusson, Måns, et al.
Published: (2024) -
Posterior Sampling of Probabilistic Word Embeddings
by: Yrjänäinen, Väinö, et al.
Published: (2025) -
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
by: Shingi, Geet, et al.
Published: (2021) -
Classification is a RAG problem: A case study on hate speech detection
by: Willats, Richard, et al.
Published: (2025)