Saved in:
| Main Authors: | Borah, Abhilekh, Ghosh, Shubhra, Joshi, Kedar, Guru, Aditya Kumar, Ghosh, Kripabandhu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01132 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ObfusQAte: A Proposed Framework to Evaluate LLM Robustness on Obfuscated Factual Question Answering
by: Ghosh, Shubhra, et al.
Published: (2025)
by: Ghosh, Shubhra, et al.
Published: (2025)
SMITE: Enhancing Fairness in LLMs through Optimal In-Context Example Selection via Dynamic Validation
by: Chhikara, Garima, et al.
Published: (2025)
by: Chhikara, Garima, et al.
Published: (2025)
DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture
by: Maji, Arijit, et al.
Published: (2025)
by: Maji, Arijit, et al.
Published: (2025)
Code-Mixer Ya Nahi: Novel Approaches to Measuring Multilingual LLMs' Code-Mixing Capabilities
by: Gupta, Ayushman, et al.
Published: (2024)
by: Gupta, Ayushman, et al.
Published: (2024)
Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization
by: Deroy, Aniket, et al.
Published: (2024)
by: Deroy, Aniket, et al.
Published: (2024)
Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs
by: Sridhar, Srivarshinee, et al.
Published: (2025)
by: Sridhar, Srivarshinee, et al.
Published: (2025)
SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations
by: Khanna, Danush, et al.
Published: (2025)
by: Khanna, Danush, et al.
Published: (2025)
Don't Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation
by: Moon, Jiwon, et al.
Published: (2025)
by: Moon, Jiwon, et al.
Published: (2025)
ReGal: A First Look at PPO-based Legal AI for Judgment Prediction and Summarization in India
by: Nigam, Shubham Kumar, et al.
Published: (2025)
by: Nigam, Shubham Kumar, et al.
Published: (2025)
Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentences
by: Gupta, Ayushman, et al.
Published: (2024)
by: Gupta, Ayushman, et al.
Published: (2024)
CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ)
by: Borah, Abhilekh, et al.
Published: (2025)
by: Borah, Abhilekh, et al.
Published: (2025)
Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
by: Roy, Amartya, et al.
Published: (2024)
by: Roy, Amartya, et al.
Published: (2024)
Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification
by: Chhikara, Garima, et al.
Published: (2024)
by: Chhikara, Garima, et al.
Published: (2024)
QuickSilver -- Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization
by: Khanna, Danush, et al.
Published: (2025)
by: Khanna, Danush, et al.
Published: (2025)
MARRO: Multi-headed Attention for Rhetorical Role Labeling in Legal Documents
by: Bambroo, Purbid, et al.
Published: (2025)
by: Bambroo, Purbid, et al.
Published: (2025)
Calibrate, Don't Curate: Label-Efficient Estimation from Noisy LLM Judges
by: Li, Yanran
Published: (2026)
by: Li, Yanran
Published: (2026)
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain
by: Du, Yanrui, et al.
Published: (2023)
by: Du, Yanrui, et al.
Published: (2023)
LaMSUM: Amplifying Voices Against Harassment through LLM Guided Extractive Summarization of User Incident Reports
by: Chhikara, Garima, et al.
Published: (2024)
by: Chhikara, Garima, et al.
Published: (2024)
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
by: Hans, Abhimanyu, et al.
Published: (2024)
by: Hans, Abhimanyu, et al.
Published: (2024)
Causal Reasoning Favors Encoders: On The Limits of Decoder-Only Models
by: Roy, Amartya, et al.
Published: (2025)
by: Roy, Amartya, et al.
Published: (2025)
Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej
by: Nigam, Shubham Kumar, et al.
Published: (2025)
by: Nigam, Shubham Kumar, et al.
Published: (2025)
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
by: Nigam, Shubham Kumar, et al.
Published: (2024)
by: Nigam, Shubham Kumar, et al.
Published: (2024)
Don't Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning
by: Qin, Yuehan, et al.
Published: (2025)
by: Qin, Yuehan, et al.
Published: (2025)
Don't Pay Attention
by: Hammoud, Mohammad, et al.
Published: (2025)
by: Hammoud, Mohammad, et al.
Published: (2025)
Don't Touch My Diacritics
by: Gorman, Kyle, et al.
Published: (2024)
by: Gorman, Kyle, et al.
Published: (2024)
Your Students Don't Use LLMs Like You Wish They Did
by: Kobler, Sebastian, et al.
Published: (2026)
by: Kobler, Sebastian, et al.
Published: (2026)
LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification
by: Nigam, Shubham Kumar, et al.
Published: (2025)
by: Nigam, Shubham Kumar, et al.
Published: (2025)
Show, Don't Tell: Uncovering Implicit Character Portrayal using LLMs
by: Jaipersaud, Brandon, et al.
Published: (2024)
by: Jaipersaud, Brandon, et al.
Published: (2024)
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models
by: Faria, Gonçalo, et al.
Published: (2025)
by: Faria, Gonçalo, et al.
Published: (2025)
If You Don't Understand It, Don't Use It: Eliminating Trojans with Filters Between Layers
by: Hernandez, Adriano
Published: (2024)
by: Hernandez, Adriano
Published: (2024)
NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis
by: Nigam, Shubham Kumar, et al.
Published: (2024)
by: Nigam, Shubham Kumar, et al.
Published: (2024)
TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context
by: Nigam, Shubham Kumar, et al.
Published: (2025)
by: Nigam, Shubham Kumar, et al.
Published: (2025)
Why Don't You Know? Evaluating the Impact of Uncertainty Sources on Uncertainty Quantification in LLMs
by: Goloburda, Maiya, et al.
Published: (2026)
by: Goloburda, Maiya, et al.
Published: (2026)
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval
by: Sinha, Aarush
Published: (2025)
by: Sinha, Aarush
Published: (2025)
Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering
by: Chowdhury, Arijit Ghosh, et al.
Published: (2023)
by: Chowdhury, Arijit Ghosh, et al.
Published: (2023)
Machine Generated Product Advertisements: Benchmarking LLMs Against Human Performance
by: Ghosh, Sanjukta
Published: (2024)
by: Ghosh, Sanjukta
Published: (2024)
Don't Think of the White Bear: Ironic Negation in Transformer Models Under Cognitive Load
by: Mann, Logan, et al.
Published: (2025)
by: Mann, Logan, et al.
Published: (2025)
Don't Throw Away Your Pretrained Model
by: Feng, Shangbin, et al.
Published: (2025)
by: Feng, Shangbin, et al.
Published: (2025)
Don't Say No: Jailbreaking LLM by Suppressing Refusal
by: Zhou, Yukai, et al.
Published: (2024)
by: Zhou, Yukai, et al.
Published: (2024)
Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation
by: Parikh, Aditya, et al.
Published: (2026)
by: Parikh, Aditya, et al.
Published: (2026)
Similar Items
-
ObfusQAte: A Proposed Framework to Evaluate LLM Robustness on Obfuscated Factual Question Answering
by: Ghosh, Shubhra, et al.
Published: (2025) -
SMITE: Enhancing Fairness in LLMs through Optimal In-Context Example Selection via Dynamic Validation
by: Chhikara, Garima, et al.
Published: (2025) -
DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture
by: Maji, Arijit, et al.
Published: (2025) -
Code-Mixer Ya Nahi: Novel Approaches to Measuring Multilingual LLMs' Code-Mixing Capabilities
by: Gupta, Ayushman, et al.
Published: (2024) -
Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization
by: Deroy, Aniket, et al.
Published: (2024)