:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Singhal, Rishi, Kim, Jung-Eun
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.10566
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
by: Qu, Leigang, et al.
Published: (2025)

Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
by: Lou, Siyu, et al.
Published: (2024)

We Should Separate Memorization from Copyright
by: Haviv, Adi, et al.
Published: (2026)

Frozen Transformers in Language Models Are Effective Visual Encoder Layers
by: Pang, Ziqi, et al.
Published: (2023)

AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers
by: Achtibat, Reduan, et al.
Published: (2024)

Explainable AI: Context-Aware Layer-Wise Integrated Gradients for Explaining Transformer Models
by: Mersha, Melkamu Abay, et al.
Published: (2026)

RESQUE: Quantifying Estimator to Task and Distribution Shift for Sustainable Model Reusability
by: Sangarya, Vishwesh, et al.
Published: (2024)

A General and Efficient Training for Transformer via Token Expansion
by: Huang, Wenxuan, et al.
Published: (2024)

Accelerating Vision Transformers with Adaptive Patch Sizes
by: Choudhury, Rohan, et al.
Published: (2025)

Memorization In Stable Diffusion Is Unexpectedly Driven by CLIP Embeddings
by: Kim, Bumjun, et al.
Published: (2026)

Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
by: Park, Dongmin, et al.
Published: (2024)

Small Vision-Language Models: A Survey on Compact Architectures and Techniques
by: Patnaik, Nitesh, et al.
Published: (2025)

Aggregate Representation Measure for Predictive Model Reusability
by: Sangarya, Vishwesh, et al.
Published: (2024)

Estimating Environmental Cost Throughout Model's Adaptive Life Cycle
by: Sangarya, Vishwesh, et al.
Published: (2024)

MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
by: Teo, Rachel S. Y., et al.
Published: (2025)

Tracing Representation Progression: Analyzing and Enhancing Layer-Wise Similarity
by: Jiang, Jiachen, et al.
Published: (2024)

Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation
by: Eger, Steffen, et al.
Published: (2025)

Transformers without Normalization
by: Zhu, Jiachen, et al.
Published: (2025)

LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
by: Lee, Suhyeon, et al.
Published: (2023)

Stronger Normalization-Free Transformers
by: Chen, Mingzhi, et al.
Published: (2025)

Weighted Multi-Prompt Learning with Description-free Large Language Model Distillation
by: Lee, Sua, et al.
Published: (2025)

Captured by Captions: On Memorization and its Mitigation in CLIP Models
by: Wang, Wenhao, et al.
Published: (2025)

Universal Approximation of Visual Autoregressive Transformers
by: Chen, Yifang, et al.
Published: (2025)

Transformers are Stateless Differentiable Neural Computers
by: Tang, Bo, et al.
Published: (2026)

On Memorization in Diffusion Models
by: Gu, Xiangming, et al.
Published: (2023)

Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
by: Jang, Yunseok, et al.
Published: (2025)

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN
by: Zhang, Junpeng, et al.
Published: (2025)

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
by: Sung, Yi-Lin, et al.
Published: (2023)

Energy-Based Transformers are Scalable Learners and Thinkers
by: Gladstone, Alexi, et al.
Published: (2025)

SeqPE: Transformer with Sequential Position Encoding
by: Li, Huayang, et al.
Published: (2025)

State Space Model for New-Generation Network Alternative to Transformers: A Survey
by: Wang, Xiao, et al.
Published: (2024)

Representation Magnitude has a Liability to Privacy Vulnerability
by: Fang, Xingli, et al.
Published: (2024)

Impact of Noisy Supervision in Foundation Model Learning
by: Chen, Hao, et al.
Published: (2024)

A Primal-Dual Framework for Transformers and Neural Networks
by: Nguyen, Tan M., et al.
Published: (2024)

IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
by: Lee, Soeun, et al.
Published: (2024)

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
by: Kim, Taewhan, et al.
Published: (2024)

BiCLIP: Domain Canonicalization via Structured Geometric Transformation
by: Mantini, Pranav, et al.
Published: (2026)

SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
by: Kim, Si-Woo, et al.
Published: (2025)

GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values
by: Javadi, Farnoosh, et al.
Published: (2023)

REALEDIT: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations
by: Sushko, Peter, et al.
Published: (2025)