Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Du, Zhekai, Li, Xinyao, Li, Fengling, Lu, Ke, Zhu, Lei, Li, Jingjing
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2403.02899
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917604630724608
author	Du, Zhekai Li, Xinyao Li, Fengling Lu, Ke Zhu, Lei Li, Jingjing
author_facet	Du, Zhekai Li, Xinyao Li, Fengling Lu, Ke Zhu, Lei Li, Jingjing
contents	Conventional Unsupervised Domain Adaptation (UDA) strives to minimize distribution discrepancy between domains, which neglects to harness rich semantics from data and struggles to handle complex domain shifts. A promising technique is to leverage the knowledge of large-scale pre-trained vision-language models for more guided adaptation. Despite some endeavors, current methods often learn textual prompts to embed domain semantics for source and target domains separately and perform classification within each domain, limiting cross-domain knowledge transfer. Moreover, prompting only the language branch lacks flexibility to adapt both modalities dynamically. To bridge this gap, we propose Domain-Agnostic Mutual Prompting (DAMP) to exploit domain-invariant semantics by mutually aligning visual and textual embeddings. Specifically, the image contextual information is utilized to prompt the language branch in a domain-agnostic and instance-conditioned way. Meanwhile, visual prompts are imposed based on the domain-agnostic textual prompt to elicit domain-invariant visual embeddings. These two branches of prompts are learned mutually with a cross-attention module and regularized with a semantic-consistency loss and an instance-discrimination contrastive loss. Experiments on three UDA benchmarks demonstrate the superiority of DAMP over state-of-the-art approaches.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_02899
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation Du, Zhekai Li, Xinyao Li, Fengling Lu, Ke Zhu, Lei Li, Jingjing Artificial Intelligence Conventional Unsupervised Domain Adaptation (UDA) strives to minimize distribution discrepancy between domains, which neglects to harness rich semantics from data and struggles to handle complex domain shifts. A promising technique is to leverage the knowledge of large-scale pre-trained vision-language models for more guided adaptation. Despite some endeavors, current methods often learn textual prompts to embed domain semantics for source and target domains separately and perform classification within each domain, limiting cross-domain knowledge transfer. Moreover, prompting only the language branch lacks flexibility to adapt both modalities dynamically. To bridge this gap, we propose Domain-Agnostic Mutual Prompting (DAMP) to exploit domain-invariant semantics by mutually aligning visual and textual embeddings. Specifically, the image contextual information is utilized to prompt the language branch in a domain-agnostic and instance-conditioned way. Meanwhile, visual prompts are imposed based on the domain-agnostic textual prompt to elicit domain-invariant visual embeddings. These two branches of prompts are learned mutually with a cross-attention module and regularized with a semantic-consistency loss and an instance-discrimination contrastive loss. Experiments on three UDA benchmarks demonstrate the superiority of DAMP over state-of-the-art approaches.
title	Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
topic	Artificial Intelligence
url	https://arxiv.org/abs/2403.02899

Similar Items