Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sun, Li, Wu, Jiefeng, Chen, Feng, Liu, Ruizhe, Yang, Yanchao
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2508.18802
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916958770823168
author	Sun, Li Wu, Jiefeng Chen, Feng Liu, Ruizhe Yang, Yanchao
author_facet	Sun, Li Wu, Jiefeng Chen, Feng Liu, Ruizhe Yang, Yanchao
contents	Effective policy learning for robotic manipulation requires scene representations that selectively capture task-relevant environmental features. Current approaches typically employ task-agnostic representation extraction, failing to emulate the dynamic perceptual adaptation observed in human cognition. We present HyperTASR, a hypernetwork-driven framework that modulates scene representations based on both task objectives and the execution phase. Our architecture dynamically generates representation transformation parameters conditioned on task specifications and progression state, enabling representations to evolve contextually throughout task execution. This approach maintains architectural compatibility with existing policy learning frameworks while fundamentally reconfiguring how visual features are processed. Unlike methods that simply concatenate or fuse task embeddings with task-agnostic representations, HyperTASR establishes computational separation between task-contextual and state-dependent processing paths, enhancing learning efficiency and representational quality. Comprehensive evaluations in both simulation and real-world environments demonstrate substantial performance improvements across different representation paradigms. Through ablation studies and attention visualization, we confirm that our approach selectively prioritizes task-relevant scene information, closely mirroring human adaptive perception during manipulation tasks. The project website is at https://lisunphil.github.io/HyperTASR_projectpage/.
format	Preprint
id	arxiv_https___arxiv_org_abs_2508_18802
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	HyperTASR: Hypernetwork-Driven Task-Aware Scene Representations for Robust Manipulation Sun, Li Wu, Jiefeng Chen, Feng Liu, Ruizhe Yang, Yanchao Robotics Effective policy learning for robotic manipulation requires scene representations that selectively capture task-relevant environmental features. Current approaches typically employ task-agnostic representation extraction, failing to emulate the dynamic perceptual adaptation observed in human cognition. We present HyperTASR, a hypernetwork-driven framework that modulates scene representations based on both task objectives and the execution phase. Our architecture dynamically generates representation transformation parameters conditioned on task specifications and progression state, enabling representations to evolve contextually throughout task execution. This approach maintains architectural compatibility with existing policy learning frameworks while fundamentally reconfiguring how visual features are processed. Unlike methods that simply concatenate or fuse task embeddings with task-agnostic representations, HyperTASR establishes computational separation between task-contextual and state-dependent processing paths, enhancing learning efficiency and representational quality. Comprehensive evaluations in both simulation and real-world environments demonstrate substantial performance improvements across different representation paradigms. Through ablation studies and attention visualization, we confirm that our approach selectively prioritizes task-relevant scene information, closely mirroring human adaptive perception during manipulation tasks. The project website is at https://lisunphil.github.io/HyperTASR_projectpage/.
title	HyperTASR: Hypernetwork-Driven Task-Aware Scene Representations for Robust Manipulation
topic	Robotics
url	https://arxiv.org/abs/2508.18802

Similar Items