Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Du, Keqing, Yang, Xinyu, Chen, Hang
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition Multimedia
Online Access:	https://arxiv.org/abs/2311.12401
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909083024490496
author	Du, Keqing Yang, Xinyu Chen, Hang
author_facet	Du, Keqing Yang, Xinyu Chen, Hang
contents	Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various models by enhancing video causality in marginalizing frame-level casual relationships. Specifically, we define the equivalent frame-level casual model and segment-level causal model, so that the causal adjacency matrix constructed from marginalized frame-level causal relationships has the ability to represent the segmnet-level causal relationships. CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models. In addition, we propose a novel evaluation metric Causal Edit Distance (CED) to evaluate the causal interpretability. Extensive experimental results on mainstream datasets indicate that CASR significantly surpasses existing various methods in action segmentation performance, as well as in causal explainability and generalization.
format	Preprint
id	arxiv_https___arxiv_org_abs_2311_12401
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships Du, Keqing Yang, Xinyu Chen, Hang Computer Vision and Pattern Recognition Multimedia Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various models by enhancing video causality in marginalizing frame-level casual relationships. Specifically, we define the equivalent frame-level casual model and segment-level causal model, so that the causal adjacency matrix constructed from marginalized frame-level causal relationships has the ability to represent the segmnet-level causal relationships. CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models. In addition, we propose a novel evaluation metric Causal Edit Distance (CED) to evaluate the causal interpretability. Extensive experimental results on mainstream datasets indicate that CASR significantly surpasses existing various methods in action segmentation performance, as well as in causal explainability and generalization.
title	CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships
topic	Computer Vision and Pattern Recognition Multimedia
url	https://arxiv.org/abs/2311.12401

Similar Items