Saved in:
Bibliographic Details
Main Authors: Du, Keqing, Yang, Xinyu, Chen, Hang
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2311.12401
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909083024490496
author Du, Keqing
Yang, Xinyu
Chen, Hang
author_facet Du, Keqing
Yang, Xinyu
Chen, Hang
contents Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various models by enhancing video causality in marginalizing frame-level casual relationships. Specifically, we define the equivalent frame-level casual model and segment-level causal model, so that the causal adjacency matrix constructed from marginalized frame-level causal relationships has the ability to represent the segmnet-level causal relationships. CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models. In addition, we propose a novel evaluation metric Causal Edit Distance (CED) to evaluate the causal interpretability. Extensive experimental results on mainstream datasets indicate that CASR significantly surpasses existing various methods in action segmentation performance, as well as in causal explainability and generalization.
format Preprint
id arxiv_https___arxiv_org_abs_2311_12401
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships
Du, Keqing
Yang, Xinyu
Chen, Hang
Computer Vision and Pattern Recognition
Multimedia
Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various models by enhancing video causality in marginalizing frame-level casual relationships. Specifically, we define the equivalent frame-level casual model and segment-level causal model, so that the causal adjacency matrix constructed from marginalized frame-level causal relationships has the ability to represent the segmnet-level causal relationships. CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models. In addition, we propose a novel evaluation metric Causal Edit Distance (CED) to evaluate the causal interpretability. Extensive experimental results on mainstream datasets indicate that CASR significantly surpasses existing various methods in action segmentation performance, as well as in causal explainability and generalization.
title CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships
topic Computer Vision and Pattern Recognition
Multimedia
url https://arxiv.org/abs/2311.12401