Unsupervised Task Graph Generation from Instructional Video Transcripts

1Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

This work explores the problem of generating task graphs of real-world activities. Different from prior formulations, we consider a setting where text transcripts of instructional videos performing a real-world activity (e.g., making coffee) are provided and the goal is to identify the key steps relevant to the task as well as the dependency relationship between these key steps. We propose a novel task graph generation approach that combines the reasoning capabilities of instruction-tuned language models along with clustering and ranking components to generate accurate task graphs in a completely unsupervised manner. We show that the proposed approach generates more accurate task graphs compared to a supervised learning approach on tasks from the ProceL and CrossTask datasets.

Cite

CITATION STYLE

APA

Logeswaran, L., Sohn, S., Jang, Y., Lee, M., & Lee, H. (2023). Unsupervised Task Graph Generation from Instructional Video Transcripts. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 3392–3406). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.210

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free