G4_01241.mp4 (Tested)
: Understanding the order of steps in a task.
If this video follows the standard "G4" dataset conventions, the "long text" description (often used for video-to-text training) would likely look like this: Action Sequence g4_01241.mp4
: Often linked to the GTEA (Georgia Tech Egocentric Activities) dataset or similar egocentric (first-person) video collections. : Understanding the order of steps in a task