data

VidSitu Annotation Files:

File Structure:

vidsitu_annotations
├── split_files
│   ├── vseg_split_testevrel_lb.json
│   ├── vseg_split_testsrl_lb.json
│   ├── vseg_split_testvb_lb.json
│   ├── vseg_split_train_lb.json
│   └── vseg_split_valid_lb.json
├── vinfo_files
│   ├── vinfo_train_lb.json
│   └── vinfo_valid_lb.json
└── vseg_ann_files
    ├── vsann_testevrel_noann_lb.json
    ├── vsann_testsrl_noann_lb.json
    ├── vsann_train_lb.json
    └── vsann_valid_lb.json

There are three directories:

split_files/:
- Containing Splits for Train, Validation, and Test Sets (three test sets).
- Each *.json file is a list of video segment ids of the form v_{video_id}_seg_{start_second}_{end_second}
- By construction, duration is 10 second for all videos

vinfo_files/:
- Containing Video Information for Train and Validation.
- Each *.json file is List[Dict]
- Keys for one instance:
  - vid_uid: unique identifier for a video
  - genres: Genres of the corresponding movie
  - duration: Entire duration of the video (not 10 seconds)
  - name: Video ID corresponding to YouTube
  - text: Text provided in Youtube Descriptions
  - start: Start time in the video
  - end: End time in the video
  - vid_seg_int: Identifier for the 10-second movie clip of the form v_{video_id}_seg_{start_second}_{end_second}
  - movie_name: Name of the corresponding movie
  - upload_year: Year it was uploaded to Youtube.
  - clip_name: Title of the Youtube Video
  - imdb_id: Imdb id for the movie
  - year_rel: Year when the movie was released.
- For Validation file, the following keys are additionally present:
  - vb_to_use: The verbs used to gather Semantic Roles and Event Relations.
  - vb_lst: A Dictionary with keys Ev1-Ev5 and values as a list of 10-verbs
  - vbid_lst: Same as above, but instead of verb names, uses the verb ids.
vseg_ann_files/: Containing Video Segment Annotations (SRL + Event Relations) for Train and Validation.
- Contains SRL and Event Relations information
- Each *.json file is a List[Dict[Dict]]
- Each Dict has the keys: ['Ev1', 'Ev2', 'Ev3', 'Ev4', 'Ev5']. The corresponding value is also a Dict with the following keys:
  - vid_uid: unique identifier for a video
  - vid_seg_int: Identifier for the 10-second movie clip of the form v_{video_id}_seg_{start_second}_{end_second}
  - EvRel: Event Relation with respect to Event 3. NOTE: Ev3 doesn't have EvRel key
  - Arg_List: A Dictionary with Argument names for the verb with their respective positions.
  - Args: The relevant Semantic Roles for the given Verb. This is a Dictionary with the same keys in Arg_List, but the values are strings representing the roles.
  - Verb: Verb name
  - VerbID: The Verb ID.

Please refer to https://github.com/TheShadow29/VidSitu/blob/main/data/DATA_PREP.md for instructions on setting up the dataset.

Name		Name	Last commit message	Last commit date
parent directory ..
DATA_PREP.md		DATA_PREP.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

README.md

VidSitu Annotation Files:

Files

data

Directory actions

More options

Directory actions

More options

Latest commit

History

data

Folders and files

parent directory

README.md

VidSitu Annotation Files: