sample_id
int32 0
199
| task_instruction
stringclasses 10
values | task_instance
dict | response
stringclasses 4
values |
---|---|---|---|
0 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}During which part of the video does the action 'person sitting on a couch' occur?",
"images_path": [
"RPY8D/RPY8D_0.jpeg",
"RPY8D/RPY8D_1.jpeg",
"RPY8D/RPY8D_2.jpeg",
"RPY8D/RPY8D_3.jpeg",
"RPY8D/RPY8D_4.jpeg",
"RPY8D/RPY8D_5.jpeg",
"RPY8D/RPY8D_6.jpeg",
"RPY8D/RPY8D_7.jpeg",
"RPY8D/RPY8D_8.jpeg",
"RPY8D/RPY8D_9.jpeg",
"RPY8D/RPY8D_10.jpeg",
"RPY8D/RPY8D_11.jpeg",
"RPY8D/RPY8D_12.jpeg",
"RPY8D/RPY8D_13.jpeg",
"RPY8D/RPY8D_14.jpeg",
"RPY8D/RPY8D_15.jpeg",
"RPY8D/RPY8D_16.jpeg",
"RPY8D/RPY8D_17.jpeg",
"RPY8D/RPY8D_18.jpeg",
"RPY8D/RPY8D_19.jpeg",
"RPY8D/RPY8D_20.jpeg",
"RPY8D/RPY8D_21.jpeg",
"RPY8D/RPY8D_22.jpeg",
"RPY8D/RPY8D_23.jpeg",
"RPY8D/RPY8D_24.jpeg",
"RPY8D/RPY8D_25.jpeg",
"RPY8D/RPY8D_26.jpeg",
"RPY8D/RPY8D_27.jpeg",
"RPY8D/RPY8D_28.jpeg",
"RPY8D/RPY8D_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"0-0.jpg"
]
} | Throughout the entire video. |
1 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}Can you identify when the action 'person still smiling' happens in the video?",
"images_path": [
"TS2DM/TS2DM_0.jpeg",
"TS2DM/TS2DM_1.jpeg",
"TS2DM/TS2DM_2.jpeg",
"TS2DM/TS2DM_3.jpeg",
"TS2DM/TS2DM_4.jpeg",
"TS2DM/TS2DM_5.jpeg",
"TS2DM/TS2DM_6.jpeg",
"TS2DM/TS2DM_7.jpeg",
"TS2DM/TS2DM_8.jpeg",
"TS2DM/TS2DM_9.jpeg",
"TS2DM/TS2DM_10.jpeg",
"TS2DM/TS2DM_11.jpeg",
"TS2DM/TS2DM_12.jpeg",
"TS2DM/TS2DM_13.jpeg",
"TS2DM/TS2DM_14.jpeg",
"TS2DM/TS2DM_15.jpeg",
"TS2DM/TS2DM_16.jpeg",
"TS2DM/TS2DM_17.jpeg",
"TS2DM/TS2DM_18.jpeg",
"TS2DM/TS2DM_19.jpeg",
"TS2DM/TS2DM_20.jpeg",
"TS2DM/TS2DM_21.jpeg",
"TS2DM/TS2DM_22.jpeg",
"TS2DM/TS2DM_23.jpeg",
"TS2DM/TS2DM_24.jpeg",
"TS2DM/TS2DM_25.jpeg",
"TS2DM/TS2DM_26.jpeg",
"TS2DM/TS2DM_27.jpeg",
"TS2DM/TS2DM_28.jpeg",
"TS2DM/TS2DM_29.jpeg",
"TS2DM/TS2DM_30.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"1-0.jpg"
]
} | Throughout the entire video. |
2 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}In the given video, when does the action 'a person is tidying up the cabinet' take place?",
"images_path": [
"ZNQVC/ZNQVC_0.jpeg",
"ZNQVC/ZNQVC_1.jpeg",
"ZNQVC/ZNQVC_2.jpeg",
"ZNQVC/ZNQVC_3.jpeg",
"ZNQVC/ZNQVC_4.jpeg",
"ZNQVC/ZNQVC_5.jpeg",
"ZNQVC/ZNQVC_6.jpeg",
"ZNQVC/ZNQVC_7.jpeg",
"ZNQVC/ZNQVC_8.jpeg",
"ZNQVC/ZNQVC_9.jpeg",
"ZNQVC/ZNQVC_10.jpeg",
"ZNQVC/ZNQVC_11.jpeg",
"ZNQVC/ZNQVC_12.jpeg",
"ZNQVC/ZNQVC_13.jpeg",
"ZNQVC/ZNQVC_14.jpeg",
"ZNQVC/ZNQVC_15.jpeg",
"ZNQVC/ZNQVC_16.jpeg",
"ZNQVC/ZNQVC_17.jpeg",
"ZNQVC/ZNQVC_18.jpeg",
"ZNQVC/ZNQVC_19.jpeg",
"ZNQVC/ZNQVC_20.jpeg",
"ZNQVC/ZNQVC_21.jpeg",
"ZNQVC/ZNQVC_22.jpeg",
"ZNQVC/ZNQVC_23.jpeg",
"ZNQVC/ZNQVC_24.jpeg",
"ZNQVC/ZNQVC_25.jpeg",
"ZNQVC/ZNQVC_26.jpeg",
"ZNQVC/ZNQVC_27.jpeg",
"ZNQVC/ZNQVC_28.jpeg",
"ZNQVC/ZNQVC_29.jpeg",
"ZNQVC/ZNQVC_30.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video."
],
"combined_1_images": [
"2-0.jpg"
]
} | Throughout the entire video. |
3 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}At what moment in the video does the action 'the person takes a lollipop from a box' occur?",
"images_path": [
"EK5K1/EK5K1_0.jpeg",
"EK5K1/EK5K1_1.jpeg",
"EK5K1/EK5K1_2.jpeg",
"EK5K1/EK5K1_3.jpeg",
"EK5K1/EK5K1_4.jpeg",
"EK5K1/EK5K1_5.jpeg",
"EK5K1/EK5K1_6.jpeg",
"EK5K1/EK5K1_7.jpeg",
"EK5K1/EK5K1_8.jpeg",
"EK5K1/EK5K1_9.jpeg",
"EK5K1/EK5K1_10.jpeg",
"EK5K1/EK5K1_11.jpeg",
"EK5K1/EK5K1_12.jpeg",
"EK5K1/EK5K1_13.jpeg",
"EK5K1/EK5K1_14.jpeg",
"EK5K1/EK5K1_15.jpeg",
"EK5K1/EK5K1_16.jpeg",
"EK5K1/EK5K1_17.jpeg",
"EK5K1/EK5K1_18.jpeg",
"EK5K1/EK5K1_19.jpeg",
"EK5K1/EK5K1_20.jpeg",
"EK5K1/EK5K1_21.jpeg",
"EK5K1/EK5K1_22.jpeg",
"EK5K1/EK5K1_23.jpeg",
"EK5K1/EK5K1_24.jpeg",
"EK5K1/EK5K1_25.jpeg",
"EK5K1/EK5K1_26.jpeg",
"EK5K1/EK5K1_27.jpeg",
"EK5K1/EK5K1_28.jpeg",
"EK5K1/EK5K1_29.jpeg",
"EK5K1/EK5K1_30.jpeg",
"EK5K1/EK5K1_31.jpeg",
"EK5K1/EK5K1_32.jpeg",
"EK5K1/EK5K1_33.jpeg",
"EK5K1/EK5K1_34.jpeg",
"EK5K1/EK5K1_35.jpeg",
"EK5K1/EK5K1_36.jpeg",
"EK5K1/EK5K1_37.jpeg",
"EK5K1/EK5K1_38.jpeg",
"EK5K1/EK5K1_39.jpeg",
"EK5K1/EK5K1_40.jpeg",
"EK5K1/EK5K1_41.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"3-0.jpg"
]
} | Throughout the entire video. |
4 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}When in the video sequence do we observe the action 'person fixes their hair in a mirror'?",
"images_path": [
"L2502/L2502_0.jpeg",
"L2502/L2502_1.jpeg",
"L2502/L2502_2.jpeg",
"L2502/L2502_3.jpeg",
"L2502/L2502_4.jpeg",
"L2502/L2502_5.jpeg",
"L2502/L2502_6.jpeg",
"L2502/L2502_7.jpeg",
"L2502/L2502_8.jpeg",
"L2502/L2502_9.jpeg",
"L2502/L2502_10.jpeg",
"L2502/L2502_11.jpeg",
"L2502/L2502_12.jpeg",
"L2502/L2502_13.jpeg",
"L2502/L2502_14.jpeg",
"L2502/L2502_15.jpeg",
"L2502/L2502_16.jpeg",
"L2502/L2502_17.jpeg",
"L2502/L2502_18.jpeg",
"L2502/L2502_19.jpeg",
"L2502/L2502_20.jpeg",
"L2502/L2502_21.jpeg",
"L2502/L2502_22.jpeg",
"L2502/L2502_23.jpeg",
"L2502/L2502_24.jpeg",
"L2502/L2502_25.jpeg",
"L2502/L2502_26.jpeg",
"L2502/L2502_27.jpeg",
"L2502/L2502_28.jpeg",
"L2502/L2502_29.jpeg",
"L2502/L2502_30.jpeg",
"L2502/L2502_31.jpeg",
"L2502/L2502_32.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"4-0.jpg"
]
} | Throughout the entire video. |
5 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}At what moment in the video does the action 'the person is also watching television' occur?",
"images_path": [
"02DPI/02DPI_0.jpeg",
"02DPI/02DPI_1.jpeg",
"02DPI/02DPI_2.jpeg",
"02DPI/02DPI_3.jpeg",
"02DPI/02DPI_4.jpeg",
"02DPI/02DPI_5.jpeg",
"02DPI/02DPI_6.jpeg",
"02DPI/02DPI_7.jpeg",
"02DPI/02DPI_8.jpeg",
"02DPI/02DPI_9.jpeg",
"02DPI/02DPI_10.jpeg",
"02DPI/02DPI_11.jpeg",
"02DPI/02DPI_12.jpeg",
"02DPI/02DPI_13.jpeg",
"02DPI/02DPI_14.jpeg",
"02DPI/02DPI_15.jpeg",
"02DPI/02DPI_16.jpeg",
"02DPI/02DPI_17.jpeg",
"02DPI/02DPI_18.jpeg",
"02DPI/02DPI_19.jpeg",
"02DPI/02DPI_20.jpeg",
"02DPI/02DPI_21.jpeg",
"02DPI/02DPI_22.jpeg",
"02DPI/02DPI_23.jpeg",
"02DPI/02DPI_24.jpeg",
"02DPI/02DPI_25.jpeg",
"02DPI/02DPI_26.jpeg",
"02DPI/02DPI_27.jpeg",
"02DPI/02DPI_28.jpeg",
"02DPI/02DPI_29.jpeg",
"02DPI/02DPI_30.jpeg",
"02DPI/02DPI_31.jpeg",
"02DPI/02DPI_32.jpeg",
"02DPI/02DPI_33.jpeg",
"02DPI/02DPI_34.jpeg",
"02DPI/02DPI_35.jpeg",
"02DPI/02DPI_36.jpeg",
"02DPI/02DPI_37.jpeg",
"02DPI/02DPI_38.jpeg",
"02DPI/02DPI_39.jpeg",
"02DPI/02DPI_40.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"5-0.jpg"
]
} | Throughout the entire video. |
6 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}When in the video sequence do we observe the action 'person reads a book'?",
"images_path": [
"TKAUR/TKAUR_0.jpeg",
"TKAUR/TKAUR_1.jpeg",
"TKAUR/TKAUR_2.jpeg",
"TKAUR/TKAUR_3.jpeg",
"TKAUR/TKAUR_4.jpeg",
"TKAUR/TKAUR_5.jpeg",
"TKAUR/TKAUR_6.jpeg",
"TKAUR/TKAUR_7.jpeg",
"TKAUR/TKAUR_8.jpeg",
"TKAUR/TKAUR_9.jpeg",
"TKAUR/TKAUR_10.jpeg",
"TKAUR/TKAUR_11.jpeg",
"TKAUR/TKAUR_12.jpeg",
"TKAUR/TKAUR_13.jpeg",
"TKAUR/TKAUR_14.jpeg",
"TKAUR/TKAUR_15.jpeg",
"TKAUR/TKAUR_16.jpeg",
"TKAUR/TKAUR_17.jpeg",
"TKAUR/TKAUR_18.jpeg",
"TKAUR/TKAUR_19.jpeg",
"TKAUR/TKAUR_20.jpeg",
"TKAUR/TKAUR_21.jpeg",
"TKAUR/TKAUR_22.jpeg",
"TKAUR/TKAUR_23.jpeg",
"TKAUR/TKAUR_24.jpeg",
"TKAUR/TKAUR_25.jpeg",
"TKAUR/TKAUR_26.jpeg",
"TKAUR/TKAUR_27.jpeg",
"TKAUR/TKAUR_28.jpeg",
"TKAUR/TKAUR_29.jpeg",
"TKAUR/TKAUR_30.jpeg",
"TKAUR/TKAUR_31.jpeg",
"TKAUR/TKAUR_32.jpeg",
"TKAUR/TKAUR_33.jpeg",
"TKAUR/TKAUR_34.jpeg",
"TKAUR/TKAUR_35.jpeg",
"TKAUR/TKAUR_36.jpeg",
"TKAUR/TKAUR_37.jpeg",
"TKAUR/TKAUR_38.jpeg",
"TKAUR/TKAUR_39.jpeg",
"TKAUR/TKAUR_40.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"6-0.jpg"
]
} | Throughout the entire video. |
7 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}Can you identify when the action 'person sits in a chair' happens in the video?",
"images_path": [
"NBMH9/NBMH9_0.jpeg",
"NBMH9/NBMH9_1.jpeg",
"NBMH9/NBMH9_2.jpeg",
"NBMH9/NBMH9_3.jpeg",
"NBMH9/NBMH9_4.jpeg",
"NBMH9/NBMH9_5.jpeg",
"NBMH9/NBMH9_6.jpeg",
"NBMH9/NBMH9_7.jpeg",
"NBMH9/NBMH9_8.jpeg",
"NBMH9/NBMH9_9.jpeg",
"NBMH9/NBMH9_10.jpeg",
"NBMH9/NBMH9_11.jpeg",
"NBMH9/NBMH9_12.jpeg",
"NBMH9/NBMH9_13.jpeg",
"NBMH9/NBMH9_14.jpeg",
"NBMH9/NBMH9_15.jpeg",
"NBMH9/NBMH9_16.jpeg",
"NBMH9/NBMH9_17.jpeg",
"NBMH9/NBMH9_18.jpeg",
"NBMH9/NBMH9_19.jpeg",
"NBMH9/NBMH9_20.jpeg",
"NBMH9/NBMH9_21.jpeg",
"NBMH9/NBMH9_22.jpeg",
"NBMH9/NBMH9_23.jpeg",
"NBMH9/NBMH9_24.jpeg",
"NBMH9/NBMH9_25.jpeg",
"NBMH9/NBMH9_26.jpeg",
"NBMH9/NBMH9_27.jpeg",
"NBMH9/NBMH9_28.jpeg",
"NBMH9/NBMH9_29.jpeg",
"NBMH9/NBMH9_30.jpeg",
"NBMH9/NBMH9_31.jpeg",
"NBMH9/NBMH9_32.jpeg",
"NBMH9/NBMH9_33.jpeg",
"NBMH9/NBMH9_34.jpeg",
"NBMH9/NBMH9_35.jpeg",
"NBMH9/NBMH9_36.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"7-0.jpg"
]
} | Throughout the entire video. |
8 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}At what moment in the video does the action 'the person picked up a phone to play with it' occur?",
"images_path": [
"J95U1/J95U1_0.jpeg",
"J95U1/J95U1_1.jpeg",
"J95U1/J95U1_2.jpeg",
"J95U1/J95U1_3.jpeg",
"J95U1/J95U1_4.jpeg",
"J95U1/J95U1_5.jpeg",
"J95U1/J95U1_6.jpeg",
"J95U1/J95U1_7.jpeg",
"J95U1/J95U1_8.jpeg",
"J95U1/J95U1_9.jpeg",
"J95U1/J95U1_10.jpeg",
"J95U1/J95U1_11.jpeg",
"J95U1/J95U1_12.jpeg",
"J95U1/J95U1_13.jpeg",
"J95U1/J95U1_14.jpeg",
"J95U1/J95U1_15.jpeg",
"J95U1/J95U1_16.jpeg",
"J95U1/J95U1_17.jpeg",
"J95U1/J95U1_18.jpeg",
"J95U1/J95U1_19.jpeg",
"J95U1/J95U1_20.jpeg",
"J95U1/J95U1_21.jpeg",
"J95U1/J95U1_22.jpeg",
"J95U1/J95U1_23.jpeg",
"J95U1/J95U1_24.jpeg",
"J95U1/J95U1_25.jpeg",
"J95U1/J95U1_26.jpeg",
"J95U1/J95U1_27.jpeg",
"J95U1/J95U1_28.jpeg",
"J95U1/J95U1_29.jpeg",
"J95U1/J95U1_30.jpeg",
"J95U1/J95U1_31.jpeg",
"J95U1/J95U1_32.jpeg",
"J95U1/J95U1_33.jpeg",
"J95U1/J95U1_34.jpeg",
"J95U1/J95U1_35.jpeg",
"J95U1/J95U1_36.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"8-0.jpg"
]
} | Throughout the entire video. |
9 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}In the given video, when does the action 'person looks out the window' take place?",
"images_path": [
"YAFX0/YAFX0_0.jpeg",
"YAFX0/YAFX0_1.jpeg",
"YAFX0/YAFX0_2.jpeg",
"YAFX0/YAFX0_3.jpeg",
"YAFX0/YAFX0_4.jpeg",
"YAFX0/YAFX0_5.jpeg",
"YAFX0/YAFX0_6.jpeg",
"YAFX0/YAFX0_7.jpeg",
"YAFX0/YAFX0_8.jpeg",
"YAFX0/YAFX0_9.jpeg",
"YAFX0/YAFX0_10.jpeg",
"YAFX0/YAFX0_11.jpeg",
"YAFX0/YAFX0_12.jpeg",
"YAFX0/YAFX0_13.jpeg",
"YAFX0/YAFX0_14.jpeg",
"YAFX0/YAFX0_15.jpeg",
"YAFX0/YAFX0_16.jpeg",
"YAFX0/YAFX0_17.jpeg",
"YAFX0/YAFX0_18.jpeg",
"YAFX0/YAFX0_19.jpeg",
"YAFX0/YAFX0_20.jpeg",
"YAFX0/YAFX0_21.jpeg",
"YAFX0/YAFX0_22.jpeg",
"YAFX0/YAFX0_23.jpeg",
"YAFX0/YAFX0_24.jpeg",
"YAFX0/YAFX0_25.jpeg",
"YAFX0/YAFX0_26.jpeg",
"YAFX0/YAFX0_27.jpeg",
"YAFX0/YAFX0_28.jpeg",
"YAFX0/YAFX0_29.jpeg",
"YAFX0/YAFX0_30.jpeg",
"YAFX0/YAFX0_31.jpeg",
"YAFX0/YAFX0_32.jpeg",
"YAFX0/YAFX0_33.jpeg",
"YAFX0/YAFX0_34.jpeg",
"YAFX0/YAFX0_35.jpeg",
"YAFX0/YAFX0_36.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"9-0.jpg"
]
} | Throughout the entire video. |
10 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}In the given video, when does the action 'person fixes their hair' take place?",
"images_path": [
"2UQKZ/2UQKZ_0.jpeg",
"2UQKZ/2UQKZ_1.jpeg",
"2UQKZ/2UQKZ_2.jpeg",
"2UQKZ/2UQKZ_3.jpeg",
"2UQKZ/2UQKZ_4.jpeg",
"2UQKZ/2UQKZ_5.jpeg",
"2UQKZ/2UQKZ_6.jpeg",
"2UQKZ/2UQKZ_7.jpeg",
"2UQKZ/2UQKZ_8.jpeg",
"2UQKZ/2UQKZ_9.jpeg",
"2UQKZ/2UQKZ_10.jpeg",
"2UQKZ/2UQKZ_11.jpeg",
"2UQKZ/2UQKZ_12.jpeg",
"2UQKZ/2UQKZ_13.jpeg",
"2UQKZ/2UQKZ_14.jpeg",
"2UQKZ/2UQKZ_15.jpeg",
"2UQKZ/2UQKZ_16.jpeg",
"2UQKZ/2UQKZ_17.jpeg",
"2UQKZ/2UQKZ_18.jpeg",
"2UQKZ/2UQKZ_19.jpeg",
"2UQKZ/2UQKZ_20.jpeg",
"2UQKZ/2UQKZ_21.jpeg",
"2UQKZ/2UQKZ_22.jpeg",
"2UQKZ/2UQKZ_23.jpeg",
"2UQKZ/2UQKZ_24.jpeg",
"2UQKZ/2UQKZ_25.jpeg",
"2UQKZ/2UQKZ_26.jpeg",
"2UQKZ/2UQKZ_27.jpeg",
"2UQKZ/2UQKZ_28.jpeg",
"2UQKZ/2UQKZ_29.jpeg",
"2UQKZ/2UQKZ_30.jpeg",
"2UQKZ/2UQKZ_31.jpeg",
"2UQKZ/2UQKZ_32.jpeg",
"2UQKZ/2UQKZ_33.jpeg",
"2UQKZ/2UQKZ_34.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"10-0.jpg"
]
} | Throughout the entire video. |
11 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}{image#46}{image#47}{image#48}{image#49}{image#50}{image#51}{image#52}{image#53}{image#54}{image#55}{image#56}{image#57}{image#58}During which part of the video does the action 'the person undresses (removes socks' occur?",
"images_path": [
"QMHK8/QMHK8_0.jpeg",
"QMHK8/QMHK8_1.jpeg",
"QMHK8/QMHK8_2.jpeg",
"QMHK8/QMHK8_3.jpeg",
"QMHK8/QMHK8_4.jpeg",
"QMHK8/QMHK8_5.jpeg",
"QMHK8/QMHK8_6.jpeg",
"QMHK8/QMHK8_7.jpeg",
"QMHK8/QMHK8_8.jpeg",
"QMHK8/QMHK8_9.jpeg",
"QMHK8/QMHK8_10.jpeg",
"QMHK8/QMHK8_11.jpeg",
"QMHK8/QMHK8_12.jpeg",
"QMHK8/QMHK8_13.jpeg",
"QMHK8/QMHK8_14.jpeg",
"QMHK8/QMHK8_15.jpeg",
"QMHK8/QMHK8_16.jpeg",
"QMHK8/QMHK8_17.jpeg",
"QMHK8/QMHK8_18.jpeg",
"QMHK8/QMHK8_19.jpeg",
"QMHK8/QMHK8_20.jpeg",
"QMHK8/QMHK8_21.jpeg",
"QMHK8/QMHK8_22.jpeg",
"QMHK8/QMHK8_23.jpeg",
"QMHK8/QMHK8_24.jpeg",
"QMHK8/QMHK8_25.jpeg",
"QMHK8/QMHK8_26.jpeg",
"QMHK8/QMHK8_27.jpeg",
"QMHK8/QMHK8_28.jpeg",
"QMHK8/QMHK8_29.jpeg",
"QMHK8/QMHK8_30.jpeg",
"QMHK8/QMHK8_31.jpeg",
"QMHK8/QMHK8_32.jpeg",
"QMHK8/QMHK8_33.jpeg",
"QMHK8/QMHK8_34.jpeg",
"QMHK8/QMHK8_35.jpeg",
"QMHK8/QMHK8_36.jpeg",
"QMHK8/QMHK8_37.jpeg",
"QMHK8/QMHK8_38.jpeg",
"QMHK8/QMHK8_39.jpeg",
"QMHK8/QMHK8_40.jpeg",
"QMHK8/QMHK8_41.jpeg",
"QMHK8/QMHK8_42.jpeg",
"QMHK8/QMHK8_43.jpeg",
"QMHK8/QMHK8_44.jpeg",
"QMHK8/QMHK8_45.jpeg",
"QMHK8/QMHK8_46.jpeg",
"QMHK8/QMHK8_47.jpeg",
"QMHK8/QMHK8_48.jpeg",
"QMHK8/QMHK8_49.jpeg",
"QMHK8/QMHK8_50.jpeg",
"QMHK8/QMHK8_51.jpeg",
"QMHK8/QMHK8_52.jpeg",
"QMHK8/QMHK8_53.jpeg",
"QMHK8/QMHK8_54.jpeg",
"QMHK8/QMHK8_55.jpeg",
"QMHK8/QMHK8_56.jpeg",
"QMHK8/QMHK8_57.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"11-0.jpg"
]
} | Throughout the entire video. |
12 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}Can you identify when the action 'person putting books' happens in the video?",
"images_path": [
"6IL0C/6IL0C_0.jpeg",
"6IL0C/6IL0C_1.jpeg",
"6IL0C/6IL0C_2.jpeg",
"6IL0C/6IL0C_3.jpeg",
"6IL0C/6IL0C_4.jpeg",
"6IL0C/6IL0C_5.jpeg",
"6IL0C/6IL0C_6.jpeg",
"6IL0C/6IL0C_7.jpeg",
"6IL0C/6IL0C_8.jpeg",
"6IL0C/6IL0C_9.jpeg",
"6IL0C/6IL0C_10.jpeg",
"6IL0C/6IL0C_11.jpeg",
"6IL0C/6IL0C_12.jpeg",
"6IL0C/6IL0C_13.jpeg",
"6IL0C/6IL0C_14.jpeg",
"6IL0C/6IL0C_15.jpeg",
"6IL0C/6IL0C_16.jpeg",
"6IL0C/6IL0C_17.jpeg",
"6IL0C/6IL0C_18.jpeg",
"6IL0C/6IL0C_19.jpeg",
"6IL0C/6IL0C_20.jpeg",
"6IL0C/6IL0C_21.jpeg",
"6IL0C/6IL0C_22.jpeg",
"6IL0C/6IL0C_23.jpeg",
"6IL0C/6IL0C_24.jpeg",
"6IL0C/6IL0C_25.jpeg",
"6IL0C/6IL0C_26.jpeg",
"6IL0C/6IL0C_27.jpeg",
"6IL0C/6IL0C_28.jpeg",
"6IL0C/6IL0C_29.jpeg",
"6IL0C/6IL0C_30.jpeg",
"6IL0C/6IL0C_31.jpeg",
"6IL0C/6IL0C_32.jpeg",
"6IL0C/6IL0C_33.jpeg",
"6IL0C/6IL0C_34.jpeg",
"6IL0C/6IL0C_35.jpeg",
"6IL0C/6IL0C_36.jpeg",
"6IL0C/6IL0C_37.jpeg",
"6IL0C/6IL0C_38.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"12-0.jpg"
]
} | Throughout the entire video. |
13 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}When in the video sequence do we observe the action 'person starts washing clothes'?",
"images_path": [
"G6K7T/G6K7T_0.jpeg",
"G6K7T/G6K7T_1.jpeg",
"G6K7T/G6K7T_2.jpeg",
"G6K7T/G6K7T_3.jpeg",
"G6K7T/G6K7T_4.jpeg",
"G6K7T/G6K7T_5.jpeg",
"G6K7T/G6K7T_6.jpeg",
"G6K7T/G6K7T_7.jpeg",
"G6K7T/G6K7T_8.jpeg",
"G6K7T/G6K7T_9.jpeg",
"G6K7T/G6K7T_10.jpeg",
"G6K7T/G6K7T_11.jpeg",
"G6K7T/G6K7T_12.jpeg",
"G6K7T/G6K7T_13.jpeg",
"G6K7T/G6K7T_14.jpeg",
"G6K7T/G6K7T_15.jpeg",
"G6K7T/G6K7T_16.jpeg",
"G6K7T/G6K7T_17.jpeg",
"G6K7T/G6K7T_18.jpeg",
"G6K7T/G6K7T_19.jpeg",
"G6K7T/G6K7T_20.jpeg",
"G6K7T/G6K7T_21.jpeg",
"G6K7T/G6K7T_22.jpeg",
"G6K7T/G6K7T_23.jpeg",
"G6K7T/G6K7T_24.jpeg",
"G6K7T/G6K7T_25.jpeg",
"G6K7T/G6K7T_26.jpeg",
"G6K7T/G6K7T_27.jpeg",
"G6K7T/G6K7T_28.jpeg",
"G6K7T/G6K7T_29.jpeg",
"G6K7T/G6K7T_30.jpeg",
"G6K7T/G6K7T_31.jpeg",
"G6K7T/G6K7T_32.jpeg",
"G6K7T/G6K7T_33.jpeg",
"G6K7T/G6K7T_34.jpeg",
"G6K7T/G6K7T_35.jpeg",
"G6K7T/G6K7T_36.jpeg",
"G6K7T/G6K7T_37.jpeg",
"G6K7T/G6K7T_38.jpeg",
"G6K7T/G6K7T_39.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"13-0.jpg"
]
} | Throughout the entire video. |
14 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}When in the video sequence do we observe the action 'person puts on shoes'?",
"images_path": [
"OWRDE/OWRDE_0.jpeg",
"OWRDE/OWRDE_1.jpeg",
"OWRDE/OWRDE_2.jpeg",
"OWRDE/OWRDE_3.jpeg",
"OWRDE/OWRDE_4.jpeg",
"OWRDE/OWRDE_5.jpeg",
"OWRDE/OWRDE_6.jpeg",
"OWRDE/OWRDE_7.jpeg",
"OWRDE/OWRDE_8.jpeg",
"OWRDE/OWRDE_9.jpeg",
"OWRDE/OWRDE_10.jpeg",
"OWRDE/OWRDE_11.jpeg",
"OWRDE/OWRDE_12.jpeg",
"OWRDE/OWRDE_13.jpeg",
"OWRDE/OWRDE_14.jpeg",
"OWRDE/OWRDE_15.jpeg",
"OWRDE/OWRDE_16.jpeg",
"OWRDE/OWRDE_17.jpeg",
"OWRDE/OWRDE_18.jpeg",
"OWRDE/OWRDE_19.jpeg",
"OWRDE/OWRDE_20.jpeg",
"OWRDE/OWRDE_21.jpeg",
"OWRDE/OWRDE_22.jpeg",
"OWRDE/OWRDE_23.jpeg",
"OWRDE/OWRDE_24.jpeg",
"OWRDE/OWRDE_25.jpeg",
"OWRDE/OWRDE_26.jpeg",
"OWRDE/OWRDE_27.jpeg",
"OWRDE/OWRDE_28.jpeg",
"OWRDE/OWRDE_29.jpeg",
"OWRDE/OWRDE_30.jpeg",
"OWRDE/OWRDE_31.jpeg",
"OWRDE/OWRDE_32.jpeg",
"OWRDE/OWRDE_33.jpeg",
"OWRDE/OWRDE_34.jpeg",
"OWRDE/OWRDE_35.jpeg",
"OWRDE/OWRDE_36.jpeg",
"OWRDE/OWRDE_37.jpeg",
"OWRDE/OWRDE_38.jpeg",
"OWRDE/OWRDE_39.jpeg",
"OWRDE/OWRDE_40.jpeg",
"OWRDE/OWRDE_41.jpeg",
"OWRDE/OWRDE_42.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"14-0.jpg"
]
} | Throughout the entire video. |
15 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}At what moment in the video does the action 'a person is walking in a room holding towel' occur?",
"images_path": [
"LQMXW/LQMXW_0.jpeg",
"LQMXW/LQMXW_1.jpeg",
"LQMXW/LQMXW_2.jpeg",
"LQMXW/LQMXW_3.jpeg",
"LQMXW/LQMXW_4.jpeg",
"LQMXW/LQMXW_5.jpeg",
"LQMXW/LQMXW_6.jpeg",
"LQMXW/LQMXW_7.jpeg",
"LQMXW/LQMXW_8.jpeg",
"LQMXW/LQMXW_9.jpeg",
"LQMXW/LQMXW_10.jpeg",
"LQMXW/LQMXW_11.jpeg",
"LQMXW/LQMXW_12.jpeg",
"LQMXW/LQMXW_13.jpeg",
"LQMXW/LQMXW_14.jpeg",
"LQMXW/LQMXW_15.jpeg",
"LQMXW/LQMXW_16.jpeg",
"LQMXW/LQMXW_17.jpeg",
"LQMXW/LQMXW_18.jpeg",
"LQMXW/LQMXW_19.jpeg",
"LQMXW/LQMXW_20.jpeg",
"LQMXW/LQMXW_21.jpeg",
"LQMXW/LQMXW_22.jpeg",
"LQMXW/LQMXW_23.jpeg",
"LQMXW/LQMXW_24.jpeg",
"LQMXW/LQMXW_25.jpeg",
"LQMXW/LQMXW_26.jpeg",
"LQMXW/LQMXW_27.jpeg",
"LQMXW/LQMXW_28.jpeg",
"LQMXW/LQMXW_29.jpeg",
"LQMXW/LQMXW_30.jpeg",
"LQMXW/LQMXW_31.jpeg",
"LQMXW/LQMXW_32.jpeg",
"LQMXW/LQMXW_33.jpeg",
"LQMXW/LQMXW_34.jpeg",
"LQMXW/LQMXW_35.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"15-0.jpg"
]
} | Throughout the entire video. |
16 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}{image#46}{image#47}{image#48}When in the video sequence do we observe the action 'a person is also dressing'?",
"images_path": [
"HXUI5/HXUI5_0.jpeg",
"HXUI5/HXUI5_1.jpeg",
"HXUI5/HXUI5_2.jpeg",
"HXUI5/HXUI5_3.jpeg",
"HXUI5/HXUI5_4.jpeg",
"HXUI5/HXUI5_5.jpeg",
"HXUI5/HXUI5_6.jpeg",
"HXUI5/HXUI5_7.jpeg",
"HXUI5/HXUI5_8.jpeg",
"HXUI5/HXUI5_9.jpeg",
"HXUI5/HXUI5_10.jpeg",
"HXUI5/HXUI5_11.jpeg",
"HXUI5/HXUI5_12.jpeg",
"HXUI5/HXUI5_13.jpeg",
"HXUI5/HXUI5_14.jpeg",
"HXUI5/HXUI5_15.jpeg",
"HXUI5/HXUI5_16.jpeg",
"HXUI5/HXUI5_17.jpeg",
"HXUI5/HXUI5_18.jpeg",
"HXUI5/HXUI5_19.jpeg",
"HXUI5/HXUI5_20.jpeg",
"HXUI5/HXUI5_21.jpeg",
"HXUI5/HXUI5_22.jpeg",
"HXUI5/HXUI5_23.jpeg",
"HXUI5/HXUI5_24.jpeg",
"HXUI5/HXUI5_25.jpeg",
"HXUI5/HXUI5_26.jpeg",
"HXUI5/HXUI5_27.jpeg",
"HXUI5/HXUI5_28.jpeg",
"HXUI5/HXUI5_29.jpeg",
"HXUI5/HXUI5_30.jpeg",
"HXUI5/HXUI5_31.jpeg",
"HXUI5/HXUI5_32.jpeg",
"HXUI5/HXUI5_33.jpeg",
"HXUI5/HXUI5_34.jpeg",
"HXUI5/HXUI5_35.jpeg",
"HXUI5/HXUI5_36.jpeg",
"HXUI5/HXUI5_37.jpeg",
"HXUI5/HXUI5_38.jpeg",
"HXUI5/HXUI5_39.jpeg",
"HXUI5/HXUI5_40.jpeg",
"HXUI5/HXUI5_41.jpeg",
"HXUI5/HXUI5_42.jpeg",
"HXUI5/HXUI5_43.jpeg",
"HXUI5/HXUI5_44.jpeg",
"HXUI5/HXUI5_45.jpeg",
"HXUI5/HXUI5_46.jpeg",
"HXUI5/HXUI5_47.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"16-0.jpg"
]
} | Throughout the entire video. |
17 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}At what moment in the video does the action 'person eats some food' occur?",
"images_path": [
"28BVI/28BVI_0.jpeg",
"28BVI/28BVI_1.jpeg",
"28BVI/28BVI_2.jpeg",
"28BVI/28BVI_3.jpeg",
"28BVI/28BVI_4.jpeg",
"28BVI/28BVI_5.jpeg",
"28BVI/28BVI_6.jpeg",
"28BVI/28BVI_7.jpeg",
"28BVI/28BVI_8.jpeg",
"28BVI/28BVI_9.jpeg",
"28BVI/28BVI_10.jpeg",
"28BVI/28BVI_11.jpeg",
"28BVI/28BVI_12.jpeg",
"28BVI/28BVI_13.jpeg",
"28BVI/28BVI_14.jpeg",
"28BVI/28BVI_15.jpeg",
"28BVI/28BVI_16.jpeg",
"28BVI/28BVI_17.jpeg",
"28BVI/28BVI_18.jpeg",
"28BVI/28BVI_19.jpeg",
"28BVI/28BVI_20.jpeg",
"28BVI/28BVI_21.jpeg",
"28BVI/28BVI_22.jpeg",
"28BVI/28BVI_23.jpeg",
"28BVI/28BVI_24.jpeg",
"28BVI/28BVI_25.jpeg",
"28BVI/28BVI_26.jpeg",
"28BVI/28BVI_27.jpeg",
"28BVI/28BVI_28.jpeg",
"28BVI/28BVI_29.jpeg",
"28BVI/28BVI_30.jpeg",
"28BVI/28BVI_31.jpeg",
"28BVI/28BVI_32.jpeg",
"28BVI/28BVI_33.jpeg",
"28BVI/28BVI_34.jpeg",
"28BVI/28BVI_35.jpeg",
"28BVI/28BVI_36.jpeg",
"28BVI/28BVI_37.jpeg",
"28BVI/28BVI_38.jpeg",
"28BVI/28BVI_39.jpeg",
"28BVI/28BVI_40.jpeg",
"28BVI/28BVI_41.jpeg",
"28BVI/28BVI_42.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"17-0.jpg"
]
} | Throughout the entire video. |
18 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}Can you identify when the action 'person open a bag' happens in the video?",
"images_path": [
"3W1GP/3W1GP_0.jpeg",
"3W1GP/3W1GP_1.jpeg",
"3W1GP/3W1GP_2.jpeg",
"3W1GP/3W1GP_3.jpeg",
"3W1GP/3W1GP_4.jpeg",
"3W1GP/3W1GP_5.jpeg",
"3W1GP/3W1GP_6.jpeg",
"3W1GP/3W1GP_7.jpeg",
"3W1GP/3W1GP_8.jpeg",
"3W1GP/3W1GP_9.jpeg",
"3W1GP/3W1GP_10.jpeg",
"3W1GP/3W1GP_11.jpeg",
"3W1GP/3W1GP_12.jpeg",
"3W1GP/3W1GP_13.jpeg",
"3W1GP/3W1GP_14.jpeg",
"3W1GP/3W1GP_15.jpeg",
"3W1GP/3W1GP_16.jpeg",
"3W1GP/3W1GP_17.jpeg",
"3W1GP/3W1GP_18.jpeg",
"3W1GP/3W1GP_19.jpeg",
"3W1GP/3W1GP_20.jpeg",
"3W1GP/3W1GP_21.jpeg",
"3W1GP/3W1GP_22.jpeg",
"3W1GP/3W1GP_23.jpeg",
"3W1GP/3W1GP_24.jpeg",
"3W1GP/3W1GP_25.jpeg",
"3W1GP/3W1GP_26.jpeg",
"3W1GP/3W1GP_27.jpeg",
"3W1GP/3W1GP_28.jpeg",
"3W1GP/3W1GP_29.jpeg",
"3W1GP/3W1GP_30.jpeg",
"3W1GP/3W1GP_31.jpeg",
"3W1GP/3W1GP_32.jpeg",
"3W1GP/3W1GP_33.jpeg",
"3W1GP/3W1GP_34.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"18-0.jpg"
]
} | Throughout the entire video. |
19 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}When in the video sequence do we observe the action 'one person undresses in front of a wardrobe'?",
"images_path": [
"BDY1V/BDY1V_0.jpeg",
"BDY1V/BDY1V_1.jpeg",
"BDY1V/BDY1V_2.jpeg",
"BDY1V/BDY1V_3.jpeg",
"BDY1V/BDY1V_4.jpeg",
"BDY1V/BDY1V_5.jpeg",
"BDY1V/BDY1V_6.jpeg",
"BDY1V/BDY1V_7.jpeg",
"BDY1V/BDY1V_8.jpeg",
"BDY1V/BDY1V_9.jpeg",
"BDY1V/BDY1V_10.jpeg",
"BDY1V/BDY1V_11.jpeg",
"BDY1V/BDY1V_12.jpeg",
"BDY1V/BDY1V_13.jpeg",
"BDY1V/BDY1V_14.jpeg",
"BDY1V/BDY1V_15.jpeg",
"BDY1V/BDY1V_16.jpeg",
"BDY1V/BDY1V_17.jpeg",
"BDY1V/BDY1V_18.jpeg",
"BDY1V/BDY1V_19.jpeg",
"BDY1V/BDY1V_20.jpeg",
"BDY1V/BDY1V_21.jpeg",
"BDY1V/BDY1V_22.jpeg",
"BDY1V/BDY1V_23.jpeg",
"BDY1V/BDY1V_24.jpeg",
"BDY1V/BDY1V_25.jpeg",
"BDY1V/BDY1V_26.jpeg",
"BDY1V/BDY1V_27.jpeg",
"BDY1V/BDY1V_28.jpeg",
"BDY1V/BDY1V_29.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"19-0.jpg"
]
} | Throughout the entire video. |
20 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}During which part of the video does the action 'a person is sitting on a couch watching tv' occur?",
"images_path": [
"0TKKR/0TKKR_0.jpeg",
"0TKKR/0TKKR_1.jpeg",
"0TKKR/0TKKR_2.jpeg",
"0TKKR/0TKKR_3.jpeg",
"0TKKR/0TKKR_4.jpeg",
"0TKKR/0TKKR_5.jpeg",
"0TKKR/0TKKR_6.jpeg",
"0TKKR/0TKKR_7.jpeg",
"0TKKR/0TKKR_8.jpeg",
"0TKKR/0TKKR_9.jpeg",
"0TKKR/0TKKR_10.jpeg",
"0TKKR/0TKKR_11.jpeg",
"0TKKR/0TKKR_12.jpeg",
"0TKKR/0TKKR_13.jpeg",
"0TKKR/0TKKR_14.jpeg",
"0TKKR/0TKKR_15.jpeg",
"0TKKR/0TKKR_16.jpeg",
"0TKKR/0TKKR_17.jpeg",
"0TKKR/0TKKR_18.jpeg",
"0TKKR/0TKKR_19.jpeg",
"0TKKR/0TKKR_20.jpeg",
"0TKKR/0TKKR_21.jpeg",
"0TKKR/0TKKR_22.jpeg",
"0TKKR/0TKKR_23.jpeg",
"0TKKR/0TKKR_24.jpeg",
"0TKKR/0TKKR_25.jpeg",
"0TKKR/0TKKR_26.jpeg",
"0TKKR/0TKKR_27.jpeg",
"0TKKR/0TKKR_28.jpeg",
"0TKKR/0TKKR_29.jpeg",
"0TKKR/0TKKR_30.jpeg",
"0TKKR/0TKKR_31.jpeg",
"0TKKR/0TKKR_32.jpeg",
"0TKKR/0TKKR_33.jpeg",
"0TKKR/0TKKR_34.jpeg",
"0TKKR/0TKKR_35.jpeg",
"0TKKR/0TKKR_36.jpeg",
"0TKKR/0TKKR_37.jpeg",
"0TKKR/0TKKR_38.jpeg",
"0TKKR/0TKKR_39.jpeg",
"0TKKR/0TKKR_40.jpeg",
"0TKKR/0TKKR_41.jpeg",
"0TKKR/0TKKR_42.jpeg",
"0TKKR/0TKKR_43.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"20-0.jpg"
]
} | Throughout the entire video. |
21 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}When in the video sequence do we observe the action 'person starts laughing'?",
"images_path": [
"AX46Z/AX46Z_0.jpeg",
"AX46Z/AX46Z_1.jpeg",
"AX46Z/AX46Z_2.jpeg",
"AX46Z/AX46Z_3.jpeg",
"AX46Z/AX46Z_4.jpeg",
"AX46Z/AX46Z_5.jpeg",
"AX46Z/AX46Z_6.jpeg",
"AX46Z/AX46Z_7.jpeg",
"AX46Z/AX46Z_8.jpeg",
"AX46Z/AX46Z_9.jpeg",
"AX46Z/AX46Z_10.jpeg",
"AX46Z/AX46Z_11.jpeg",
"AX46Z/AX46Z_12.jpeg",
"AX46Z/AX46Z_13.jpeg",
"AX46Z/AX46Z_14.jpeg",
"AX46Z/AX46Z_15.jpeg",
"AX46Z/AX46Z_16.jpeg",
"AX46Z/AX46Z_17.jpeg",
"AX46Z/AX46Z_18.jpeg",
"AX46Z/AX46Z_19.jpeg",
"AX46Z/AX46Z_20.jpeg",
"AX46Z/AX46Z_21.jpeg",
"AX46Z/AX46Z_22.jpeg",
"AX46Z/AX46Z_23.jpeg",
"AX46Z/AX46Z_24.jpeg",
"AX46Z/AX46Z_25.jpeg",
"AX46Z/AX46Z_26.jpeg",
"AX46Z/AX46Z_27.jpeg",
"AX46Z/AX46Z_28.jpeg",
"AX46Z/AX46Z_29.jpeg",
"AX46Z/AX46Z_30.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"21-0.jpg"
]
} | Throughout the entire video. |
22 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}During which part of the video does the action 'person walking through a doorway in the living room' occur?",
"images_path": [
"81VSN/81VSN_0.jpeg",
"81VSN/81VSN_1.jpeg",
"81VSN/81VSN_2.jpeg",
"81VSN/81VSN_3.jpeg",
"81VSN/81VSN_4.jpeg",
"81VSN/81VSN_5.jpeg",
"81VSN/81VSN_6.jpeg",
"81VSN/81VSN_7.jpeg",
"81VSN/81VSN_8.jpeg",
"81VSN/81VSN_9.jpeg",
"81VSN/81VSN_10.jpeg",
"81VSN/81VSN_11.jpeg",
"81VSN/81VSN_12.jpeg",
"81VSN/81VSN_13.jpeg",
"81VSN/81VSN_14.jpeg",
"81VSN/81VSN_15.jpeg",
"81VSN/81VSN_16.jpeg",
"81VSN/81VSN_17.jpeg",
"81VSN/81VSN_18.jpeg",
"81VSN/81VSN_19.jpeg",
"81VSN/81VSN_20.jpeg",
"81VSN/81VSN_21.jpeg",
"81VSN/81VSN_22.jpeg",
"81VSN/81VSN_23.jpeg",
"81VSN/81VSN_24.jpeg",
"81VSN/81VSN_25.jpeg",
"81VSN/81VSN_26.jpeg",
"81VSN/81VSN_27.jpeg",
"81VSN/81VSN_28.jpeg",
"81VSN/81VSN_29.jpeg",
"81VSN/81VSN_30.jpeg",
"81VSN/81VSN_31.jpeg",
"81VSN/81VSN_32.jpeg",
"81VSN/81VSN_33.jpeg",
"81VSN/81VSN_34.jpeg",
"81VSN/81VSN_35.jpeg",
"81VSN/81VSN_36.jpeg",
"81VSN/81VSN_37.jpeg",
"81VSN/81VSN_38.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"22-0.jpg"
]
} | Throughout the entire video. |
23 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}{image#46}When in the video sequence do we observe the action 'person quickly undressing'?",
"images_path": [
"RQRRD/RQRRD_0.jpeg",
"RQRRD/RQRRD_1.jpeg",
"RQRRD/RQRRD_2.jpeg",
"RQRRD/RQRRD_3.jpeg",
"RQRRD/RQRRD_4.jpeg",
"RQRRD/RQRRD_5.jpeg",
"RQRRD/RQRRD_6.jpeg",
"RQRRD/RQRRD_7.jpeg",
"RQRRD/RQRRD_8.jpeg",
"RQRRD/RQRRD_9.jpeg",
"RQRRD/RQRRD_10.jpeg",
"RQRRD/RQRRD_11.jpeg",
"RQRRD/RQRRD_12.jpeg",
"RQRRD/RQRRD_13.jpeg",
"RQRRD/RQRRD_14.jpeg",
"RQRRD/RQRRD_15.jpeg",
"RQRRD/RQRRD_16.jpeg",
"RQRRD/RQRRD_17.jpeg",
"RQRRD/RQRRD_18.jpeg",
"RQRRD/RQRRD_19.jpeg",
"RQRRD/RQRRD_20.jpeg",
"RQRRD/RQRRD_21.jpeg",
"RQRRD/RQRRD_22.jpeg",
"RQRRD/RQRRD_23.jpeg",
"RQRRD/RQRRD_24.jpeg",
"RQRRD/RQRRD_25.jpeg",
"RQRRD/RQRRD_26.jpeg",
"RQRRD/RQRRD_27.jpeg",
"RQRRD/RQRRD_28.jpeg",
"RQRRD/RQRRD_29.jpeg",
"RQRRD/RQRRD_30.jpeg",
"RQRRD/RQRRD_31.jpeg",
"RQRRD/RQRRD_32.jpeg",
"RQRRD/RQRRD_33.jpeg",
"RQRRD/RQRRD_34.jpeg",
"RQRRD/RQRRD_35.jpeg",
"RQRRD/RQRRD_36.jpeg",
"RQRRD/RQRRD_37.jpeg",
"RQRRD/RQRRD_38.jpeg",
"RQRRD/RQRRD_39.jpeg",
"RQRRD/RQRRD_40.jpeg",
"RQRRD/RQRRD_41.jpeg",
"RQRRD/RQRRD_42.jpeg",
"RQRRD/RQRRD_43.jpeg",
"RQRRD/RQRRD_44.jpeg",
"RQRRD/RQRRD_45.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"23-0.jpg"
]
} | Throughout the entire video. |
24 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}Can you identify when the action 'a person is dressing in front of a mirror' happens in the video?",
"images_path": [
"3P055/3P055_0.jpeg",
"3P055/3P055_1.jpeg",
"3P055/3P055_2.jpeg",
"3P055/3P055_3.jpeg",
"3P055/3P055_4.jpeg",
"3P055/3P055_5.jpeg",
"3P055/3P055_6.jpeg",
"3P055/3P055_7.jpeg",
"3P055/3P055_8.jpeg",
"3P055/3P055_9.jpeg",
"3P055/3P055_10.jpeg",
"3P055/3P055_11.jpeg",
"3P055/3P055_12.jpeg",
"3P055/3P055_13.jpeg",
"3P055/3P055_14.jpeg",
"3P055/3P055_15.jpeg",
"3P055/3P055_16.jpeg",
"3P055/3P055_17.jpeg",
"3P055/3P055_18.jpeg",
"3P055/3P055_19.jpeg",
"3P055/3P055_20.jpeg",
"3P055/3P055_21.jpeg",
"3P055/3P055_22.jpeg",
"3P055/3P055_23.jpeg",
"3P055/3P055_24.jpeg",
"3P055/3P055_25.jpeg",
"3P055/3P055_26.jpeg",
"3P055/3P055_27.jpeg",
"3P055/3P055_28.jpeg",
"3P055/3P055_29.jpeg",
"3P055/3P055_30.jpeg",
"3P055/3P055_31.jpeg",
"3P055/3P055_32.jpeg",
"3P055/3P055_33.jpeg",
"3P055/3P055_34.jpeg",
"3P055/3P055_35.jpeg",
"3P055/3P055_36.jpeg",
"3P055/3P055_37.jpeg",
"3P055/3P055_38.jpeg",
"3P055/3P055_39.jpeg",
"3P055/3P055_40.jpeg",
"3P055/3P055_41.jpeg",
"3P055/3P055_42.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"24-0.jpg"
]
} | Throughout the entire video. |
25 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}When in the video sequence do we observe the action 'person putting away groceries'?",
"images_path": [
"S4P5J/S4P5J_0.jpeg",
"S4P5J/S4P5J_1.jpeg",
"S4P5J/S4P5J_2.jpeg",
"S4P5J/S4P5J_3.jpeg",
"S4P5J/S4P5J_4.jpeg",
"S4P5J/S4P5J_5.jpeg",
"S4P5J/S4P5J_6.jpeg",
"S4P5J/S4P5J_7.jpeg",
"S4P5J/S4P5J_8.jpeg",
"S4P5J/S4P5J_9.jpeg",
"S4P5J/S4P5J_10.jpeg",
"S4P5J/S4P5J_11.jpeg",
"S4P5J/S4P5J_12.jpeg",
"S4P5J/S4P5J_13.jpeg",
"S4P5J/S4P5J_14.jpeg",
"S4P5J/S4P5J_15.jpeg",
"S4P5J/S4P5J_16.jpeg",
"S4P5J/S4P5J_17.jpeg",
"S4P5J/S4P5J_18.jpeg",
"S4P5J/S4P5J_19.jpeg",
"S4P5J/S4P5J_20.jpeg",
"S4P5J/S4P5J_21.jpeg",
"S4P5J/S4P5J_22.jpeg",
"S4P5J/S4P5J_23.jpeg",
"S4P5J/S4P5J_24.jpeg",
"S4P5J/S4P5J_25.jpeg",
"S4P5J/S4P5J_26.jpeg",
"S4P5J/S4P5J_27.jpeg",
"S4P5J/S4P5J_28.jpeg",
"S4P5J/S4P5J_29.jpeg",
"S4P5J/S4P5J_30.jpeg",
"S4P5J/S4P5J_31.jpeg",
"S4P5J/S4P5J_32.jpeg",
"S4P5J/S4P5J_33.jpeg",
"S4P5J/S4P5J_34.jpeg",
"S4P5J/S4P5J_35.jpeg",
"S4P5J/S4P5J_36.jpeg",
"S4P5J/S4P5J_37.jpeg",
"S4P5J/S4P5J_38.jpeg",
"S4P5J/S4P5J_39.jpeg",
"S4P5J/S4P5J_40.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"25-0.jpg"
]
} | Throughout the entire video. |
26 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}During which part of the video does the action 'person sitting on bed' occur?",
"images_path": [
"V1WN7/V1WN7_0.jpeg",
"V1WN7/V1WN7_1.jpeg",
"V1WN7/V1WN7_2.jpeg",
"V1WN7/V1WN7_3.jpeg",
"V1WN7/V1WN7_4.jpeg",
"V1WN7/V1WN7_5.jpeg",
"V1WN7/V1WN7_6.jpeg",
"V1WN7/V1WN7_7.jpeg",
"V1WN7/V1WN7_8.jpeg",
"V1WN7/V1WN7_9.jpeg",
"V1WN7/V1WN7_10.jpeg",
"V1WN7/V1WN7_11.jpeg",
"V1WN7/V1WN7_12.jpeg",
"V1WN7/V1WN7_13.jpeg",
"V1WN7/V1WN7_14.jpeg",
"V1WN7/V1WN7_15.jpeg",
"V1WN7/V1WN7_16.jpeg",
"V1WN7/V1WN7_17.jpeg",
"V1WN7/V1WN7_18.jpeg",
"V1WN7/V1WN7_19.jpeg",
"V1WN7/V1WN7_20.jpeg",
"V1WN7/V1WN7_21.jpeg",
"V1WN7/V1WN7_22.jpeg",
"V1WN7/V1WN7_23.jpeg",
"V1WN7/V1WN7_24.jpeg",
"V1WN7/V1WN7_25.jpeg",
"V1WN7/V1WN7_26.jpeg",
"V1WN7/V1WN7_27.jpeg",
"V1WN7/V1WN7_28.jpeg",
"V1WN7/V1WN7_29.jpeg",
"V1WN7/V1WN7_30.jpeg",
"V1WN7/V1WN7_31.jpeg",
"V1WN7/V1WN7_32.jpeg",
"V1WN7/V1WN7_33.jpeg",
"V1WN7/V1WN7_34.jpeg",
"V1WN7/V1WN7_35.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"26-0.jpg"
]
} | Throughout the entire video. |
27 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}During which part of the video does the action 'person looks out the window' occur?",
"images_path": [
"VINM0/VINM0_0.jpeg",
"VINM0/VINM0_1.jpeg",
"VINM0/VINM0_2.jpeg",
"VINM0/VINM0_3.jpeg",
"VINM0/VINM0_4.jpeg",
"VINM0/VINM0_5.jpeg",
"VINM0/VINM0_6.jpeg",
"VINM0/VINM0_7.jpeg",
"VINM0/VINM0_8.jpeg",
"VINM0/VINM0_9.jpeg",
"VINM0/VINM0_10.jpeg",
"VINM0/VINM0_11.jpeg",
"VINM0/VINM0_12.jpeg",
"VINM0/VINM0_13.jpeg",
"VINM0/VINM0_14.jpeg",
"VINM0/VINM0_15.jpeg",
"VINM0/VINM0_16.jpeg",
"VINM0/VINM0_17.jpeg",
"VINM0/VINM0_18.jpeg",
"VINM0/VINM0_19.jpeg",
"VINM0/VINM0_20.jpeg",
"VINM0/VINM0_21.jpeg",
"VINM0/VINM0_22.jpeg",
"VINM0/VINM0_23.jpeg",
"VINM0/VINM0_24.jpeg",
"VINM0/VINM0_25.jpeg",
"VINM0/VINM0_26.jpeg",
"VINM0/VINM0_27.jpeg",
"VINM0/VINM0_28.jpeg",
"VINM0/VINM0_29.jpeg",
"VINM0/VINM0_30.jpeg",
"VINM0/VINM0_31.jpeg",
"VINM0/VINM0_32.jpeg",
"VINM0/VINM0_33.jpeg",
"VINM0/VINM0_34.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"27-0.jpg"
]
} | Throughout the entire video. |
28 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}In the given video, when does the action 'person pour a glass of soda' take place?",
"images_path": [
"D0AGO/D0AGO_0.jpeg",
"D0AGO/D0AGO_1.jpeg",
"D0AGO/D0AGO_2.jpeg",
"D0AGO/D0AGO_3.jpeg",
"D0AGO/D0AGO_4.jpeg",
"D0AGO/D0AGO_5.jpeg",
"D0AGO/D0AGO_6.jpeg",
"D0AGO/D0AGO_7.jpeg",
"D0AGO/D0AGO_8.jpeg",
"D0AGO/D0AGO_9.jpeg",
"D0AGO/D0AGO_10.jpeg",
"D0AGO/D0AGO_11.jpeg",
"D0AGO/D0AGO_12.jpeg",
"D0AGO/D0AGO_13.jpeg",
"D0AGO/D0AGO_14.jpeg",
"D0AGO/D0AGO_15.jpeg",
"D0AGO/D0AGO_16.jpeg",
"D0AGO/D0AGO_17.jpeg",
"D0AGO/D0AGO_18.jpeg",
"D0AGO/D0AGO_19.jpeg",
"D0AGO/D0AGO_20.jpeg",
"D0AGO/D0AGO_21.jpeg",
"D0AGO/D0AGO_22.jpeg",
"D0AGO/D0AGO_23.jpeg",
"D0AGO/D0AGO_24.jpeg",
"D0AGO/D0AGO_25.jpeg",
"D0AGO/D0AGO_26.jpeg",
"D0AGO/D0AGO_27.jpeg",
"D0AGO/D0AGO_28.jpeg",
"D0AGO/D0AGO_29.jpeg",
"D0AGO/D0AGO_30.jpeg",
"D0AGO/D0AGO_31.jpeg",
"D0AGO/D0AGO_32.jpeg",
"D0AGO/D0AGO_33.jpeg",
"D0AGO/D0AGO_34.jpeg",
"D0AGO/D0AGO_35.jpeg",
"D0AGO/D0AGO_36.jpeg",
"D0AGO/D0AGO_37.jpeg",
"D0AGO/D0AGO_38.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"28-0.jpg"
]
} | Throughout the entire video. |
29 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}When in the video sequence do we observe the action 'the person takes a picture out'?",
"images_path": [
"Q948H/Q948H_0.jpeg",
"Q948H/Q948H_1.jpeg",
"Q948H/Q948H_2.jpeg",
"Q948H/Q948H_3.jpeg",
"Q948H/Q948H_4.jpeg",
"Q948H/Q948H_5.jpeg",
"Q948H/Q948H_6.jpeg",
"Q948H/Q948H_7.jpeg",
"Q948H/Q948H_8.jpeg",
"Q948H/Q948H_9.jpeg",
"Q948H/Q948H_10.jpeg",
"Q948H/Q948H_11.jpeg",
"Q948H/Q948H_12.jpeg",
"Q948H/Q948H_13.jpeg",
"Q948H/Q948H_14.jpeg",
"Q948H/Q948H_15.jpeg",
"Q948H/Q948H_16.jpeg",
"Q948H/Q948H_17.jpeg",
"Q948H/Q948H_18.jpeg",
"Q948H/Q948H_19.jpeg",
"Q948H/Q948H_20.jpeg",
"Q948H/Q948H_21.jpeg",
"Q948H/Q948H_22.jpeg",
"Q948H/Q948H_23.jpeg",
"Q948H/Q948H_24.jpeg",
"Q948H/Q948H_25.jpeg",
"Q948H/Q948H_26.jpeg",
"Q948H/Q948H_27.jpeg",
"Q948H/Q948H_28.jpeg",
"Q948H/Q948H_29.jpeg",
"Q948H/Q948H_30.jpeg",
"Q948H/Q948H_31.jpeg",
"Q948H/Q948H_32.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"29-0.jpg"
]
} | Throughout the entire video. |
30 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}In the given video, when does the action 'person begins to undress' take place?",
"images_path": [
"JVOM3/JVOM3_0.jpeg",
"JVOM3/JVOM3_1.jpeg",
"JVOM3/JVOM3_2.jpeg",
"JVOM3/JVOM3_3.jpeg",
"JVOM3/JVOM3_4.jpeg",
"JVOM3/JVOM3_5.jpeg",
"JVOM3/JVOM3_6.jpeg",
"JVOM3/JVOM3_7.jpeg",
"JVOM3/JVOM3_8.jpeg",
"JVOM3/JVOM3_9.jpeg",
"JVOM3/JVOM3_10.jpeg",
"JVOM3/JVOM3_11.jpeg",
"JVOM3/JVOM3_12.jpeg",
"JVOM3/JVOM3_13.jpeg",
"JVOM3/JVOM3_14.jpeg",
"JVOM3/JVOM3_15.jpeg",
"JVOM3/JVOM3_16.jpeg",
"JVOM3/JVOM3_17.jpeg",
"JVOM3/JVOM3_18.jpeg",
"JVOM3/JVOM3_19.jpeg",
"JVOM3/JVOM3_20.jpeg",
"JVOM3/JVOM3_21.jpeg",
"JVOM3/JVOM3_22.jpeg",
"JVOM3/JVOM3_23.jpeg",
"JVOM3/JVOM3_24.jpeg",
"JVOM3/JVOM3_25.jpeg",
"JVOM3/JVOM3_26.jpeg",
"JVOM3/JVOM3_27.jpeg",
"JVOM3/JVOM3_28.jpeg",
"JVOM3/JVOM3_29.jpeg",
"JVOM3/JVOM3_30.jpeg",
"JVOM3/JVOM3_31.jpeg",
"JVOM3/JVOM3_32.jpeg",
"JVOM3/JVOM3_33.jpeg",
"JVOM3/JVOM3_34.jpeg",
"JVOM3/JVOM3_35.jpeg",
"JVOM3/JVOM3_36.jpeg",
"JVOM3/JVOM3_37.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"30-0.jpg"
]
} | Throughout the entire video. |
31 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}Can you identify when the action 'a person is smiling' happens in the video?",
"images_path": [
"O8T6G/O8T6G_0.jpeg",
"O8T6G/O8T6G_1.jpeg",
"O8T6G/O8T6G_2.jpeg",
"O8T6G/O8T6G_3.jpeg",
"O8T6G/O8T6G_4.jpeg",
"O8T6G/O8T6G_5.jpeg",
"O8T6G/O8T6G_6.jpeg",
"O8T6G/O8T6G_7.jpeg",
"O8T6G/O8T6G_8.jpeg",
"O8T6G/O8T6G_9.jpeg",
"O8T6G/O8T6G_10.jpeg",
"O8T6G/O8T6G_11.jpeg",
"O8T6G/O8T6G_12.jpeg",
"O8T6G/O8T6G_13.jpeg",
"O8T6G/O8T6G_14.jpeg",
"O8T6G/O8T6G_15.jpeg",
"O8T6G/O8T6G_16.jpeg",
"O8T6G/O8T6G_17.jpeg",
"O8T6G/O8T6G_18.jpeg",
"O8T6G/O8T6G_19.jpeg",
"O8T6G/O8T6G_20.jpeg",
"O8T6G/O8T6G_21.jpeg",
"O8T6G/O8T6G_22.jpeg",
"O8T6G/O8T6G_23.jpeg",
"O8T6G/O8T6G_24.jpeg",
"O8T6G/O8T6G_25.jpeg",
"O8T6G/O8T6G_26.jpeg",
"O8T6G/O8T6G_27.jpeg",
"O8T6G/O8T6G_28.jpeg",
"O8T6G/O8T6G_29.jpeg",
"O8T6G/O8T6G_30.jpeg",
"O8T6G/O8T6G_31.jpeg",
"O8T6G/O8T6G_32.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"31-0.jpg"
]
} | Throughout the entire video. |
32 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}During which part of the video does the action 'a person is smiling' occur?",
"images_path": [
"XMYXI/XMYXI_0.jpeg",
"XMYXI/XMYXI_1.jpeg",
"XMYXI/XMYXI_2.jpeg",
"XMYXI/XMYXI_3.jpeg",
"XMYXI/XMYXI_4.jpeg",
"XMYXI/XMYXI_5.jpeg",
"XMYXI/XMYXI_6.jpeg",
"XMYXI/XMYXI_7.jpeg",
"XMYXI/XMYXI_8.jpeg",
"XMYXI/XMYXI_9.jpeg",
"XMYXI/XMYXI_10.jpeg",
"XMYXI/XMYXI_11.jpeg",
"XMYXI/XMYXI_12.jpeg",
"XMYXI/XMYXI_13.jpeg",
"XMYXI/XMYXI_14.jpeg",
"XMYXI/XMYXI_15.jpeg",
"XMYXI/XMYXI_16.jpeg",
"XMYXI/XMYXI_17.jpeg",
"XMYXI/XMYXI_18.jpeg",
"XMYXI/XMYXI_19.jpeg",
"XMYXI/XMYXI_20.jpeg",
"XMYXI/XMYXI_21.jpeg",
"XMYXI/XMYXI_22.jpeg",
"XMYXI/XMYXI_23.jpeg",
"XMYXI/XMYXI_24.jpeg",
"XMYXI/XMYXI_25.jpeg",
"XMYXI/XMYXI_26.jpeg",
"XMYXI/XMYXI_27.jpeg",
"XMYXI/XMYXI_28.jpeg",
"XMYXI/XMYXI_29.jpeg",
"XMYXI/XMYXI_30.jpeg",
"XMYXI/XMYXI_31.jpeg",
"XMYXI/XMYXI_32.jpeg",
"XMYXI/XMYXI_33.jpeg",
"XMYXI/XMYXI_34.jpeg",
"XMYXI/XMYXI_35.jpeg",
"XMYXI/XMYXI_36.jpeg",
"XMYXI/XMYXI_37.jpeg",
"XMYXI/XMYXI_38.jpeg",
"XMYXI/XMYXI_39.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"32-0.jpg"
]
} | Throughout the entire video. |
33 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}In the given video, when does the action 'person working on a laptop' take place?",
"images_path": [
"TDAY1/TDAY1_0.jpeg",
"TDAY1/TDAY1_1.jpeg",
"TDAY1/TDAY1_2.jpeg",
"TDAY1/TDAY1_3.jpeg",
"TDAY1/TDAY1_4.jpeg",
"TDAY1/TDAY1_5.jpeg",
"TDAY1/TDAY1_6.jpeg",
"TDAY1/TDAY1_7.jpeg",
"TDAY1/TDAY1_8.jpeg",
"TDAY1/TDAY1_9.jpeg",
"TDAY1/TDAY1_10.jpeg",
"TDAY1/TDAY1_11.jpeg",
"TDAY1/TDAY1_12.jpeg",
"TDAY1/TDAY1_13.jpeg",
"TDAY1/TDAY1_14.jpeg",
"TDAY1/TDAY1_15.jpeg",
"TDAY1/TDAY1_16.jpeg",
"TDAY1/TDAY1_17.jpeg",
"TDAY1/TDAY1_18.jpeg",
"TDAY1/TDAY1_19.jpeg",
"TDAY1/TDAY1_20.jpeg",
"TDAY1/TDAY1_21.jpeg",
"TDAY1/TDAY1_22.jpeg",
"TDAY1/TDAY1_23.jpeg",
"TDAY1/TDAY1_24.jpeg",
"TDAY1/TDAY1_25.jpeg",
"TDAY1/TDAY1_26.jpeg",
"TDAY1/TDAY1_27.jpeg",
"TDAY1/TDAY1_28.jpeg",
"TDAY1/TDAY1_29.jpeg",
"TDAY1/TDAY1_30.jpeg",
"TDAY1/TDAY1_31.jpeg",
"TDAY1/TDAY1_32.jpeg",
"TDAY1/TDAY1_33.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"33-0.jpg"
]
} | Throughout the entire video. |
34 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}{image#46}{image#47}{image#48}{image#49}{image#50}During which part of the video does the action 'person playing a game on a laptop' occur?",
"images_path": [
"Q5YDL/Q5YDL_0.jpeg",
"Q5YDL/Q5YDL_1.jpeg",
"Q5YDL/Q5YDL_2.jpeg",
"Q5YDL/Q5YDL_3.jpeg",
"Q5YDL/Q5YDL_4.jpeg",
"Q5YDL/Q5YDL_5.jpeg",
"Q5YDL/Q5YDL_6.jpeg",
"Q5YDL/Q5YDL_7.jpeg",
"Q5YDL/Q5YDL_8.jpeg",
"Q5YDL/Q5YDL_9.jpeg",
"Q5YDL/Q5YDL_10.jpeg",
"Q5YDL/Q5YDL_11.jpeg",
"Q5YDL/Q5YDL_12.jpeg",
"Q5YDL/Q5YDL_13.jpeg",
"Q5YDL/Q5YDL_14.jpeg",
"Q5YDL/Q5YDL_15.jpeg",
"Q5YDL/Q5YDL_16.jpeg",
"Q5YDL/Q5YDL_17.jpeg",
"Q5YDL/Q5YDL_18.jpeg",
"Q5YDL/Q5YDL_19.jpeg",
"Q5YDL/Q5YDL_20.jpeg",
"Q5YDL/Q5YDL_21.jpeg",
"Q5YDL/Q5YDL_22.jpeg",
"Q5YDL/Q5YDL_23.jpeg",
"Q5YDL/Q5YDL_24.jpeg",
"Q5YDL/Q5YDL_25.jpeg",
"Q5YDL/Q5YDL_26.jpeg",
"Q5YDL/Q5YDL_27.jpeg",
"Q5YDL/Q5YDL_28.jpeg",
"Q5YDL/Q5YDL_29.jpeg",
"Q5YDL/Q5YDL_30.jpeg",
"Q5YDL/Q5YDL_31.jpeg",
"Q5YDL/Q5YDL_32.jpeg",
"Q5YDL/Q5YDL_33.jpeg",
"Q5YDL/Q5YDL_34.jpeg",
"Q5YDL/Q5YDL_35.jpeg",
"Q5YDL/Q5YDL_36.jpeg",
"Q5YDL/Q5YDL_37.jpeg",
"Q5YDL/Q5YDL_38.jpeg",
"Q5YDL/Q5YDL_39.jpeg",
"Q5YDL/Q5YDL_40.jpeg",
"Q5YDL/Q5YDL_41.jpeg",
"Q5YDL/Q5YDL_42.jpeg",
"Q5YDL/Q5YDL_43.jpeg",
"Q5YDL/Q5YDL_44.jpeg",
"Q5YDL/Q5YDL_45.jpeg",
"Q5YDL/Q5YDL_46.jpeg",
"Q5YDL/Q5YDL_47.jpeg",
"Q5YDL/Q5YDL_48.jpeg",
"Q5YDL/Q5YDL_49.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"34-0.jpg"
]
} | Throughout the entire video. |
35 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}Can you identify when the action 'person closes a window' happens in the video?",
"images_path": [
"ZJRCS/ZJRCS_0.jpeg",
"ZJRCS/ZJRCS_1.jpeg",
"ZJRCS/ZJRCS_2.jpeg",
"ZJRCS/ZJRCS_3.jpeg",
"ZJRCS/ZJRCS_4.jpeg",
"ZJRCS/ZJRCS_5.jpeg",
"ZJRCS/ZJRCS_6.jpeg",
"ZJRCS/ZJRCS_7.jpeg",
"ZJRCS/ZJRCS_8.jpeg",
"ZJRCS/ZJRCS_9.jpeg",
"ZJRCS/ZJRCS_10.jpeg",
"ZJRCS/ZJRCS_11.jpeg",
"ZJRCS/ZJRCS_12.jpeg",
"ZJRCS/ZJRCS_13.jpeg",
"ZJRCS/ZJRCS_14.jpeg",
"ZJRCS/ZJRCS_15.jpeg",
"ZJRCS/ZJRCS_16.jpeg",
"ZJRCS/ZJRCS_17.jpeg",
"ZJRCS/ZJRCS_18.jpeg",
"ZJRCS/ZJRCS_19.jpeg",
"ZJRCS/ZJRCS_20.jpeg",
"ZJRCS/ZJRCS_21.jpeg",
"ZJRCS/ZJRCS_22.jpeg",
"ZJRCS/ZJRCS_23.jpeg",
"ZJRCS/ZJRCS_24.jpeg",
"ZJRCS/ZJRCS_25.jpeg",
"ZJRCS/ZJRCS_26.jpeg",
"ZJRCS/ZJRCS_27.jpeg",
"ZJRCS/ZJRCS_28.jpeg",
"ZJRCS/ZJRCS_29.jpeg",
"ZJRCS/ZJRCS_30.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"35-0.jpg"
]
} | Throughout the entire video. |
36 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}When in the video sequence do we observe the action 'person opens the closet'?",
"images_path": [
"L5YHH/L5YHH_0.jpeg",
"L5YHH/L5YHH_1.jpeg",
"L5YHH/L5YHH_2.jpeg",
"L5YHH/L5YHH_3.jpeg",
"L5YHH/L5YHH_4.jpeg",
"L5YHH/L5YHH_5.jpeg",
"L5YHH/L5YHH_6.jpeg",
"L5YHH/L5YHH_7.jpeg",
"L5YHH/L5YHH_8.jpeg",
"L5YHH/L5YHH_9.jpeg",
"L5YHH/L5YHH_10.jpeg",
"L5YHH/L5YHH_11.jpeg",
"L5YHH/L5YHH_12.jpeg",
"L5YHH/L5YHH_13.jpeg",
"L5YHH/L5YHH_14.jpeg",
"L5YHH/L5YHH_15.jpeg",
"L5YHH/L5YHH_16.jpeg",
"L5YHH/L5YHH_17.jpeg",
"L5YHH/L5YHH_18.jpeg",
"L5YHH/L5YHH_19.jpeg",
"L5YHH/L5YHH_20.jpeg",
"L5YHH/L5YHH_21.jpeg",
"L5YHH/L5YHH_22.jpeg",
"L5YHH/L5YHH_23.jpeg",
"L5YHH/L5YHH_24.jpeg",
"L5YHH/L5YHH_25.jpeg",
"L5YHH/L5YHH_26.jpeg",
"L5YHH/L5YHH_27.jpeg",
"L5YHH/L5YHH_28.jpeg",
"L5YHH/L5YHH_29.jpeg",
"L5YHH/L5YHH_30.jpeg",
"L5YHH/L5YHH_31.jpeg",
"L5YHH/L5YHH_32.jpeg",
"L5YHH/L5YHH_33.jpeg",
"L5YHH/L5YHH_34.jpeg",
"L5YHH/L5YHH_35.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"36-0.jpg"
]
} | Throughout the entire video. |
37 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}When in the video sequence do we observe the action 'person opens the oven door'?",
"images_path": [
"Z4Y04/Z4Y04_0.jpeg",
"Z4Y04/Z4Y04_1.jpeg",
"Z4Y04/Z4Y04_2.jpeg",
"Z4Y04/Z4Y04_3.jpeg",
"Z4Y04/Z4Y04_4.jpeg",
"Z4Y04/Z4Y04_5.jpeg",
"Z4Y04/Z4Y04_6.jpeg",
"Z4Y04/Z4Y04_7.jpeg",
"Z4Y04/Z4Y04_8.jpeg",
"Z4Y04/Z4Y04_9.jpeg",
"Z4Y04/Z4Y04_10.jpeg",
"Z4Y04/Z4Y04_11.jpeg",
"Z4Y04/Z4Y04_12.jpeg",
"Z4Y04/Z4Y04_13.jpeg",
"Z4Y04/Z4Y04_14.jpeg",
"Z4Y04/Z4Y04_15.jpeg",
"Z4Y04/Z4Y04_16.jpeg",
"Z4Y04/Z4Y04_17.jpeg",
"Z4Y04/Z4Y04_18.jpeg",
"Z4Y04/Z4Y04_19.jpeg",
"Z4Y04/Z4Y04_20.jpeg",
"Z4Y04/Z4Y04_21.jpeg",
"Z4Y04/Z4Y04_22.jpeg",
"Z4Y04/Z4Y04_23.jpeg",
"Z4Y04/Z4Y04_24.jpeg",
"Z4Y04/Z4Y04_25.jpeg",
"Z4Y04/Z4Y04_26.jpeg",
"Z4Y04/Z4Y04_27.jpeg",
"Z4Y04/Z4Y04_28.jpeg",
"Z4Y04/Z4Y04_29.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"37-0.jpg"
]
} | Throughout the entire video. |
38 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}When in the video sequence do we observe the action 'person start working on their laptop'?",
"images_path": [
"VXOE4/VXOE4_0.jpeg",
"VXOE4/VXOE4_1.jpeg",
"VXOE4/VXOE4_2.jpeg",
"VXOE4/VXOE4_3.jpeg",
"VXOE4/VXOE4_4.jpeg",
"VXOE4/VXOE4_5.jpeg",
"VXOE4/VXOE4_6.jpeg",
"VXOE4/VXOE4_7.jpeg",
"VXOE4/VXOE4_8.jpeg",
"VXOE4/VXOE4_9.jpeg",
"VXOE4/VXOE4_10.jpeg",
"VXOE4/VXOE4_11.jpeg",
"VXOE4/VXOE4_12.jpeg",
"VXOE4/VXOE4_13.jpeg",
"VXOE4/VXOE4_14.jpeg",
"VXOE4/VXOE4_15.jpeg",
"VXOE4/VXOE4_16.jpeg",
"VXOE4/VXOE4_17.jpeg",
"VXOE4/VXOE4_18.jpeg",
"VXOE4/VXOE4_19.jpeg",
"VXOE4/VXOE4_20.jpeg",
"VXOE4/VXOE4_21.jpeg",
"VXOE4/VXOE4_22.jpeg",
"VXOE4/VXOE4_23.jpeg",
"VXOE4/VXOE4_24.jpeg",
"VXOE4/VXOE4_25.jpeg",
"VXOE4/VXOE4_26.jpeg",
"VXOE4/VXOE4_27.jpeg",
"VXOE4/VXOE4_28.jpeg",
"VXOE4/VXOE4_29.jpeg",
"VXOE4/VXOE4_30.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"38-0.jpg"
]
} | Throughout the entire video. |
39 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}When in the video sequence do we observe the action 'person holding a pillow'?",
"images_path": [
"PN1F2/PN1F2_0.jpeg",
"PN1F2/PN1F2_1.jpeg",
"PN1F2/PN1F2_2.jpeg",
"PN1F2/PN1F2_3.jpeg",
"PN1F2/PN1F2_4.jpeg",
"PN1F2/PN1F2_5.jpeg",
"PN1F2/PN1F2_6.jpeg",
"PN1F2/PN1F2_7.jpeg",
"PN1F2/PN1F2_8.jpeg",
"PN1F2/PN1F2_9.jpeg",
"PN1F2/PN1F2_10.jpeg",
"PN1F2/PN1F2_11.jpeg",
"PN1F2/PN1F2_12.jpeg",
"PN1F2/PN1F2_13.jpeg",
"PN1F2/PN1F2_14.jpeg",
"PN1F2/PN1F2_15.jpeg",
"PN1F2/PN1F2_16.jpeg",
"PN1F2/PN1F2_17.jpeg",
"PN1F2/PN1F2_18.jpeg",
"PN1F2/PN1F2_19.jpeg",
"PN1F2/PN1F2_20.jpeg",
"PN1F2/PN1F2_21.jpeg",
"PN1F2/PN1F2_22.jpeg",
"PN1F2/PN1F2_23.jpeg",
"PN1F2/PN1F2_24.jpeg",
"PN1F2/PN1F2_25.jpeg",
"PN1F2/PN1F2_26.jpeg",
"PN1F2/PN1F2_27.jpeg",
"PN1F2/PN1F2_28.jpeg",
"PN1F2/PN1F2_29.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"39-0.jpg"
]
} | Throughout the entire video. |
40 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}During which part of the video does the action 'person looking at a book' occur?",
"images_path": [
"7LWW3/7LWW3_0.jpeg",
"7LWW3/7LWW3_1.jpeg",
"7LWW3/7LWW3_2.jpeg",
"7LWW3/7LWW3_3.jpeg",
"7LWW3/7LWW3_4.jpeg",
"7LWW3/7LWW3_5.jpeg",
"7LWW3/7LWW3_6.jpeg",
"7LWW3/7LWW3_7.jpeg",
"7LWW3/7LWW3_8.jpeg",
"7LWW3/7LWW3_9.jpeg",
"7LWW3/7LWW3_10.jpeg",
"7LWW3/7LWW3_11.jpeg",
"7LWW3/7LWW3_12.jpeg",
"7LWW3/7LWW3_13.jpeg",
"7LWW3/7LWW3_14.jpeg",
"7LWW3/7LWW3_15.jpeg",
"7LWW3/7LWW3_16.jpeg",
"7LWW3/7LWW3_17.jpeg",
"7LWW3/7LWW3_18.jpeg",
"7LWW3/7LWW3_19.jpeg",
"7LWW3/7LWW3_20.jpeg",
"7LWW3/7LWW3_21.jpeg",
"7LWW3/7LWW3_22.jpeg",
"7LWW3/7LWW3_23.jpeg",
"7LWW3/7LWW3_24.jpeg",
"7LWW3/7LWW3_25.jpeg",
"7LWW3/7LWW3_26.jpeg",
"7LWW3/7LWW3_27.jpeg",
"7LWW3/7LWW3_28.jpeg",
"7LWW3/7LWW3_29.jpeg",
"7LWW3/7LWW3_30.jpeg",
"7LWW3/7LWW3_31.jpeg",
"7LWW3/7LWW3_32.jpeg",
"7LWW3/7LWW3_33.jpeg",
"7LWW3/7LWW3_34.jpeg",
"7LWW3/7LWW3_35.jpeg",
"7LWW3/7LWW3_36.jpeg",
"7LWW3/7LWW3_37.jpeg",
"7LWW3/7LWW3_38.jpeg",
"7LWW3/7LWW3_39.jpeg",
"7LWW3/7LWW3_40.jpeg",
"7LWW3/7LWW3_41.jpeg",
"7LWW3/7LWW3_42.jpeg",
"7LWW3/7LWW3_43.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"40-0.jpg"
]
} | Throughout the entire video. |
41 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}During which part of the video does the action 'person cooks some food on the stove' occur?",
"images_path": [
"3QL7J/3QL7J_0.jpeg",
"3QL7J/3QL7J_1.jpeg",
"3QL7J/3QL7J_2.jpeg",
"3QL7J/3QL7J_3.jpeg",
"3QL7J/3QL7J_4.jpeg",
"3QL7J/3QL7J_5.jpeg",
"3QL7J/3QL7J_6.jpeg",
"3QL7J/3QL7J_7.jpeg",
"3QL7J/3QL7J_8.jpeg",
"3QL7J/3QL7J_9.jpeg",
"3QL7J/3QL7J_10.jpeg",
"3QL7J/3QL7J_11.jpeg",
"3QL7J/3QL7J_12.jpeg",
"3QL7J/3QL7J_13.jpeg",
"3QL7J/3QL7J_14.jpeg",
"3QL7J/3QL7J_15.jpeg",
"3QL7J/3QL7J_16.jpeg",
"3QL7J/3QL7J_17.jpeg",
"3QL7J/3QL7J_18.jpeg",
"3QL7J/3QL7J_19.jpeg",
"3QL7J/3QL7J_20.jpeg",
"3QL7J/3QL7J_21.jpeg",
"3QL7J/3QL7J_22.jpeg",
"3QL7J/3QL7J_23.jpeg",
"3QL7J/3QL7J_24.jpeg",
"3QL7J/3QL7J_25.jpeg",
"3QL7J/3QL7J_26.jpeg",
"3QL7J/3QL7J_27.jpeg",
"3QL7J/3QL7J_28.jpeg",
"3QL7J/3QL7J_29.jpeg",
"3QL7J/3QL7J_30.jpeg",
"3QL7J/3QL7J_31.jpeg",
"3QL7J/3QL7J_32.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"41-0.jpg"
]
} | Throughout the entire video. |
42 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}When in the video sequence do we observe the action 'person putting on their shoes'?",
"images_path": [
"1HZGH/1HZGH_0.jpeg",
"1HZGH/1HZGH_1.jpeg",
"1HZGH/1HZGH_2.jpeg",
"1HZGH/1HZGH_3.jpeg",
"1HZGH/1HZGH_4.jpeg",
"1HZGH/1HZGH_5.jpeg",
"1HZGH/1HZGH_6.jpeg",
"1HZGH/1HZGH_7.jpeg",
"1HZGH/1HZGH_8.jpeg",
"1HZGH/1HZGH_9.jpeg",
"1HZGH/1HZGH_10.jpeg",
"1HZGH/1HZGH_11.jpeg",
"1HZGH/1HZGH_12.jpeg",
"1HZGH/1HZGH_13.jpeg",
"1HZGH/1HZGH_14.jpeg",
"1HZGH/1HZGH_15.jpeg",
"1HZGH/1HZGH_16.jpeg",
"1HZGH/1HZGH_17.jpeg",
"1HZGH/1HZGH_18.jpeg",
"1HZGH/1HZGH_19.jpeg",
"1HZGH/1HZGH_20.jpeg",
"1HZGH/1HZGH_21.jpeg",
"1HZGH/1HZGH_22.jpeg",
"1HZGH/1HZGH_23.jpeg",
"1HZGH/1HZGH_24.jpeg",
"1HZGH/1HZGH_25.jpeg",
"1HZGH/1HZGH_26.jpeg",
"1HZGH/1HZGH_27.jpeg",
"1HZGH/1HZGH_28.jpeg",
"1HZGH/1HZGH_29.jpeg",
"1HZGH/1HZGH_30.jpeg",
"1HZGH/1HZGH_31.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"42-0.jpg"
]
} | Throughout the entire video. |
43 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}Can you identify when the action 'the person is sitting in the couch with the laptop' happens in the video?",
"images_path": [
"X9M5B/X9M5B_0.jpeg",
"X9M5B/X9M5B_1.jpeg",
"X9M5B/X9M5B_2.jpeg",
"X9M5B/X9M5B_3.jpeg",
"X9M5B/X9M5B_4.jpeg",
"X9M5B/X9M5B_5.jpeg",
"X9M5B/X9M5B_6.jpeg",
"X9M5B/X9M5B_7.jpeg",
"X9M5B/X9M5B_8.jpeg",
"X9M5B/X9M5B_9.jpeg",
"X9M5B/X9M5B_10.jpeg",
"X9M5B/X9M5B_11.jpeg",
"X9M5B/X9M5B_12.jpeg",
"X9M5B/X9M5B_13.jpeg",
"X9M5B/X9M5B_14.jpeg",
"X9M5B/X9M5B_15.jpeg",
"X9M5B/X9M5B_16.jpeg",
"X9M5B/X9M5B_17.jpeg",
"X9M5B/X9M5B_18.jpeg",
"X9M5B/X9M5B_19.jpeg",
"X9M5B/X9M5B_20.jpeg",
"X9M5B/X9M5B_21.jpeg",
"X9M5B/X9M5B_22.jpeg",
"X9M5B/X9M5B_23.jpeg",
"X9M5B/X9M5B_24.jpeg",
"X9M5B/X9M5B_25.jpeg",
"X9M5B/X9M5B_26.jpeg",
"X9M5B/X9M5B_27.jpeg",
"X9M5B/X9M5B_28.jpeg",
"X9M5B/X9M5B_29.jpeg",
"X9M5B/X9M5B_30.jpeg",
"X9M5B/X9M5B_31.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"43-0.jpg"
]
} | Throughout the entire video. |
44 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}Can you identify when the action 'person takes out the phone' happens in the video?",
"images_path": [
"47RAA/47RAA_0.jpeg",
"47RAA/47RAA_1.jpeg",
"47RAA/47RAA_2.jpeg",
"47RAA/47RAA_3.jpeg",
"47RAA/47RAA_4.jpeg",
"47RAA/47RAA_5.jpeg",
"47RAA/47RAA_6.jpeg",
"47RAA/47RAA_7.jpeg",
"47RAA/47RAA_8.jpeg",
"47RAA/47RAA_9.jpeg",
"47RAA/47RAA_10.jpeg",
"47RAA/47RAA_11.jpeg",
"47RAA/47RAA_12.jpeg",
"47RAA/47RAA_13.jpeg",
"47RAA/47RAA_14.jpeg",
"47RAA/47RAA_15.jpeg",
"47RAA/47RAA_16.jpeg",
"47RAA/47RAA_17.jpeg",
"47RAA/47RAA_18.jpeg",
"47RAA/47RAA_19.jpeg",
"47RAA/47RAA_20.jpeg",
"47RAA/47RAA_21.jpeg",
"47RAA/47RAA_22.jpeg",
"47RAA/47RAA_23.jpeg",
"47RAA/47RAA_24.jpeg",
"47RAA/47RAA_25.jpeg",
"47RAA/47RAA_26.jpeg",
"47RAA/47RAA_27.jpeg",
"47RAA/47RAA_28.jpeg",
"47RAA/47RAA_29.jpeg",
"47RAA/47RAA_30.jpeg",
"47RAA/47RAA_31.jpeg",
"47RAA/47RAA_32.jpeg",
"47RAA/47RAA_33.jpeg",
"47RAA/47RAA_34.jpeg",
"47RAA/47RAA_35.jpeg",
"47RAA/47RAA_36.jpeg",
"47RAA/47RAA_37.jpeg",
"47RAA/47RAA_38.jpeg",
"47RAA/47RAA_39.jpeg",
"47RAA/47RAA_40.jpeg",
"47RAA/47RAA_41.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"44-0.jpg"
]
} | Throughout the entire video. |
45 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}When in the video sequence do we observe the action 'a person laughs to themselves'?",
"images_path": [
"4BIMI/4BIMI_0.jpeg",
"4BIMI/4BIMI_1.jpeg",
"4BIMI/4BIMI_2.jpeg",
"4BIMI/4BIMI_3.jpeg",
"4BIMI/4BIMI_4.jpeg",
"4BIMI/4BIMI_5.jpeg",
"4BIMI/4BIMI_6.jpeg",
"4BIMI/4BIMI_7.jpeg",
"4BIMI/4BIMI_8.jpeg",
"4BIMI/4BIMI_9.jpeg",
"4BIMI/4BIMI_10.jpeg",
"4BIMI/4BIMI_11.jpeg",
"4BIMI/4BIMI_12.jpeg",
"4BIMI/4BIMI_13.jpeg",
"4BIMI/4BIMI_14.jpeg",
"4BIMI/4BIMI_15.jpeg",
"4BIMI/4BIMI_16.jpeg",
"4BIMI/4BIMI_17.jpeg",
"4BIMI/4BIMI_18.jpeg",
"4BIMI/4BIMI_19.jpeg",
"4BIMI/4BIMI_20.jpeg",
"4BIMI/4BIMI_21.jpeg",
"4BIMI/4BIMI_22.jpeg",
"4BIMI/4BIMI_23.jpeg",
"4BIMI/4BIMI_24.jpeg",
"4BIMI/4BIMI_25.jpeg",
"4BIMI/4BIMI_26.jpeg",
"4BIMI/4BIMI_27.jpeg",
"4BIMI/4BIMI_28.jpeg",
"4BIMI/4BIMI_29.jpeg",
"4BIMI/4BIMI_30.jpeg",
"4BIMI/4BIMI_31.jpeg",
"4BIMI/4BIMI_32.jpeg",
"4BIMI/4BIMI_33.jpeg",
"4BIMI/4BIMI_34.jpeg",
"4BIMI/4BIMI_35.jpeg",
"4BIMI/4BIMI_36.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"45-0.jpg"
]
} | Throughout the entire video. |
46 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}When in the video sequence do we observe the action 'person tidying clothes'?",
"images_path": [
"TU9K1/TU9K1_0.jpeg",
"TU9K1/TU9K1_1.jpeg",
"TU9K1/TU9K1_2.jpeg",
"TU9K1/TU9K1_3.jpeg",
"TU9K1/TU9K1_4.jpeg",
"TU9K1/TU9K1_5.jpeg",
"TU9K1/TU9K1_6.jpeg",
"TU9K1/TU9K1_7.jpeg",
"TU9K1/TU9K1_8.jpeg",
"TU9K1/TU9K1_9.jpeg",
"TU9K1/TU9K1_10.jpeg",
"TU9K1/TU9K1_11.jpeg",
"TU9K1/TU9K1_12.jpeg",
"TU9K1/TU9K1_13.jpeg",
"TU9K1/TU9K1_14.jpeg",
"TU9K1/TU9K1_15.jpeg",
"TU9K1/TU9K1_16.jpeg",
"TU9K1/TU9K1_17.jpeg",
"TU9K1/TU9K1_18.jpeg",
"TU9K1/TU9K1_19.jpeg",
"TU9K1/TU9K1_20.jpeg",
"TU9K1/TU9K1_21.jpeg",
"TU9K1/TU9K1_22.jpeg",
"TU9K1/TU9K1_23.jpeg",
"TU9K1/TU9K1_24.jpeg",
"TU9K1/TU9K1_25.jpeg",
"TU9K1/TU9K1_26.jpeg",
"TU9K1/TU9K1_27.jpeg",
"TU9K1/TU9K1_28.jpeg",
"TU9K1/TU9K1_29.jpeg",
"TU9K1/TU9K1_30.jpeg",
"TU9K1/TU9K1_31.jpeg",
"TU9K1/TU9K1_32.jpeg",
"TU9K1/TU9K1_33.jpeg",
"TU9K1/TU9K1_34.jpeg",
"TU9K1/TU9K1_35.jpeg",
"TU9K1/TU9K1_36.jpeg",
"TU9K1/TU9K1_37.jpeg",
"TU9K1/TU9K1_38.jpeg",
"TU9K1/TU9K1_39.jpeg",
"TU9K1/TU9K1_40.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"46-0.jpg"
]
} | Throughout the entire video. |
47 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}In the given video, when does the action 'person pour it into a glass' take place?",
"images_path": [
"C10FA/C10FA_0.jpeg",
"C10FA/C10FA_1.jpeg",
"C10FA/C10FA_2.jpeg",
"C10FA/C10FA_3.jpeg",
"C10FA/C10FA_4.jpeg",
"C10FA/C10FA_5.jpeg",
"C10FA/C10FA_6.jpeg",
"C10FA/C10FA_7.jpeg",
"C10FA/C10FA_8.jpeg",
"C10FA/C10FA_9.jpeg",
"C10FA/C10FA_10.jpeg",
"C10FA/C10FA_11.jpeg",
"C10FA/C10FA_12.jpeg",
"C10FA/C10FA_13.jpeg",
"C10FA/C10FA_14.jpeg",
"C10FA/C10FA_15.jpeg",
"C10FA/C10FA_16.jpeg",
"C10FA/C10FA_17.jpeg",
"C10FA/C10FA_18.jpeg",
"C10FA/C10FA_19.jpeg",
"C10FA/C10FA_20.jpeg",
"C10FA/C10FA_21.jpeg",
"C10FA/C10FA_22.jpeg",
"C10FA/C10FA_23.jpeg",
"C10FA/C10FA_24.jpeg",
"C10FA/C10FA_25.jpeg",
"C10FA/C10FA_26.jpeg",
"C10FA/C10FA_27.jpeg",
"C10FA/C10FA_28.jpeg",
"C10FA/C10FA_29.jpeg",
"C10FA/C10FA_30.jpeg",
"C10FA/C10FA_31.jpeg",
"C10FA/C10FA_32.jpeg",
"C10FA/C10FA_33.jpeg",
"C10FA/C10FA_34.jpeg",
"C10FA/C10FA_35.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"47-0.jpg"
]
} | Throughout the entire video. |
48 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}At what moment in the video does the action 'person eating something' occur?",
"images_path": [
"UKSCV/UKSCV_0.jpeg",
"UKSCV/UKSCV_1.jpeg",
"UKSCV/UKSCV_2.jpeg",
"UKSCV/UKSCV_3.jpeg",
"UKSCV/UKSCV_4.jpeg",
"UKSCV/UKSCV_5.jpeg",
"UKSCV/UKSCV_6.jpeg",
"UKSCV/UKSCV_7.jpeg",
"UKSCV/UKSCV_8.jpeg",
"UKSCV/UKSCV_9.jpeg",
"UKSCV/UKSCV_10.jpeg",
"UKSCV/UKSCV_11.jpeg",
"UKSCV/UKSCV_12.jpeg",
"UKSCV/UKSCV_13.jpeg",
"UKSCV/UKSCV_14.jpeg",
"UKSCV/UKSCV_15.jpeg",
"UKSCV/UKSCV_16.jpeg",
"UKSCV/UKSCV_17.jpeg",
"UKSCV/UKSCV_18.jpeg",
"UKSCV/UKSCV_19.jpeg",
"UKSCV/UKSCV_20.jpeg",
"UKSCV/UKSCV_21.jpeg",
"UKSCV/UKSCV_22.jpeg",
"UKSCV/UKSCV_23.jpeg",
"UKSCV/UKSCV_24.jpeg",
"UKSCV/UKSCV_25.jpeg",
"UKSCV/UKSCV_26.jpeg",
"UKSCV/UKSCV_27.jpeg",
"UKSCV/UKSCV_28.jpeg",
"UKSCV/UKSCV_29.jpeg",
"UKSCV/UKSCV_30.jpeg",
"UKSCV/UKSCV_31.jpeg",
"UKSCV/UKSCV_32.jpeg",
"UKSCV/UKSCV_33.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"48-0.jpg"
]
} | Throughout the entire video. |
49 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}During which part of the video does the action 'person eating a sandwich' occur?",
"images_path": [
"9JZO2/9JZO2_0.jpeg",
"9JZO2/9JZO2_1.jpeg",
"9JZO2/9JZO2_2.jpeg",
"9JZO2/9JZO2_3.jpeg",
"9JZO2/9JZO2_4.jpeg",
"9JZO2/9JZO2_5.jpeg",
"9JZO2/9JZO2_6.jpeg",
"9JZO2/9JZO2_7.jpeg",
"9JZO2/9JZO2_8.jpeg",
"9JZO2/9JZO2_9.jpeg",
"9JZO2/9JZO2_10.jpeg",
"9JZO2/9JZO2_11.jpeg",
"9JZO2/9JZO2_12.jpeg",
"9JZO2/9JZO2_13.jpeg",
"9JZO2/9JZO2_14.jpeg",
"9JZO2/9JZO2_15.jpeg",
"9JZO2/9JZO2_16.jpeg",
"9JZO2/9JZO2_17.jpeg",
"9JZO2/9JZO2_18.jpeg",
"9JZO2/9JZO2_19.jpeg",
"9JZO2/9JZO2_20.jpeg",
"9JZO2/9JZO2_21.jpeg",
"9JZO2/9JZO2_22.jpeg",
"9JZO2/9JZO2_23.jpeg",
"9JZO2/9JZO2_24.jpeg",
"9JZO2/9JZO2_25.jpeg",
"9JZO2/9JZO2_26.jpeg",
"9JZO2/9JZO2_27.jpeg",
"9JZO2/9JZO2_28.jpeg",
"9JZO2/9JZO2_29.jpeg",
"9JZO2/9JZO2_30.jpeg",
"9JZO2/9JZO2_31.jpeg",
"9JZO2/9JZO2_32.jpeg",
"9JZO2/9JZO2_33.jpeg",
"9JZO2/9JZO2_34.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"49-0.jpg"
]
} | Throughout the entire video. |
50 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}At what moment in the video does the action 'person put down broom' occur?",
"images_path": [
"JT537/JT537_0.jpeg",
"JT537/JT537_1.jpeg",
"JT537/JT537_2.jpeg",
"JT537/JT537_3.jpeg",
"JT537/JT537_4.jpeg",
"JT537/JT537_5.jpeg",
"JT537/JT537_6.jpeg",
"JT537/JT537_7.jpeg",
"JT537/JT537_8.jpeg",
"JT537/JT537_9.jpeg",
"JT537/JT537_10.jpeg",
"JT537/JT537_11.jpeg",
"JT537/JT537_12.jpeg",
"JT537/JT537_13.jpeg",
"JT537/JT537_14.jpeg",
"JT537/JT537_15.jpeg",
"JT537/JT537_16.jpeg",
"JT537/JT537_17.jpeg",
"JT537/JT537_18.jpeg",
"JT537/JT537_19.jpeg",
"JT537/JT537_20.jpeg",
"JT537/JT537_21.jpeg",
"JT537/JT537_22.jpeg",
"JT537/JT537_23.jpeg",
"JT537/JT537_24.jpeg",
"JT537/JT537_25.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"50-0.jpg"
]
} | At the end of the video. |
51 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}In the given video, when does the action 'the person opens a cabinet' take place?",
"images_path": [
"1NJOQ/1NJOQ_0.jpeg",
"1NJOQ/1NJOQ_1.jpeg",
"1NJOQ/1NJOQ_2.jpeg",
"1NJOQ/1NJOQ_3.jpeg",
"1NJOQ/1NJOQ_4.jpeg",
"1NJOQ/1NJOQ_5.jpeg",
"1NJOQ/1NJOQ_6.jpeg",
"1NJOQ/1NJOQ_7.jpeg",
"1NJOQ/1NJOQ_8.jpeg",
"1NJOQ/1NJOQ_9.jpeg",
"1NJOQ/1NJOQ_10.jpeg",
"1NJOQ/1NJOQ_11.jpeg",
"1NJOQ/1NJOQ_12.jpeg",
"1NJOQ/1NJOQ_13.jpeg",
"1NJOQ/1NJOQ_14.jpeg",
"1NJOQ/1NJOQ_15.jpeg",
"1NJOQ/1NJOQ_16.jpeg",
"1NJOQ/1NJOQ_17.jpeg",
"1NJOQ/1NJOQ_18.jpeg",
"1NJOQ/1NJOQ_19.jpeg",
"1NJOQ/1NJOQ_20.jpeg",
"1NJOQ/1NJOQ_21.jpeg",
"1NJOQ/1NJOQ_22.jpeg",
"1NJOQ/1NJOQ_23.jpeg",
"1NJOQ/1NJOQ_24.jpeg",
"1NJOQ/1NJOQ_25.jpeg",
"1NJOQ/1NJOQ_26.jpeg",
"1NJOQ/1NJOQ_27.jpeg",
"1NJOQ/1NJOQ_28.jpeg",
"1NJOQ/1NJOQ_29.jpeg",
"1NJOQ/1NJOQ_30.jpeg",
"1NJOQ/1NJOQ_31.jpeg",
"1NJOQ/1NJOQ_32.jpeg",
"1NJOQ/1NJOQ_33.jpeg",
"1NJOQ/1NJOQ_34.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"51-0.jpg"
]
} | At the end of the video. |
52 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}Can you identify when the action 'person opens the door' happens in the video?",
"images_path": [
"FMZOY/FMZOY_0.jpeg",
"FMZOY/FMZOY_1.jpeg",
"FMZOY/FMZOY_2.jpeg",
"FMZOY/FMZOY_3.jpeg",
"FMZOY/FMZOY_4.jpeg",
"FMZOY/FMZOY_5.jpeg",
"FMZOY/FMZOY_6.jpeg",
"FMZOY/FMZOY_7.jpeg",
"FMZOY/FMZOY_8.jpeg",
"FMZOY/FMZOY_9.jpeg",
"FMZOY/FMZOY_10.jpeg",
"FMZOY/FMZOY_11.jpeg",
"FMZOY/FMZOY_12.jpeg",
"FMZOY/FMZOY_13.jpeg",
"FMZOY/FMZOY_14.jpeg",
"FMZOY/FMZOY_15.jpeg",
"FMZOY/FMZOY_16.jpeg",
"FMZOY/FMZOY_17.jpeg",
"FMZOY/FMZOY_18.jpeg",
"FMZOY/FMZOY_19.jpeg",
"FMZOY/FMZOY_20.jpeg",
"FMZOY/FMZOY_21.jpeg",
"FMZOY/FMZOY_22.jpeg",
"FMZOY/FMZOY_23.jpeg",
"FMZOY/FMZOY_24.jpeg",
"FMZOY/FMZOY_25.jpeg",
"FMZOY/FMZOY_26.jpeg",
"FMZOY/FMZOY_27.jpeg",
"FMZOY/FMZOY_28.jpeg",
"FMZOY/FMZOY_29.jpeg",
"FMZOY/FMZOY_30.jpeg",
"FMZOY/FMZOY_31.jpeg",
"FMZOY/FMZOY_32.jpeg",
"FMZOY/FMZOY_33.jpeg",
"FMZOY/FMZOY_34.jpeg",
"FMZOY/FMZOY_35.jpeg",
"FMZOY/FMZOY_36.jpeg",
"FMZOY/FMZOY_37.jpeg",
"FMZOY/FMZOY_38.jpeg",
"FMZOY/FMZOY_39.jpeg",
"FMZOY/FMZOY_40.jpeg",
"FMZOY/FMZOY_41.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"52-0.jpg"
]
} | At the end of the video. |
53 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}When in the video sequence do we observe the action 'person open a box'?",
"images_path": [
"HHNTA/HHNTA_0.jpeg",
"HHNTA/HHNTA_1.jpeg",
"HHNTA/HHNTA_2.jpeg",
"HHNTA/HHNTA_3.jpeg",
"HHNTA/HHNTA_4.jpeg",
"HHNTA/HHNTA_5.jpeg",
"HHNTA/HHNTA_6.jpeg",
"HHNTA/HHNTA_7.jpeg",
"HHNTA/HHNTA_8.jpeg",
"HHNTA/HHNTA_9.jpeg",
"HHNTA/HHNTA_10.jpeg",
"HHNTA/HHNTA_11.jpeg",
"HHNTA/HHNTA_12.jpeg",
"HHNTA/HHNTA_13.jpeg",
"HHNTA/HHNTA_14.jpeg",
"HHNTA/HHNTA_15.jpeg",
"HHNTA/HHNTA_16.jpeg",
"HHNTA/HHNTA_17.jpeg",
"HHNTA/HHNTA_18.jpeg",
"HHNTA/HHNTA_19.jpeg",
"HHNTA/HHNTA_20.jpeg",
"HHNTA/HHNTA_21.jpeg",
"HHNTA/HHNTA_22.jpeg",
"HHNTA/HHNTA_23.jpeg",
"HHNTA/HHNTA_24.jpeg",
"HHNTA/HHNTA_25.jpeg",
"HHNTA/HHNTA_26.jpeg",
"HHNTA/HHNTA_27.jpeg",
"HHNTA/HHNTA_28.jpeg",
"HHNTA/HHNTA_29.jpeg",
"HHNTA/HHNTA_30.jpeg",
"HHNTA/HHNTA_31.jpeg",
"HHNTA/HHNTA_32.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"53-0.jpg"
]
} | At the end of the video. |
54 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}At what moment in the video does the action 'person takes a drink from a coffee cup' occur?",
"images_path": [
"DOYQE/DOYQE_0.jpeg",
"DOYQE/DOYQE_1.jpeg",
"DOYQE/DOYQE_2.jpeg",
"DOYQE/DOYQE_3.jpeg",
"DOYQE/DOYQE_4.jpeg",
"DOYQE/DOYQE_5.jpeg",
"DOYQE/DOYQE_6.jpeg",
"DOYQE/DOYQE_7.jpeg",
"DOYQE/DOYQE_8.jpeg",
"DOYQE/DOYQE_9.jpeg",
"DOYQE/DOYQE_10.jpeg",
"DOYQE/DOYQE_11.jpeg",
"DOYQE/DOYQE_12.jpeg",
"DOYQE/DOYQE_13.jpeg",
"DOYQE/DOYQE_14.jpeg",
"DOYQE/DOYQE_15.jpeg",
"DOYQE/DOYQE_16.jpeg",
"DOYQE/DOYQE_17.jpeg",
"DOYQE/DOYQE_18.jpeg",
"DOYQE/DOYQE_19.jpeg",
"DOYQE/DOYQE_20.jpeg",
"DOYQE/DOYQE_21.jpeg",
"DOYQE/DOYQE_22.jpeg",
"DOYQE/DOYQE_23.jpeg",
"DOYQE/DOYQE_24.jpeg",
"DOYQE/DOYQE_25.jpeg",
"DOYQE/DOYQE_26.jpeg",
"DOYQE/DOYQE_27.jpeg",
"DOYQE/DOYQE_28.jpeg",
"DOYQE/DOYQE_29.jpeg",
"DOYQE/DOYQE_30.jpeg",
"DOYQE/DOYQE_31.jpeg",
"DOYQE/DOYQE_32.jpeg",
"DOYQE/DOYQE_33.jpeg",
"DOYQE/DOYQE_34.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"54-0.jpg"
]
} | At the end of the video. |
55 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}Can you identify when the action 'person opens cabinets above it' happens in the video?",
"images_path": [
"ZMY8M/ZMY8M_0.jpeg",
"ZMY8M/ZMY8M_1.jpeg",
"ZMY8M/ZMY8M_2.jpeg",
"ZMY8M/ZMY8M_3.jpeg",
"ZMY8M/ZMY8M_4.jpeg",
"ZMY8M/ZMY8M_5.jpeg",
"ZMY8M/ZMY8M_6.jpeg",
"ZMY8M/ZMY8M_7.jpeg",
"ZMY8M/ZMY8M_8.jpeg",
"ZMY8M/ZMY8M_9.jpeg",
"ZMY8M/ZMY8M_10.jpeg",
"ZMY8M/ZMY8M_11.jpeg",
"ZMY8M/ZMY8M_12.jpeg",
"ZMY8M/ZMY8M_13.jpeg",
"ZMY8M/ZMY8M_14.jpeg",
"ZMY8M/ZMY8M_15.jpeg",
"ZMY8M/ZMY8M_16.jpeg",
"ZMY8M/ZMY8M_17.jpeg",
"ZMY8M/ZMY8M_18.jpeg",
"ZMY8M/ZMY8M_19.jpeg",
"ZMY8M/ZMY8M_20.jpeg",
"ZMY8M/ZMY8M_21.jpeg",
"ZMY8M/ZMY8M_22.jpeg",
"ZMY8M/ZMY8M_23.jpeg",
"ZMY8M/ZMY8M_24.jpeg",
"ZMY8M/ZMY8M_25.jpeg",
"ZMY8M/ZMY8M_26.jpeg",
"ZMY8M/ZMY8M_27.jpeg",
"ZMY8M/ZMY8M_28.jpeg",
"ZMY8M/ZMY8M_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"55-0.jpg"
]
} | At the end of the video. |
56 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}At what moment in the video does the action 'person open a box' occur?",
"images_path": [
"FNBYE/FNBYE_0.jpeg",
"FNBYE/FNBYE_1.jpeg",
"FNBYE/FNBYE_2.jpeg",
"FNBYE/FNBYE_3.jpeg",
"FNBYE/FNBYE_4.jpeg",
"FNBYE/FNBYE_5.jpeg",
"FNBYE/FNBYE_6.jpeg",
"FNBYE/FNBYE_7.jpeg",
"FNBYE/FNBYE_8.jpeg",
"FNBYE/FNBYE_9.jpeg",
"FNBYE/FNBYE_10.jpeg",
"FNBYE/FNBYE_11.jpeg",
"FNBYE/FNBYE_12.jpeg",
"FNBYE/FNBYE_13.jpeg",
"FNBYE/FNBYE_14.jpeg",
"FNBYE/FNBYE_15.jpeg",
"FNBYE/FNBYE_16.jpeg",
"FNBYE/FNBYE_17.jpeg",
"FNBYE/FNBYE_18.jpeg",
"FNBYE/FNBYE_19.jpeg",
"FNBYE/FNBYE_20.jpeg",
"FNBYE/FNBYE_21.jpeg",
"FNBYE/FNBYE_22.jpeg",
"FNBYE/FNBYE_23.jpeg",
"FNBYE/FNBYE_24.jpeg",
"FNBYE/FNBYE_25.jpeg",
"FNBYE/FNBYE_26.jpeg",
"FNBYE/FNBYE_27.jpeg",
"FNBYE/FNBYE_28.jpeg",
"FNBYE/FNBYE_29.jpeg",
"FNBYE/FNBYE_30.jpeg",
"FNBYE/FNBYE_31.jpeg",
"FNBYE/FNBYE_32.jpeg",
"FNBYE/FNBYE_33.jpeg",
"FNBYE/FNBYE_34.jpeg",
"FNBYE/FNBYE_35.jpeg",
"FNBYE/FNBYE_36.jpeg",
"FNBYE/FNBYE_37.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"56-0.jpg"
]
} | At the end of the video. |
57 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}In the given video, when does the action 'person close a cabinet' take place?",
"images_path": [
"2ZICJ/2ZICJ_0.jpeg",
"2ZICJ/2ZICJ_1.jpeg",
"2ZICJ/2ZICJ_2.jpeg",
"2ZICJ/2ZICJ_3.jpeg",
"2ZICJ/2ZICJ_4.jpeg",
"2ZICJ/2ZICJ_5.jpeg",
"2ZICJ/2ZICJ_6.jpeg",
"2ZICJ/2ZICJ_7.jpeg",
"2ZICJ/2ZICJ_8.jpeg",
"2ZICJ/2ZICJ_9.jpeg",
"2ZICJ/2ZICJ_10.jpeg",
"2ZICJ/2ZICJ_11.jpeg",
"2ZICJ/2ZICJ_12.jpeg",
"2ZICJ/2ZICJ_13.jpeg",
"2ZICJ/2ZICJ_14.jpeg",
"2ZICJ/2ZICJ_15.jpeg",
"2ZICJ/2ZICJ_16.jpeg",
"2ZICJ/2ZICJ_17.jpeg",
"2ZICJ/2ZICJ_18.jpeg",
"2ZICJ/2ZICJ_19.jpeg",
"2ZICJ/2ZICJ_20.jpeg",
"2ZICJ/2ZICJ_21.jpeg",
"2ZICJ/2ZICJ_22.jpeg",
"2ZICJ/2ZICJ_23.jpeg",
"2ZICJ/2ZICJ_24.jpeg",
"2ZICJ/2ZICJ_25.jpeg",
"2ZICJ/2ZICJ_26.jpeg",
"2ZICJ/2ZICJ_27.jpeg",
"2ZICJ/2ZICJ_28.jpeg",
"2ZICJ/2ZICJ_29.jpeg",
"2ZICJ/2ZICJ_30.jpeg",
"2ZICJ/2ZICJ_31.jpeg",
"2ZICJ/2ZICJ_32.jpeg",
"2ZICJ/2ZICJ_33.jpeg",
"2ZICJ/2ZICJ_34.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"57-0.jpg"
]
} | At the end of the video. |
58 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}In the given video, when does the action 'person eats a sandwich' take place?",
"images_path": [
"OKHVL/OKHVL_0.jpeg",
"OKHVL/OKHVL_1.jpeg",
"OKHVL/OKHVL_2.jpeg",
"OKHVL/OKHVL_3.jpeg",
"OKHVL/OKHVL_4.jpeg",
"OKHVL/OKHVL_5.jpeg",
"OKHVL/OKHVL_6.jpeg",
"OKHVL/OKHVL_7.jpeg",
"OKHVL/OKHVL_8.jpeg",
"OKHVL/OKHVL_9.jpeg",
"OKHVL/OKHVL_10.jpeg",
"OKHVL/OKHVL_11.jpeg",
"OKHVL/OKHVL_12.jpeg",
"OKHVL/OKHVL_13.jpeg",
"OKHVL/OKHVL_14.jpeg",
"OKHVL/OKHVL_15.jpeg",
"OKHVL/OKHVL_16.jpeg",
"OKHVL/OKHVL_17.jpeg",
"OKHVL/OKHVL_18.jpeg",
"OKHVL/OKHVL_19.jpeg",
"OKHVL/OKHVL_20.jpeg",
"OKHVL/OKHVL_21.jpeg",
"OKHVL/OKHVL_22.jpeg",
"OKHVL/OKHVL_23.jpeg",
"OKHVL/OKHVL_24.jpeg",
"OKHVL/OKHVL_25.jpeg",
"OKHVL/OKHVL_26.jpeg",
"OKHVL/OKHVL_27.jpeg",
"OKHVL/OKHVL_28.jpeg",
"OKHVL/OKHVL_29.jpeg",
"OKHVL/OKHVL_30.jpeg",
"OKHVL/OKHVL_31.jpeg",
"OKHVL/OKHVL_32.jpeg",
"OKHVL/OKHVL_33.jpeg",
"OKHVL/OKHVL_34.jpeg",
"OKHVL/OKHVL_35.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"58-0.jpg"
]
} | At the end of the video. |
59 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}At what moment in the video does the action 'person turns on the light' occur?",
"images_path": [
"LGJAR/LGJAR_0.jpeg",
"LGJAR/LGJAR_1.jpeg",
"LGJAR/LGJAR_2.jpeg",
"LGJAR/LGJAR_3.jpeg",
"LGJAR/LGJAR_4.jpeg",
"LGJAR/LGJAR_5.jpeg",
"LGJAR/LGJAR_6.jpeg",
"LGJAR/LGJAR_7.jpeg",
"LGJAR/LGJAR_8.jpeg",
"LGJAR/LGJAR_9.jpeg",
"LGJAR/LGJAR_10.jpeg",
"LGJAR/LGJAR_11.jpeg",
"LGJAR/LGJAR_12.jpeg",
"LGJAR/LGJAR_13.jpeg",
"LGJAR/LGJAR_14.jpeg",
"LGJAR/LGJAR_15.jpeg",
"LGJAR/LGJAR_16.jpeg",
"LGJAR/LGJAR_17.jpeg",
"LGJAR/LGJAR_18.jpeg",
"LGJAR/LGJAR_19.jpeg",
"LGJAR/LGJAR_20.jpeg",
"LGJAR/LGJAR_21.jpeg",
"LGJAR/LGJAR_22.jpeg",
"LGJAR/LGJAR_23.jpeg",
"LGJAR/LGJAR_24.jpeg",
"LGJAR/LGJAR_25.jpeg",
"LGJAR/LGJAR_26.jpeg",
"LGJAR/LGJAR_27.jpeg",
"LGJAR/LGJAR_28.jpeg",
"LGJAR/LGJAR_29.jpeg",
"LGJAR/LGJAR_30.jpeg",
"LGJAR/LGJAR_31.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"59-0.jpg"
]
} | At the end of the video. |
60 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}During which part of the video does the action 'person takes a blanket' occur?",
"images_path": [
"9POJB/9POJB_0.jpeg",
"9POJB/9POJB_1.jpeg",
"9POJB/9POJB_2.jpeg",
"9POJB/9POJB_3.jpeg",
"9POJB/9POJB_4.jpeg",
"9POJB/9POJB_5.jpeg",
"9POJB/9POJB_6.jpeg",
"9POJB/9POJB_7.jpeg",
"9POJB/9POJB_8.jpeg",
"9POJB/9POJB_9.jpeg",
"9POJB/9POJB_10.jpeg",
"9POJB/9POJB_11.jpeg",
"9POJB/9POJB_12.jpeg",
"9POJB/9POJB_13.jpeg",
"9POJB/9POJB_14.jpeg",
"9POJB/9POJB_15.jpeg",
"9POJB/9POJB_16.jpeg",
"9POJB/9POJB_17.jpeg",
"9POJB/9POJB_18.jpeg",
"9POJB/9POJB_19.jpeg",
"9POJB/9POJB_20.jpeg",
"9POJB/9POJB_21.jpeg",
"9POJB/9POJB_22.jpeg",
"9POJB/9POJB_23.jpeg",
"9POJB/9POJB_24.jpeg",
"9POJB/9POJB_25.jpeg",
"9POJB/9POJB_26.jpeg",
"9POJB/9POJB_27.jpeg",
"9POJB/9POJB_28.jpeg",
"9POJB/9POJB_29.jpeg",
"9POJB/9POJB_30.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"60-0.jpg"
]
} | At the end of the video. |
61 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}In the given video, when does the action 'the person in the doorway walks away holding the towel' take place?",
"images_path": [
"NI15V/NI15V_0.jpeg",
"NI15V/NI15V_1.jpeg",
"NI15V/NI15V_2.jpeg",
"NI15V/NI15V_3.jpeg",
"NI15V/NI15V_4.jpeg",
"NI15V/NI15V_5.jpeg",
"NI15V/NI15V_6.jpeg",
"NI15V/NI15V_7.jpeg",
"NI15V/NI15V_8.jpeg",
"NI15V/NI15V_9.jpeg",
"NI15V/NI15V_10.jpeg",
"NI15V/NI15V_11.jpeg",
"NI15V/NI15V_12.jpeg",
"NI15V/NI15V_13.jpeg",
"NI15V/NI15V_14.jpeg",
"NI15V/NI15V_15.jpeg",
"NI15V/NI15V_16.jpeg",
"NI15V/NI15V_17.jpeg",
"NI15V/NI15V_18.jpeg",
"NI15V/NI15V_19.jpeg",
"NI15V/NI15V_20.jpeg",
"NI15V/NI15V_21.jpeg",
"NI15V/NI15V_22.jpeg",
"NI15V/NI15V_23.jpeg",
"NI15V/NI15V_24.jpeg",
"NI15V/NI15V_25.jpeg",
"NI15V/NI15V_26.jpeg",
"NI15V/NI15V_27.jpeg",
"NI15V/NI15V_28.jpeg",
"NI15V/NI15V_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"61-0.jpg"
]
} | At the end of the video. |
62 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}During which part of the video does the action 'person closes the door' occur?",
"images_path": [
"6JLD4/6JLD4_0.jpeg",
"6JLD4/6JLD4_1.jpeg",
"6JLD4/6JLD4_2.jpeg",
"6JLD4/6JLD4_3.jpeg",
"6JLD4/6JLD4_4.jpeg",
"6JLD4/6JLD4_5.jpeg",
"6JLD4/6JLD4_6.jpeg",
"6JLD4/6JLD4_7.jpeg",
"6JLD4/6JLD4_8.jpeg",
"6JLD4/6JLD4_9.jpeg",
"6JLD4/6JLD4_10.jpeg",
"6JLD4/6JLD4_11.jpeg",
"6JLD4/6JLD4_12.jpeg",
"6JLD4/6JLD4_13.jpeg",
"6JLD4/6JLD4_14.jpeg",
"6JLD4/6JLD4_15.jpeg",
"6JLD4/6JLD4_16.jpeg",
"6JLD4/6JLD4_17.jpeg",
"6JLD4/6JLD4_18.jpeg",
"6JLD4/6JLD4_19.jpeg",
"6JLD4/6JLD4_20.jpeg",
"6JLD4/6JLD4_21.jpeg",
"6JLD4/6JLD4_22.jpeg",
"6JLD4/6JLD4_23.jpeg",
"6JLD4/6JLD4_24.jpeg",
"6JLD4/6JLD4_25.jpeg",
"6JLD4/6JLD4_26.jpeg",
"6JLD4/6JLD4_27.jpeg",
"6JLD4/6JLD4_28.jpeg",
"6JLD4/6JLD4_29.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"62-0.jpg"
]
} | At the end of the video. |
63 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}During which part of the video does the action 'person sits down at the table' occur?",
"images_path": [
"BI6Y4/BI6Y4_0.jpeg",
"BI6Y4/BI6Y4_1.jpeg",
"BI6Y4/BI6Y4_2.jpeg",
"BI6Y4/BI6Y4_3.jpeg",
"BI6Y4/BI6Y4_4.jpeg",
"BI6Y4/BI6Y4_5.jpeg",
"BI6Y4/BI6Y4_6.jpeg",
"BI6Y4/BI6Y4_7.jpeg",
"BI6Y4/BI6Y4_8.jpeg",
"BI6Y4/BI6Y4_9.jpeg",
"BI6Y4/BI6Y4_10.jpeg",
"BI6Y4/BI6Y4_11.jpeg",
"BI6Y4/BI6Y4_12.jpeg",
"BI6Y4/BI6Y4_13.jpeg",
"BI6Y4/BI6Y4_14.jpeg",
"BI6Y4/BI6Y4_15.jpeg",
"BI6Y4/BI6Y4_16.jpeg",
"BI6Y4/BI6Y4_17.jpeg",
"BI6Y4/BI6Y4_18.jpeg",
"BI6Y4/BI6Y4_19.jpeg",
"BI6Y4/BI6Y4_20.jpeg",
"BI6Y4/BI6Y4_21.jpeg",
"BI6Y4/BI6Y4_22.jpeg",
"BI6Y4/BI6Y4_23.jpeg",
"BI6Y4/BI6Y4_24.jpeg",
"BI6Y4/BI6Y4_25.jpeg",
"BI6Y4/BI6Y4_26.jpeg",
"BI6Y4/BI6Y4_27.jpeg",
"BI6Y4/BI6Y4_28.jpeg",
"BI6Y4/BI6Y4_29.jpeg",
"BI6Y4/BI6Y4_30.jpeg",
"BI6Y4/BI6Y4_31.jpeg",
"BI6Y4/BI6Y4_32.jpeg",
"BI6Y4/BI6Y4_33.jpeg"
],
"choice_list": [
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"63-0.jpg"
]
} | At the end of the video. |
64 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}During which part of the video does the action 'person put the bottle in the garbage' occur?",
"images_path": [
"JIUH7/JIUH7_0.jpeg",
"JIUH7/JIUH7_1.jpeg",
"JIUH7/JIUH7_2.jpeg",
"JIUH7/JIUH7_3.jpeg",
"JIUH7/JIUH7_4.jpeg",
"JIUH7/JIUH7_5.jpeg",
"JIUH7/JIUH7_6.jpeg",
"JIUH7/JIUH7_7.jpeg",
"JIUH7/JIUH7_8.jpeg",
"JIUH7/JIUH7_9.jpeg",
"JIUH7/JIUH7_10.jpeg",
"JIUH7/JIUH7_11.jpeg",
"JIUH7/JIUH7_12.jpeg",
"JIUH7/JIUH7_13.jpeg",
"JIUH7/JIUH7_14.jpeg",
"JIUH7/JIUH7_15.jpeg",
"JIUH7/JIUH7_16.jpeg",
"JIUH7/JIUH7_17.jpeg",
"JIUH7/JIUH7_18.jpeg",
"JIUH7/JIUH7_19.jpeg",
"JIUH7/JIUH7_20.jpeg",
"JIUH7/JIUH7_21.jpeg",
"JIUH7/JIUH7_22.jpeg",
"JIUH7/JIUH7_23.jpeg",
"JIUH7/JIUH7_24.jpeg",
"JIUH7/JIUH7_25.jpeg",
"JIUH7/JIUH7_26.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"64-0.jpg"
]
} | At the end of the video. |
65 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}Can you identify when the action 'person puts the remote on a shelf' happens in the video?",
"images_path": [
"LGPWK/LGPWK_0.jpeg",
"LGPWK/LGPWK_1.jpeg",
"LGPWK/LGPWK_2.jpeg",
"LGPWK/LGPWK_3.jpeg",
"LGPWK/LGPWK_4.jpeg",
"LGPWK/LGPWK_5.jpeg",
"LGPWK/LGPWK_6.jpeg",
"LGPWK/LGPWK_7.jpeg",
"LGPWK/LGPWK_8.jpeg",
"LGPWK/LGPWK_9.jpeg",
"LGPWK/LGPWK_10.jpeg",
"LGPWK/LGPWK_11.jpeg",
"LGPWK/LGPWK_12.jpeg",
"LGPWK/LGPWK_13.jpeg",
"LGPWK/LGPWK_14.jpeg",
"LGPWK/LGPWK_15.jpeg",
"LGPWK/LGPWK_16.jpeg",
"LGPWK/LGPWK_17.jpeg",
"LGPWK/LGPWK_18.jpeg",
"LGPWK/LGPWK_19.jpeg",
"LGPWK/LGPWK_20.jpeg",
"LGPWK/LGPWK_21.jpeg",
"LGPWK/LGPWK_22.jpeg",
"LGPWK/LGPWK_23.jpeg",
"LGPWK/LGPWK_24.jpeg",
"LGPWK/LGPWK_25.jpeg",
"LGPWK/LGPWK_26.jpeg",
"LGPWK/LGPWK_27.jpeg",
"LGPWK/LGPWK_28.jpeg",
"LGPWK/LGPWK_29.jpeg",
"LGPWK/LGPWK_30.jpeg",
"LGPWK/LGPWK_31.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"65-0.jpg"
]
} | At the end of the video. |
66 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}In the given video, when does the action 'person put on their shoes' take place?",
"images_path": [
"759MY/759MY_0.jpeg",
"759MY/759MY_1.jpeg",
"759MY/759MY_2.jpeg",
"759MY/759MY_3.jpeg",
"759MY/759MY_4.jpeg",
"759MY/759MY_5.jpeg",
"759MY/759MY_6.jpeg",
"759MY/759MY_7.jpeg",
"759MY/759MY_8.jpeg",
"759MY/759MY_9.jpeg",
"759MY/759MY_10.jpeg",
"759MY/759MY_11.jpeg",
"759MY/759MY_12.jpeg",
"759MY/759MY_13.jpeg",
"759MY/759MY_14.jpeg",
"759MY/759MY_15.jpeg",
"759MY/759MY_16.jpeg",
"759MY/759MY_17.jpeg",
"759MY/759MY_18.jpeg",
"759MY/759MY_19.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"66-0.jpg"
]
} | At the end of the video. |
67 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}At what moment in the video does the action 'person puts the cup on a table' occur?",
"images_path": [
"IBWAW/IBWAW_0.jpeg",
"IBWAW/IBWAW_1.jpeg",
"IBWAW/IBWAW_2.jpeg",
"IBWAW/IBWAW_3.jpeg",
"IBWAW/IBWAW_4.jpeg",
"IBWAW/IBWAW_5.jpeg",
"IBWAW/IBWAW_6.jpeg",
"IBWAW/IBWAW_7.jpeg",
"IBWAW/IBWAW_8.jpeg",
"IBWAW/IBWAW_9.jpeg",
"IBWAW/IBWAW_10.jpeg",
"IBWAW/IBWAW_11.jpeg",
"IBWAW/IBWAW_12.jpeg",
"IBWAW/IBWAW_13.jpeg",
"IBWAW/IBWAW_14.jpeg",
"IBWAW/IBWAW_15.jpeg",
"IBWAW/IBWAW_16.jpeg",
"IBWAW/IBWAW_17.jpeg",
"IBWAW/IBWAW_18.jpeg",
"IBWAW/IBWAW_19.jpeg",
"IBWAW/IBWAW_20.jpeg",
"IBWAW/IBWAW_21.jpeg",
"IBWAW/IBWAW_22.jpeg",
"IBWAW/IBWAW_23.jpeg",
"IBWAW/IBWAW_24.jpeg",
"IBWAW/IBWAW_25.jpeg",
"IBWAW/IBWAW_26.jpeg",
"IBWAW/IBWAW_27.jpeg",
"IBWAW/IBWAW_28.jpeg",
"IBWAW/IBWAW_29.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"67-0.jpg"
]
} | At the end of the video. |
68 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}During which part of the video does the action 'person takes a drink from a cup' occur?",
"images_path": [
"QOYH2/QOYH2_0.jpeg",
"QOYH2/QOYH2_1.jpeg",
"QOYH2/QOYH2_2.jpeg",
"QOYH2/QOYH2_3.jpeg",
"QOYH2/QOYH2_4.jpeg",
"QOYH2/QOYH2_5.jpeg",
"QOYH2/QOYH2_6.jpeg",
"QOYH2/QOYH2_7.jpeg",
"QOYH2/QOYH2_8.jpeg",
"QOYH2/QOYH2_9.jpeg",
"QOYH2/QOYH2_10.jpeg",
"QOYH2/QOYH2_11.jpeg",
"QOYH2/QOYH2_12.jpeg",
"QOYH2/QOYH2_13.jpeg",
"QOYH2/QOYH2_14.jpeg",
"QOYH2/QOYH2_15.jpeg",
"QOYH2/QOYH2_16.jpeg",
"QOYH2/QOYH2_17.jpeg",
"QOYH2/QOYH2_18.jpeg",
"QOYH2/QOYH2_19.jpeg",
"QOYH2/QOYH2_20.jpeg",
"QOYH2/QOYH2_21.jpeg",
"QOYH2/QOYH2_22.jpeg",
"QOYH2/QOYH2_23.jpeg",
"QOYH2/QOYH2_24.jpeg",
"QOYH2/QOYH2_25.jpeg",
"QOYH2/QOYH2_26.jpeg",
"QOYH2/QOYH2_27.jpeg",
"QOYH2/QOYH2_28.jpeg",
"QOYH2/QOYH2_29.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"68-0.jpg"
]
} | At the end of the video. |
69 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}When in the video sequence do we observe the action 'person opens a laptop up'?",
"images_path": [
"AB2V6/AB2V6_0.jpeg",
"AB2V6/AB2V6_1.jpeg",
"AB2V6/AB2V6_2.jpeg",
"AB2V6/AB2V6_3.jpeg",
"AB2V6/AB2V6_4.jpeg",
"AB2V6/AB2V6_5.jpeg",
"AB2V6/AB2V6_6.jpeg",
"AB2V6/AB2V6_7.jpeg",
"AB2V6/AB2V6_8.jpeg",
"AB2V6/AB2V6_9.jpeg",
"AB2V6/AB2V6_10.jpeg",
"AB2V6/AB2V6_11.jpeg",
"AB2V6/AB2V6_12.jpeg",
"AB2V6/AB2V6_13.jpeg",
"AB2V6/AB2V6_14.jpeg",
"AB2V6/AB2V6_15.jpeg",
"AB2V6/AB2V6_16.jpeg",
"AB2V6/AB2V6_17.jpeg",
"AB2V6/AB2V6_18.jpeg",
"AB2V6/AB2V6_19.jpeg",
"AB2V6/AB2V6_20.jpeg",
"AB2V6/AB2V6_21.jpeg",
"AB2V6/AB2V6_22.jpeg",
"AB2V6/AB2V6_23.jpeg",
"AB2V6/AB2V6_24.jpeg",
"AB2V6/AB2V6_25.jpeg",
"AB2V6/AB2V6_26.jpeg",
"AB2V6/AB2V6_27.jpeg",
"AB2V6/AB2V6_28.jpeg",
"AB2V6/AB2V6_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"69-0.jpg"
]
} | At the end of the video. |
70 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}In the given video, when does the action 'the person closes the book' take place?",
"images_path": [
"QLFR5/QLFR5_0.jpeg",
"QLFR5/QLFR5_1.jpeg",
"QLFR5/QLFR5_2.jpeg",
"QLFR5/QLFR5_3.jpeg",
"QLFR5/QLFR5_4.jpeg",
"QLFR5/QLFR5_5.jpeg",
"QLFR5/QLFR5_6.jpeg",
"QLFR5/QLFR5_7.jpeg",
"QLFR5/QLFR5_8.jpeg",
"QLFR5/QLFR5_9.jpeg",
"QLFR5/QLFR5_10.jpeg",
"QLFR5/QLFR5_11.jpeg",
"QLFR5/QLFR5_12.jpeg",
"QLFR5/QLFR5_13.jpeg",
"QLFR5/QLFR5_14.jpeg",
"QLFR5/QLFR5_15.jpeg",
"QLFR5/QLFR5_16.jpeg",
"QLFR5/QLFR5_17.jpeg",
"QLFR5/QLFR5_18.jpeg",
"QLFR5/QLFR5_19.jpeg",
"QLFR5/QLFR5_20.jpeg",
"QLFR5/QLFR5_21.jpeg",
"QLFR5/QLFR5_22.jpeg",
"QLFR5/QLFR5_23.jpeg",
"QLFR5/QLFR5_24.jpeg",
"QLFR5/QLFR5_25.jpeg",
"QLFR5/QLFR5_26.jpeg",
"QLFR5/QLFR5_27.jpeg",
"QLFR5/QLFR5_28.jpeg",
"QLFR5/QLFR5_29.jpeg",
"QLFR5/QLFR5_30.jpeg",
"QLFR5/QLFR5_31.jpeg",
"QLFR5/QLFR5_32.jpeg",
"QLFR5/QLFR5_33.jpeg",
"QLFR5/QLFR5_34.jpeg",
"QLFR5/QLFR5_35.jpeg",
"QLFR5/QLFR5_36.jpeg",
"QLFR5/QLFR5_37.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"70-0.jpg"
]
} | At the end of the video. |
71 | Evaluate the presented graphics and infer the timing of the action in the question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}In the given video, when does the action 'the person puts the book down' take place?",
"images_path": [
"IKJGO/IKJGO_0.jpeg",
"IKJGO/IKJGO_1.jpeg",
"IKJGO/IKJGO_2.jpeg",
"IKJGO/IKJGO_3.jpeg",
"IKJGO/IKJGO_4.jpeg",
"IKJGO/IKJGO_5.jpeg",
"IKJGO/IKJGO_6.jpeg",
"IKJGO/IKJGO_7.jpeg",
"IKJGO/IKJGO_8.jpeg",
"IKJGO/IKJGO_9.jpeg",
"IKJGO/IKJGO_10.jpeg",
"IKJGO/IKJGO_11.jpeg",
"IKJGO/IKJGO_12.jpeg",
"IKJGO/IKJGO_13.jpeg",
"IKJGO/IKJGO_14.jpeg",
"IKJGO/IKJGO_15.jpeg",
"IKJGO/IKJGO_16.jpeg",
"IKJGO/IKJGO_17.jpeg",
"IKJGO/IKJGO_18.jpeg",
"IKJGO/IKJGO_19.jpeg",
"IKJGO/IKJGO_20.jpeg",
"IKJGO/IKJGO_21.jpeg",
"IKJGO/IKJGO_22.jpeg",
"IKJGO/IKJGO_23.jpeg",
"IKJGO/IKJGO_24.jpeg",
"IKJGO/IKJGO_25.jpeg",
"IKJGO/IKJGO_26.jpeg",
"IKJGO/IKJGO_27.jpeg",
"IKJGO/IKJGO_28.jpeg",
"IKJGO/IKJGO_29.jpeg",
"IKJGO/IKJGO_30.jpeg",
"IKJGO/IKJGO_31.jpeg",
"IKJGO/IKJGO_32.jpeg",
"IKJGO/IKJGO_33.jpeg",
"IKJGO/IKJGO_34.jpeg",
"IKJGO/IKJGO_35.jpeg",
"IKJGO/IKJGO_36.jpeg",
"IKJGO/IKJGO_37.jpeg",
"IKJGO/IKJGO_38.jpeg",
"IKJGO/IKJGO_39.jpeg",
"IKJGO/IKJGO_40.jpeg",
"IKJGO/IKJGO_41.jpeg",
"IKJGO/IKJGO_42.jpeg",
"IKJGO/IKJGO_43.jpeg",
"IKJGO/IKJGO_44.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"71-0.jpg"
]
} | At the end of the video. |
72 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}Can you identify when the action 'person close the closet door' happens in the video?",
"images_path": [
"WKPQ3/WKPQ3_0.jpeg",
"WKPQ3/WKPQ3_1.jpeg",
"WKPQ3/WKPQ3_2.jpeg",
"WKPQ3/WKPQ3_3.jpeg",
"WKPQ3/WKPQ3_4.jpeg",
"WKPQ3/WKPQ3_5.jpeg",
"WKPQ3/WKPQ3_6.jpeg",
"WKPQ3/WKPQ3_7.jpeg",
"WKPQ3/WKPQ3_8.jpeg",
"WKPQ3/WKPQ3_9.jpeg",
"WKPQ3/WKPQ3_10.jpeg",
"WKPQ3/WKPQ3_11.jpeg",
"WKPQ3/WKPQ3_12.jpeg",
"WKPQ3/WKPQ3_13.jpeg",
"WKPQ3/WKPQ3_14.jpeg",
"WKPQ3/WKPQ3_15.jpeg",
"WKPQ3/WKPQ3_16.jpeg",
"WKPQ3/WKPQ3_17.jpeg",
"WKPQ3/WKPQ3_18.jpeg",
"WKPQ3/WKPQ3_19.jpeg",
"WKPQ3/WKPQ3_20.jpeg",
"WKPQ3/WKPQ3_21.jpeg",
"WKPQ3/WKPQ3_22.jpeg",
"WKPQ3/WKPQ3_23.jpeg",
"WKPQ3/WKPQ3_24.jpeg",
"WKPQ3/WKPQ3_25.jpeg",
"WKPQ3/WKPQ3_26.jpeg",
"WKPQ3/WKPQ3_27.jpeg",
"WKPQ3/WKPQ3_28.jpeg",
"WKPQ3/WKPQ3_29.jpeg",
"WKPQ3/WKPQ3_30.jpeg",
"WKPQ3/WKPQ3_31.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"72-0.jpg"
]
} | At the end of the video. |
73 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}During which part of the video does the action 'person open book' occur?",
"images_path": [
"XXS99/XXS99_0.jpeg",
"XXS99/XXS99_1.jpeg",
"XXS99/XXS99_2.jpeg",
"XXS99/XXS99_3.jpeg",
"XXS99/XXS99_4.jpeg",
"XXS99/XXS99_5.jpeg",
"XXS99/XXS99_6.jpeg",
"XXS99/XXS99_7.jpeg",
"XXS99/XXS99_8.jpeg",
"XXS99/XXS99_9.jpeg",
"XXS99/XXS99_10.jpeg",
"XXS99/XXS99_11.jpeg",
"XXS99/XXS99_12.jpeg",
"XXS99/XXS99_13.jpeg",
"XXS99/XXS99_14.jpeg",
"XXS99/XXS99_15.jpeg",
"XXS99/XXS99_16.jpeg",
"XXS99/XXS99_17.jpeg",
"XXS99/XXS99_18.jpeg",
"XXS99/XXS99_19.jpeg",
"XXS99/XXS99_20.jpeg",
"XXS99/XXS99_21.jpeg",
"XXS99/XXS99_22.jpeg",
"XXS99/XXS99_23.jpeg",
"XXS99/XXS99_24.jpeg",
"XXS99/XXS99_25.jpeg",
"XXS99/XXS99_26.jpeg",
"XXS99/XXS99_27.jpeg",
"XXS99/XXS99_28.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"73-0.jpg"
]
} | At the end of the video. |
74 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}Can you identify when the action 'person close a laptop' happens in the video?",
"images_path": [
"7MRKY/7MRKY_0.jpeg",
"7MRKY/7MRKY_1.jpeg",
"7MRKY/7MRKY_2.jpeg",
"7MRKY/7MRKY_3.jpeg",
"7MRKY/7MRKY_4.jpeg",
"7MRKY/7MRKY_5.jpeg",
"7MRKY/7MRKY_6.jpeg",
"7MRKY/7MRKY_7.jpeg",
"7MRKY/7MRKY_8.jpeg",
"7MRKY/7MRKY_9.jpeg",
"7MRKY/7MRKY_10.jpeg",
"7MRKY/7MRKY_11.jpeg",
"7MRKY/7MRKY_12.jpeg",
"7MRKY/7MRKY_13.jpeg",
"7MRKY/7MRKY_14.jpeg",
"7MRKY/7MRKY_15.jpeg",
"7MRKY/7MRKY_16.jpeg",
"7MRKY/7MRKY_17.jpeg",
"7MRKY/7MRKY_18.jpeg",
"7MRKY/7MRKY_19.jpeg",
"7MRKY/7MRKY_20.jpeg",
"7MRKY/7MRKY_21.jpeg",
"7MRKY/7MRKY_22.jpeg",
"7MRKY/7MRKY_23.jpeg",
"7MRKY/7MRKY_24.jpeg",
"7MRKY/7MRKY_25.jpeg",
"7MRKY/7MRKY_26.jpeg",
"7MRKY/7MRKY_27.jpeg",
"7MRKY/7MRKY_28.jpeg",
"7MRKY/7MRKY_29.jpeg",
"7MRKY/7MRKY_30.jpeg",
"7MRKY/7MRKY_31.jpeg",
"7MRKY/7MRKY_32.jpeg",
"7MRKY/7MRKY_33.jpeg",
"7MRKY/7MRKY_34.jpeg",
"7MRKY/7MRKY_35.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"74-0.jpg"
]
} | At the end of the video. |
75 | Analyze the provided visuals and determine the timing of the event in question. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}At what moment in the video does the action 'person putting glasses on' occur?",
"images_path": [
"O3HV7/O3HV7_0.jpeg",
"O3HV7/O3HV7_1.jpeg",
"O3HV7/O3HV7_2.jpeg",
"O3HV7/O3HV7_3.jpeg",
"O3HV7/O3HV7_4.jpeg",
"O3HV7/O3HV7_5.jpeg",
"O3HV7/O3HV7_6.jpeg",
"O3HV7/O3HV7_7.jpeg",
"O3HV7/O3HV7_8.jpeg",
"O3HV7/O3HV7_9.jpeg",
"O3HV7/O3HV7_10.jpeg",
"O3HV7/O3HV7_11.jpeg",
"O3HV7/O3HV7_12.jpeg",
"O3HV7/O3HV7_13.jpeg",
"O3HV7/O3HV7_14.jpeg",
"O3HV7/O3HV7_15.jpeg",
"O3HV7/O3HV7_16.jpeg",
"O3HV7/O3HV7_17.jpeg",
"O3HV7/O3HV7_18.jpeg",
"O3HV7/O3HV7_19.jpeg",
"O3HV7/O3HV7_20.jpeg",
"O3HV7/O3HV7_21.jpeg",
"O3HV7/O3HV7_22.jpeg",
"O3HV7/O3HV7_23.jpeg",
"O3HV7/O3HV7_24.jpeg",
"O3HV7/O3HV7_25.jpeg",
"O3HV7/O3HV7_26.jpeg",
"O3HV7/O3HV7_27.jpeg",
"O3HV7/O3HV7_28.jpeg",
"O3HV7/O3HV7_29.jpeg",
"O3HV7/O3HV7_30.jpeg",
"O3HV7/O3HV7_31.jpeg",
"O3HV7/O3HV7_32.jpeg",
"O3HV7/O3HV7_33.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video.",
"At the end of the video."
],
"combined_1_images": [
"75-0.jpg"
]
} | At the end of the video. |
76 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}In the given video, when does the action 'person opening a door' take place?",
"images_path": [
"HIKIC/HIKIC_0.jpeg",
"HIKIC/HIKIC_1.jpeg",
"HIKIC/HIKIC_2.jpeg",
"HIKIC/HIKIC_3.jpeg",
"HIKIC/HIKIC_4.jpeg",
"HIKIC/HIKIC_5.jpeg",
"HIKIC/HIKIC_6.jpeg",
"HIKIC/HIKIC_7.jpeg",
"HIKIC/HIKIC_8.jpeg",
"HIKIC/HIKIC_9.jpeg",
"HIKIC/HIKIC_10.jpeg",
"HIKIC/HIKIC_11.jpeg",
"HIKIC/HIKIC_12.jpeg",
"HIKIC/HIKIC_13.jpeg",
"HIKIC/HIKIC_14.jpeg",
"HIKIC/HIKIC_15.jpeg",
"HIKIC/HIKIC_16.jpeg",
"HIKIC/HIKIC_17.jpeg",
"HIKIC/HIKIC_18.jpeg",
"HIKIC/HIKIC_19.jpeg",
"HIKIC/HIKIC_20.jpeg",
"HIKIC/HIKIC_21.jpeg",
"HIKIC/HIKIC_22.jpeg",
"HIKIC/HIKIC_23.jpeg",
"HIKIC/HIKIC_24.jpeg",
"HIKIC/HIKIC_25.jpeg",
"HIKIC/HIKIC_26.jpeg",
"HIKIC/HIKIC_27.jpeg",
"HIKIC/HIKIC_28.jpeg",
"HIKIC/HIKIC_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"76-0.jpg"
]
} | At the end of the video. |
77 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}During which part of the video does the action 'the person was putting the bag into the cabinet' occur?",
"images_path": [
"J48N6/J48N6_0.jpeg",
"J48N6/J48N6_1.jpeg",
"J48N6/J48N6_2.jpeg",
"J48N6/J48N6_3.jpeg",
"J48N6/J48N6_4.jpeg",
"J48N6/J48N6_5.jpeg",
"J48N6/J48N6_6.jpeg",
"J48N6/J48N6_7.jpeg",
"J48N6/J48N6_8.jpeg",
"J48N6/J48N6_9.jpeg",
"J48N6/J48N6_10.jpeg",
"J48N6/J48N6_11.jpeg",
"J48N6/J48N6_12.jpeg",
"J48N6/J48N6_13.jpeg",
"J48N6/J48N6_14.jpeg",
"J48N6/J48N6_15.jpeg",
"J48N6/J48N6_16.jpeg",
"J48N6/J48N6_17.jpeg",
"J48N6/J48N6_18.jpeg",
"J48N6/J48N6_19.jpeg",
"J48N6/J48N6_20.jpeg",
"J48N6/J48N6_21.jpeg",
"J48N6/J48N6_22.jpeg",
"J48N6/J48N6_23.jpeg",
"J48N6/J48N6_24.jpeg",
"J48N6/J48N6_25.jpeg",
"J48N6/J48N6_26.jpeg",
"J48N6/J48N6_27.jpeg",
"J48N6/J48N6_28.jpeg",
"J48N6/J48N6_29.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"77-0.jpg"
]
} | At the end of the video. |
78 | Using the images at hand, infer when the action in the question takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}At what moment in the video does the action 'person turns off a light' occur?",
"images_path": [
"VCYH8/VCYH8_0.jpeg",
"VCYH8/VCYH8_1.jpeg",
"VCYH8/VCYH8_2.jpeg",
"VCYH8/VCYH8_3.jpeg",
"VCYH8/VCYH8_4.jpeg",
"VCYH8/VCYH8_5.jpeg",
"VCYH8/VCYH8_6.jpeg",
"VCYH8/VCYH8_7.jpeg",
"VCYH8/VCYH8_8.jpeg",
"VCYH8/VCYH8_9.jpeg",
"VCYH8/VCYH8_10.jpeg",
"VCYH8/VCYH8_11.jpeg",
"VCYH8/VCYH8_12.jpeg",
"VCYH8/VCYH8_13.jpeg",
"VCYH8/VCYH8_14.jpeg",
"VCYH8/VCYH8_15.jpeg",
"VCYH8/VCYH8_16.jpeg",
"VCYH8/VCYH8_17.jpeg",
"VCYH8/VCYH8_18.jpeg",
"VCYH8/VCYH8_19.jpeg",
"VCYH8/VCYH8_20.jpeg",
"VCYH8/VCYH8_21.jpeg",
"VCYH8/VCYH8_22.jpeg",
"VCYH8/VCYH8_23.jpeg",
"VCYH8/VCYH8_24.jpeg",
"VCYH8/VCYH8_25.jpeg",
"VCYH8/VCYH8_26.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"78-0.jpg"
]
} | At the end of the video. |
79 | Examine the given illustrations and deduce when the action in the inquiry happens. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}{image#46}{image#47}{image#48}{image#49}{image#50}{image#51}{image#52}{image#53}{image#54}{image#55}{image#56}{image#57}{image#58}During which part of the video does the action 'person stand up again' occur?",
"images_path": [
"RBQ9Y/RBQ9Y_0.jpeg",
"RBQ9Y/RBQ9Y_1.jpeg",
"RBQ9Y/RBQ9Y_2.jpeg",
"RBQ9Y/RBQ9Y_3.jpeg",
"RBQ9Y/RBQ9Y_4.jpeg",
"RBQ9Y/RBQ9Y_5.jpeg",
"RBQ9Y/RBQ9Y_6.jpeg",
"RBQ9Y/RBQ9Y_7.jpeg",
"RBQ9Y/RBQ9Y_8.jpeg",
"RBQ9Y/RBQ9Y_9.jpeg",
"RBQ9Y/RBQ9Y_10.jpeg",
"RBQ9Y/RBQ9Y_11.jpeg",
"RBQ9Y/RBQ9Y_12.jpeg",
"RBQ9Y/RBQ9Y_13.jpeg",
"RBQ9Y/RBQ9Y_14.jpeg",
"RBQ9Y/RBQ9Y_15.jpeg",
"RBQ9Y/RBQ9Y_16.jpeg",
"RBQ9Y/RBQ9Y_17.jpeg",
"RBQ9Y/RBQ9Y_18.jpeg",
"RBQ9Y/RBQ9Y_19.jpeg",
"RBQ9Y/RBQ9Y_20.jpeg",
"RBQ9Y/RBQ9Y_21.jpeg",
"RBQ9Y/RBQ9Y_22.jpeg",
"RBQ9Y/RBQ9Y_23.jpeg",
"RBQ9Y/RBQ9Y_24.jpeg",
"RBQ9Y/RBQ9Y_25.jpeg",
"RBQ9Y/RBQ9Y_26.jpeg",
"RBQ9Y/RBQ9Y_27.jpeg",
"RBQ9Y/RBQ9Y_28.jpeg",
"RBQ9Y/RBQ9Y_29.jpeg",
"RBQ9Y/RBQ9Y_30.jpeg",
"RBQ9Y/RBQ9Y_31.jpeg",
"RBQ9Y/RBQ9Y_32.jpeg",
"RBQ9Y/RBQ9Y_33.jpeg",
"RBQ9Y/RBQ9Y_34.jpeg",
"RBQ9Y/RBQ9Y_35.jpeg",
"RBQ9Y/RBQ9Y_36.jpeg",
"RBQ9Y/RBQ9Y_37.jpeg",
"RBQ9Y/RBQ9Y_38.jpeg",
"RBQ9Y/RBQ9Y_39.jpeg",
"RBQ9Y/RBQ9Y_40.jpeg",
"RBQ9Y/RBQ9Y_41.jpeg",
"RBQ9Y/RBQ9Y_42.jpeg",
"RBQ9Y/RBQ9Y_43.jpeg",
"RBQ9Y/RBQ9Y_44.jpeg",
"RBQ9Y/RBQ9Y_45.jpeg",
"RBQ9Y/RBQ9Y_46.jpeg",
"RBQ9Y/RBQ9Y_47.jpeg",
"RBQ9Y/RBQ9Y_48.jpeg",
"RBQ9Y/RBQ9Y_49.jpeg",
"RBQ9Y/RBQ9Y_50.jpeg",
"RBQ9Y/RBQ9Y_51.jpeg",
"RBQ9Y/RBQ9Y_52.jpeg",
"RBQ9Y/RBQ9Y_53.jpeg",
"RBQ9Y/RBQ9Y_54.jpeg",
"RBQ9Y/RBQ9Y_55.jpeg",
"RBQ9Y/RBQ9Y_56.jpeg",
"RBQ9Y/RBQ9Y_57.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video."
],
"combined_1_images": [
"79-0.jpg"
]
} | At the end of the video. |
80 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}During which part of the video does the action 'the person takes a sandwich from the refrigerator' occur?",
"images_path": [
"J2XFQ/J2XFQ_0.jpeg",
"J2XFQ/J2XFQ_1.jpeg",
"J2XFQ/J2XFQ_2.jpeg",
"J2XFQ/J2XFQ_3.jpeg",
"J2XFQ/J2XFQ_4.jpeg",
"J2XFQ/J2XFQ_5.jpeg",
"J2XFQ/J2XFQ_6.jpeg",
"J2XFQ/J2XFQ_7.jpeg",
"J2XFQ/J2XFQ_8.jpeg",
"J2XFQ/J2XFQ_9.jpeg",
"J2XFQ/J2XFQ_10.jpeg",
"J2XFQ/J2XFQ_11.jpeg",
"J2XFQ/J2XFQ_12.jpeg",
"J2XFQ/J2XFQ_13.jpeg",
"J2XFQ/J2XFQ_14.jpeg",
"J2XFQ/J2XFQ_15.jpeg",
"J2XFQ/J2XFQ_16.jpeg",
"J2XFQ/J2XFQ_17.jpeg",
"J2XFQ/J2XFQ_18.jpeg",
"J2XFQ/J2XFQ_19.jpeg",
"J2XFQ/J2XFQ_20.jpeg",
"J2XFQ/J2XFQ_21.jpeg",
"J2XFQ/J2XFQ_22.jpeg",
"J2XFQ/J2XFQ_23.jpeg",
"J2XFQ/J2XFQ_24.jpeg",
"J2XFQ/J2XFQ_25.jpeg",
"J2XFQ/J2XFQ_26.jpeg",
"J2XFQ/J2XFQ_27.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"80-0.jpg"
]
} | At the end of the video. |
81 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}{image#38}{image#39}{image#40}{image#41}{image#42}{image#43}{image#44}{image#45}At what moment in the video does the action 'the person sits on a couch' occur?",
"images_path": [
"OY3LS/OY3LS_0.jpeg",
"OY3LS/OY3LS_1.jpeg",
"OY3LS/OY3LS_2.jpeg",
"OY3LS/OY3LS_3.jpeg",
"OY3LS/OY3LS_4.jpeg",
"OY3LS/OY3LS_5.jpeg",
"OY3LS/OY3LS_6.jpeg",
"OY3LS/OY3LS_7.jpeg",
"OY3LS/OY3LS_8.jpeg",
"OY3LS/OY3LS_9.jpeg",
"OY3LS/OY3LS_10.jpeg",
"OY3LS/OY3LS_11.jpeg",
"OY3LS/OY3LS_12.jpeg",
"OY3LS/OY3LS_13.jpeg",
"OY3LS/OY3LS_14.jpeg",
"OY3LS/OY3LS_15.jpeg",
"OY3LS/OY3LS_16.jpeg",
"OY3LS/OY3LS_17.jpeg",
"OY3LS/OY3LS_18.jpeg",
"OY3LS/OY3LS_19.jpeg",
"OY3LS/OY3LS_20.jpeg",
"OY3LS/OY3LS_21.jpeg",
"OY3LS/OY3LS_22.jpeg",
"OY3LS/OY3LS_23.jpeg",
"OY3LS/OY3LS_24.jpeg",
"OY3LS/OY3LS_25.jpeg",
"OY3LS/OY3LS_26.jpeg",
"OY3LS/OY3LS_27.jpeg",
"OY3LS/OY3LS_28.jpeg",
"OY3LS/OY3LS_29.jpeg",
"OY3LS/OY3LS_30.jpeg",
"OY3LS/OY3LS_31.jpeg",
"OY3LS/OY3LS_32.jpeg",
"OY3LS/OY3LS_33.jpeg",
"OY3LS/OY3LS_34.jpeg",
"OY3LS/OY3LS_35.jpeg",
"OY3LS/OY3LS_36.jpeg",
"OY3LS/OY3LS_37.jpeg",
"OY3LS/OY3LS_38.jpeg",
"OY3LS/OY3LS_39.jpeg",
"OY3LS/OY3LS_40.jpeg",
"OY3LS/OY3LS_41.jpeg",
"OY3LS/OY3LS_42.jpeg",
"OY3LS/OY3LS_43.jpeg",
"OY3LS/OY3LS_44.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"81-0.jpg"
]
} | At the end of the video. |
82 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}When in the video sequence do we observe the action 'person drinking water from a glass'?",
"images_path": [
"5OV3M/5OV3M_0.jpeg",
"5OV3M/5OV3M_1.jpeg",
"5OV3M/5OV3M_2.jpeg",
"5OV3M/5OV3M_3.jpeg",
"5OV3M/5OV3M_4.jpeg",
"5OV3M/5OV3M_5.jpeg",
"5OV3M/5OV3M_6.jpeg",
"5OV3M/5OV3M_7.jpeg",
"5OV3M/5OV3M_8.jpeg",
"5OV3M/5OV3M_9.jpeg",
"5OV3M/5OV3M_10.jpeg",
"5OV3M/5OV3M_11.jpeg",
"5OV3M/5OV3M_12.jpeg",
"5OV3M/5OV3M_13.jpeg",
"5OV3M/5OV3M_14.jpeg",
"5OV3M/5OV3M_15.jpeg",
"5OV3M/5OV3M_16.jpeg",
"5OV3M/5OV3M_17.jpeg",
"5OV3M/5OV3M_18.jpeg",
"5OV3M/5OV3M_19.jpeg",
"5OV3M/5OV3M_20.jpeg",
"5OV3M/5OV3M_21.jpeg",
"5OV3M/5OV3M_22.jpeg",
"5OV3M/5OV3M_23.jpeg",
"5OV3M/5OV3M_24.jpeg",
"5OV3M/5OV3M_25.jpeg",
"5OV3M/5OV3M_26.jpeg",
"5OV3M/5OV3M_27.jpeg",
"5OV3M/5OV3M_28.jpeg",
"5OV3M/5OV3M_29.jpeg",
"5OV3M/5OV3M_30.jpeg",
"5OV3M/5OV3M_31.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"82-0.jpg"
]
} | At the end of the video. |
83 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}When in the video sequence do we observe the action 'person put the plate on a table'?",
"images_path": [
"21WN7/21WN7_0.jpeg",
"21WN7/21WN7_1.jpeg",
"21WN7/21WN7_2.jpeg",
"21WN7/21WN7_3.jpeg",
"21WN7/21WN7_4.jpeg",
"21WN7/21WN7_5.jpeg",
"21WN7/21WN7_6.jpeg",
"21WN7/21WN7_7.jpeg",
"21WN7/21WN7_8.jpeg",
"21WN7/21WN7_9.jpeg",
"21WN7/21WN7_10.jpeg",
"21WN7/21WN7_11.jpeg",
"21WN7/21WN7_12.jpeg",
"21WN7/21WN7_13.jpeg",
"21WN7/21WN7_14.jpeg",
"21WN7/21WN7_15.jpeg",
"21WN7/21WN7_16.jpeg",
"21WN7/21WN7_17.jpeg",
"21WN7/21WN7_18.jpeg",
"21WN7/21WN7_19.jpeg",
"21WN7/21WN7_20.jpeg",
"21WN7/21WN7_21.jpeg",
"21WN7/21WN7_22.jpeg",
"21WN7/21WN7_23.jpeg",
"21WN7/21WN7_24.jpeg",
"21WN7/21WN7_25.jpeg",
"21WN7/21WN7_26.jpeg",
"21WN7/21WN7_27.jpeg",
"21WN7/21WN7_28.jpeg",
"21WN7/21WN7_29.jpeg",
"21WN7/21WN7_30.jpeg",
"21WN7/21WN7_31.jpeg",
"21WN7/21WN7_32.jpeg",
"21WN7/21WN7_33.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"83-0.jpg"
]
} | At the end of the video. |
84 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}During which part of the video does the action 'person closes the cabinet' occur?",
"images_path": [
"HS14N/HS14N_0.jpeg",
"HS14N/HS14N_1.jpeg",
"HS14N/HS14N_2.jpeg",
"HS14N/HS14N_3.jpeg",
"HS14N/HS14N_4.jpeg",
"HS14N/HS14N_5.jpeg",
"HS14N/HS14N_6.jpeg",
"HS14N/HS14N_7.jpeg",
"HS14N/HS14N_8.jpeg",
"HS14N/HS14N_9.jpeg",
"HS14N/HS14N_10.jpeg",
"HS14N/HS14N_11.jpeg",
"HS14N/HS14N_12.jpeg",
"HS14N/HS14N_13.jpeg",
"HS14N/HS14N_14.jpeg",
"HS14N/HS14N_15.jpeg",
"HS14N/HS14N_16.jpeg",
"HS14N/HS14N_17.jpeg",
"HS14N/HS14N_18.jpeg",
"HS14N/HS14N_19.jpeg",
"HS14N/HS14N_20.jpeg",
"HS14N/HS14N_21.jpeg",
"HS14N/HS14N_22.jpeg",
"HS14N/HS14N_23.jpeg",
"HS14N/HS14N_24.jpeg",
"HS14N/HS14N_25.jpeg",
"HS14N/HS14N_26.jpeg",
"HS14N/HS14N_27.jpeg",
"HS14N/HS14N_28.jpeg",
"HS14N/HS14N_29.jpeg",
"HS14N/HS14N_30.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"84-0.jpg"
]
} | At the end of the video. |
85 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}In the given video, when does the action 'a person opens a box' take place?",
"images_path": [
"AHBE8/AHBE8_0.jpeg",
"AHBE8/AHBE8_1.jpeg",
"AHBE8/AHBE8_2.jpeg",
"AHBE8/AHBE8_3.jpeg",
"AHBE8/AHBE8_4.jpeg",
"AHBE8/AHBE8_5.jpeg",
"AHBE8/AHBE8_6.jpeg",
"AHBE8/AHBE8_7.jpeg",
"AHBE8/AHBE8_8.jpeg",
"AHBE8/AHBE8_9.jpeg",
"AHBE8/AHBE8_10.jpeg",
"AHBE8/AHBE8_11.jpeg",
"AHBE8/AHBE8_12.jpeg",
"AHBE8/AHBE8_13.jpeg",
"AHBE8/AHBE8_14.jpeg",
"AHBE8/AHBE8_15.jpeg",
"AHBE8/AHBE8_16.jpeg",
"AHBE8/AHBE8_17.jpeg",
"AHBE8/AHBE8_18.jpeg",
"AHBE8/AHBE8_19.jpeg",
"AHBE8/AHBE8_20.jpeg",
"AHBE8/AHBE8_21.jpeg",
"AHBE8/AHBE8_22.jpeg",
"AHBE8/AHBE8_23.jpeg",
"AHBE8/AHBE8_24.jpeg",
"AHBE8/AHBE8_25.jpeg",
"AHBE8/AHBE8_26.jpeg",
"AHBE8/AHBE8_27.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"85-0.jpg"
]
} | At the end of the video. |
86 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}At what moment in the video does the action 'the person throws their clothes onto the shelf' occur?",
"images_path": [
"OK45U/OK45U_0.jpeg",
"OK45U/OK45U_1.jpeg",
"OK45U/OK45U_2.jpeg",
"OK45U/OK45U_3.jpeg",
"OK45U/OK45U_4.jpeg",
"OK45U/OK45U_5.jpeg",
"OK45U/OK45U_6.jpeg",
"OK45U/OK45U_7.jpeg",
"OK45U/OK45U_8.jpeg",
"OK45U/OK45U_9.jpeg",
"OK45U/OK45U_10.jpeg",
"OK45U/OK45U_11.jpeg",
"OK45U/OK45U_12.jpeg",
"OK45U/OK45U_13.jpeg",
"OK45U/OK45U_14.jpeg",
"OK45U/OK45U_15.jpeg",
"OK45U/OK45U_16.jpeg",
"OK45U/OK45U_17.jpeg",
"OK45U/OK45U_18.jpeg",
"OK45U/OK45U_19.jpeg",
"OK45U/OK45U_20.jpeg",
"OK45U/OK45U_21.jpeg",
"OK45U/OK45U_22.jpeg",
"OK45U/OK45U_23.jpeg",
"OK45U/OK45U_24.jpeg",
"OK45U/OK45U_25.jpeg",
"OK45U/OK45U_26.jpeg",
"OK45U/OK45U_27.jpeg",
"OK45U/OK45U_28.jpeg",
"OK45U/OK45U_29.jpeg",
"OK45U/OK45U_30.jpeg",
"OK45U/OK45U_31.jpeg",
"OK45U/OK45U_32.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"86-0.jpg"
]
} | At the end of the video. |
87 | Based on the given images, identify when does the action in the question happen You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}During which part of the video does the action 'person puts down the box' occur?",
"images_path": [
"S5KQ1/S5KQ1_0.jpeg",
"S5KQ1/S5KQ1_1.jpeg",
"S5KQ1/S5KQ1_2.jpeg",
"S5KQ1/S5KQ1_3.jpeg",
"S5KQ1/S5KQ1_4.jpeg",
"S5KQ1/S5KQ1_5.jpeg",
"S5KQ1/S5KQ1_6.jpeg",
"S5KQ1/S5KQ1_7.jpeg",
"S5KQ1/S5KQ1_8.jpeg",
"S5KQ1/S5KQ1_9.jpeg",
"S5KQ1/S5KQ1_10.jpeg",
"S5KQ1/S5KQ1_11.jpeg",
"S5KQ1/S5KQ1_12.jpeg",
"S5KQ1/S5KQ1_13.jpeg",
"S5KQ1/S5KQ1_14.jpeg",
"S5KQ1/S5KQ1_15.jpeg",
"S5KQ1/S5KQ1_16.jpeg",
"S5KQ1/S5KQ1_17.jpeg",
"S5KQ1/S5KQ1_18.jpeg",
"S5KQ1/S5KQ1_19.jpeg",
"S5KQ1/S5KQ1_20.jpeg",
"S5KQ1/S5KQ1_21.jpeg",
"S5KQ1/S5KQ1_22.jpeg",
"S5KQ1/S5KQ1_23.jpeg",
"S5KQ1/S5KQ1_24.jpeg",
"S5KQ1/S5KQ1_25.jpeg",
"S5KQ1/S5KQ1_26.jpeg",
"S5KQ1/S5KQ1_27.jpeg",
"S5KQ1/S5KQ1_28.jpeg",
"S5KQ1/S5KQ1_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"87-0.jpg"
]
} | At the end of the video. |
88 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}During which part of the video does the action 'person runs out' occur?",
"images_path": [
"9VF2C/9VF2C_0.jpeg",
"9VF2C/9VF2C_1.jpeg",
"9VF2C/9VF2C_2.jpeg",
"9VF2C/9VF2C_3.jpeg",
"9VF2C/9VF2C_4.jpeg",
"9VF2C/9VF2C_5.jpeg",
"9VF2C/9VF2C_6.jpeg",
"9VF2C/9VF2C_7.jpeg",
"9VF2C/9VF2C_8.jpeg",
"9VF2C/9VF2C_9.jpeg",
"9VF2C/9VF2C_10.jpeg",
"9VF2C/9VF2C_11.jpeg",
"9VF2C/9VF2C_12.jpeg",
"9VF2C/9VF2C_13.jpeg",
"9VF2C/9VF2C_14.jpeg",
"9VF2C/9VF2C_15.jpeg",
"9VF2C/9VF2C_16.jpeg",
"9VF2C/9VF2C_17.jpeg",
"9VF2C/9VF2C_18.jpeg",
"9VF2C/9VF2C_19.jpeg",
"9VF2C/9VF2C_20.jpeg",
"9VF2C/9VF2C_21.jpeg",
"9VF2C/9VF2C_22.jpeg"
],
"choice_list": [
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video.",
"At the beginning of the video."
],
"combined_1_images": [
"88-0.jpg"
]
} | At the end of the video. |
89 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}At what moment in the video does the action 'person laughs at a tv' occur?",
"images_path": [
"SW5TC/SW5TC_0.jpeg",
"SW5TC/SW5TC_1.jpeg",
"SW5TC/SW5TC_2.jpeg",
"SW5TC/SW5TC_3.jpeg",
"SW5TC/SW5TC_4.jpeg",
"SW5TC/SW5TC_5.jpeg",
"SW5TC/SW5TC_6.jpeg",
"SW5TC/SW5TC_7.jpeg",
"SW5TC/SW5TC_8.jpeg",
"SW5TC/SW5TC_9.jpeg",
"SW5TC/SW5TC_10.jpeg",
"SW5TC/SW5TC_11.jpeg",
"SW5TC/SW5TC_12.jpeg",
"SW5TC/SW5TC_13.jpeg",
"SW5TC/SW5TC_14.jpeg",
"SW5TC/SW5TC_15.jpeg",
"SW5TC/SW5TC_16.jpeg",
"SW5TC/SW5TC_17.jpeg",
"SW5TC/SW5TC_18.jpeg",
"SW5TC/SW5TC_19.jpeg",
"SW5TC/SW5TC_20.jpeg",
"SW5TC/SW5TC_21.jpeg",
"SW5TC/SW5TC_22.jpeg",
"SW5TC/SW5TC_23.jpeg",
"SW5TC/SW5TC_24.jpeg",
"SW5TC/SW5TC_25.jpeg",
"SW5TC/SW5TC_26.jpeg",
"SW5TC/SW5TC_27.jpeg",
"SW5TC/SW5TC_28.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video."
],
"combined_1_images": [
"89-0.jpg"
]
} | At the end of the video. |
90 | Inspect the presented illustrations and conclude when the action in the inquiry occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}At what moment in the video does the action 'person takes a glass out from the refrigerator' occur?",
"images_path": [
"ZFT06/ZFT06_0.jpeg",
"ZFT06/ZFT06_1.jpeg",
"ZFT06/ZFT06_2.jpeg",
"ZFT06/ZFT06_3.jpeg",
"ZFT06/ZFT06_4.jpeg",
"ZFT06/ZFT06_5.jpeg",
"ZFT06/ZFT06_6.jpeg",
"ZFT06/ZFT06_7.jpeg",
"ZFT06/ZFT06_8.jpeg",
"ZFT06/ZFT06_9.jpeg",
"ZFT06/ZFT06_10.jpeg",
"ZFT06/ZFT06_11.jpeg",
"ZFT06/ZFT06_12.jpeg",
"ZFT06/ZFT06_13.jpeg",
"ZFT06/ZFT06_14.jpeg",
"ZFT06/ZFT06_15.jpeg",
"ZFT06/ZFT06_16.jpeg",
"ZFT06/ZFT06_17.jpeg",
"ZFT06/ZFT06_18.jpeg",
"ZFT06/ZFT06_19.jpeg",
"ZFT06/ZFT06_20.jpeg",
"ZFT06/ZFT06_21.jpeg",
"ZFT06/ZFT06_22.jpeg",
"ZFT06/ZFT06_23.jpeg",
"ZFT06/ZFT06_24.jpeg",
"ZFT06/ZFT06_25.jpeg",
"ZFT06/ZFT06_26.jpeg",
"ZFT06/ZFT06_27.jpeg",
"ZFT06/ZFT06_28.jpeg",
"ZFT06/ZFT06_29.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"Throughout the entire video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"90-0.jpg"
]
} | At the end of the video. |
91 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}When in the video sequence do we observe the action 'person closes the box'?",
"images_path": [
"NAZ52/NAZ52_0.jpeg",
"NAZ52/NAZ52_1.jpeg",
"NAZ52/NAZ52_2.jpeg",
"NAZ52/NAZ52_3.jpeg",
"NAZ52/NAZ52_4.jpeg",
"NAZ52/NAZ52_5.jpeg",
"NAZ52/NAZ52_6.jpeg",
"NAZ52/NAZ52_7.jpeg",
"NAZ52/NAZ52_8.jpeg",
"NAZ52/NAZ52_9.jpeg",
"NAZ52/NAZ52_10.jpeg",
"NAZ52/NAZ52_11.jpeg",
"NAZ52/NAZ52_12.jpeg",
"NAZ52/NAZ52_13.jpeg",
"NAZ52/NAZ52_14.jpeg",
"NAZ52/NAZ52_15.jpeg",
"NAZ52/NAZ52_16.jpeg",
"NAZ52/NAZ52_17.jpeg",
"NAZ52/NAZ52_18.jpeg",
"NAZ52/NAZ52_19.jpeg",
"NAZ52/NAZ52_20.jpeg",
"NAZ52/NAZ52_21.jpeg",
"NAZ52/NAZ52_22.jpeg",
"NAZ52/NAZ52_23.jpeg",
"NAZ52/NAZ52_24.jpeg",
"NAZ52/NAZ52_25.jpeg",
"NAZ52/NAZ52_26.jpeg",
"NAZ52/NAZ52_27.jpeg",
"NAZ52/NAZ52_28.jpeg",
"NAZ52/NAZ52_29.jpeg",
"NAZ52/NAZ52_30.jpeg",
"NAZ52/NAZ52_31.jpeg",
"NAZ52/NAZ52_32.jpeg",
"NAZ52/NAZ52_33.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"In the middle of the video."
],
"combined_1_images": [
"91-0.jpg"
]
} | At the end of the video. |
92 | Given the visuals, discern the timing of the event in the query. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}At what moment in the video does the action 'person begins laughing' occur?",
"images_path": [
"VXZBA/VXZBA_0.jpeg",
"VXZBA/VXZBA_1.jpeg",
"VXZBA/VXZBA_2.jpeg",
"VXZBA/VXZBA_3.jpeg",
"VXZBA/VXZBA_4.jpeg",
"VXZBA/VXZBA_5.jpeg",
"VXZBA/VXZBA_6.jpeg",
"VXZBA/VXZBA_7.jpeg",
"VXZBA/VXZBA_8.jpeg",
"VXZBA/VXZBA_9.jpeg",
"VXZBA/VXZBA_10.jpeg",
"VXZBA/VXZBA_11.jpeg",
"VXZBA/VXZBA_12.jpeg",
"VXZBA/VXZBA_13.jpeg",
"VXZBA/VXZBA_14.jpeg",
"VXZBA/VXZBA_15.jpeg",
"VXZBA/VXZBA_16.jpeg",
"VXZBA/VXZBA_17.jpeg",
"VXZBA/VXZBA_18.jpeg",
"VXZBA/VXZBA_19.jpeg",
"VXZBA/VXZBA_20.jpeg",
"VXZBA/VXZBA_21.jpeg",
"VXZBA/VXZBA_22.jpeg",
"VXZBA/VXZBA_23.jpeg",
"VXZBA/VXZBA_24.jpeg",
"VXZBA/VXZBA_25.jpeg",
"VXZBA/VXZBA_26.jpeg",
"VXZBA/VXZBA_27.jpeg",
"VXZBA/VXZBA_28.jpeg",
"VXZBA/VXZBA_29.jpeg",
"VXZBA/VXZBA_30.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video."
],
"combined_1_images": [
"92-0.jpg"
]
} | At the end of the video. |
93 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}At what moment in the video does the action 'person smiling at a camera' occur?",
"images_path": [
"W5ZY8/W5ZY8_0.jpeg",
"W5ZY8/W5ZY8_1.jpeg",
"W5ZY8/W5ZY8_2.jpeg",
"W5ZY8/W5ZY8_3.jpeg",
"W5ZY8/W5ZY8_4.jpeg",
"W5ZY8/W5ZY8_5.jpeg",
"W5ZY8/W5ZY8_6.jpeg",
"W5ZY8/W5ZY8_7.jpeg",
"W5ZY8/W5ZY8_8.jpeg",
"W5ZY8/W5ZY8_9.jpeg",
"W5ZY8/W5ZY8_10.jpeg",
"W5ZY8/W5ZY8_11.jpeg",
"W5ZY8/W5ZY8_12.jpeg",
"W5ZY8/W5ZY8_13.jpeg",
"W5ZY8/W5ZY8_14.jpeg",
"W5ZY8/W5ZY8_15.jpeg",
"W5ZY8/W5ZY8_16.jpeg",
"W5ZY8/W5ZY8_17.jpeg",
"W5ZY8/W5ZY8_18.jpeg",
"W5ZY8/W5ZY8_19.jpeg",
"W5ZY8/W5ZY8_20.jpeg",
"W5ZY8/W5ZY8_21.jpeg",
"W5ZY8/W5ZY8_22.jpeg",
"W5ZY8/W5ZY8_23.jpeg",
"W5ZY8/W5ZY8_24.jpeg",
"W5ZY8/W5ZY8_25.jpeg",
"W5ZY8/W5ZY8_26.jpeg",
"W5ZY8/W5ZY8_27.jpeg",
"W5ZY8/W5ZY8_28.jpeg",
"W5ZY8/W5ZY8_29.jpeg",
"W5ZY8/W5ZY8_30.jpeg",
"W5ZY8/W5ZY8_31.jpeg",
"W5ZY8/W5ZY8_32.jpeg"
],
"choice_list": [
"In the middle of the video.",
"At the end of the video.",
"At the beginning of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"93-0.jpg"
]
} | At the end of the video. |
94 | From the images presented, ascertain the moment the action in the query occurs. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}When in the video sequence do we observe the action 'the person closes the refrigerator'?",
"images_path": [
"WM6RQ/WM6RQ_0.jpeg",
"WM6RQ/WM6RQ_1.jpeg",
"WM6RQ/WM6RQ_2.jpeg",
"WM6RQ/WM6RQ_3.jpeg",
"WM6RQ/WM6RQ_4.jpeg",
"WM6RQ/WM6RQ_5.jpeg",
"WM6RQ/WM6RQ_6.jpeg",
"WM6RQ/WM6RQ_7.jpeg",
"WM6RQ/WM6RQ_8.jpeg",
"WM6RQ/WM6RQ_9.jpeg",
"WM6RQ/WM6RQ_10.jpeg",
"WM6RQ/WM6RQ_11.jpeg",
"WM6RQ/WM6RQ_12.jpeg",
"WM6RQ/WM6RQ_13.jpeg",
"WM6RQ/WM6RQ_14.jpeg",
"WM6RQ/WM6RQ_15.jpeg",
"WM6RQ/WM6RQ_16.jpeg",
"WM6RQ/WM6RQ_17.jpeg",
"WM6RQ/WM6RQ_18.jpeg",
"WM6RQ/WM6RQ_19.jpeg",
"WM6RQ/WM6RQ_20.jpeg",
"WM6RQ/WM6RQ_21.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video.",
"At the end of the video."
],
"combined_1_images": [
"94-0.jpg"
]
} | At the end of the video. |
95 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}At what moment in the video does the action 'person begins to eat it' occur?",
"images_path": [
"1GII3/1GII3_0.jpeg",
"1GII3/1GII3_1.jpeg",
"1GII3/1GII3_2.jpeg",
"1GII3/1GII3_3.jpeg",
"1GII3/1GII3_4.jpeg",
"1GII3/1GII3_5.jpeg",
"1GII3/1GII3_6.jpeg",
"1GII3/1GII3_7.jpeg",
"1GII3/1GII3_8.jpeg",
"1GII3/1GII3_9.jpeg",
"1GII3/1GII3_10.jpeg",
"1GII3/1GII3_11.jpeg",
"1GII3/1GII3_12.jpeg",
"1GII3/1GII3_13.jpeg",
"1GII3/1GII3_14.jpeg",
"1GII3/1GII3_15.jpeg",
"1GII3/1GII3_16.jpeg",
"1GII3/1GII3_17.jpeg",
"1GII3/1GII3_18.jpeg",
"1GII3/1GII3_19.jpeg",
"1GII3/1GII3_20.jpeg",
"1GII3/1GII3_21.jpeg",
"1GII3/1GII3_22.jpeg",
"1GII3/1GII3_23.jpeg",
"1GII3/1GII3_24.jpeg",
"1GII3/1GII3_25.jpeg",
"1GII3/1GII3_26.jpeg",
"1GII3/1GII3_27.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the end of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"95-0.jpg"
]
} | At the end of the video. |
96 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}{image#31}{image#32}{image#33}{image#34}{image#35}{image#36}{image#37}During which part of the video does the action 'person opens the dryer door' occur?",
"images_path": [
"I52A6/I52A6_0.jpeg",
"I52A6/I52A6_1.jpeg",
"I52A6/I52A6_2.jpeg",
"I52A6/I52A6_3.jpeg",
"I52A6/I52A6_4.jpeg",
"I52A6/I52A6_5.jpeg",
"I52A6/I52A6_6.jpeg",
"I52A6/I52A6_7.jpeg",
"I52A6/I52A6_8.jpeg",
"I52A6/I52A6_9.jpeg",
"I52A6/I52A6_10.jpeg",
"I52A6/I52A6_11.jpeg",
"I52A6/I52A6_12.jpeg",
"I52A6/I52A6_13.jpeg",
"I52A6/I52A6_14.jpeg",
"I52A6/I52A6_15.jpeg",
"I52A6/I52A6_16.jpeg",
"I52A6/I52A6_17.jpeg",
"I52A6/I52A6_18.jpeg",
"I52A6/I52A6_19.jpeg",
"I52A6/I52A6_20.jpeg",
"I52A6/I52A6_21.jpeg",
"I52A6/I52A6_22.jpeg",
"I52A6/I52A6_23.jpeg",
"I52A6/I52A6_24.jpeg",
"I52A6/I52A6_25.jpeg",
"I52A6/I52A6_26.jpeg",
"I52A6/I52A6_27.jpeg",
"I52A6/I52A6_28.jpeg",
"I52A6/I52A6_29.jpeg",
"I52A6/I52A6_30.jpeg",
"I52A6/I52A6_31.jpeg",
"I52A6/I52A6_32.jpeg",
"I52A6/I52A6_33.jpeg",
"I52A6/I52A6_34.jpeg",
"I52A6/I52A6_35.jpeg",
"I52A6/I52A6_36.jpeg"
],
"choice_list": [
"At the end of the video.",
"Throughout the entire video.",
"In the middle of the video.",
"At the beginning of the video."
],
"combined_1_images": [
"96-0.jpg"
]
} | At the end of the video. |
97 | Observe the given images and deduce when the action in the query takes place. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}At what moment in the video does the action 'person throws the book down' occur?",
"images_path": [
"ZPRJH/ZPRJH_0.jpeg",
"ZPRJH/ZPRJH_1.jpeg",
"ZPRJH/ZPRJH_2.jpeg",
"ZPRJH/ZPRJH_3.jpeg",
"ZPRJH/ZPRJH_4.jpeg",
"ZPRJH/ZPRJH_5.jpeg",
"ZPRJH/ZPRJH_6.jpeg",
"ZPRJH/ZPRJH_7.jpeg",
"ZPRJH/ZPRJH_8.jpeg",
"ZPRJH/ZPRJH_9.jpeg",
"ZPRJH/ZPRJH_10.jpeg",
"ZPRJH/ZPRJH_11.jpeg",
"ZPRJH/ZPRJH_12.jpeg",
"ZPRJH/ZPRJH_13.jpeg",
"ZPRJH/ZPRJH_14.jpeg",
"ZPRJH/ZPRJH_15.jpeg",
"ZPRJH/ZPRJH_16.jpeg",
"ZPRJH/ZPRJH_17.jpeg",
"ZPRJH/ZPRJH_18.jpeg",
"ZPRJH/ZPRJH_19.jpeg",
"ZPRJH/ZPRJH_20.jpeg",
"ZPRJH/ZPRJH_21.jpeg",
"ZPRJH/ZPRJH_22.jpeg",
"ZPRJH/ZPRJH_23.jpeg",
"ZPRJH/ZPRJH_24.jpeg",
"ZPRJH/ZPRJH_25.jpeg",
"ZPRJH/ZPRJH_26.jpeg",
"ZPRJH/ZPRJH_27.jpeg",
"ZPRJH/ZPRJH_28.jpeg",
"ZPRJH/ZPRJH_29.jpeg"
],
"choice_list": [
"In the middle of the video.",
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video."
],
"combined_1_images": [
"97-0.jpg"
]
} | At the end of the video. |
98 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}{image#24}{image#25}{image#26}{image#27}{image#28}{image#29}{image#30}Can you identify when the action 'the person takes a paper towel from the shelf' happens in the video?",
"images_path": [
"YT2C3/YT2C3_0.jpeg",
"YT2C3/YT2C3_1.jpeg",
"YT2C3/YT2C3_2.jpeg",
"YT2C3/YT2C3_3.jpeg",
"YT2C3/YT2C3_4.jpeg",
"YT2C3/YT2C3_5.jpeg",
"YT2C3/YT2C3_6.jpeg",
"YT2C3/YT2C3_7.jpeg",
"YT2C3/YT2C3_8.jpeg",
"YT2C3/YT2C3_9.jpeg",
"YT2C3/YT2C3_10.jpeg",
"YT2C3/YT2C3_11.jpeg",
"YT2C3/YT2C3_12.jpeg",
"YT2C3/YT2C3_13.jpeg",
"YT2C3/YT2C3_14.jpeg",
"YT2C3/YT2C3_15.jpeg",
"YT2C3/YT2C3_16.jpeg",
"YT2C3/YT2C3_17.jpeg",
"YT2C3/YT2C3_18.jpeg",
"YT2C3/YT2C3_19.jpeg",
"YT2C3/YT2C3_20.jpeg",
"YT2C3/YT2C3_21.jpeg",
"YT2C3/YT2C3_22.jpeg",
"YT2C3/YT2C3_23.jpeg",
"YT2C3/YT2C3_24.jpeg",
"YT2C3/YT2C3_25.jpeg",
"YT2C3/YT2C3_26.jpeg",
"YT2C3/YT2C3_27.jpeg",
"YT2C3/YT2C3_28.jpeg",
"YT2C3/YT2C3_29.jpeg"
],
"choice_list": [
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video.",
"Throughout the entire video."
],
"combined_1_images": [
"98-0.jpg"
]
} | At the end of the video. |
99 | Review the supplied visuals and ascertain the timing of the action in the inquiry. You must choose your answer from the Choice List. | {
"context": "{image#1}{image#2}{image#3}{image#4}{image#5}{image#6}{image#7}{image#8}{image#9}{image#10}{image#11}{image#12}{image#13}{image#14}{image#15}{image#16}{image#17}{image#18}{image#19}{image#20}{image#21}{image#22}{image#23}In the given video, when does the action 'person closed the book' take place?",
"images_path": [
"JTXAM/JTXAM_0.jpeg",
"JTXAM/JTXAM_1.jpeg",
"JTXAM/JTXAM_2.jpeg",
"JTXAM/JTXAM_3.jpeg",
"JTXAM/JTXAM_4.jpeg",
"JTXAM/JTXAM_5.jpeg",
"JTXAM/JTXAM_6.jpeg",
"JTXAM/JTXAM_7.jpeg",
"JTXAM/JTXAM_8.jpeg",
"JTXAM/JTXAM_9.jpeg",
"JTXAM/JTXAM_10.jpeg",
"JTXAM/JTXAM_11.jpeg",
"JTXAM/JTXAM_12.jpeg",
"JTXAM/JTXAM_13.jpeg",
"JTXAM/JTXAM_14.jpeg",
"JTXAM/JTXAM_15.jpeg",
"JTXAM/JTXAM_16.jpeg",
"JTXAM/JTXAM_17.jpeg",
"JTXAM/JTXAM_18.jpeg",
"JTXAM/JTXAM_19.jpeg",
"JTXAM/JTXAM_20.jpeg",
"JTXAM/JTXAM_21.jpeg",
"JTXAM/JTXAM_22.jpeg"
],
"choice_list": [
"Throughout the entire video.",
"At the beginning of the video.",
"At the end of the video.",
"In the middle of the video."
],
"combined_1_images": [
"99-0.jpg"
]
} | At the end of the video. |
MileBench
Introduction
We introduce MileBench, a pioneering benchmark designed to test the MultImodal Long-contExt capabilities of MLLMs. This benchmark comprises not only multimodal long contexts, but also multiple tasks requiring both comprehension and generation. We establish two distinct evaluation sets, diagnostic and realistic, to systematically assess MLLMs’ long-context adaptation capacity and their ability to completetasks in long-context scenarios

To construct our evaluation sets, we gather 6,440 multimodal long-context samples from 21 pre-existing or self-constructed datasets, with an average of 15.2 images and 422.3 words each, as depicted in the figure, and we categorize them into their respective subsets.


How to use?
Please download MileBench.zip and refer to Code for MileBench.
Links
- Homepage: MileBench Homepage
- Repository: MileBench GitHub
- Paper: Arxiv
- Point of Contact: Dingjie Song
Citation
If you find this project useful in your research, please consider cite:
- Downloads last month
- 1