0,0 should be the center of 0:0 (None)
1,0 should be +00:00:02:00 when video ends
2,0 should be +00:00:02:00 with white bands on top/bottom when video ends
3,0 should be the center part of 0:00:02:0 when video ends
0,1 should be empty (no video image, not even the rectangle border)
1,1 should be empty (no video image, not even the rectangle border)
2,1 should be the center of 0:0 (like 0,0)
3,1 should be the center part of 0:00:02:0 when video ends (like 3,0)