Maybe the issue is that there is nothing to concretely establish a sense of scale, since none of the figures have human features. Another issue is probably the frame rate of the video this still was taken from, since the cloaks have no motion blur and thus look motionless and staged.
More likely it is a production photo and not a still frame.