Recognition Of Complex Events In Open-Source Web-Scale Videos: Features, Intermediate Representations And Their Temporal Interactions