Learning disentangled representations in deep visual analysis