Publication: Bootstrapped Self-Supervised Training with Monocular Video for Semantic Segmentation and Depth Estimation.