Publication: Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection.