[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
Why do you think that https://github.com/The-ML-Hero/Robo-Semantic-Segmentation is a good alternative to MiVOS