Suggest an alternative to

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Why do you think that https://github.com/matterport/Mask_RCNN is a good alternative to multimodal-maestro