The Grounding DINO model works only with the version of supervision==0.4.0 (as mentioned by supervision) However, using this version, I cant use the supervision's mask annotator function.