NVIDIA Learning and Perception Research6/1/2025Omni-RGPT: Unifying Image and Video Region-level Understanding via Token MarksPublication IEEE Conference on Computer Vision and Pattern Recognition (CVPR)Read at NVIDIA Learning and Perception Research