AI applications are moving beyond text generation to multimodal systems that can perceive, search, and reason across images, documents, video, and...