
A new type of multimodal large language model (MLLM) from Apple that excels in both image understanding and language processing, particularly demonstrating significant advantages in understanding spatial references.
Ferret awards
1
Launches
1
Awards
Ferret
January 2nd, 2024