On Monday, researchers from Microsoft introduced Kosmos-1, a multimodal model that can reportedly analyze images for content, solve visual puzzles, perform visual text recognition, pass visual IQ ...
Alibaba expands its AI live speech translation model from 18 to 60 languages, adding real-time voice cloning and reducing ...