V, a multimodal model that has introduced native visual function calling to bypass text conversion in agentic workflows.
Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models ...
After seizing the summer with a blitz of powerful, freely available new open source language and coding focused AI models that matched or in some cases bested closed ...
In this paper, we introduce the world’s first 8K 120-Hz video real-time encoder and decoder that complies with ARIB STD-B32 1) . We evaluated the coding efficiency and demonstrated that 8K 120-Hz ...