Multimodal Model with Image Text

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

Apple AI research shows how MLLMs understand, generate, search for images

Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, ...

3don MSN

Zhipu AI breaks US chip reliance with first major model trained on Huawei stack

Zhipu claims GLM-Image achieved industry-leading scores among open-source models for text rendering and Chinese character ...

Network World

Chinese AI firm trains state-of-the-art model entirely on Huawei chips

Chinese company Zhipu AI has trained image generation model entirely on Huawei processors, demonstrating that Chinese firms ...

InfoQ

Mistral AI Releases Pixtral Large: a Multimodal Model for Advanced Image and Text Analysis

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Forbes

Beyond Large Language Models: How Multimodal AI Is Unlocking Human-Like Intelligence

The AI industry has long been dominated by text-based large language models (LLMs), but the future lies beyond the written word. Multimodal AI represents the next major wave in artificial intelligence ...

SiliconANGLE

Amazon introduces Nova family of multimodal AI foundation models

Amazon Web Services Inc., the cloud division of Amazon.com Inc., today announced a new family of multimodal, generative artificial intelligence models called Nova. Amazon Chief Executive Andy Jassy ...

Semiconductor Engineering

NPU Acceleration For Multimodal LLMs

Transformer-based models have rapidly spread from text to speech, vision, and other modalities. This has created challenges for the development of Neural Processing Units (NPUs). NPUs must now ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results