Meta Unveils Open-Source Multimodal Generative AI System

Singapore News News

Meta Unveils Open-Source Multimodal Generative AI System
Singapore Latest News,Singapore Headlines
  • 📰 petapixel
  • ⏱ Reading Time:
  • 59 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 27%
  • Publisher: 51%

Meta says it is the first AI model to combine six types of data into a single 'embedding space.'

Example of ImageBind showing an image generated from a photo and an audio clip.

Meta has announced a new generative AI model called “ImageBind” that links together six different modalities; images, text, audio, depth, thermal, and IMU.like Midjourney, DALL-E, and Stable Diffusion work. These systems link together text and images so the AI can understand patterns in the visual data.

Facebook owner Meta says that ImageBind is the first AI model to combine six types of data into a single “embedding space.” For example, it can create an image from an audio clip — such as creating an image based on the sounds of a rainforest or a bustling market. The system is intended to mimic the way humans interpret the world which of course is a multi-sensory experience.

“Today, we’re introducing an approach that brings machines one step closer to humans’ ability to learn simultaneously, holistically, and directly from many different forms of information — without the need for explicit supervision ,” MetaThe one modality that you may not recognize is IMU . This technology is found in phones and smartwatches where they perform a range of tasks including switching a phone from landscape to portrait when the device is physically rotated.

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

petapixel /  🏆 527. in US

Singapore Latest News, Singapore Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Meta's open-source ImageBind AI aims to mimic human perception | EngadgetMeta's open-source ImageBind AI aims to mimic human perception | EngadgetMeta is open-sourcing an AI tool called ImageBind that predicts connections between data similar to how humans perceive or imagine an environment.
Read more »

Meta open-sources multisensory AI model that combines six types of dataMeta open-sources multisensory AI model that combines six types of dataThe ImageBind model combines six types of information: text, audio, visual, movement, thermal, and depth data.
Read more »

A fleet of humanoid, open-source robots could change robotics researchA fleet of humanoid, open-source robots could change robotics researchNot all robots are created equal—and the National Science Foundation wants to help level the playing field to speed up research.
Read more »

Update: ENHYPEN Unveils Haunting “DARK BLOOD” Comeback TeasersUpdate: ENHYPEN Unveils Haunting “DARK BLOOD” Comeback TeasersUpdate: ENHYPEN Unveils Haunting “DARK BLOOD” Comeback Teasers
Read more »

Update: YOUNITE Unveils Sporty Teasers For “BIT Part.1” ComebackUpdate: YOUNITE Unveils Sporty Teasers For “BIT Part.1” ComebackUpdate: YOUNITE Unveils Sporty Teasers For “BIT Part.1” Comeback
Read more »



Render Time: 2025-04-08 09:42:27