Meta open source multi-sensory artificial intelligence model, integrating text, audio, visual and other six types of data 02/09 Update SLTechnology News&Howtos

Meta open source multi-sensory artificial intelligence model, integrating text, audio, visual and other six types of data

2026-02-09 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >

Shulou(Shulou.com)11/24 Report--

CTOnews.com, May 9 / PRNewswire-FirstCall-Asianet /-- Meta has released a new open source artificial intelligence model, ImageBind, which integrates multiple data streams, including text, audio, visual data, temperature and motion readings. The model is currently just a research project and has no direct consumer or practical application, but it shows the possibility of future generative artificial intelligence systems that can create immersive, multi-sensory experiences. At the same time, the model also shows that Meta is open to artificial intelligence research, while its competitors such as OpenAI and Google are becoming more and more closed.

The core concept of this study is to integrate multiple types of data into a multidimensional index (or, in artificial intelligence terms, "embedded space"). This concept may be abstract, but it is the basis of the recent craze for generative artificial intelligence. For example, artificial intelligence image generators, such as DALL-E, Stable Diffusion, and Midjourney, rely on systems that link text and images during the training phase. While looking for patterns in the visual data, they connect the information with the description of the image. This is why these systems can generate pictures based on the user's text input. The same applies to many artificial intelligence tools that can generate video or audio in the same way.

Meta says its model ImageBind is the first to integrate six types of data into one embedded space. These six types of data include: vision (including images and video); thermal (infrared images); text; audio; depth information; and, the most interesting one, motion readings generated by inertial measurement units (IMU). (IMU exists in mobile phones and smartwatches and is used to perform a variety of tasks, from switching from horizontal to vertical to distinguishing between different types of motion. )

Future artificial intelligence systems will be able to cross-reference this data like current systems for text input. For example, imagine a future virtual reality device that can generate not only audio and visual inputs, but also the motion of your environment and physical platform. You can ask it to simulate a long sea trip. It will not only put you on a ship with the sound of the waves in the background, but also make you feel the deck shaking under your feet and the sea breeze blowing.

Meta pointed out in a blog post that future models could also add other sensory input streams, including "tactile, voice, smell and brain fMRI signals". The company also claims that the research "brings machines closer to the ability of humans to learn from many different forms of information simultaneously, comprehensively and directly."

Of course, much of this is based on prediction, and it is likely that the direct application of this study will be very limited. Last year, for example, Meta showed an artificial intelligence model that can generate short, blurry videos based on text descriptions. Research like ImageBind shows how future versions of the system can integrate other data streams, such as generating audio that matches the video output.

The study is also interesting to industry watchers because CTOnews.com notes that Meta has opened up the underlying model, which is a growing concern in the field of artificial intelligence.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Weibo

Tencent

Renren

QQZone

Douban

Weibo

Tencent

Renren

QQZone

Douban

Yixin

The market share of Chrome browser on the desktop has exceeded 70%

The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about

2025-09-03 14:52:50 SL Technology News Views: 45
The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.

The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r

2025-09-03 14:07:30 SL Technology News Views: 52
Disney Agrees to Pay $10 Million to Settle with FTC over Alleged Child Data Collection Using YouTube Animations

On September 3, it was reported that Disney has agreed to pay $10 million to settle a case in which

2025-09-03 14:03:30 SL Technology News Views: 56
Google Wins! Court Rules It Doesn't Have to Sell Chrome Browser

A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from

2025-09-03 13:41:31 SL Technology News Views: 52
Build zoopker+hbase environment

Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope

2023-12-25 21:17:29 shulou Views: 405

IT Information

More IT Information >

Meta open source multi-sensory artificial intelligence model, integrating text, audio, visual and other six types of data

Related

The market share of Chrome browser on the desktop has exceeded 70%

The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.

Disney Agrees to Pay $10 Million to Settle with FTC over Alleged Child Data Collection Using YouTube Animations

Google Wins! Court Rules It Doesn't Have to Sell Chrome Browser

Build zoopker+hbase environment

IT Information

The energy cost of AI is too high, Microsoft is considering using nuclear power to power the data center

Tesla bought 110000 tons of lithium ore from Core Lithium in Australia and ordered it until October 26 this year.

[out of the box] Cloud Whale sweeping and dragging robot J4 Picture appreciation: 7800Pa suction, upgraded cyclone guide zero winding rolling brush

Selis sold 6243 new energy vehicles (including AITO) in August, down 57.37% from the same period last year.

Funeng Technology: at present, the company already has semi-solid battery products for mass production and loading.

Latest Network Security More Network Security >

Latest Internet Technology More Internet Technology >

Latest Development More Development >

Latest Database More Database >

Latest Servers More Servers >

Latest Mobile Phone More Mobile Phone >

Latest Android Software More Android Software >

Latest Apple Software More Apple Software >

Latest Computer Software News More Computer Software News >

Latest IT Information More IT Information >