Intel Labs Advances Computer Vision Development with Two New AI Models

March 24, 2023 | Intel

Estimated reading time: 2 minutes

Depth estimation is a challenging computer vision task required to create a wide range of applications in robotics, augmented reality (AR) and virtual reality (VR). Existing solutions often struggle to correctly estimate distances, which is a crucial aspect in helping plan motion and avoiding obstacles when it comes to visual navigation. Researchers at Intel Labs are addressing this issue by releasing two AI models1 for monocular depth estimation: one for visual-inertial depth estimation and one for robust relative depth estimation (RDE).

The latest RDE model, MiDaS version 3.1, predicts robust relative depth using only a single image as an input. Due to its training on a large and diverse dataset, it can efficiently perform on a wider range of tasks and environments. The latest version of MiDaS improves model accuracy for RDE by about 30% with its larger training set and updated encoder backbones.

MiDaS has been incorporated into many projects, most notably Stable Diffusion 2.0, where it enables the depth-to-image feature that infers the depth of an input image and then generates new images using both the text and depth information. For example, digital creator Scottie Fox used a combination of Stable Diffusion and MiDaS to create a 360-degree VR environment. This technology could lead to new virtual applications, including crime scene reconstruction for court cases, therapeutic environments for healthcare and immersive gaming experiences.

Intel Introduces MiDaS 3.1 for Computer Vision

While RDE has good generalizability and is useful, the lack of scale decreases its utility for downstream tasks requiring metric depth, such as mapping, planning, navigation, object recognition, 3D reconstruction and image editing. Researchers at Intel Labs are addressing this issue by releasing VI-Depth, another AI model that provides accurate depth estimation.

VI-Depth is a visual-inertial depth estimation pipeline that integrates monocular depth estimation and visual-inertial odometry (VIO) to produce dense depth estimates with a metric scale. This approach provides accurate depth estimation, which can aid in scene reconstruction, mapping and object manipulation.

Incorporating inertial data can help resolve scale ambiguity. Most mobile devices already contain inertial measurement units (IMUs). Global alignment determines appropriate global scale, while dense scale alignment (SML) operates locally and pushes or pulls regions toward correct metric depth. The SML network leverages MiDaS as an encoder backbone. In the modular pipeline, VI-Depth combines data-driven depth estimation with the MiDaS relative depth prediction model, alongside the IMU sensor measurement unit. The combination of data sources allows VI-Depth to generate more reliable dense metric depth for every pixel in an image.

Share on:

Testimonial

"Advertising in PCB007 Magazine has been a great way to showcase our bare board testers to the right audience. The I-Connect007 team makes the process smooth and professional. We’re proud to be featured in such a trusted publication."

Klaus Koziol - atg

Suggested Items

VeriSilicon’s AI-ISP Custom Chip Solution Enables Mass Production of Customer’s Smartphones

06/09/2025 | BUSINESS WIRE
VeriSilicon recently announced that its AI-ISP custom chip solution has been successfully adopted in a customer’s mass-produced smartphones, reaffirming the company’s comprehensive one-stop custom silicon service capabilities in AI vision processing.

Orbel Corporation Integrates Schmoll Direct Imaging

06/04/2025 | Schmoll America
Orbel Corporation in Easton, PA, proudly becomes the first PCM facility in the U.S. equipped with Schmoll’s MDI Direct Imaging system. This installation empowers Orbel to support customers with greater precision and quality.

ICEYE, Safran Announce Strategic Partnership on Persistent Surveillance Capabilities

05/20/2025 | PRNewswire
Safran.AI and ICEYE announce a long-term strategic partnership, leveraging their combined expertise to deliver advanced multisensor Artificial Intelligence (AI) solutions that enable governments to achieve faster and more accurate decision-making in geospatial intelligence.

Summit Interconnect Hollister Elevates PCB Prototyping with New TiTAN Direct Imaging System from Technica USA

05/01/2025 | Summit Interconnect, Inc.
Summit Interconnect’s Hollister facility has recently enhanced its quick-turn PCB prototyping capabilities by installing the TiTAN PSR-H Direct Imaging (DI) system.

The Knowledge Base: Unlocking the Invisible—The Critical Role of X-ray Technology

04/29/2025 | Mike Konrad -- Column: The Knowledge Base
From detecting voids under BGAs to solder defects in high-reliability applications, X-ray inspection has become an indispensable tool in modern manufacturing. But how is the technology evolving? What challenges do experts face in deploying X-ray inspection effectively and what does the future hold for this critical quality assurance method?

News Highlights

More News

Featured Books

Article Highlights

More Articles

Latest Columns

See all of our columnists

Media Kit - Choose Your Primary Marketing Focus: