OpenCV

Calibration is one of those problems every computer vision practitioner knows and knows well. Getting multi-sensor, multi-modal systems to agree on a shared view of the world is hard work, and doing it reliably at production scale has historically meant stitching together brittle pipelines, fighting edge cases, and hoping nothing drifts the next time you […] The post Tangram Vision and OpenCV Are…

computer-sciencecomputer-vision

2.5 billion model inferences every day across robotics, healthcare, manufacturing, and beyond. That’s the scale at which Ultralytics YOLO operates, and at ​OSCCA on May 4th in Los Angeles​, you’ll hear directly from the person who built it. We’re excited to announce that Glenn Jocher, Founder and CEO of Ultralytics, will be delivering an pre-recorded […] The post Glenn Jocher of Ultralytics (YOLO…

Over 1.5 billion downloads. Used in everything from self-driving cars to medical imaging to robotics. OpenCV has become the backbone of modern computer vision — and it all started with one person. Gary Bradski, the founder of OpenCV, is coming to OSCCA, the OpenCV-SID Conference on Computer Vision & AI, on May 4th in Los […] The post The Founder of OpenCV Is Speaking at OSCCA, Don’t Miss It! appe…

aicomputer-vision

What does it look like when computer vision and AI power experiences for millions of guests at Disney scale, from AI-driven robotic characters to conversational droids in a galaxy far, far away? Find out at OSCCA, the OpenCV-SID Conference on Computer Vision & AI, happening May 4th in Los Angeles as part of Display Week […] The post Behind the Magic: Disney Research Imagineering’s Doug Fidaleo Co…

aicomputer-vision

What does it take to build an AI that competes in professional motorsports — no driver, no remote control, just autonomous decision-making at race speed? Find out at OSCCA, the OpenCV-SID Conference on Computer Vision & AI, happening May 4th in Los Angeles as part of Display Week 2026. One of our featured speakers is […] The post When the Track Is Your Lab: Meet the Team Racing Without a Driver a…

aiautonomous-systemscomputer-vision

OpenCV is continuing our partnership with the awesome Display Week conference, joining them in Los Angeles this May 4th for a special one-day event packed with insights from Computer Vision & AI visionaries and a 3-hour workshop from OpenCV CEO Dr. Satya Mallick. OSCCA is back for 2026! Last year was the first time we’d […] The post Attend The OpenCV-SID Conference On Computer Vision & AI This Ma…

aicomputer-sciencecomputer-vision

Join the organizers of the Embedded Vision Summit on this preview webinar for an insider look at the premier conference on practical computer vision and edge AI, highlighting key trends, sessions, and what to expect at this year’s event organized by the Edge AI and Vision Alliance. OpenCV has been going to Embedded Vision Summit […] The post Preview The Embedded Vision Summit 2026 Conference On O…

aicomputer-vision

A real-world robotics challenge with a $180K prize pool, where innovation and industry impact collide. We’re standing at an inflection point in robotics: electronics assembly, especially dexterous manipulation remains one of the biggest open problems in industry today. Tasks like handling flexible cables or inserting connectors during electronics assembly, are still exceedingly hard for robots [……

airobotics

This project controls a Universal Robots UR5 using real-time face tracking built with OpenCV. A standard webcam provides a live video stream that detects a human face, computes its position relative to the image center, and maps this offset into the robot’s Cartesian workspace. The robot’s tool center point (TCP) is then updated continuously, resulting in smooth, responsive motion that follows th…

aicomputer-visionroboticstechnology

Note: This event has been rescheduled but the links still work. Simultaneous Localization & Mapping (SLAM) is one of the most active and contentious areas of CV & robotics. Should you use purely visual SLAM? Do you need LiDAR? What about indoor .vs. outdoor use cases? We’ll cover all these and more with OpenCV community […] The post Part 3: Simultaneous Localization & Mapping: Which SLAM Is For Y…

airobotics

This year the Low-Power Computer Vision Challenge (LPCV) has three tracks with serious prize money including Image-to-Text Retrieval, Action Recognition in Video and AI Generated Images Detection. Each track has over $10,000 in prizes up for grabs, and is open for participation! On this week’s episode we welcome back the LPCV organizers to give us […] The post OpenCV Live: The Low-Power Computer …

aicomputer-vision

In this blog, we explore Visual Place Recognition (VPR) with hands-on examples using OpenCV and lightweight Python tools. You will create a practical VPR pipeline that includes visual descriptor extraction, global image encoding, similarity-based image retrieval, and optional geometric verification. By the end, you’ll understand how VPR works in practice and have a standalone system capable of de…

aicomputer-sciencecomputer-visionmachine-learning

Explore the elegant intersection of nature-inspired algorithms and computer vision. This comprehensive technical guide unveils the powerful watershed segmentation technique, demonstrating how a simple topographic analogy translates into sophisticated image analysis capabilities using OpenCV. The post Watershed Segmentation Using OpenCV appeared first on OpenCV .

algorithmscomputer-sciencecomputer-vision

In this blog post, we'll tackle this challenge head-on with a practical approach to shadow correction using OpenCV. Our method leverages Multi-Scale Retinex (MSR) for illumination normalization, combined with adaptive shadow masking in LAB and HSV color spaces. This technique not only removes shadows effectively but also preserves natural colors and textures. The post Enhancing Images: Adaptive S…

algorithmscomputer-science

This blog explores how to build a smart, browser-based document scanner using OpenCV.js and live OCR. It covers document detection, perspective correction, interactive preprocessing, and client-side text extraction—all running entirely in the browser for maximum privacy and performance. The post Smart Document Scanning with Live OCR using OpenCV.js appeared first on OpenCV .

computer-sciencehcisoftware-engineeringtechnology

Explore OpenCV G-API and how it transforms image-processing pipelines from imperative to declarative with graph-based execution. The post OpenCV G-API: From Imperative to Declarative Pipelines appeared first on OpenCV .

algorithmscomputer-science

EgoX introduces a novel framework for translating third-person (exocentric) videos into realistic first-person (egocentric) videos using only a single input video. The work tackles a highly challenging problem of extreme viewpoint transformation with minimal view overlap, leveraging pretrained video diffusion models and explicit geometric reasoning to generate coherent, high-fidelity egocentric v…

aicomputer-vision

Underwater images often suffer from color loss, low contrast, and haze due to light absorption and scattering. This blog presents a multi-stage OpenCV pipeline in Python to enhance underwater images and videos using white balance correction, red channel restoration, CLAHE, dehazing, sharpening, and gamma correction, with real-time interactive tuning. The post Guide to Underwater Image Enhancement…

computer-scienceprogramming-languages

Omni-Attribute introduces a new paradigm for fine-grained visual concept personalization, solving a long-standing problem in image generation: how to transfer only the desired attribute (identity, hairstyle, lighting, style, etc.) without leaking irrelevant visual details. Developed by researchers from Snap Inc., UC Merced, and CMU, this work proposes the first open-vocabulary image attribute enc…

aicomputer-visionmachine-learning

We capture the world with cameras that compress depth, texture, and geometry into flat pixel grids, yet our minds effortlessly reconstruct the 3D structure behind them. What if computers could do the same? Structure-from-Motion (SfM) is the technique that enables this. By analyzing how features shift across multiple images, SfM simultaneously recovers the camera motion and the scene’s underlying …

computer-sciencecomputer-vision
research.ioresearch.io

Sign up to keep scrolling

Create your feed subscriptions, save articles, keep scrolling.

Already have an account?