computer-vision
Research on target recognition method for machine-plucked fresh tea leaves based on improved YOLOv11
Non-invasive eye scans allow doctors a zoomed-in, three-dimensional look beneath the eye's surface without causing discomfort or pain to the patient.
Background and objectivesAccurate medical image segmentation remains a challenging task in computer-aided diagnosis because of the intricacies and the variability in the biomedical data in terms of the anatomical complexity, inter-patient diversity, class imbalance, and irregular morphological patterns.MethodsIn the present work, a Context Aware Adaptive Progressive Network (CA2PNet) is proposed.…
The scale wall A computer vision pipeline that works on one image at one resolution isn't a pipeline. It's a prototype. The moment you move beyond controlled inputs, you hit the reality of production images: a 4K video frame, a satellite capture, a whole-slide pathology image, a high-resolution document scan. These images don't fit in a single model call. They're too large, too detailed, and too …
Stella is the companion app for Meta's smart glasses. Inspecting version 273.0.0.21 of the Android build (com.facebook.stella ), I found the entire computational and storage stack for on-device facial recognition: three face models, a local database schema, a cosine-similarity vector index dimensioned to match the models, a write path that stages biometric records to disk, a fully wired notificat…
Recognition and classification tasks have become increasingly popular for automation in several fields. These tasks are commonly carried out using convolutional neural networks (CNNs) and feedforward neural networks (FFNNs). Their adaptability and feature extraction lead to high-accuracy image recognition results; despite being computationally expensive. However, high computational demands, large…


AMD is now an OpenCV 5 Launch Partner and will become an OpenCV Gold Sponsors as part of collaboration focused on OpenCV 5 CPU and GPU acceleration PALO ALTO, CA. – June 4th 2026 OpenCV, the world’s leading open-source computer vision library, today announced a new engineering collaboration with AMD to make AMD hardware a […] The post OpenCV and AMD Announce Collaboration to Accelerate Computer V…
Augmented reality (AR) devices such as smart glasses may soon become much smarter. Researchers have developed a new technology that can predict where a person will look next, potentially allowing AR systems to prepare information and graphics before the user even turns their eyes. The research was led by Fiona Ryan, a Ph.D. student in […] The post Smart glasses could soon predict what you’ll look…
By Jack Kao — author of mk-qa-master , an MCP-native QA toolkit. Most "AI testing" stops at calling an API and asserting the response isn't empty. Edge AI — a model running on a live camera feed — doesn't fit that mold. You can't assert exact bounding-box coordinates (the output is fuzzy by design), and "correct but 200ms too late" is a production failure, not a pass. When I added the edge runner…
The new model family covers reasoning, coding, image generation, voice, and transcription, with Microsoft Frontier Tuning designed to let organizations adapt models to their own workflows. Microsoft AI has launched seven new MAI models covering reasoning, coding, image generation, voice, and transcription. Photo Credit: Mustafa Suleyman. Microsoft AI has launched seven new in-house MAI models acr…
The Bellevue startup's funding will accelerate its FeaturePrint technology, which uses optical AI to create a unique digital "fingerprint" for physical objects — no barcodes, tags, or labels required. Read More
Amazon will use visual search and AI to show AI generated product images that match your search queries. The retailer says it will help guide users to products.
Journal of Computer Science, Published online: 3 June 2026; doi:10.3844/jcssp.2026.1785.1796 Early detection of lung cancer remains challenging due to high intra-class variation and inter-class similarity in Computed Tomography (CT) images. In this paper, we propose a hybrid deep learning mod...

At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems. The core challenge in physical AI research isn’t simply developing stronger models. It’s building a full workflow around them — reconstructing real-world scenes, generating edge-case scenarios, training policies, evaluating […
AI Identifies a Ray’s Prey From the Sound of Cracking Shells A whitespotted eagle ray feeds on hard-shelled mollusks in its natural habitat. The distinctive crunching sounds produced during feeding are helping FAU researchers develop AI-powered tools to monitor predator-prey interactions beneath the ocean surface
Reconstructing Physically Stable 3D Scenes from a Single Image Reconstructing physically stable 3D scenes from a single RGB image enables casual images to be converted into simulation-ready digital assets for applications such as immersive interaction and content creation. However, existing single-image reconstruction methods fall short in capturing the physical structure of a scene. As a result,…
research.ioSign up to keep scrolling
Create your feed subscriptions, save articles, keep scrolling.

