computer-vision

Ever stared at a cryptic medicine bottle, wondering if it interacts with your morning coffee or that other pill you're taking? For the elderly or those with visual impairments, reading tiny labels on medication packaging is more than a nuisance—it’s a safety hazard. In this tutorial, we are building a Medication Safety Assistant . This isn't just a simple OCR tool; we are implementing a Multimoda…
Bengali OCR is challenging due to complex characters and handwriting variations. This paper proposes a lightweight CNN with 40×40 input for real-time Bengali OCR, achieving 98.29% accuracy with reduced computational complexity for edge deployment.
Static portfolios blend together. Every designer's site has the same grid of JPEGs. We wanted something different for our own product pages at Inithouse, a studio shipping a growing portfolio of products in parallel, so we started experimenting with living photos: short AI-generated animations that make a still image breathe. Here's how we did it, what we learned about performance, and the code y…
Autonomous AI models can bring specialist-level retinal screening to clinics worldwide, aiding the diagnosis of diabetic retinopathy and other eye conditions.
Obviously, Instagram does not want you to automate engagement. Their HTML is a mess of randomly generated class names and deeply nested divs. The structure changes every deployment. Any script that relies on DOM selectors breaks within weeks because the class name doesn't exist anymore. But it doesn't matter anyway. Instagram can obfuscate their code all they want because code is for machines. Bu…
In high-resolution remote sensing imagery, near-shore water bodies typically exhibit tortuous shorelines, fragmented lakeshore coves, and superimposed disturbances such as building reflections, vegetation shadows, and mixed substrates, posing significant challenges to the fine extraction of water boundaries. Although deep learning-based semantic segmentation has substantially improved water body …
BackgroundAutomated analysis of Pap-smear images plays an important role in cervical cancer screening, particularly in low-resource settings where manual cytology remains labour-intensive, subjective, and prone to inter-observer variability. On the other hand, accurate segmentation of the nucleus and cytoplasm is a fundamental step in computer-aided diagnosis systems because it enables quantitati…
ViT-ConvGAN: a hybrid model for spatiotemporal action recognition using video transformer and 3D CNN
Train a Roboflow localization model, isolate printed label fields, and verify batch numbers and expiry dates with Google Gemini.
Scientific Reports, Published online: 11 June 2026; doi:10.1038/s41598-026-52345-6 Lightweight CNN SE transformer for robust weed classification with optimizer aware performance
We are excited to announce support for YOLO26 semantic segmentation in Roboflow.
Claude Fable 5 is a strong reasoning model for visual understand but not a state-of-the-art vision model.
Combine RF-DETR and LMMs to build an AI pipeline that perceives, reasons, and acts.
Scientific Reports, Published online: 11 June 2026; doi:10.1038/s41598-026-55337-8 Language-assisted multimodal convolutional transformer pipeline for retinal lesions segmentation
Robert Dillon was arrested at home in Florida despite living 300 miles away, and charges were later dropped Sign up for the Breaking News US newsletter email A Florida man is suing several law enforcement agencies for his arrest and prosecution for allegedly luring a child after he was wrongly identified using faulty AI facial recognition software. According to the Jacksonville Beach police depar…

research.ioSign up to keep scrolling
Create your feed subscriptions, save articles, keep scrolling.







