DEV Community

# computervision

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

1
Comments
2 min read
Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

1
Comments
2 min read
Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

1
Comments
2 min read
Vision Transform

Vision Transform

Comments
16 min read
See the Unseen: Blending Fisheye and Pinhole Vision for Next-Gen 3D Scanning

See the Unseen: Blending Fisheye and Pinhole Vision for Next-Gen 3D Scanning

Comments
2 min read
Unlock the World in 3D: AI Bridges the Depth Gap Between Phone Cameras

Unlock the World in 3D: AI Bridges the Depth Gap Between Phone Cameras

Comments
2 min read
Challenges to adapt AI-based Video Codecs

Challenges to adapt AI-based Video Codecs

Comments 1
5 min read
Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Comments
2 min read
Unlocking AI's Inner Artist: A New Way to Sculpt 3D with Neural Networks

Unlocking AI's Inner Artist: A New Way to Sculpt 3D with Neural Networks

Comments
2 min read
Sculpting Reality: High-Fidelity 3D Models from Neural Nets by Arvind Sundararajan

Sculpting Reality: High-Fidelity 3D Models from Neural Nets by Arvind Sundararajan

1
Comments
2 min read
AI Sees the Forest for the Trees: Revolutionizing Plant Counting for a Greener Future

AI Sees the Forest for the Trees: Revolutionizing Plant Counting for a Greener Future

Comments
2 min read
I Built an AI That Reads My Blinks and Speaks Morse Code

I Built an AI That Reads My Blinks and Speaks Morse Code

12
Comments
5 min read
Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Comments
9 min read
From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

Comments
4 min read
How I Built an AI-Powered Face Recognition App from Scratch

How I Built an AI-Powered Face Recognition App from Scratch

Comments
1 min read
Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Comments
5 min read
Smart Stable Monitoring System for Premium Remote Horse Care

Smart Stable Monitoring System for Premium Remote Horse Care

1
Comments
9 min read
[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

Comments
1 min read
Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Comments
7 min read
Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Comments
3 min read
Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Comments
3 min read
Does DINO loss compare the [CLS] tokens from both teacher and student?

Does DINO loss compare the [CLS] tokens from both teacher and student?

Comments
1 min read
Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Comments
5 min read
[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Comments
1 min read
Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

2
Comments 1
3 min read
loading...