#ComputerVision

2025-12-14
2025-12-13

🔬 Excited to present our latest research at the #MAIN2025 conference today!
🔗 laurentperrinet.github.io/talk

👁️ What if CNNs could see like humans? Our new work shows how foveated vision—concentrating processing at gaze center—makes networks more robust to perturbations & great at localization. Inspired by human vision's architecture (high-resolution foveal center, low-resolution periphery), we embedded this retinotopic transformation into CNN architectures, allowing to actively scan the image. This gives literally a new look to #ConvNets !

📄 Paper: "Foveated Retinotopy Improves Classification and Localization in CNNs"
🔗 laurentperrinet.github.io/publ

#DeepLearning #ComputerVision #AI #Research #NeuralNetworks #NeuroAI #OpenScience I love #Montreal

OpenCVopencv
2025-12-11

We're live at 9am Pacific with Joseph Nelson, CEO of Roboflow, who will be showing and telling us all about Roboflow Rapid, the game-changer for image labeling. Train & Deploy custom vision models in just minutes, with as few as 10 images.

Watch live on YouTube for your chance to win a free OpenCV University course and participate in the Q&A youtube.com/live/JpmTsokeYjM

Poster for a live webinar with the title "Train & Deploy in Minutes with Roboflow Rapid" and the face of Joseph Nelson, CEO of Roboflow.
Gareth Halfacreeghalfacree
2025-12-11

And finally: computer vision specialist has a new fourth-generation OAK smart camera family out, available with or without stereo vision capabilities - and now with the option of an interchangeable lens version with rolling or global shutter sensor. 52 TOPS of INT8 compute on each, I'm told(!)

hackster.io/news/luxonis-launc

2025-12-10
FOSS Advent Calendar - Door 11: Read Any Text with EasyOCR

Meet EasyOCR, a lightweight open source optical character recognition (OCR) engine that makes extracting text from images and documents almost effortless. Supporting over 80 languages, including those with complex scripts and mixed language text, it's designed to be powerful, accurate, and incredibly straightforward to use.

Built on PyTorch and integrating deep learning models, EasyOCR delivers high recognition accuracy even on challenging images, low resolution, skewed text, or complex backgrounds. What sets it apart is its simplicity: with just a few lines of code, you can have a fully functional OCR pipeline running locally, without needing an internet connection or external APIs. Your data remains completely private.

Whether you're digitizing printed material, extracting text from screenshots (for example, lyrics from L’âme Immortelle, an Austrian dark wave band), automating document workflows, or analyzing visual data, EasyOCR gets the job done quickly and reliably.

Pro tip: Use it to create searchable PDFs, translate foreign text in images, or even capture and digitize handwritten notes with the right training data.

Link: https://github.com/JaidedAI/EasyOCR

What text would you like to extract from images? Scanned books, street signs, or maybe your old family documents?

#FOSS #OpenSource #OCR #EasyOCR #TextRecognition #AI #DeepLearning #Python #ComputerVision #DocumentDigitization #DataExtraction #Privacy #LocalAI #Multilingual #OpenTools #Fediverse #TechNerds #AdventCalendar #adventkalender #adventskalender #TextExtraktion #KI #PyTorch #DevCommunity #Automation #OfflineAI #PythonProgramming
Yellow Papayaexoticroot
2025-12-10

Some robot stuff to back up my profile claims.

2025-12-10

[CẬP NHẬT] Công cụ **im-vid-detector** dựa trên YOLOE giúp phát hiện ảnh và video tự động. Bài đăng từ /u/1krzysiek01 trên Reddit (subreddit: r/computervision). #AI #ComputerVision #YoLoE #CôngNghệTríTuệNhânTạo #NhậnDạngHìnhẢnh

reddit.com/r/opensource/commen

OpenCVopencv
2025-12-08

Training and deploying a computer vision model doesn't have to be hard, just ask Roboflow! Roboflow Rapid is a system that enables users to train vision models in under 5 minutes with just a single video or 10 images. youtube.com/live/JpmTsokeYjM

2025-12-08

Ph.D. position - Dynamic changes in material appearance
UTIA AV CR, v.v.i.

We are opening a PhD position at our institute in Prague with focus on modelling dynamic effects of material appearance within the EC MSCA DN.

See the full job description on jobRxiv: jobrxiv.org/job/utia-av-cr-v-v

#computervision #da...
jobrxiv.org/job/utia-av-cr-v-v

2025-12-08
2025-12-08

Реализуем компьютерное зрение на практике

На тему компьютерного зрения есть множество различных публикаций, которые в основном рассказывают о применении этой технологии в разных отраслях. Однако, зачастую публикации содержат лишь общую информацию о том, что реализовано и для каких задач, но при этом отсутствует описание того, как это можно сделать. В нашей статье мы поговорим о том, как можно реализовать на Python навигационную систему на основе машинного зрения для автономных транспортных средств, проанализировать медицинские изображения и выполнить генерацию новых изображений из набора данных уже существующих.

habr.com/ru/companies/otus/art

#ai #computervision #ml #компьютерное_зрение #обработка_изображений #автономная_навигация #сегментация_изображений #генерация_изображений #нейронные_сети #глубокое_обучение

HabileDatahabiledata
2025-12-08

Top 10 Image Annotation Companies to Outsource in 2026

Image annotation services make it easier to scale computer-vision projects by delivering accurate, high-quality labeled datasets. Outsourcing helps reduce costs and ensures consistent annotation standards.

Trained specialists, strong quality checks, and flexible capacity can speed up model development while maintaining reliable data foundations.

Read more: differ.blog/p/top-10-image-ann

Image Annotation
2025-12-08

Full writeup @opensource.block.xyz@bsky.brid.gy's Advent of AI 2025 - Day 5: I Built a Touchless Flight Tracker You Control With Hand Gestures dev.to/nickytonline... #adventofai #computervision #ai

Advent of AI 2025 - Day 5: I B...

2025-12-07

After a few fruitless attempts it becomes apparent that the problem is probably
in the lack of degrees of freedom of the robot movements in the 3d space. The #handeye calibration solves equations in 6 dimensions: 3 translational + 3 rotational. Our robot, however, cannot jump up and down. So there's no variability in the Z axis translation. Also the robot cannot tilt. It simply rolls on a flat floor and can only rotate left and right. So there's no rotations of the robot, and Therefore of the camera, around X and Y axes. So, out of 6 possible degrees of freedom in the 3d space, the 3 translational dimensions and 3 rotations we can only utilize 3 of them:
- translation in X directions,
- translation in Y directions, and
- rotation around Z axis.

This really sounds that we are very short of measurements variability. We gotta get more creative on how we can enrich our measurements, or maybe introduce the constraints into the algorithm somehow. Think. Think...

#PhotonVision #openCV #computerVision #Limelight #WPILib

2025-12-05

It's been a while I did not play with #OpenCV . It's quite refreshing since I was mostly busy with high level programming, devops and sysadmin this past two years, and I missed #ComputerVision #programming.

Still, here's something I did NOT miss:
- cv::Mat::size returns the actual N-dimensional size of the matrix
- cv::Mat::size() - with the parentheses - returns a two-dimensional size [width, height] only, and [-1, -1] in case it"s not 2D.

Who the hell designed this fracking #API?

stackoverflow.com/questions/14

Grrmmmbll.

2025-12-05

Цифровые культиваторы, теплицы и мотоблоки или мультиагентная трансформация АПК

Миронов В.О., Кальченко С.Н. Приветствую вас, бравые хаброжители ;)) В наше время искусственный интеллект очень быстро развивается, при этом, вносит значительные коррективы в развитие различных профессий, диктуя там свои правила и виденье. При этом основные козыри — это скорость, время и профит. В этом контексте мы и будем говорить о сложившейся ситуации, а именно, о дифференцированной трансформации профессий. Да-да, все видели, эти километровые лонгриды, когда ИИшка выкатывает список профессий, которые попадают под трансформацию. При этом какие-то прогнозы сбываются какие-то нет, как и в целом всё в жизни. Однако, почему именно дифференцированной, да всё потому что, профессии даже не столько дифференцируются, сколько видоизменяются, но их суть остаётся той же. Бывает даже так, что не всегда удаётся охватить весь спектр нововведений.

habr.com/ru/articles/973682/

#analytics #analysis #agrohack #agrocode #machinelearning #computervision #computer_science #data_science #data_analysis #data_engineering

Gareth Halfacreeghalfacree
2025-12-02

And lastly, at least until the other four pop off the stack, I've been following @libbymiller's progress on this project and it's ace: a -powered OpenCV-driven synth-of-a-sorts which turns a printed sheet of blocks and some LEGO bricks into music.

hackster.io/news/libby-miller-

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst