Computer Vision

December 19, 2017December 19, 2017

Image recognition and AI on a Raspberry Pi 3 using MobileNets and Neural Compute Stick

If you are building a robot driven by Raspberry Pi and want to use image recognition and object detection you may want to look into Googles Mobile Nets platform which lets you do use a several mobile-first computer vision models for TensorFlow, combined with an Intel Movidius Neural Compute Stick on a Rasrberry PI 3. The MobileNets platform is designed to be run on resource conservative devices while maintaining accuracy and the latter will give you an order of magnitude more compute power than running the detection on the raspberrys CPU.

November 1, 2017

Course 4 [deeplearning.ai] has been released!

The fourth course, Convolutional Neural Networks of Deeplearning.ai has now been released on coursera. People have been waiting for this one, but i think that the delay was to make the material very up to date with current research results. The four weeks of learning deals with:

Foundations of Convolutional Neural Networks
Deep convolutional models: case studies
Object detection
Special applications: Face recognition & Neural style transfer

September 26, 2017

You only look once

The “YOLO9000: Better, Faster, Stronger” paper describes the improvements to the YOLO, You only look once, architecture that enables realtime object detection and classification. It can classify over 9000 object categories and outperforms Faster RCNN with ResNet and SSD while being significantly faster. They train on both COCO dataset for detection simultaneously with ImageNet for Classification and combine it with a wordtree so that they can also fallback to “dog” if they cannot classify for instance a specific dog breed.

The first version, and architecture can be seen in this paper.

Here is a video presentation: https://www.youtube.com/watch?v=GBu2jofRJtk

August 30, 2017

Stanford CS231n 2017 – Convolutional Neural Networks for Visual Recognition

The video lectures for Stanfords very popular CS231n (Convolutional Neural Networks for Visual Recognition) that was held in Spring 2017 was released this month. (According to their twitter page, the cs231n website gets over 10 000 views per day. The reading material on their page is really good at explaining CNNs)

Here are the video lectures:

These are the assignments for the course:

Also Make sure to check out last years student reports. note: one is about improving the state of the art of detecting the Higgs Boson.

June 26, 2017

Stanford CS231n – Convolutional Neural Networks for Visual Recognition

Here is a link to the most recent cs231n course at standford

The course page is here: http://cs231n.stanford.edu/