Rosella       Machine Intelligence & Data Mining

Convolutional Neural Network for Computer Vision

Convolutional Neural Network (CNN) is a very powerful image classification modeling techniques. It can classify objects in images. As in the following figure, a CMSR CNN model consists of a number of streams and final fully connected layers. The number of streams is normally one. Advanced models can have multiple streams. A stream is a sequence of convolutional layers and pooling layers, normally pairs of convolutional and pooling layers. Final fully connected layers are made up with three layers: a merge layer, a fully connected ANN (Artificial Neural Network) layer and an output layer which is another ANN layer. Input is an image data. Give an image data, CNN network classify into class labels.

Convolutional Neural Netwrk - CNN

Accuracy of models are mainly determined by the following factors;

  • Initial kernel/weight assignment: CMSR uses zero sum randomization. Some randomizations can lead to dead or poor accuracy networks. Dead networks are dead when their accuracy is low and and cannot improve accuracy. When happened, start training with refresh option or recreate a new model.
  • Network configuration: Good network configuration that fits for training data is vital. Start with good guess and improve upon it. It's quite common to see many networks go dead. Bad configuration often leads to dead networks.
  • Training data: Good training data is essential for good training results.
  • Training techniques: Good training technique is essential for good results.

The first step of CNN modeling is configuring network and creation of a network based on configuration. Next step is training of the created network. Network training is an iterative process that involves verification using test datasets. You can train network using various compatible datasets and verify.

Convolutional Neural Network Layers

Convolutional neural network consists of the following layers;

  • Convolution layers: Convolution layers perform convolution operation. They are special filters that can be trained automatically with training data. You can consider them as special camera filters. For more, please read Convolution Layer section.
  • Pooling layers: Like convolution layers, pooling layers are also special filters. But they are not trainable. They are of three types: average (AVG), maximum (MAX) and global average (GAP) pooling. For more, please read Pooling Layer section.
  • Residual layers:Residual layer combines/adds two pairs of values. One from the previous layer and the other from the (k+1)-th prevous layer. In other words, k is the number of layers to skip. Residual layers can make training easier.
  • Merge layer: Only single merge layer exists in a neural network. It flattens final values of streams and feeds to the first fully connected layer.
  • Fully connected layers: Fully connected layers are Artificial Neural Network (ANN) layers. They are the ones that perform real network functions. For more, please read Neural Network Predictive Modeling.

Network Creation and Configuration

The first step of CNN is to configure network and create a network based on configuration. Once created configuration cannot be changed. Only training will be possible. Configuration will differ based on layer types such as convolution, pooling, merge, fully connected and output layers.

Network Training

Network training is an iterative process. You can train a network model using various datasets and verify. After training, you can save models. And later reopen it and start training again, using the same or different compatible datasets. Compatible datasets are having same width, height, color system and classification labels. All training dataset classification label names must exist in model's configuration datasets, thus, models classification label names.

Applying Model to Datasets

Models can be applied to a dataset to classify images in the dataset. This classification can be viewed from image dataset browser. Select image size of 128 or bigger, then you will be able to see original label as well as predicted label. In addition, place the cursor to an image, then hover will appear showing classification information.

To apply a model to a dataset, open both of them. Make the model as the front. Then press the "Apply model to Dataset" button. Select the dataset and model version. Once applied, resulting classification can be viewed from image dataset browser. Note that maximum one classification is applicable to each dataset. When a model is applied again, previous classification will be replaced new classification.

Exporting Network for Embedded Applications

Once fully trained, you can generate program source codes in Java and C/C++. It's fully independent program codes that do not require additional libraries. It only uses standard math libraries which is always included in target systems. This feature is designed for embedded application developers. Just copy it and paste into your applications.

Note that Java codes may include a bunch of class definitions. This is because Java limits the number of literal values for each class. So CMSR generates a bunch of classes. From your main program, just call "modelname.execute(, , , )".


Skin Cancer Detection by Neural Network

A large number of people suffer from skin cancer. Advanced powerful neural network models can detect skin cancer. For more, visit CMSR Powered Check Skin Cancer website.


Convolutional Neural Network for Object Detection

Object Detection Convolutional Neural Network is a special deep learning modeling designed to detect objects in images. The following figure shows Object Detection CNN in action. Red boxes are ground objects. Green boxes are detected objects by a CMSR OD-CNN model. Numbers in green shows object class indices. The model shows high accuracy!

Object Detection Convolutional Neural Netwrk - OD-CNN


Wildfire/Bushfire Detection by Machine Learning

Wildfires (aka bushfires in Australia) can occur naturally by lightening strikes. Sometimes caused by arson or human activities. Wildfires can destroy homes, kill people and natural habitats, destroy infrastructure, and pollute air.

The following images show fire detection examples. This ML Deep Learning model was developed using OD-CNN (Object Detection - Convolutional Neural Network) of CMSR Machine Learning Studio. Green boxes are detected fire areas. As seen in these examples, CMSR OD-CNN can accurately detect fires.

wildfire/bushfire detection.


For more information, please read Wildfire detection by Machine Learning.