ResNet (34, 50, 101): Residual CNNs for Image Classification Tasks

23 January 2019
resnet

ResNet (34, 50, 101): Residual CNNs for Image Classification Tasks

ResNet is a short name for a residual network, but what’s residual learning? Deep convolutional neural networks have achieved the human level image classification result. Deep networks extract low, middle…

R-CNN – Neural Network for Object Detection and Semantic Segmentation

29 November 2018
r-cnn object detection

R-CNN – Neural Network for Object Detection and Semantic Segmentation

Computer vision is an interdisciplinary field that has been gaining huge amounts of traction in recent years (since CNN), and self-driving cars have taken center stage. One of the most…

U-Net: Image Segmentation Network

23 November 2018
u-net

U-Net: Image Segmentation Network

U-Net is considered one of the standard CNN architectures for image classification tasks, when we need not only to define the whole image by its class but also to segment areas of…

VGG16 – Convolutional Network for Classification and Detection

20 November 2018
vgg16

VGG16 – Convolutional Network for Classification and Detection

VGG16 is a convolutional neural network model proposed by K. Simonyan and A. Zisserman from the University of Oxford in the paper “Very Deep Convolutional Networks for Large-Scale Image Recognition”.…

AlexNet – ImageNet Classification with Deep Convolutional Neural Networks

29 October 2018

AlexNet – ImageNet Classification with Deep Convolutional Neural Networks

AlexNet is the name of a convolutional neural network which has had a large impact on the field of machine learning, specifically in the application of deep learning to machine vision. It famously won the…

Temporal Relational Reasoning in Videos

25 September 2018
temporal relation network

Temporal Relational Reasoning in Videos

The ability to reason about the relations between entities over time is crucial for intelligent decision-making. Temporal relational reasoning allows intelligent species to analyze the current situation relative to the…

Inferring a 3D Human Pose out of a 2D Image with FBI

16 July 2018
3D pose estimation based on 2D joints and Forward-or-Backward Information (FBI) for each bone

Inferring a 3D Human Pose out of a 2D Image with FBI

Autonomous driving, virtual reality, human-computer interaction and video surveillance — these are all application scenarios, where you would like to derive a 3D human pose out of a single RGB…

A New Method for Image De-blurring

21 June 2018
устранения размытия

A New Method for Image De-blurring

When we use a camera, we want the recorded image to be a faithful representation of what we see in front of us. However, very often images contain blur, and…

Human Rights Violations Recognition in Images Using CNN Features

18 May 2018
Human Rights Violations Recognition in Images Using CNN Features

Human Rights Violations Recognition in Images Using CNN Features

Human rights violations have been unfolding during the entire human history, while nowadays they increasingly appear in many different forms around the world. Human rights violations refer to the actions…

Image Inpainting for Irregular Holes Using Partial Convolutions

8 May 2018
Image Inpainting for Irregular Holes Using Partial Convolutions

Image Inpainting for Irregular Holes Using Partial Convolutions

Deep learning is growing very fast and it is one of the fast-growing areas of artificial intelligence. It has been used in many fields extensively including real-time object detection, image…

Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image

3 May 2018
SVBRDF Acquisition with a Single Mobile Phone Image

Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image

A wide variety of images around us are the outcome of interactions between lighting, shapes and materials. In recent years, the advent of convolutional neural networks (CNN) has led to…

Neural Network Has Learned to Separate Individuals’ Speech on Video

13 April 2018
Neural Network Has Learned to Separate Individuals’ Speech on Video

Neural Network Has Learned to Separate Individuals’ Speech on Video

The fact that our brain in a noisy environment can effectively focus on a particular speaker, “turning off” background sounds — is no secret. This phenomenon even received the popular…

How Has MS Voxel Deep Network Managed to Improve 3D Objects Recognition Using Cloud Map Only

12 April 2018
Objects Recognition Using Cloud Map Only

How Has MS Voxel Deep Network Managed to Improve 3D Objects Recognition Using Cloud Map Only

Mobile Laser Scanning (MLS) systems can now scan large areas, like cities or even countries. The produced 3D point clouds can be used as maps for autonomous systems. To do…