ResNet (34, 50, 101): Residual CNNs for Image Classification Tasks

23 January 2019
resnet

ResNet (34, 50, 101): Residual CNNs for Image Classification Tasks

ResNet is a short name for a residual network, but what’s residual learning? Deep convolutional neural networks have achieved the human level image classification result. Deep networks extract low, middle…

R-CNN – Neural Network for Object Detection and Semantic Segmentation

29 November 2018
r-cnn object detection

R-CNN – Neural Network for Object Detection and Semantic Segmentation

Computer vision is an interdisciplinary field that has been gaining huge amounts of traction in recent years (since CNN), and self-driving cars have taken center stage. One of the most…

Pix2Pix – Image-to-Image Translation Neural Network

27 November 2018
pix2pix network

Pix2Pix – Image-to-Image Translation Neural Network

Pix2pix architecture was presented in 2016 by researchers from Berkeley in their work “Image-to-Image Translation with Conditional Adversarial Networks.” Most of the problems in image processing and computer vision can…

U-Net: Image Segmentation Network

23 November 2018
u-net

U-Net: Image Segmentation Network

U-Net is considered one of the standard CNN architectures for image classification tasks, when we need not only to define the whole image by its class but also to segment areas of…

VGG16 – Convolutional Network for Classification and Detection

20 November 2018
vgg16

VGG16 – Convolutional Network for Classification and Detection

VGG16 is a convolutional neural network model proposed by K. Simonyan and A. Zisserman from the University of Oxford in the paper “Very Deep Convolutional Networks for Large-Scale Image Recognition”.…

AlexNet – ImageNet Classification with Deep Convolutional Neural Networks

29 October 2018

AlexNet – ImageNet Classification with Deep Convolutional Neural Networks

AlexNet is the name of a convolutional neural network which has had a large impact on the field of machine learning, specifically in the application of deep learning to machine vision. It famously won the…

Head Reconstruction from Internet Photos

15 October 2018
head reconstruction internet photos

Head Reconstruction from Internet Photos

Methods that reconstruct 3D models of people’s heads from images need to account for varying 3D pose, lighting, non-rigid changes due to expressions, relatively smooth surfaces of faces, ears, and…

This Neural Network Evaluates Natural Scene Memorability

1 October 2018
natural scene memorability score by neural network

This Neural Network Evaluates Natural Scene Memorability

One hallmark of human cognition is the splendid capacity of recalling thousands of different images, some in details, after only a single view. Not all photos are remembered equally in…

Temporal Relational Reasoning in Videos

25 September 2018
temporal relation network

Temporal Relational Reasoning in Videos

The ability to reason about the relations between entities over time is crucial for intelligent decision-making. Temporal relational reasoning allows intelligent species to analyze the current situation relative to the…

Method for Automatic Forensic Facial Reconstruction

31 August 2018
facial reconstruction

Method for Automatic Forensic Facial Reconstruction

Facial reconstruction is mainly used in two principal branches of science: forensic science and anthropology. Remains of a human skull act as input to reconstruct the most likely corresponding facial…

DeepWrinkles: Accurate and Realistic Clothing Modeling

28 August 2018

DeepWrinkles: Accurate and Realistic Clothing Modeling

Realistic garment reconstruction is notoriously a complex problem and its importance is undeniable in many research work and applications, such as accurate body shape and pose estimation in the wild…

A Style-Aware Content Loss for Real-time HD Style Transfer

14 August 2018

A Style-Aware Content Loss for Real-time HD Style Transfer

A picture may be worth a thousand words, but at least it contains a lot of very diverse information. This not only comprises what is portrayed, e.g., a composition of…