New CNN Model For Lane Detection Using Self Attention

18 August 2019

New CNN Model For Lane Detection Using Self Attention

A group of researchers, led by Yuenan Hou from the Chinese University of Hong Kong has released a new state-of-the-art method for lane markings detection. The problem of lane markings…

Gated-SCNN: New State-of-the-art Method for Semantic Segmentation

31 July 2019

Gated-SCNN: New State-of-the-art Method for Semantic Segmentation

A group of researchers from NVIDIA, the University of Waterloo, the University of Toronto and the Vector Institute have published a new state-of-the-art method for semantic segmentation. The novel method…

Open Source 3D Human Pose Estimation Approach with Self-Supervised Learning

14 March 2019
3d human pose estimation

Open Source 3D Human Pose Estimation Approach with Self-Supervised Learning

Human pose estimation is a fundamental problem in Computer Vision. Deriving a 3D Human pose out of single RGB image is needed in many real-world application scenarios, especially within the…

The StyleGAN Code Released: Neural Network for Faces Generation by NVIDIA

18 February 2019
stylegan

The StyleGAN Code Released: Neural Network for Faces Generation by NVIDIA

NVIDIA released the StyleGAN code, the GAN for faces generation that has never existed which is the state-of-the-art method in terms of interpolation capabilities and disentanglement power. On the 18th of December…

Who Leads the Self-Driving Cars Race? State-of-Affairs in Autonomous Driving

30 January 2019
self-driving cars

Who Leads the Self-Driving Cars Race? State-of-Affairs in Autonomous Driving

Within just a couple of years, self-driving cars have gone from science fiction to “now commercially available” road-bound reality. This year’s CES (the largest consumer electronics show) was “flooded” with…

EXAM – State-of-The-Art Method for Text Classification

24 December 2018
text classification EXAM

EXAM – State-of-The-Art Method for Text Classification

One of the widely used Natural Language Processing & Supervised Machine Learning (ML) task in different problems and used cases is the so-called Text Classification. It is an example of…

Dissecting GANs for Better Understanding and Visualization

5 December 2018
dissecting gan paper

Dissecting GANs for Better Understanding and Visualization

GANs can be taught to create (or generate) worlds similar to our own in any domain: images, music, speech, etc. Since 2014, a large number of improvements of GANs have…

PIFR: Pose Invariant 3D Face Reconstruction

26 November 2018
pifr reconstruction

PIFR: Pose Invariant 3D Face Reconstruction

3D face geometry needs to be recovered from 2D images in many real-world applications, including face recognition, face landmark detection, 3D emoticon animation etc. However, this task remains challenging especially…

BrainNet – Brain-to-Brain Interface for Direct Collaboration Between Brains

23 October 2018
brain-to-brain interface

BrainNet – Brain-to-Brain Interface for Direct Collaboration Between Brains

In the past few years, the brain to direct computer communication started to gain more and more attention. A few breakthroughs have traced the path for researchers towards building a…

Fooling Facial Recognition: Fast Method for Generating Adversarial Faces

2 October 2018
Fooling Facial Recognition Fast Method for Generating Adversarial Faces

Fooling Facial Recognition: Fast Method for Generating Adversarial Faces

With the rapid progress and state-of-the-art performance in a wide range of tasks, deep learning based methods are in use in a large number of security-sensitive and critical applications. However,…

This Neural Network Evaluates Natural Scene Memorability

1 October 2018
natural scene memorability score by neural network

This Neural Network Evaluates Natural Scene Memorability

One hallmark of human cognition is the splendid capacity of recalling thousands of different images, some in details, after only a single view. Not all photos are remembered equally in…

Temporal Relational Reasoning in Videos

25 September 2018
temporal relation network

Temporal Relational Reasoning in Videos

The ability to reason about the relations between entities over time is crucial for intelligent decision-making. Temporal relational reasoning allows intelligent species to analyze the current situation relative to the…

Identity Verification with Deep Learning: ID-Selfie Matching Method

24 September 2018
ID selfie verification

Identity Verification with Deep Learning: ID-Selfie Matching Method

A large number of daily activities in our lives require identity verification. Identity verification provides a security mechanism starting from access control to systems all the way to at border crossing…

Deep Clustering Approach for Image Classification Task

20 September 2018
deepcluster facebook

Deep Clustering Approach for Image Classification Task

Clustering of images seems to be a well-researched topic. But in fact, little work has been done to adapt it to the end-to-end training of visual features on large-scale datasets.…

Realistic Exemplar-Based Image Colorization

18 September 2018
colorization method

Realistic Exemplar-Based Image Colorization

Image colorization is a widespread problem within computer vision. The ultimate objective of image colorization is to map a gray-scale image to a visually plausible and perceptually meaningful color image.…

AlphaGAN: Natural Image Matting

11 September 2018
AlphaGAN

AlphaGAN: Natural Image Matting

Many image-editing and film post-production applications rely on natural image matting as one of the processing steps. The task of the matting algorithm is to estimate the opacity of a…

Learning 3D Face Morphable Model Out of 2D Images

5 September 2018
3D morphable model out of single image

Learning 3D Face Morphable Model Out of 2D Images

The 3D Morphable Model (3DMM) is a statistical model of 3D facial shape and texture. 3D Morphable Models have various applications in many fields including computer vision, computer graphics, human…

Vid2Vid – Conditional GANs for Video-to-Video Synthesis

3 September 2018
vid2vid-video-to-video-synthesis-e1535641547242

Vid2Vid – Conditional GANs for Video-to-Video Synthesis

Researchers from NVIDIA and MIT’s Computer Science and Artificial Intelligence Lab have proposed a novel method for video-to-video synthesis, showing impressive results. The proposed method – Vid2Vid – can synthesize…

Method for Automatic Forensic Facial Reconstruction

31 August 2018
facial reconstruction

Method for Automatic Forensic Facial Reconstruction

Facial reconstruction is mainly used in two principal branches of science: forensic science and anthropology. Remains of a human skull act as input to reconstruct the most likely corresponding facial…

Everybody Dance Now: a New Approach to “Do As I Do” Motion Transfer

30 August 2018
everybody dance now

Everybody Dance Now: a New Approach to “Do As I Do” Motion Transfer

Not very good at dancing? Not a problem anymore! Now you can easily impress your friends with a stunning video, where you dance like a superstar. Researchers from UC Berkeley…

DeepWrinkles: Accurate and Realistic Clothing Modeling

28 August 2018

DeepWrinkles: Accurate and Realistic Clothing Modeling

Realistic garment reconstruction is notoriously a complex problem and its importance is undeniable in many research work and applications, such as accurate body shape and pose estimation in the wild…