RxR: Google Released New Dataset and Challenge On Robot Navigation Using Language

22 January 2021

RxR: Google Released New Dataset and Challenge On Robot Navigation Using Language

A group of researchers from Google has announced the release of a new dataset and benchmark for navigation instruction called Room-across-Room (RxR). Robotic navigation is one of the major challenges…

New AI System Can Predict If a COVID Patient Will Need Intensive Care

21 January 2021

New AI System Can Predict If a COVID Patient Will Need Intensive Care

Researchers from Facebook AI have open-sourced a method and a system that can predict if a Covid patient will need intensive care, using chest x-ray images and self-supervised learning. In…

PaddleSeg: A New Toolkit for Efficient Image Segmentation

20 January 2021

PaddleSeg: A New Toolkit for Efficient Image Segmentation

Researchers from Baidu Inc. have announced the release of a novel image segmentation toolkit called PaddleSeg. The development kit was built in order to help researchers and engineers working in…

Switch Transformer: Google’s New Language Model Features Trillion Parameters

19 January 2021

Switch Transformer: Google’s New Language Model Features Trillion Parameters

There’s been a lot of hype around OpenAI’s powerful GPT-3 model, which proved to be able to spin novel super-realistic human-like articles but also tackle many different NLP tasks using…

Researchers Re-labeled ImageNet Introducing Multi-labels and Localized Annotations

17 January 2021

Researchers Re-labeled ImageNet Introducing Multi-labels and Localized Annotations

A large portion of the research experiments in Computer Vision in the past decade were using the ImageNet dataset. Over the years it has become the default and standard benchmark…

Pr-VIPE: New Method Successfully Recognizes 3D Poses in 2D Images

15 January 2021

Pr-VIPE: New Method Successfully Recognizes 3D Poses in 2D Images

Researchers from Google AI have presented a new method for recognizing poses and pose similarity in images and videos. Images and videos contain 2D information about some portion of the…

Novel Neural Network Generates Segmentation Masks from Bounding Boxes in Videos

13 January 2021

Novel Neural Network Generates Segmentation Masks from Bounding Boxes in Videos

A group of researchers from ETH University has proposed a novel method for object semantic segmentation that exploits video data and spatio-temporal consistencies. Arguing that current approaches for object segmentation…

Researchers Design a Plain Simple Network that Achieves Over 80% Accuracy on ImageNet

13 January 2021

Researchers Design a Plain Simple Network that Achieves Over 80% Accuracy on ImageNet

In a joint project, researchers from several universities in China and UK have proposed a new powerful neural network which has a VGG-like plain and simple architecture using only a…

DALL-E: OpenAI’s New 12 Billion Parameter Model

7 January 2021

DALL-E: OpenAI’s New 12 Billion Parameter Model

Researchers from OpenAI have presented their new model that can generate realistic images given text descriptions. The model, called DALL-E is based on OpenAI’s powerful GPT-3 model and can generate…

Lambda Networks: New State-of-the-art Architecture for Image Recognition

2 January 2021

Lambda Networks: New State-of-the-art Architecture for Image Recognition

LambdaNetworks – researchers propose a new type of deep neural networks that are computationally efficient while maintaining on-par performance with existing classification models. Based on an interesting idea, Lambda Networks…

Soft-IntroVAE: Improving Training Stability and Image Generation Quality

31 December 2020

Soft-IntroVAE: Improving Training Stability and Image Generation Quality

Recently, a novel and powerful deep variational autoencoder was proposed, called IntroVAE. The model was able to learn how to generate highly-realistic images. The key feature of this model was…

Data-efficient Image Transformers: Transformers Arrive in Computer Vision

24 December 2020

Data-efficient Image Transformers: Transformers Arrive in Computer Vision

We have written before about Google’s new Vision Transformer model which successfully applied the powerful Transformer architecture to a computer vision problem. Originally designed for natural language processing tasks, transformers…

Generating New Person Identities With A GAN Network

24 December 2020

Generating New Person Identities With A GAN Network

Generative Adversarial Networks are known to be capable of generating highly-realistic synthetic images due to their representation learning capabilities. However, many of the GAN models fail to disentangle identity and…

Facebook AI Open-sourced Its State-of-the-art Voice Separation Model

24 December 2020

Facebook AI Open-sourced Its State-of-the-art Voice Separation Model

Researchers from Facebook AI Research, have open-sourced the implementation of the state-of-the-art voice model that can separate up to five different voices in a simultaneous conversation. In July this year,…

CML – Continuous Integration (CI) and Development for Machine Learning

20 December 2020

CML – Continuous Integration (CI) and Development for Machine Learning

CML or Continuous Machine Learning is a novel open-source library for continuous integration and continuous delivery (CI/CD) specifically tailored for machine learning projects. The new CI library was developed by…

Removing the NMS: Researchers Propose New End-to-end Object Detector

13 December 2020

Removing the NMS: Researchers Propose New End-to-end Object Detector

Object detectors based on deep convolutional neural networks have excelled in their job in the past few years. Most of these models are actually fully-convolutional neural networks that use one…

ReBeL: Facebook’s AI System That Can Play Chess, Poker and Go

8 December 2020

ReBeL: Facebook’s AI System That Can Play Chess, Poker and Go

Researchers from Facebook AI have developed a general AI algorithm that can play multiple different games including Chess, Poker, and Go. Beating humans in Chess and other similar games such…

Rel3D – A Large-scale Benchmark for Spatial Relations in 3D

8 December 2020

Rel3D – A Large-scale Benchmark for Spatial Relations in 3D

Rel3D – A novel large-scale benchmark for spatial relations in 3D was recently released by researchers from the University of Michigan and Princeton University. The new benchmark tries to overcome…

Discovering Visual Effects by Navigating GAN’s Parameter Space

6 December 2020

Discovering Visual Effects by Navigating GAN’s Parameter Space

Researchers from Yandex research have proposed a new method for semantic image editing in which a GAN output can be controlled by navigating its parameter space. The name of the…

DeepMind’s AI Solves a Grand Challenge – The Protein Folding Problem

2 December 2020

DeepMind’s AI Solves a Grand Challenge – The Protein Folding Problem

A major scientific breakthrough was announced by DeepMind – their advanced AlphaFold AI System solved the 50-year old challenge in biology known as the “protein folding problem”. In a recent…

Neural Network Replaces the Green Screen For Human Matting

29 November 2020

Neural Network Replaces the Green Screen For Human Matting

The so-called “green screens” are the most common technique for adding custom backgrounds to a foreground object picture, and they have been widely adopted in areas such as film production.…