The neural network generates images with clothes try on

14 March 2021

The neural network generates images with clothes try on

PF-AFN is a neural network that generates images of people trying on different kinds of clothes. The model accepts an image of a person and an image of a garment…

SEER: a self-supervised neural network with a billion parameters from FAIR

9 March 2021

SEER: a self-supervised neural network with a billion parameters from FAIR

SEER is FAIR’s self-supervised billion-parameter neural network for computer vision applications. The model pre-trained on the Instagram pictures can be further trained on your tasks. The developers have published the…

Robot trained to manipulate tissue at University of California

5 March 2021

Robot trained to manipulate tissue at University of California

Researchers from the University of California and the Honda Research Institute trained a robot to fold fabric. The algorithm is based on a framework for teaching visual dynamics of objects…

MLS: FAIR’s Multilingual Speech Recognition Dataset

4 March 2021

MLS: FAIR’s Multilingual Speech Recognition Dataset

Facebook AI published a multilingual dataset used to train speech recognition models. Multilingual LibriSpeech (MLS) contains 50 thousand hours of audio with people speaking in 8 languages: English, German, Spanish,…

Google AI neural network simulates camera movement

3 March 2021

Google AI neural network simulates camera movement

Google AI neural network simulates camera movement and parallax for photos. The Cinematic photos system is used in the Google Photos app. Image Depth Estimation Along with the latest photography…

GraphGallery: a library for graph neural networks on PyTorch and TensorFlow

2 March 2021

GraphGallery: a library for graph neural networks on PyTorch and TensorFlow

GraphGallery is a library for training and testing graph neural networks. GraphGallery implements adversarial attacks on graph neural networks. The library is compatible with PyTorch, TensorFlow 2.x, Pytorch Geometric (PyG),…

CSTR neural network recognizes text in scene images

1 March 2021

CSTR neural network recognizes text in scene images

CSTR is a convolutional neural network that recognizes text in scene images. The previous work considers the problem of text recognition on a scene image as a segmentation and seq2seq…

TransGAN: two Transformer models as one GAN

26 February 2021

TransGAN: two Transformer models as one GAN

TransGAN is a GAN model in which the generator and discriminator are composed of two Transformer architectures. GAN architecture traditionally uses convolutions. In TransGAN, convolutions are replaced with Transformer. The…

Google presented the framework for ML-model architecture automatic search

23 February 2021

Google presented the framework for ML-model architecture automatic search

Model search (MS) is a library that uses ML model architecture automatic search algorithms. The developers claim that the framework scales in cases when the state search space appears large.…

Twitter Opens Tweet Archive for Scientific Researchers

20 February 2021

Twitter Opens Tweet Archive for Scientific Researchers

Twitter has opened an archive of tweets for scientific researchers. This way the IT-company supports research on online discourse and trends on the platform. More data and access to them…

DAF:re – new public dataset for recognizing anime characters

20 February 2021

DAF:re – new public dataset for recognizing anime characters

DAF:re is a public dataset for recognizing anime characters. The dataset consists of 500 thousand images with 3000 object classes. Data across classes is not evenly distributed. Besides, the researchers…

Robot manage objects from video tutorials using RL

19 February 2021

Robot manage objects from video tutorials using RL

In FAIR, the RL-agent was trained to manage objects using video tutorials. Standard RL algorithms are trained to a problem iteratively through learning from errors. The proposed algorithm learns a…

SAM: the neural network changes the age on the image of a person’s face

17 February 2021

SAM: the neural network changes the age on the image of a person’s face

SAM is a neural network model that changes the age of a person in an image. The model takes as input an image of a person’s face and target age.…

MeInGame neural network generates game character from a face image

15 February 2021

MeInGame neural network generates game character from a face image

MeInGame is a neural network model that generates a character in the game from one face image. The neural network predicts the shape of the face and its texture. The…

JigsawGAN: Generative neural network model solves jigsaw puzzles

11 February 2021

JigsawGAN: Generative neural network model solves jigsaw puzzles

JigsawGAN is a self-supervised generative neural network model that has been trained on a puzzle-solving task. The model accepts chaotically located parts of the image as input and outputs the…

TracIn: a way to evaluate the impact of specific data on model predictions

10 February 2021

TracIn: a way to evaluate the impact of specific data on model predictions

TracIn is a scalable method for assessing the impact of individual features in data on predictions. The idea behind TracIn is to track the learning process of the model to…

TAPAS neural network looks for answers in tabular data

30 January 2021

TAPAS neural network looks for answers in tabular data

TAPAS is a neural network model for finding answers to questions in tabular data. The neural network is an extension of the BERT bi-directional Transformer model with special embeddings looking…

FaceX-Zoo: PyTorch library for face recognition in images

29 January 2021

FaceX-Zoo: PyTorch library for face recognition in images

FaceX-Zoo is an open-source library on PyTorch for recognizing faces in images. The library provides a module for training models with different configurations of error functionality and basic architecture. In…

Pile: 825-gigabyte open-source dataset for language models training

28 January 2021

Pile: 825-gigabyte open-source dataset for language models training

Pile is an 825 gigabyte dataset for teaching language models. The dataset consists of 22 smaller datasets, which are combined into one. In addition to the dataset, the creators published…

Pixellib: library for object segmentation in photos and videos

28 January 2021

Pixellib: library for object segmentation in photos and videos

Pixellib is a library for the task of segmenting objects in images and videos. The library supports two main types of object segmentation: semantic and instance segmentation. The complexity of…