Google Imagen: text-to-image model

29 May 2022

Google Imagen: text-to-image model

Google has introduced Imagen, a model that transforms a text description into an image with a resolution of 1024 x 1024 pixels. Imagen surpassed OpenAI DALL-E 2 in terms of…

Lang: analysis of customer dialogues with the support service

28 May 2022

Lang: analysis of customer dialogues with the support service

Startup Lang has developed a system that integrates with customer support and CRM to automatically recognize the topic of conversation and identify trends in the reasons for customer requests. The…

LAION-5B: the largest dataset of image-text pairs

28 May 2022

LAION-5B: the largest dataset of image-text pairs

LAION-5B — dataset of image-text pairs collected on the Internet. LAION-5B contains more than 5 billion pairs, which makes it the largest among similar datasets. AION-5B was assembled by parsing…

Deepmind has introduced a universal Gato model

28 May 2022

Deepmind has introduced a universal Gato model

DeepMind has introduced a cross-modal universal model with 1.2 billion Gato parameters. Gato can perform more than 600 tasks, such as playing video games, creating subtitles for images and controlling…

Mastercard has launched payments via biometry

28 May 2022

Mastercard has launched payments via biometry

Mastercard has started testing a program for retail stores offering payment for purchases using facial recognition or fingerprint scanning. The company plans to deploy a new payment scheme for the…

The model was trained to perform a cross-modal search for actions

9 May 2022

The model was trained to perform a cross-modal search for actions

MIT has developed a model of cross-modal search for actions in text, audio and video content. The model allows you to determine where a certain action takes place in the…

Flamingo: DeepMind multimodal model

9 May 2022

Flamingo: DeepMind multimodal model

Flamingo is a multimodal DeepMind model that generates a text description of photos, videos and sounds. The model surpasses the previous state-of-the-art models in 16 tasks, and its feature is…

GraphWorld: benchmark for graph neural networks

9 May 2022

GraphWorld: benchmark for graph neural networks

Google AI has introduced a benchmark for graph neural networks GraphWorld. The benchmark uses several million synthetic datasets reproducing a wide class of graphs and generates a generalized estimate of…

Israeli startup simplifies hiring using natural language processing

9 May 2022

Israeli startup simplifies hiring using natural language processing

Myinterview is an Israeli startup developing machine learning tools to speed up and simplify hiring processes for companies. The My interview platform transcribes candidates’ video interviews, evaluates their skills and…

Google Cloud Manufacturing: advanced analytics in manufacturing

9 May 2022

Google Cloud Manufacturing: advanced analytics in manufacturing

Google and Ford have developed a Google Cloud Manufacturing tool aimed at combining and unifying disparate data in manufacturing. The tool provides an opportunity to analyze production processes and train…

MIT’s drone algorithm Predicts object Trajectories

29 April 2022

MIT’s drone algorithm Predicts object Trajectories

MIT researchers have developed an algorithm to improve the safety of self-driving cars. The model predicts the trajectories of road users moving near the drone in real time. Modern methods…

MASSIVE: Amazon dataset for multilingual model training

29 April 2022

MASSIVE: Amazon dataset for multilingual model training

Amazon has introduced the MASSIVE open-source dataset with translations of texts into 51 languages. The dataset is aimed at creating natural language processing models that can be easily generalized to…

SORDI: dataset of synthetic images of production resource

20 April 2022

SORDI: dataset of synthetic images of production resource

BMW Group presented SORDI, the largest open-source dataset of marked-up photorealistic images of factories and other industries. SORDI contains more than 800,000 images in 80 categories and is aimed at…

The model was trained to detect seismic activity against the urban noise

18 April 2022

The model was trained to detect seismic activity against the urban noise

Researchers at Stanford University have developed an algorithm for removing background noise from data coming from seismic activity sensors. The model allows you to register four times more earthquake signals.…

The model predicts the risk of cardiac arrest for ten years ahead

14 April 2022

The model predicts the risk of cardiac arrest for ten years ahead

Johns Hopkins University has developed a model that predicts the risk of cardiac arrest based on MRI images. The researchers claim that the analysis of the structure of scar tissue…

DALL-E 2: text-to-image OpenAI model

13 April 2022

DALL-E 2: text-to-image OpenAI model

OpenAI has introduced a new version of the DALL-E text-to-image conversion model. Compared to the first version, DALL-E 2 generates images in higher quality with less delay, and also allows…

PaLM: Google’s language model with 540 billion parameters

8 April 2022

PaLM: Google’s language model with 540 billion parameters

Google has introduced a PaLM – language model with 540 billion parameters. PaLM has surpassed existing language models in most benchmarks. The model is trained using 6144 Google TPU tensor…

Synthetic image generator for training classification models

4 April 2022

Synthetic image generator for training classification models

MIT researchers have developed a method in which a controlled model of synthetic image generation is integrated into a classification model. The method allows you to reduce the cost of…

Jigsaw: Microsoft tool for text-to-code models

1 April 2022

Jigsaw: Microsoft tool for text-to-code models

Microsoft has introduced Jigsaw, a tool for laying out the output of text–to-code models by providing examples of output data. When working with Python Pandas, the tool made it possible…

Instant NeRF: ultrafast 3D-scene reconstruction

28 March 2022

Instant NeRF: ultrafast 3D-scene reconstruction

Nvidia has introduced Instant NeRF, an algorithm for ultrafast reconstruction of three-dimensional scenes from multiple images. Instant NeRF is aimed at use in autonomous driving systems and when creating metaverses.…

The surgical robot determines the place of needle insertion

24 March 2022

The surgical robot determines the place of needle insertion

AI-Guide is a hand-held surgical robot developed at MIT that allows automating the process of inserting a needle or catheter into a blood vessel. The device is aimed at providing…