The model was trained to perform a cross-modal search for actions
9 May 2022
The model was trained to perform a cross-modal search for actions
MIT has developed a model of cross-modal search for actions in text, audio and video content. The model allows you to determine where a certain action takes place in the…
Flamingo: DeepMind multimodal model
9 May 2022
Flamingo: DeepMind multimodal model
Flamingo is a multimodal DeepMind model that generates a text description of photos, videos and sounds. The model surpasses the previous state-of-the-art models in 16 tasks, and its feature is…
GraphWorld: benchmark for graph neural networks
9 May 2022
GraphWorld: benchmark for graph neural networks
Google AI has introduced a benchmark for graph neural networks GraphWorld. The benchmark uses several million synthetic datasets reproducing a wide class of graphs and generates a generalized estimate of…
Israeli startup simplifies hiring using natural language processing
9 May 2022
Israeli startup simplifies hiring using natural language processing
Myinterview is an Israeli startup developing machine learning tools to speed up and simplify hiring processes for companies. The My interview platform transcribes candidates’ video interviews, evaluates their skills and…
Google Cloud Manufacturing: advanced analytics in manufacturing
9 May 2022
Google Cloud Manufacturing: advanced analytics in manufacturing
Google and Ford have developed a Google Cloud Manufacturing tool aimed at combining and unifying disparate data in manufacturing. The tool provides an opportunity to analyze production processes and train…
MIT’s drone algorithm Predicts object Trajectories
29 April 2022
MIT’s drone algorithm Predicts object Trajectories
MIT researchers have developed an algorithm to improve the safety of self-driving cars. The model predicts the trajectories of road users moving near the drone in real time. Modern methods…
MASSIVE: Amazon dataset for multilingual model training
29 April 2022
MASSIVE: Amazon dataset for multilingual model training
Amazon has introduced the MASSIVE open-source dataset with translations of texts into 51 languages. The dataset is aimed at creating natural language processing models that can be easily generalized to…
SORDI: dataset of synthetic images of production resource
20 April 2022
SORDI: dataset of synthetic images of production resource
BMW Group presented SORDI, the largest open-source dataset of marked-up photorealistic images of factories and other industries. SORDI contains more than 800,000 images in 80 categories and is aimed at…
The model was trained to detect seismic activity against the urban noise
18 April 2022
The model was trained to detect seismic activity against the urban noise
Researchers at Stanford University have developed an algorithm for removing background noise from data coming from seismic activity sensors. The model allows you to register four times more earthquake signals.…
The model predicts the risk of cardiac arrest for ten years ahead
14 April 2022
The model predicts the risk of cardiac arrest for ten years ahead
Johns Hopkins University has developed a model that predicts the risk of cardiac arrest based on MRI images. The researchers claim that the analysis of the structure of scar tissue…
DALL-E 2: text-to-image OpenAI model
13 April 2022
DALL-E 2: text-to-image OpenAI model
OpenAI has introduced a new version of the DALL-E text-to-image conversion model. Compared to the first version, DALL-E 2 generates images in higher quality with less delay, and also allows…
PaLM: Google’s language model with 540 billion parameters
8 April 2022
PaLM: Google’s language model with 540 billion parameters
Google has introduced a PaLM – language model with 540 billion parameters. PaLM has surpassed existing language models in most benchmarks. The model is trained using 6144 Google TPU tensor…
Synthetic image generator for training classification models
4 April 2022
Synthetic image generator for training classification models
MIT researchers have developed a method in which a controlled model of synthetic image generation is integrated into a classification model. The method allows you to reduce the cost of…
Jigsaw: Microsoft tool for text-to-code models
1 April 2022
Jigsaw: Microsoft tool for text-to-code models
Microsoft has introduced Jigsaw, a tool for laying out the output of text–to-code models by providing examples of output data. When working with Python Pandas, the tool made it possible…
Instant NeRF: ultrafast 3D-scene reconstruction
28 March 2022
Instant NeRF: ultrafast 3D-scene reconstruction
Nvidia has introduced Instant NeRF, an algorithm for ultrafast reconstruction of three-dimensional scenes from multiple images. Instant NeRF is aimed at use in autonomous driving systems and when creating metaverses.…
The surgical robot determines the place of needle insertion
24 March 2022
The surgical robot determines the place of needle insertion
AI-Guide is a hand-held surgical robot developed at MIT that allows automating the process of inserting a needle or catheter into a blood vessel. The device is aimed at providing…
Applications of machine learning in the field of nature conservation
28 February 2022
Applications of machine learning in the field of nature conservation
Machine learning has become one of the three leading technologies in the field of nature protection. The article provides an overview of the tasks solved with the help of machine…
Computer vision system reduces delays in aircraft departures
28 February 2022
Computer vision system reduces delays in aircraft departures
Israeli startup IntellAct has developed a system for monitoring the actions of airport employees to reduce flight delays. Preliminary tests of the system conducted by El Al Airline at Ben Gurion…
Datasets for music generation and analysis
27 February 2022
Datasets for music generation and analysis
The article provides an overview of datasets with musical compositions. Datasets are designed to train models of music generation, recognition and analysis. NSynth The largest dataset consisting of 305,979 musical…
Machine learning was used to help anesthesiologists
18 February 2022
Machine learning was used to help anesthesiologists
MIT scientists have demonstrated a machine learning algorithm for continuous automation of dosing of the anesthetic drug propofol. The algorithm can improve the process of tracking the condition of patients during…
Reinforcement training to control thermonuclear reactions
17 February 2022
Reinforcement training to control thermonuclear reactions
DeepMind has announced the use of reinforcement learning to control the plasma state during a thermonuclear reaction. The DeepMind algorithm made it possible to increase the stability of the process…