News / Neural Networks and Deep Learning

NVIDIA Isaac 5.0: Enhanced Sensor Physics and Expanded Synthetic Data Generation

19 May 2025

19 May 2025

NVIDIA Isaac robotics platform showing a humanoid robot interacting with objects

NVIDIA Isaac 5.0: Enhanced Sensor Physics and Expanded Synthetic Data Generation

19 May 2025

NVIDIA continues to push the boundaries of AI-driven robotics with significant updates to its Isaac ecosystem, announced at COMPUTEX 2025. These innovations address key challenges in robotics development by enhancing…

Claude for Education: Revolutionizing Higher Education with AI-Powered Learning

3 April 2025

3 April 2025

Claude for Education: Revolutionizing Higher Education with AI-Powered Learning

3 April 2025

Anthropic has released Claude for Education, specifically designed for implementation in universities and other higher education institutions. While the classic chatbot provides direct answers to questions, Claude for Education uses…

Adobe Drops Industry-First Commercially Safe AI Video Model

13 February 2025

13 February 2025

Adobe Drops Industry-First Commercially Safe AI Video Model

13 February 2025

Adobe has launched its Firefly video model in public, marking the first generative AI tool built specifically for commercial use, addressing enterprise concerns about IP rights and legal safety in…

Nvidia Drops Monster AI Update at CES 2025: Local Foundation Models Meet New RTX 50 Series

7 January 2025

7 January 2025

Nvidia Drops Monster AI Update at CES 2025: Local Foundation Models Meet New RTX 50 Series

7 January 2025

Nvidia just announced a major shift in consumer AI computing at CES 2025, combining new GPUs with a platform for running foundation models locally. The announcement includes next-gen RTX 50…

SynthID: DeepMind’s Open Source Approach for Generated Text Watermarking

31 October 2024

31 October 2024

synthID deepmind text generator watermark

SynthID: DeepMind’s Open Source Approach for Generated Text Watermarking

31 October 2024

DeepMind has released SynthID Text, expanding their established AI content authentication ecosystem to include text watermarking. This release, now available in Hugging Face Transformers v4.46.0+, follows DeepMind’s deployment of SynthID…

Google PH-LLM: A Language Model for Health Monitoring

16 June 2024

16 June 2024

Google PH-LLM: A Language Model for Health Monitoring

16 June 2024

Google developed the PH-LLM language model to analyze medical data collected from wearable devices such as smartwatches and heart rate monitors. During experiments, the model answered health-related questions and predicted…

ElevenLabs’ AI Sound Generation Transforms Audio Production

1 June 2024

1 June 2024

ElevenLabs’ AI Sound Generation Transforms Audio Production

1 June 2024

In the ever-evolving landscape of digital content creation, ElevenLabs is making waves with its AI-driven sound generation tool. Designed to streamline the audio production process, this new technology allows users…

Google RecurrentGemma: Next-Gen Local Language Model

14 April 2024

14 April 2024

Google RecurrentGemma: Next-Gen Local Language Model

14 April 2024

Google has introduced the RecurrentGemma language model, designed to operate locally on devices with limited resources such as smartphones, personal computers, and smart speakers. The new architecture from Google significantly…

Voice Engine: OpenAI’s Voice Synthesis Model

1 April 2024

1 April 2024

Voice Engine: OpenAI’s Voice Synthesis Model

1 April 2024

OpenAI has unveiled Voice Engine, a model capable of voice cloning from a 15-second audio recording. Among the users of the model, the company mentions podcasters, announcers, audiobook authors, advertisers,…

Startup Insilico Medicine Introduces First Drug Developed with Generative Models

10 March 2024

10 March 2024

искусственный интеллект разрабатывает лекарство

Startup Insilico Medicine Introduces First Drug Developed with Generative Models

10 March 2024

Startup Insilico Medicine has unveiled the first drug developed using generative models. This innovative approach to creation enabled the drug to pass its initial clinical trial phases in just two…

Microsoft ViSNet: Predicting Molecule Activity

3 March 2024

3 March 2024

Microsoft ViSNet: Predicting Molecule Activity

3 March 2024

Microsoft has unveiled ViSNet – a graph neural network modeling the geometry of complex molecules to predict their activity. ViSNet has the potential to significantly expedite the search for and…

Sora: OpenAI’s Groundbreaking Text-to-Image Diffusion Model

18 February 2024

18 February 2024

Sora: OpenAI’s Groundbreaking Text-to-Image Diffusion Model

18 February 2024

OpenAI has unveiled Sora, a diffusion-based text-to-image model capable of generating 60-second videos. Compared to competitors like Runway, Pika, Stability AI, and Google, OpenAI’s model boasts high-resolution (Full HD) output,…

Google MobileDiffusion: Generating Images on Mobile Devices

4 February 2024

4 February 2024

Google MobileDiffusion: Generating Images on Mobile Devices

4 February 2024

Google has introduced MobileDiffusion, a real-time text-to-image generation model that operates entirely on mobile devices. On Android and iOS devices with the latest generation processors, image generation at a resolution…

Pika 1.0: A Web Platform for Video Generation

7 January 2024

7 January 2024

Pika 1.0: A Web Platform for Video Generation

7 January 2024

Pika Labs startup has launched Pika 1.0 – a free web platform for generating and editing videos using text-based queries. The service creates both realistic videos and 3D animation in…

Google MusicFX: Transform Text Into Unique Soundscapes with AI

17 December 2023

17 December 2023

Google MusicFX: Transform Text Into Unique Soundscapes with AI

17 December 2023

Google has launched MusicFX, an online service that generates music based on text queries. Furthermore, the product utilizes Google’s MusicLM model, and each audio file contains an inaudible watermark created…

Microsoft LeMa: Boosting Language Model Accuracy in Math

4 November 2023

4 November 2023

Microsoft LeMa: Boosting Language Model Accuracy in Math

4 November 2023

Microsoft researchers have introduced LeMa (Learning from Mistakes), an open-source algorithm designed to enhance the ability of large language models to solve mathematical problems. LeMa encourages models to learn from…

NVIDIA Eureka: Agent for Autonomous Robot Learning

22 October 2023

22 October 2023

NVIDIA Eureka: Agent for Autonomous Robot Learning

22 October 2023

NVIDIA has unveiled Eureka, an open-source agent based on GPT-4, designed to teach robots complex skills such as performing tricks and handling scissors. This breakthrough leverages the power of large…

Google Introduces Image Generation in Search

15 October 2023

15 October 2023

Google Introduces Image Generation in Search

15 October 2023

Google has announced the integration of image generation in search results based on descriptions and several other AI features. Furthermore, this tool is built on the Imagen model and allows…

Microsoft AutoGen: A Framework for Configuring LLM Agents

8 October 2023

8 October 2023

Microsoft AutoGen: A Framework for Configuring LLM Agents

8 October 2023

Microsoft has unveiled AutoGen, an open-source library designed for creating and configuring LLM agents. Moreover, these are individual sessions of large language models that can collaborate for collective problem-solving. LLM…

Microsoft Copilot and 150 Other AI Features in Windows 11

1 October 2023

1 October 2023

Microsoft Copilot and 150 Other AI Features in Windows 11

1 October 2023

Microsoft has released an update for Windows 11, bringing over 150 AI features and the Copilot chatbot, designed to support most of the operating system’s applications. Copilot serves as a…

ChatGPT Enhancements: Voice Conversations and Image Recognition

25 September 2023

25 September 2023

ChatGPT conversations and image recognition

ChatGPT Enhancements: Voice Conversations and Image Recognition

25 September 2023

ChatGPT will be able to engage in voice conversations and recognize objects in images. For instance, ChatGPT is ready to read bedtime stories, assist in creating recipes from photos of…