Deep Learning / Neural Networks and Deep Learning

Claude Sonnet 4.5 Leads on Comprehensive Backend Benchmark, Outperforming in Both Code and Environment Configuration

22 January 2026

Claude Sonnet 4.5 Leads on Comprehensive Backend Benchmark, Outperforming in Both Code and Environment Configuration

A team of researchers from Fudan University and Shanghai Qĳi Zhifeng Co. introduced ABC-Bench — the first benchmark that tests the ability of AI agents to solve full-fledged backend development…

From Millions Spent on “Thank You” to Efficient Inference: Boilerplate Detection in a Single Token

31 October 2025

From Millions Spent on “Thank You” to Efficient Inference: Boilerplate Detection in a Single Token

Researchers from JFrog published a study demonstrating a method for early detection of boilerplate responses in large language models after generating just a single token. The method enables computational cost…

How To Choose A Generative AI Platform

26 August 2025

How To Choose A Generative AI Platform

Most teams outgrow single-model tools once they need governance, repeatability, and multi-model routing. This guide shows what belongs in a Generative AI platform and how to evaluate options with architecture-level…

MiniCPM4: Open Local Model Achieves Qwen3-8B Performance with 7x Inference Acceleration

15 June 2025

MiniCPM4: Open Local Model Achieves Qwen3-8B Performance with 7x Inference Acceleration

The OpenBMB research team presented MiniCPM4 — a highly efficient language model designed specifically for local devices. MiniCPM4-8B achieves comparable performance to Qwen3-8B (81.13 vs 80.55), while requiring 4.5 times…

Strict On-Policy Training with Optimal Baseline: Microsoft Introduces Simplified Algorithm for RLHF

4 June 2025

Strict On-Policy Training with Optimal Baseline: Microsoft Introduces Simplified Algorithm for RLHF

The Microsoft Research team introduced On-Policy RL with Optimal reward baseline (OPO) — a simplified reinforcement learning algorithm for aligning large language models. The new method addresses key problems of…

ZEROSEARCH: A Framework That Cuts LLM Search Training Costs by 88%

9 May 2025

ZEROSEARCH: A Framework That Cuts LLM Search Training Costs by 88%

Alibaba’s NLP research team has officially open-sourced ZEROSEARCH, a complete framework for training LLMs to search without using real search engines. ZEROSEARCH builds on a key insight: LLMs have already…

DeepMath-103K: Advancing AI Reasoning Through Challenge

21 April 2025

DeepMath-103K: Advancing AI Reasoning Through Challenge

Mathematical reasoning stands as a crucial benchmark for artificial intelligence systems, requiring logical deduction, symbolic manipulation, and multi-step problem-solving. Recent breakthroughs in AI reasoning have been significantly driven by reinforcement…

Building an AI-Powered Game: A Deep Dive into DeepLearning.AI’s Latest Free Course Making AI Game Development Accessible

2 December 2024

deeplearning ai game development course free

Building an AI-Powered Game: A Deep Dive into DeepLearning.AI’s Latest Free Course Making AI Game Development Accessible

DeepLearning.AI’s newly released course, “Building an AI-Powered Game,” represents a significant step forward in making AI game development accessible to developers and enthusiasts alike. This comprehensive analysis explores how the…

MinerU: Open-Source AI Solution Significantly Boosts Document Extraction Accuracy

30 September 2024

MinerU: Open-Source AI Solution Significantly Boosts Document Extraction Accuracy

Researchers from the Shanghai Artificial Intelligence Laboratory have developed MinerU, a cutting-edge open-source solution for precise document content extraction. MinerU is designed to extract and structure content from diverse document…

Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5

26 September 2024

Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5

Molmo is a new series of multimodal vision-language models (VLMs) created by researchers at the Allen Institute for AI and the University of Washington. The Molmo family outperforms many state-of-the-art…

EzAudio: Open Source Hyperrealistic Text-to-Audio Model

19 September 2024

ezaudio text-to-audio model generation ai

EzAudio: Open Source Hyperrealistic Text-to-Audio Model

EzAudio, a new transformer-based text-to-audio (T2A) diffusion model developed by researchers from Tencent AI Lab and Johns Hopkins University. EzAudio addresses key challenges in T2A generation, including generation quality, computational…

Scaling Test-Time Compute: A New Paradigm in LLM Performance

27 August 2024

Scaling Test-Time Compute: A New Paradigm in LLM Performance

Researchers from UC Berkeley and Google DeepMind published a groundbreaking paper titled “Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.” This paper introduces a transformative…

LongWriter: Open-Source Framework for Generating Texts Beyond 10,000 Words

19 August 2024

LongWriter: Open-Source Framework for Generating Texts Beyond 10,000 Words

LongWriter is a framework and a set of large language models (LLMs) designed specifically to enable ultra-long text generation, often exceeding 10,000 words while maintaining coherence, quality, and relevance. It…

CRAM: Cutting AI Energy Consumption by 1,000 Times

30 July 2024

CRAM: Cutting AI Energy Consumption by 1,000 Times

Researchers at the University of Minnesota Twin Cities have unveiled the Computational Random-Access Memory (CRAM) hardware architecture, poised to transform AI computing by drastically reducing energy consumption. CRAM promises to…

DenseAV Algorithm Learns Language from Videos

23 June 2024

DenseAV Algorithm Learns Language from Videos

The algorithm DenseAV, developed at MIT, learns to understand the meaning of words and sentences by watching videos of people conversing. DenseAV outperformed other algorithms in tasks involving identifying objects…

Zyda: 1.3T Dataset for Open Language Modeling

12 June 2024

Zyda: 1.3T Dataset for Open Language Modeling

Zyda is a 1.3 trillion-token open-source dataset designed for open language modeling. Zyda integrates a range of high-quality open datasets, including RefinedWeb, Starcoder, C4, Pile, enhancing them through comprehensive filtering…

Hugging Face and Pollen Robotics Introduce Reachy2 – an Open-Source Robot for Household Tasks

10 June 2024

Hugging Face and Pollen Robotics Introduce Reachy2 – an Open-Source Robot for Household Tasks

Hugging Face and Pollen Robotics unveiled the anthropomorphic robot Reachy2, whose training dataset and model are open-source. Reachy2 performs household tasks and interacts safely with people and pets. Pollen Robotics…

Gretel: The Largest Open Text-to-SQL Dataset

7 April 2024

Gretel: The Largest Open Text-to-SQL Dataset

Gretel, a startup specializing in generating high-quality synthetic data, has announced the creation of the largest open text-to-SQL dataset aimed at accelerating the development of no-code analytics tools. The dataset…

Shopping Muse: Mastercard’s Recommender System

10 December 2023

Shopping Muse: Mastercard’s Recommender System

Mastercard has unveiled Shopping Muse, a chatbot-format module for online stores that recommends products to shoppers based on their purchase and search history, region, and other factors. Operating on the…

DeepMind GNoME Discovered 2 Million New Materials

3 December 2023

DeepMind GNoME Discovered 2 Million New Materials

DeepMind has developed the graph neural network GNoME, predicting material stability. GNoME has identified 2.2 million new materials, with 380 thousand deemed stable for application in developing computer chips, batteries,…

MIT Releases Free Lecture Course on TinyML & Efficient DL Computing on Youtube

29 September 2023

MIT Releases Free Lecture Course on TinyML & Efficient DL Computing on Youtube

In recent years, large language and diffusion models have showcased impressive results. However, their demands on computational resources and memory consumption pose significant challenges for researchers and developers. The TinyML…