Deep Reinforcement Learning / Neural Networks and Deep Learning

ClawGUI: the first open-source end-to-end framework for GUI agents — from training to real device

15 April 2026

ClawGUI: the first open-source end-to-end framework for GUI agents — from training to real device

Researchers from Zhejiang University have published ClawGUI — a fully open-source framework for building GUI agents that control applications through their visual interface, just like a human would: taps, swipes,…

OpenClaw-RL: A Framework That Updates an Agent’s Weights on the Fly, Learning from User and Environment Feedback

17 March 2026

OpenClaw-RL: A Framework That Updates an Agent’s Weights on the Fly, Learning from User and Environment Feedback

Researchers from Princeton University have introduced OpenClaw-RL, a framework that allows an AI agent to improve in real time — without a separate data collection stage and without manual annotation.…

P1: First Open-Source Model to Win Gold at the International Physics Olympiad

30 November 2025

P1: First Open-Source Model to Win Gold at the International Physics Olympiad

P1-235B-A22B from Shanghai AI Laboratory became the first open-source model to win a gold medal at the latest International Physics Olympiad IPhO 2025, scoring 21.2 out of 30 points and…

QeRL: Training 32B Models on Single H100 vs Three GPUs, Beating LoRA in Accuracy

16 October 2025

QeRL rainforcement learning quantization training speedup

QeRL: Training 32B Models on Single H100 vs Three GPUs, Beating LoRA in Accuracy

QeRL is a framework for training language models using reinforcement learning that simultaneously reduces GPU requirements and surpasses traditional LoRA and QLoRA methods in accuracy. On the Qwen2.5-7B-Instruct model, QeRL…

14 Free Courses on Machine Learning, Data Science, Data Analysis, and Python

30 August 2023

free machine learning data science analysis python courses

14 Free Courses on Machine Learning, Data Science, Data Analysis, and Python

The prevailing trend in online education is the rise of Massive Open Online Courses (MOOCs). Free courses on machine learning, data science, data analysis, and Python are readily available, based…

Robot manage objects from video tutorials using RL

19 February 2021

19 February 2021

Robot manage objects from video tutorials using RL

19 February 2021

In FAIR, the RL-agent was trained to manage objects using video tutorials. Standard RL algorithms are trained to a problem iteratively through learning from errors. The proposed algorithm learns a…

Evolving Simple Programs for Playing Atari Games

2 August 2018

2 August 2018

Evolving Simple Programs for Playing Atari Games

2 August 2018

While a great number of researchers in Artificial Intelligence have focused their efforts on Deep Reinforcement Learning trying to beat human players on Atari games, researchers from the University of…