Trinity-Large-Thinking 400B: an open model matching Claude Opus-4.6 on agentic benchmarks at 28x lower price
3 April 2026
Trinity-Large-Thinking 400B: an open model matching Claude Opus-4.6 on agentic benchmarks at 28x lower price
Arcee AI has released Trinity-Large-Thinking — an open-weight reasoning model for complex multi-turn agentic tasks. On PinchBench — a comprehensive benchmark for AI agents — it ranks second among all…
PixelSmile: Open Model for Facial Expression Editing with Smooth Intensity Control
31 March 2026
PixelSmile: Open Model for Facial Expression Editing with Smooth Intensity Control
Researchers from Fudan University and StepFun have published PixelSmile — a diffusion model for precise facial expression editing in portraits and anime images. Instead of training on discrete labels like…
RealRestorer: Open-Source Image Enhancement Model Outperforms Nano Banana Pro on Real-World Benchmark
30 March 2026
RealRestorer: Open-Source Image Enhancement Model Outperforms Nano Banana Pro on Real-World Benchmark
A team of researchers from StepFun, Southern University of Science and Technology, and the Chinese Academy of Sciences has published RealRestorer — an open-source image quality enhancement model that removes…
MinerU-Diffusion: A New Approach to OCR via Diffusion Decoding Speeds Up PDF Parsing 3× Without Accuracy Loss
27 March 2026
MinerU-Diffusion: A New Approach to OCR via Diffusion Decoding Speeds Up PDF Parsing 3× Without Accuracy Loss
A team from Shanghai Artificial Intelligence Laboratory and Peking University published MinerU-Diffusion — a document OCR framework that abandons classical autoregressive generation in favor of diffusion-based decoding. The project is…
daVinci-MagiHuman: Open 15B Model Generates a 5-Second Lip Sync Video in 2 Seconds on a Single H100
24 March 2026
daVinci-MagiHuman: Open 15B Model Generates a 5-Second Lip Sync Video in 2 Seconds on a Single H100
SII-GAIR and Sand.ai have published daVinci-MagiHuman — an open-source multimodal 15B model based on a single-stream transformer that simultaneously generates video with precise lip sync and synchronized audio, producing a…
OpenClaw: The Lobster That Took Over the World — How One Developer Built the Most Popular Open-Source AI Agent in History
18 March 2026
OpenClaw: The Lobster That Took Over the World — How One Developer Built the Most Popular Open-Source AI Agent in History
OpenClaw is a free and open-source AI agent created by Austrian developer Peter Steinberger in November 2025. An AI agent is a software wrapper around a language model that does…
OpenClaw-RL: A Framework That Updates an Agent’s Weights on the Fly, Learning from User and Environment Feedback
17 March 2026
OpenClaw-RL: A Framework That Updates an Agent’s Weights on the Fly, Learning from User and Environment Feedback
Researchers from Princeton University have introduced OpenClaw-RL, a framework that allows an AI agent to improve in real time — without a separate data collection stage and without manual annotation.…
Helios: 14B Model Generates Videos Longer Than 60 Seconds at 19.5 FPS on a Single H100
11 March 2026
Helios: 14B Model Generates Videos Longer Than 60 Seconds at 19.5 FPS on a Single H100
A team of researchers from Peking University and ByteDance published Helios — an autoregressive diffusion transformer with 14 billion parameters that generates video at 19.5 frames per second on a…
VBVR: 2 Million Videos for Reasoning Training — an Open Dataset That Changes the Rules
26 February 2026
VBVR: 2 Million Videos for Reasoning Training — an Open Dataset That Changes the Rules
A team of more than 50 researchers from around the world — from Berkeley, Stanford, CMU, Oxford and other universities — has published Very Big Video Reasoning (VBVR), a massive…
Choosing Between Vendor AI, In-House Builds and Hybrid Delivery
20 February 2026
Choosing Between Vendor AI, In-House Builds and Hybrid Delivery
Most organisations now face a practical choice about how to deliver AI. Should they rely on vendor tools and platforms? Should they build capabilities in-house? Or should they blend both…
GLM-5: Top-1 Open-Weight Model for Code and Text Generation, Competing with Claude and GPT on Agentic Tasks
19 February 2026
GLM-5: Top-1 Open-Weight Model for Code and Text Generation, Competing with Claude and GPT on Agentic Tasks
Zhipu AI and Tsinghua University have published a GLM-5 technical report — currently the top-performing open-weight language model by benchmarks: first place among open-weight models on Artificial Analysis and top-1…
Baichuan-M3: An Open Medical Model That Conducts Consultations Like a Real Doctor and Outperforms GPT-5.2 on Benchmarks
10 February 2026
Baichuan-M3: An Open Medical Model That Conducts Consultations Like a Real Doctor and Outperforms GPT-5.2 on Benchmarks
A research team from the Chinese company Baichuan has introduced Baichuan-M3 — an open medical language model that, instead of the traditional question-and-answer mode, conducts a full clinical dialogue, actively…
Hyper-Personalized Email Marketing: How AI is Killing the “Blast” Method
Hyper-Personalized Email Marketing: How AI is Killing the “Blast” Method
Email marketing isn’t dead—but the spray-and-pray approach certainly is. Sending identical messages to your entire list and hoping for the best no longer cuts it when consumers expect personalized experiences…
Claude Sonnet 4.5 Leads on Comprehensive Backend Benchmark, Outperforming in Both Code and Environment Configuration
22 January 2026
Claude Sonnet 4.5 Leads on Comprehensive Backend Benchmark, Outperforming in Both Code and Environment Configuration
A team of researchers from Fudan University and Shanghai Qiji Zhifeng Co. introduced ABC-Bench — the first benchmark that tests the ability of AI agents to solve full-fledged backend development…
Multiplex Thinking: Sampling 3 Tokens Instead of 1 Increases Olympiad Problem-Solving Accuracy from 40% to 55%
22 January 2026
Multiplex Thinking: Sampling 3 Tokens Instead of 1 Increases Olympiad Problem-Solving Accuracy from 40% to 55%
Researchers from the University of Pennsylvania and Microsoft Research introduced Multiplex Thinking — a new reasoning method for large language models. The idea is to generate not one token at…
Yume1.5: An Open Model for Creating Interactive Virtual Worlds with Keyboard Control
5 January 2026
Yume1.5: An Open Model for Creating Interactive Virtual Worlds with Keyboard Control
Researchers from Shanghai AI Laboratory and Fudan University published Yume1.5 — a model for generating interactive virtual worlds that can be controlled directly from the keyboard. Unlike regular video generation,…
AI Models Are 13% Worse Than Humans at Detecting Generated ASMR Videos
18 December 2025
AI Models Are 13% Worse Than Humans at Detecting Generated ASMR Videos
Researchers from CUHK, NUS, University of Oxford, and Video Rebirth introduced Video Reality Test — the first benchmark that tests whether modern AI models can create videos indistinguishable from real…
Wan-Move: Open-Source Alternative to Kling 1.5 Pro for Motion-Controllable Video Generation
13 December 2025
Wan-Move: Open-Source Alternative to Kling 1.5 Pro for Motion-Controllable Video Generation
A team of researchers from Tongyi Lab (Alibaba Group), Tsinghua University, and the University of Hong Kong presented Wan-Move — a new approach to precise motion control in generative video…
P1: First Open-Source Model to Win Gold at the International Physics Olympiad
30 November 2025
P1: First Open-Source Model to Win Gold at the International Physics Olympiad
P1-235B-A22B from Shanghai AI Laboratory became the first open-source model to win a gold medal at the latest International Physics Olympiad IPhO 2025, scoring 21.2 out of 30 points and…
MiroThinker v1.0: Open-Source AI Research Agent Learns to Make Up to 600 Tool Calls Per Task
20 November 2025
MiroThinker v1.0: Open-Source AI Research Agent Learns to Make Up to 600 Tool Calls Per Task
The MiroMind team introduced MiroThinker v1.0 — an AI research agent capable of performing up to 600 tool calls per task with a 256K token context window. On four key…
Which AI Can Play a Villain: Comparing Alignment Algorithms Across 17 ModelsRetry
13 November 2025
Which AI Can Play a Villain: Comparing Alignment Algorithms Across 17 ModelsRetry
Researchers from Tencent Multimodal Department and Sun Yat-Sen University published a study on how large language models handle role-playing. It turns out that AI models perform mediocrely at role-playing: even…



















