MinerU: Open-Source AI Solution Significantly Boosts Document Extraction Accuracy

30 September 2024
Structure AI document extraction ai

MinerU: Open-Source AI Solution Significantly Boosts Document Extraction Accuracy

Researchers from the Shanghai Artificial Intelligence Laboratory have developed MinerU, a cutting-edge open-source solution for precise document content extraction. MinerU is designed to extract and structure content from diverse document…

Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5

26 September 2024

Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5

Molmo is a new series of multimodal vision-language models (VLMs) created by researchers at the Allen Institute for AI and the University of Washington. The Molmo family outperforms many state-of-the-art…

EzAudio: Open Source Hyperrealistic Text-to-Audio Model

19 September 2024
ezaudio text-to-audio model generation ai

EzAudio: Open Source Hyperrealistic Text-to-Audio Model

EzAudio, a new transformer-based text-to-audio (T2A) diffusion model developed by researchers from Tencent AI Lab and Johns Hopkins University. EzAudio addresses key challenges in T2A generation, including generation quality, computational…

Scaling Test-Time Compute: A New Paradigm in LLM Performance

27 August 2024
search types

Scaling Test-Time Compute: A New Paradigm in LLM Performance

Researchers from UC Berkeley and Google DeepMind published a groundbreaking paper titled “Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.” This paper introduces a transformative…

LongWriter: Open-Source Framework for Generating Texts Beyond 10,000 Words

19 August 2024
agentwrite

LongWriter: Open-Source Framework for Generating Texts Beyond 10,000 Words

LongWriter is a framework and a set of large language models (LLMs) designed specifically to enable ultra-long text generation, often exceeding 10,000 words while maintaining coherence, quality, and relevance. It…

CRAM: Cutting AI Energy Consumption by 1,000 Times

30 July 2024
CRAM

CRAM: Cutting AI Energy Consumption by 1,000 Times

Researchers at the University of Minnesota Twin Cities have unveiled the Computational Random-Access Memory (CRAM) hardware architecture, poised to transform AI computing by drastically reducing energy consumption. CRAM promises to…

DenseAV Algorithm Learns Language from Videos

23 June 2024
DenseAV Algorithm

DenseAV Algorithm Learns Language from Videos

The algorithm DenseAV, developed at MIT, learns to understand the meaning of words and sentences by watching videos of people conversing. DenseAV outperformed other algorithms in tasks involving identifying objects…

Zyda: 1.3T Dataset for Open Language Modeling

12 June 2024
zyda dataset composition

Zyda: 1.3T Dataset for Open Language Modeling

Zyda is a 1.3 trillion-token open-source dataset designed for open language modeling. Zyda integrates a range of high-quality open datasets, including RefinedWeb, Starcoder, C4, Pile, enhancing them through comprehensive filtering…

Hugging Face and Pollen Robotics Introduce Reachy2 – an Open-Source Robot for Household Tasks

10 June 2024

Hugging Face and Pollen Robotics Introduce Reachy2 – an Open-Source Robot for Household Tasks

Hugging Face and Pollen Robotics unveiled the anthropomorphic robot Reachy2, whose training dataset and model are open-source. Reachy2 performs household tasks and interacts safely with people and pets. Pollen Robotics…

Gretel: The Largest Open Text-to-SQL Dataset

7 April 2024
gretel dataset sql

Gretel: The Largest Open Text-to-SQL Dataset

Gretel, a startup specializing in generating high-quality synthetic data, has announced the creation of the largest open text-to-SQL dataset aimed at accelerating the development of no-code analytics tools. The dataset…

Shopping Muse: Mastercard’s Recommender System

10 December 2023

Shopping Muse: Mastercard’s Recommender System

Mastercard has unveiled Shopping Muse, a chatbot-format module for online stores that recommends products to shoppers based on their purchase and search history, region, and other factors. Operating on the…

DeepMind GNoME Discovered 2 Million New Materials

3 December 2023

DeepMind GNoME Discovered 2 Million New Materials

DeepMind has developed the graph neural network GNoME, predicting material stability. GNoME has identified 2.2 million new materials, with 380 thousand deemed stable for application in developing computer chips, batteries,…

MIT Releases Free Lecture Course on TinyML & Efficient DL Computing on Youtube

29 September 2023
TinyML & Efficient DL Computing

MIT Releases Free Lecture Course on TinyML & Efficient DL Computing on Youtube

In recent years, large language and diffusion models have showcased impressive results. However, their demands on computational resources and memory consumption pose significant challenges for researchers and developers. The TinyML…

Würstchen: An Open-Source Text-to-Image Model Consuming 16 Times Less GPU than Stable Diffusion 1.4

14 September 2023
Würstchen approach

Würstchen: An Open-Source Text-to-Image Model Consuming 16 Times Less GPU than Stable Diffusion 1.4

Würstchen is an open text-to-image model that generates images faster than diffusion models like Stable Diffusion while consuming significantly less memory, achieving comparable results. The approach is based on a…

Persimmon-8B: An Open Model with a 16k Token Context, Running on a Single GPU

11 September 2023
persimmon-8b-llm

Persimmon-8B: An Open Model with a 16k Token Context, Running on a Single GPU

Researchers from Adept have introduced the open-source language model Persimmon-8B with a 16k token context, which is four times larger than the most compact Llama 2 and text-davinci-002 used in…

14 Free Courses on Machine Learning, Data Science, Data Analysis, and Python

30 August 2023
free machine learning data science analysis python courses

14 Free Courses on Machine Learning, Data Science, Data Analysis, and Python

The prevailing trend in online education is the rise of Massive Open Online Courses (MOOCs). Free courses on machine learning, data science, data analysis, and Python are readily available, based…

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

22 August 2023
GPT 3.5 turbo finetuning

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

OpenAI has introduced a significant update to its GPT-3.5 Turbo model, allowing developers to fine-tune the model for their specific tasks and applications. This enhancement opens up the opportunity for…

ReLoRA: Method for Enhancing Performance in Training Large Language Models

16 August 2023
relora method

ReLoRA: Method for Enhancing Performance in Training Large Language Models

ReLoRA is a technique for training large transformer-based language models using low-rank matrices, aimed at boosting training efficiency. The effectiveness of this method increases with the scale of the models.…

AI Photo Enhancer Online Apps Review: Improve Image Quality for Free

2 August 2023
ai photo enhancer

AI Photo Enhancer Online Apps Review: Improve Image Quality for Free

In this article, we will explore AI photo enhancer online apps that improve image quality for free. The limit for free upscaling typically ranges from just 5 attempts to several…

Wix AI: Building Websites Using Chatbot Technology

23 July 2023
wix ai

Wix AI: Building Websites Using Chatbot Technology

Wix, the website creation service, has announced the launch of its chatbot, Wix AI, which allows users to create and modify websites using natural language queries. Additionally, the tool will…

Llama 2 and Llama-2-Chat: A New Generation of Open Source Language Models

19 July 2023
Llama 2 update

Llama 2 and Llama-2-Chat: A New Generation of Open Source Language Models

The new generation of Llama models comprises three large language models, namely Llama 2 with 7, 13, and 70 billion parameters, along with the fine-tuned conversational models Llama-2-Chat 7B, 34B,…