Mistral Large 2: Leading the Way in Open Source AI Code Generation

25 July 2024
Performance accuracy on code generation benchmarks (all models were benchmarked through the same evaluation pipeline)

Mistral Large 2: Leading the Way in Open Source AI Code Generation

Mistral AI has announced Mistral Large 2, the latest iteration of its flagship model, setting a new state of the art (SOTA) in open-source code generation models. This new model…

LLaMA 3.1 Models Released: Open Source and Comparable to GPT-4

24 July 2024
llama 3.1 human evaluation

LLaMA 3.1 Models Released: Open Source and Comparable to GPT-4

LLAMA 3,1 models has been officially released, including the massive 405 billion-parameter LLaMA 3.1 405B model. Expanded context length to 128K, support for eight languages, and the introduction of LLaMA…

Google PH-LLM: A Language Model for Health Monitoring

16 June 2024
Google PH-LLM pipeline

Google PH-LLM: A Language Model for Health Monitoring

Google developed the PH-LLM language model to analyze medical data collected from wearable devices such as smartwatches and heart rate monitors. During experiments, the model answered health-related questions and predicted…

Zyda: 1.3T Dataset for Open Language Modeling

12 June 2024
zyda dataset composition

Zyda: 1.3T Dataset for Open Language Modeling

Zyda is a 1.3 trillion-token open-source dataset designed for open language modeling. Zyda integrates a range of high-quality open datasets, including RefinedWeb, Starcoder, C4, Pile, enhancing them through comprehensive filtering…

Apple Unveils “Apple Intelligence” and OpenAI Partnership at WWDC

11 June 2024
Apple-WWDC24-Apple-Intelligence-OpenAI-deal

Apple Unveils “Apple Intelligence” and OpenAI Partnership at WWDC

Apple’s Worldwide Developers Conference (WWDC) saw a major focus on artificial intelligence, introducing “Apple Intelligence” and a strategic partnership with OpenAI. These announcements highlight Apple’s commitment to integrating AI across…

Qwen2: A Leap Forward in Open Source Language Models

7 June 2024
qwen2-72b comparison

Qwen2: A Leap Forward in Open Source Language Models

In a momentous development, the highly anticipated transition from Qwen1.5 to Qwen2 has finally arrived, marking a significant milestone in the realm of language models. Qwen 2 outperforms Llama 3…

GPT-4 Trained to Predict Financial Metrics Better Than Analysts

26 May 2024
finance market analisys ai model

GPT-4 Trained to Predict Financial Metrics Better Than Analysts

Scientists from the University of Chicago have demonstrated that large language models can conduct financial statement analysis of companies with greater accuracy than professional analysts. These research findings could impact…

Google RecurrentGemma: Next-Gen Local Language Model

14 April 2024
recurrentgemma пщщпду

Google RecurrentGemma: Next-Gen Local Language Model

Google has introduced the RecurrentGemma language model, designed to operate locally on devices with limited resources such as smartphones, personal computers, and smart speakers. The new architecture from Google significantly…

Apple MGIE: Multimodal Models for Image Editing

12 February 2024
apple mgie

Apple MGIE: Multimodal Models for Image Editing

Apple, in collaboration with the University of California, has developed the open-source MGIE model for image editing based on text input. This model tackles various editing tasks, including Photoshop-style image…

Google Introduces Gemini, a Cutting-Edge Language Model Set

7 December 2023

Google Introduces Gemini, a Cutting-Edge Language Model Set

Google has announced the creation of Gemini, a set of three language models surpassing competitors in 30 out of 32 benchmarks. The top-tier model, Gemini Ultra, is available through an…

Microsoft AutoGen: A Framework for Configuring LLM Agents

8 October 2023
AutoGen framework

Microsoft AutoGen: A Framework for Configuring LLM Agents

Microsoft has unveiled AutoGen, an open-source library designed for creating and configuring LLM agents. Moreover, these are individual sessions of large language models that can collaborate for collective problem-solving. LLM…

FLM-101B: Training 101 Billion Parameter Language Model with a $100K Budget

24 September 2023
FLM 101B evaluating growth strategy

FLM-101B: Training 101 Billion Parameter Language Model with a $100K Budget

Researchers from Beijing University present FLM-101B, an open-source large language model (LLM) with 101 billion parameters trained from scratch with a budget of only $100K. Training LLMs at large scales…

Falcon 180B: The Largest Open Language Model Surpasses Llama 2 and GPT 3.5

6 September 2023
falcon 180b model intro

Falcon 180B: The Largest Open Language Model Surpasses Llama 2 and GPT 3.5

The Institute of Technological Innovations from the UAE has unveiled Falcon 180B, the largest open language model, displacing Llama 2 from the top spot in the rankings of pre-trained open-access…

OpenAI Suggests Teachers Use ChatGPT for Lesson Preparation and Assessment

5 September 2023
chatgpt for teachers

OpenAI Suggests Teachers Use ChatGPT for Lesson Preparation and Assessment

OpenAI, in anticipation of the upcoming academic year, has revealed how teachers can leverage ChatGPT to streamline the teaching process. In the article “Teaching with AI,” the company presents four…

Google VRDU: Advancing Document Content Understanding with Dataset and Benchmark

27 August 2023
google vrdu 2

Google VRDU: Advancing Document Content Understanding with Dataset and Benchmark

Google has publicly released VRDU, a dataset and benchmark designed for training models in understanding document content. VRDU aims to accelerate the development of models capable of processing complex documents…

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

22 August 2023
GPT 3.5 turbo finetuning

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

OpenAI has introduced a significant update to its GPT-3.5 Turbo model, allowing developers to fine-tune the model for their specific tasks and applications. This enhancement opens up the opportunity for…

Arthur Bench: Framework for Evaluating Language Models

20 August 2023
arthur bench

Arthur Bench: Framework for Evaluating Language Models

American startup Arthur has released an open-source framework called Bench for evaluating and comparing the performance of large language models. This tool enables users to select the most suitable language…

ConPLex: Language Model for Drug Development

11 June 2023
ConPLex

ConPLex: Language Model for Drug Development

ConPLex is a language model trained to analyze chemical databases and search for potential drug molecules that interact best with specific target proteins. The model enables the exploration of over…

LIMA: Pretraining Method on 1000 Examples Achieved GPT4-Level Accuracy

31 May 2023
LIMA LLAMA

LIMA: Pretraining Method on 1000 Examples Achieved GPT4-Level Accuracy

Language models typically undergo two stages of training: unsupervised pretraining and fine-tuning to specific tasks and user preferences. The novel LIMA method (Less Is More for Alignment) challenges the traditional…