Google RecurrentGemma: Next-Gen Local Language Model

14 April 2024
recurrentgemma пщщпду

Google RecurrentGemma: Next-Gen Local Language Model

Google has introduced the RecurrentGemma language model, designed to operate locally on devices with limited resources such as smartphones, personal computers, and smart speakers. The new architecture from Google significantly…

Apple MGIE: Multimodal Models for Image Editing

12 February 2024
apple mgie

Apple MGIE: Multimodal Models for Image Editing

Apple, in collaboration with the University of California, has developed the open-source MGIE model for image editing based on text input. This model tackles various editing tasks, including Photoshop-style image…

Google Introduces Gemini, a Cutting-Edge Language Model Set

7 December 2023

Google Introduces Gemini, a Cutting-Edge Language Model Set

Google has announced the creation of Gemini, a set of three language models surpassing competitors in 30 out of 32 benchmarks. The top-tier model, Gemini Ultra, is available through an…

Microsoft AutoGen: A Framework for Configuring LLM Agents

8 October 2023
AutoGen framework

Microsoft AutoGen: A Framework for Configuring LLM Agents

Microsoft has unveiled AutoGen, an open-source library designed for creating and configuring LLM agents. Moreover, these are individual sessions of large language models that can collaborate for collective problem-solving. LLM…

FLM-101B: Training 101 Billion Parameter Language Model with a $100K Budget

24 September 2023
FLM 101B evaluating growth strategy

FLM-101B: Training 101 Billion Parameter Language Model with a $100K Budget

Researchers from Beijing University present FLM-101B, an open-source large language model (LLM) with 101 billion parameters trained from scratch with a budget of only $100K. Training LLMs at large scales…

Falcon 180B: The Largest Open Language Model Surpasses Llama 2 and GPT 3.5

6 September 2023
falcon 180b model intro

Falcon 180B: The Largest Open Language Model Surpasses Llama 2 and GPT 3.5

The Institute of Technological Innovations from the UAE has unveiled Falcon 180B, the largest open language model, displacing Llama 2 from the top spot in the rankings of pre-trained open-access…

OpenAI Suggests Teachers Use ChatGPT for Lesson Preparation and Assessment

5 September 2023
chatgpt for teachers

OpenAI Suggests Teachers Use ChatGPT for Lesson Preparation and Assessment

OpenAI, in anticipation of the upcoming academic year, has revealed how teachers can leverage ChatGPT to streamline the teaching process. In the article “Teaching with AI,” the company presents four…

Google VRDU: Advancing Document Content Understanding with Dataset and Benchmark

27 August 2023
google vrdu 2

Google VRDU: Advancing Document Content Understanding with Dataset and Benchmark

Google has publicly released VRDU, a dataset and benchmark designed for training models in understanding document content. VRDU aims to accelerate the development of models capable of processing complex documents…

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

22 August 2023
GPT 3.5 turbo finetuning

OpenAI Unlocks New Potential with GPT-3.5 Turbo Model Fine-Tuning

OpenAI has introduced a significant update to its GPT-3.5 Turbo model, allowing developers to fine-tune the model for their specific tasks and applications. This enhancement opens up the opportunity for…

Arthur Bench: Framework for Evaluating Language Models

20 August 2023
arthur bench

Arthur Bench: Framework for Evaluating Language Models

American startup Arthur has released an open-source framework called Bench for evaluating and comparing the performance of large language models. This tool enables users to select the most suitable language…

ConPLex: Language Model for Drug Development

11 June 2023
ConPLex

ConPLex: Language Model for Drug Development

ConPLex is a language model trained to analyze chemical databases and search for potential drug molecules that interact best with specific target proteins. The model enables the exploration of over…

LIMA: Pretraining Method on 1000 Examples Achieved GPT4-Level Accuracy

31 May 2023
LIMA LLAMA

LIMA: Pretraining Method on 1000 Examples Achieved GPT4-Level Accuracy

Language models typically undergo two stages of training: unsupervised pretraining and fine-tuning to specific tasks and user preferences. The novel LIMA method (Less Is More for Alignment) challenges the traditional…