Ideogram 2.0: Generating Text on Images with Unmatched Accuracy

22 August 2024

Ideogram 2.0: Generating Text on Images with Unmatched Accuracy

Ideogram launched its groundbreaking Ideogram 2.0 model, setting new standards in the text-to-image generation space. Trained from scratch, Ideogram 2.0 significantly outperforms existing models in key quality metrics such as…

Midjourney Introduces Character Transfer Feature to New Images

17 March 2024
перенос персонажа

Midjourney Introduces Character Transfer Feature to New Images

The image generation service Midjourney now offers a character transfer feature to new images by specifying a link to an existing image with the character in the request. This functionality…

Apple MGIE: Multimodal Models for Image Editing

12 February 2024
apple mgie

Apple MGIE: Multimodal Models for Image Editing

Apple, in collaboration with the University of California, has developed the open-source MGIE model for image editing based on text input. This model tackles various editing tasks, including Photoshop-style image…

Microsoft DragNUWA: Video Generation via Object Trajectories

15 January 2024

Microsoft DragNUWA: Video Generation via Object Trajectories

Microsoft has released the DragNUWA weights – a cross-domain video generation model that offers more precise control over the resulting output compared to similar models. Control is achieved by simultaneously…

OpenAI Announced the Release of Dall-E 3 in Early October

20 September 2023
Dalle-3

OpenAI Announced the Release of Dall-E 3 in Early October

OpenAI announced the release of Dall-E 3 in the ChatGPT interface in early October. Researchers revealed that the new version of the text-to-image models surpasses Dall-E 2 in several key aspects.…

Würstchen: An Open-Source Text-to-Image Model Consuming 16 Times Less GPU than Stable Diffusion 1.4

14 September 2023
Würstchen approach

Würstchen: An Open-Source Text-to-Image Model Consuming 16 Times Less GPU than Stable Diffusion 1.4

Würstchen is an open text-to-image model that generates images faster than diffusion models like Stable Diffusion while consuming significantly less memory, achieving comparable results. The approach is based on a…

Best AI Photo Generator Apps: Top 10 Selection

12 September 2023
best ai photo generator apps

Best AI Photo Generator Apps: Top 10 Selection

Which AI can draw pictures from words with maximum quality and minimal time investment? We have conducted research to find out the best AI photo generator apps that create images…

AI Photo Enhancer Online Apps Review: Improve Image Quality for Free

2 August 2023
ai photo enhancer

AI Photo Enhancer Online Apps Review: Improve Image Quality for Free

In this article, we will explore AI photo enhancer online apps that improve image quality for free. The limit for free upscaling typically ranges from just 5 attempts to several…

Stability AI Introduces Stable Diffusion SDXL 1.0 Model

26 July 2023
Stable Diffusion SDXL 1.0

Stability AI Introduces Stable Diffusion SDXL 1.0 Model

Stability AI has announced the release of Stable Diffusion SDXL 1.0, a new version of the popular image generation model. SDXL 1.0 is a foundational model with 3.5 billion parameters…

Google Bard Update: Image Processing and New Language Support

16 July 2023
google bard

Google Bard Update: Image Processing and New Language Support

Google Bard has undergone an update, expanding its functionality to 46 languages across more than 200 countries, including countries in Europe and Brazil. The latest features include image processing, dialog…

PACGen: Personalized and Controllable Text-to-Image Generation

7 July 2023
pacgen model

PACGen: Personalized and Controllable Text-to-Image Generation

Researchers from the University of Wisconsin-Madison have introduced a text-to-image diffusion model called PACGen (Personalized and Controllable Text-to-Image Generation) for transferring objects from one image to a new scene generated…

NVIDIA neural network generates realistic 3D worlds based on Minecraft

22 April 2021

NVIDIA neural network generates realistic 3D worlds based on Minecraft

Nvidia has unveiled GANcraft, a neural network for creating photorealistic images based on 3D block worlds, similar to the worlds in Minecraft. GANcraft creates a visualization of a world, taking…

The StyleCLIP neural network sets picture style based on a text description

9 April 2021

The StyleCLIP neural network sets picture style based on a text description

StyleCLIP is a combination of CLIP and StyleGAN models designed to manipulate image style with text prompts. The open-source code is available, including Google Colab notebooks. Why is it needed StyleGAN…

SAM: the neural network changes the age on the image of a person’s face

17 February 2021

SAM: the neural network changes the age on the image of a person’s face

SAM is a neural network model that changes the age of a person in an image. The model takes as input an image of a person’s face and target age.…

Semantic Data Augmentation Improves Neural Network’s Generalization

24 July 2020

Semantic Data Augmentation Improves Neural Network’s Generalization

A group of researchers from the University of Beijing has proposed a novel implicit semantic data augmentation method that improves the generalization capabilities of deep neural networks. Data augmentation has…

Dissecting GANs for Better Understanding and Visualization

5 December 2018
dissecting gan paper

Dissecting GANs for Better Understanding and Visualization

GANs can be taught to create (or generate) worlds similar to our own in any domain: images, music, speech, etc. Since 2014, a large number of improvements of GANs have…

PIFR: Pose Invariant 3D Face Reconstruction

26 November 2018
pifr reconstruction

PIFR: Pose Invariant 3D Face Reconstruction

3D face geometry needs to be recovered from 2D images in many real-world applications, including face recognition, face landmark detection, 3D emoticon animation etc. However, this task remains challenging especially…

This Neural Network Evaluates Natural Scene Memorability

1 October 2018
natural scene memorability score by neural network

This Neural Network Evaluates Natural Scene Memorability

One hallmark of human cognition is the splendid capacity of recalling thousands of different images, some in details, after only a single view. Not all photos are remembered equally in…

True Face Super-Resolution Upscaling with the Facial Component Heatmaps

1 October 2018
face resolution upscaling

True Face Super-Resolution Upscaling with the Facial Component Heatmaps

The performance of the most facial analysis techniques relies on the resolution of the corresponding image. Face alignment or face identification is not going to work correctly when the resolution…

Anatomically-Aware Facial Animation from a Single Image

7 August 2018
GAN animation

Anatomically-Aware Facial Animation from a Single Image

Let’s say you have a picture of a Hugh Jackman for an advertisement. He looks great, but the client wants him to look a little bit happier. No, you don’t…

Unsupervised Attention-Guided Image-to-Image Translation

30 July 2018
Unsupervised Attention-Guided Image-to-Image Translation

Unsupervised Attention-Guided Image-to-Image Translation

Image-to-image translation is the task of mapping an image from a source domain to a target domain. Applications include image colorization, image super-resolution, style transfer, domain adaptation and data augmentation. Most of the approaches require…