February 8, 2024Anmol Nagpal

Apple launches AI image tool that lets you make edits by just describing them

Apple researchers have introduced an AI model enabling users to write desired changes to a photograph using simple language, eliminating the need for traditional photo editing software.

Source: Apple

The groundbreaking model, named MGIE (MLLM-Guided Image Editing), is the result of a collaboration between Apple and the University of California, Santa Barbara.

Source: Representative

Capable of executing various editing tasks including cropping, resizing, flipping, and applying filters solely through text prompts, MGIE represents a significant advancement in image editing.

Source: Pexels

This innovation can handle both straightforward and intricate editing requests, such as altering specific objects within a photo or enhancing brightness levels.

Source: ANI

MGIE harnesses the power of multimodal language models, first deciphering user prompts and then generating corresponding edits.

Source: X

For instance, a request for a "bluer sky" translates to adjusting the brightness of the sky portion of an image.

Source: Unsplash

This approach ensures precise interpretation and execution of editing instructions.

Source: Twitter

For example, a prompt to "make it more healthy" when editing an image of a pepperoni pizza results in the addition of vegetable toppings.

Source: LIL’ LUNA

Similarly, instructing the model to "add more contrast to simulate more light" enhances the brightness of a dark image, such as tigers in the Sahara.

Source: Naveen Patnaik on X

In a statement accompanying the release, the researchers highlighted the model's ability to derive explicit visual-aware intentions, leading to meaningful image enhancements.

Source: Unsplash

They conducted extensive studies validating MGIE's effectiveness across various editing scenarios, emphasising its improved performance while maintaining competitive efficiency.

Source: Unsplash

Furthermore, they envision the MLLM-guided framework contributing to future advancements in vision-and-language research.

Source: Pexels

Apple has made MGIE accessible for download via GitHub, with a web demo also available on Hugging Face Spaces.

Source: Freepik

However, the company has not disclosed its plans for the model beyond research purposes.

Source: Apple

While some image generation platforms like OpenAI's DALL-E 3 offer similar capabilities, and Adobe's Firefly AI model powers generative fill in its Photoshop software.

Source: Shutterstock

Apple's foray into the generative AI space signifies its commitment to incorporating advanced AI features into its products.

Source: unsplash