Anmol Nagpal

Apple launches AI image tool that lets you make edits by just describing them

Apple researchers have introduced an AI model enabling users to write desired changes to a photograph using simple language, eliminating the need for traditional photo editing software.

Source: Apple

The groundbreaking model, named MGIE (MLLM-Guided Image Editing), is the result of a collaboration between Apple and the University of California, Santa Barbara.

Source: Representative

Capable of executing various editing tasks including cropping, resizing, flipping, and applying filters solely through text prompts, MGIE represents a significant advancement in image editing.

Source: Pexels

This innovation can handle both straightforward and intricate editing requests, such as altering specific objects within a photo or enhancing brightness levels.

Source: ANI

MGIE harnesses the power of multimodal language models, first deciphering user prompts and then generating corresponding edits.

Source: X

For instance, a request for a "bluer sky" translates to adjusting the brightness of the sky portion of an image.

Source: Unsplash

This approach ensures precise interpretation and execution of editing instructions.

Source: Twitter

For example, a prompt to "make it more healthy" when editing an image of a pepperoni pizza results in the addition of vegetable toppings.

Source: LIL’ LUNA

Similarly, instructing the model to "add more contrast to simulate more light" enhances the brightness of a dark image, such as tigers in the Sahara.

Source: Naveen Patnaik on X

In a statement accompanying the release, the researchers highlighted the model's ability to derive explicit visual-aware intentions, leading to meaningful image enhancements.

Source: Unsplash

They conducted extensive studies validating MGIE's effectiveness across various editing scenarios, emphasising its improved performance while maintaining competitive efficiency.

Source: Unsplash

Furthermore, they envision the MLLM-guided framework contributing to future advancements in vision-and-language research.

Source: Pexels

Apple has made MGIE accessible for download via GitHub, with a web demo also available on Hugging Face Spaces.

Source: Freepik

However, the company has not disclosed its plans for the model beyond research purposes.

Source: Apple

While some image generation platforms like OpenAI's DALL-E 3 offer similar capabilities, and Adobe's Firefly AI model powers generative fill in its Photoshop software.

Source: Shutterstock

Apple's foray into the generative AI space signifies its commitment to incorporating advanced AI features into its products.

Source: unsplash