{"id":3234,"date":"2025-08-20T15:08:04","date_gmt":"2025-08-20T15:08:04","guid":{"rendered":"https:\/\/violethoward.com\/new\/qwen-image-edit-gives-photoshop-a-run-for-its-money-with-ai-powered-text-to-image-edits-that-work-in-seconds\/"},"modified":"2025-08-20T15:08:04","modified_gmt":"2025-08-20T15:08:04","slug":"qwen-image-edit-gives-photoshop-a-run-for-its-money-with-ai-powered-text-to-image-edits-that-work-in-seconds","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/qwen-image-edit-gives-photoshop-a-run-for-its-money-with-ai-powered-text-to-image-edits-that-work-in-seconds\/","title":{"rendered":"Qwen-Image Edit gives Photoshop a run for its money with AI-powered text-to-image edits that work in seconds"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders.<\/em> <em>Subscribe Now<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p>Adobe Photoshop is among the most recognizable pieces of software ever created, used by more than 90% of the world\u2019s creative professionals, according to Photutorial.<\/p>\n\n\n\n<p>So the fact that a <strong>new open source AI model<\/strong> \u2014 Qwen-Image Edit, released yesterday by Chinese e-commerce giant Alibaba\u2019s Qwen Team of AI researchers \u2014 is<strong> now able to accomplish a huge number of Photoshop-like editing jobs with text inputs alone<\/strong>, is a notable achievement.<\/p>\n\n\n\n<p>Built on the 20-billion-parameter Qwen-Image foundation model released earlier this month, Qwen-Image-Edit extends the system\u2019s unique strengths in text rendering to cover a wide spectrum of editing tasks, from subtle appearance changes to broader semantic transformations.<\/p>\n\n\n\n<p>Simply upload a starting image \u2014 I tried one of myself from VentureBeat\u2019s last annual Transform conference in San Francisco \u2014 and then type instructions of what you want to change, and Qwen-Image-Edit will return a new image with those edits applied.<\/p>\n\n\n\n<div id=\"boilerplate_2803147\" class=\"post-boilerplate boilerplate-speedbump\">\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong\/><strong>AI Scaling Hits Its Limits<\/strong><\/p>\n\n\n\n<p>Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Turning energy into a strategic advantage<\/li>\n\n\n\n<li>Architecting efficient inference for real throughput gains<\/li>\n\n\n\n<li>Unlocking competitive ROI with sustainable AI systems<\/li>\n<\/ul>\n\n\n\n<p><strong>Secure your spot to stay ahead<\/strong>: https:\/\/bit.ly\/4mwGngO<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n<\/div><p>Input image example:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"461\" height=\"470\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png\" alt=\"\" class=\"wp-image-3015788\" style=\"width:840px;height:auto\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png 461w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png?resize=300,306 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png?resize=52,52 52w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png?resize=400,408 400w\" sizes=\"(max-width: 461px) 100vw, 461px\"\/><figcaption class=\"wp-element-caption\">Photo credit: Michael O\u2019Donnell Photography<\/figcaption><\/figure>\n\n\n\n<p>Output image example with prompt: \u201cMake the man wearing a tuxedo.\u201d<\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" height=\"600\" width=\"600\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?w=600\" alt=\"\" class=\"wp-image-3015789\" style=\"width:838px;height:auto\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png 1024w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=300,300 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=768,768 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=600,600 600w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=52,52 52w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=160,160 160w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=400,400 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=750,750 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=578,578 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/1755620145-2.png?resize=930,930 930w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\"\/><\/figure>\n\n\n\n<p>The model is available now across several platforms, including <strong>Qwen Chat<\/strong>, <strong>Hugging Face<\/strong>, <strong>ModelScope<\/strong>, <strong>GitHub<\/strong>, and through the <strong>Alibaba Cloud application programming interface (API)<\/strong>, the latter which allows any third-party developer or enterprise to integrate this new model into their own applications and workflows. <\/p>\n\n\n\n<p>I created my examples above on Qwen Chat, the Qwen Team\u2019s rival to OpenAI\u2019s ChatGPT, however, it should be noted for any aspiring users that generations are limited to about 8 free jobs (input\/outputs) per 12 hour period before it resets. Paying users can have access to more jobs.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"600\" width=\"737\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12%E2%80%AFPM.png?w=737\" alt=\"\" class=\"wp-image-3015793\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png 940w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=300,244 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=768,625 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=737,600 737w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=400,326 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=750,610 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=578,470 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-19-at-4.14.12\u202fPM.png?resize=930,757 930w\" sizes=\"auto, (max-width: 737px) 100vw, 737px\"\/><\/figure>\n\n\n\n<p>With support for both English and Chinese inputs, and a dual focus on both semantic meaning and visual fidelity, Qwen-Image-Edit aims to lower barriers to professional-grade visual content creation.<\/p>\n\n\n\n<p>And given that the model is available as an open source code under an Apache 2.0 license, it\u2019s safe for enterprises to take, download and set up for free on their own hardware or virtual clouds\/machines, potentially resulting in a huge cost savings from proprietary software like Photoshop. <\/p>\n\n\n\n<p>As <strong>Junyang Lin, a Qwen Team researcher wrote on X, \u201cit can remove a strand of hair, very delicate image modification.\u201d<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"\/>\n\n\n\n<p>The team\u2019s announcement echoes this sentiment, presenting Qwen-Image-Edit not as an entirely new system, but as a natural extension of Qwen-Image that applies its unique text rendering and dual-encoding approach directly to editing tasks.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-dual-encodings-allow-for-edits-preserving-style-and-content-of-original-image\">Dual encodings allow for edits preserving style and content of original image<\/h2>\n\n\n\n<p>Qwen-Image-Edit builds on the foundation established by <strong>Qwen-Image<\/strong>, which was introduced earlier this year as a large-scale model specializing in both image generation and text rendering. <\/p>\n\n\n\n<p>Qwen-Image\u2019s technical report highlighted its ability to handle complex tasks like paragraph-level text rendering, Chinese and English characters, and multi-line layouts with accuracy. <\/p>\n\n\n\n<p>The report also emphasized a <strong>dual-encoding mechanism<\/strong>, feeding images simultaneously into Qwen2.5-VL for semantic control and a variational autoencoder (VAE) for reconstructive detail. This approach allows edits that remain faithful to both the intent of the prompt and the look of the original image.<\/p>\n\n\n\n<p>Those same architectural choices underpin Qwen-Image-Edit. By leveraging dual encodings, the model can adjust at two levels: <strong>semantic edits<\/strong> that change the meaning or structure of a scene, and <strong>appearance edits<\/strong> that introduce or remove elements while keeping the rest untouched. <\/p>\n\n\n\n<p><strong>Semantic editing<\/strong> includes creating new intellectual property, rotating objects 90 or 180 degrees to reveal different views, or transforming an input into another style such as Studio Ghibli-inspired art. These edits typically modify many pixels but preserve the underlying identity of objects.<\/p>\n\n\n\n<p>Here\u2019s an example of semantic editing from Shridhar Athinarayanan, an engineer at AI applications platform Replicate, who used a Replicate-hosted implementation or \u201cinference\u201d of Qwen to reskin a photo of Manhattan to look like a toy Lego set.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"\/>\n\n\n\n<p><strong>Appearance editing<\/strong> focuses on precise, local changes. In these cases, most of the image remains unchanged while specific objects are altered. Demonstrations include adding a signboard that generates a reflection in water, removing stray hair strands from a portrait, and changing the color of a single letter in a text image.<\/p>\n\n\n\n<p>One good example of appearance editing with Qwen-Image Edit comes from AnswerAI co-founder and CEO Thomas Hill who posted a side-by-side on X showing his wife in her wedding dress below an archway and another with the same archway covered with graffiti:<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"\/>\n\n\n\n<p>Combined with Qwen\u2019s established strength in rendering Chinese and English text, the editing-focused system is positioned as a flexible tool for creators who need more than simple generative imagery.<\/p>\n\n\n\n<p>The dual control over semantic scope and appearance fidelity means the same tool can serve very different needs, from creative IP development to production-level photo retouching.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-adding-or-removing-text-to-images\">Adding or removing text to images<\/h2>\n\n\n\n<p>Another standout capability is <strong>bilingual text editing<\/strong>. Qwen-Image-Edit allows users to add, remove, or modify text in both Chinese and English while preserving font, size, and style. <\/p>\n\n\n\n<p>This expands on Qwen-Image\u2019s reputation for strong text rendering, particularly in challenging scenarios like intricate Chinese characters.<\/p>\n\n\n\n<p>In practice, this allows for accurate editing of posters, signs, T-shirts, or calligraphy artworks where small text details matter, as seen in another example from Replicate below.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"\/>\n\n\n\n<p>One demonstration involved correcting errors in a piece of generated Chinese calligraphy through a step-by-step chained editing process. <\/p>\n\n\n\n<p>Users could highlight incorrect regions, instruct the system to fix them, and then further refine details until the correct characters were rendered. This iterative approach shows how the model can be applied to high-stakes editing tasks where precision is essential.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-applications-and-use-cases\">Applications and use cases<\/h2>\n\n\n\n<p>The Qwen team has highlighted a range of potential applications:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Creative design and IP expansion<\/strong>, such as generating mascot-based emoji packs.<\/li>\n\n\n\n<li><strong>Advertising and content creation<\/strong>, where logos, signage, and text-heavy visuals can be customized.<\/li>\n\n\n\n<li><strong>Virtual avatars and art<\/strong>, with style transfer supporting unique character representations.<\/li>\n\n\n\n<li><strong>Photography and personal use<\/strong>, including background adjustments, clothing changes, and object removal.<\/li>\n\n\n\n<li><strong>Cultural preservation<\/strong>, demonstrated through correcting classical calligraphy works.<\/li>\n<\/ul>\n\n\n\n<p>By bridging fine-grained editing with broader creative transformations, Qwen-Image-Edit caters to professionals who need control while remaining approachable for casual experimentation.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-benchmarking-and-performance\">Benchmarking and performance<\/h2>\n\n\n\n<p>According to the Qwen team, evaluations across public benchmarks indicate that Qwen-Image-Edit delivers <strong>state-of-the-art performance<\/strong> in image editing. <\/p>\n\n\n\n<p>This follows from the broader Qwen-Image technical evaluations, where the base model achieved leading results in both general image generation and text rendering tasks. <\/p>\n\n\n\n<p>While specific editing benchmark figures were not detailed in the release, Qwen-Image itself ranked highly in independent evaluations such as AI Arena, where human raters compared outputs across models from different providers.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-api-pricing-and-availability\">API pricing and availability<\/h2>\n\n\n\n<p>Through <strong>Alibaba Cloud Model Studio<\/strong>, developers can access Qwen-Image-Edit as an API. Pricing is set at <strong>$0.045 per image<\/strong>, with a free quota of <strong>100 images valid for 180 days<\/strong> after activation. <\/p>\n\n\n\n<p>The service is initially available in the <strong>Singapore region<\/strong>, with a rate limit of <strong>five requests per second<\/strong> and up to <strong>two concurrent tasks per account<\/strong>.<\/p>\n\n\n\n<p>To use the API, developers must obtain a Model Studio API key and can call the model via HTTP or through the DashScope SDK in Python or Java. <\/p>\n\n\n\n<p>Images can be submitted as URLs or in Base64 format, with supported resolutions ranging from 512 to 4,096 pixels and file sizes up to 10 MB. Output images are hosted on Alibaba Cloud Object Storage with links valid for 24 hours, requiring users to download and save results promptly.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-s-next-for-qwen\">What\u2019s next for Qwen?<\/h2>\n\n\n\n<p>Qwen positions Image-Edit as a step towar<strong>d lowering barriers for visual content creation.<\/strong> By making precise, style-consistent editing more accessible, the model <strong>could support applications from design studios to casual users refining personal projects.<\/strong><\/p>\n\n\n\n<p>The system also signals a broader trend in AI development: moving beyond single-purpose generation toward tools that integrate editing, correction, and refinement. <\/p>\n\n\n\n<p>With both semantic flexibility and appearance-level precision, Qwen-Image-Edit reflects this shift, blending the generative strengths of large models with the reliability required for professional editing.<\/p>\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/qwen-image-edit-gives-photoshop-a-run-for-its-money-with-ai-powered-text-to-image-edits-that-work-in-seconds\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Adobe Photoshop is among the most recognizable pieces of software ever created, used by more than 90% of the world\u2019s creative professionals, according to Photutorial. So the fact that [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3235,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-3234","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/08\/tumblr_inline_syz91yFD9E1rnhd8o_500.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3234","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=3234"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3234\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/3235"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=3234"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=3234"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=3234"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69e302c146fa5c92dc28ac12. Config Timestamp: 2026-04-18 04:04:16 UTC, Cached Timestamp: 2026-04-29 20:44:20 UTC -->