Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI’s GPT-4o

Alibaba has launched a brand new AI picture era mannequin referred to as Qwen-VLo that’s stated to have the flexibility to grasp context and generate pictures based mostly on that understanding.

“Right now, we’re excited to introduce a brand new mannequin, Qwen VLo, a unified multimodal understanding and era mannequin. This newly upgraded mannequin not solely “understands” the world but additionally generates high-quality recreations based mostly on that understanding, actually bridging the hole between notion and creation,” the corporate stated in a weblog put up revealed on June 26.

Not like earlier Alibaba fashions reminiscent of Qwen-VL, Qwen-VLo can provide the consumer extra detailed pictures with considerably extra accuracy. Whereas earlier fashions altered unrelated particulars inside the picture when the consumer requested solely minor adjustments (reminiscent of color), Qwen-VLo is ready to protect the unique construction of the picture and make the requested adjustments to it, as per the e-commerce large.

Story continues under this advert

The mannequin can also be in a position to perceive open-ended requests, reminiscent of creative model, climate adjustments, and even making the picture bear resemblance to a particular time interval. Alibaba additionally introduced that the mannequin would assist a number of languages apart from Chinese language and English.

One of many mannequin’s notable options is A number of Picture Enter. The mannequin takes present pictures offered by the consumer, alters the textual content inside them, and is even in a position to manipulate them to change into a part of the generated picture. For example, in an instance given by the corporate, the consumer offered pictures of particular person bathing merchandise and a basket, then requested Qwen-VLo to place the merchandise into the basket.

The Multiple Image Input feature in Qwen-VLo. The A number of Picture Enter characteristic in Qwen-VLo. (Picture: Alibaba)

Nonetheless, this characteristic has not been formally rolled out inside the mannequin but.

Qwen-VLo makes use of dynamic decision coaching, permitting the consumer to re-size their pictures as per required dimensions, together with 1:1, 3:4, and 16:9. The mannequin additionally makes use of a progressive top-to-bottom, left-to-right era course of, which helps in duties requiring effective management. Nonetheless, in its weblog put up, the corporate has stated that the mannequin remains to be within the preview stage and customers may encounter errors reminiscent of inconsistency and non-compliance.

Story continues under this advert

The corporate additional theorised that its AI fashions could possibly be able to conveying concepts and meanings via the photographs it creates sooner or later. Alibaba additionally proposed mannequin producing segmentation/ detection maps to additional enhance the efficiency of Qwen-VLo.

Extensively recognized for its e-commerce enterprise in China, Alibaba has thrown its hat into the AI race. The corporate’s CEO, Eddie Wu, even stated that Alibaba is now totally centered on AI mannequin improvement and goals to construct AI programs with human-level mental capabilities.

(This text has been curated by Purv Ashar, who’s an intern with The Indian Specific)

Source link

Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI’s GPT-4o | Technology News

iOS 27 features Apple didn’t highlight: Full-screen widgets, smarter messages, better clipboard and more | Technology News

When is Wear OS 7 Coming to the Pixel Watch? Yesterday, Apparently

Android Users Should Know These Secret Smartphone Codes

Meta partners with Reliance to build AI-powered data centre in India | Technology News

Cristiano Ronaldo’s influence, movement and finishing remain a ‘big, big strength’ at 41

Karmelo Anthony Found Guilty Of Murdering Austin Metcalf at Track Meet

iOS 27 features Apple didn’t highlight: Full-screen widgets, smarter messages, better clipboard and more | Technology News

Hot May inflation reading reinforces Fed’s path to hold interest rates next week

Domestic, Asian markets rise after US court rules against Trump’s reciprocal tariffs | Business News

Goldman Sachs CEO David Solomon gets 29% pay cut to $25 million

Taylor Swift Concertgoers Slammed For Whining About Hurricane Milton

Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI’s GPT-4o | Technology News

Related Posts