Chinese technology company ByteDance has recently launched its new multimodal artificial intelligence (AI) model, named Bagel. It is a visual language model (VLM) that can not only understand pictures but can also generate and edit them. The biggest thing is that the company has made it open-source and now it can be downloaded from popular AI platforms like GitHub and Hugging Face.

Features of Bagel

Multimodal input: Capable of understanding and processing both text and images simultaneously.

14 billion parameters: 7 billion of which are active at a time.

Interleaved training data: Text and images were trained together, allowing Bagel to make better connections between the two.

Advanced image editing capabilities

ByteDance claims that Bagel does better image editing than other existing open-source VLMs. It can easily do things like adding emotions to the image, removing, changing, or adding an element, style transfer, and free-form editing, i.e. making changes without any limited framework.

Also capable of world modeling

Bagel has been trained in such a way that it can understand the world in visual form - such as the relationship between objects, the effect of natural factors like light or gravity, etc. ByteDance says that in their internal tests, Bagel has surpassed Qwen2.5-VL-7B (better in understanding images), Janus-Pro-7B and Flux-1-dev (better in image generation), Gemini-2-exp (better performance in image editing in GEdit-Bench test) AI models.

PC Social media

Read more
Anees Bazmee shares 14-year-old BTS pictures of Salman Khan-starrer 'Ready': "When I look back... I don't just see a film"
Newspoint
Ramones fans floored to discover what band's name actually means
Newspoint
'This brightening vitamin C serum is the only one that doesn't irritate my problem skin'
Newspoint
5.8-magnitude earthquake shakes Turkish Mediterranean coast, injuring 7 people
Newspoint
Virat Kohli In Big Bash League? All You Need To Know
Abplive
Telangana To Develop 80-Acre Eco-Town Inspired By Japan’s Kitakyushu, Eyes Green Growth
Abplive
Musk launches new chatting feature Xchat, WhatsApp’s increased tension
Tezzbuzz
RCB fans increased before the final
Tezzbuzz
RCB vs PBKS: Final match will be held on this pitch, know who will benefit?
Tezzbuzz
World’s tallest railway arch bridge over Chenab set for inauguration by PM Modi on June 6
Tezzbuzz