Chinese technology company ByteDance has recently launched its new multimodal artificial intelligence (AI) model, named Bagel. It is a visual language model (VLM) that can not only understand pictures but can also generate and edit them. The biggest thing is that the company has made it open-source and now it can be downloaded from popular AI platforms like GitHub and Hugging Face.

Features of Bagel

Multimodal input: Capable of understanding and processing both text and images simultaneously.

14 billion parameters: 7 billion of which are active at a time.

Interleaved training data: Text and images were trained together, allowing Bagel to make better connections between the two.

Advanced image editing capabilities

ByteDance claims that Bagel does better image editing than other existing open-source VLMs. It can easily do things like adding emotions to the image, removing, changing, or adding an element, style transfer, and free-form editing, i.e. making changes without any limited framework.

Also capable of world modeling

Bagel has been trained in such a way that it can understand the world in visual form - such as the relationship between objects, the effect of natural factors like light or gravity, etc. ByteDance says that in their internal tests, Bagel has surpassed Qwen2.5-VL-7B (better in understanding images), Janus-Pro-7B and Flux-1-dev (better in image generation), Gemini-2-exp (better performance in image editing in GEdit-Bench test) AI models.

PC Social media

Read more
'Housefull 5' director Tarun Mansukhani reacts to Deepika Padukone's alleged 8-day work demand: “You can't come to me at 11 am and say 'I need to leave at 5'
Newspoint
'Fake tans make me look orange– but this cool toned one promises sunkissed results for pale skin'
Newspoint
Amazon flash sale slashes price of a Nintendo Switch by £60 less than full price
Newspoint
From matcha runs to cookie tasting: Things to do in UAE from June 6-8
Newspoint
Anupamaa: Anu to get bullied while Prem to cheat on Rahi
Newspoint
Arun Govil questions modern-day Ramayan adaptations: 'None of the stars today fit the part'
Newspoint
Damascus faces quiet Eid as economic struggles persist
Newspoint
Amd acquires Brona to Supercharge Open AI Innovation in 2025
Tezzbuzz
Elon Musk’s Starlink prepared for entry in India, how much will be the monthly expenses of satellite internet?
Tezzbuzz
Google Deepmind Working on Ai To Simplife Email: Ceo Demis Hassabis
Tezzbuzz