Chinese technology company ByteDance has recently launched its new multimodal artificial intelligence (AI) model, named Bagel. It is a visual language model (VLM) that can not only understand pictures but can also generate and edit them. The biggest thing is that the company has made it open-source and now it can be downloaded from popular AI platforms like GitHub and Hugging Face.

Features of Bagel

Multimodal input: Capable of understanding and processing both text and images simultaneously.

14 billion parameters: 7 billion of which are active at a time.

Interleaved training data: Text and images were trained together, allowing Bagel to make better connections between the two.

Advanced image editing capabilities

ByteDance claims that Bagel does better image editing than other existing open-source VLMs. It can easily do things like adding emotions to the image, removing, changing, or adding an element, style transfer, and free-form editing, i.e. making changes without any limited framework.

Also capable of world modeling

Bagel has been trained in such a way that it can understand the world in visual form - such as the relationship between objects, the effect of natural factors like light or gravity, etc. ByteDance says that in their internal tests, Bagel has surpassed Qwen2.5-VL-7B (better in understanding images), Janus-Pro-7B and Flux-1-dev (better in image generation), Gemini-2-exp (better performance in image editing in GEdit-Bench test) AI models.

PC Social media

Read more
Noel Edmonds' bitter feud with New Zealand locals after 'rude' pub sparked fury
Newspoint
SL vs BAN 2025, 1st Test, Day 2 Review: Sri Lanka pick flurry of wickets as Bangladesh near 500
Newspoint
South Africa announce 16-member squad for Zimbabwe Test series
Newspoint
India's Election Commission Introduces New SOP for Voter ID Delivery
Newspoint
Big Brother star's brutal two-word review on Jeremy Clarkson's pub The Farmer's Dog
Newspoint
Ariana Grande announces death of her beloved "Nonna" Marjorie
Newspoint
British mum who died of rabies after being scratched by stray dog named for first time
Newspoint
Uncrowded European city two hours away where a glass of wine costs £1.70
Newspoint
Andre Onana at risk of missing Manchester derby in potential seven-game Ruben Amorim headache
Newspoint
Ahmedabad Plane Crash: DNA Identification Process Accelerates for Victims
Newspoint