ARTIFICIAL INTELLIGENCE | VISUAL LANGUAGE MODEL

BLIP-2: when ChatGPT meets images

BLIP-2, a new visual language model capable to dialogue about images

Salvatore Raieli
Level Up Coding
Published in
11 min readMar 8, 2023

--

salesforce BLIP-2 visual language model
Image by the author using OpenAI DALL-E

ChatGPT shocked the world with its ability to converse naturally. However, ChatGPT cannot see. is it possible to have a model that can read an image and discuss it with a user?

--

--

Senior data scientist | about science, machine learning, and AI. Top writer in Artificial Intelligence