BLIP

Gradio demo for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation (Salesforce Research). To use it, simply upload your image, or click one of the examples to load them. Read more at the links below.

Task
Caption Decoding Strategy
Examples
raw_image Task Question Caption Decoding Strategy