How to make the description longer and more detailed? #228

CaledoniaProject · 2025-01-31T01:06:23Z

I have a small script that captions the image

import argparse
import glob
import os
from PIL import Image, UnidentifiedImageError
from transformers import pipeline

captioner = pipeline("image-to-text", "Salesforce/blip-image-captioning-large")
captionText = captioner(jpegFile, max_new_tokens=100)[0]["generated_text"]

However, the generated text is like 20 tokens, never reached 100 tokens. The description of the image is not "precise" enough, many details are missed.

e.g this picture generates araffe walking down the sidewalk in a city with a backpack

Many details like the backpack color, the background scene are missing, even with the large model. How can I ask blip to be more detailed?

The text was updated successfully, but these errors were encountered:

CaledoniaProject changed the title ~~Make it longer~~ Make the description longer Jan 31, 2025

CaledoniaProject changed the title ~~Make the description longer~~ How to make the description longer and more detailed? Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make the description longer and more detailed? #228

How to make the description longer and more detailed? #228

CaledoniaProject commented Jan 31, 2025

How to make the description longer and more detailed? #228

How to make the description longer and more detailed? #228

Comments

CaledoniaProject commented Jan 31, 2025