Some badcase. #9

ApolloRay · 2024-10-30T02:51:59Z

I tried to use models to infer some cases, but found that the model's handling of details is not very good. For example, for this link http, my prompt is “When was the goal scored in the game and provide a specific match time“, but the output result is "The goal was scored towards the end of the match, specifically at a match time of 86.42". Thank you for using clip for encoding, the effect loss is still quite significant.

shuyansy · 2024-10-30T14:12:27Z

I acknowledge the current version of VideoXL still holds limited capacity in some domains. As for the case you provided, it is weak in video text recognition and sports event spotting. We will add more data to improve its ability in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some badcase. #9

Some badcase. #9

ApolloRay commented Oct 30, 2024

shuyansy commented Oct 30, 2024

Some badcase. #9

Some badcase. #9

Comments

ApolloRay commented Oct 30, 2024

shuyansy commented Oct 30, 2024