Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase corpus size by 90x #270

Merged
merged 1 commit into from
Feb 5, 2025
Merged

Increase corpus size by 90x #270

merged 1 commit into from
Feb 5, 2025

Conversation

dyastremsky
Copy link
Contributor

@dyastremsky dyastremsky commented Feb 5, 2025

As more users use GenAI-Perf to benchmark at the datacenter scale, increase the corpus size from 20k to 1.8M tokens (sample tokenization scheme: GPT2). This adds a modest overhead of .5s. This changes the corpus from all of Shakespeare's sonnets to his full set of works.

Also, add a warning if the number of tokens requested exceeds the corpus size.

New corpus size:
image

Pre-update time for 10 prompts with 1000 tokens:
image

Post-update time for 10 prompts with 1000 tokens:
image

Pre-update time for 100 prompts with 10000 tokens:
image

Post-update time for 100 prompts with 10000 tokens:
image

Warning when exceeding corpus size:
image

Remove unused logging statement
@dyastremsky dyastremsky self-assigned this Feb 5, 2025
@dyastremsky dyastremsky changed the title Increase corpus size Increase corpus size by 90x Feb 5, 2025
ganeshku1
ganeshku1 previously approved these changes Feb 5, 2025
@dyastremsky dyastremsky merged commit 3416d4f into main Feb 5, 2025
6 checks passed
@dyastremsky dyastremsky deleted the dyas-larger-corpus branch February 5, 2025 01:09
cquil11 pushed a commit to cquil11/perf_analyzer that referenced this pull request Feb 5, 2025
cquil11 pushed a commit to cquil11/perf_analyzer that referenced this pull request Feb 5, 2025
lkomali pushed a commit that referenced this pull request Feb 5, 2025
Remove unused logging statement
tedzhouhk pushed a commit that referenced this pull request Feb 6, 2025
Remove unused logging statement
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants