Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

Merged

Conversation

renxida
Copy link
Contributor

@renxida renxida commented Jan 28, 2025

Problem

Currently, prefix sharing is read as a config key from the exported-model's config.json.

There is no reason for it to be there - - this is not a property of the model, but rather a property of the server instance that's being launched.

This overcomplicates the tests: rather than just inserting an additional cmdline arg, the tests have to make an edited copy of the exported config files in order to specify the prefix-sharing algorithm.

Solution

This PR puts in a ServerParam class for storing config options that don't require re-exporting the model. Right now it just affects the prefix-sharing algorithm but this makes space for things like:

  • decoding / token search options
  • multi-token prediction

Why this?

Deepseek v3 will have many run-time options that aren't relevant at export/compile, e.g. MTP. These changes will make it easier to set those up.

@renxida renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch 2 times, most recently from fd72536 to 4a83957 Compare January 29, 2025 17:03
@renxida renxida marked this pull request as ready for review January 29, 2025 17:40
@renxida renxida requested a review from stbaione January 29, 2025 18:16
@renxida renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch from 197fa67 to 2d3d198 Compare January 29, 2025 19:10
@renxida renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch from 2d3d198 to 2e2911b Compare January 29, 2025 21:27
@renxida renxida requested a review from rsuderman January 29, 2025 21:36
@renxida renxida changed the title Move prefix-sharing algorithm out of model config and into server config through cmdline args. Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py Jan 29, 2025
@renxida renxida merged commit 2e27a97 into nod-ai:main Jan 29, 2025
36 of 37 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants