Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

renxida · 2025-01-28T23:50:42Z

Problem

Currently, prefix sharing is read as a config key from the exported-model's config.json.

There is no reason for it to be there - - this is not a property of the model, but rather a property of the server instance that's being launched.

This overcomplicates the tests: rather than just inserting an additional cmdline arg, the tests have to make an edited copy of the exported config files in order to specify the prefix-sharing algorithm.

Solution

This PR puts in a ServerParam class for storing config options that don't require re-exporting the model. Right now it just affects the prefix-sharing algorithm but this makes space for things like:

decoding / token search options
multi-token prediction

Why this?

Deepseek v3 will have many run-time options that aren't relevant at export/compile, e.g. MTP. These changes will make it easier to set those up.

renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch 2 times, most recently from fd72536 to 4a83957 Compare January 29, 2025 17:03

renxida marked this pull request as ready for review January 29, 2025 17:40

renxida requested a review from stbaione January 29, 2025 18:16

renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch from 197fa67 to 2d3d198 Compare January 29, 2025 19:10

renxida added 6 commits January 29, 2025 13:27

refacotring a bunch of things

c98029b

missing import

7aba702

remove residual arg

16862a6

linting changes

3dd5760

move config structs together

fd2c50d

reorganize ServerParams to merge into config_struct.py

2e2911b

renxida force-pushed the move-prefix-sharing-algo-out-of-model-config branch from 2d3d198 to 2e2911b Compare January 29, 2025 21:27

renxida requested a review from rsuderman January 29, 2025 21:36

renxida added 2 commits January 29, 2025 21:56

docstring improvements

c75be75

simplify ServerParams config struct

854c933

renxida changed the title ~~Move prefix-sharing algorithm out of model config and into server config through cmdline args.~~ Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py Jan 29, 2025

comments and docstring improvements

9cc707f

stbaione approved these changes Jan 29, 2025

View reviewed changes

renxida merged commit 2e27a97 into nod-ai:main Jan 29, 2025
36 of 37 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

renxida commented Jan 28, 2025 •

edited

Loading

Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

Move server-specific config args out of ModelParams and into new ServerParams in config_struct.py #877

Conversation

renxida commented Jan 28, 2025 • edited Loading

Problem

Solution

Why this?

renxida commented Jan 28, 2025 •

edited

Loading