Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request complete training_notebook.ipynb for v2 model training #55

Open
yf-fan-org opened this issue Jan 20, 2025 · 3 comments
Open

Request complete training_notebook.ipynb for v2 model training #55

yf-fan-org opened this issue Jan 20, 2025 · 3 comments

Comments

@yf-fan-org
Copy link

yf-fan-org commented Jan 20, 2025

@kahrendt Hi, I tested training_notebook.ipynb with tag 2.1 but it doesn't work, can you give a full ipynb, thanks a lot! (Or Can you share the commands to train the v2 model architecture, much appreciated!)

#31

I see from your previous reply that the current input of 1x5x40 is as expected? I checked the v2 model in esp and the input is 1x3x40

@yf-fan-org
Copy link
Author

I use the Master branch with the following training command:
“python -m microwakeword.model_train_eval --training_config='training_parameters.yaml' --train 1 --restore_checkpoint 1 --test_tf_nonstreaming 0 --test_tflite_nonstreaming 0 --test_tflite_streaming 0 --test_tflite_streaming_quantized 1 --use_weights "last_weights" inception --cnn1_filters '32' --cnn1_kernel_sizes '5' --cnn1_subspectral_groups '1' --cnn2_filters1 '24,24,24' --cnn2_filters2 '32,64,96' --cnn2_kernel_sizes '3,5,5' --cnn2_subspectral_groups '1,1,1' --cnn2_dilation '1,1,1' --dropout 0.8”

, and finally get the input of the model as 1x1x40, not 1x3x40 as in v2, do I need to change any parameter?

@kahrendt
Copy link
Owner

The version 2.1 tag refers to the model version, not the mWW version (this is confusing... I should make the distinction more clear). Try using basic_training_notebook.ipynb in the main repo branch. You are trying to train with the old Inception architecture, but the that notebook uses the newer, more accurate Mixednet architecture.

@yf-fan-org
Copy link
Author

The version 2.1 tag refers to the model version, not the mWW version (this is confusing... I should make the distinction more clear). Try using basic_training_notebook.ipynb in the main repo branch. You are trying to train with the old Inception architecture, but the that notebook uses the newer, more accurate Mixednet architecture.

Okay, thanks a lot. One more question, the model I trained with default parameters is not as accurate as the pre-trained model you published, do we need to debug this ourselves? Or can you also give a training configuration reference?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants