Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load the pre-trained neural network model and save the trained model. #104

Open
beomyeol opened this issue Sep 1, 2015 · 4 comments
Open

Comments

@beomyeol
Copy link
Contributor

beomyeol commented Sep 1, 2015

dolphin DNN has not supported the feature of loading and saving the neural network model.
We discussed this feature in #77. I'd like to continue our discussion such as how to implement this and how to load and save the model.

@dongjoon-hyun
Copy link
Contributor

I think we can implement saving the model by using 'output' service.
For loading, I have no idea at all so far.

@beomyeol
Copy link
Contributor Author

beomyeol commented Sep 1, 2015

As we discussed in #77, we can implement this by saving parameter of each layer and loading the saved parameter using parameter initializer.

We can save the parameters in Numpy compatible plain text file format like we are using this format as input matrix format. The layer parameter consists of the weight matrix and the bias vector. These can be put into a single matrix. However, in some case, bias vector may not be used. So, I think we can introduce another delimiter to distinguish weight matrix and bias vector explicitly. (The delimiter must be different to one that is used as the delimiter of the matrix.)

@dongjoon-hyun
Copy link
Contributor

Do you think each layer save and load its own file, e.g. layer1.txt? If we can handle separate files for each layers, why don't we split them into two subfiles? e.g. layer1.weight.txt, layer1.bias.txt? It's just question, @beomyeol .

@beomyeol
Copy link
Contributor Author

beomyeol commented Sep 2, 2015

@dongjoon-hyun Thank you for your idea. It will be better if we can handle multiple files. With this, we can rule out the possibility of the problem when the delimiter for matrices is same to the one for parameters. :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants