New name suggestion notebook using tensor2tensor #8

bzz · 2019-12-16T13:51:45Z

Previous name suggestion notebook using OpenNMT-tf + youtokenme for BPE was overexposing the accidental complexity of "tenzorisation" of the source code.

This uses https://github.com/tensorflow/tensor2tensor as a library to archive the same, and thus also works on TPU with Colab.

This version is not the final, workshop-ready one but rather an intermediate one that was used on Colab.

We agreed that the scarce time of workshop prep would better be spent not improving this one, but rather including a pyTourch version instead, where it should be easy to incorporate custom models e.g based on GGNN, and contrast the results to seq2seq ones.

@m09 Please, review the structure of the notebook though - I'm planing to reuse it for a new version, so any methodological feedback on it would be very appreciated.

Signed-off-by: Alexander Bezzubov <[email protected]>

review-notebook-app · 2019-12-16T13:51:51Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

bzz · 2019-12-16T14:13:36Z

I figured that reviewing ToC in a notebook diff is not easy, so here it is, the collapsable cell structure with titles:

And here it is customized for a new workshop, and what we discussed already:

Data: exploration
Data: problem definition for CodeSearchNet dataset
Data: generate tensor representation
Data: inspect
  Plot subtoken sequence lengths
Train the model
Visualize the training
Inference using trained model
Visualize the attention
  Attention Utils
  Display Attention
Serving: export the model
Serving: serve predictions over HTTP
Interactive predictions (local webapp)
Compare the results with the literature
Change the model to GGNN (advanced)

bzz · 2019-12-16T14:23:04Z

Experiment results

bzz · 2020-01-29T12:49:21Z

Shall we merge this?

m09 · 2020-01-31T16:59:36Z

Sorry @bzz for the lack of review, I had in mind that it was WIP for some reason even though you asked for review super long time ago, my bad. I think we can merge as is: it seems to need a rebase against the new docker image but so do the other notebooks so we can take care of that in a future PR. As for the experimental plan, we can also discuss it together with the PyTorch version and come to a parallel plan that will be great for both, using this one as the starting point (it seems very good to me).

names: new t2t notebook

355b5a2

Signed-off-by: Alexander Bezzubov <[email protected]>

m09 force-pushed the master branch 3 times, most recently from 3237d51 to 4d66d44 Compare January 27, 2020 13:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New name suggestion notebook using tensor2tensor #8

New name suggestion notebook using tensor2tensor #8

bzz commented Dec 16, 2019

review-notebook-app bot commented Dec 16, 2019

bzz commented Dec 16, 2019 •

edited

Loading

bzz commented Dec 16, 2019

bzz commented Jan 29, 2020

m09 commented Jan 31, 2020

New name suggestion notebook using tensor2tensor #8

Are you sure you want to change the base?

New name suggestion notebook using tensor2tensor #8

Conversation

bzz commented Dec 16, 2019

review-notebook-app bot commented Dec 16, 2019

bzz commented Dec 16, 2019 • edited Loading

bzz commented Dec 16, 2019

Experiment results

bzz commented Jan 29, 2020

m09 commented Jan 31, 2020

bzz commented Dec 16, 2019 •

edited

Loading