transformer-implementation Implementing a transformer from scratch with TensorFlow as proposed in the paper 'Attention is all you need'