Sentence AutoEncoder using Keras
The text preprocessing filters can be edited in the function
class TextProcessor:
def get_word_list(self,sent):
<
return < list of word >
Dataset can be easily loaded from
-
List of Files
tokenizer = Tokenizer() data = tokenizer.process_text(file_names= <list of : filenames> )
-
From a large text
tokenizer = Tokenizer() data = tokenizer.process_text(text=<str : text>)
-
Using a custom Function
You can make your function to use Tokenizer object
encoder = keras.models.load_model("sentence_encoder.h5")
encoder.predict(<(batch_length,seq_length)>)