You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to serialise a object of ExLlamaV2TokenEnforcerFilter class to cache it for future use, but it seems to be not possible. Has anyone of you achieved this?
The text was updated successfully, but these errors were encountered:
the class TokenEnforcerTokenizerData (which can be given to the ExLlamaV2TokenEnforcerFilter constructor instead of ExLlamaV2Tokenizer) is cacheable, and will provide the fast initialization which you are likely trying to achieve. Let me know if this works, or if it doesn't cover your usecase.
tokenizer_data = build_token_enforcer_tokenizer_data(tokenzier)
# save for caching, and load next time
filter = ExLlamaV2TokenEnforcerFilter(parser, tokenizer_data)
I am trying to serialise a object of ExLlamaV2TokenEnforcerFilter class to cache it for future use, but it seems to be not possible. Has anyone of you achieved this?
The text was updated successfully, but these errors were encountered: