Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Where do i find some function like: #11

Closed
HuXinjing opened this issue Aug 26, 2023 · 2 comments
Closed

Where do i find some function like: #11

HuXinjing opened this issue Aug 26, 2023 · 2 comments

Comments

@HuXinjing
Copy link

def mem_attn_layer(Ql , Kl , Vl , Cl , Km , Vm , Kp , Vp , attn_scf, mode ):

@CStanKonrad
Copy link
Owner

We plan to release the official FoT large-scale continual pre-training (FoT finetuning) code within two weeks. This code will be in JAX. The instruction fine-tuning code does not use FoT (in fact, it uses a modified version with cross_batch=1, but this is not the version used to tune the base models, for more see #12).

@HuXinjing
Copy link
Author

cool, looking forward to ur entire code of continual pre-training model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants