You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the Methods section of this paper, it is described that the secondary structure and block adjacency tensors are one-hot tensors.
an [L,4] secondary one-hot tensor (0 = α-helix, 1 = β-strand, 2 = loop and 3 = masked secondary structure identity) to indicate the secondary structure classification of each residue in the binder–target complex
an [L,L,3] adjacency one-hot tensor (0 = non-adjacent, 1 = adjacent and 2 = masked adjacency) to indicate interacting partner residues for each residue in the binder–target complex
However, in the examples/target_folds example, they are not in one-hot encoding but label encoding. I also tried to use the provided script to generate these inputs, which are also in label encoding. Of note is that the generated secondary structure encoding is in floats rather than ints, as in the given example. Please see the output at the end.
I wonder if the paper is using a different version of RFdiffusion, also if one can add the running command script and the inputs for the case studies in this paper. I believe adding a reproducing guide for this paper would greatly benefit the research community. Many thanks!
If I understand the README and the Methods in the paper correctly, the difference is actually the following.
In the README, at the fold conditioning section with a target structure, the secondary structure and block adjacency tensors for the target and the scaffold are independently set. scaffoldguided.target_ss and scaffoldguided.target_adj are responsible for the target, while scaffoldguided.scaffold_dir has the ss and adj tensors for the scaffold. I checked in the given examples/ppi_scaffolds and found some scaffolds are shorter than the insulin_target.pdb, which means the scaffold does not include the target itself.
In the new paper, these tensors are for "binder–target complex".
Now, a few questions come in:
How do we set the adjacency tensor values between the binder's amino acids and the target's?
With such tensors generated, either one-hot encoding or label encoding, how do we feed them into the RFdiffusion model?
Hi,
In the Methods section of this paper, it is described that the secondary structure and block adjacency tensors are one-hot tensors.
However, in the examples/target_folds example, they are not in one-hot encoding but label encoding. I also tried to use the provided script to generate these inputs, which are also in label encoding. Of note is that the generated secondary structure encoding is in floats rather than ints, as in the given example. Please see the output at the end.
I wonder if the paper is using a different version of RFdiffusion, also if one can add the running command script and the inputs for the case studies in this paper. I believe adding a reproducing guide for this paper would greatly benefit the research community. Many thanks!
The text was updated successfully, but these errors were encountered: