Unfair comparison between ProtBert and ESM #9

ww-rm · 2024-02-29T11:07:50Z

In ProtTrans, the author says that:

No auxiliary tasks like BERT's next-sentence prediction were used for any model described here.

But in the PEER, the [CLS] token is used for ProtBert as a protein-level embedding representation. In this case the [CLS] token may not have the ability to represent sequence embedding.

For ProtBert, should we use the same strategy as for ESM (i.e., mean pooling over all residues) to get a fairer comparison?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unfair comparison between ProtBert and ESM #9

Unfair comparison between ProtBert and ESM #9

ww-rm commented Feb 29, 2024

Unfair comparison between ProtBert and ESM #9

Unfair comparison between ProtBert and ESM #9

Comments

ww-rm commented Feb 29, 2024