Protein Language Model
27 papers with code • 1 benchmarks • 1 datasets
Most implemented papers
PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications
Moreover, despite the wealth of benchmarks and studies in the natural language community, there remains a lack of a comprehensive benchmark for systematically evaluating protein language model quality.
Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model
Signal peptide (SP) is a short peptide located in the N-terminus of proteins.
CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues
Accurate identification of protein nucleic-acid-binding residues poses a significant challenge with important implications for various biological processes and drug design.
Structure-Informed Protein Language Model
To address this issue, we introduce the integration of remote homology detection to distill structural information into protein language models without requiring explicit protein structures as input.
Decoupled Sequence and Structure Generation for Realistic Antibody Design
Antibody design plays a pivotal role in advancing therapeutics.
On Recovering Higher-order Interactions from Protein Language Models
We demonstrate that ESM2 is dominated by three regions in the sparsity-ruggedness plane, two of which are better suited for sparse Fourier transforms.
Detection of circular permutations by Protein Language Models
The protein language model can help us extract structural information from sequences.
AntiFold: Improved antibody structure-based design using inverse folding
AntiFold outperforms existing inverse folding tools on sequence recovery across complementarity-determining regions, with designed sequences showing high structural similarity to their solved counterpart.