Protein Language Model

27 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Datasets


Most implemented papers

PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications

ginnm/proteinpretraining 26 Oct 2023

Moreover, despite the wealth of benchmarks and studies in the natural language community, there remains a lack of a comprehensive benchmark for systematically evaluating protein language model quality.

Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model

ml4bio/uspnet 14 Dec 2023

Signal peptide (SP) is a short peptide located in the N-terminus of proteins.

CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues

beam-labs/crossbind 19 Dec 2023

Accurate identification of protein nucleic-acid-binding residues poses a significant challenge with important implications for various biological processes and drug design.

Structure-Informed Protein Language Model

deepgraphlearning/esm-s 7 Feb 2024

To address this issue, we introduce the integration of remote homology detection to distill structural information into protein language models without requiring explicit protein structures as input.

Decoupled Sequence and Structure Generation for Realistic Antibody Design

no code yet • 8 Feb 2024

Antibody design plays a pivotal role in advancing therapeutics.

On Recovering Higher-order Interactions from Protein Language Models

amirgroup-codes/interactionrecovery 15 Mar 2024

We demonstrate that ESM2 is dominated by three regions in the sparsity-ruggedness plane, two of which are better suited for sparse Fourier transforms.

Detection of circular permutations by Protein Language Models

yuehulab/plmcp 23 Apr 2024

The protein language model can help us extract structural information from sequences.

AntiFold: Improved antibody structure-based design using inverse folding

oxpig/antifold 6 May 2024

AntiFold outperforms existing inverse folding tools on sequence recovery across complementarity-determining regions, with designed sequences showing high structural similarity to their solved counterpart.