Membrane Protein-Lipid Interaction Database Pipeline

Automated pipeline for generating residue-level lipid contact annotations from PDB structures. The workflow queries RCSB PDB for structures containing lipid chemical components (117 recognized codes), downloads original PDB files, calculates all-atom heavy-atom contacts at 4.0 Angstrom cutoff, clusters sequences at 30% identity using MMseqs2, and partitions into train/validation/test splits. Produces a complete dataset of 4,704 proteins with 8,055,325 annotated residues.

Space: Independent Teams

SEEK ID: https://workflowhub.eu/projects/456

Public web page: https://github.com/omagebright/MPLID

Organisms: No Organisms specified

WorkflowHub PALs: No PALs for this Team

Team created: 24th May 2026

help Tags

This item has not yet been tagged.

Powered by
(v.1.17.3)
Copyright © 2008 - 2026 The University of Manchester and HITS gGmbH