Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 1.12 KB

File metadata and controls

18 lines (13 loc) · 1.12 KB

DIONYSUS_data

Additional source data & code used to generate and analyze content of DIONYSUS database : www.dsimb.inserm.fr/DIONYSUS/.

When using please cite the following publication:

Gheeraert A, Bailly T, Ren Y, Hamraoui A, Te J, Vander Meersche Y, Cretin G, Leon Foun Lin R, Gelly J-C, Pérez S, Guyon Frédéric & Galochkina T. DIONYSUS: a database of protein-carbohydrate interfaces. Nucleic Acids Res 2024, DOI: 10.1093/nar/gkae890

Data:

  • Sequence clusters in terms of MMseqs2 score: data/binding_sites_mmseqs2_clusters.csv
  • Table downloadable from Clusters page: data/binding_sites_mmseqs2_clusters.csv
  • List of core carbohydrates: data/core_carbohydrates.json
  • List of all the carbohydrates: data/carbohydrate_components.csv
  • Pairwise scores of high-quality interfaces in terms of geometrical resemblance (source code for the comparison script was already available at www.github.com/DSIMB/CompareCBS): data/ava.score

Code:

  • Script for selection of protein residues involved in binding site formation: PDBParser.py
  • Source code for clustering algorithm used in our study hierarchical_spectral_clustering.py