biointerface package¶
Submodules¶
biointerface.core module¶
Core module for extracting Protein-DNA interfaces.
- class biointerface.core.Interface(structure, protein_chain_id, search_radius=5.0)[source]¶
Bases:
objectExtract Protein-DNA interface.
Parameters¶
- structureBio.PDB.Structure
Biopython Structure entity.
- protein_chain_idstr
Chain id of a protein that may interact with DNA.
- search_radiusfloat | int, optional
Search radius, measured in Armstrong, within which Protein-DNA interactions are found. Default is 5.0
- get_aminoacids() list[Residue][source]¶
Get only protein residues in the protein-DNA interface.
Returns¶
- list[Residue]
List of protein reisudes in the interface.
- get_atomic_contacts() list[tuple[Atom]][source]¶
Get interface contacts as pairs of atoms.
Returns¶
- list[tuple[Atom]]
List of pairs of atoms, first one is DNA, second is proteic.
- get_bound_double_strands() list[DoubleStrandNucleicAcid][source]¶
Get all double-strand nucleic acids bound by the protein.
The output double stranded nucleic acids (DSNAs) are subsequences of the full DSNAs found in the structure, since proteins usually do not bind the whole DSNA.
This method allows for “gaps” of unbound base-pairs inside the DSNA, only the base pairs at the ends are trimmed accourding to being protein-bound or not.
- A visual example of “gaps”:
` Input full DSNA: GATATACAAGCCA Protein-bound: **** ** Output protein-bound DSNA: TATACAAG `
Returns¶
- bound_dsna_listlist[DoubleStrandNucleicAcid]
List of double-strand nucleic acids bound by the protein.
- get_dna_atoms() list[Atom][source]¶
Get only DNA atoms in the protein-DNA interface.
Returns¶
- list[Atom]
List of DNA atoms in the interface.
- get_interface_data() DataFrame[source]¶
Get all data from the interface, as a dataframe.
- Contains the following data fields:
Residue hetero field Residue number Residue insertion code Residue name Atom name Atom alternate location Atom element Atomic coordinates (x, y, z) From both protein and DNA atoms Euclidean distance
Returns¶
- dfpd.DataFrame
All data from the interface.
- biointerface.core.build_interfaces(structure, search_radius=5.0) list[source]¶
Extract all Protein-DNA interfaces found in a structure.
Parameters¶
- structureBio.PDB.Structure
Biopython Structure entity.
- search_radiusfloat | int, optional
Search radius, measured in Armstrong, within which Protein-DNA interactions are found. Default is 5.0
Returns¶
- list
List of all Protein-DNA interfaces found in a structure.
Module contents¶
Top-level package for BioInterface.
- class biointerface.Interface(structure, protein_chain_id, search_radius=5.0)[source]¶
Bases:
objectExtract Protein-DNA interface.
Parameters¶
- structureBio.PDB.Structure
Biopython Structure entity.
- protein_chain_idstr
Chain id of a protein that may interact with DNA.
- search_radiusfloat | int, optional
Search radius, measured in Armstrong, within which Protein-DNA interactions are found. Default is 5.0
- get_aminoacids() list[Residue][source]¶
Get only protein residues in the protein-DNA interface.
Returns¶
- list[Residue]
List of protein reisudes in the interface.
- get_atomic_contacts() list[tuple[Atom]][source]¶
Get interface contacts as pairs of atoms.
Returns¶
- list[tuple[Atom]]
List of pairs of atoms, first one is DNA, second is proteic.
- get_bound_double_strands() list[DoubleStrandNucleicAcid][source]¶
Get all double-strand nucleic acids bound by the protein.
The output double stranded nucleic acids (DSNAs) are subsequences of the full DSNAs found in the structure, since proteins usually do not bind the whole DSNA.
This method allows for “gaps” of unbound base-pairs inside the DSNA, only the base pairs at the ends are trimmed accourding to being protein-bound or not.
- A visual example of “gaps”:
` Input full DSNA: GATATACAAGCCA Protein-bound: **** ** Output protein-bound DSNA: TATACAAG `
Returns¶
- bound_dsna_listlist[DoubleStrandNucleicAcid]
List of double-strand nucleic acids bound by the protein.
- get_dna_atoms() list[Atom][source]¶
Get only DNA atoms in the protein-DNA interface.
Returns¶
- list[Atom]
List of DNA atoms in the interface.
- get_interface_data() DataFrame[source]¶
Get all data from the interface, as a dataframe.
- Contains the following data fields:
Residue hetero field Residue number Residue insertion code Residue name Atom name Atom alternate location Atom element Atomic coordinates (x, y, z) From both protein and DNA atoms Euclidean distance
Returns¶
- dfpd.DataFrame
All data from the interface.
- biointerface.build_interfaces(structure, search_radius=5.0) list[source]¶
Extract all Protein-DNA interfaces found in a structure.
Parameters¶
- structureBio.PDB.Structure
Biopython Structure entity.
- search_radiusfloat | int, optional
Search radius, measured in Armstrong, within which Protein-DNA interactions are found. Default is 5.0
Returns¶
- list
List of all Protein-DNA interfaces found in a structure.