terminator.data.data.load_file

terminator.data.data.load_file(in_folder, pdb_id, min_protein_len=30)[source]

Load the data specified in the proper .features file and return them. If the read sequence length is less than min_protein_len, instead return None.

Parameters:
  • in_folder (str) – folder to find TERM file.

  • pdb_id (str) – PDB ID to load.

  • min_protein_len (int) – minimum cutoff for loading TERM file.

Returns:

  • data (dict) – Data from TERM file (as dict)

  • total_term_len (int) – Sum of lengths of all TERMs

  • seq_len (int) – Length of protein sequence