scripts.data.preprocessing.parseTERM.parseTERMdata

scripts.data.preprocessing.parseTERM.parseTERMdata(filename)[source]

Function that parses all relavent data from TERM data dumps.

Returns the sequence numerically encoded, the selection, full sequence ppoe, and all TERMs found.

Parameters:

filename (str) – path to .dat file

Returns:

output – Dictionary containing information about the dTERMen run e.g. sequence, structural information, and chain lengths, as well as a list of all data mined from TERM matches.

Return type:

dict