terminator.utils.model.default_hparams¶

Default hyperparameter set for TERMinator

Parameters:

matches (str, default=’transformer’) – How to processes singleton statistics.

Options

‘resnet’
process using a convolutional ResNet

‘transformer’
process using MatchAttention

‘ablate’
perform no processing
term_hidden_dim (int, default=32) – Hidden dimensionality for TERM Information Condenser (e.g. net1, self.bot, or CondenseMSA variant)
enrgies_hidden_dim (int, default=32) – Hidden dimensionality for GNN Potts Model Encoder (e.g. net2, self.top, or PairEnergies variant)
gradient_checkpointing (bool, default=True) – Enable gradient checkpointing at most memory-intensive steps (currently, MatchAttention)
cov_features (str, default=’all_raw’) – What features to include for covariance matrix computation. See terminator.models.layers.condense.EdgeFeatures for more information.

Options

‘shared_learned’:
use those produced by the ResidueFeatures module.

‘all_raw’:
concatenate a one-hot encoding of residue identity and the fed-in additional features.

‘all_learned’:
‘all_raw’, but fed through a dense layer.

‘aa_learned’:
use an embedding matrix for residue identity

‘aa_counts’:
use a one-hot encoding of residue identity

‘cnn’:
use a 2D convolutional neural net on fed-in matches (WARNING: not tested due to extreme memory consumption)
cov_compress (str, default=’ffn’) – The method the covariance matrix is compressed into a vector.

Options

‘ffn’
Use a 2-layer dense network

‘project’
Use a linear transformation

‘ablate’
Use a 0 vector
num_pair_stats (int, default=28) – [DEPRECIATED] Number of precomputed pairwise match statistics fed into TERMinator
num_sing_stats (int, default=0) – [DEPRECIATED] Number of precomputed singleton match statistics fed into TERMinator
resnet_blocks (int, default=4) – Number of ResNet blocks to use if matches='resnet'
term_layers (int, default=4) – Number of TERM MPNN layers to use.
term_heads (int, default=4) – Number of heads to use in TERMAttention if term_use_mpnn=False
conv_filter (int, default=3) – Length of convolutional filter if code:matches=’resnet’
matches_layers (int, default=4) – Number of Transformer layers to use in MatchesCondensor if matches='transformer'
matches_num_heads (int, default=4) – Number of heads to use in MatchAttention if matches='transformer'
k_neighbors (int, default=30) – What k is for kNN computation
k_cutoff (int, default=None) – When outputting a kNN potts model, take the top k_cutoff edges and output the truncated etab
contact_idx (bool, default=True) – Whether or not to include contact indices in computation
cie_dropout (float, default=0.1) – Dropout rate for sinusoidal encoding of contact index
cie_scaling (int, default=500) – Multiplicative factor by which to scale contact indices
cie_offset (int, default=0) – Additive factor by which to offset contact indices
transformer_dropout (float, default=0.1) – Dropout rate for Transformers used in the TERM Information Condensor
term_use_mpnn (bool, default=True) – If set to True, use a feedforward network to compute TERM graph messages. Otherwise, update TERM graph representations using an Attention-based mechanism.
energies_protein_features (str, default=’full’) – Feature set for coordinates fed into the GNN Potts Model Encoder
energies_augment_eps (float, default=0) – Scaling factor for Gaussian noise added to coordinates before featurization
energies_encoder_layers (int, default=6) – Number of {node_update, edge_update} layers to include in the GNN Potts Model Encoder
energies_dropout (float, default=0.1) – Dropout rate in the GNN Potts Model Encoder
energies_use_mpnn (bool, default=False) – If set to True, use a feedforward network to compute kNN graph messages. Otherwise, update kNN graph representations using an Attention-based mechanism.
energies_output_dim (int, default=400) – Output dimension of GNN Potts Model Encoder
energies_geometric (bool, default=False) – Use Torch Geometric version of GNN Potts Model Encoder instead
energies_gvp (bool, default=False) – Use GVP version of GNN Potts Model Encoder instead
energies_full_graph (bool, default=True) – [DEPRECIATED] Update both node and edge representations in the GNN Potts Model Encoder. GNN Potts Model Encoder always updates node and edge representations now, making this option do nothing.
res_embed_linear (bool, default=False) – Replace the singleton matches residue embedding layer with a linear layer.
matches_linear (bool, default=False) – Remove the Matches Condensor
term_mpnn_linear (bool, default=False) – Remove the TERM MPNN
struct2seq_linear (bool, default=False) – Linearize the GNN Potts Model Encoder
use_terms (bool, default=True) – Whether or not to use the TERM Information Condensor / net1
term_matches_cutoff (int or None, default=None) – Use the top term_matches_cutoff TERM matches for featurization. If None, apply no cutoff.
test_term_matches_cutoff (int, optional) – Apply a different term_matches_cutoff for validation/evaluation
use_coords (bool, default=True) – Whether or not to use coordinate-based features in the GNN Potts Model Encoder
train_batch_size (int or None, default=16) – Batch size for training
shuffle (bool, default=True) – Whether to do a complete shuffle of the data
sort_data (bool, default=False) – Create deterministic batches by sorting the data according to the specified length metric and creating batches from the sorted data. Incompatible with shuffle=True and semi_shuffle=True.
shuffle (bool, default=True) – Shuffle the data completely before creating batches. Incompatible with sort_data=True and semi_shuffle=True.
semi_shuffle (bool, default=False) – Sort the data according to the specified length metric, then partition the data into semi_shuffle_cluster_size-sized partitions. Within each partition perform a complete shuffle. The upside is that batching with similar lengths reduces padding making for more efficient computation, but the downside is that it does a less complete shuffle.
regularization (float, default=0) – Amount of L2 regularization to apply to the internal Adam optimizer
max_term_res (int or None, default=55000) – When train_batch_size=None, max_term_res>0, max_seq_tokens=None, batch by fitting as many datapoints as possible with the total number of TERM residues included below max_term_res. Calibrated using nn.DataParallel on two V100 GPUs.
max_seq_tokens (int or None, default=None) – When train_batch_size=None, max_term_res=None, max_seq_tokens>0, batch by fitting as many datapoints as possible with the total number of sequence residues included below max_seq_tokens.
term_dropout (str or None, default=None) – Let t be the number of TERM matches in the given datapoint. Select a random int n from 1 to t, and take a random subset n of the given TERM matches to keep. If term_dropout='keep_first', keep the first match and choose n-1 from the rest. If term_dropout='all', choose n matches from all matches.
num_features (int, default=9) – The number of non-sequence TERM-based features included per TERM residue.
loss_config (dict of str (loss component) -> float (scaling factor)) – Dictionary that describes how to construct a loss function. An example dictionary follows: .. code-block :

{
‘nlcpl’: 1, ‘etab_norm_penalty’: 0.01

}
finetune (bool) – Whether or not to train the model in finetuning mode (i.e. freezing all weights but the output layer)