terminator.models.layers.term.graph.s2s.TERMNodeTransformerLayer¶

class terminator.models.layers.term.graph.s2s.TERMNodeTransformerLayer(num_hidden, num_in, num_heads=4, dropout=0.1)[source]¶

Bases: Module

TERM Node Transformer Layer

A TERM Node Transformer Layer that updates nodes via TERMNeighborAttention

Variables:

__init__(num_hidden, num_in, num_heads=4, dropout=0.1)[source]¶

Parameters:

num_hidden (int) – Hidden dimension, and dimensionality of querys in TERMNeighborAttention
num_in (int) – Dimensionality of keys and values
num_heads (int, default=4) – Number of heads to use in TERMNeighborAttention
dropout (float, default=0.1) – Dropout rate

Methods

__init__(num_hidden, num_in[, num_heads, ...])

Parameters:

num_hidden (int) -- Hidden dimension, and dimensionality of querys in TERMNeighborAttention

forward(h_V, h_VE[, mask_V, mask_attend])

Apply one Transformer update on nodes in a TERM graph

Attributes

`T_destination`	alias of TypeVar('T_destination', bound=`Mapping`[`str`, `Tensor`])
`dump_patches`	This allows better BC support for `load_state_dict()`.

forward(h_V, h_VE, mask_V=None, mask_attend=None)[source]¶

Apply one Transformer update on nodes in a TERM graph

Parameters:

h_V (torch.Tensor) – Central node features Shape: n_batch x n_terms x n_nodes x n_hidden
h_VE (torch.Tensor) – Neighbor features, which includes the node vector concatenated onto the edge connecting the central node to the neighbor node Shape: n_batch x n_terms x n_nodes x n_neighbors x n_in
mask_V (torch.ByteTensor or None) – Mask for attention regarding TERM residues Shape : n_batch x n_terms x n_nodes
mask_attend (torch.ByteTensor or None) – Mask for attention regarding neighbors Shape: n_batch x n_terms x n_nodes x k

Returns:

h_V – Updated node embeddings Shape: n_batch x n_terms x n_nodes x n_hidden

Return type:

torch.Tensor