neuralogic.nn packageο
Subpackagesο
- neuralogic.nn.module package
- Subpackages
- neuralogic.nn.module.general package
- Submodules
- neuralogic.nn.module.general.attention module
- neuralogic.nn.module.general.gru module
- neuralogic.nn.module.general.linear module
- neuralogic.nn.module.general.lstm module
- neuralogic.nn.module.general.mlp module
- neuralogic.nn.module.general.pooling module
- neuralogic.nn.module.general.positional_encoding module
- neuralogic.nn.module.general.rnn module
- neuralogic.nn.module.general.rvnn module
- neuralogic.nn.module.general.transformer module
- Module contents
- neuralogic.nn.module.gnn package
- Submodules
- neuralogic.nn.module.gnn.appnp module
- neuralogic.nn.module.gnn.gatv2 module
- neuralogic.nn.module.gnn.gcn module
- neuralogic.nn.module.gnn.gen module
- neuralogic.nn.module.gnn.gin module
- neuralogic.nn.module.gnn.gine module
- neuralogic.nn.module.gnn.gsage module
- neuralogic.nn.module.gnn.res_gated module
- neuralogic.nn.module.gnn.rgcn module
- neuralogic.nn.module.gnn.sg module
- neuralogic.nn.module.gnn.tag module
- Module contents
- neuralogic.nn.module.meta package
- neuralogic.nn.module.general package
- Submodules
- neuralogic.nn.module.module module
- Module contents
APPNPConvAttentionAvgPoolingGATv2ConvGCNConvGENConvGINConvGINEConvGRULSTMLinearMAGNNLinearMAGNNMeanMLPMaxPoolingMetaConvModuleMultiheadAttentionPoolingPositionalEncodingRGCNConvRNNResGatedGraphConvRvNNSAGEConvSGConvSumPoolingTAGConvTransformerTransformerDecoderTransformerEncoder
- Subpackages
- neuralogic.nn.optim package
- neuralogic.nn.trainer package
- Submodules
- neuralogic.nn.trainer.callbacks module
- neuralogic.nn.trainer.helpers module
- neuralogic.nn.trainer.history module
TrainerHistoryTrainerHistory.train_lossesTrainerHistory.val_lossesTrainerHistory.train_metricsTrainerHistory.val_metricsTrainerHistory.learning_ratesTrainerHistory.best_epochTrainerHistory.best_val_lossTrainerHistory.stopped_earlyTrainerHistory.best_epochTrainerHistory.best_val_lossTrainerHistory.learning_ratesTrainerHistory.stopped_earlyTrainerHistory.train_lossesTrainerHistory.train_metricsTrainerHistory.val_lossesTrainerHistory.val_metrics
- neuralogic.nn.trainer.metrics module
- neuralogic.nn.trainer.trainer module
- Module contents
CheckpointCallbackEarlyStoppingCallbackMetricProgressCallbackTrainerTrainerCallbackTrainerHistoryTrainerHistory.train_lossesTrainerHistory.val_lossesTrainerHistory.train_metricsTrainerHistory.val_metricsTrainerHistory.learning_ratesTrainerHistory.best_epochTrainerHistory.best_val_lossTrainerHistory.stopped_earlyTrainerHistory.best_epochTrainerHistory.best_val_lossTrainerHistory.learning_ratesTrainerHistory.stopped_earlyTrainerHistory.train_lossesTrainerHistory.train_metricsTrainerHistory.val_lossesTrainerHistory.val_metrics
compute_metrics()
Submodulesο
neuralogic.nn.init moduleο
- class Constant(value: float = 0.1)[source]ο
Bases:
InitializerInitializes learnable parameters with the
value.- Parameters:
value (float) β Value to fill weights with. Default:
0.1
- class Glorot(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with samples from a uniform distribution (from the interval
[-scale / 2, scale / 2]) using the Glorot method.- Parameters:
scale (float) β Scale of a uniform distribution interval
[-scale / 2, scale / 2]. Default:2
- class He(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with samples from a uniform distribution (from the interval
[-scale / 2, scale / 2]) using the He method.- Parameters:
scale (float) β Scale of a uniform distribution interval
[-scale / 2, scale / 2]. Default:2
- class InitializerNames[source]ο
Bases:
object- CONSTANT = 'CONSTANT'ο
- GLOROT = 'GLOROT'ο
- HE = 'HE'ο
- LONGTAIL = 'LONGTAIL'ο
- NORMAL = 'NORMAL'ο
- UNIFORM = 'UNIFORM'ο
- class Longtail[source]ο
Bases:
InitializerInitializes learnable parameters with random samples from a long tail distribution
- class Normal[source]ο
Bases:
InitializerInitializes learnable parameters with random samples from a normal (Gaussian) distribution
- class Uniform(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with random uniformly distributed samples from the interval
[-scale / 2, scale / 2].- Parameters:
scale (float) β Scale of the distribution interval
[-scale / 2, scale / 2]. Default:2
neuralogic.nn.loss moduleο
- class CrossEntropy(with_logits: bool = True)[source]ο
Bases:
ErrorFunctionCross Entropy loss function. Suitable for classification tasks.
- class ErrorFunction[source]ο
Bases:
objectBase class for error (loss) functions in the neural network.
- class ErrorFunctionNames[source]ο
Bases:
object- CROSSENTROPY = 'CROSSENTROPY'ο
- MSE = 'SQUARED_DIFF'ο
- SOFTENTROPY = 'SOFTENTROPY'ο
- class MSE[source]ο
Bases:
ErrorFunctionMean Squared Error (SQUARED_DIFF) loss function. Suitable for regression tasks.
- class SoftEntropy[source]ο
Bases:
ErrorFunctionSoft Entropy loss function. Similar to Cross Entropy but usually applied with a soft layer at the end.
neuralogic.nn.torch_function moduleο
Module contentsο
Neural network building blocks β modules, losses, initializers, and optimizers.
All primary symbols are importable directly from neuralogic.nn:
from neuralogic.nn import GCNConv, LSTM, MSE, Uniform, Adam
- class APPNPConv(output_name: str, feature_name: str, edge_name: str, k: int, alpha: float, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleApproximate Personalized Propagation of Neural Predictions layer from βPredict then Propagate: Graph Neural Networks meet Personalized PageRankβ. Which can be expressed as:
\[\mathbf{x}^{0}_i = \mathbf{x}_i\]\[\mathbf{x}^{k}_i = \alpha \cdot \mathbf{x}^0_i + (1 - \alpha) \cdot {agg}_{j \in \mathcal{N}(i)}(\mathbf{x}^{k - 1}_j)\]\[\mathbf{x}^{\prime}_i = act(\mathbf{x}^{K}_i)\]Where act is an activation function and agg aggregation function.
The first part of the second equation that is β\(\alpha \cdot \mathbf{x}^0_i\)β is expressed in the logic form as:
R.<output_name>__<k>(V.I) <= R.<feature_name>(V.I)[<alpha>].fixed()
The second part of the second equation that is β\((1 - \alpha) \cdot {agg}_{j \in \mathcal{N}(i)}(\mathbf{x}^{k - 1}_j)\)β is expressed as:
R.<output_name>__<k>(V.I) <= (R.<output_name>__<k-1>(V.J)[1 - <alpha>].fixed(), R.<edge_name>(V.J, V.I))
Examples
The whole computation of this module (parametrized as
APPNPConv("h1", "h0", "_edge", 3, 0.1, Transformation.SIGMOID)) is as follows:metadata = Metadata(transformation=Transformation.IDENTITY, aggregation=Aggregation.SUM) (R.h1__1(V.I) <= R.h0(V.I)[0.1].fixed()) | metadata (R.h1__1(V.I) <= (R.h0(V.J)[0.9].fixed(), R._edge(V.J, V.I))) | metadata R.h1__1/1 [Transformation.IDENTITY] (R.h1__2(V.I) <= <0.1> R.h0(V.I)) | metadata (R.h1__2(V.I) <= (<0.9> R.h1__1(V.J), R._edge(V.J, V.I))) | metadata R.h1__2/1 [Transformation.IDENTITY] (R.h1(V.I) <= <0.1> R.h0(V.I)) | metadata (R.h1(V.I) <= (<0.9> R.h1__2(V.J), R._edge(V.J, V.I))) | metadata R.h1 / 1 [Transformation.SIGMOID]
- Parameters:
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
k (int) β Number of iterations
alpha (float) β Teleport probability
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUM
- class Adam(lr: float = 0.001, betas: tuple[float, float] = (0.9, 0.999), eps: float = 1e-08, lr_decay: LRDecay | None = None)[source]ο
Bases:
OptimizerAdam optimizer. It implements the Adaptive Moment Estimation (Adam) algorithm.
- property betas: tuple[float, float]ο
- property eps: floatο
- class Attention(embed_dim: int, output_name: str, query_name: str, key_name: str, value_name: str, mask_name: str | None = None, arity: int = 1)[source]ο
Bases:
ModuleA single-head attention module based on βAttention Is All You Needβ.
- Parameters:
embed_dim (int) β The number of expected features.
output_name (str) β Output (head) predicate name of the module.
query_name (str) β The name of the queries predicate.
key_name (str) β The name of the keys predicate.
value_name (str) β The name of the values predicate.
mask_name (str, optional) β The name of the input mask predicate. Default:
Nonearity (int) β Arity of the input and output predicates. Default:
1
- class AvgPooling(output_name: str, input_name: str, input_arity: int = 1)[source]ο
Bases:
PoolingApply average pooling over the input specified by the input_name and the input arity parameters. Can be expressed as:
\[h = \frac{1}{|N|}\sum_{i_{0}, .., i_{n} \in N} x_{(i_{0}, .., i_{n})}\]Where \(N\) is a set of tuples of length \(n\) (specified by the input arity parameter) that are valid arguments for the input predicate.
This module extends the generic pooling
Pooling.Examples
The whole computation of this module (parametrized as
AvgPooling("h1", "h0")) is as follows:(R.h1 <= R.h0(V.X0)) | [Aggregation.AVG, Transformation.IDENTITY] R.h1 / 0 | [Transformation.IDENTITY]
- Parameters:
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
input_arity (int) β Arity of the input predicate
input_name. Default:1
- class CheckpointCallback(directory: str | Path, filename: str = 'best.pkl')[source]ο
Bases:
TrainerCallbackSave the model whenever validation loss improves.
- Parameters:
directory (str or Path) β Directory to save checkpoints into (created if missing).
filename (str) β Filename for the checkpoint file (default
"best.pkl").
- on_epoch_end(trainer: Trainer, epoch: int, logs: dict[str, Any]) None[source]ο
Called after every epoch.
- Parameters:
trainer (Trainer) β The trainer instance (access
trainer.model, etc.).epoch (int) β 0-indexed epoch number that just finished.
logs (dict) β Dictionary with keys
"train_loss","val_loss"(if available),"lr", and per-metric keys like"train_accuracy","val_mae", etc.
- property path: Pathο
- class Constant(value: float = 0.1)[source]ο
Bases:
InitializerInitializes learnable parameters with the
value.- Parameters:
value (float) β Value to fill weights with. Default:
0.1
- class CrossEntropy(with_logits: bool = True)[source]ο
Bases:
ErrorFunctionCross Entropy loss function. Suitable for classification tasks.
- class EarlyStoppingCallback(patience: int, min_delta: float = 0.0)[source]ο
Bases:
TrainerCallbackStop training when validation loss stops improving.
- Parameters:
patience (int) β Number of epochs with no improvement after which training stops.
min_delta (float) β Minimum absolute change to qualify as an improvement.
- on_epoch_end(trainer: Trainer, epoch: int, logs: dict[str, Any]) None[source]ο
Called after every epoch.
- Parameters:
trainer (Trainer) β The trainer instance (access
trainer.model, etc.).epoch (int) β 0-indexed epoch number that just finished.
logs (dict) β Dictionary with keys
"train_loss","val_loss"(if available),"lr", and per-metric keys like"train_accuracy","val_mae", etc.
- class ErrorFunction[source]ο
Bases:
objectBase class for error (loss) functions in the neural network.
- class GATv2Conv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, share_weights: bool = False, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>)[source]ο
Bases:
ModuleGATv2 layer from βHow Attentive are Graph Attention Networks?β.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
share_weights (bool) β Share weights in attention. Default:
Falseactivation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITY
- class GCNConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>, add_self_loops: bool | None = None, normalize: bool = True)[source]ο
Bases:
ModuleGraph Convolutional layer from βSemi-supervised Classification with Graph Convolutional Networksβ.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUMadd_self_loops (bool | None) β Add self loops if either set to True or None (if normalize is True). Default:
Nonenormalize (bool) β Add normalization. Default :
True
- class GENConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, aggregation: AggregationFunction = <neuralogic.core.constructs.function.softmax.SoftmaxAggregation object>, num_layers: int = 2, expansion: int = 2, eps: float = 1e-07, train_eps: bool = False, edge_dim: int | None = None)[source]ο
Bases:
ModuleGENConv layer from βDeeperGCN: All You Need to Train Deeper GCNsβ.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
aggregation (AggregationFunction) β The aggregation function. Default:
Aggregation.SOFTMAXnum_layers (int) β The number of MLP layers. Default:
2expansion (int) β The expansion factor of hidden channels in MLP. Default:
2eps (float) β \(\epsilon\)-value. Default:
0.0train_eps (bool) β Is
epstrainable parameter. Default:falseedge_dim (int | None) β Dimension of edge features (
Noneis projection toin_channelsis not needed). Default:None
- class GINConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleImplements the Graph Isomorphism Network (GIN) convolution layer. GIN is a powerful GNN layer that can distinguish between different graph structures.
- class GINEConv(in_channels: int, feature_name: str, edge_name: str, nn_name: str, eps: float = 0.0, train_eps: bool = False, edge_dim: int | None = None)[source]ο
Bases:
ModuleGINEConv layer from βStrategies for Pre-training Graph Neural Networksβ.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
nn_name (str) β Neural network predicate name.
eps (float) β \(\epsilon\)-value. Default:
0.0train_eps (bool) β Is
epstrainable parameter. Default:falseedge_dim (int | None) β Dimension of edge features (
Noneis projection toin_channelsis not needed). Default:None
- class GRU(input_size: int, hidden_size: int, output_name: str, input_name: str, hidden_0_name: str, arity: int = 1, next_name: str = '_next__positive')[source]ο
Bases:
ModuleOne-layer Gated Recurrent Unit (GRU) module which is computed as:
\[\begin{split}r_t = \sigma(\mathbf{W}_{xr} \mathbf{x}_t + \mathbf{W}_{hr} \mathbf{h}_{t-1}) \\\end{split}\]\[\begin{split}z_t = \sigma(\mathbf{W}_{xz} \mathbf{x}_t + \mathbf{W}_{hz} \mathbf{h}_{t-1}) \\\end{split}\]\[\begin{split}n_t = \tanh(\mathbf{W}_{xn} \mathbf{x}_t + r_t \odot (\mathbf{W}_{hn} \mathbf{h}_{t-1})) \\\end{split}\]\[h_t = (1 - z_t) \odot n_t + z_t \odot h_{t-1}\]where \(t \in (1, sequence\_length + 1)\) is a time step. In the template, the \(t\) is referred to as
V.T, and \(t - 1\) is referred to asV.Z. This module expresses the first equation as:(R.<output_name>__r(<...terms>, V.T) <= ( R.<input_name>(<...terms>, V.T)[<hidden_size>, <input_size>], R.<hidden_input_name>(<...terms>, V.Z)[<hidden_size>, <hidden_size>], R.<next_name>(V.Z, V.T), )) | [Transformation.SIGMOID] R.<output_name>__r / <arity> + 1 | [Transformation.IDENTITY]
The second equation is expressed in the same way, except for a different head predicate name. The third equation is split into three rules. The first two computes the element-wise product - \(r_t * (\mathbf{W}_{hn} \mathbf{h}_{t-1})\).
(R.<output_name>__n_helper_weighted(<...terms>, V.T) <= ( R.<hidden_input_name>(<...terms>, V.Z)[<hidden_size>, <hidden_size>], R.<next_name>(V.Z, V.T), )) | [Transformation.IDENTITY], R.<output_name>__n_helper_weighted / (<arity> + 1) | [Transformation.IDENTITY], (R.<output_name>__n_helper(<...terms>, V.T) <= ( R.<output_name>__r(<..terms>, V.T), R.<>__n_helper_weighted(<...terms>, V.T) )) | [Transformation.IDENTITY, Combination.ELPRODUCT], R.<output_name>__n_helper / (<arity> + 1) | [Transformation.IDENTITY],
The third one computes the sum and applies the \(tanh\) activation function.
(R.<output_name>__n(<...terms>, V.T) <= ( R.<input_name>(<...terms>, V.T)[<hidden_size>, <input_size>], R.<output_name>__n_helper(<...terms>, V.T) )) | [Transformation.TANH] R.<output_name>__n / (<arity> + 1) | [Transformation.IDENTITY],
The last equation is computed via three rules. The first two rules computes element-wise products. That is:
(R.<output_name>__left(<...terms>, V.T) <= ( R.<output_name>__z(<...terms>, V.T), R.<output_name>__n(<...terms>, V.T) )) | [Transformation.IDENTITY, Combination.ELPRODUCT] (R.<output_name>__right(<...terms>, V.T) <= ( R.<output_name>__z(<...terms>, V.T), R.<hidden_input_name>(<...terms>, V.Z), R.<next_name>(V.Z, V.T),, )) | [Transformation.IDENTITY, Combination.ELPRODUCT] R.<output_name>__left / <arity> + 1 | [Transformation.IDENTITY] R.<output_name>__right / <arity> + 1 | [Transformation.IDENTITY]
The last output rule sums up the element-wise products.
(R.<output_name>(<...terms>, V.T) <= ( R.<output_name>__left(<...terms>, V.T), R.<output_name>__right(<...terms>, V.T) )) | [Transformation.IDENTITY] R.<output_name> / <arity> + 1 | [Transformation.IDENTITY],
Additionally, we define a rule for the βstop conditionβ, that is:
(R.<output_name>(<...terms>, 0) <= R.<hidden_0_name>(<...terms>)) | [Transformation.IDENTITY]
- Parameters:
input_size (int) β Input feature size.
hidden_size (int) β Output and hidden feature size.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input feature predicate name to get features from.
hidden_0_name (str) β Predicate name to get initial hidden state from.
arity (int) β Arity of the input and output predicate. Default:
1
- class Glorot(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with samples from a uniform distribution (from the interval
[-scale / 2, scale / 2]) using the Glorot method.- Parameters:
scale (float) β Scale of a uniform distribution interval
[-scale / 2, scale / 2]. Default:2
- class He(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with samples from a uniform distribution (from the interval
[-scale / 2, scale / 2]) using the He method.- Parameters:
scale (float) β Scale of a uniform distribution interval
[-scale / 2, scale / 2]. Default:2
- class LSTM(input_size: int, hidden_size: int, output_name: str, input_name: str, hidden_0_name: str, cell_state_0_name: str, arity: int = 1)[source]ο
Bases:
ModuleOne-layer Long Short-Term Memory (LSTM) RNN module which is computed as:
\[i_t = \sigma(\mathbf{W}_{xi} \mathbf{x}_t + \mathbf{W}_{hi} \mathbf{h}_{t-1})\]\[f_t = \sigma(\mathbf{W}_{xf} \mathbf{x}_t + \mathbf{W}_{hf} \mathbf{h}_{t-1})\]\[o_t = \sigma(\mathbf{W}_{xo} \mathbf{x}_t + \mathbf{W}_{ho} \mathbf{h}_{t-1})\]\[\begin{split}g_t = \tanh(\mathbf{W}_{xg} \mathbf{x}_t + \mathbf{W}_{hg} \mathbf{h}_{t-1}) \\\end{split}\]\[c_t = f_t \odot c_{t-1} + i_t \odot g_t\]\[h_t = o_t \odot \tanh(c_t)\]- Parameters:
input_size (int) β Input feature size.
hidden_size (int) β Output and hidden feature size.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input feature predicate name to get features from.
hidden_0_name (str) β Predicate name to get initial hidden state from.
cell_state_0_name (str) β Predicate name to get initial cell state from.
arity (int) β Arity of the input and output predicate. Default:
1
- class Linear(in_channels: int, out_channels: int, output_name: str, input_name: str, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, arity: int = 1)[source]ο
Bases:
ModuleApply linear transformation on the input. Can be expressed as:
\[h_{i_0, .., i_{n}} = W \cdot x_{i_0, .., i_{n}}\]Where \(x\) is the input, \(W \in R^{(out\_channels \times in\_channels)}\) is a learnable parameter, and \(n\) is the arity of the input and output.
It is also possible to attach non-linearity via the activation parameter and compute:
\[h_{i_0, .., i_{n}} = act(W \cdot x_{i_0, .., i_{n}})\]Example
The whole computation of this module (parametrized as
Linear(1, 2, "h1", "h0")) is as follows:(R.h1(V.X0)[2, 1] <= R.h0(V.X0)) | [Transformation.IDENTITY] R.h1 / 1 | [Transformation.IDENTITY]
Module parametrized as
Linear(1, 2, "h1", "h0", Transformation.SIGMOID, 2)translates into:(R.h1(V.X0, V.X1)[2, 1] <= R.h0(V.X0, V.X1)) | [Transformation.IDENTITY] R.h1 / 2 | [Transformation.SIGMOID]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYarity (int) β Arity of the input and output predicate. Default:
1
- class Longtail[source]ο
Bases:
InitializerInitializes learnable parameters with random samples from a long tail distribution
- class MAGNNLinear(in_channels: int, out_channels: int, output_name: str, feature_name: str, relation_name: str, type_name: str | None, meta_paths: ~typing.List[str], activation: ~neuralogic.core.constructs.function.function.TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: ~neuralogic.core.constructs.function.function.AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
MAGNNMeanIntra-metapath Aggregation module with Linear encoder from βMAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embeddingβ. Which can be expressed as:
\[\mathbf{h}_{P(v,u)} = \mathbf{W}_p \cdot MEAN(\{\mathbf{x}_t | \forall t \in P(v,u) \})\]\[\mathbf{h}^P_{v} = act(\sum_{u \in N^P_v} \mathbf{h}_{P(v,u)})\]Where act is an activation function, \(P(v,u)\) is a single metapath instance, \(N^P_{v}\) is set of metapath-based neighbors.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
relation_name (str) β Relation predicate name for connectivity checks between entities.
type_name (Optional[str]) β Metapath type predicate name. If none,
meta_pathswill be used instead.meta_paths (List[str]) β Name of types forming a single metapath.
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.SIGMOIDaggregation (AggregationFunction) β Aggregation function of the output. Default:
Aggreagtion.SUM
- class MAGNNMean(output_name: str, feature_name: str, relation_name: str, type_name: str | None, meta_paths: ~typing.List[str], activation: ~neuralogic.core.constructs.function.function.TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: ~neuralogic.core.constructs.function.function.AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleIntra-metapath Aggregation module with Mean encoder from βMAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embeddingβ. Which can be expressed as:
\[\mathbf{h}_{P(v,u)} = MEAN(\{\mathbf{x}_t | \forall t \in P(v,u) \})\]\[\mathbf{h}^P_{v} = act(\sum_{u \in N^P_v} \mathbf{h}_{P(v,u)})\]Where act is an activation function, \(P(v,u)\) is a single metapath instance, \(N^P_{v}\) is set of metapath-based neighbors.
- Parameters:
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
relation_name (str) β Relation predicate name for connectivity checks between entities.
type_name (Optional[str]) β Metapath type predicate name. If none,
meta_pathswill be used instead.meta_paths (List[str]) β Name of types forming a single metapath.
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.SIGMOIDaggregation (AggregationFunction) β Aggregation function of the output. Default:
Aggreagtion.SUM
- class MLP(units: ~typing.List[int], output_name: str, input_name: str, activation: ~neuralogic.core.constructs.function.function.TransformationFunction | ~typing.List[~neuralogic.core.constructs.function.function.TransformationFunction] = <neuralogic.core.constructs.function.function.TransformationFunction object>, arity: int = 1)[source]ο
Bases:
Module- Parameters:
units (List[int]) β List of layer sizes.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
activation (Union[TransformationFunction, List[TransformationFunction]]) β Activation function of all layers or list of activations for each layer. Default:
Transformation.RELUarity (int) β Default:
-1
- class MSE[source]ο
Bases:
ErrorFunctionMean Squared Error (SQUARED_DIFF) loss function. Suitable for regression tasks.
- class MaxPooling(output_name: str, input_name: str, input_arity: int = 1)[source]ο
Bases:
PoolingApply max pooling over the input specified by the input_name and the input arity parameters. Can be expressed as:
\[h = max_{i_{0}, .., i_{n} \in N}(x_{(i_{0}, .., i_{n})})\]Where \(N\) is a set of tuples of length \(n\) (specified by the input arity parameter) that are valid arguments for the input predicate.
This module extends the generic pooling
Pooling.Examples
The whole computation of this module (parametrized as
MaxPooling("h1", "h0")) is as follows:(R.h1 <= R.h0(V.X0)) | [Aggregation.MAX, Transformation.IDENTITY] R.h1 / 0 | [Transformation.IDENTITY]
- Parameters:
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
input_arity (int) β Arity of the input predicate
input_name. Default:1
- class MetaConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, role_name: str | None, roles: ~typing.List[str], activation: ~neuralogic.core.constructs.function.function.TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: ~neuralogic.core.constructs.function.function.AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleMetagraph Convolutional Unit layer.
From Meta-GNN: metagraph neural network for semi-supervised learning in attributed heterogeneous information networks. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\mathbf{W_0} \cdot \mathbf{x}_i + {agg}_{j \in \mathcal{N}_r(i)} \sum_{k \in \mathcal{K}} (\mathbf{W_k} \cdot \mathbf{x}_j))\]Where act is an activation function, agg aggregation function (by default average), \(W_0\) is a learnable root parameter and \(W_k\) is a learnable parameter for each role.
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
role_name (str, optional) β Role predicate name to use for role relations. When
None, elements fromrolesare used instead.roles (List[str]) β List of relationsβ names.
activation (TransformationFunction, optional) β Activation function of the output. Default:
Transformation.SIGMOID.aggregation (AggregationFunction, optional) β Aggregation function of nodesβ neighbors. Default:
Aggregation.AVG.
- class Metric(value)[source]ο
Bases:
str,EnumEnum of available metric names.
Inherits
strso members can be used directly where a string is expected:>>> Metric.ACCURACY == "accuracy" True >>> trainer.fit(..., metrics=[Metric.ACCURACY, Metric.F1_MACRO])
- ACCURACY = 'accuracy'ο
- F1_MACRO = 'f1_macro'ο
- MAE = 'mae'ο
- MSE = 'mse'ο
- PRECISION_MACRO = 'precision_macro'ο
- R2 = 'r2'ο
- RECALL_MACRO = 'recall_macro'ο
- RMSE = 'rmse'ο
- class MultiheadAttention(embed_dim: int, num_heads: int, output_name: str, query_name: str, key_name: str, value_name: str, vdim: int | None = None, kdim: int | None = None, mask_name: str | None = None, arity: int = 1)[source]ο
Bases:
ModuleA multi-head attention module based on βAttention Is All You Needβ.
- Parameters:
embed_dim (int) β The number of expected features.
num_heads (int) β The number of heads.
output_name (str) β Output (head) predicate name of the module.
query_name (str) β The name of the queries predicate.
key_name (str) β The name of the keys predicate.
value_name (str) β The name of the values predicate.
vdim (Optional[int]) β Total number of features for values.
kdim (Optional[int]) β Total number of features for keys.
mask_name (str, optional) β The name of the input mask predicate. Default:
Nonearity (int) β Arity of the input and output predicates. Default:
1
- class Normal[source]ο
Bases:
InitializerInitializes learnable parameters with random samples from a normal (Gaussian) distribution
- class Optimizer(lr: float, lr_decay: LRDecay | None = None)[source]ο
Bases:
objectBase class for all optimizers. Optimizers are used to update the weights of the neural network during training.
- get_lr_decay() Any | None[source]ο
Initializes and returns the learning rate decay object.
- Returns:
The Java learning rate decay object, or None if no decay is set.
- Return type:
Any
- initialize() Any[source]ο
Initializes the Java representation of the optimizer.
- Returns:
The Java optimizer object.
- Return type:
Any
- property lr: floatο
Returns the current learning rate.
- Returns:
Current learning rate.
- Return type:
float
- class Pooling(output_name: str, input_name: str, aggregation: AggregationFunction, input_arity: int = 1)[source]ο
Bases:
ModuleApply generic pooling over the input specified by the input_name and the input arity parameters. Can be expressed as:
\[h = agg_{i_{0}, .., i_{n} \in N}(x_{(i_{0}, .., i_{n})})\]Where \(N\) is a set of tuples of length \(n\) (specified by the input arity parameter) that are valid arguments for the input predicate.
For example, a classic pooling over graph nodes represented by relations of arity 1 (node id) would be calculated as:
\[h = agg_{i \in N}(x_{(i)})\]Here \(N\) refers to a set of all node ids. Lifting the restriction of the input arity via the input_arity parameter allows for pooling not only nodes but also edges (
input_arity=2) and other objects (hyperedges etc.)Examples
The whole computation of this module (parametrized as
Pooling("h1", "h0", Aggregation.AVG)) is as follows:(R.h1 <= R.h0(V.X0)) | [Aggregation.AVG, Transformation.IDENTITY] R.h1 / 0 | [Transformation.IDENTITY]
Module parametrized as
Pooling("h1", "h0", Aggregation.MAX, 2)translates into:(R.h1 <= R.h0(V.X0, V.X1)) | [Aggregation.MAX, Transformation.IDENTITY] R.h1 / 0 | [Transformation.IDENTITY]
- Parameters:
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
aggregation (AggregationFunction) β Aggregation function.
input_arity (int) β Arity of the input predicate
input_name. Default:1
- class PositionalEncoding(embed_dim: int, max_len: int, output_name: str, input_name: str, arity: int = 1, learnable: bool = False)[source]ο
Bases:
ModuleImplements positional encoding for logic sequences. It generates a set of rules that add fixed or learnable positional embeddings to the input features.
- class ProgressCallback(epochs: int)[source]ο
Bases:
TrainerCallbackShow a tqdm progress bar during training.
- Parameters:
epochs (int) β Total number of epochs (used for progress bar length).
- on_epoch_end(trainer: Trainer, epoch: int, logs: dict[str, Any]) None[source]ο
Called after every epoch.
- Parameters:
trainer (Trainer) β The trainer instance (access
trainer.model, etc.).epoch (int) β 0-indexed epoch number that just finished.
logs (dict) β Dictionary with keys
"train_loss","val_loss"(if available),"lr", and per-metric keys like"train_accuracy","val_mae", etc.
- class RGCNConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str | None, relations: ~typing.List[str], activation: ~neuralogic.core.constructs.function.function.TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: ~neuralogic.core.constructs.function.function.AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleRelational Graph Convolutional layer from Modeling Relational Data with Graph Convolutional Networks. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\mathbf{W_0} \cdot \mathbf{x}_i + \sum_{r \in \mathcal{R}} {agg}_{j \in \mathcal{N}_r(i)}(\mathbf{W_r} \cdot \mathbf{x}_j))\]Where act is an activation function, agg aggregation function (by default average), \(W_0\) is a learnable root parameter and \(W_r\) is a learnable parameter for each relation.
The first part of the equation that is β\(\mathbf{W_0} \cdot \mathbf{x}_i\)β can be expressed in the logic form as:
R.<output_name>(V.I) <= R.<feature_name>(V.I)[<W0>]
Another part of the equation that is β\({agg}_{j \in \mathcal{N}_r(i)}(\mathbf{W_r} \cdot \mathbf{x}_j)\)β can be expressed as:
R.<output_name>(V.I) <= (R.<feature_name>(V.J)[<Wr>], R.<edge_name>(V.J, relation, V.I))
where βrelationβ is a constant name, or as:
R.<output_name>(V.I) <= (R.<feature_name>(V.J)[<Wr>], R.<relation>(V.J, V.I))
The outer summation, together with summing it with the first part, is handled by aggregation of all rules with the same head (and substitution).
Examples
The whole computation of this module (parametrized as
RGCNConv(1, 2, "h1", "h0", "_edge", ["sibling", "parent"])) is as follows:metadata = Metadata(activation=Transformation.IDENTITY, aggregation=Aggregation.AVG) (R.h1(V.I) <= R.h0(V.I)[2, 1]) | metadata (R.h1(V.I) <= (R.h0(V.J)[2, 1], R._edge(V.J, sibling, V.I))) | metadata (R.h1(V.I) <= (R.h0(V.J)[2, 1], R._edge(V.J, parent, V.I))) | metadata R.h1 / 1 [Transformation.IDENTITY]
Module parametrized as
RGCNConv(1, 2, "h1", "h0", None, ["sibling", "parent"])translates into:metadata = Metadata(activation=Transformation.IDENTITY, aggregation=Aggregation.AVG) (R.h1(V.I) <= R.h0(V.I)[2, 1]) | metadata (R.h1(V.I) <= (R.h0(V.J)[2, 1], R.sibling(V.J, V.I))) | metadata (R.h1(V.I) <= (R.h0(V.J)[2, 1], R.parent(V.J, V.I))) | metadata R.h1 / 1 [Transformation.IDENTITY]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str | None) β Edge predicate name to use for neighborhood relations. When
None, elements fromrelationsare used instead.relations (List[str]) β List of relationsβ names
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUM
- class RNN(input_size: int, hidden_size: int, output_name: str, input_name: str, hidden_0_name: str, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, arity: int = 1)[source]ο
Bases:
ModuleOne-layer Recurrent Neural Network (RNN) module which is computed as:
\[h_t = act(\mathbf{W}_{ih} \mathbf{x}_t + \mathbf{W}_{hh} \mathbf{h}_{t-1})\]where \(t \in (1, sequence\_length + 1)\) is a time step. In the template, the \(t\) is referred to as
V.T, and \(t - 1\) is referred to asV.Z. This module expresses the first equation as:(R.<output_name>(<...terms>, V.T) <= ( R.<input_name>(<...terms>, V.T)[<hidden_size>, <input_size>], R.<hidden_input_name>(<...terms>, V.Z)[<hidden_size>, <hidden_size>], R.special.next(V.Z, V.T), )) | [<activation>] R.<output_name> / <arity> + 1 | [Transformation.IDENTITY]
Additionally, we define a rule for the βstop conditionβ, that is:
(R.<output_name>(<...terms>, 0) <= R.<hidden_0_name>(<...terms>)) | [Transformation.IDENTITY]
- Parameters:
input_size (int) β Input feature size.
hidden_size (int) β Output and hidden feature size.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input feature predicate name to get features from.
hidden_0_name (str) β Predicate name to get initial hidden state from.
activation (TransformationFunction) β Activation function. Default:
Transformation.TANHarity (int) β Arity of the input and output predicate. Default:
1
- class ResGatedGraphConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, gating_activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleResidual Gated Graph Convolutional layer from βResidual Gated Graph ConvNetsβ. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\mathbf{W}_1 \mathbf{x}_i + {agg}_{j \in \mathcal{N}(i)}(\eta_{i,j} \odot \mathbf{W}_2 \mathbf{x}_j))\]\[\mathbf{\eta}_{i,j} = gating\_act(\mathbf{W}_3 \mathbf{x}_i + \mathbf{W}_4 \mathbf{x}_j)\]Where act is an activation function, agg aggregation function, gating_act is a gating activation function and \(W_n\) are learnable parameters. This equation is translated into the logic form as:
(R.<output_name>__gate(V.I, V.J) <= (R.<feature_name>(V.I)[<W>], R.<feature_name>(V.J)[<W>])) | [Transformation.IDENTITY] R.<output_name>__gate / 2 | [<activation>] (R.<output_name>(V.I) <= R.<feature_name>(V.I)[<W>]) | [Transformation.IDENTITY] (R.<output_name>(V.I) <= ( R.<output_name>__gate(V.I, V.J), R.<feature_name>(V.J)[<W>], R.<edge_name>(V.J, V.I)) ) | Metadata(activation="elementproduct-identity", aggregation=<aggregation>) R.<output_name> / 1 | [<activation>]
Examples
The whole computation of this module (parametrized as
ResGatedGraphConv(1, 2, "h1", "h0", "_edge")) is as follows:metadata = Metadata(activation="elementproduct-identity", aggregation=Aggregation.SUM) (R.h1__gate(V.I, V.J) <= (R.h0(V.I)[2, 1], R.h0(V.J)[2, 1])) | [Transformation.IDENTITY] R.h1__gate / 2 | [Transformation.SIGMOID] (R.h1(V.I) <= R.h0(V.I)[2, 1]) | [Transformation.IDENTITY] (R.h1(V.I) <= (R.h1__gate(V.I, V.J), R.h0(V.J)[2, 1], R._edge(V.J, V.I))) | metadata R.h1 / 1 | [Transformation.IDENTITY]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
gating_activation (TransformationFunction) β Gating activation function. Default:
Transformation.SIGMOIDactivation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUM
- class RvNN(input_size: int, output_name: str, input_name: str, parent_map_name: str, max_children: int = 2, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>, arity: int = 1)[source]ο
Bases:
ModuleRecursive Neural Network (RvNN) module which is computed as:
\[\mathbf{h}_i = act(agg_{j \in \mathcal{Ch(i)}}(\mathbf{W_{id(j)}} \mathbf{h}_j))\]Where \(act\) is an activation function, \(agg\) aggregation function and \(\mathbf{W}\)βs are learnable parameters. \(\mathcal{Ch(i)}\) represents the ordered list of children of node \(i\). The \(id(j)\) function maps node \(j\) to its index (position) in its parentβs children list.
- Parameters:
input_size (int) β Input feature size.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input feature predicate name to get leaf features from.
parent_map_name (str) β Name of the predicate to get mapping from parent to children
max_children (int) β Maximum number of children (specify which <max_children>-ary tree will be considered). Default:
2activation (TransformationFunction) β Activation function of all layers. Default:
Transformation.TANHaggregation (AggregationFunction) β Aggregation function of a layer. Default:
Aggregation.SUMarity (int) β Arity of the input and output predicate (doesnβt include the node id term). Default:
1
- class SAGEConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleGraphSAGE layer from βInductive Representation Learning on Large Graphsβ. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\mathbf{W}_1 \mathbf{x}_i + \mathbf{W}_2 \cdot {agg}_{j \in \mathcal{N}(i)}(\mathbf{x}_j)))\]Where act is an activation function, agg aggregation function and Wβs are learnable parameters. This equation is translated into the logic form as:
(R.<output_name>(V.I)[<W1>] <= (R.<feature_name>(V.J), R.<edge_name>(V.J, V.I))) | [<aggregation>, Transformation.IDENTITY] (R.<output_name>(V.I)[<W2>] <= R.<feature_name>(V.I)) | [Transformation.IDENTITY] R.<output_name> / 1 | [<activation>]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.AVG
- class SGConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, k: int = 1, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleSimple Graph Convolutional layer from βSimplifying Graph Convolutional Networksβ. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\mathbf{W} \cdot {agg}_{j \in \mathcal{N}^k(i)}(\mathbf{x}_j))\]Where act is an activation function, agg aggregation function, W is a learnable parameter and \(\mathcal{N}^k(i)\) denotes nodes that are k hops away from the node i. This equation is translated into the logic form as:
(R.<output_name>(V.I)[<W>] <= ( R.<feature_name>(V.I<k>), R.<edge_name>(V.I<1>, V.I<0>), R.<edge_name>(V.I<2>, V.I<1>), ..., R.<edge_name>(V.I<k>, V.I<k-1>), )) | [<aggregation>, Transformation.IDENTITY] R.<output_name> / 1 | [<activation>]
Examples
The whole computation of this module (parametrized as
SGConv(2, 3, "h1", "h0", "_edge", 2)) is as follows:(R.h1(V.I0)[3, 2] <= (R.h0(V.I2), R._edge(V.I1, V.I0), R._edge(V.I2, V.I1))) | [Transformation.IDENTITY, Aggregation.SUM] R.h1 / 1 | [Transformation.IDENTITY]
Module parametrized as
SGConv(2, 3, "h1", "h0", "_edge", 1)translates into:(R.h1(V.I0)[3, 2] <= (R.h0(V.I1), R._edge(V.I1, V.I0))) | [Transformation.IDENTITY, Aggregation.SUM] R.h1 / 1 | [Transformation.IDENTITY]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
k (int) β Number of hops. Default:
1activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUM
- class SGD(lr: float = 0.1, lr_decay: LRDecay | None = None)[source]ο
Bases:
OptimizerStochastic Gradient Descent (SGD) optimizer.
- class SoftEntropy[source]ο
Bases:
ErrorFunctionSoft Entropy loss function. Similar to Cross Entropy but usually applied with a soft layer at the end.
- class SumPooling(output_name: str, input_name: str, input_arity: int = 1)[source]ο
Bases:
PoolingApply sum pooling over the input specified by the input_name and the input arity parameters. Can be expressed as:
\[h = \sum_{i_{0}, .., i_{n} \in N} x_{(i_{0}, .., i_{n})}\]Where \(N\) is a set of tuples of length \(n\) (specified by the input arity parameter) that are valid arguments for the input predicate.
This module extends the generic pooling
Pooling.Examples
The whole computation of this module (parametrized as
SumPooling("h1", "h0")) is as follows:(R.h1 <= R.h0(V.X0)) | [Aggregation.SUM, Transformation.IDENTITY] R.h1 / 0 | [Transformation.IDENTITY]
- Parameters:
output_name (str) β Output (head) predicate name of the module.
input_name (str) β Input name.
input_arity (int) β Arity of the input predicate
input_name. Default:1
- class TAGConv(in_channels: int, out_channels: int, output_name: str, feature_name: str, edge_name: str, k: int = 2, activation: TransformationFunction = <neuralogic.core.constructs.function.function.TransformationFunction object>, aggregation: AggregationFunction = <neuralogic.core.constructs.function.function.AggregationFunction object>)[source]ο
Bases:
ModuleTopology Adaptive Graph Convolutional layer from βTopology Adaptive Graph Convolutional Networksβ. Which can be expressed as:
\[\mathbf{x}^{\prime}_i = act(\sum_{k=0}^K \mathbf{W}_k \cdot {agg}_{j \in \mathcal{N}^k(i)}(\mathbf{x}_j))\]Where act is an activation function, agg aggregation function, Wk are learnable parameters and \(\mathcal{N}^k(i)\) denotes nodes that are k hops away from the node i. This equation is translated into the logic form as:
This equation is translated into the logic form as:
(R.<output_name>(V.I0)[<W0>] <= R.<feature_name>(V.I0)) | [<aggregation>, Transformation.IDENTITY] (R.<output_name>(V.I0)[<W1>] <= (R.<feature_name>(V.I1), R.<edge_name>(V.I1, V.I0))) | [<aggregation>, Transformation.IDENTITY] (R.<output_name>(V.I0)[<W2>] <= (R.<feature_name>(V.I2), R.<edge_name>(V.I1, V.I0), R.<edge_name>(V.I2, V.I1)) | [<aggregation>, Transformation.IDENTITY] ... (R.<output_name>(V.I0)[<Wk>] <= (R.<feature_name>(V.I<k>), R.<edge_name>(V.I1, V.I0), ..., R.<edge_name>(V.I<k>, V.I<k-1>)) | [<aggregation>, Transformation.IDENTITY] R.<output_name> / 1 | [<activation>]
Examples
The whole computation of this module (parametrized as
TAGConv(1, 2, "h1", "h0", "_edge")) is as follows:(R.h1(V.I0)[2, 2] <= R.h0(V.I0)) | [Aggregation.SUM, Transformation.IDENTITY] (R.h1(V.I0)[2, 1] <= (R.h0(V.I1), R._edge(V.I1, V.I0)) | [Aggregation.SUM, Transformation.IDENTITY] (R.h1(V.I0)[2, 1] <= (R.h0(V.I2), R._edge(V.I1, V.I0), R._edge(V.I2, V.I1)) | [Aggregation.SUM, Transformation.IDENTITY] R.h1 / 1 | [Transformation.IDENTITY]
Module parametrized as
TAGConv(1, 2, "h1", "h0", "_edge", 1)translates into:(R.h1(V.I0)[2, 1] <= R.h0(V.I0)) | [Aggregation.SUM, Transformation.IDENTITY] (R.h1(V.I0)[2, 1] <= (R.h0(V.I1), R._edge(V.I1, V.I0)) | [Aggregation.SUM, Transformation.IDENTITY] R.h1 / 1 | [Transformation.IDENTITY]
- Parameters:
in_channels (int) β Input feature size.
out_channels (int) β Output feature size.
output_name (str) β Output (head) predicate name of the module.
feature_name (str) β Feature predicate name to get features from.
edge_name (str) β Edge predicate name to use for neighborhood relations.
k (int) β Number of hops. Default:
2activation (TransformationFunction) β Activation function of the output. Default:
Transformation.IDENTITYaggregation (AggregationFunction) β Aggregation function of nodesβ neighbors. Default:
Aggregation.SUM
- class Trainer(module: NeuralModule)[source]ο
Bases:
object- fit(train_dataset: Dataset | GroundedDataset | BuiltDataset, val_dataset: Dataset | GroundedDataset | BuiltDataset | None = None, *, epochs: int = 1, batch_size: int = 1, early_stopping_patience: int | None = None, min_delta: float = 0.0, checkpoint_dir: str | Path | None = None, metrics: Sequence[str | Metric] | None = None, silent: bool = False, callbacks: Sequence[TrainerCallback] | None = None) TrainerHistory[source]ο
Run the training loop.
- Parameters:
train_dataset β Training data. Raw
Datasetobjects are built automatically; pass aBuiltDatasetto skip repeated grounding.val_dataset β Optional validation data. When provided, validation loss (and any requested metrics) are computed after every epoch. Early stopping and checkpointing depend on validation loss.
epochs (int) β Number of epochs to train. Default 1.
batch_size (int) β Batch size when building raw datasets. Default 1.
early_stopping_patience (int or None) β Stop after this many epochs without validation-loss improvement. Requires
val_dataset. DefaultNone(no early stopping).min_delta (float) β Minimum absolute change in validation loss to count as improvement. Default 0.0.
checkpoint_dir (str, Path, or None) β Directory to save the best model (by validation loss). A file named
best.pklis written on every improvement. DefaultNone(no checkpointing).metrics (Sequence[str or Metric] or None) β Extra metrics to compute, e.g.
[Metric.ACCURACY]or["mae", "r2"]. Loss is always tracked. DefaultNone.silent (bool) β If
True, suppress the tqdm progress bar. DefaultFalse.callbacks (Sequence[TrainerCallback] or None) β Additional callbacks to invoke. Built-in callbacks (early stopping, checkpoint, progress) are appended automatically based on the other arguments.
- Returns:
Losses and metrics for every epoch.
- Return type:
- test(dataset: Dataset | GroundedDataset | BuiltDataset, *, batch_size: int = 1) list[source]ο
Evaluate the model on a dataset (no weight updates).
- Parameters:
dataset β Test data. Raw
Datasetobjects are built automatically.batch_size (int) β Batch size when building raw datasets. Default 1.
- Returns:
Model outputs for every sample.
- Return type:
list
- class TrainerCallback[source]ο
Bases:
objectBase class for training callbacks.
Override any of the hooks. The trainer calls them in the order they were passed to
Trainer.fit().- on_epoch_end(trainer: Trainer, epoch: int, logs: dict[str, Any]) None[source]ο
Called after every epoch.
- Parameters:
trainer (Trainer) β The trainer instance (access
trainer.model, etc.).epoch (int) β 0-indexed epoch number that just finished.
logs (dict) β Dictionary with keys
"train_loss","val_loss"(if available),"lr", and per-metric keys like"train_accuracy","val_mae", etc.
- class TrainerHistory(train_losses: list[float] = <factory>, val_losses: list[float] = <factory>, train_metrics: dict[str, list[float]]=<factory>, val_metrics: dict[str, list[float]]=<factory>, learning_rates: list[float] = <factory>, best_epoch: int = 0, best_val_loss: float = inf, stopped_early: bool = False)[source]ο
Bases:
objectTraining history collected during a
Trainer.fit()run.- train_lossesο
Mean training loss per epoch.
- Type:
list[float]
- val_lossesο
Mean validation loss per epoch (empty if no validation set).
- Type:
list[float]
- train_metricsο
Per-epoch extra metrics on the training set (each key maps to a list of epoch-level means).
- Type:
dict[str, list[float]]
- val_metricsο
Per-epoch extra metrics on the validation set.
- Type:
dict[str, list[float]]
- learning_ratesο
Learning rate at each epoch.
- Type:
list[float]
- best_epochο
Epoch (0-indexed) that achieved the lowest validation loss.
- Type:
int
- best_val_lossο
Lowest validation loss observed.
- Type:
float
- stopped_earlyο
Trueif early stopping fired.- Type:
bool
- best_epoch: int = 0ο
- best_val_loss: float = infο
- learning_rates: list[float]ο
- stopped_early: bool = Falseο
- train_losses: list[float]ο
- train_metrics: dict[str, list[float]]ο
- val_losses: list[float]ο
- val_metrics: dict[str, list[float]]ο
- class Transformer(input_dim: int, num_heads: int, dim_feedforward: int, output_name: str, src_name: str, tgt_name: str, src_mask_name: str | None = None, tgt_mask_name: str | None = None, memory_mask_name: str | None = None, arity: int = 1)[source]ο
Bases:
ModuleA transformer module based on βAttention Is All You Needβ.
- Parameters:
input_dim (int) β The number of expected features.
num_heads (int) β The number of heads in the multi-head attention module.
dim_feedforward (int) β The dimension of the feedforward network.
output_name (str) β Output (head) predicate name of the module.
src_name (str) β The name of the predicate of the input to the encoder.
tgt_name (str) β The name of the predicate of the input to the decoder.
src_mask_name (str, optional) β The name of the predicate of the encoder input mask. Default:
Nonetgt_mask_name (str, optional) β The name of the predicate of the decoder input mask. Default:
Nonememory_mask_name (str, optional) β The name of the predicate of the encoder output mask. Default:
Nonearity (int) β Arity of the input and output predicate. Default:
1
- class TransformerDecoder(input_dim: int, num_heads: int, dim_feedforward: int, output_name: str, input_name: str, encoder_name: str, mask_name: str | None = None, memory_mask_name: str | None = None, arity: int = 1)[source]ο
Bases:
ModuleA transformer decoder module based on βAttention Is All You Needβ.
- Parameters:
input_dim (int) β The number of expected features.
num_heads (int) β The number of heads in the multi-head attention module.
dim_feedforward (int) β The dimension of the feedforward network.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β The name of the predicate of the input sequence.
input_name β The name of the input encoder.
mask_name (str, optional) β The name of the predicate of the decoder input sequence mask. Default:
Nonememory_mask_name (str, optional) β The name of the predicate of the encoder output mask. Default:
Nonearity (int) β Arity of the input and output predicate. Default:
1
- class TransformerEncoder(input_dim: int, num_heads: int, dim_feedforward: int, output_name: str, input_name: str, mask_name: str | None = None, arity: int = 1)[source]ο
Bases:
EncoderBlockA transformer encoder module based on βAttention Is All You Needβ.
- Parameters:
input_dim (int) β The number of expected features.
num_heads (int) β The number of heads in the multi-head attention module.
dim_feedforward (int) β The dimension of the feedforward network.
output_name (str) β Output (head) predicate name of the module.
input_name (str) β The name of the predicate of the input sequence.
mask_name (str, optional) β The name of the predicate of the input sequence mask. Default:
Nonearity (int) β Arity of the input and output predicate. Default:
1
- class Uniform(scale: float = 2)[source]ο
Bases:
InitializerInitializes learnable parameters with random uniformly distributed samples from the interval
[-scale / 2, scale / 2].- Parameters:
scale (float) β Scale of the distribution interval
[-scale / 2, scale / 2]. Default:2
- compute_metrics(targets: list, outputs: list, names: Sequence[str | Metric]) dict[str, float][source]ο
Compute named metrics over a batch of (target, output) pairs.
Each metric receives the full batch and returns a single float.
- Parameters:
targets (list) β Per-sample target values (floats, lists, or 2D lists).
outputs (list) β Per-sample output values (same shapes as targets).
names (Sequence[str or Metric]) β Metric names to compute, e.g.
["accuracy"]or[Metric.MAE, Metric.R2].
- Returns:
Mapping from metric name to its value across the batch.
- Return type:
dict[str, float]