neuromorphs · Jegp · Mar 28, 2024 · Mar 3, 2024 · Mar 3, 2024 · Mar 3, 2024
diff --git a/.github/workflows/build.yml b/.github/workflows/build.yml
@@ -9,7 +9,7 @@ jobs:
     strategy:
       matrix:
         os: [ubuntu-latest]
-        python-version: ["3.7", "3.8", "3.9", "3.10", "3.11", "3.12"]
+        python-version: ["3.9", "3.10", "3.11", "3.12"]
 
     runs-on: ${{ matrix.os }}
 

diff --git a/docs/source/conf.py b/docs/source/conf.py
@@ -56,7 +56,7 @@
 ]
 
 # MyST settings
-nb_execution_mode = "off" # this can be turned to 'auto' once the package is stable
+nb_execution_mode = "off"  # this can be turned to 'auto' once the package is stable
 nb_execution_timeout = 300
 nb_execution_show_tb = True
 

diff --git a/docs/source/primitives.md b/docs/source/primitives.md
@@ -1,6 +1,14 @@
 # Primitives
 
-NIR defines 16 fundamental primitives listed in the table below, which backends are free to implement as they want, leading to varying outputs across platforms. While discrepancies could be minimized by constraining implementations or making backends aware of each other's discretization choices, NIR does not do this since it is declarative, specifying only the necessary inputs and outputs. Constraining implementations would cause hardware incompatibilities and making backends aware of each other could create large O(N^2) overhead for N backends. The primitives are already computationally expressive and able to solve complex PDEs. 
+At its core, NIR is simply a [directed graph](https://en.wikipedia.org/wiki/Directed_graph) (using the [`NIRGraph` primitive](https://github.com/neuromorphs/NIR/blob/main/nir/ir/graph.py)).
+The nodes of the graph are computational units, and the edges are the (directed) connections between them.
+There are no restrictions on the graph structure, so it can be a simple feedforward network, a recurrent network, a graph with cycles, and even with duplicated connections, if needed.
+
+But, if you plan to execute the graph on restricted neuromorphic hardware, please **verify that the graph is compatible with the hardware**.
+
+## NIR computational primitives
+
+NIR defines 16 fundamental primitives listed in the table below, which backends are free to implement as they want, leading to varying outputs across platforms. While discrepancies could be minimized by constraining implementations or making backends aware of each other's discretization choices, NIR does not do this since it is declarative, specifying only the necessary inputs and outputs. Constraining implementations would cause hardware incompatibilities and making backends aware of each other could create large O(N^2) overhead for N backends. The primitives are already computationally expressive and able to solve complex PDEs.
 
 | Primitive | Parameters | Computation | Reset |
 |-|-|-|-|  
@@ -21,17 +29,89 @@ NIR defines 16 fundamental primitives listed in the table below, which backends
 | **AvgPooling** | $p$ | **SumPooling**; **Scale** | - |
 | **Threshold** | $\theta_\text{thr}$ | $H(I - \theta_\text{thr})$ | - |
 
-Each primitive is defined by their own dynamical equation, specified in the [API docs](https://nnir.readthedocs.io/en/latest/modindex.html).
+Each primitive is defined by their own dynamical equation, specified in the [API docs](https://nnir.readthedocs.io/en/latest/).
 
 ## Connectivity 
 
-Each computational unit is a node in a static graph.
-Given 3 nodes $A$ which is a LIF node, $B$ which is a Linear node and $C$ which is another LIF node, we can define edges in the graph such as:
+In the graph, each node has a name like "Neuron 1" or, in some cases, simply just an index "1".
+Connections between nodes are simply a tuple of the strings desribing the source and target.
+As an example, `("A", "B")`, tells us that the output of node `A` is sent to node `B`.
+
+Describing the full connectivity in a graph is as simple as listing all the connections in the graph:
+```
+[
+    ("A", "B"),
+    ("B", "C"),
+    ("C", "D"),
+    ...
+]
+```
+
+## Input and output nodes
+Given a graph, how do we know which nodes should receive inputs? And which nodes should provide outputs?
+For that, we define two special nodes: `Input` and `Output`.
+Both nodes are "dummies" in the sense that they do not provide any function, apart from marking the beginning and end of the graph.
+Note that a single node can be both an input and an output node.
+
+To clarify the dimensionality/input types of the input and output nodes, we require the user to specify the shape *and* name of the input, like so:
+```python
+import numpy as np
+nir.Input(
+    input_type = {"input": np.array([28, 28])}
+)
+nir.Output(
+    output_type = {"output": np.array([2])}
+)
+```
+
+## A Graph Example in Python
+To illustrate how a computational graph can be defined using the NIR Python primitives, here is an example of a graph with a single `LIF` neuron with input and output nodes:
+
+```python
+import nir
+
+nir.NIRGraph(
+    nodes = {
+        "input" : nir.Input({"input": np.array([1])}),
+        "lif"   : nir.LIF(...),
+        "output": nir.Output{"output": np.array([1])}
+    },
+    edges = [
+        ("Input", "LIF"),
+        ("LIF"  , "Output"),
+    ],
+)
+```
+
+## Metadata
+
+Each node in the graph can have metadata attached to it.
+The metadata is a dictionary that can contain any information that may be helpful for the user or backend.
+Any dictionary entries can be added, although we recommend restricting the entries to strings, numbers, and arrays.
+Here is an example of a metadata dictionary attached to a graph:
+
+```python
+import nir
+
+nir.NIRGraph(
+    ...,
+    metadata = {"some": "metadata", "info": 1}
+)
+```
+
+
+```{admonition} Do not rely on the metadata
+:class: warning
+It's vital to ensure that **no backend should rely on this metadata**.
+Metadata entries should contain non-essential meta-information about nodes or graphs, such as the discretization scheme with which the graph was trained, timestamps, etc.
+Tidbits that can improve the model or execution, but are not necessary for the execution itself.
+
+If the backend would strictly rely this metadata, it would require everyone else to adhere to this non-enforced standard.
+NIR graphs should be self-contained and unambiguous, such that the graph itself (without the metadata) contains all the necessary information to execute the graph.
+```
 
-$$
-    A \rightarrow B \\
-    B \rightarrow C
-$$
+## Importing and exporting
+While the NIR librray is written in Python, the graph can be defined and used in any language.
+We provide import and export functions to and from the [Hierarchical Data Format](https://en.wikipedia.org/wiki/Hierarchical_Data_Format) which allows for easy storage and retrieval of the graph.
 
-## Format
-The intermediate represenation can be stored as hdf5 file, which benefits from compression. 
+See [the usage page](usage) for more information.
diff --git a/nir/ir/conv.py b/nir/ir/conv.py
@@ -1,5 +1,5 @@
-from dataclasses import dataclass
-from typing import Optional, Tuple, Union
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional, Tuple, Union
 
 import numpy as np
 
@@ -41,6 +41,9 @@ class Conv1d(NIRNode):
     dilation: int  # Dilation
     groups: int  # Groups
     bias: np.ndarray  # Bias C_out
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         if isinstance(self.padding, str) and self.padding not in ["same", "valid"]:

diff --git a/nir/ir/delay.py b/nir/ir/delay.py
@@ -1,4 +1,5 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -16,6 +17,9 @@ class Delay(NIRNode):
     """
 
     delay: np.ndarray  # Delay
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         # set input and output shape, if not set by user

diff --git a/nir/ir/flatten.py b/nir/ir/flatten.py
@@ -1,5 +1,5 @@
-from dataclasses import dataclass
-from typing import Any, Dict
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -21,6 +21,9 @@ class Flatten(NIRNode):
     input_type: Types
     start_dim: int = 1  # First dimension to flatten
     end_dim: int = -1  # Last dimension to flatten
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         self.input_type = parse_shape_argument(self.input_type, "input")
@@ -41,7 +44,6 @@ def __post_init__(self):
 
     def to_dict(self) -> Dict[str, Any]:
         ret = super().to_dict()
-        del ret["input_type"]
         ret["input_type"] = self.input_type["input"]
         return ret
 

diff --git a/nir/ir/graph.py b/nir/ir/graph.py
@@ -1,6 +1,6 @@
 from collections import Counter
-from dataclasses import dataclass
-from typing import Any, Dict
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -27,6 +27,9 @@ class NIRGraph(NIRNode):
 
     nodes: Nodes  # List of computational nodes
     edges: Edges  # List of edges between nodes
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     @property
     def inputs(self):
@@ -456,7 +459,6 @@ def __post_init__(self):
 
     def to_dict(self) -> Dict[str, Any]:
         ret = super().to_dict()
-        del ret["input_type"]
         ret["shape"] = self.input_type["input"]
         return ret
 
@@ -484,7 +486,6 @@ def __post_init__(self):
 
     def to_dict(self) -> Dict[str, Any]:
         ret = super().to_dict()
-        del ret["output_type"]
         ret["shape"] = self.output_type["output"]
         return ret
 

diff --git a/nir/ir/linear.py b/nir/ir/linear.py
@@ -1,4 +1,5 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -20,6 +21,9 @@ class Affine(NIRNode):
 
     weight: np.ndarray  # Weight term
     bias: np.ndarray  # Bias term
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         assert len(self.weight.shape) >= 2, "Weight must be at least 2D"

diff --git a/nir/ir/neuron.py b/nir/ir/neuron.py
@@ -1,4 +1,5 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -45,6 +46,9 @@ class CubaLIF(NIRNode):
     v_leak: np.ndarray  # Leak voltage
     v_threshold: np.ndarray  # Firing threshold
     w_in: np.ndarray = 1.0  # Input current weight
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         assert (
@@ -71,6 +75,9 @@ class I(NIRNode):  # noqa: E742
     """
 
     r: np.ndarray
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         self.input_type = {"input": np.array(self.r.shape)}
@@ -101,6 +108,9 @@ class IF(NIRNode):
 
     r: np.ndarray  # Resistance
     v_threshold: np.ndarray  # Firing threshold
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         assert (
@@ -127,6 +137,9 @@ class LI(NIRNode):
     tau: np.ndarray  # Time constant
     r: np.ndarray  # Resistance
     v_leak: np.ndarray  # Leak voltage
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         assert (
@@ -166,6 +179,9 @@ class LIF(NIRNode):
     r: np.ndarray  # Resistance
     v_leak: np.ndarray  # Leak voltage
     v_threshold: np.ndarray  # Firing threshold
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         assert (

diff --git a/nir/ir/node.py b/nir/ir/node.py
@@ -10,16 +10,22 @@ class NIRNode:
     instantiated.
     """
 
-    # Note: Adding input/output types as follows is ideal, but requires Python 3.10
-    # input_type: Types = field(init=False, kw_only=True)
-    # output_type: Types = field(init=False, kw_only=True)
+    # Note: Adding input/output types and metadata as follows is ideal, but requires Python 3.10
+    # TODO: implement this in 2025 when 3.9 is EOL
+    # input_type: Dict[str, np.ndarray] = field(init=False, kw_only=True)
+    # output_type: Dict[str, np.ndarray] = field(init=False, kw_only=True)
+    # metadata: Dict[str, Any] = field(init=True, default_factory=dict)
 
     def __eq__(self, other):
         return self is other
 
     def to_dict(self) -> Dict[str, Any]:
         """Serialize into a dictionary."""
         ret = asdict(self)
+        if "input_type" in ret.keys():
+            del ret["input_type"]
+        if "output_type" in ret.keys():
+            del ret["output_type"]
         # Note: The customization below won't be automatically done recursively for nested NIRNode.
         # Therefore, classes with nested NIRNode e.g. NIRGraph must implement its own to_dict
         ret["type"] = type(self).__name__

diff --git a/nir/ir/pooling.py b/nir/ir/pooling.py
@@ -1,4 +1,5 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -12,6 +13,9 @@ class SumPool2d(NIRNode):
     kernel_size: np.ndarray  # (Height, Width)
     stride: np.ndarray  # (Height, width)
     padding: np.ndarray  # (Height, width)
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         self.input_type = {"input": None}

diff --git a/nir/ir/surrogate_gradient.py b/nir/ir/surrogate_gradient.py
@@ -1,4 +1,5 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
+from typing import Any, Dict, Optional
 
 import numpy as np
 
@@ -19,6 +20,9 @@ class Threshold(NIRNode):
     """
 
     threshold: np.ndarray  # Firing threshold
+    input_type: Optional[Dict[str, np.ndarray]] = None
+    output_type: Optional[Dict[str, np.ndarray]] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
 
     def __post_init__(self):
         self.input_type = {"input": np.array(self.threshold.shape)}

diff --git a/nir/ir/utils.py b/nir/ir/utils.py
@@ -62,7 +62,7 @@ def calculate_conv_output(
                 / _index_tuple(stride, i)
                 + 1
             )
-        shapes.append(int(shape))
+        shapes.append(int(shape.item()))
     return np.array(shapes)