From a58d8af300dc99a652982735f98cf5ac0a5a53b1 Mon Sep 17 00:00:00 2001
From: "Documenter.jl" <documenter@juliadocs.github.io>
Date: Fri, 27 Oct 2023 11:25:30 +0000
Subject: [PATCH] build based on b53d8d9

---
 dev/.documenter-siteinfo.json |  2 +-
 dev/api/index.html            | 82 +++++++++++++++++------------------
 dev/api_overview/index.html   |  2 +-
 dev/changelog/index.html      |  2 +-
 dev/examples/index.html       |  2 +-
 dev/index.html                |  2 +-
 dev/license/index.html        |  2 +-
 dev/overview/index.html       |  2 +-
 8 files changed, 48 insertions(+), 48 deletions(-)
diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index 25ae3ed8..e9482aa5 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.9.3","generation_timestamp":"2023-10-27T11:13:22","documenter_version":"1.1.2"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.9.3","generation_timestamp":"2023-10-27T11:25:16","documenter_version":"1.1.2"}}
\ No newline at end of file
diff --git a/dev/api/index.html b/dev/api/index.html
index d7f6cca4..e79b96e2 100644
--- a/dev/api/index.html
+++ b/dev/api/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Reference · NNHelferlein.jl</title><meta name="title" content="API Reference · NNHelferlein.jl"/><meta property="og:title" content="API Reference · NNHelferlein.jl"/><meta property="twitter:title" content="API Reference · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li class="is-active"><a class="tocitem" href>API Reference</a><ul class="internal"><li class="toplevel"><a class="tocitem" href="#Layers"><span>Layers</span></a></li><li><a class="tocitem" href="#Fully-connected-layers"><span>Fully connected layers</span></a></li><li><a class="tocitem" href="#Convolutional"><span>Convolutional</span></a></li><li><a class="tocitem" href="#Recurrent"><span>Recurrent</span></a></li><li><a class="tocitem" href="#Transformers"><span>Transformers</span></a></li><li><a class="tocitem" href="#Others"><span>Others</span></a></li><li><a class="tocitem" href="#Attention-Mechanisms"><span>Attention Mechanisms</span></a></li><li class="toplevel"><a class="tocitem" href="#Data-providers"><span>Data providers</span></a></li><li><a class="tocitem" href="#Iteration-utilities"><span>Iteration utilities</span></a></li><li><a class="tocitem" href="#Tabular-data"><span>Tabular data</span></a></li><li><a class="tocitem" href="#Image-data"><span>Image data</span></a></li><li><a class="tocitem" href="#Text-data"><span>Text data</span></a></li><li class="toplevel"><a class="tocitem" href="#Training"><span>Training</span></a></li><li class="toplevel"><a class="tocitem" href="#Evaluation-and-accuracy"><span>Evaluation and accuracy</span></a></li><li class="toplevel"><a class="tocitem" href="#ImageNet-tools"><span>ImageNet tools</span></a></li><li class="toplevel"><a class="tocitem" href="#Other-utils"><span>Other utils</span></a></li><li><a class="tocitem" href="#Layers-and-helpers-for-transformers"><span>Layers and helpers for transformers</span></a></li><li><a class="tocitem" href="#Utils-for-array-manipulation"><span>Utils for array manipulation</span></a></li><li><a class="tocitem" href="#Utils-for-fixing-types-in-GPU-context"><span>Utils for fixing types in GPU context</span></a></li><li><a class="tocitem" href="#Utils-for-Bioinformatics"><span>Utils for Bioinformatics</span></a></li><li><a class="tocitem" href="#Saving,-loading-and-inspection-of-models"><span>Saving, loading and inspection of models</span></a></li><li><a class="tocitem" href="#Datasets"><span>Datasets</span></a></li><li class="toplevel"><a class="tocitem" href="#Pretrained-networks"><span>Pretrained networks</span></a></li></ul></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Reference</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Reference</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/api.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><p>API doc of all exported functions are listed here:</p><h1 id="Chains"><a class="docs-heading-anchor" href="#Chains">Chains</a><a id="Chains-1"></a><a class="docs-heading-anchor-permalink" href="#Chains" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractNN" href="#NNHelferlein.AbstractNN"><code>NNHelferlein.AbstractNN</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractNN</code></pre><p>Mother type for AbstractNN hierarchy with implementation for a chain of layers.</p><p><strong>Signatures:</strong></p><ul><li><code>(m::AbstractNN)(x)</code>: run the AbstractArray <code>x</code> througth all layers and return                       the output</li><li><code>(m::AbstractNN)(x,y)</code>: Calculate the loss for one minibatch <code>x</code> and teaching input <code>y</code></li><li><code>(m::AbstractNN)(d::Knet.Data)</code>: Calculate the loss for all minibatches in <code>d</code></li><li><code>(m::AbstractNN)(d::Tuple)</code>: Calculate the loss for all minibatches in <code>d</code></li><li><code>(m::AbstractNN)(d::NNHelferlein.DataLoader)</code>: Calculate the loss for all minibatches in <code>d</code>                        if teaching input is included (i.e. elements of d are tuples).                       Otherwise return the out of all minibatches as one array with                        samples as columns.</li></ul><p>```</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L6-L22">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractChain" href="#NNHelferlein.AbstractChain"><code>NNHelferlein.AbstractChain</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractChain</code></pre><p>Mother type for AbstractChain hierarchy with implementation for a chain of layers. By default every <code>AbstractChain</code> has a property <code>layers</code> with a iterable list of  <code>AbstractLayer</code>s or <code>AbstractChain</code>s that are executed recursively.</p><p>Non-standard Chains in which Layers are not execueted sequnetially (such as ResnetBlocks) must provide a custom implementation with the signature <code>chain(x)</code>.</p><p><strong>Signatures:</strong></p><ul><li><code>(m::AbstractChain)(x)</code>: run the AbstractArray <code>x</code> througth all layers and return                       the output</li></ul><p>```</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L37-L51">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.add_layer!" href="#NNHelferlein.add_layer!"><code>NNHelferlein.add_layer!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function add_layer!(n::Union{NNHelferlein.AbstractNN, NNHelferlein.AbstractChain}, l)</code></pre><p>Add a layer <code>l</code> or a chain to a model <code>n</code>. The layer is always added  at the end of the chains.  The modified model is returned.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L125-L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Base.:+" href="#Base.:+"><code>Base.:+</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function +(n::Union{NNHelferlein.AbstractNN, NNHelferlein.AbstractChain}, l::Union{AbstractLayer, AbstractChain})
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Reference · NNHelferlein.jl</title><meta name="title" content="API Reference · NNHelferlein.jl"/><meta property="og:title" content="API Reference · NNHelferlein.jl"/><meta property="twitter:title" content="API Reference · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li class="is-active"><a class="tocitem" href>API Reference</a><ul class="internal"><li class="toplevel"><a class="tocitem" href="#Layers"><span>Layers</span></a></li><li><a class="tocitem" href="#Fully-connected-layers"><span>Fully connected layers</span></a></li><li><a class="tocitem" href="#Convolutional"><span>Convolutional</span></a></li><li><a class="tocitem" href="#Recurrent"><span>Recurrent</span></a></li><li><a class="tocitem" href="#Transformers"><span>Transformers</span></a></li><li><a class="tocitem" href="#Others"><span>Others</span></a></li><li><a class="tocitem" href="#Attention-Mechanisms"><span>Attention Mechanisms</span></a></li><li class="toplevel"><a class="tocitem" href="#Data-providers"><span>Data providers</span></a></li><li><a class="tocitem" href="#Iteration-utilities"><span>Iteration utilities</span></a></li><li><a class="tocitem" href="#Tabular-data"><span>Tabular data</span></a></li><li><a class="tocitem" href="#Image-data"><span>Image data</span></a></li><li><a class="tocitem" href="#Text-data"><span>Text data</span></a></li><li class="toplevel"><a class="tocitem" href="#Training"><span>Training</span></a></li><li class="toplevel"><a class="tocitem" href="#Evaluation-and-accuracy"><span>Evaluation and accuracy</span></a></li><li class="toplevel"><a class="tocitem" href="#ImageNet-tools"><span>ImageNet tools</span></a></li><li class="toplevel"><a class="tocitem" href="#Other-utils"><span>Other utils</span></a></li><li><a class="tocitem" href="#Layers-and-helpers-for-transformers"><span>Layers and helpers for transformers</span></a></li><li><a class="tocitem" href="#Utils-for-array-manipulation"><span>Utils for array manipulation</span></a></li><li><a class="tocitem" href="#Utils-for-fixing-types-in-GPU-context"><span>Utils for fixing types in GPU context</span></a></li><li><a class="tocitem" href="#Utils-for-Bioinformatics"><span>Utils for Bioinformatics</span></a></li><li><a class="tocitem" href="#Saving,-loading-and-inspection-of-models"><span>Saving, loading and inspection of models</span></a></li><li><a class="tocitem" href="#Datasets"><span>Datasets</span></a></li><li class="toplevel"><a class="tocitem" href="#Pretrained-networks"><span>Pretrained networks</span></a></li></ul></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Reference</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Reference</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/api.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><p>API doc of all exported functions are listed here:</p><h1 id="Chains"><a class="docs-heading-anchor" href="#Chains">Chains</a><a id="Chains-1"></a><a class="docs-heading-anchor-permalink" href="#Chains" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractNN" href="#NNHelferlein.AbstractNN"><code>NNHelferlein.AbstractNN</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractNN</code></pre><p>Mother type for AbstractNN hierarchy with implementation for a chain of layers.</p><p><strong>Signatures:</strong></p><ul><li><code>(m::AbstractNN)(x)</code>: run the AbstractArray <code>x</code> througth all layers and return                       the output</li><li><code>(m::AbstractNN)(x,y)</code>: Calculate the loss for one minibatch <code>x</code> and teaching input <code>y</code></li><li><code>(m::AbstractNN)(d::Knet.Data)</code>: Calculate the loss for all minibatches in <code>d</code></li><li><code>(m::AbstractNN)(d::Tuple)</code>: Calculate the loss for all minibatches in <code>d</code></li><li><code>(m::AbstractNN)(d::NNHelferlein.DataLoader)</code>: Calculate the loss for all minibatches in <code>d</code>                        if teaching input is included (i.e. elements of d are tuples).                       Otherwise return the out of all minibatches as one array with                        samples as columns.</li></ul><p>```</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L6-L22">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractChain" href="#NNHelferlein.AbstractChain"><code>NNHelferlein.AbstractChain</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractChain</code></pre><p>Mother type for AbstractChain hierarchy with implementation for a chain of layers. By default every <code>AbstractChain</code> has a property <code>layers</code> with a iterable list of  <code>AbstractLayer</code>s or <code>AbstractChain</code>s that are executed recursively.</p><p>Non-standard Chains in which Layers are not execueted sequnetially (such as ResnetBlocks) must provide a custom implementation with the signature <code>chain(x)</code>.</p><p><strong>Signatures:</strong></p><ul><li><code>(m::AbstractChain)(x)</code>: run the AbstractArray <code>x</code> througth all layers and return                       the output</li></ul><p>```</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L37-L51">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.add_layer!" href="#NNHelferlein.add_layer!"><code>NNHelferlein.add_layer!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function add_layer!(n::Union{NNHelferlein.AbstractNN, NNHelferlein.AbstractChain}, l)</code></pre><p>Add a layer <code>l</code> or a chain to a model <code>n</code>. The layer is always added  at the end of the chains.  The modified model is returned.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L125-L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Base.:+" href="#Base.:+"><code>Base.:+</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function +(n::Union{NNHelferlein.AbstractNN, NNHelferlein.AbstractChain}, l::Union{AbstractLayer, AbstractChain})
 function +(l1::AbstractLayer, l2::Union{AbstractLayer, AbstractChain})</code></pre><p>The <code>plus</code>-operator is overloaded to be able to add layers and chains  to a network.</p><p>The second form returns a new chain if 2 Layers are added.</p><p><strong>Example:</strong></p><pre><code class="language-julia hljs">julia&gt; mdl = Classifier() + Dense(2,5)
 julia&gt; print_network(mdl)
 
@@ -25,33 +25,33 @@
     Dense layer 5 → 1 with identity,                                 6 params
  
 Total number of layers: 3
-Total number of parameters: 51</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L138-L177">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Classifier" href="#NNHelferlein.Classifier"><code>NNHelferlein.Classifier</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Classifier &lt;: AbstractNN</code></pre><p>Classifier with default nll loss. An alternative loss function can be supplied as keyword argument. The function must provide a signature to be called as  <code>loss(model(x), y)</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Classifier(layers...; loss=Knet.nll)</code></pre><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(m::Classifier)(x,y) = m.loss(m(x), y)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L59-L72">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Regressor" href="#NNHelferlein.Regressor"><code>NNHelferlein.Regressor</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Regressor &lt;: AbstractNN</code></pre><p>Regression network with square loss as loss function.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Regressor(layers...; loss=mean_squared_error.nll)</code></pre><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(m::Regression)(x,y) = mean(abs2, Array(m(x)) - y)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L85-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Transformer" href="#NNHelferlein.Transformer"><code>NNHelferlein.Transformer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct Transformer</code></pre><p>A Bert-like transformer network consisting of an encoder and a decoder stack.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">Transformer(n_layers, depth, heads; drop_rate=0.1)</code></pre><ul><li><code>n_layers</code>: number of layers in encoder and decoder</li><li><code>depth</code>: embedding depth</li><li><code>heads</code>: number of heads for the multi-head attention</li><li><code>drop_rate</code>: dropout rate used in all layers</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(tf::Transformer)(x, y; enc_mask=nothing, dec_mask=nothing)</code></pre><p>The transformer is called with two 3-d-arrays of embedded sequences <code>x</code> and <code>y</code> of size <code>[depth, seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len_y, n_minibatch]</code>.  Sequences <code>x</code> and <code>y</code> may be of different lengths; output has always the same dimensions as <code>y</code>.</p><p>Attention factors of the last run  are stored in the field <code>α</code> of the transformer object.</p><p><code>enc_mask</code> and <code>dec_mask</code> are optional padding masks for the encoder and decoder input, respectively. They must be of size <code>[seq_len, n_minibatch]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L497-L527">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TokenTransformer" href="#NNHelferlein.TokenTransformer"><code>NNHelferlein.TokenTransformer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct TokenTransformer</code></pre><p>A wrapper around the <code>Transformer</code> object that takes sequences of token ids as input.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TokenTransformer(n_layers, depth, heads, 
+Total number of parameters: 51</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L138-L177">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Classifier" href="#NNHelferlein.Classifier"><code>NNHelferlein.Classifier</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Classifier &lt;: AbstractNN</code></pre><p>Classifier with default nll loss. An alternative loss function can be supplied as keyword argument. The function must provide a signature to be called as  <code>loss(model(x), y)</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Classifier(layers...; loss=Knet.nll)</code></pre><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(m::Classifier)(x,y) = m.loss(m(x), y)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L59-L72">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Regressor" href="#NNHelferlein.Regressor"><code>NNHelferlein.Regressor</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Regressor &lt;: AbstractNN</code></pre><p>Regression network with square loss as loss function.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Regressor(layers...; loss=mean_squared_error.nll)</code></pre><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(m::Regression)(x,y) = mean(abs2, Array(m(x)) - y)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L85-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Transformer" href="#NNHelferlein.Transformer"><code>NNHelferlein.Transformer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct Transformer</code></pre><p>A Bert-like transformer network consisting of an encoder and a decoder stack.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">Transformer(n_layers, depth, heads; drop_rate=0.1)</code></pre><ul><li><code>n_layers</code>: number of layers in encoder and decoder</li><li><code>depth</code>: embedding depth</li><li><code>heads</code>: number of heads for the multi-head attention</li><li><code>drop_rate</code>: dropout rate used in all layers</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(tf::Transformer)(x, y; enc_mask=nothing, dec_mask=nothing)</code></pre><p>The transformer is called with two 3-d-arrays of embedded sequences <code>x</code> and <code>y</code> of size <code>[depth, seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len_y, n_minibatch]</code>.  Sequences <code>x</code> and <code>y</code> may be of different lengths; output has always the same dimensions as <code>y</code>.</p><p>Attention factors of the last run  are stored in the field <code>α</code> of the transformer object.</p><p><code>enc_mask</code> and <code>dec_mask</code> are optional padding masks for the encoder and decoder input, respectively. They must be of size <code>[seq_len, n_minibatch]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L497-L527">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TokenTransformer" href="#NNHelferlein.TokenTransformer"><code>NNHelferlein.TokenTransformer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct TokenTransformer</code></pre><p>A wrapper around the <code>Transformer</code> object that takes sequences of token ids as input.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TokenTransformer(n_layers, depth, heads, 
                  x_vocab, y_vocab;
                  drop_rate=0.1)</code></pre><ul><li><code>n_layers</code>: number of layers in encoder and decoder</li><li><code>depth</code>: embedding depth</li><li><code>heads</code>: number of heads for the multi-head attention</li><li><code>x_vocab</code>: vocabulary size of the input sequences as integer value            or a <code>WordTokenizer</code> object</li><li><code>y_vocab</code>: vocabulary size of the output sequences as integer value               or a <code>WordTokenizer</code> object</li><li><code>drop_rate</code>: dropout rate used in all layers</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">    (tt::TokenTransformer)(x, y; enc_mask=nothing, dec_mask=nothing
-                           embedded=true)</code></pre><p>The transformer is called with two 2-d-arrays of token ids <code>x</code> and <code>y</code> of size <code>[seq_len, n_minibatch]</code> which may be of  different lengths. It returns a tensor of size <code>[y_vocab, seq_len_y, n_minibatch]</code> with the raw activations  of output neurons or, if <code>embedded</code> is set to <code>false</code>, a 2-d-array of size <code>[seq_len_y, n_minibatch]</code> with the sequences of generated tokens.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L552-L585">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Chain" href="#NNHelferlein.Chain"><code>NNHelferlein.Chain</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Chain &lt;: AbstractChain</code></pre><p>Simple wrapper to chain layers and execute them one after another.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L108-L112">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.VAE" href="#NNHelferlein.VAE"><code>NNHelferlein.VAE</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct VAE   &lt;: AbstractNN</code></pre><p>Type for a generic variational autoencoder.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">VAE(encoder, decoder)</code></pre><p>Separate predefind chains (ideally, but not necessarily of type <code>Chain</code>)  for encoder and decoder must be specified. The VAE needs the 2 parameters mean and variance to define the distribution of each code-neuron in the bottleneck-layer. In consequence the encoder output must be 2 times  the size of the decoder input (in case of dense layers: if encoder output is a 8-value vector, 4 codes are defined and the decoder input is a 4-value vector; in case of convolutional layers the number of encoder output channels must be 2 times the number of the encoder input channels - see the examples). </p><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(vae::VAE)(x)
+                           embedded=true)</code></pre><p>The transformer is called with two 2-d-arrays of token ids <code>x</code> and <code>y</code> of size <code>[seq_len, n_minibatch]</code> which may be of  different lengths. It returns a tensor of size <code>[y_vocab, seq_len_y, n_minibatch]</code> with the raw activations  of output neurons or, if <code>embedded</code> is set to <code>false</code>, a 2-d-array of size <code>[seq_len_y, n_minibatch]</code> with the sequences of generated tokens.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L552-L585">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Chain" href="#NNHelferlein.Chain"><code>NNHelferlein.Chain</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Chain &lt;: AbstractChain</code></pre><p>Simple wrapper to chain layers and execute them one after another.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L108-L112">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.VAE" href="#NNHelferlein.VAE"><code>NNHelferlein.VAE</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct VAE   &lt;: AbstractNN</code></pre><p>Type for a generic variational autoencoder.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">VAE(encoder, decoder)</code></pre><p>Separate predefind chains (ideally, but not necessarily of type <code>Chain</code>)  for encoder and decoder must be specified. The VAE needs the 2 parameters mean and variance to define the distribution of each code-neuron in the bottleneck-layer. In consequence the encoder output must be 2 times  the size of the decoder input (in case of dense layers: if encoder output is a 8-value vector, 4 codes are defined and the decoder input is a 4-value vector; in case of convolutional layers the number of encoder output channels must be 2 times the number of the encoder input channels - see the examples). </p><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">(vae::VAE)(x)
 (vae::VAE)(x,y)</code></pre><p>Called with one argument, predict will be executed;  with two arguments (args x and y should be identical for the autoencoder) the loss will be returned.    </p><p><strong>Details:</strong></p><p>The loss is calculated as the sum of element-wise error squares plus the <em>Kullback-Leibler-Divergence</em> to adapt the distributions of the bottleneck codes:</p><p class="math-container">\[\mathcal{L} = \frac{1}{2} \sum_{i=1}^{n_{outputs}} (t_{i}-o_{i})^{2} - 
-               \frac{1}{2} \sum_{j=1}^{n_{codes}}(1 + ln\sigma_{c_j}^{2}-\mu_{c_j}^{2}-\sigma_{c_j}^{2}) \]</p><p>Output of the autoencoder is cropped to the size of input before loss calculation (and before prediction); i.e. the output has always the same dimensions as the input, even if the last layer generates a bigger shape.</p><p><strong>KL-training parameters:</strong></p><p>The parameter β is by default set to 1.0, i.e. mean-squared error and KL  has the same weights. The functions <code>set_beta(vae, beta)</code> and <code>get_beta(vae)</code> can be used to set and get the β used in training. With β=0.0 no KL-loss will be used.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L293-L336">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_beta" href="#NNHelferlein.get_beta"><code>NNHelferlein.get_beta</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_beta(vae::VAE; ramp=false)</code></pre><p>Return a <code>Dict</code> with the current VAE-parameters beta and ramp-up.</p><p><strong>Arguments:</strong></p><ul><li><code>ramp=false</code>: if <code>true</code>, a vector of β for all ramp-up steps is returned.               This way, the ramp-up phase can be visualised:               &lt;img src=&quot;./assets/vae-beta-range.png&quot;/&gt;</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L344-L353">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_beta!" href="#NNHelferlein.set_beta!"><code>NNHelferlein.set_beta!</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function set<em>beta!(vae::VAE, β</em>max; ramp_up=false, steps=0)</p><p>Helper to set the current value of the VAE-parameter beta and ramp-up settings.</p><p>VAE loss is calculated as (mean of error squares) + β * (mean of KL divergence).</p><p><strong>Ramp-up:</strong></p><p>In case of <code>ramp_up=true</code>, β starts with almost 0.0 (<code>sigm(-10.0)</code> ≈4.5e-5) and  reaches almost 1.0 after <code>steps</code> steps, following a sigmoid curve. <code>steps</code> should be more than 25, to avoid rounding errors in the calculation of the derivative of the sigmoid function.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L372-L385">source</a></section></article><h1 id="Layers"><a class="docs-heading-anchor" href="#Layers">Layers</a><a id="Layers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractLayer" href="#NNHelferlein.AbstractLayer"><code>NNHelferlein.AbstractLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractLayer
-abstract type Layer</code></pre><p>Mother type for layers hierarchy. (The type <code>Layer</code> is kept for backward compatibility)</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/types.jl#L1-L7">source</a></section></article><h2 id="Fully-connected-layers"><a class="docs-heading-anchor" href="#Fully-connected-layers">Fully connected layers</a><a id="Fully-connected-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Fully-connected-layers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Dense" href="#NNHelferlein.Dense"><code>NNHelferlein.Dense</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Dense  &lt;: AbstractLayer</code></pre><p>Default Dense layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Dense(w, b, actf)</code>: default constructor, <code>w</code> are the weights and <code>b</code> the bias.</li><li><code>Dense(i::Int, j::Int; actf=sigm, init=..)</code>: layer of <code>j</code> neurons with       <code>i</code> inputs. Initialiser is xavier<em>uniform for  <code>actf=sigm</code> and       xaview</em>normal otherwise.</li><li><code>Dense(h5::HDF5.File, group::String; trainable=false, actf=sigm)</code>: kernel and bias are loaded by the specified <code>group</code>.</li><li><code>Dense(h5::HDF5.File, kernel::String, bias::String;       trainable=false, actf=sigm)</code>: layer       imported from a hdf5-file from TensorFlow with the       hdf-object h5 and the group name group.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L7-L22">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Linear" href="#NNHelferlein.Linear"><code>NNHelferlein.Linear</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Linear  &lt;: AbstractLayer</code></pre><p>Almost standard dense layer, but functionality inspired by the TensorFlow-layer:</p><ul><li>capable to work with input tensors of any number of dimensions</li><li>default activation function <code>identity</code></li><li>optionally without biases.</li></ul><p>The shape of the input tensor is preserved; only the size of the first dim is changed from in to out.</p><p><strong>Constructors:</strong></p><ul><li><code>Linear(i::Int, j::Int; bias=true, actf=identity, init=xaview_normal)</code>        where <code>i</code> is fan-in and <code>j</code> is fan-out.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>bias=true</code>: if false biases are fixed to 0.0</li><li><code>actf=identity</code>: activation function.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L77-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Embed" href="#NNHelferlein.Embed"><code>NNHelferlein.Embed</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Embed &lt;: AbstractLayer</code></pre><p>Simple type for an embedding layer to embed a virtual onehot-vector into a smaller number of neurons by linear combination. The onehot-vector is virtual, because not the vector, but only the index of the &quot;one&quot; in the vector has to be provided as Integer value (or a minibatch of integers) with values between 1 and the vocab size.</p><p><strong>Constructors:</strong></p><ul><li><code>Embed(v,d; actf=identity, mask=nothing):</code> with   vocab size <code>v</code>, embedding depth <code>d</code> and default activation function identity.   <code>mask</code> defines the padding token (see below).</li></ul><p><strong>Signatures:</strong></p><ul><li><code>(l::Embed)(x)</code>: default embedding of input tensor <code>x</code>.</li></ul><p><strong>Value:</strong></p><p>The embedding is constructed by adding a first dimension to the input tensor with number of rows = embedding depth. If <code>x</code> is a column vector, the value is a matrix. If <code>x</code> is as row-vector or a matrix, the value is a 3-d array, etc.</p><p><strong>Padding and masking:</strong></p><p>If a token value is defined as <code>mask</code>, occurences are embedded as zero vector. This can be used for padding sequence with zeros. The masking/padding token counts to the vocab size. If padding tokens are not masked, their embedding will be optimised during training (which is not recommended but still possible for many applications).</p><p>Zero may be used as padding token, but it must count to the vocab size  (i.e. the vocab size must be one larger than the number of tokens) and the keyword arg <code>mask=0</code> must be specified.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L703-L737">source</a></section></article><h2 id="Convolutional"><a class="docs-heading-anchor" href="#Convolutional">Convolutional</a><a id="Convolutional-1"></a><a class="docs-heading-anchor-permalink" href="#Convolutional" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Conv" href="#NNHelferlein.Conv"><code>NNHelferlein.Conv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Conv  &lt;: AbstractLayer</code></pre><p>Default Conv layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Conv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu; kwargs...)</code>: layer with   o kernels of size (w1,w2) for an input of i channels.</li><li><code>Conv(w1::Int, w2::Int, w3::Int, i::Int, o::Int; actf=relu; kwargs...)</code>: layer        with 3-dimensional kernels for 3D convolution        (requires 5-dimensional input)</li><li><code>Conv(w1::Int,  i::Int, o::Int; actf=relu; kwargs...)</code>: layer with   o kernels of size (1,w1) for an input of i channels.   This 1-dimensional convolution uses a 2-dimensional kernel with a first    dimension of size 1. Input and output contain an empty firfst dimension   of size 1. If <code>padding</code>, <code>stride</code> or <code>dilation</code> are specified, 2-tuples   must be specified to correspond with the 2-dimensional kernel   (e.g. <code>padding=(0,1)</code> for a 1-padding along the 1D sequence).</li></ul><p><strong>Constructors to read parameters from Tensorflow/Keras HDF-files:</strong></p><ul><li><code>Conv(h5::HDF5.File, kernel::String, bias::String; trainable=false, actf=Knet.relu,   use_bias=true, kwargs...)</code>:       Import parameters from HDF file <code>h5</code> with <code>kernel</code> and <code>bias</code> specifying       the full path to weights and biases, respectively.</li><li><code>Conv(h5::HDF5.File, group::String; trainable=false, actf=relu, tf=true, use_bias=true)</code>:       Import a conv-layer from a default TF/Keras HDF5 file.        If <code>tf=false</code>, <code>group</code> defines the full path to the parameters       <code>group/kernel:0</code> and <code>group/bias:0</code>.        If <code>tf=true</code>, <code>group</code> defines the  only the group name and        parameters are addressed as <code>model_weights/group/group/kernel:0</code> and       <code>model_weights/group/group/bias:0</code>.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension.</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window.</li><li><code>dilation=1</code>: dilation factor for each dimension.</li><li><code>...</code> See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>conv4()</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L159-L200">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DeConv" href="#NNHelferlein.DeConv"><code>NNHelferlein.DeConv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct DeConv  &lt;: AbstractLayer</code></pre><p>Default deconvolution layer.</p><p><strong>Constructors:</strong></p><ul><li><code>DeConv(w, b, actf, kwargs...)</code>: default constructor</li><li><code>DeConv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   o kernels of size (w1,w2) for an input of i channels.</li><li><code>DeConv(w1::Int, w2::Int, w3::Int, i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   o kernels of size (w1,w2,w3) for an input of i channels.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension (applied to the output).</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window       (applied to the output).</li><li><code>...</code> See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>deconv4()</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L550-L571">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.ResNetBlock" href="#NNHelferlein.ResNetBlock"><code>NNHelferlein.ResNetBlock</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct ResNetBlock &lt;: AbstractChain</code></pre><p>Executable type for one block of a ResNet-type network.</p><p><strong>Constructors:</strong></p><ul><li><code>ResNetBlock(layers; shortcut=[identity], post=[identity])</code>:       3 chains to form the block:        the main chain, the shortcut and a chain of layers        to be added after the confluence.       All chains must be specified as lists, even if they are        empty (<code>[]</code>) or comprise only one layer       (<code>[BatchNorm]</code>).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/pretrained.jl#L463-L476">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DepthwiseConv" href="#NNHelferlein.DepthwiseConv"><code>NNHelferlein.DepthwiseConv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">DepthwiseConv  &lt;: AbstractLayer</code></pre><p>Conv layer with seperate filters per input channel.  <em>o</em> output feature maps will be created by performing a convolution  on only one input channel. <code>o</code> must be a multiple of <code>i</code>.</p><p><strong>Constructors:</strong></p><ul><li><code>DepthwiseConv(w, b, actf; kwargs)</code>: default constructor</li><li><code>Conv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   <code>o</code> kernels of size (w1,w2) for every input channel of an 2-d input of <code>i</code> layers.   <code>o</code> must be a multiple of <code>i</code>; if <code>o == i</code>, each output feature map is    generated from one channel. If <code>o == n*i</code>, <code>n</code> feature maps are    generated from each channel.    </li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension.</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window.</li><li><code>dilation=1</code>: dilation factor for each dimension.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L286-L306">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Pool" href="#NNHelferlein.Pool"><code>NNHelferlein.Pool</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Pool &lt;: AbstractLayer</code></pre><p>Pooling layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Pool(;kwargs...)</code>: max pooling; without <code>kwargs</code>, 2-pooling       is performed.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>window=2</code>: pooling <code>window</code> size (same for all directions)</li><li><code>...</code>: See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>pool</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L395-L409">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.UnPool" href="#NNHelferlein.UnPool"><code>NNHelferlein.UnPool</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct UnPool &lt;: AbstractLayer</code></pre><p>Unpooling layer.</p><p><strong>Constructors:</strong></p><ul><li><code>UnPool(;kwargs...)</code>: user-defined unpooling</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L608-L615">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Pad" href="#NNHelferlein.Pad"><code>NNHelferlein.Pad</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Pad     &lt;: AbstractLayer</code></pre><p>Pad an n-dimensional array along dimensions with one of the types &#39;:zeros&#39; (default), &#39;:ones&#39;.</p><p><strong>Constructors:</strong></p><ul><li><code>Pad(padding::Int...; mode=:zeros)</code>: Pad with <code>padding</code>           along all specified dims.           If <code>padding</code> is a single integer, it is applied to all            but the last 2 dims (i.e. in context of a CNN the channel and            minibatch dimension will be excluded from padding).            If more then one padding value is            specified, the values will be applied to the dims in the           order they are specified and missing values will be filled           with zeros.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>mode</code>: one of <ul><li><code>:zeros</code>: zero-padding</li><li><code>:ones</code>: one-padding</li></ul></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L433-L454">source</a></section></article><h2 id="Recurrent"><a class="docs-heading-anchor" href="#Recurrent">Recurrent</a><a id="Recurrent-1"></a><a class="docs-heading-anchor-permalink" href="#Recurrent" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.RecurrentUnit" href="#NNHelferlein.RecurrentUnit"><code>NNHelferlein.RecurrentUnit</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type RecurrentUnit end</code></pre><p>Supertype for all recurrent unit types. Self-defined recurrent units which are a child of <code>RecurrentUnit</code> can be used inside the &#39;Recurrent&#39; layer.</p><p><strong>Interface</strong></p><p>All subtypes of <code>RecurrentUnit</code> must provide the followning:</p><ul><li>a constructor with signature <code>Type(n_inputs, n_units; kwargs)</code> and   arbitrary keyword arguments.</li><li>an implementation of signature <code>(o::Recurrent)(x)</code>   where <code>x</code> is a 3d- or 2d-array of shape [fan-in, mb-size, 1] or    [fan-in, mb-size].   The function must return the result of one forward    computation for one step and return the hidden state   and set the internal fields <code>h</code> and optionally <code>c</code>.</li><li>a field <code>h</code> (to store the last hidden state)</li><li>an optional field <code>c</code>, if the cell state is to be stored   such as in a lstm unit.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/types.jl#L22-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Recurrent" href="#NNHelferlein.Recurrent"><code>NNHelferlein.Recurrent</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Recurrent &lt;: AbstractLayer</code></pre><p>One layer RNN that works with minibatches of (time) series data. Minibatch can be a 2- or 3-dimensional Array. If 2-d, inputs for one step are in one column and the Array has as many colums as steps. If 3-d, the last dimension iterates the samples of the minibatch.</p><p>Result is an array matrix with the output of the units of all steps for all smaples of the minibatch (with model depth as first and samples of the minimatch as last dimension).</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Recurrent(n_inputs::Int, n_units::Int; u_type=:lstm, 
+               \frac{1}{2} \sum_{j=1}^{n_{codes}}(1 + ln\sigma_{c_j}^{2}-\mu_{c_j}^{2}-\sigma_{c_j}^{2}) \]</p><p>Output of the autoencoder is cropped to the size of input before loss calculation (and before prediction); i.e. the output has always the same dimensions as the input, even if the last layer generates a bigger shape.</p><p><strong>KL-training parameters:</strong></p><p>The parameter β is by default set to 1.0, i.e. mean-squared error and KL  has the same weights. The functions <code>set_beta(vae, beta)</code> and <code>get_beta(vae)</code> can be used to set and get the β used in training. With β=0.0 no KL-loss will be used.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L293-L336">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_beta" href="#NNHelferlein.get_beta"><code>NNHelferlein.get_beta</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_beta(vae::VAE; ramp=false)</code></pre><p>Return a <code>Dict</code> with the current VAE-parameters beta and ramp-up.</p><p><strong>Arguments:</strong></p><ul><li><code>ramp=false</code>: if <code>true</code>, a vector of β for all ramp-up steps is returned.               This way, the ramp-up phase can be visualised:               &lt;img src=&quot;./assets/vae-beta-range.png&quot;/&gt;</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L344-L353">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_beta!" href="#NNHelferlein.set_beta!"><code>NNHelferlein.set_beta!</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function set<em>beta!(vae::VAE, β</em>max; ramp_up=false, steps=0)</p><p>Helper to set the current value of the VAE-parameter beta and ramp-up settings.</p><p>VAE loss is calculated as (mean of error squares) + β * (mean of KL divergence).</p><p><strong>Ramp-up:</strong></p><p>In case of <code>ramp_up=true</code>, β starts with almost 0.0 (<code>sigm(-10.0)</code> ≈4.5e-5) and  reaches almost 1.0 after <code>steps</code> steps, following a sigmoid curve. <code>steps</code> should be more than 25, to avoid rounding errors in the calculation of the derivative of the sigmoid function.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L372-L385">source</a></section></article><h1 id="Layers"><a class="docs-heading-anchor" href="#Layers">Layers</a><a id="Layers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AbstractLayer" href="#NNHelferlein.AbstractLayer"><code>NNHelferlein.AbstractLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AbstractLayer
+abstract type Layer</code></pre><p>Mother type for layers hierarchy. (The type <code>Layer</code> is kept for backward compatibility)</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/types.jl#L1-L7">source</a></section></article><h2 id="Fully-connected-layers"><a class="docs-heading-anchor" href="#Fully-connected-layers">Fully connected layers</a><a id="Fully-connected-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Fully-connected-layers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Dense" href="#NNHelferlein.Dense"><code>NNHelferlein.Dense</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Dense  &lt;: AbstractLayer</code></pre><p>Default Dense layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Dense(w, b, actf)</code>: default constructor, <code>w</code> are the weights and <code>b</code> the bias.</li><li><code>Dense(i::Int, j::Int; actf=sigm, init=..)</code>: layer of <code>j</code> neurons with       <code>i</code> inputs. Initialiser is xavier<em>uniform for  <code>actf=sigm</code> and       xaview</em>normal otherwise.</li><li><code>Dense(h5::HDF5.File, group::String; trainable=false, actf=sigm)</code>: kernel and bias are loaded by the specified <code>group</code>.</li><li><code>Dense(h5::HDF5.File, kernel::String, bias::String;       trainable=false, actf=sigm)</code>: layer       imported from a hdf5-file from TensorFlow with the       hdf-object h5 and the group name group.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L7-L22">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Linear" href="#NNHelferlein.Linear"><code>NNHelferlein.Linear</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Linear  &lt;: AbstractLayer</code></pre><p>Almost standard dense layer, but functionality inspired by the TensorFlow-layer:</p><ul><li>capable to work with input tensors of any number of dimensions</li><li>default activation function <code>identity</code></li><li>optionally without biases.</li></ul><p>The shape of the input tensor is preserved; only the size of the first dim is changed from in to out.</p><p><strong>Constructors:</strong></p><ul><li><code>Linear(i::Int, j::Int; bias=true, actf=identity, init=xaview_normal)</code>        where <code>i</code> is fan-in and <code>j</code> is fan-out.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>bias=true</code>: if false biases are fixed to 0.0</li><li><code>actf=identity</code>: activation function.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L77-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Embed" href="#NNHelferlein.Embed"><code>NNHelferlein.Embed</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Embed &lt;: AbstractLayer</code></pre><p>Simple type for an embedding layer to embed a virtual onehot-vector into a smaller number of neurons by linear combination. The onehot-vector is virtual, because not the vector, but only the index of the &quot;one&quot; in the vector has to be provided as Integer value (or a minibatch of integers) with values between 1 and the vocab size.</p><p><strong>Constructors:</strong></p><ul><li><code>Embed(v,d; actf=identity, mask=nothing):</code> with   vocab size <code>v</code>, embedding depth <code>d</code> and default activation function identity.   <code>mask</code> defines the padding token (see below).</li></ul><p><strong>Signatures:</strong></p><ul><li><code>(l::Embed)(x)</code>: default embedding of input tensor <code>x</code>.</li></ul><p><strong>Value:</strong></p><p>The embedding is constructed by adding a first dimension to the input tensor with number of rows = embedding depth. If <code>x</code> is a column vector, the value is a matrix. If <code>x</code> is as row-vector or a matrix, the value is a 3-d array, etc.</p><p><strong>Padding and masking:</strong></p><p>If a token value is defined as <code>mask</code>, occurences are embedded as zero vector. This can be used for padding sequence with zeros. The masking/padding token counts to the vocab size. If padding tokens are not masked, their embedding will be optimised during training (which is not recommended but still possible for many applications).</p><p>Zero may be used as padding token, but it must count to the vocab size  (i.e. the vocab size must be one larger than the number of tokens) and the keyword arg <code>mask=0</code> must be specified.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L703-L737">source</a></section></article><h2 id="Convolutional"><a class="docs-heading-anchor" href="#Convolutional">Convolutional</a><a id="Convolutional-1"></a><a class="docs-heading-anchor-permalink" href="#Convolutional" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Conv" href="#NNHelferlein.Conv"><code>NNHelferlein.Conv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Conv  &lt;: AbstractLayer</code></pre><p>Default Conv layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Conv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu; kwargs...)</code>: layer with   o kernels of size (w1,w2) for an input of i channels.</li><li><code>Conv(w1::Int, w2::Int, w3::Int, i::Int, o::Int; actf=relu; kwargs...)</code>: layer        with 3-dimensional kernels for 3D convolution        (requires 5-dimensional input)</li><li><code>Conv(w1::Int,  i::Int, o::Int; actf=relu; kwargs...)</code>: layer with   o kernels of size (1,w1) for an input of i channels.   This 1-dimensional convolution uses a 2-dimensional kernel with a first    dimension of size 1. Input and output contain an empty firfst dimension   of size 1. If <code>padding</code>, <code>stride</code> or <code>dilation</code> are specified, 2-tuples   must be specified to correspond with the 2-dimensional kernel   (e.g. <code>padding=(0,1)</code> for a 1-padding along the 1D sequence).</li></ul><p><strong>Constructors to read parameters from Tensorflow/Keras HDF-files:</strong></p><ul><li><code>Conv(h5::HDF5.File, kernel::String, bias::String; trainable=false, actf=Knet.relu,   use_bias=true, kwargs...)</code>:       Import parameters from HDF file <code>h5</code> with <code>kernel</code> and <code>bias</code> specifying       the full path to weights and biases, respectively.</li><li><code>Conv(h5::HDF5.File, group::String; trainable=false, actf=relu, tf=true, use_bias=true)</code>:       Import a conv-layer from a default TF/Keras HDF5 file.        If <code>tf=false</code>, <code>group</code> defines the full path to the parameters       <code>group/kernel:0</code> and <code>group/bias:0</code>.        If <code>tf=true</code>, <code>group</code> defines the  only the group name and        parameters are addressed as <code>model_weights/group/group/kernel:0</code> and       <code>model_weights/group/group/bias:0</code>.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension.</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window.</li><li><code>dilation=1</code>: dilation factor for each dimension.</li><li><code>...</code> See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>conv4()</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L159-L200">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DeConv" href="#NNHelferlein.DeConv"><code>NNHelferlein.DeConv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct DeConv  &lt;: AbstractLayer</code></pre><p>Default deconvolution layer.</p><p><strong>Constructors:</strong></p><ul><li><code>DeConv(w, b, actf, kwargs...)</code>: default constructor</li><li><code>DeConv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   o kernels of size (w1,w2) for an input of i channels.</li><li><code>DeConv(w1::Int, w2::Int, w3::Int, i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   o kernels of size (w1,w2,w3) for an input of i channels.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension (applied to the output).</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window       (applied to the output).</li><li><code>...</code> See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>deconv4()</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L550-L571">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.ResNetBlock" href="#NNHelferlein.ResNetBlock"><code>NNHelferlein.ResNetBlock</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct ResNetBlock &lt;: AbstractChain</code></pre><p>Executable type for one block of a ResNet-type network.</p><p><strong>Constructors:</strong></p><ul><li><code>ResNetBlock(layers; shortcut=[identity], post=[identity])</code>:       3 chains to form the block:        the main chain, the shortcut and a chain of layers        to be added after the confluence.       All chains must be specified as lists, even if they are        empty (<code>[]</code>) or comprise only one layer       (<code>[BatchNorm]</code>).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/pretrained.jl#L463-L476">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DepthwiseConv" href="#NNHelferlein.DepthwiseConv"><code>NNHelferlein.DepthwiseConv</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">DepthwiseConv  &lt;: AbstractLayer</code></pre><p>Conv layer with seperate filters per input channel.  <em>o</em> output feature maps will be created by performing a convolution  on only one input channel. <code>o</code> must be a multiple of <code>i</code>.</p><p><strong>Constructors:</strong></p><ul><li><code>DepthwiseConv(w, b, actf; kwargs)</code>: default constructor</li><li><code>Conv(w1::Int, w2::Int,  i::Int, o::Int; actf=relu, kwargs...)</code>: layer with   <code>o</code> kernels of size (w1,w2) for every input channel of an 2-d input of <code>i</code> layers.   <code>o</code> must be a multiple of <code>i</code>; if <code>o == i</code>, each output feature map is    generated from one channel. If <code>o == n*i</code>, <code>n</code> feature maps are    generated from each channel.    </li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>padding=0</code>: the number of extra zeros implicitly concatenated       at the start and end of each dimension.</li><li><code>stride=1</code>: the number of elements to slide to reach the next filtering window.</li><li><code>dilation=1</code>: dilation factor for each dimension.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L286-L306">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Pool" href="#NNHelferlein.Pool"><code>NNHelferlein.Pool</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Pool &lt;: AbstractLayer</code></pre><p>Pooling layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Pool(;kwargs...)</code>: max pooling; without <code>kwargs</code>, 2-pooling       is performed.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>window=2</code>: pooling <code>window</code> size (same for all directions)</li><li><code>...</code>: See the Knet documentation for Details:       https://denizyuret.github.io/Knet.jl/latest/reference/#Convolution-and-Pooling.       All keywords to the Knet function <code>pool</code> are supported.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L395-L409">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.UnPool" href="#NNHelferlein.UnPool"><code>NNHelferlein.UnPool</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct UnPool &lt;: AbstractLayer</code></pre><p>Unpooling layer.</p><p><strong>Constructors:</strong></p><ul><li><code>UnPool(;kwargs...)</code>: user-defined unpooling</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L608-L615">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Pad" href="#NNHelferlein.Pad"><code>NNHelferlein.Pad</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Pad     &lt;: AbstractLayer</code></pre><p>Pad an n-dimensional array along dimensions with one of the types &#39;:zeros&#39; (default), &#39;:ones&#39;.</p><p><strong>Constructors:</strong></p><ul><li><code>Pad(padding::Int...; mode=:zeros)</code>: Pad with <code>padding</code>           along all specified dims.           If <code>padding</code> is a single integer, it is applied to all            but the last 2 dims (i.e. in context of a CNN the channel and            minibatch dimension will be excluded from padding).            If more then one padding value is            specified, the values will be applied to the dims in the           order they are specified and missing values will be filled           with zeros.</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>mode</code>: one of <ul><li><code>:zeros</code>: zero-padding</li><li><code>:ones</code>: one-padding</li></ul></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L433-L454">source</a></section></article><h2 id="Recurrent"><a class="docs-heading-anchor" href="#Recurrent">Recurrent</a><a id="Recurrent-1"></a><a class="docs-heading-anchor-permalink" href="#Recurrent" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.RecurrentUnit" href="#NNHelferlein.RecurrentUnit"><code>NNHelferlein.RecurrentUnit</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type RecurrentUnit end</code></pre><p>Supertype for all recurrent unit types. Self-defined recurrent units which are a child of <code>RecurrentUnit</code> can be used inside the &#39;Recurrent&#39; layer.</p><p><strong>Interface</strong></p><p>All subtypes of <code>RecurrentUnit</code> must provide the followning:</p><ul><li>a constructor with signature <code>Type(n_inputs, n_units; kwargs)</code> and   arbitrary keyword arguments.</li><li>an implementation of signature <code>(o::Recurrent)(x)</code>   where <code>x</code> is a 3d- or 2d-array of shape [fan-in, mb-size, 1] or    [fan-in, mb-size].   The function must return the result of one forward    computation for one step and return the hidden state   and set the internal fields <code>h</code> and optionally <code>c</code>.</li><li>a field <code>h</code> (to store the last hidden state)</li><li>an optional field <code>c</code>, if the cell state is to be stored   such as in a lstm unit.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/types.jl#L22-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Recurrent" href="#NNHelferlein.Recurrent"><code>NNHelferlein.Recurrent</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Recurrent &lt;: AbstractLayer</code></pre><p>One layer RNN that works with minibatches of (time) series data. Minibatch can be a 2- or 3-dimensional Array. If 2-d, inputs for one step are in one column and the Array has as many colums as steps. If 3-d, the last dimension iterates the samples of the minibatch.</p><p>Result is an array matrix with the output of the units of all steps for all smaples of the minibatch (with model depth as first and samples of the minimatch as last dimension).</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Recurrent(n_inputs::Int, n_units::Int; u_type=:lstm, 
           bidirectional=false, allow_mask=false, o...)</code></pre><ul><li><code>n_inputs</code>: number of inputs</li><li><code>n_units</code>:  number of units </li><li><code>u_type</code> :  unit type can be one of the Knet unit types       (<code>:relu, :tanh, :lstm, :gru</code>) or a type which must be a        subtype of <code>RecurrentUnit</code> and fullfill the respective interface        (see the docs for <code>RecurentUnit</code>).</li><li><code>bidirectional=false</code>: if true, 2 layers of <code>n_units</code> units will be defined       and run in forward and backward direction respectively. The hidden       state is <code>[2*n_units*mb]</code> or <code>[2*n_units,steps,mb]</code> id <code>return_all==true</code>.</li><li><code>allow_mask=false</code>: if masking is allowed, a slower algorithm is used to be        able to ignore any masked step. Arbitrary sequence positions may be        masked for any sequence.</li></ul><p>Any keyword argument of <code>Knet.RNN</code> or  a self-defined <code>RecurrentUnit</code> type may be provided.</p><p><strong>Signatures:</strong></p><pre><code class="nohighlight hljs">function (rnn::Recurrent)(x; c=nothing, h=nothing, return_all=false, 
-          mask=nothing)</code></pre><p>The layer is called either with a 2-dimensional array of the shape [fan-in, steps]  or a 3-dimensional array of [fan-in, steps, batchsize].</p><p><strong>Arguments:</strong></p><ul><li><code>c=0</code>, <code>h=0</code>: inits the hidden and cell state.   If <code>nothing</code>,  states <code>h</code> or <code>c</code> keep their values.    If <code>c=0</code> or <code>h=0</code>, the states are resetted to <code>0</code>;   otherwise an array of states of the correct dimensions can be supplied    to be used as initial states.</li><li><code>return_all=false</code>: if <code>true</code> an array with all hidden states of all steps    is returned (size is [units, time-steps, minibatch]).   Otherwise only the hidden states of the last step are returned   ([units, minibatch]).</li><li><code>mask</code>: optional mask for the input sequence minibatch of shape    [steps, minibatch]. Values in the mask must be 1.0 for masked positions   or 0.0 otherwise and of type <code>Float32</code> or <code>CuArray{Float32}</code> for GPU context.    Appropriate masks can be generated with the NNHelferlein function    <code>mk_padding_mask()</code>.</li></ul><p>Bidirectional layers can be constructed by specifying <code>bidirectional=true</code>, if the unit-type supports it (Knet.RNN does).  Please be aware that the actual number of units is 2 x n_units for  bidirectional layers and the output dimension is [2 x units, steps, mb] or [2 x units, mb].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1197-L1260">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_hidden_states" href="#NNHelferlein.get_hidden_states"><code>NNHelferlein.get_hidden_states</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_hidden_states(l::&lt;RNN_Type&gt;; flatten=true)</code></pre><p>Return the hidden states of one or more layers of an RNN. <code>&lt;RNN_Type&gt;</code> is one of <code>NNHelferlein.Recurrent</code>, <code>Knet.RNN</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>flatten=true</code>: if the states tensor is 3d with a 3rd dim &gt; 1, the        array is transformed to [units, mb, 1] to represent all current states       after the last step.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1438-L1448">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_cell_states" href="#NNHelferlein.get_cell_states"><code>NNHelferlein.get_cell_states</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_cell_states(l::&lt;RNN_Type&gt;; unbox=true, flatten=true)</code></pre><p>Return the cell states of one or more layers of an RNN only if it is a LSTM (Long short-term memory).</p><p><strong>Arguments:</strong></p><ul><li><code>unbox=true</code>: By default, c is unboxed when called in <code>@diff</code> context (while AutoGrad        is recording) to avoid unwanted dependencies of the computation graph       s2s.attn(reset=true)       (backprop should run via the hidden states, not the cell states).</li><li><code>flatten=true</code>: if the states tensor is 3d with a 3rd dim &gt; 1, the        array is transformed to [units, mb, 1] to represent all current states       after the last step.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1468-L1482">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_hidden_states!" href="#NNHelferlein.set_hidden_states!"><code>NNHelferlein.set_hidden_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function set_hidden_states!(l::&lt;RNN_Type&gt;, h)</code></pre><p>Set the hidden states of one or more layers of an RNN to <code>h</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1505-L1510">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_cell_states!" href="#NNHelferlein.set_cell_states!"><code>NNHelferlein.set_cell_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function set_cell_states!(l::&lt;RNN_Type&gt;, c)</code></pre><p>Set the cell states of one or more layers of an RNN to <code>c</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1519-L1524">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.reset_hidden_states!" href="#NNHelferlein.reset_hidden_states!"><code>NNHelferlein.reset_hidden_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function reset_hidden_states!(l::&lt;RNN_Type&gt;)</code></pre><p>Reset the hidden states of one or more layers of an RNN to 0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1535-L1540">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.reset_cell_states!" href="#NNHelferlein.reset_cell_states!"><code>NNHelferlein.reset_cell_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function reset_cell_states!(l::&lt;RNN_Type&gt;)</code></pre><p>Reset the cell states of one or more layers of an RNN to 0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1549-L1554">source</a></section></article><h2 id="Transformers"><a class="docs-heading-anchor" href="#Transformers">Transformers</a><a id="Transformers-1"></a><a class="docs-heading-anchor-permalink" href="#Transformers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFEncoder" href="#NNHelferlein.TFEncoder"><code>NNHelferlein.TFEncoder</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFEncoder</code></pre><p>A Bert-like encoder to be used as part of a tranformer.  The encoder is build as a stack of <code>TFEncoderLayer</code>s  which is entered after embedding, positional encoding and generation of a padding mask.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFEncoder(n_layers, depth, n_heads; drop_rate=0.1)</code></pre><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(e::TFEncoder)(x)</code></pre><p>The encoder is called with a matrix of embedded tokens of size <code>[depth, seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len, n_minibatch]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L300-L319">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFEncoderLayer" href="#NNHelferlein.TFEncoderLayer"><code>NNHelferlein.TFEncoderLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFEncoderLayer</code></pre><p>A Bert-like encoder layer to be used as part of a Bert-like transformer. The layer consists of a multi-head attention sub-layer followed by a feed-forward network of size depth -&gt; 4*depth -&gt; depth.  Both parts have separate residual connections and layer normalisation.</p><p>The design follows the original paper &quot;Attention is all you need&quot;  by Vaswani, 2017.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFEncoderLayer(depth, n_heads, drop)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the multi-head attention</li><li><code>drop_rate</code>: dropout rate</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(el::TFEncoderLayer)(x; mask=nothing)</code></pre><p>Objects of type <code>TFEncoderLayer</code> are callable and expect a  3-dimensional array of size [embedding<em>depth, seq</em>len, minibatch<em>size]  as input.  The optional <code>mask</code> must be of size [seq</em>len, minibatch_size] and mark masked positions with 1.0.</p><p>It returns a tensor of the same size as the input and the self-attention factors of size [seq<em>len, seq</em>len, minibatch_size].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L236-L268">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFDecoder" href="#NNHelferlein.TFDecoder"><code>NNHelferlein.TFDecoder</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFDecoder</code></pre><p>A Bert-like decoder to be used as part of a tranformer.  The decoder is build as a stack of <code>TFDecoderLayer</code>s  which is entered after embedding, positional encoding and generation of a padding mask and a peek-ahead mask.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFDecoder(n_layers, depth, n_heads, vocab_size; 
-          pad_id=NNHelferlein.TOKEN_PAD, drop_rate=0.1)</code></pre><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(e::TFdecoder)(x)</code></pre><p>The decoder is called with a matrix of token ids of size <code>[seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len, n_minibatch]</code> and the attention factors.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L438-L458">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFDecoderLayer" href="#NNHelferlein.TFDecoderLayer"><code>NNHelferlein.TFDecoderLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFDecoderLayer</code></pre><p>A Bert-like decoder layer to be used as part of a Bert-like transformer. The layer consists of a multi-head self-attention sub-layer, a multi-head attention sub-layer followed by a feed-forward network of size depth -&gt; 4*depth -&gt; depth.  All three parts have separate residual connections and layer normalisation.</p><p>The design follows the original paper &quot;Attention is all you need&quot;  by Vaswani, 2017.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFDecoderLayer(depth, n_heads, drop)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the multi-head attention</li><li><code>drop</code>: dropout rate</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(el::TFDecoderLayer)(x, h_encoder; enc_m_pad=nothing, m_combi=nothing)</code></pre><p>Objects of type <code>TFDecoderLayer</code> are callable and expect a  minibatch of embedded sequences as input.</p><ul><li><code>x</code>: 3-dimensional array of size [embedding<em>depth, seq</em>len, minibatch_size] </li><li><code>h_encoder</code>: output of the encoder stack</li><li><code>enc_m_pad</code>: optional padding mask for the encoder output</li><li><code>m_combi</code>: optional mask for the decoder self-attention            combining padding and peek-ahead mask.</li></ul><p>It returns a tensor of the same size as the input, the self-attention factors and the decoder-encoder attention factors.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L355-L391">source</a></section></article><p>These layers are used by the  <a href="#NNHelferlein.Transformer"><code>Transformer</code></a> and <a href="#NNHelferlein.TokenTransformer"><code>TokenTransformer</code></a> types to build Bert-like transformer networks.</p><h2 id="Others"><a class="docs-heading-anchor" href="#Others">Others</a><a id="Others-1"></a><a class="docs-heading-anchor-permalink" href="#Others" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Flat" href="#NNHelferlein.Flat"><code>NNHelferlein.Flat</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Flat &lt;: AbstractLayer</code></pre><p>Default flatten layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Flat()</code>: with no options.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L639-L646">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.flatten" href="#NNHelferlein.flatten"><code>NNHelferlein.flatten</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">flatten(x)</code></pre><p>Flatten a tensor to a matrix, preserving the last dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L652-L656">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PyFlat" href="#NNHelferlein.PyFlat"><code>NNHelferlein.PyFlat</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PyFlat &lt;: AbstractLayer</code></pre><p>Flatten layer with optional Python-stype flattening (row-major). This layer can be used if pre-trained weight matrices from tensorflow are applied after the flatten layer.</p><p><strong>Constructors:</strong></p><ul><li><code>PyFlat(; python=true)</code>: if true, row-major flatten is performed.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L672-L681">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.FeatureSelection" href="#NNHelferlein.FeatureSelection"><code>NNHelferlein.FeatureSelection</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct FeatureSelection  &lt;: AbstractLayer</code></pre><p>Simple feature selection layer that maps input to output with one-by-one connections; i.e. a layer of size 128 has 128 weights (plus optional biases).</p><p>Biases and activation functions are disabled by default.</p><p><strong>Constructors:</strong></p><ul><li><code>FeatureSelection(i; bias=false, actf=identity)</code>: with the same           input- and output-size <code>i</code>, whre <code>i</code> is an integer           or a Tuple of the input dimensions.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L124-L138">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Activation" href="#NNHelferlein.Activation"><code>NNHelferlein.Activation</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Activation &lt;: AbstractLayer</code></pre><p>Simple activation layer with the desired activation function as argument.</p><p><strong>Constructors:</strong></p><ul><li><code>Activation(actf)</code></li><li><code>Relu()</code></li><li><code>Sigm()</code></li><li><code>Swish()</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L833-L843">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Softmax" href="#NNHelferlein.Softmax"><code>NNHelferlein.Softmax</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Softmax &lt;: AbstractLayer</code></pre><p>Simple softmax layer to compute softmax probabilities.</p><p><strong>Constructors:</strong></p><ul><li><code>Softmax()</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L782-L789">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Logistic" href="#NNHelferlein.Logistic"><code>NNHelferlein.Logistic</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Logistic &lt;: AbstractLayer</code></pre><p>Logistic (sigmoid) layer activation with additional Temperature parameter to control the slope of the curve. Low temperatures (such as T=0.001) result in a step-like activation  function, whereas high temperatures (such as T=10) makes the activation almoset linear.</p><p><strong>Constructors:</strong></p><ul><li><code>Logistic(; T=1.0)</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L802-L814">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Dropout" href="#NNHelferlein.Dropout"><code>NNHelferlein.Dropout</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Dropout &lt;: AbstractLayer</code></pre><p>Dropout layer. Implemented with help of Knet&#39;s dropout() function that evaluates AutoGrad.recording() to detect if in training or in prediction. Dropouts are applied only if prediction.</p><p><strong>Constructors:</strong></p><ul><li><code>Dropout(p)</code> with the dropout rate <em>p</em>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L863-L873">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.BatchNorm" href="#NNHelferlein.BatchNorm"><code>NNHelferlein.BatchNorm</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct BatchNorm &lt;: AbstractLayer</code></pre><p>Batchnormalisation layer. Implemented with help of Knet&#39;s batchnorm() function that evaluates AutoGrad.recording() to detect if in training or in prediction. In training the moments are updated to record the running averages; in prediction the moments are applied, but not modified.</p><p>In addition, optional trainable factor <code>a</code> and bias <code>b</code> are applied:</p><p class="math-container">\[y = a \cdot \frac{(x - \mu)}{(\sigma + \epsilon)} + b\]</p><p><strong>Constructors:</strong></p><ul><li><code>BatchNorm(; scale=true, channels=0)</code> will initialise       the moments with <code>Knet.bnmoments()</code> and       trainable parameters <code>β</code> and <code>γ</code> only if       <code>scale==true</code> (in this case, the number of channels must       be defined - for CNNs this is the number of feature maps).</li></ul><p><strong>Constructors to read parameters from Tensorflow/Keras HDF-files:</strong></p><ul><li><p><code>BatchNorm(h5::HDF5.File, β_path, γ_path, μ_path, var_path;                       scale=false, trainable=true, momentum=0.1, ε=1e-5, dims=4)</code>:       Import parameters from HDF file <code>h5</code> with <code>β_path</code>, <code>γ_path</code>,        <code>μ_path</code> and <code>var_path</code> specifying       the full path to β, γ, μ and variance respectively.</p></li><li><p><code>BatchNorm(h5::HDF5.File, group::String; scale=false, trainable=true, momentum=0.1,                        ε=1e-5, dims=4, tf=true)</code>:       Import parameters from HDF file <code>h5</code> with parameters in the group       <code>group</code>. Paths to β, γ, μ and variance are constructed        if <code>tf=true</code> as <code>model_weights/group/group/beta:0</code>, etc.       If <code>tf=false</code> group must define the full group path:       <code>group/beta:0</code>.       <code>dims</code> specifies the number of dimensions of the input and may be       2, 4 or 5. The default (4) applies to standard CNNs        (imgsize, imgsize, channels, batchsize).</p></li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>scale=true</code>: if <code>true</code>, the trainable scale parameters β and γ       are used. </li><li><code>trainable=true</code>. only used with hdf5-import. If <code>true</code> the        parameters β and γ are initialised as <code>Param</code> and trained in training.</li></ul><p><strong>Details:</strong></p><p>2d, 4d and 5d inputs are supported. Mean and variance are computed over dimensions (2), (1,2,4) and (1,2,3,5) for 2d, 4d and 5d arrays, respectively.</p><p>If <code>scale=true</code> and <code>channels != 0</code>, trainable parameters <code>β</code> and <code>γ</code> will be initialised for each channel.</p><p>If <code>scale=true</code> and <code>channels == 0</code> (i.e. <code>BatchNorm(scale=true)</code>), the params <code>β</code> and <code>γ</code> are not initialised by the constructor. Instead, the number of channels is inferred when the first minibatch is normalised as: 2d: <code>size(x)[1]</code> 4d: <code>size(x)[3]</code> 5d: <code>size(x)[4]</code> or <code>0</code> otherwise.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L890-L952">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.LayerNorm" href="#NNHelferlein.LayerNorm"><code>NNHelferlein.LayerNorm</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct LayerNorm  &lt;: AbstractLayer</code></pre><p>Simple layer normalisation (inspired by TFs LayerNormalization). Implementation is from Deniz Yuret&#39;s answer to feature request 429 (https://github.com/denizyuret/Knet.jl/issues/492).</p><p>The layer performs a normalisation within each sample, <em>not</em> batchwise. Normalisation is modified by two trainable parameters <code>a</code> and <code>b</code> (variance and mean) added to every value of the sample vector.</p><p><strong>Constructors:</strong></p><ul><li><code>LayertNorm(depth; eps=1e-6)</code>:  <code>depth</code> is the number       of activations for one sample of the layer.</li></ul><p><strong>Signatures:</strong></p><ul><li><code>function (l::LayerNorm)(x; dims=1)</code>: normalise <code>x</code> along the given dimensions.       The size of the specified dimension must fit with the initialised <code>depth</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1070-L1089">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GaussianNoise" href="#NNHelferlein.GaussianNoise"><code>NNHelferlein.GaussianNoise</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct GaussianNoise</code></pre><p>Gaussian noise layer. Multiplies Gaussian-distributed random values with  <em>mean = 1.0</em> and <em>sigma = σ</em> to each training value.</p><p><strong>Constructors:</strong></p><ul><li><code>aussianNoise(σ; train_only=true)</code></li></ul><p><strong>Arguments:</strong></p><ul><li><code>σ</code>: Standard deviation for the distribution of noise</li><li><code>train_only=true</code>: if <code>true</code>, noise will only be applied in training.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1116-L1128">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GlobalAveragePooling" href="#NNHelferlein.GlobalAveragePooling"><code>NNHelferlein.GlobalAveragePooling</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct GlobalAveragePooling  &lt;: AbstractLayer</code></pre><p>Layer to return a matrix with the mean values of all but the last two dimensions for each sample of the minibatch. If the input is a stack of feature maps from a convolutional layer, the result can be seen as the mean value of each feature map. Number of <em>output</em>-rows equals number of <em>input</em>-featuremaps;  number of <em>output</em>-columns equals size of minibatch. </p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">GlobalAveragePooling()</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1154-L1167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.global_average_pooling" href="#NNHelferlein.global_average_pooling"><code>NNHelferlein.global_average_pooling</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">global_average_pooling(x)</code></pre><p>Function to return a matrix with the mean values of all but the last two dimensions for each sample of the minibatch.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/layers.jl#L1181-L1186">source</a></section></article><h2 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttentionMechanism" href="#NNHelferlein.AttentionMechanism"><code>NNHelferlein.AttentionMechanism</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AttentionMechanism</code></pre><p>Attention mechanisms follow the same interface and common signatures.</p><p>If possible, the algorithm allows precomputing of the projections of the context vector generated by the encoder in a encoder-decoder-architecture (i.e. in case of an RNN encoder the accumulated encoder hidden states).</p><p>By default attention scores are scaled according to Vaswani et al., 2017 <em>(Vaswani et al., Attention Is All You Need, CoRR, 2017)</em>.</p><p>All algorithms use soft attention.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Attn*Mechanism*(dec_units, enc_units; scale=true)
+          mask=nothing)</code></pre><p>The layer is called either with a 2-dimensional array of the shape [fan-in, steps]  or a 3-dimensional array of [fan-in, steps, batchsize].</p><p><strong>Arguments:</strong></p><ul><li><code>c=0</code>, <code>h=0</code>: inits the hidden and cell state.   If <code>nothing</code>,  states <code>h</code> or <code>c</code> keep their values.    If <code>c=0</code> or <code>h=0</code>, the states are resetted to <code>0</code>;   otherwise an array of states of the correct dimensions can be supplied    to be used as initial states.</li><li><code>return_all=false</code>: if <code>true</code> an array with all hidden states of all steps    is returned (size is [units, time-steps, minibatch]).   Otherwise only the hidden states of the last step are returned   ([units, minibatch]).</li><li><code>mask</code>: optional mask for the input sequence minibatch of shape    [steps, minibatch]. Values in the mask must be 1.0 for masked positions   or 0.0 otherwise and of type <code>Float32</code> or <code>CuArray{Float32}</code> for GPU context.    Appropriate masks can be generated with the NNHelferlein function    <code>mk_padding_mask()</code>.</li></ul><p>Bidirectional layers can be constructed by specifying <code>bidirectional=true</code>, if the unit-type supports it (Knet.RNN does).  Please be aware that the actual number of units is 2 x n_units for  bidirectional layers and the output dimension is [2 x units, steps, mb] or [2 x units, mb].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1197-L1260">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_hidden_states" href="#NNHelferlein.get_hidden_states"><code>NNHelferlein.get_hidden_states</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_hidden_states(l::&lt;RNN_Type&gt;; flatten=true)</code></pre><p>Return the hidden states of one or more layers of an RNN. <code>&lt;RNN_Type&gt;</code> is one of <code>NNHelferlein.Recurrent</code>, <code>Knet.RNN</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>flatten=true</code>: if the states tensor is 3d with a 3rd dim &gt; 1, the        array is transformed to [units, mb, 1] to represent all current states       after the last step.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1438-L1448">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_cell_states" href="#NNHelferlein.get_cell_states"><code>NNHelferlein.get_cell_states</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_cell_states(l::&lt;RNN_Type&gt;; unbox=true, flatten=true)</code></pre><p>Return the cell states of one or more layers of an RNN only if it is a LSTM (Long short-term memory).</p><p><strong>Arguments:</strong></p><ul><li><code>unbox=true</code>: By default, c is unboxed when called in <code>@diff</code> context (while AutoGrad        is recording) to avoid unwanted dependencies of the computation graph       s2s.attn(reset=true)       (backprop should run via the hidden states, not the cell states).</li><li><code>flatten=true</code>: if the states tensor is 3d with a 3rd dim &gt; 1, the        array is transformed to [units, mb, 1] to represent all current states       after the last step.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1468-L1482">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_hidden_states!" href="#NNHelferlein.set_hidden_states!"><code>NNHelferlein.set_hidden_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function set_hidden_states!(l::&lt;RNN_Type&gt;, h)</code></pre><p>Set the hidden states of one or more layers of an RNN to <code>h</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1505-L1510">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.set_cell_states!" href="#NNHelferlein.set_cell_states!"><code>NNHelferlein.set_cell_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function set_cell_states!(l::&lt;RNN_Type&gt;, c)</code></pre><p>Set the cell states of one or more layers of an RNN to <code>c</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1519-L1524">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.reset_hidden_states!" href="#NNHelferlein.reset_hidden_states!"><code>NNHelferlein.reset_hidden_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function reset_hidden_states!(l::&lt;RNN_Type&gt;)</code></pre><p>Reset the hidden states of one or more layers of an RNN to 0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1535-L1540">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.reset_cell_states!" href="#NNHelferlein.reset_cell_states!"><code>NNHelferlein.reset_cell_states!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function reset_cell_states!(l::&lt;RNN_Type&gt;)</code></pre><p>Reset the cell states of one or more layers of an RNN to 0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1549-L1554">source</a></section></article><h2 id="Transformers"><a class="docs-heading-anchor" href="#Transformers">Transformers</a><a id="Transformers-1"></a><a class="docs-heading-anchor-permalink" href="#Transformers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFEncoder" href="#NNHelferlein.TFEncoder"><code>NNHelferlein.TFEncoder</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFEncoder</code></pre><p>A Bert-like encoder to be used as part of a tranformer.  The encoder is build as a stack of <code>TFEncoderLayer</code>s  which is entered after embedding, positional encoding and generation of a padding mask.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFEncoder(n_layers, depth, n_heads; drop_rate=0.1)</code></pre><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(e::TFEncoder)(x)</code></pre><p>The encoder is called with a matrix of embedded tokens of size <code>[depth, seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len, n_minibatch]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L300-L319">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFEncoderLayer" href="#NNHelferlein.TFEncoderLayer"><code>NNHelferlein.TFEncoderLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFEncoderLayer</code></pre><p>A Bert-like encoder layer to be used as part of a Bert-like transformer. The layer consists of a multi-head attention sub-layer followed by a feed-forward network of size depth -&gt; 4*depth -&gt; depth.  Both parts have separate residual connections and layer normalisation.</p><p>The design follows the original paper &quot;Attention is all you need&quot;  by Vaswani, 2017.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFEncoderLayer(depth, n_heads, drop)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the multi-head attention</li><li><code>drop_rate</code>: dropout rate</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(el::TFEncoderLayer)(x; mask=nothing)</code></pre><p>Objects of type <code>TFEncoderLayer</code> are callable and expect a  3-dimensional array of size [embedding<em>depth, seq</em>len, minibatch<em>size]  as input.  The optional <code>mask</code> must be of size [seq</em>len, minibatch_size] and mark masked positions with 1.0.</p><p>It returns a tensor of the same size as the input and the self-attention factors of size [seq<em>len, seq</em>len, minibatch_size].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L236-L268">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFDecoder" href="#NNHelferlein.TFDecoder"><code>NNHelferlein.TFDecoder</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFDecoder</code></pre><p>A Bert-like decoder to be used as part of a tranformer.  The decoder is build as a stack of <code>TFDecoderLayer</code>s  which is entered after embedding, positional encoding and generation of a padding mask and a peek-ahead mask.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFDecoder(n_layers, depth, n_heads, vocab_size; 
+          pad_id=NNHelferlein.TOKEN_PAD, drop_rate=0.1)</code></pre><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(e::TFdecoder)(x)</code></pre><p>The decoder is called with a matrix of token ids of size <code>[seq_len, n_minibatch]</code> and returns a tensor of size <code>[depth, seq_len, n_minibatch]</code> and the attention factors.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L438-L458">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.TFDecoderLayer" href="#NNHelferlein.TFDecoderLayer"><code>NNHelferlein.TFDecoderLayer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">TFDecoderLayer</code></pre><p>A Bert-like decoder layer to be used as part of a Bert-like transformer. The layer consists of a multi-head self-attention sub-layer, a multi-head attention sub-layer followed by a feed-forward network of size depth -&gt; 4*depth -&gt; depth.  All three parts have separate residual connections and layer normalisation.</p><p>The design follows the original paper &quot;Attention is all you need&quot;  by Vaswani, 2017.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">TFDecoderLayer(depth, n_heads, drop)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the multi-head attention</li><li><code>drop</code>: dropout rate</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">(el::TFDecoderLayer)(x, h_encoder; enc_m_pad=nothing, m_combi=nothing)</code></pre><p>Objects of type <code>TFDecoderLayer</code> are callable and expect a  minibatch of embedded sequences as input.</p><ul><li><code>x</code>: 3-dimensional array of size [embedding<em>depth, seq</em>len, minibatch_size] </li><li><code>h_encoder</code>: output of the encoder stack</li><li><code>enc_m_pad</code>: optional padding mask for the encoder output</li><li><code>m_combi</code>: optional mask for the decoder self-attention            combining padding and peek-ahead mask.</li></ul><p>It returns a tensor of the same size as the input, the self-attention factors and the decoder-encoder attention factors.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L355-L391">source</a></section></article><p>These layers are used by the  <a href="#NNHelferlein.Transformer"><code>Transformer</code></a> and <a href="#NNHelferlein.TokenTransformer"><code>TokenTransformer</code></a> types to build Bert-like transformer networks.</p><h2 id="Others"><a class="docs-heading-anchor" href="#Others">Others</a><a id="Others-1"></a><a class="docs-heading-anchor-permalink" href="#Others" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Flat" href="#NNHelferlein.Flat"><code>NNHelferlein.Flat</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Flat &lt;: AbstractLayer</code></pre><p>Default flatten layer.</p><p><strong>Constructors:</strong></p><ul><li><code>Flat()</code>: with no options.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L639-L646">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.flatten" href="#NNHelferlein.flatten"><code>NNHelferlein.flatten</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">flatten(x)</code></pre><p>Flatten a tensor to a matrix, preserving the last dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L652-L656">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PyFlat" href="#NNHelferlein.PyFlat"><code>NNHelferlein.PyFlat</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PyFlat &lt;: AbstractLayer</code></pre><p>Flatten layer with optional Python-stype flattening (row-major). This layer can be used if pre-trained weight matrices from tensorflow are applied after the flatten layer.</p><p><strong>Constructors:</strong></p><ul><li><code>PyFlat(; python=true)</code>: if true, row-major flatten is performed.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L672-L681">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.FeatureSelection" href="#NNHelferlein.FeatureSelection"><code>NNHelferlein.FeatureSelection</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct FeatureSelection  &lt;: AbstractLayer</code></pre><p>Simple feature selection layer that maps input to output with one-by-one connections; i.e. a layer of size 128 has 128 weights (plus optional biases).</p><p>Biases and activation functions are disabled by default.</p><p><strong>Constructors:</strong></p><ul><li><code>FeatureSelection(i; bias=false, actf=identity)</code>: with the same           input- and output-size <code>i</code>, whre <code>i</code> is an integer           or a Tuple of the input dimensions.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L124-L138">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Activation" href="#NNHelferlein.Activation"><code>NNHelferlein.Activation</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Activation &lt;: AbstractLayer</code></pre><p>Simple activation layer with the desired activation function as argument.</p><p><strong>Constructors:</strong></p><ul><li><code>Activation(actf)</code></li><li><code>Relu()</code></li><li><code>Sigm()</code></li><li><code>Swish()</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L833-L843">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Softmax" href="#NNHelferlein.Softmax"><code>NNHelferlein.Softmax</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Softmax &lt;: AbstractLayer</code></pre><p>Simple softmax layer to compute softmax probabilities.</p><p><strong>Constructors:</strong></p><ul><li><code>Softmax()</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L782-L789">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Logistic" href="#NNHelferlein.Logistic"><code>NNHelferlein.Logistic</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Logistic &lt;: AbstractLayer</code></pre><p>Logistic (sigmoid) layer activation with additional Temperature parameter to control the slope of the curve. Low temperatures (such as T=0.001) result in a step-like activation  function, whereas high temperatures (such as T=10) makes the activation almoset linear.</p><p><strong>Constructors:</strong></p><ul><li><code>Logistic(; T=1.0)</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L802-L814">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.Dropout" href="#NNHelferlein.Dropout"><code>NNHelferlein.Dropout</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct Dropout &lt;: AbstractLayer</code></pre><p>Dropout layer. Implemented with help of Knet&#39;s dropout() function that evaluates AutoGrad.recording() to detect if in training or in prediction. Dropouts are applied only if prediction.</p><p><strong>Constructors:</strong></p><ul><li><code>Dropout(p)</code> with the dropout rate <em>p</em>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L863-L873">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.BatchNorm" href="#NNHelferlein.BatchNorm"><code>NNHelferlein.BatchNorm</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct BatchNorm &lt;: AbstractLayer</code></pre><p>Batchnormalisation layer. Implemented with help of Knet&#39;s batchnorm() function that evaluates AutoGrad.recording() to detect if in training or in prediction. In training the moments are updated to record the running averages; in prediction the moments are applied, but not modified.</p><p>In addition, optional trainable factor <code>a</code> and bias <code>b</code> are applied:</p><p class="math-container">\[y = a \cdot \frac{(x - \mu)}{(\sigma + \epsilon)} + b\]</p><p><strong>Constructors:</strong></p><ul><li><code>BatchNorm(; scale=true, channels=0)</code> will initialise       the moments with <code>Knet.bnmoments()</code> and       trainable parameters <code>β</code> and <code>γ</code> only if       <code>scale==true</code> (in this case, the number of channels must       be defined - for CNNs this is the number of feature maps).</li></ul><p><strong>Constructors to read parameters from Tensorflow/Keras HDF-files:</strong></p><ul><li><p><code>BatchNorm(h5::HDF5.File, β_path, γ_path, μ_path, var_path;                       scale=false, trainable=true, momentum=0.1, ε=1e-5, dims=4)</code>:       Import parameters from HDF file <code>h5</code> with <code>β_path</code>, <code>γ_path</code>,        <code>μ_path</code> and <code>var_path</code> specifying       the full path to β, γ, μ and variance respectively.</p></li><li><p><code>BatchNorm(h5::HDF5.File, group::String; scale=false, trainable=true, momentum=0.1,                        ε=1e-5, dims=4, tf=true)</code>:       Import parameters from HDF file <code>h5</code> with parameters in the group       <code>group</code>. Paths to β, γ, μ and variance are constructed        if <code>tf=true</code> as <code>model_weights/group/group/beta:0</code>, etc.       If <code>tf=false</code> group must define the full group path:       <code>group/beta:0</code>.       <code>dims</code> specifies the number of dimensions of the input and may be       2, 4 or 5. The default (4) applies to standard CNNs        (imgsize, imgsize, channels, batchsize).</p></li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>scale=true</code>: if <code>true</code>, the trainable scale parameters β and γ       are used. </li><li><code>trainable=true</code>. only used with hdf5-import. If <code>true</code> the        parameters β and γ are initialised as <code>Param</code> and trained in training.</li></ul><p><strong>Details:</strong></p><p>2d, 4d and 5d inputs are supported. Mean and variance are computed over dimensions (2), (1,2,4) and (1,2,3,5) for 2d, 4d and 5d arrays, respectively.</p><p>If <code>scale=true</code> and <code>channels != 0</code>, trainable parameters <code>β</code> and <code>γ</code> will be initialised for each channel.</p><p>If <code>scale=true</code> and <code>channels == 0</code> (i.e. <code>BatchNorm(scale=true)</code>), the params <code>β</code> and <code>γ</code> are not initialised by the constructor. Instead, the number of channels is inferred when the first minibatch is normalised as: 2d: <code>size(x)[1]</code> 4d: <code>size(x)[3]</code> 5d: <code>size(x)[4]</code> or <code>0</code> otherwise.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L890-L952">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.LayerNorm" href="#NNHelferlein.LayerNorm"><code>NNHelferlein.LayerNorm</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct LayerNorm  &lt;: AbstractLayer</code></pre><p>Simple layer normalisation (inspired by TFs LayerNormalization). Implementation is from Deniz Yuret&#39;s answer to feature request 429 (https://github.com/denizyuret/Knet.jl/issues/492).</p><p>The layer performs a normalisation within each sample, <em>not</em> batchwise. Normalisation is modified by two trainable parameters <code>a</code> and <code>b</code> (variance and mean) added to every value of the sample vector.</p><p><strong>Constructors:</strong></p><ul><li><code>LayertNorm(depth; eps=1e-6)</code>:  <code>depth</code> is the number       of activations for one sample of the layer.</li></ul><p><strong>Signatures:</strong></p><ul><li><code>function (l::LayerNorm)(x; dims=1)</code>: normalise <code>x</code> along the given dimensions.       The size of the specified dimension must fit with the initialised <code>depth</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1070-L1089">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GaussianNoise" href="#NNHelferlein.GaussianNoise"><code>NNHelferlein.GaussianNoise</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct GaussianNoise</code></pre><p>Gaussian noise layer. Multiplies Gaussian-distributed random values with  <em>mean = 1.0</em> and <em>sigma = σ</em> to each training value.</p><p><strong>Constructors:</strong></p><ul><li><code>aussianNoise(σ; train_only=true)</code></li></ul><p><strong>Arguments:</strong></p><ul><li><code>σ</code>: Standard deviation for the distribution of noise</li><li><code>train_only=true</code>: if <code>true</code>, noise will only be applied in training.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1116-L1128">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GlobalAveragePooling" href="#NNHelferlein.GlobalAveragePooling"><code>NNHelferlein.GlobalAveragePooling</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct GlobalAveragePooling  &lt;: AbstractLayer</code></pre><p>Layer to return a matrix with the mean values of all but the last two dimensions for each sample of the minibatch. If the input is a stack of feature maps from a convolutional layer, the result can be seen as the mean value of each feature map. Number of <em>output</em>-rows equals number of <em>input</em>-featuremaps;  number of <em>output</em>-columns equals size of minibatch. </p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">GlobalAveragePooling()</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1154-L1167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.global_average_pooling" href="#NNHelferlein.global_average_pooling"><code>NNHelferlein.global_average_pooling</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">global_average_pooling(x)</code></pre><p>Function to return a matrix with the mean values of all but the last two dimensions for each sample of the minibatch.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/layers.jl#L1181-L1186">source</a></section></article><h2 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttentionMechanism" href="#NNHelferlein.AttentionMechanism"><code>NNHelferlein.AttentionMechanism</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type AttentionMechanism</code></pre><p>Attention mechanisms follow the same interface and common signatures.</p><p>If possible, the algorithm allows precomputing of the projections of the context vector generated by the encoder in a encoder-decoder-architecture (i.e. in case of an RNN encoder the accumulated encoder hidden states).</p><p>By default attention scores are scaled according to Vaswani et al., 2017 <em>(Vaswani et al., Attention Is All You Need, CoRR, 2017)</em>.</p><p>All algorithms use soft attention.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">Attn*Mechanism*(dec_units, enc_units; scale=true)
 Attn*Mechanism*(units; scale=true)</code></pre><p>The one-argument version can be used, if encoder dimensions and decoder dimensions are the same.</p><p><strong>Common Signatures:</strong></p><pre><code class="nohighlight hljs">function (attn::AttentionMechanism)(h_t, h_enc; reset=false, mask=nothing)
-function (attn::AttentionMechanism)(; reset=false)</code></pre><p><strong>Arguments:</strong></p><ul><li><code>h_t</code>:    decoder hidden state. If <span>$h_t$</span> is a vector, its length           equals the number of decoder units. If it is a matrix,           <span>$h_t$</span> includes the states for a minibatch of samples and has           the size [units, mb].</li><li><code>h_enc</code>:  encoder hidden states, 2d or 3d. If <span>$h_{enc}$</span> is a           matrix [units, steps] with the hidden states of all encoder steps.           If 3d: [units, mb, steps] encoder states for all minibatches.</li><li><code>mask</code>:   optional mask (e.g. padding mask) for masking input steps           of dimensions [mb, steps]. Attentions factors for masked steps            will be set to 0.0.</li><li><code>reset=false</code>: If the keyword argument is set to <code>true</code>, projections of           the encoder states are computed. By default projections are           stored in the object and reused until the object is resetted.           For attention mechanisms that do not allow precomputation           the argument is ignored.</li></ul><p>The short form <code>(::AttentionMechanism)(reset=true)</code> can be used to reset the precomputed projections.</p><p><strong>Return values</strong></p><p>All functions return <code>c</code> and <code>α</code> where <code>α</code> is a matrix of size [mb,steps] with the attention factors for each step and minibatch. <code>c</code> is a matrix of size [units, mb] with the context vector for each sample of the minibatch, calculated as the α-weighted sum of all encoder hidden states <span>$h_{enc}$</span> for each minibatch.</p><p><strong>Attention Mechanisms:</strong></p><p>All attention mechanisms calculate attention factors α from scores derived from projections of the encoder hidden states:</p><p class="math-container">\[\alpha = \mathrm{softmax}(\mathrm{score}(h_{enc},h_{t}) \cdot 1/\sqrt{n}))\]</p><p>Attention mechanisms implemented:</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L3-L67">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnBahdanau" href="#NNHelferlein.AttnBahdanau"><code>NNHelferlein.AttnBahdanau</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnBahdanau &lt;: AttentionMechanism</code></pre><p>Bahdanau-style (additive, concat) attention mechanism according to the paper:</p><p><em>D. Bahdanau, KH. Co, Y. Bengio, Neural Machine Translation by jointlylearning to align and translate, ICLR, 2015</em>.</p><p class="math-container">\[\mathrm{score}(h_{t},h_{enc}) = v_{a}^{\top}\cdot\tanh(W[h_{t},h_{enc}])\]</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnBahdanau(dec_units, enc_units; scale=true)
-AttnBahdanau(units; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L85-L102">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnLuong" href="#NNHelferlein.AttnLuong"><code>NNHelferlein.AttnLuong</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnLuong &lt;: AttentionMechanism</code></pre><p>Luong-style (multiplicative) attention mechanism according to the paper (referred as <em>General</em>-type attention): <em>M.-T. Luong, H. Pham, C.D. Manning, Effective Approaches to Attention-based Neural Machine Translation, CoRR, 2015</em>.</p><p class="math-container">\[\mathrm{score}(h_{t},h_{enc}) = h_{t}^{\top} W h_{enc}\]</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnLuong(dec_units, enc_units; scale=true)
-AttnLuong(units; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L174-L189">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnDot" href="#NNHelferlein.AttnDot"><code>NNHelferlein.AttnDot</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnDot &lt;: AttentionMechanism</code></pre><p>Dot-product attention (without trainable parameters) according to the Luong, et al. (2015) paper.</p><p><span>$\mathrm{score}(h_{t},h_{enc}) = h_{t}^{\top} h_{enc}$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnDot(; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L232-L242">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnLocation" href="#NNHelferlein.AttnLocation"><code>NNHelferlein.AttnLocation</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnLocation &lt;: AttentionMechanism</code></pre><p>Location-based attention that only depends on the current decoder state <span>$h_t$</span> and not on the encoder states, according to the Luong, et al. (2015) paper.</p><p><span>$\mathrm{score}(h_{t}) = W h_{t}$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnLocation(len, dec_units; scale=true)</code></pre><ul><li><code>len</code>: maximum sequence length of the encoder to be considered       for attention. If the actual length of <span>$h_{enc}$</span> is bigger than the       length of <code>α</code>, attention factors for the remaining states are set to       0.0. If the actual length of h_enc is smaller than <code>α</code>, only the matching       attention factors are applied.</li><li><code>dec_units</code>: number of decoder units.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L275-L293">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnInFeed" href="#NNHelferlein.AttnInFeed"><code>NNHelferlein.AttnInFeed</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnInFeed &lt;: AttentionMechanism</code></pre><p>Input-feeding attention that depends on the current decoder state <span>$h_t$</span> and the next input to the decoder <span>$i_{t+1}$</span>, according to the Luong, et al. (2015) paper.</p><p>Infeed attention provides a semantic attention that depends on the next input token.</p><p><span>$\mathrm{score}(h_{t}, i_{t+1}) = W_h h_{t} + W_i i_{t+1} = W [h_t, i_{t+1}]$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnInFeed(len, dec_units, fan_in; scale=true)</code></pre><ul><li><code>len</code>: maximum sequence length of the encoder to be considered       for attention. If the actual length of <span>$h_{enc}$</span> is bigger than the       length of <code>α</code>, attention factors for the remaining states are set to       0.0. If the actual length of <code>h_enc</code> is smaller than <code>α</code>, only the matching       attention factors are applied.</li><li><code>dec_units</code>: number of decoder units.</li><li><code>fan_in</code>: size of the decoder input.</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">function (attn::AttnInFeed)(h_t, inp, h_enc; mask=nothing)</code></pre><ul><li><code>h_t</code>:    decoder hidden state. If <span>$h_t$</span> is a vector, its length           equals the number of decoder units. If it is a matrix,           <span>$h_t$</span> includes the states for a minibatch of samples and has           the size [units, mb].</li><li><code>inp</code>: next decoder input <span>$i_{t+1}$</span>           (e.g. next embedded token of sequence)</li><li><code>h_enc</code>:  encoder hidden states, 2d or 3d. If <span>$h_{enc}$</span> is a           matrix [units, steps] with the hidden states of all encoder steps.           If 3d: [units, mb, steps] encoder states for all minibatches.</li><li><code>mask</code>:   Optional mask for input states of shape [mb, steps].</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/attn.jl#L348-L385">source</a></section></article><h1 id="Data-providers"><a class="docs-heading-anchor" href="#Data-providers">Data providers</a><a id="Data-providers-1"></a><a class="docs-heading-anchor-permalink" href="#Data-providers" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DataLoader" href="#NNHelferlein.DataLoader"><code>NNHelferlein.DataLoader</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type DataLoader</code></pre><p>Mother type for minibatch iterators.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/types.jl#L48-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.SequenceData" href="#NNHelferlein.SequenceData"><code>NNHelferlein.SequenceData</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct SequenceData &lt;: DataLoader</code></pre><p>Type for a generic minibatch iterator. All NNHelferlein models accept minibatches of type <code>DataLoader</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">SequenceData(x; shuffle=true)</code></pre><ul><li><code>x</code>: List, Array or other iterable object with the minibatches</li><li><code>shuffle</code>: if <code>true</code>, minibatches are shuffled every epoch.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/types.jl#L55-L67">source</a></section></article><h2 id="Iteration-utilities"><a class="docs-heading-anchor" href="#Iteration-utilities">Iteration utilities</a><a id="Iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Iteration-utilities" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PartialIterator" href="#NNHelferlein.PartialIterator"><code>NNHelferlein.PartialIterator</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PartialIterator &lt;: DataLoader</code></pre><p>The <code>PartialIterator</code> wraps any iterator and will only iterate the states specified in the list <code>indices</code>. </p><p><strong>Constuctors</strong></p><pre><code class="nohighlight hljs">PartialIterator(inner, indices; shuffle=true)</code></pre><p>Type of the states must match the states of the wrapped iterator <code>inner</code>. A <code>nothing</code> element may be  given to specify the first iterator element.</p><p>If <code>shuffle==true</code>, the list of indices are shuffled every time the <code>PartialIterator</code> is started.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/iterators.jl#L81-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.split_minibatches" href="#NNHelferlein.split_minibatches"><code>NNHelferlein.split_minibatches</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function split_minibatches(it, at=0.8; shuffle=true)</code></pre><p>Return 2 iterators of type <code>PartialIterator</code> which iterate only parts of the  states of the iterator <code>it</code>.  Be aware that the partial iterators will not contain copies of the data but instead forward the data provided by the iterator <code>it</code>.</p><p>The function can be used to split an iterator of minibatches into train-  and validation iterators, without copying any data. As the <code>PartialIterator</code> objects work with the states of the inner iterator, it is important <em>not</em> to shuffle the inner iterator (in this case the  composition of the partial iterators would change and training and validation data  may be mixed!).</p><p><strong>Arguments:</strong></p><ul><li><code>it</code>: Iterator to be splitted. The list of allowed states is created by       performing a full iteration once.</li><li><code>at</code>: Split point. The first returned iterator will include the given        fraction (default: 80%) of the states.</li><li><code>shuffle</code>: If true, the elements are shuffled at each restart of the iterator.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/iterators.jl#L4-L25">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MBNoiser" href="#NNHelferlein.MBNoiser"><code>NNHelferlein.MBNoiser</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">type MBNoiser</code></pre><p>Iterator to wrap any Knet.Data iterator of minibatches in  order to add random noise.     Each value will be multiplied with a random value form  Gaussian noise with mean=1.0 and sd=σ.</p><p><strong>Construtors:</strong></p><pre><code class="nohighlight hljs">MBNoiser(mbs::Knet.Data, σ)
+function (attn::AttentionMechanism)(; reset=false)</code></pre><p><strong>Arguments:</strong></p><ul><li><code>h_t</code>:    decoder hidden state. If <span>$h_t$</span> is a vector, its length           equals the number of decoder units. If it is a matrix,           <span>$h_t$</span> includes the states for a minibatch of samples and has           the size [units, mb].</li><li><code>h_enc</code>:  encoder hidden states, 2d or 3d. If <span>$h_{enc}$</span> is a           matrix [units, steps] with the hidden states of all encoder steps.           If 3d: [units, mb, steps] encoder states for all minibatches.</li><li><code>mask</code>:   optional mask (e.g. padding mask) for masking input steps           of dimensions [mb, steps]. Attentions factors for masked steps            will be set to 0.0.</li><li><code>reset=false</code>: If the keyword argument is set to <code>true</code>, projections of           the encoder states are computed. By default projections are           stored in the object and reused until the object is resetted.           For attention mechanisms that do not allow precomputation           the argument is ignored.</li></ul><p>The short form <code>(::AttentionMechanism)(reset=true)</code> can be used to reset the precomputed projections.</p><p><strong>Return values</strong></p><p>All functions return <code>c</code> and <code>α</code> where <code>α</code> is a matrix of size [mb,steps] with the attention factors for each step and minibatch. <code>c</code> is a matrix of size [units, mb] with the context vector for each sample of the minibatch, calculated as the α-weighted sum of all encoder hidden states <span>$h_{enc}$</span> for each minibatch.</p><p><strong>Attention Mechanisms:</strong></p><p>All attention mechanisms calculate attention factors α from scores derived from projections of the encoder hidden states:</p><p class="math-container">\[\alpha = \mathrm{softmax}(\mathrm{score}(h_{enc},h_{t}) \cdot 1/\sqrt{n}))\]</p><p>Attention mechanisms implemented:</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L3-L67">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnBahdanau" href="#NNHelferlein.AttnBahdanau"><code>NNHelferlein.AttnBahdanau</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnBahdanau &lt;: AttentionMechanism</code></pre><p>Bahdanau-style (additive, concat) attention mechanism according to the paper:</p><p><em>D. Bahdanau, KH. Co, Y. Bengio, Neural Machine Translation by jointlylearning to align and translate, ICLR, 2015</em>.</p><p class="math-container">\[\mathrm{score}(h_{t},h_{enc}) = v_{a}^{\top}\cdot\tanh(W[h_{t},h_{enc}])\]</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnBahdanau(dec_units, enc_units; scale=true)
+AttnBahdanau(units; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L85-L102">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnLuong" href="#NNHelferlein.AttnLuong"><code>NNHelferlein.AttnLuong</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnLuong &lt;: AttentionMechanism</code></pre><p>Luong-style (multiplicative) attention mechanism according to the paper (referred as <em>General</em>-type attention): <em>M.-T. Luong, H. Pham, C.D. Manning, Effective Approaches to Attention-based Neural Machine Translation, CoRR, 2015</em>.</p><p class="math-container">\[\mathrm{score}(h_{t},h_{enc}) = h_{t}^{\top} W h_{enc}\]</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnLuong(dec_units, enc_units; scale=true)
+AttnLuong(units; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L174-L189">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnDot" href="#NNHelferlein.AttnDot"><code>NNHelferlein.AttnDot</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnDot &lt;: AttentionMechanism</code></pre><p>Dot-product attention (without trainable parameters) according to the Luong, et al. (2015) paper.</p><p><span>$\mathrm{score}(h_{t},h_{enc}) = h_{t}^{\top} h_{enc}$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnDot(; scale=true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L232-L242">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnLocation" href="#NNHelferlein.AttnLocation"><code>NNHelferlein.AttnLocation</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnLocation &lt;: AttentionMechanism</code></pre><p>Location-based attention that only depends on the current decoder state <span>$h_t$</span> and not on the encoder states, according to the Luong, et al. (2015) paper.</p><p><span>$\mathrm{score}(h_{t}) = W h_{t}$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnLocation(len, dec_units; scale=true)</code></pre><ul><li><code>len</code>: maximum sequence length of the encoder to be considered       for attention. If the actual length of <span>$h_{enc}$</span> is bigger than the       length of <code>α</code>, attention factors for the remaining states are set to       0.0. If the actual length of h_enc is smaller than <code>α</code>, only the matching       attention factors are applied.</li><li><code>dec_units</code>: number of decoder units.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L275-L293">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.AttnInFeed" href="#NNHelferlein.AttnInFeed"><code>NNHelferlein.AttnInFeed</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct AttnInFeed &lt;: AttentionMechanism</code></pre><p>Input-feeding attention that depends on the current decoder state <span>$h_t$</span> and the next input to the decoder <span>$i_{t+1}$</span>, according to the Luong, et al. (2015) paper.</p><p>Infeed attention provides a semantic attention that depends on the next input token.</p><p><span>$\mathrm{score}(h_{t}, i_{t+1}) = W_h h_{t} + W_i i_{t+1} = W [h_t, i_{t+1}]$</span></p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">AttnInFeed(len, dec_units, fan_in; scale=true)</code></pre><ul><li><code>len</code>: maximum sequence length of the encoder to be considered       for attention. If the actual length of <span>$h_{enc}$</span> is bigger than the       length of <code>α</code>, attention factors for the remaining states are set to       0.0. If the actual length of <code>h_enc</code> is smaller than <code>α</code>, only the matching       attention factors are applied.</li><li><code>dec_units</code>: number of decoder units.</li><li><code>fan_in</code>: size of the decoder input.</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">function (attn::AttnInFeed)(h_t, inp, h_enc; mask=nothing)</code></pre><ul><li><code>h_t</code>:    decoder hidden state. If <span>$h_t$</span> is a vector, its length           equals the number of decoder units. If it is a matrix,           <span>$h_t$</span> includes the states for a minibatch of samples and has           the size [units, mb].</li><li><code>inp</code>: next decoder input <span>$i_{t+1}$</span>           (e.g. next embedded token of sequence)</li><li><code>h_enc</code>:  encoder hidden states, 2d or 3d. If <span>$h_{enc}$</span> is a           matrix [units, steps] with the hidden states of all encoder steps.           If 3d: [units, mb, steps] encoder states for all minibatches.</li><li><code>mask</code>:   Optional mask for input states of shape [mb, steps].</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/attn.jl#L348-L385">source</a></section></article><h1 id="Data-providers"><a class="docs-heading-anchor" href="#Data-providers">Data providers</a><a id="Data-providers-1"></a><a class="docs-heading-anchor-permalink" href="#Data-providers" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.DataLoader" href="#NNHelferlein.DataLoader"><code>NNHelferlein.DataLoader</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">abstract type DataLoader</code></pre><p>Mother type for minibatch iterators.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/types.jl#L48-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.SequenceData" href="#NNHelferlein.SequenceData"><code>NNHelferlein.SequenceData</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct SequenceData &lt;: DataLoader</code></pre><p>Type for a generic minibatch iterator. All NNHelferlein models accept minibatches of type <code>DataLoader</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">SequenceData(x; shuffle=true)</code></pre><ul><li><code>x</code>: List, Array or other iterable object with the minibatches</li><li><code>shuffle</code>: if <code>true</code>, minibatches are shuffled every epoch.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/types.jl#L55-L67">source</a></section></article><h2 id="Iteration-utilities"><a class="docs-heading-anchor" href="#Iteration-utilities">Iteration utilities</a><a id="Iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Iteration-utilities" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PartialIterator" href="#NNHelferlein.PartialIterator"><code>NNHelferlein.PartialIterator</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PartialIterator &lt;: DataLoader</code></pre><p>The <code>PartialIterator</code> wraps any iterator and will only iterate the states specified in the list <code>indices</code>. </p><p><strong>Constuctors</strong></p><pre><code class="nohighlight hljs">PartialIterator(inner, indices; shuffle=true)</code></pre><p>Type of the states must match the states of the wrapped iterator <code>inner</code>. A <code>nothing</code> element may be  given to specify the first iterator element.</p><p>If <code>shuffle==true</code>, the list of indices are shuffled every time the <code>PartialIterator</code> is started.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/iterators.jl#L81-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.split_minibatches" href="#NNHelferlein.split_minibatches"><code>NNHelferlein.split_minibatches</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function split_minibatches(it, at=0.8; shuffle=true)</code></pre><p>Return 2 iterators of type <code>PartialIterator</code> which iterate only parts of the  states of the iterator <code>it</code>.  Be aware that the partial iterators will not contain copies of the data but instead forward the data provided by the iterator <code>it</code>.</p><p>The function can be used to split an iterator of minibatches into train-  and validation iterators, without copying any data. As the <code>PartialIterator</code> objects work with the states of the inner iterator, it is important <em>not</em> to shuffle the inner iterator (in this case the  composition of the partial iterators would change and training and validation data  may be mixed!).</p><p><strong>Arguments:</strong></p><ul><li><code>it</code>: Iterator to be splitted. The list of allowed states is created by       performing a full iteration once.</li><li><code>at</code>: Split point. The first returned iterator will include the given        fraction (default: 80%) of the states.</li><li><code>shuffle</code>: If true, the elements are shuffled at each restart of the iterator.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/iterators.jl#L4-L25">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MBNoiser" href="#NNHelferlein.MBNoiser"><code>NNHelferlein.MBNoiser</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">type MBNoiser</code></pre><p>Iterator to wrap any Knet.Data iterator of minibatches in  order to add random noise.     Each value will be multiplied with a random value form  Gaussian noise with mean=1.0 and sd=σ.</p><p><strong>Construtors:</strong></p><pre><code class="nohighlight hljs">MBNoiser(mbs::Knet.Data, σ)
 MBNoiser(mbs::Knet.Data; σ=0.01)</code></pre><ul><li><code>mbs</code>: iterator with minibatches</li><li><code>σ</code>: standard deviation for the Gaussian noise</li></ul><p><strong>Example:</strong></p><pre><code class="language-juliaREPL hljs">julia&gt; trn = minibatch(x)
 julia&gt; tb_train!(mdl, Adam, MBNoiser(trn, σ=0.1))
-julia&gt; mbs_noised = MBNoiser(mbs, 0.05)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/iterators.jl#L133-L155">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MBMasquerade" href="#NNHelferlein.MBMasquerade"><code>NNHelferlein.MBMasquerade</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct MBMasquerade  &lt;: DataLoader</code></pre><p>Iterator wrapper to partially mask training data of a minibatch  iterator of type <code>Knet.Data</code> or <code>NNHelferlein.DataLoader</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">MBMasquerade(it, rho=0.1; mode=:noise, value=0)
+julia&gt; mbs_noised = MBNoiser(mbs, 0.05)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/iterators.jl#L133-L155">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MBMasquerade" href="#NNHelferlein.MBMasquerade"><code>NNHelferlein.MBMasquerade</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct MBMasquerade  &lt;: DataLoader</code></pre><p>Iterator wrapper to partially mask training data of a minibatch  iterator of type <code>Knet.Data</code> or <code>NNHelferlein.DataLoader</code>.</p><p><strong>Constructors:</strong></p><pre><code class="nohighlight hljs">MBMasquerade(it, rho=0.1; mode=:noise, value=0)
 MBMasquerade(it; ρ=0.1, mode=:noise, value=0)</code></pre><p>The constructor may be called with the density <code>rho</code> as normal argument or <code>ρ</code> as keyword argument.</p><p><strong>Arguments:</strong></p><ul><li><code>it</code>: Minibatch iterator that must deliver (x,y)-tuples of        minibatches</li><li><code>ρ=0.1</code> or <code>rho</code>: Density of mask; a value of 1.0 will mask everything,       a value of 0.0 nothing.</li><li><code>value=0</code>: the value with which the masking is done.</li><li><code>mode=:noise</code>: type of masking (only <code>:noise</code> implemented yet):<ul><li><code>:noise</code>: randomly distributed single values of the        training data will be overwitten with <code>value</code>.</li></ul></li></ul><p><strong>Examples:</strong></p><pre><code class="language-juliaREPL hljs">julia&gt; dtrn 
 26-element Knet.Train20.Data{Tuple{CuArray{Float32}, Array{UInt8}}}
 
 julia&gt; mtrn = Masquerade(dtrn, 0.5, value=2.0h)
-Masquerade(26-element Knet.Train20.Data{Tuple{CuArray{Float32}, Array{UInt8}}}, 0.5, 2.0, :noise)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/iterators.jl#L197-L229">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GPUIterator" href="#NNHelferlein.GPUIterator"><code>NNHelferlein.GPUIterator</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">GPUIterator(iterator)</code></pre><p>Wraps any iterator and makes it return CuArrays. Element types  are preserved except of Float-Types, which are casted to <code>Float32</code> for performance reasons).</p><p><strong>Contsructor:</strong></p><p><code>GPUIterator(iterator; y=:cpu)</code>:      + <code>iterator</code>: any iterator     + <code>y</code>: if <code>:gpu</code>, the labels of the iterator are also              converted to <code>CuArray{}</code>. If <code>:cpu</code>, the labels             are not converted.                For a classifier (labels are integers), keeping              labels on the cpu is more efficient. For Regression             (labels are Floats), labels on the gpu is             recommended.</p><p><strong>Deprecation warning:</strong></p><p>Use of <code>GPUIterator</code> is deprecated in favour of  <code>CUDA.CuIterator</code>, which offeres similar functionality.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/iterators.jl#L310-L332">source</a></section></article><h2 id="Tabular-data"><a class="docs-heading-anchor" href="#Tabular-data">Tabular data</a><a id="Tabular-data-1"></a><a class="docs-heading-anchor-permalink" href="#Tabular-data" title="Permalink"></a></h2><p>Tabular data is normally provided in table form (csv, ods) row-wise, i.e. one sample per row. The helper functions can read the tables and generate Knet compatible iterators of minibatches.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_read" href="#NNHelferlein.dataframe_read"><code>NNHelferlein.dataframe_read</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">dataframe_read(fname; o...)</code></pre><p>Read a data table from an CSV-file with one sample per row and return a DataFrame with the data. (ODS-support is removed because of PyCall compatibility issues of the OdsIO package).</p><p>All keyword arguments accepted by CSV.File() can be used.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/dataframes.jl#L5-L14">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_minibatch" href="#NNHelferlein.dataframe_minibatch"><code>NNHelferlein.dataframe_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">dataframe_minibatch(data::DataFrames.DataFrame; size=256, 
+Masquerade(26-element Knet.Train20.Data{Tuple{CuArray{Float32}, Array{UInt8}}}, 0.5, 2.0, :noise)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/iterators.jl#L197-L229">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.GPUIterator" href="#NNHelferlein.GPUIterator"><code>NNHelferlein.GPUIterator</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">GPUIterator(iterator)</code></pre><p>Wraps any iterator and makes it return CuArrays. Element types  are preserved except of Float-Types, which are casted to <code>Float32</code> for performance reasons).</p><p><strong>Contsructor:</strong></p><p><code>GPUIterator(iterator; y=:cpu)</code>:      + <code>iterator</code>: any iterator     + <code>y</code>: if <code>:gpu</code>, the labels of the iterator are also              converted to <code>CuArray{}</code>. If <code>:cpu</code>, the labels             are not converted.                For a classifier (labels are integers), keeping              labels on the cpu is more efficient. For Regression             (labels are Floats), labels on the gpu is             recommended.</p><p><strong>Deprecation warning:</strong></p><p>Use of <code>GPUIterator</code> is deprecated in favour of  <code>CUDA.CuIterator</code>, which offeres similar functionality.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/iterators.jl#L310-L332">source</a></section></article><h2 id="Tabular-data"><a class="docs-heading-anchor" href="#Tabular-data">Tabular data</a><a id="Tabular-data-1"></a><a class="docs-heading-anchor-permalink" href="#Tabular-data" title="Permalink"></a></h2><p>Tabular data is normally provided in table form (csv, ods) row-wise, i.e. one sample per row. The helper functions can read the tables and generate Knet compatible iterators of minibatches.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_read" href="#NNHelferlein.dataframe_read"><code>NNHelferlein.dataframe_read</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">dataframe_read(fname; o...)</code></pre><p>Read a data table from an CSV-file with one sample per row and return a DataFrame with the data. (ODS-support is removed because of PyCall compatibility issues of the OdsIO package).</p><p>All keyword arguments accepted by CSV.File() can be used.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/dataframes.jl#L5-L14">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_minibatch" href="#NNHelferlein.dataframe_minibatch"><code>NNHelferlein.dataframe_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">dataframe_minibatch(data::DataFrames.DataFrame; size=256, 
                     ignore=[], teaching=nothing, 
                     verbose=1, o...)
 
-dataframe_minibatches()</code></pre><p>Make Knet-conform minibatches of type <code>Knet.data</code> from a dataframe with one sample per row.</p><p><code>dataframe_minibatches()</code> is an alieas kept for backward compatibility.</p><p><strong>Arguments:</strong></p><ul><li><code>ignore</code>: defines a list of column names to be ignored</li><li><code>teaching=nothing</code>: defines the column name with teaching input.                <code>teaching</code> is handled differently, depending on its type:               If <code>Int</code>, the teaching input is interpreted as               class IDs and directly used for training (this assumes that               the values range from 1..n). If type is a String, values are               interpreted as class labels and converted to numeric class IDs               by calling <code>mk_class_ids()</code>. The list of valid lables and their               order can be created by calling <code>mk_class_ids(data.y)[2]</code>.               If teaching is a scalar value, regression context is assumed,               and the value is used unchanged for training.                    If <code>teaching</code> is <code>nothing</code>, no teaching input is used and               minibatches of x-data only are returned.</li><li><code>verbose=1</code>: if &gt; 0, a summary of how the dataframe is used is echoed.</li><li>other keyword arguments: all keyword arguments accepted by               <code>Knet.minibatch()</code> may be used.</li></ul><p>Allowed column definitions for <code>ignore</code> and <code>teaching</code> include names (as Strings), column names (as Symbols) or column indices (as Integer values).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/dataframes.jl#L44-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_split" href="#NNHelferlein.dataframe_split"><code>NNHelferlein.dataframe_split</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataframe_split(df::DataFrames.DataFrame;
-                         teaching=&quot;y&quot;, split=0.8, balanced=true)</code></pre><p>Split data, organised row-wise in a DataFrame into train and validation sets.</p><p><strong>Arguments:</strong></p><ul><li><code>df</code>: data</li><li><code>teaching=&quot;y&quot;</code>: name or index of column with teaching input &quot;y&quot;</li><li><code>split=0.8</code>: fraction of data to be used for the first returned              subdataframe</li><li><code>shuffle=true</code>: shuffle the rows of the dataframe.</li><li><code>balanced=true</code>: if <code>true</code>, result datasets will be balanced by oversampling.             Returned datasets will be bigger as expected             but include the same numbers of samples for each class.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/dataframes.jl#L214-L229">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_class_ids" href="#NNHelferlein.mk_class_ids"><code>NNHelferlein.mk_class_ids</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_class_ids(labels)</code></pre><p>Take a list with n class labels for n instances and return a list of n class-IDs (of type Int) and an array of lables with the array index of each label corresponds its ID.</p><p><strong>Arguments:</strong></p><ul><li><code>labels</code>: List of labels (typically Strings)</li></ul><p><strong>Result values:</strong></p><ul><li>array of class-IDs in the same order as the input</li><li>array of unique class-IDs ordered by their ID.</li></ul><p><strong>Examples:</strong></p><pre><code class="nohighlight hljs">julia&gt; labels = [&quot;blue&quot;, &quot;red&quot;, &quot;red&quot;, &quot;red&quot;, &quot;green&quot;, &quot;blue&quot;, &quot;blue&quot;]
+dataframe_minibatches()</code></pre><p>Make Knet-conform minibatches of type <code>Knet.data</code> from a dataframe with one sample per row.</p><p><code>dataframe_minibatches()</code> is an alieas kept for backward compatibility.</p><p><strong>Arguments:</strong></p><ul><li><code>ignore</code>: defines a list of column names to be ignored</li><li><code>teaching=nothing</code>: defines the column name with teaching input.                <code>teaching</code> is handled differently, depending on its type:               If <code>Int</code>, the teaching input is interpreted as               class IDs and directly used for training (this assumes that               the values range from 1..n). If type is a String, values are               interpreted as class labels and converted to numeric class IDs               by calling <code>mk_class_ids()</code>. The list of valid lables and their               order can be created by calling <code>mk_class_ids(data.y)[2]</code>.               If teaching is a scalar value, regression context is assumed,               and the value is used unchanged for training.                    If <code>teaching</code> is <code>nothing</code>, no teaching input is used and               minibatches of x-data only are returned.</li><li><code>verbose=1</code>: if &gt; 0, a summary of how the dataframe is used is echoed.</li><li>other keyword arguments: all keyword arguments accepted by               <code>Knet.minibatch()</code> may be used.</li></ul><p>Allowed column definitions for <code>ignore</code> and <code>teaching</code> include names (as Strings), column names (as Symbols) or column indices (as Integer values).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/dataframes.jl#L44-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataframe_split" href="#NNHelferlein.dataframe_split"><code>NNHelferlein.dataframe_split</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataframe_split(df::DataFrames.DataFrame;
+                         teaching=&quot;y&quot;, split=0.8, balanced=true)</code></pre><p>Split data, organised row-wise in a DataFrame into train and validation sets.</p><p><strong>Arguments:</strong></p><ul><li><code>df</code>: data</li><li><code>teaching=&quot;y&quot;</code>: name or index of column with teaching input &quot;y&quot;</li><li><code>split=0.8</code>: fraction of data to be used for the first returned              subdataframe</li><li><code>shuffle=true</code>: shuffle the rows of the dataframe.</li><li><code>balanced=true</code>: if <code>true</code>, result datasets will be balanced by oversampling.             Returned datasets will be bigger as expected             but include the same numbers of samples for each class.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/dataframes.jl#L214-L229">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_class_ids" href="#NNHelferlein.mk_class_ids"><code>NNHelferlein.mk_class_ids</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_class_ids(labels)</code></pre><p>Take a list with n class labels for n instances and return a list of n class-IDs (of type Int) and an array of lables with the array index of each label corresponds its ID.</p><p><strong>Arguments:</strong></p><ul><li><code>labels</code>: List of labels (typically Strings)</li></ul><p><strong>Result values:</strong></p><ul><li>array of class-IDs in the same order as the input</li><li>array of unique class-IDs ordered by their ID.</li></ul><p><strong>Examples:</strong></p><pre><code class="nohighlight hljs">julia&gt; labels = [&quot;blue&quot;, &quot;red&quot;, &quot;red&quot;, &quot;red&quot;, &quot;green&quot;, &quot;blue&quot;, &quot;blue&quot;]
 7-element Array{String,1}:
  &quot;blue&quot;
  &quot;red&quot;
@@ -75,7 +75,7 @@
 3-element Array{String,1}:
  &quot;blue&quot;
  &quot;green&quot;
- &quot;red&quot;</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/dataframes.jl#L160-L204">source</a></section></article><h2 id="Image-data"><a class="docs-heading-anchor" href="#Image-data">Image data</a><a id="Image-data-1"></a><a class="docs-heading-anchor-permalink" href="#Image-data" title="Permalink"></a></h2><p>Images as data should be provided in directories with the directory names denoting the class labels. The helpers read from the root of a directory tree in which the first level of sub-dirs tell the class label. All images in the tree under a class label are read as instances of the respective class. The following tree will generate the classes <code>daisy</code>, <code>rose</code> and <code>tulip</code>:</p><pre><code class="nohighlight hljs">image_dir/
+ &quot;red&quot;</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/dataframes.jl#L160-L204">source</a></section></article><h2 id="Image-data"><a class="docs-heading-anchor" href="#Image-data">Image data</a><a id="Image-data-1"></a><a class="docs-heading-anchor-permalink" href="#Image-data" title="Permalink"></a></h2><p>Images as data should be provided in directories with the directory names denoting the class labels. The helpers read from the root of a directory tree in which the first level of sub-dirs tell the class label. All images in the tree under a class label are read as instances of the respective class. The following tree will generate the classes <code>daisy</code>, <code>rose</code> and <code>tulip</code>:</p><pre><code class="nohighlight hljs">image_dir/
 ├── daisy
 │   ├── 01
 │   │   ├── 01
@@ -100,10 +100,10 @@
     pre_proc
     pre_load
     i_images
-end</code></pre><p>Iterable image loader to provide minibatches of images as 4-d-arrays (x,y,rgb,mb).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L100-L117">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_image_minibatch" href="#NNHelferlein.mk_image_minibatch"><code>NNHelferlein.mk_image_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_image_minibatch(dir, batchsize; split=false, at=0.8,
+end</code></pre><p>Iterable image loader to provide minibatches of images as 4-d-arrays (x,y,rgb,mb).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L100-L117">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_image_minibatch" href="#NNHelferlein.mk_image_minibatch"><code>NNHelferlein.mk_image_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_image_minibatch(dir, batchsize; split=false, at=0.8,
                             balanced=false, shuffle=true, train=true,
                             pre_load=false,
-                            aug_pipl=nothing, pre_proc=nothing)</code></pre><p>Return one or two iterable image-loader-objects that provides minibatches of images. For training each minibatch is a tupel <code>(x,y)</code> with x: 4-d-array with the minibatch of data and y: vector of class IDs as Int.</p><p><strong>Arguments:</strong></p><ul><li><code>dir</code>: base-directory of the image dataset. The first level of       sub-dirs are used as class names.</li><li><code>batchsize</code>: size of minibatches</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>split</code>: return two iterators for training and validation</li><li><code>at</code>: split fraction (for training; the rest is for validation).</li><li><code>balanced</code>: return balanced data (i.e. same number of instances       for all classes). Balancing is achieved via oversampling</li><li><code>shuffle</code>: if true, shuffle the images everytime the iterator       restarts</li><li><code>train</code>: if true, minibatches with (x,y) tuples are provided,       if false only x (for prediction)</li><li><code>aug_pipl</code>: augmentation pipeline for Augmentor.jl. Augmentation       is performed before the pre_proc-function is applied</li><li><code>pre_proc</code>: function with preprocessing       and augmentation algorithms of type x = f(x). In contrast       to the augmentation that modifies images, is <code>pre_proc</code>       working on Arrays{Float32}.</li><li><code>pre_load=false</code>: read all images from disk once when populating the       loader (requires loads of memory, but speeds up training).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L7-L40">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_class_labels" href="#NNHelferlein.get_class_labels"><code>NNHelferlein.get_class_labels</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_class_labels(d::DataLoader)</code></pre><p>Extracts a list of class labels from a DataLoader.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L91-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.image2array" href="#NNHelferlein.image2array"><code>NNHelferlein.image2array</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function image2array(img)</code></pre><p>Take an image and return a 3d-array for RGB and a 2d-array for grayscale images with the colour channels as last dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L306-L311">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.array2image" href="#NNHelferlein.array2image"><code>NNHelferlein.array2image</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function array2image(arr)</code></pre><p>Take a 3d-array with colour channels as last dimension or a 2d-array and return an array of RGB or of Gray as Image.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L328-L333">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.array2RGB" href="#NNHelferlein.array2RGB"><code>NNHelferlein.array2RGB</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function array2RGB(arr)</code></pre><p>Take a 3d-array with colour channels as last dimension or a 2d-array and return always an array of RGB as Image.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/images.jl#L353-L358">source</a></section></article><h2 id="Text-data"><a class="docs-heading-anchor" href="#Text-data">Text data</a><a id="Text-data-1"></a><a class="docs-heading-anchor-permalink" href="#Text-data" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.WordTokenizer" href="#NNHelferlein.WordTokenizer"><code>NNHelferlein.WordTokenizer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct WordTokenizer
+                            aug_pipl=nothing, pre_proc=nothing)</code></pre><p>Return one or two iterable image-loader-objects that provides minibatches of images. For training each minibatch is a tupel <code>(x,y)</code> with x: 4-d-array with the minibatch of data and y: vector of class IDs as Int.</p><p><strong>Arguments:</strong></p><ul><li><code>dir</code>: base-directory of the image dataset. The first level of       sub-dirs are used as class names.</li><li><code>batchsize</code>: size of minibatches</li></ul><p><strong>Keyword arguments:</strong></p><ul><li><code>split</code>: return two iterators for training and validation</li><li><code>at</code>: split fraction (for training; the rest is for validation).</li><li><code>balanced</code>: return balanced data (i.e. same number of instances       for all classes). Balancing is achieved via oversampling</li><li><code>shuffle</code>: if true, shuffle the images everytime the iterator       restarts</li><li><code>train</code>: if true, minibatches with (x,y) tuples are provided,       if false only x (for prediction)</li><li><code>aug_pipl</code>: augmentation pipeline for Augmentor.jl. Augmentation       is performed before the pre_proc-function is applied</li><li><code>pre_proc</code>: function with preprocessing       and augmentation algorithms of type x = f(x). In contrast       to the augmentation that modifies images, is <code>pre_proc</code>       working on Arrays{Float32}.</li><li><code>pre_load=false</code>: read all images from disk once when populating the       loader (requires loads of memory, but speeds up training).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L7-L40">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_class_labels" href="#NNHelferlein.get_class_labels"><code>NNHelferlein.get_class_labels</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_class_labels(d::DataLoader)</code></pre><p>Extracts a list of class labels from a DataLoader.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L91-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.image2array" href="#NNHelferlein.image2array"><code>NNHelferlein.image2array</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function image2array(img)</code></pre><p>Take an image and return a 3d-array for RGB and a 2d-array for grayscale images with the colour channels as last dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L306-L311">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.array2image" href="#NNHelferlein.array2image"><code>NNHelferlein.array2image</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function array2image(arr)</code></pre><p>Take a 3d-array with colour channels as last dimension or a 2d-array and return an array of RGB or of Gray as Image.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L328-L333">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.array2RGB" href="#NNHelferlein.array2RGB"><code>NNHelferlein.array2RGB</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function array2RGB(arr)</code></pre><p>Take a 3d-array with colour channels as last dimension or a 2d-array and return always an array of RGB as Image.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/images.jl#L353-L358">source</a></section></article><h2 id="Text-data"><a class="docs-heading-anchor" href="#Text-data">Text data</a><a id="Text-data-1"></a><a class="docs-heading-anchor-permalink" href="#Text-data" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.WordTokenizer" href="#NNHelferlein.WordTokenizer"><code>NNHelferlein.WordTokenizer</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">mutable struct WordTokenizer
     len
     w2i
     i2w
@@ -176,12 +176,12 @@
 julia&gt; vocab([&quot;They love Julia&quot;, &quot;I love Julia&quot;])
 2-element Array{Array{Int64,1},1}:
  [7, 5, 8]
- [6, 5, 8]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L10-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_tatoeba_corpus" href="#NNHelferlein.get_tatoeba_corpus"><code>NNHelferlein.get_tatoeba_corpus</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_tatoeba_corpus(lang; force=false,
-            url=&quot;https://www.manythings.org/anki/&quot;)</code></pre><p>Download and read a bilingual text corpus from Tatoeba (provided) by ManyThings (https://www.manythings.org). All corpi are English-<em>Language</em>-pairs with different size and quality. Considerable languages include:</p><ul><li><code>fra</code>: French-English, 180 000 sentences</li><li><code>deu</code>: German-English, 227 000 sentences</li><li><code>heb</code>: Hebrew-English, 126 000 sentences</li><li><code>por</code>: Portuguese-English, 170 000 sentences</li><li><code>tur</code>: Turkish-English, 514 000 sentences</li></ul><p>The function returns two lists with corresponding sentences in both languages. Sentences are <em>not</em> processed/normalised/cleaned, but exactly as provided by Tatoeba.</p><p>The data is stored in the package directory and only downloaded once.</p><p><strong>Arguments:</strong></p><ul><li><code>lang</code>: languagecode</li><li><code>force=false</code>: if <code>true</code>, the corpus is downloaded even if       a data file is already saved.</li><li><code>url</code>: base url of ManyThings.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L287-L312">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.sequence_minibatch" href="#NNHelferlein.sequence_minibatch"><code>NNHelferlein.sequence_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function sequence_minibatch(x, [y], batchsize; 
+ [6, 5, 8]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L10-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_tatoeba_corpus" href="#NNHelferlein.get_tatoeba_corpus"><code>NNHelferlein.get_tatoeba_corpus</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_tatoeba_corpus(lang; force=false,
+            url=&quot;https://www.manythings.org/anki/&quot;)</code></pre><p>Download and read a bilingual text corpus from Tatoeba (provided) by ManyThings (https://www.manythings.org). All corpi are English-<em>Language</em>-pairs with different size and quality. Considerable languages include:</p><ul><li><code>fra</code>: French-English, 180 000 sentences</li><li><code>deu</code>: German-English, 227 000 sentences</li><li><code>heb</code>: Hebrew-English, 126 000 sentences</li><li><code>por</code>: Portuguese-English, 170 000 sentences</li><li><code>tur</code>: Turkish-English, 514 000 sentences</li></ul><p>The function returns two lists with corresponding sentences in both languages. Sentences are <em>not</em> processed/normalised/cleaned, but exactly as provided by Tatoeba.</p><p>The data is stored in the package directory and only downloaded once.</p><p><strong>Arguments:</strong></p><ul><li><code>lang</code>: languagecode</li><li><code>force=false</code>: if <code>true</code>, the corpus is downloaded even if       a data file is already saved.</li><li><code>url</code>: base url of ManyThings.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L287-L312">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.sequence_minibatch" href="#NNHelferlein.sequence_minibatch"><code>NNHelferlein.sequence_minibatch</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function sequence_minibatch(x, [y], batchsize; 
                             pad=NNHelferlein.TOKEN_PAD, 
                             seq2seq=true, pad_y=pad,
                             x_padding=false,
-                            shuffle=true, partial=false)</code></pre><p>Return an iterator of type <code>DataLoader</code> with (x,y) sequence minibatches from two lists of sequences.</p><p>All sequences within a minibatch in x and y are brought to the same length by padding with the token provided as <code>pad</code>.</p><p>The sequences are sorted by length before building minibatches in order to  reduce padding (i.e. sequences of similar length are combined to a minibatch). If the same sequence length is needed for all minibatches, the sequences must be truncated or padded before call of <code>sequence_minibatch()</code>  (see functions <code>truncate_seqence()</code> and <code>pad_sequence()</code>).</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: List of sequences of <code>Int</code></li><li><code>y</code>: List of sequences of <code>Int</code> or list of target values (i.e. teaching input)</li><li><code>batchsize</code>: size of minibatches</li><li><code>pad=NNHelferlein.PAD_TOKEN</code>,</li><li><code>pad_y=x</code>: token, used for padding. The token must be compatible       with the type of the sequence elements. If <code>pad_y</code> is omitted, it is set        equal to pad_x.</li><li><code>seq2seq=true</code>: if <code>true</code> and <code>y</code> is provided, sequence-to-sequence minibatches are        created. Otherwise <code>y</code> is treated as scalar teaching input.</li><li><code>shuffle=true</code>: The minibatches are shuffled as last step. If <code>false</code> the minibatches        with short sequences will be at the beginning of the dataset.</li><li><code>partial=false</code>: If <code>true</code>, a partial minibatch will be created if necessaray to        include all input data.</li><li><code>x_padding=false</code>: if <code>true</code>, pad sequences in x to make minibatches of the demanded size,        even if there are not       enougth sequences of the same length in x.       If <code>false</code>, partial minibatches are built (if partial == <code>true</code>) or remaining        sequneces are skipped (if partial == <code>false</code>).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L476-L515">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.pad_sequence" href="#NNHelferlein.pad_sequence"><code>NNHelferlein.pad_sequence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function pad_sequence(s, len; token=NNHelferlein.TOKEN_PAD)</code></pre><p>Stretch a sequence to length <code>len</code> by adding the padding token.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L612-L616">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.truncate_sequence" href="#NNHelferlein.truncate_sequence"><code>NNHelferlein.truncate_sequence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function truncate_sequence(s, len; end_token=nothing)</code></pre><p>Truncate a sequence to the length <code>len</code>.  If not <code>isnothing(end_token)</code>, the last token of the sequence is  overwritten by the token.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L626-L632">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.clean_sentence" href="#NNHelferlein.clean_sentence"><code>NNHelferlein.clean_sentence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function clean_sentence(s)</code></pre><p>Cleaning a sentence in some simple steps:</p><ul><li>normalise Unicode</li><li>remove punctuation</li><li>remove duplicate spaces</li><li>strip</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/texts.jl#L161-L170">source</a></section></article><h1 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.tb_train!" href="#NNHelferlein.tb_train!"><code>NNHelferlein.tb_train!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function tb_train!(mdl, opti, trn, vld=nothing; epochs=1, split=nothing,
+                            shuffle=true, partial=false)</code></pre><p>Return an iterator of type <code>DataLoader</code> with (x,y) sequence minibatches from two lists of sequences.</p><p>All sequences within a minibatch in x and y are brought to the same length by padding with the token provided as <code>pad</code>.</p><p>The sequences are sorted by length before building minibatches in order to  reduce padding (i.e. sequences of similar length are combined to a minibatch). If the same sequence length is needed for all minibatches, the sequences must be truncated or padded before call of <code>sequence_minibatch()</code>  (see functions <code>truncate_seqence()</code> and <code>pad_sequence()</code>).</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: List of sequences of <code>Int</code></li><li><code>y</code>: List of sequences of <code>Int</code> or list of target values (i.e. teaching input)</li><li><code>batchsize</code>: size of minibatches</li><li><code>pad=NNHelferlein.PAD_TOKEN</code>,</li><li><code>pad_y=x</code>: token, used for padding. The token must be compatible       with the type of the sequence elements. If <code>pad_y</code> is omitted, it is set        equal to pad_x.</li><li><code>seq2seq=true</code>: if <code>true</code> and <code>y</code> is provided, sequence-to-sequence minibatches are        created. Otherwise <code>y</code> is treated as scalar teaching input.</li><li><code>shuffle=true</code>: The minibatches are shuffled as last step. If <code>false</code> the minibatches        with short sequences will be at the beginning of the dataset.</li><li><code>partial=false</code>: If <code>true</code>, a partial minibatch will be created if necessaray to        include all input data.</li><li><code>x_padding=false</code>: if <code>true</code>, pad sequences in x to make minibatches of the demanded size,        even if there are not       enougth sequences of the same length in x.       If <code>false</code>, partial minibatches are built (if partial == <code>true</code>) or remaining        sequneces are skipped (if partial == <code>false</code>).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L476-L515">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.pad_sequence" href="#NNHelferlein.pad_sequence"><code>NNHelferlein.pad_sequence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function pad_sequence(s, len; token=NNHelferlein.TOKEN_PAD)</code></pre><p>Stretch a sequence to length <code>len</code> by adding the padding token.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L612-L616">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.truncate_sequence" href="#NNHelferlein.truncate_sequence"><code>NNHelferlein.truncate_sequence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function truncate_sequence(s, len; end_token=nothing)</code></pre><p>Truncate a sequence to the length <code>len</code>.  If not <code>isnothing(end_token)</code>, the last token of the sequence is  overwritten by the token.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L626-L632">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.clean_sentence" href="#NNHelferlein.clean_sentence"><code>NNHelferlein.clean_sentence</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function clean_sentence(s)</code></pre><p>Cleaning a sentence in some simple steps:</p><ul><li>normalise Unicode</li><li>remove punctuation</li><li>remove duplicate spaces</li><li>strip</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/texts.jl#L161-L170">source</a></section></article><h1 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.tb_train!" href="#NNHelferlein.tb_train!"><code>NNHelferlein.tb_train!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function tb_train!(mdl, opti, trn, vld=nothing; epochs=1, split=nothing,
                   lr_decay=nothing, lrd_steps=5, lrd_linear=false,
                   l2=nothing, l1=nothing,
                   eval_size=0.2, eval_freq=1,
@@ -191,21 +191,21 @@
                   tb_dir=&quot;logs&quot;, tb_name=&quot;run&quot;,
                   tb_text=&quot;&quot;&quot;Description of tb_train!() run.&quot;&quot;&quot;,
                   resume=true, tensorboard=true, return_stats=false,
-                  opti_args...)</code></pre><p>Train function with TensorBoard integration. TB logs are written with the TensorBoardLogger.jl package. The model is updated (in-place) and the trained model is returned.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: model; i.e. forward-function for the net</li><li><code>opti</code>: Knet-stype optimiser type</li><li><code>trn</code>: training data; iterator to provide (x,y)-tuples with       minibatches</li><li><code>vld</code>: validation data; iterator to provide (x,y)-tuples with       minibatches. Set to <code>nothing</code>, if not defined.</li></ul><p><strong>Keyword arguments:</strong></p><p><strong>Optimiser:</strong></p><ul><li><code>epochs=1</code>: number of epochs to train</li><li><code>resume=true</code>: if <code>true</code>, optimiser parameters (momentum or gradient       moving average) from a previous run are used to enable a        seemless continuation of the training.       Be aware that in a <code>resume</code>ed training, the original optimizer        will be used, even if a different one is specified for the continuation. </li><li><code>lr_decay=nothing</code>: do a leraning rate decay if not <code>nothing</code>:       the value given is the final learning rate after <code>lrd_steps</code>       steps of decay (<code>lr_decay</code> may be bigger than <code>lr</code>; in this case       the leraning rate is increased).        <code>lr_decay</code> is only applied if both start learning rate       <code>lr</code> and final learning rate <code>lr_decay</code> are defined explicitly.       Example: <code>lr=0.01, lr_decay=0.001</code> will reduce the lr from       0.01 to 0.001 during the training (by default in 5 steps).            <code>lr_decay</code> is applied to <code>l1</code> and <code>l2</code> with the same decay rate.</li><li><code>lrd_steps=5</code>: number of learning rate decay steps. Default is <code>5</code>, i.e.       modify the lr 4 times during the training (resulting in 5 different        learning rates).</li><li><code>lrd_linear=false</code>: type of learning rate decay;       If <code>false</code>, lr is modified       by a constant factor (e.g. 0.9) resulting in an exponential decay.       If <code>true</code>, lr is modified by the same step size, i.e. linearly.</li><li><code>l1=nothing</code>: L1 regularisation; implemented as weight decay per       parameter. If learning-rate decay is used, L1 and L2 are also decayed.</li><li><code>l2=nothing</code>: L2 regularisation; implemented as weight decay per       parameter</li><li><code>opti_args...</code>: optional keyword arguments for the optimiser can be specified       (i.e. <code>lr</code>, <code>gamma</code>, ...).</li></ul><p><strong>Model evaluation:</strong></p><ul><li><code>split=nothing</code>: if no validation data is specified and split is a        fraction (between 0.0 and 1.0), the training dataset is splitted at the       specified point (e.g.: if <code>split=0.8</code>, 80% of the minibatches are used        for training and 20% for validation).</li><li><code>eval_size=0.2</code>: fraction of validation data to be used for calculating       loss and accuracy for train and validation data during training.</li><li><code>eval_freq=1</code>: frequency of evaluation; default=1 means evaluation is       calculated after each epoch. With eval_freq=10 eveluation is       calculated 10 times per epoch.</li><li><code>acc_fun=nothing</code>: function to calculate accuracy. The function       must implement the following signature: <code>fun(model; data)</code> where       data is an iterator that provides (x,y)-tuples of minibatches.       For classification tasks, <code>accuracy</code> from the Knet package is       a good choice. For regression a correlation or mean error       may be preferred.</li><li><code>mb_loss_freq=100</code>: frequency of training loss reporting. default=100       means that 100 loss-values per epoch will be logged to TensorBoard.       If mb<em>loss</em>freq is greater then the number of minibatches,       loss is logged for each minibatch.</li><li><code>checkpoints=nothing</code>: frequency of model checkpoints written to disk.       Default is <code>nothing</code>, i.e. no checkpoints are written.       To write the model after each epoch with       name <code>model</code> use cp<em>epoch=1; to write every second epochs cp</em>epoch=2,        etc.</li><li><code>cp_dir=&quot;checkpoints&quot;</code>: directory for checkpoints</li><li><code>return_stats=false</code>: if <code>true</code>, a dictionary with losses and accuracies  of       training and validation data is returned instead of the        model. </li></ul><p><strong>TensorBoard:</strong></p><p>TensorBoard log-directory is created from 3 parts: <code>tb_dir/tb_name/&lt;current date time&gt;</code>.</p><ul><li><code>tensorboard=true</code>: if <code>true</code>, TensorBoard logs are written</li><li><code>tb_dir=&quot;logs&quot;</code>: root directory for TensorBoard logs.</li><li><code>tb_name=&quot;run&quot;</code>: name of training run. <code>tb_name</code> will be used as       directory name and should not include whitespace</li><li><code>tb_text</code>:  description       to be included in the TensorBoard log as <em>text</em> log.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/train.jl#L1-L97">source</a></section></article><h1 id="Evaluation-and-accuracy"><a class="docs-heading-anchor" href="#Evaluation-and-accuracy">Evaluation and accuracy</a><a id="Evaluation-and-accuracy-1"></a><a class="docs-heading-anchor-permalink" href="#Evaluation-and-accuracy" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.focal_nll" href="#NNHelferlein.focal_nll"><code>NNHelferlein.focal_nll</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function focal_nll(scores, labels::AbstractArray{&lt;:Integer}; γ=2.0, dims=1)
-function focal_nll(mdl; data, γ=2.0, dims=1)</code></pre><p>Calculate the negative log-likelihood (i.e. cross entropy) with increased weights on  weekly classified samples. <em>focal nll</em> for sample <em>j</em> is defined as</p><p class="math-container">\[- (1 - p_{j})^{\gamma} \cdot \ln p_{j} =\]</p><p class="math-container">\[(1 - p_{j})^{\gamma} \cdot nll(p_{j})\]</p><p>where <em>p</em> is the softmax-scaled likelyhood for the true class of the  <em>j</em>-th sample.  The sample weight is high, if predicted <em>p</em> &lt;&lt; 1.</p><p>The second signature can be used to caclulate the mean <em>focus nll</em> for a dataset of minibatches of (x,y)-tuples.</p><p><strong>Arguments:</strong></p><ul><li><code>scores</code>: unnormalised scores (i.e. activations of output neurons           without applying an activation function), typically of a classifier with            one neuron per class</li><li><code>labels</code>: ground truth as integer values</li><li><code>γ=2.0</code>: The parameter <em>γ</em> controls the strength of the effect:            for <em>γ=0</em>, all weights become exactly 1.0; with higher values            for <em>γ</em>,            focus on mis-classified or weakly classified sample is increased.</li></ul><p><code>dims=1</code>: dimension in which the instances are organised.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L523-L554">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.focal_bce" href="#NNHelferlein.focal_bce"><code>NNHelferlein.focal_bce</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function focal_bce(scores, labels::AbstractArray{&lt;:Integer}; 
-function focal_bce(mdl; data, γ=2.0, dims=1)</code></pre><p>Calculate the biray crossentropywith increased weights on  weekly classified samples. <em>focal bce</em> for sample <em>j</em> is defined as</p><p class="math-container">\[(1 - p_{j})^{\gamma} \cdot bce(p_{j})\]</p><p>where <em>p</em> is the softmax-scaled likelyhood for the true class of the  <em>j</em>-th sample.  The sample weight is high, if predicted <em>p</em> &lt;&lt; 1.</p><p>The second signature can be used to caclulate the mean <em>focus bce</em> for a dataset of minibatches of (x,y)-tuples.</p><p>For arguments and details, please refer to the documentation of  <code>focal_nll</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L573-L593">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict" href="#NNHelferlein.predict"><code>NNHelferlein.predict</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict(mdl; data, softmax=false)
-function predict(mdl, x; softmax=false )</code></pre><p>Return the prediction for minibatches of data.      The signature follows the standard call <code>predict(model, data=xxx)</code>.      The second signature predicts a single Array of data.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: executable network model</li><li><code>data=iterator</code>: iterator providing minibatches       of input data; if the minibatches include y-values        (i.e. teaching input), predictions (i.e. index of class with highest        value <em>and</em> the y-values will be returned. </li><li><code>data</code>: single Array of input data (i.e. input for one minibatch)</li><li><code>softmax</code>: if true or if model is of type <code>Classifier</code> the predicted       softmax probabilities are returned instead of raw       activations.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/train.jl#L576-L595">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict_top5" href="#NNHelferlein.predict_top5"><code>NNHelferlein.predict_top5</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict_top5(mdl; data, top_n=5, classes=nothing)</code></pre><p>Run the model <code>mdl</code> for data in minibatches <code>data</code> and print the top 5 predictions as softmax probabilities.</p><p><strong>Arguments:</strong></p><ul><li><code>top_n</code>: print top <em>n</em> hits</li><li><code>classes</code>: optional list of human readable class labels.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/train.jl#L545-L554">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.minibatch_eval" href="#NNHelferlein.minibatch_eval"><code>NNHelferlein.minibatch_eval</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function minibatch_eval(mdl, fun, data; o...)</code></pre><p>Given an accuracy or loss function <code>fun(p, y)</code> that returns an accuracy meassure for n-dimensional arrays of predictions <code>p</code> and  teaching input <code>y</code> (i.e. one minibatch of data),  <code>minibatch_eval()</code> applies the <code>fun()</code> to all minibatches supplied by  the minibatch iterator <code>data</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: model to compute predictions</li><li><code>fun</code>: evaluation function for one minibatch that returns the mean       of results for all samples of the minibatch</li><li><code>data</code>: iterator that supplies a Tuple of (x,y) for        each minibatch</li></ul><p><code>o...</code>: all additional keyword arguments are forwarded to         <code>fun()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L485-L502">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.squared_error_acc" href="#NNHelferlein.squared_error_acc"><code>NNHelferlein.squared_error_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function squared_error_acc(mdl; data)</code></pre><p>Return the <em>mean squared error</em> between the predictions  of the model <code>mdl</code> and the corresponding teaching input by providung the standard signature  <code>fun(model, data=iterator)</code>.</p><p><strong>Arguments</strong></p><ul><li><code>mdl</code>: model with the signature <code>mdl(x)</code> to generate predictions       for one minibatch (i.e. array) of data.</li><li><code>data</code>: iterator, providing (x,y)-tuples of training or validation        data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L440-L453">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.abs_error_acc" href="#NNHelferlein.abs_error_acc"><code>NNHelferlein.abs_error_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function abs_error_acc(mdl; data)</code></pre><p>Return the <em>mean absolute error</em> between the predictions  of the model <code>mdl</code> and the corresponding teaching input by providung the standard signature  <code>fun(model, data=iterator)</code>.</p><p><strong>Arguments</strong></p><ul><li><code>mdl</code>: model with the signature <code>mdl(x)</code> to generate predictions       for one minibatch (i.e. array) of data.</li><li><code>data</code>: iterator, providing (x,y)-tuples of training or validation        data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L465-L478">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.hamming_dist" href="#NNHelferlein.hamming_dist"><code>NNHelferlein.hamming_dist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function hamming_dist(p, t; accuracy=false, 
+                  opti_args...)</code></pre><p>Train function with TensorBoard integration. TB logs are written with the TensorBoardLogger.jl package. The model is updated (in-place) and the trained model is returned.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: model; i.e. forward-function for the net</li><li><code>opti</code>: Knet-stype optimiser type</li><li><code>trn</code>: training data; iterator to provide (x,y)-tuples with       minibatches</li><li><code>vld</code>: validation data; iterator to provide (x,y)-tuples with       minibatches. Set to <code>nothing</code>, if not defined.</li></ul><p><strong>Keyword arguments:</strong></p><p><strong>Optimiser:</strong></p><ul><li><code>epochs=1</code>: number of epochs to train</li><li><code>resume=true</code>: if <code>true</code>, optimiser parameters (momentum or gradient       moving average) from a previous run are used to enable a        seemless continuation of the training.       Be aware that in a <code>resume</code>ed training, the original optimizer        will be used, even if a different one is specified for the continuation. </li><li><code>lr_decay=nothing</code>: do a leraning rate decay if not <code>nothing</code>:       the value given is the final learning rate after <code>lrd_steps</code>       steps of decay (<code>lr_decay</code> may be bigger than <code>lr</code>; in this case       the leraning rate is increased).        <code>lr_decay</code> is only applied if both start learning rate       <code>lr</code> and final learning rate <code>lr_decay</code> are defined explicitly.       Example: <code>lr=0.01, lr_decay=0.001</code> will reduce the lr from       0.01 to 0.001 during the training (by default in 5 steps).            <code>lr_decay</code> is applied to <code>l1</code> and <code>l2</code> with the same decay rate.</li><li><code>lrd_steps=5</code>: number of learning rate decay steps. Default is <code>5</code>, i.e.       modify the lr 4 times during the training (resulting in 5 different        learning rates).</li><li><code>lrd_linear=false</code>: type of learning rate decay;       If <code>false</code>, lr is modified       by a constant factor (e.g. 0.9) resulting in an exponential decay.       If <code>true</code>, lr is modified by the same step size, i.e. linearly.</li><li><code>l1=nothing</code>: L1 regularisation; implemented as weight decay per       parameter. If learning-rate decay is used, L1 and L2 are also decayed.</li><li><code>l2=nothing</code>: L2 regularisation; implemented as weight decay per       parameter</li><li><code>opti_args...</code>: optional keyword arguments for the optimiser can be specified       (i.e. <code>lr</code>, <code>gamma</code>, ...).</li></ul><p><strong>Model evaluation:</strong></p><ul><li><code>split=nothing</code>: if no validation data is specified and split is a        fraction (between 0.0 and 1.0), the training dataset is splitted at the       specified point (e.g.: if <code>split=0.8</code>, 80% of the minibatches are used        for training and 20% for validation).</li><li><code>eval_size=0.2</code>: fraction of validation data to be used for calculating       loss and accuracy for train and validation data during training.</li><li><code>eval_freq=1</code>: frequency of evaluation; default=1 means evaluation is       calculated after each epoch. With eval_freq=10 eveluation is       calculated 10 times per epoch.</li><li><code>acc_fun=nothing</code>: function to calculate accuracy. The function       must implement the following signature: <code>fun(model; data)</code> where       data is an iterator that provides (x,y)-tuples of minibatches.       For classification tasks, <code>accuracy</code> from the Knet package is       a good choice. For regression a correlation or mean error       may be preferred.</li><li><code>mb_loss_freq=100</code>: frequency of training loss reporting. default=100       means that 100 loss-values per epoch will be logged to TensorBoard.       If mb<em>loss</em>freq is greater then the number of minibatches,       loss is logged for each minibatch.</li><li><code>checkpoints=nothing</code>: frequency of model checkpoints written to disk.       Default is <code>nothing</code>, i.e. no checkpoints are written.       To write the model after each epoch with       name <code>model</code> use cp<em>epoch=1; to write every second epochs cp</em>epoch=2,        etc.</li><li><code>cp_dir=&quot;checkpoints&quot;</code>: directory for checkpoints</li><li><code>return_stats=false</code>: if <code>true</code>, a dictionary with losses and accuracies  of       training and validation data is returned instead of the        model. </li></ul><p><strong>TensorBoard:</strong></p><p>TensorBoard log-directory is created from 3 parts: <code>tb_dir/tb_name/&lt;current date time&gt;</code>.</p><ul><li><code>tensorboard=true</code>: if <code>true</code>, TensorBoard logs are written</li><li><code>tb_dir=&quot;logs&quot;</code>: root directory for TensorBoard logs.</li><li><code>tb_name=&quot;run&quot;</code>: name of training run. <code>tb_name</code> will be used as       directory name and should not include whitespace</li><li><code>tb_text</code>:  description       to be included in the TensorBoard log as <em>text</em> log.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/train.jl#L1-L97">source</a></section></article><h1 id="Evaluation-and-accuracy"><a class="docs-heading-anchor" href="#Evaluation-and-accuracy">Evaluation and accuracy</a><a id="Evaluation-and-accuracy-1"></a><a class="docs-heading-anchor-permalink" href="#Evaluation-and-accuracy" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.focal_nll" href="#NNHelferlein.focal_nll"><code>NNHelferlein.focal_nll</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function focal_nll(scores, labels::AbstractArray{&lt;:Integer}; γ=2.0, dims=1)
+function focal_nll(mdl; data, γ=2.0, dims=1)</code></pre><p>Calculate the negative log-likelihood (i.e. cross entropy) with increased weights on  weekly classified samples. <em>focal nll</em> for sample <em>j</em> is defined as</p><p class="math-container">\[- (1 - p_{j})^{\gamma} \cdot \ln p_{j} =\]</p><p class="math-container">\[(1 - p_{j})^{\gamma} \cdot nll(p_{j})\]</p><p>where <em>p</em> is the softmax-scaled likelyhood for the true class of the  <em>j</em>-th sample.  The sample weight is high, if predicted <em>p</em> &lt;&lt; 1.</p><p>The second signature can be used to caclulate the mean <em>focus nll</em> for a dataset of minibatches of (x,y)-tuples.</p><p><strong>Arguments:</strong></p><ul><li><code>scores</code>: unnormalised scores (i.e. activations of output neurons           without applying an activation function), typically of a classifier with            one neuron per class</li><li><code>labels</code>: ground truth as integer values</li><li><code>γ=2.0</code>: The parameter <em>γ</em> controls the strength of the effect:            for <em>γ=0</em>, all weights become exactly 1.0; with higher values            for <em>γ</em>,            focus on mis-classified or weakly classified sample is increased.</li></ul><p><code>dims=1</code>: dimension in which the instances are organised.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L523-L554">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.focal_bce" href="#NNHelferlein.focal_bce"><code>NNHelferlein.focal_bce</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function focal_bce(scores, labels::AbstractArray{&lt;:Integer}; 
+function focal_bce(mdl; data, γ=2.0, dims=1)</code></pre><p>Calculate the biray crossentropywith increased weights on  weekly classified samples. <em>focal bce</em> for sample <em>j</em> is defined as</p><p class="math-container">\[(1 - p_{j})^{\gamma} \cdot bce(p_{j})\]</p><p>where <em>p</em> is the softmax-scaled likelyhood for the true class of the  <em>j</em>-th sample.  The sample weight is high, if predicted <em>p</em> &lt;&lt; 1.</p><p>The second signature can be used to caclulate the mean <em>focus bce</em> for a dataset of minibatches of (x,y)-tuples.</p><p>For arguments and details, please refer to the documentation of  <code>focal_nll</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L573-L593">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict" href="#NNHelferlein.predict"><code>NNHelferlein.predict</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict(mdl; data, softmax=false)
+function predict(mdl, x; softmax=false )</code></pre><p>Return the prediction for minibatches of data.      The signature follows the standard call <code>predict(model, data=xxx)</code>.      The second signature predicts a single Array of data.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: executable network model</li><li><code>data=iterator</code>: iterator providing minibatches       of input data; if the minibatches include y-values        (i.e. teaching input), predictions (i.e. index of class with highest        value <em>and</em> the y-values will be returned. </li><li><code>data</code>: single Array of input data (i.e. input for one minibatch)</li><li><code>softmax</code>: if true or if model is of type <code>Classifier</code> the predicted       softmax probabilities are returned instead of raw       activations.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/train.jl#L576-L595">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict_top5" href="#NNHelferlein.predict_top5"><code>NNHelferlein.predict_top5</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict_top5(mdl; data, top_n=5, classes=nothing)</code></pre><p>Run the model <code>mdl</code> for data in minibatches <code>data</code> and print the top 5 predictions as softmax probabilities.</p><p><strong>Arguments:</strong></p><ul><li><code>top_n</code>: print top <em>n</em> hits</li><li><code>classes</code>: optional list of human readable class labels.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/train.jl#L545-L554">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.minibatch_eval" href="#NNHelferlein.minibatch_eval"><code>NNHelferlein.minibatch_eval</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function minibatch_eval(mdl, fun, data; o...)</code></pre><p>Given an accuracy or loss function <code>fun(p, y)</code> that returns an accuracy meassure for n-dimensional arrays of predictions <code>p</code> and  teaching input <code>y</code> (i.e. one minibatch of data),  <code>minibatch_eval()</code> applies the <code>fun()</code> to all minibatches supplied by  the minibatch iterator <code>data</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: model to compute predictions</li><li><code>fun</code>: evaluation function for one minibatch that returns the mean       of results for all samples of the minibatch</li><li><code>data</code>: iterator that supplies a Tuple of (x,y) for        each minibatch</li></ul><p><code>o...</code>: all additional keyword arguments are forwarded to         <code>fun()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L485-L502">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.squared_error_acc" href="#NNHelferlein.squared_error_acc"><code>NNHelferlein.squared_error_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function squared_error_acc(mdl; data)</code></pre><p>Return the <em>mean squared error</em> between the predictions  of the model <code>mdl</code> and the corresponding teaching input by providung the standard signature  <code>fun(model, data=iterator)</code>.</p><p><strong>Arguments</strong></p><ul><li><code>mdl</code>: model with the signature <code>mdl(x)</code> to generate predictions       for one minibatch (i.e. array) of data.</li><li><code>data</code>: iterator, providing (x,y)-tuples of training or validation        data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L440-L453">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.abs_error_acc" href="#NNHelferlein.abs_error_acc"><code>NNHelferlein.abs_error_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function abs_error_acc(mdl; data)</code></pre><p>Return the <em>mean absolute error</em> between the predictions  of the model <code>mdl</code> and the corresponding teaching input by providung the standard signature  <code>fun(model, data=iterator)</code>.</p><p><strong>Arguments</strong></p><ul><li><code>mdl</code>: model with the signature <code>mdl(x)</code> to generate predictions       for one minibatch (i.e. array) of data.</li><li><code>data</code>: iterator, providing (x,y)-tuples of training or validation        data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L465-L478">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.hamming_dist" href="#NNHelferlein.hamming_dist"><code>NNHelferlein.hamming_dist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function hamming_dist(p, t; accuracy=false, 
                             ignore_ctls=false, vocab=nothing, 
                             start=nothing, stop=nothing, pad=nothing, unk=nothing)
 
 
 function hamming_acc(p, t; o...)
 
-function hamming_acc(mdl; data=data, o...)</code></pre><p>Return the Hamming distance between two sequences or two minibatches of sequences. Predicted sequences <code>p</code> and teaching input sequences <code>t</code> may be of different length but the number of sequences in the minibatch must be the same.</p><p><strong>Arguments:</strong></p><ul><li><code>p</code>, <code>t</code>: n-dimensional arrays of type <code>Int</code> with predictions       and teaching input for a minibatch of sequences.       Shape of the arrays must be identical except of the first dimension       (i.e. the sequence length) that may differ between <code>p</code> and <code>t</code>.</li><li><code>accuracy=false</code>: if <code>false</code>, the mean Hamming distance in the minibatch       is returned (i.e. the average number of differences in the sequences).       If <code>true</code>, the accuracy is returned       for all not padded positions in a range (0.0 - 1.0).</li><li><code>ignore_ctls=false</code>: a vocab is used to replace all &#39;&lt;start&gt;, &lt;end&gt;, &lt;unknwon&gt;, &lt;pad&gt;&#39;       tokens by <code>&lt;pad&gt;</code>. If true, padding and other control tokens are treated as       normal codes and are not ignored.</li><li><code>vocab=nothing</code>: target laguage vocabulary of type <code>NNHelferlein.WordTokenizer</code>.       If defined,       the padding token of <code>vocab</code> is used to mask all control tokens in the       sequences (i.e. &#39;&lt;start&gt;, &lt;end&gt;, &lt;unknwon&gt;, &lt;pad&gt;&#39;).</li><li><code>start, stop, pad, unk</code>: may be used to define individual control tokens.       default is <code>nothing</code>.</li></ul><p><strong>Details:</strong></p><p>The function <code>hamming_acc()</code> is a shortcut to return the accuracy instead of the distance. The signature <code>hamming_acc(mdl; data=data; o...)</code> is for compatibility with acc functions called by train.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L167-L208">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.peak_finder_acc" href="#NNHelferlein.peak_finder_acc"><code>NNHelferlein.peak_finder_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function peak_finder_acc(p, t; ret=:f1, verbose=0, 
+function hamming_acc(mdl; data=data, o...)</code></pre><p>Return the Hamming distance between two sequences or two minibatches of sequences. Predicted sequences <code>p</code> and teaching input sequences <code>t</code> may be of different length but the number of sequences in the minibatch must be the same.</p><p><strong>Arguments:</strong></p><ul><li><code>p</code>, <code>t</code>: n-dimensional arrays of type <code>Int</code> with predictions       and teaching input for a minibatch of sequences.       Shape of the arrays must be identical except of the first dimension       (i.e. the sequence length) that may differ between <code>p</code> and <code>t</code>.</li><li><code>accuracy=false</code>: if <code>false</code>, the mean Hamming distance in the minibatch       is returned (i.e. the average number of differences in the sequences).       If <code>true</code>, the accuracy is returned       for all not padded positions in a range (0.0 - 1.0).</li><li><code>ignore_ctls=false</code>: a vocab is used to replace all &#39;&lt;start&gt;, &lt;end&gt;, &lt;unknwon&gt;, &lt;pad&gt;&#39;       tokens by <code>&lt;pad&gt;</code>. If true, padding and other control tokens are treated as       normal codes and are not ignored.</li><li><code>vocab=nothing</code>: target laguage vocabulary of type <code>NNHelferlein.WordTokenizer</code>.       If defined,       the padding token of <code>vocab</code> is used to mask all control tokens in the       sequences (i.e. &#39;&lt;start&gt;, &lt;end&gt;, &lt;unknwon&gt;, &lt;pad&gt;&#39;).</li><li><code>start, stop, pad, unk</code>: may be used to define individual control tokens.       default is <code>nothing</code>.</li></ul><p><strong>Details:</strong></p><p>The function <code>hamming_acc()</code> is a shortcut to return the accuracy instead of the distance. The signature <code>hamming_acc(mdl; data=data; o...)</code> is for compatibility with acc functions called by train.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L167-L208">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.peak_finder_acc" href="#NNHelferlein.peak_finder_acc"><code>NNHelferlein.peak_finder_acc</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function peak_finder_acc(p, t; ret=:f1, verbose=0, 
                          tolerance=1, limit=0.5
 
-function peak_finder_acc(mdl; data=data, o...)</code></pre><p>Calculate an accuracy-like measure for data series consisting  mainly of zeros and rare peaks. The function counts the number of peaks in <code>y</code> detected by <code>p</code>  (<em>true positives</em>), peaks not detected (<em>false negatives</em>)  and the number of peaks in <code>p</code> not present in <code>y</code>  (<em>false positives</em>).</p><p>It is assumed that peaks in <code>y</code> are marked by a single value higher as the limit (typically 1.0). Peaks in <code>p</code> may be  broader; and are defined as local maxima with a value above the limit. If the tolerance ist set to &gt; 0, it may happen that the peaks at the first  or last step are not evaluated (because evaluation stops at  <code>end-tolerance</code>).</p><p>If requested, <em>f1</em>, <em>G-mean</em> and <em>intersection over union</em>  are calulated from the raw values .</p><p><strong>Arguments:</strong></p><ul><li><code>p</code>, <code>t</code>: Predictions <code>p</code> and teaching input <code>t</code> (i.e. <code>y</code>) are mini-batches of           1-d series of data. The sequence must be in the 1st dimension           (column). All other dims are treated as separate windows           of length size(p/t,1).</li><li><code>ret</code>: return value as <code>Symbol</code>; one of        <code>:peaks</code>, <code>:recall</code>, <code>:precision</code>, <code>:miss_rate</code>, <code>:f1</code>,       <code>:g_mean</code>, <code>:iou</code> or <code>:all</code>.       If <code>:all</code> a named tuple is returned.</li><li><code>verbose=0</code>: if <code>0</code>, no additional output is generated;       if <code>1</code>, composite measures are printed to stdout;       if <code>2</code>, all raw counts are printed.</li><li><code>tolerance=1</code>: peak finder tolerance: The peak is defined as <em>correct</em>       if it is detected within the tolerance.</li><li><code>limit=0.5</code>: Only maxima with values above the limit are considered.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L8-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.confusion_matrix" href="#NNHelferlein.confusion_matrix"><code>NNHelferlein.confusion_matrix</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function confusion_matrix(mdl; data, labels=nothing, pretty_print=true, accuracy=true)
-function confusion_matrix(y, p; labels=nothing, pretty_print=true, accuracy=true)</code></pre><p>Compute and display the confusion matrix of   (x,y)-minibatches. Predictions are calculated with model <code>mdl</code> for which  a signature <code>mdl(x)</code> must exist.</p><p>The second signature generates the confusion matrix from  the 2 vectors <em>ground truth</em> <code>y</code> and <em>predictions</em> <code>p</code>.</p><p>The function is an interface to the function <code>confusmat</code>  provided by the package <code>MLBase</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: mdl with signature <code>mdl(x)</code> to generate predictions</li><li><code>data</code>: minibatches of (x,y)-tuples</li><li><code>pretty_print=true</code>: if <code>true</code>, the matrix will pe displayed to stdout</li><li><code>labels=nothing</code>: a vecor of human readable labels can be provided</li><li><code>accuracy=true</code>: if <code>true</code>, accuracy, precisiomn and recall is printed        for all classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/acc.jl#L312-L334">source</a></section></article><h1 id="ImageNet-tools"><a class="docs-heading-anchor" href="#ImageNet-tools">ImageNet tools</a><a id="ImageNet-tools-1"></a><a class="docs-heading-anchor-permalink" href="#ImageNet-tools" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_vgg" href="#NNHelferlein.preproc_imagenet_vgg"><code>NNHelferlein.preproc_imagenet_vgg</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function preproc_imagenet_vgg(img)
+function peak_finder_acc(mdl; data=data, o...)</code></pre><p>Calculate an accuracy-like measure for data series consisting  mainly of zeros and rare peaks. The function counts the number of peaks in <code>y</code> detected by <code>p</code>  (<em>true positives</em>), peaks not detected (<em>false negatives</em>)  and the number of peaks in <code>p</code> not present in <code>y</code>  (<em>false positives</em>).</p><p>It is assumed that peaks in <code>y</code> are marked by a single value higher as the limit (typically 1.0). Peaks in <code>p</code> may be  broader; and are defined as local maxima with a value above the limit. If the tolerance ist set to &gt; 0, it may happen that the peaks at the first  or last step are not evaluated (because evaluation stops at  <code>end-tolerance</code>).</p><p>If requested, <em>f1</em>, <em>G-mean</em> and <em>intersection over union</em>  are calulated from the raw values .</p><p><strong>Arguments:</strong></p><ul><li><code>p</code>, <code>t</code>: Predictions <code>p</code> and teaching input <code>t</code> (i.e. <code>y</code>) are mini-batches of           1-d series of data. The sequence must be in the 1st dimension           (column). All other dims are treated as separate windows           of length size(p/t,1).</li><li><code>ret</code>: return value as <code>Symbol</code>; one of        <code>:peaks</code>, <code>:recall</code>, <code>:precision</code>, <code>:miss_rate</code>, <code>:f1</code>,       <code>:g_mean</code>, <code>:iou</code> or <code>:all</code>.       If <code>:all</code> a named tuple is returned.</li><li><code>verbose=0</code>: if <code>0</code>, no additional output is generated;       if <code>1</code>, composite measures are printed to stdout;       if <code>2</code>, all raw counts are printed.</li><li><code>tolerance=1</code>: peak finder tolerance: The peak is defined as <em>correct</em>       if it is detected within the tolerance.</li><li><code>limit=0.5</code>: Only maxima with values above the limit are considered.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L8-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.confusion_matrix" href="#NNHelferlein.confusion_matrix"><code>NNHelferlein.confusion_matrix</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function confusion_matrix(mdl; data, labels=nothing, pretty_print=true, accuracy=true)
+function confusion_matrix(y, p; labels=nothing, pretty_print=true, accuracy=true)</code></pre><p>Compute and display the confusion matrix of   (x,y)-minibatches. Predictions are calculated with model <code>mdl</code> for which  a signature <code>mdl(x)</code> must exist.</p><p>The second signature generates the confusion matrix from  the 2 vectors <em>ground truth</em> <code>y</code> and <em>predictions</em> <code>p</code>.</p><p>The function is an interface to the function <code>confusmat</code>  provided by the package <code>MLBase</code>.</p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: mdl with signature <code>mdl(x)</code> to generate predictions</li><li><code>data</code>: minibatches of (x,y)-tuples</li><li><code>pretty_print=true</code>: if <code>true</code>, the matrix will pe displayed to stdout</li><li><code>labels=nothing</code>: a vecor of human readable labels can be provided</li><li><code>accuracy=true</code>: if <code>true</code>, accuracy, precisiomn and recall is printed        for all classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/acc.jl#L312-L334">source</a></section></article><h1 id="ImageNet-tools"><a class="docs-heading-anchor" href="#ImageNet-tools">ImageNet tools</a><a id="ImageNet-tools-1"></a><a class="docs-heading-anchor-permalink" href="#ImageNet-tools" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_vgg" href="#NNHelferlein.preproc_imagenet_vgg"><code>NNHelferlein.preproc_imagenet_vgg</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function preproc_imagenet_vgg(img)
 function preproc_imagenet_resnetv2(img)</code></pre><p>Image preprocessing for pre-trained ImageNet examples. Preprocessing includes</p><ul><li>bring RGB colour values into a range 0-255</li><li>standardise of colour values by substracting mean colour values   (103.939, 116.779, 123.68) from RGB</li><li>changing colour channel sequence from RGB to BGR</li><li>normalising or scaling colour values.</li></ul><p>Resize is <strong>not</strong> done, because this may be part of the augmentation pipeline.</p><p><strong>Details</strong></p><p>Unfortunately image preprocessing is not consistent between all  pretrained Tenrflow/Keras applications. As a result, different preprocessing functions must be used for different  pretrained applications:</p><ul><li><strong>VGG16, VGG19</strong>: <code>preproc_imagenet_vgg</code>  (colour space: BGR, values: 0 - 255, centered according to the imagenet training set)</li><li><strong>RESNET</strong>: <code>preproc_imagenet_resnet</code> (identical to vgg)</li><li><strong>RESNET V2</strong>: <code>preproc_imagenet_resnetv2</code> (colour space: RGB,  values: -1.0 - 1.0, scaled for each sample individually) </li></ul><p><strong>Examples:</strong></p><p>The function can be used with the image loader; for prediction with a trained model as:</p><pre><code class="language-julia hljs">pipl = CropRatio(ratio=1.0) |&gt; Resize(224,224)
 images = mk_image_minibatch(&quot;./example_pics&quot;, 16;
                     shuffle=false, train=false,
@@ -219,8 +219,8 @@
                     split=true, at=0.8, balanced=false,
                     shuffle=true, train=true,
                     aug_pipl=pipl,
-                    pre_proc=preproc_imagenet_vgg)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/imagenet.jl#L2-L55">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_resnet" href="#NNHelferlein.preproc_imagenet_resnet"><code>NNHelferlein.preproc_imagenet_resnet</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">preproc_imagenet_resnet(img)</code></pre><p>See documentation of <code>preproc_imagenet_vgg</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/imagenet.jl#L61-L65">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_resnetv2" href="#NNHelferlein.preproc_imagenet_resnetv2"><code>NNHelferlein.preproc_imagenet_resnetv2</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">preproc_imagenet_resnetv2(img)</code></pre><p>See documentation of <code>preproc_imagenet_vgg</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/imagenet.jl#L71-L75">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict_imagenet" href="#NNHelferlein.predict_imagenet"><code>NNHelferlein.predict_imagenet</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict_imagenet(mdl; data, top_n=5)</code></pre><p>Predict the ImageNet-class of images from the predefined list of class labels.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/imagenet.jl#L153-L158">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_imagenet_classes" href="#NNHelferlein.get_imagenet_classes"><code>NNHelferlein.get_imagenet_classes</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_imagenet_classes()</code></pre><p>Return a list of all 1000 ImageNet class labels.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/imagenet.jl#L130-L134">source</a></section></article><h1 id="Other-utils"><a class="docs-heading-anchor" href="#Other-utils">Other utils</a><a id="Other-utils-1"></a><a class="docs-heading-anchor-permalink" href="#Other-utils" title="Permalink"></a></h1><h2 id="Layers-and-helpers-for-transformers"><a class="docs-heading-anchor" href="#Layers-and-helpers-for-transformers">Layers and helpers for transformers</a><a id="Layers-and-helpers-for-transformers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers-and-helpers-for-transformers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PositionalEncoding" href="#NNHelferlein.PositionalEncoding"><code>NNHelferlein.PositionalEncoding</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PositionalEncoding &lt;: AbstractLayer</code></pre><p>Positional encoding layer. Only <em>sincos</em>-style (according to Vaswani, et al., NIPS 2017) is implemented.</p><p>The layer takes an array of any number of dimensions (&gt;=2), calculates the Vaswani-2017-style positional encoding and adds the encoding to each plane of the array.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L27-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.positional_encoding_sincos" href="#NNHelferlein.positional_encoding_sincos"><code>NNHelferlein.positional_encoding_sincos</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function positional_encoding_sincos(n_embed, n_seq)</code></pre><p>Calculate and return a matrix of size <code>[n_embed, n_seq]</code> of positional encoding values following the sin and cos style in the paper <em>Vaswani, A. et al.; Attention Is All You Need; 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 2017.</em></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L7-L16">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_padding_mask" href="#NNHelferlein.mk_padding_mask"><code>NNHelferlein.mk_padding_mask</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_padding_mask(x; pad=TOKEN_PAD, add_dims=false)</code></pre><p>Make a padding mask; i.e. return an Array of type <code>KnetArray{Float32}</code> (or <code>Array{Float32}</code>) similar to <code>x</code> but with two additional dimensions of size 1 in the middle (this will represent the 2nd seq_len and the number of heads) in multi-head attention and the value <code>1.0</code> at each position where <code>x</code> is <code>pad</code> and <code>0.0</code> otherwise.</p><p>The function can be used for creating padding masks for attention mechanisms.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of sequences (typically a matrix with n<em>cols sequences   of length n</em>rows)</li><li><code>pad</code>: value for the token to be masked</li><li><code>add_dims</code>: if <code>true</code>, 2 additional dimensions are inserted to    return a 4-D-array as needed for transformer architectures. Otherwise   the size of the returned array is similar to <code>x</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L51-L71">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_peek_ahead_mask" href="#NNHelferlein.mk_peek_ahead_mask"><code>NNHelferlein.mk_peek_ahead_mask</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_peek_ahead_mask(x; dim=1)
-function mk_peek_ahead_mask(n_seq)</code></pre><p>Return a matrix of size <code>[n_seq, n_seq]</code> filled with 1.0 and the <em>uppper triangle</em> set to 0.0. Type is <code>CuArray{Float32}</code> in GPU context, <code>Array{Float32}</code> otherwise. The matrix can be used as peek-ahead mask in transformers.</p><p><code>dim=1</code> specifies the dimension in which the sequence length is represented. For un-embedded data this is normally <code>1</code>, i.e. the shape of <code>x</code> is [n<em>seq, n</em>mb]. After embedding the shape probably is [depth, n<em>seq, n</em>mb].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L82-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dot_prod_attn" href="#NNHelferlein.dot_prod_attn"><code>NNHelferlein.dot_prod_attn</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dot_prod_attn(q, k, v; mask=nothing)</code></pre><p>Generic scaled dot product attention following the paper of Vaswani et al., (2017), <em>Attention Is All You Need</em>.</p><p><strong>Arguments:</strong></p><ul><li><code>q</code>: query of size <code>[depth, n_seq_q, ...]</code></li><li><code>k</code>: key of size <code>[depth, n_seq_v, ...]</code></li><li><code>v</code>: value of size <code>[depth, n_seq_v, ...]</code></li><li><code>mask</code>: mask for attention factors may have different shapes but must be       broadcastable for addition to the scores tensor (which as the same size as       alpha <code>[n_seq_v, n_seq_q, ...]</code>). In transformer context typical masks are one of:       padding mask of size <code>[n_seq_v, ...]</code> or a peek-ahead mask of size <code>[n_seq_v, n_seq_v]</code>       (which is only possible in case of self-attention when all sequence lengths       are identical).</li></ul><p><code>q, k, v</code> must have matching leading dimensions (i.e. same depth or embedding). <code>k</code> and <code>v</code> must have the same sequence length.</p><p><strong>Return values:</strong></p><ul><li><code>c</code>: context as alpha-weighted sum of values with size [depth, n<em>seq</em>v, ...]</li><li><code>alpha</code>: attention factors of size [n<em>seq</em>v, n<em>seq</em>q, ...]</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L109-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MultiHeadAttn" href="#NNHelferlein.MultiHeadAttn"><code>NNHelferlein.MultiHeadAttn</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct MultiHeadAttn &lt;: AbstractLayer</code></pre><p>Multi-headed attention layer, designed following the Vaswani, 2017 paper.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">MultiHeadAttn(depth, n_heads)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the attention.</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">function(mha::MultiHeadAttn)(q, k, v; mask=nothing)</code></pre><p><code>q, k, v</code> are 3-dimensional tensors of the same size ([depth, seq<em>len, n</em>minibatch]) and the optional mask must be of  size [seq<em>len, n</em>minibatch] and mark masked positions with 1.0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L147-L167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.separate_heads" href="#NNHelferlein.separate_heads"><code>NNHelferlein.separate_heads</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function separate_heads(x, n)</code></pre><p>Helper function for multi-headed attention mechanisms:  an additional second dimension is added to a tensor of minibatches by splitting the first (i.e. depth).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L201-L207">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.merge_heads" href="#NNHelferlein.merge_heads"><code>NNHelferlein.merge_heads</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function merge_heads(x)</code></pre><p>Helper to merge the result of multi-headed attention back to full depth .</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/transformers.jl#L215-L220">source</a></section></article><h2 id="Utils-for-array-manipulation"><a class="docs-heading-anchor" href="#Utils-for-array-manipulation">Utils for array manipulation</a><a id="Utils-for-array-manipulation-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-array-manipulation" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.crop_array" href="#NNHelferlein.crop_array"><code>NNHelferlein.crop_array</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function crop_array(x, crop_sizes)</code></pre><p>Crop a n-dimensional array to the given size. Cropping is always centered (i.e. a margin is removed).</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: n-dim AbstractArray</li><li><code>crop_sizes</code>: Tuple of target sizes to which the array is cropped.       Allowed values are Int or <code>:</code>. If <code>crop_sizes</code> defines less       dims as x has, the remaining dims will not be cropped (assuming <code>:</code>).       If a demanded crop size is bigger as the actual size of x,       it is ignored.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L23-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.blowup_array" href="#NNHelferlein.blowup_array"><code>NNHelferlein.blowup_array</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function blowup_array(x, n)</p><p>Blow up an array <code>x</code> with an additional dimension and repeat the content of the array <code>n</code> times.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of any dimension</li><li><code>n</code>: number of repeats. ´n=1´ will return an</li></ul><p>array with an additional dimension of size 1.</p><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">julia&gt; x = [1,2,3,4]; blowup_array(x, 3)
+                    pre_proc=preproc_imagenet_vgg)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/imagenet.jl#L2-L55">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_resnet" href="#NNHelferlein.preproc_imagenet_resnet"><code>NNHelferlein.preproc_imagenet_resnet</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">preproc_imagenet_resnet(img)</code></pre><p>See documentation of <code>preproc_imagenet_vgg</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/imagenet.jl#L61-L65">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.preproc_imagenet_resnetv2" href="#NNHelferlein.preproc_imagenet_resnetv2"><code>NNHelferlein.preproc_imagenet_resnetv2</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">preproc_imagenet_resnetv2(img)</code></pre><p>See documentation of <code>preproc_imagenet_vgg</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/imagenet.jl#L71-L75">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.predict_imagenet" href="#NNHelferlein.predict_imagenet"><code>NNHelferlein.predict_imagenet</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function predict_imagenet(mdl; data, top_n=5)</code></pre><p>Predict the ImageNet-class of images from the predefined list of class labels.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/imagenet.jl#L153-L158">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_imagenet_classes" href="#NNHelferlein.get_imagenet_classes"><code>NNHelferlein.get_imagenet_classes</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_imagenet_classes()</code></pre><p>Return a list of all 1000 ImageNet class labels.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/imagenet.jl#L130-L134">source</a></section></article><h1 id="Other-utils"><a class="docs-heading-anchor" href="#Other-utils">Other utils</a><a id="Other-utils-1"></a><a class="docs-heading-anchor-permalink" href="#Other-utils" title="Permalink"></a></h1><h2 id="Layers-and-helpers-for-transformers"><a class="docs-heading-anchor" href="#Layers-and-helpers-for-transformers">Layers and helpers for transformers</a><a id="Layers-and-helpers-for-transformers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers-and-helpers-for-transformers" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.PositionalEncoding" href="#NNHelferlein.PositionalEncoding"><code>NNHelferlein.PositionalEncoding</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct PositionalEncoding &lt;: AbstractLayer</code></pre><p>Positional encoding layer. Only <em>sincos</em>-style (according to Vaswani, et al., NIPS 2017) is implemented.</p><p>The layer takes an array of any number of dimensions (&gt;=2), calculates the Vaswani-2017-style positional encoding and adds the encoding to each plane of the array.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L27-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.positional_encoding_sincos" href="#NNHelferlein.positional_encoding_sincos"><code>NNHelferlein.positional_encoding_sincos</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function positional_encoding_sincos(n_embed, n_seq)</code></pre><p>Calculate and return a matrix of size <code>[n_embed, n_seq]</code> of positional encoding values following the sin and cos style in the paper <em>Vaswani, A. et al.; Attention Is All You Need; 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 2017.</em></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L7-L16">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_padding_mask" href="#NNHelferlein.mk_padding_mask"><code>NNHelferlein.mk_padding_mask</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_padding_mask(x; pad=TOKEN_PAD, add_dims=false)</code></pre><p>Make a padding mask; i.e. return an Array of type <code>KnetArray{Float32}</code> (or <code>Array{Float32}</code>) similar to <code>x</code> but with two additional dimensions of size 1 in the middle (this will represent the 2nd seq_len and the number of heads) in multi-head attention and the value <code>1.0</code> at each position where <code>x</code> is <code>pad</code> and <code>0.0</code> otherwise.</p><p>The function can be used for creating padding masks for attention mechanisms.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of sequences (typically a matrix with n<em>cols sequences   of length n</em>rows)</li><li><code>pad</code>: value for the token to be masked</li><li><code>add_dims</code>: if <code>true</code>, 2 additional dimensions are inserted to    return a 4-D-array as needed for transformer architectures. Otherwise   the size of the returned array is similar to <code>x</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L51-L71">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.mk_peek_ahead_mask" href="#NNHelferlein.mk_peek_ahead_mask"><code>NNHelferlein.mk_peek_ahead_mask</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function mk_peek_ahead_mask(x; dim=1)
+function mk_peek_ahead_mask(n_seq)</code></pre><p>Return a matrix of size <code>[n_seq, n_seq]</code> filled with 1.0 and the <em>uppper triangle</em> set to 0.0. Type is <code>CuArray{Float32}</code> in GPU context, <code>Array{Float32}</code> otherwise. The matrix can be used as peek-ahead mask in transformers.</p><p><code>dim=1</code> specifies the dimension in which the sequence length is represented. For un-embedded data this is normally <code>1</code>, i.e. the shape of <code>x</code> is [n<em>seq, n</em>mb]. After embedding the shape probably is [depth, n<em>seq, n</em>mb].</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L82-L95">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dot_prod_attn" href="#NNHelferlein.dot_prod_attn"><code>NNHelferlein.dot_prod_attn</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dot_prod_attn(q, k, v; mask=nothing)</code></pre><p>Generic scaled dot product attention following the paper of Vaswani et al., (2017), <em>Attention Is All You Need</em>.</p><p><strong>Arguments:</strong></p><ul><li><code>q</code>: query of size <code>[depth, n_seq_q, ...]</code></li><li><code>k</code>: key of size <code>[depth, n_seq_v, ...]</code></li><li><code>v</code>: value of size <code>[depth, n_seq_v, ...]</code></li><li><code>mask</code>: mask for attention factors may have different shapes but must be       broadcastable for addition to the scores tensor (which as the same size as       alpha <code>[n_seq_v, n_seq_q, ...]</code>). In transformer context typical masks are one of:       padding mask of size <code>[n_seq_v, ...]</code> or a peek-ahead mask of size <code>[n_seq_v, n_seq_v]</code>       (which is only possible in case of self-attention when all sequence lengths       are identical).</li></ul><p><code>q, k, v</code> must have matching leading dimensions (i.e. same depth or embedding). <code>k</code> and <code>v</code> must have the same sequence length.</p><p><strong>Return values:</strong></p><ul><li><code>c</code>: context as alpha-weighted sum of values with size [depth, n<em>seq</em>v, ...]</li><li><code>alpha</code>: attention factors of size [n<em>seq</em>v, n<em>seq</em>q, ...]</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L109-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.MultiHeadAttn" href="#NNHelferlein.MultiHeadAttn"><code>NNHelferlein.MultiHeadAttn</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">struct MultiHeadAttn &lt;: AbstractLayer</code></pre><p>Multi-headed attention layer, designed following the Vaswani, 2017 paper.</p><p><strong>Constructor:</strong></p><pre><code class="nohighlight hljs">MultiHeadAttn(depth, n_heads)</code></pre><ul><li><code>depth</code>: Embedding depth</li><li><code>n_heads</code>: number of heads for the attention.</li></ul><p><strong>Signature:</strong></p><pre><code class="nohighlight hljs">function(mha::MultiHeadAttn)(q, k, v; mask=nothing)</code></pre><p><code>q, k, v</code> are 3-dimensional tensors of the same size ([depth, seq<em>len, n</em>minibatch]) and the optional mask must be of  size [seq<em>len, n</em>minibatch] and mark masked positions with 1.0.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L147-L167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.separate_heads" href="#NNHelferlein.separate_heads"><code>NNHelferlein.separate_heads</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function separate_heads(x, n)</code></pre><p>Helper function for multi-headed attention mechanisms:  an additional second dimension is added to a tensor of minibatches by splitting the first (i.e. depth).</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L201-L207">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.merge_heads" href="#NNHelferlein.merge_heads"><code>NNHelferlein.merge_heads</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function merge_heads(x)</code></pre><p>Helper to merge the result of multi-headed attention back to full depth .</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/transformers.jl#L215-L220">source</a></section></article><h2 id="Utils-for-array-manipulation"><a class="docs-heading-anchor" href="#Utils-for-array-manipulation">Utils for array manipulation</a><a id="Utils-for-array-manipulation-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-array-manipulation" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.crop_array" href="#NNHelferlein.crop_array"><code>NNHelferlein.crop_array</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function crop_array(x, crop_sizes)</code></pre><p>Crop a n-dimensional array to the given size. Cropping is always centered (i.e. a margin is removed).</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: n-dim AbstractArray</li><li><code>crop_sizes</code>: Tuple of target sizes to which the array is cropped.       Allowed values are Int or <code>:</code>. If <code>crop_sizes</code> defines less       dims as x has, the remaining dims will not be cropped (assuming <code>:</code>).       If a demanded crop size is bigger as the actual size of x,       it is ignored.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L23-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.blowup_array" href="#NNHelferlein.blowup_array"><code>NNHelferlein.blowup_array</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function blowup_array(x, n)</p><p>Blow up an array <code>x</code> with an additional dimension and repeat the content of the array <code>n</code> times.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of any dimension</li><li><code>n</code>: number of repeats. ´n=1´ will return an</li></ul><p>array with an additional dimension of size 1.</p><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">julia&gt; x = [1,2,3,4]; blowup_array(x, 3)
 4×3 Array{Int64,2}:
  1  1  1
  2  2  2
@@ -239,7 +239,7 @@
 
 [:, :, 3] =
  1  2
- 3  4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L176-L212">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.recycle_array" href="#NNHelferlein.recycle_array"><code>NNHelferlein.recycle_array</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function recycle_array(x, n; dims=dims(x))</p><p>Recycle an array <code>x</code> along the specified dimension  (default the last dimension) and repeat the content of the array <code>n</code> times. The number of dims stays unchanged, but the array values are repeated <code>n</code> times.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of any dimension</li><li><code>n</code>: number of repeats. ´n=1´ will return an unchanged       array</li><li><code>dims</code>: dimension to be repeated.</li></ul><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">julia&gt; recycle_array([1,2],3)
+ 3  4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L176-L212">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.recycle_array" href="#NNHelferlein.recycle_array"><code>NNHelferlein.recycle_array</code></a> — <span class="docstring-category">Function</span></header><section><div><p>function recycle_array(x, n; dims=dims(x))</p><p>Recycle an array <code>x</code> along the specified dimension  (default the last dimension) and repeat the content of the array <code>n</code> times. The number of dims stays unchanged, but the array values are repeated <code>n</code> times.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Array of any dimension</li><li><code>n</code>: number of repeats. ´n=1´ will return an unchanged       array</li><li><code>dims</code>: dimension to be repeated.</li></ul><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">julia&gt; recycle_array([1,2],3)
 6-element Array{Int64,1}:
  1
  2
@@ -262,7 +262,7 @@
 3x3 Array{Int64,2}:
  1 2 3
  1 2 3
- 1 2 3</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L226-L269">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.de_embed" href="#NNHelferlein.de_embed"><code>NNHelferlein.de_embed</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function de_embed(x; remove_dim=false)</code></pre><p>Replace the maximum of the first dimension of an n-dimensional array by its index (aka argmax()). If <code>remove_dim</code> is true, the result has the first dimension removed; otherwise the returned array has the first dimension with size 1  (default).</p><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">&gt; x = [1 1 1
+ 1 2 3</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L226-L269">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.de_embed" href="#NNHelferlein.de_embed"><code>NNHelferlein.de_embed</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function de_embed(x; remove_dim=false)</code></pre><p>Replace the maximum of the first dimension of an n-dimensional array by its index (aka argmax()). If <code>remove_dim</code> is true, the result has the first dimension removed; otherwise the returned array has the first dimension with size 1  (default).</p><p><strong>Examples:</strong></p><pre><code class="language-Julia hljs">&gt; x = [1 1 1
        2 1 1
        1 2 1
        1 1 2]
@@ -274,19 +274,19 @@
 3-element Vector{Int64}:
  2
  3
- 4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L294-L319">source</a></section></article><h2 id="Utils-for-fixing-types-in-GPU-context"><a class="docs-heading-anchor" href="#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a><a id="Utils-for-fixing-types-in-GPU-context-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-fixing-types-in-GPU-context" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.init0" href="#NNHelferlein.init0"><code>NNHelferlein.init0</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function init0(siz...)</code></pre><p>Initialise a vector or array of size <code>siz</code> with zeros. If a GPU is detected type of the returned value is <code>KnetArray{Float32}</code>, otherwise <code>Array{Float32}</code>.</p><p><strong>Examples:</strong></p><pre><code class="nohighlight hljs">julia&gt; init0(2,10)
+ 4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L294-L319">source</a></section></article><h2 id="Utils-for-fixing-types-in-GPU-context"><a class="docs-heading-anchor" href="#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a><a id="Utils-for-fixing-types-in-GPU-context-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-fixing-types-in-GPU-context" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.init0" href="#NNHelferlein.init0"><code>NNHelferlein.init0</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function init0(siz...)</code></pre><p>Initialise a vector or array of size <code>siz</code> with zeros. If a GPU is detected type of the returned value is <code>KnetArray{Float32}</code>, otherwise <code>Array{Float32}</code>.</p><p><strong>Examples:</strong></p><pre><code class="nohighlight hljs">julia&gt; init0(2,10)
 2×10 Array{Float32,2}:
  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0
  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0  0.0
 
  julia&gt; init0(0,10)
- 0×10 Array{Float32,2}</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L61-L78">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.convert2CuArray" href="#NNHelferlein.convert2CuArray"><code>NNHelferlein.convert2CuArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
+ 0×10 Array{Float32,2}</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L61-L78">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.convert2CuArray" href="#NNHelferlein.convert2CuArray"><code>NNHelferlein.convert2CuArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
 function convert2KnetArray(x, innerType=Float32)
-function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.convert2KnetArray" href="#NNHelferlein.convert2KnetArray"><code>NNHelferlein.convert2KnetArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
+function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.convert2KnetArray" href="#NNHelferlein.convert2KnetArray"><code>NNHelferlein.convert2KnetArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
 function convert2KnetArray(x, innerType=Float32)
-function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L134">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.ifgpu" href="#NNHelferlein.ifgpu"><code>NNHelferlein.ifgpu</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
+function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L134">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.ifgpu" href="#NNHelferlein.ifgpu"><code>NNHelferlein.ifgpu</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function convert2CuArray(x, innerType=Float32)
 function convert2KnetArray(x, innerType=Float32)
-function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L107-L121">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.emptyCuArray" href="#NNHelferlein.emptyCuArray"><code>NNHelferlein.emptyCuArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function emptyCuArray(size...=(0,0);innerType=Float32)
+function ifgpu(x, innerType=Float32)</code></pre><p>Convert an array <code>x</code> to a <code>CuArray{Float32}</code> or whatever specified as innerType only in GPU context (if <code>CUDA.functional()</code>) or to an <code>Array{Float32}</code> otherwise. By converting, the data is copied to the GPU.</p><p><code>convert2KnetArray()</code> is kept as an alias for backward compatibility.    </p><p><code>ifgpu()</code> is an alias/shortcut to <code>convert2KnetArray()</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L107-L121">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.emptyCuArray" href="#NNHelferlein.emptyCuArray"><code>NNHelferlein.emptyCuArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function emptyCuArray(size...=(0,0);innerType=Float32)
 function emptyKnetArray(size...=(0,0);innerType=Float32)</code></pre><p>Return an empty CuArray with the specified dimensions. The  array may be empty (i.e. one dimension 0) or elements will be undefined.</p><p>By default an empty matrix is returned.</p><p><strong>Examples:</strong></p><pre><code class="language-julia hljs">&gt;&gt;&gt; emptyKnetArray(0,0)
 0×0 Knet.KnetArrays.KnetMatrix{Float32}
 
@@ -294,7 +294,7 @@
 0×0 Knet.KnetArrays.KnetMatrix{Float32}
 
 &gt;&gt;&gt; emptyKnetArray(0)
-0-element Knet.KnetArrays.KnetVector{Float32}</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/util.jl#L138-L158">source</a></section></article><h2 id="Utils-for-Bioinformatics"><a class="docs-heading-anchor" href="#Utils-for-Bioinformatics">Utils for Bioinformatics</a><a id="Utils-for-Bioinformatics-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-Bioinformatics" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.aminoacid_tokenizer" href="#NNHelferlein.aminoacid_tokenizer"><code>NNHelferlein.aminoacid_tokenizer</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">aminoacid_tokenizer(sec; ignore_unknown=true)</code></pre><p>Tokenize a protein sequence into amino acids using the following table:</p><pre><code class="nohighlight hljs">    Amino acid | Token | Description
+0-element Knet.KnetArrays.KnetVector{Float32}</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/util.jl#L138-L158">source</a></section></article><h2 id="Utils-for-Bioinformatics"><a class="docs-heading-anchor" href="#Utils-for-Bioinformatics">Utils for Bioinformatics</a><a id="Utils-for-Bioinformatics-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-Bioinformatics" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.aminoacid_tokenizer" href="#NNHelferlein.aminoacid_tokenizer"><code>NNHelferlein.aminoacid_tokenizer</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">aminoacid_tokenizer(sec; ignore_unknown=true)</code></pre><p>Tokenize a protein sequence into amino acids using the following table:</p><pre><code class="nohighlight hljs">    Amino acid | Token | Description
     --------------------------------
     C          | 1     | Cysteine
     S          | 2     | Serine
@@ -322,10 +322,10 @@
     J          | 23    | Leucine or Isoleucine
     U          | 24    | Selenocysteine
     X          | 25    | Unknown amino acid
-    .          | 26    | padding token</code></pre><p><strong>Arguments:</strong></p><ul><li><code>sec</code>: A string containing the protein sequence in uppercase or lowercase.        All other letters or symbols will be converted to the unknwon token.</li><li><code>ignore_unknown</code>: If <code>true</code>, unkown amino acids (i.e. &quot;X&quot;) will be converted                   to the padding token. If <code>false</code>, the embedding for &quot;X&quot; will                   be trained as for all other amino acids.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/bioinformatics.jl#L61-L103">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.embed_blosum62" href="#NNHelferlein.embed_blosum62"><code>NNHelferlein.embed_blosum62</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">embed_blosum62(x)</code></pre><p>Embed a protein sequence into a 21-dimensional vector using the BLOSUM62 amino acid substitution matrix. Aminoacid are encoded as with  <em>NNHelferleins</em> <code>aminoacid tokenizer</code> function. <code>x</code> can be any <code>AbstractArray</code> of <code>Int</code> and a dimension of size 21 will be added as the first dimension. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/bioinformatics.jl#L159-L167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.embed_vhse8" href="#NNHelferlein.embed_vhse8"><code>NNHelferlein.embed_vhse8</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">embed_vhse8(x)</code></pre><p>Embed a protein sequence into a 8-dimensional vector using the VHSE8 amino acid embedding scheme. Aminoacid are encoded as with  <em>NNHelferleins</em> <code>aminoacid tokenizer</code> function. <code>x</code> can be any <code>AbstractArray</code> of <code>Int</code> and a dimension of size 21 will be added as the first dimension. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/bioinformatics.jl#L174-L182">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.EmbedAminoAcids" href="#NNHelferlein.EmbedAminoAcids"><code>NNHelferlein.EmbedAminoAcids</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">EmbedAminoAcids &lt;: AbstractLayer</code></pre><p>Embed a protein sequence into a 21-dimensional vector using the BLOSUM62 amino acid substitution matrix or as a 8-dimensional vector using the VHSE8 parameters. Aminoacids must be encoded acording to  <em>NNHelferlein&#39;s</em> <code>aminoacid tokenizer</code> function.</p><p>Layer input a is a n-dimensional array of an Integer type. Output is a (n+1)-dimensional array of Float32 type with a first (added) dimension  of size 21 or 8.</p><p><strong>Constructor:</strong></p><ul><li><code>EmbedAminoAcids(embedding::Symbol=:blosum62)</code>: <ul><li><code>embedding=:blosum62</code>: Either <code>:blosum62</code> or <code>:vhse8</code>            to select the embedding scheme.</li></ul></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/bioinformatics.jl#L192-L209">source</a></section></article><h2 id="Saving,-loading-and-inspection-of-models"><a class="docs-heading-anchor" href="#Saving,-loading-and-inspection-of-models">Saving, loading and inspection of models</a><a id="Saving,-loading-and-inspection-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#Saving,-loading-and-inspection-of-models" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.save_network" href="#NNHelferlein.save_network"><code>NNHelferlein.save_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">save_network(fname, mdl)</code></pre><p>Save a model as jld2-file.</p><p><strong>Arguments:</strong></p><ul><li><code>fname</code>: filename; if the name does not end with the extension <code>.jld2</code>,          it will be added.</li><li><code>mdl</code>: network model to be saved. The model will be copied to a          cpu-based model via <code>copy_network(mdl, to=:cpu)</code> before         saving, to remove hardware dependencies of          parameters on the gpu.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/io.jl#L75-L87">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.load_network" href="#NNHelferlein.load_network"><code>NNHelferlein.load_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">load_network(fname; to=:gpu)</code></pre><p>Load a model from a jld2-file.</p><p><strong>Arguments:</strong></p><ul><li><code>fname</code>: filename; if the name does not end with the extension <code>.jld2</code>,           it will be added.</li><li><code>to=:gpu</code>: by default, parameters are loaded as CuArrays, if          a functional gpu is detected. If <code>to=:cpu</code> is specified          parameters are loaded as cpu-arrays.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/io.jl#L97-L108">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.copy_network" href="#NNHelferlein.copy_network"><code>NNHelferlein.copy_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">copy_network(mdl::AbstractNN; to=:gpu)</code></pre><p>Returns a copy of a Helferlein model. <em>cave: the copy is generated by <code>Adapt.adapt()</code> and no deep copy!</em></p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: Network model of type <code>AbstractNN</code>.</li><li><code>to=:gpu</code>: by default all parameters of the copy are <code>CuArrays</code> for            GPU usage. If <code>to=:cpu</code> is specified, parameters             are Arrays and the model will be processed in the cpu.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/io.jl#L49-L61">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Base.summary" href="#Base.summary"><code>Base.summary</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function summary(mdl)</code></pre><p>Print a network summary of any model of Type <code>AbstractNN</code>,  <code>AbstractChain</code> or <code>AbstractLayer</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L223-L228">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.print_network" href="#NNHelferlein.print_network"><code>NNHelferlein.print_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function print_network(mdl::AbstractNN)</code></pre><p>Alias to <code>summary()</code>, kept for backward compatibility only.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/nets.jl#L268-L272">source</a></section></article><h2 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_mit_nsr" href="#NNHelferlein.dataset_mit_nsr"><code>NNHelferlein.dataset_mit_nsr</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_mit_nsr(records=nothing; force=false)</code></pre><p>Retrieve the Physionet ECG data set: &quot;MIT-BIH Normal Sinus Rhythm Database&quot;. If necessary the data is downloaded from Zenodo (and stored in the <em>NNHelferlein</em> data directory,  <a href="https://doi.org/10.5281/zenodo.6526342"><img src="https://zenodo.org/badge/DOI/10.5281/zenodo.6526342.svg" alt="DOI"/></a>).</p><p>All 18 recordings are returned as a list of DataFrames.</p><p>ECGs from the MIT-NSR database with some modifications to make them more  suitable as playground data set for machine learning.</p><ul><li>all 18 ECGs are trimmed to approx. 50000 heart beats from a region  without recording errors</li><li>scaled to a range -1 to 1 (non-linear/tanh)</li><li>heart beats annotation as time series with  value 1.0 at the point of the annotated beat and 0.0 for all other times</li><li>additional heart beat column smoothed by applying a gaussian filter</li><li>provided as csv with columns &quot;time in sec&quot;, &quot;channel 1&quot;, &quot;channel 2&quot;,  &quot;beat&quot; and  &quot;smooth&quot;.</li></ul><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code> the download will be forced and local data will be        overwitten.</li><li><code>records</code>: list of records names to be downloaded.</li></ul><p><strong>Examples:</strong></p><pre><code class="language-juliaREPL hljs">nsr_16265 = dataset_mit_nsr(&quot;16265&quot;)
+    .          | 26    | padding token</code></pre><p><strong>Arguments:</strong></p><ul><li><code>sec</code>: A string containing the protein sequence in uppercase or lowercase.        All other letters or symbols will be converted to the unknwon token.</li><li><code>ignore_unknown</code>: If <code>true</code>, unkown amino acids (i.e. &quot;X&quot;) will be converted                   to the padding token. If <code>false</code>, the embedding for &quot;X&quot; will                   be trained as for all other amino acids.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/bioinformatics.jl#L61-L103">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.embed_blosum62" href="#NNHelferlein.embed_blosum62"><code>NNHelferlein.embed_blosum62</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">embed_blosum62(x)</code></pre><p>Embed a protein sequence into a 21-dimensional vector using the BLOSUM62 amino acid substitution matrix. Aminoacid are encoded as with  <em>NNHelferleins</em> <code>aminoacid tokenizer</code> function. <code>x</code> can be any <code>AbstractArray</code> of <code>Int</code> and a dimension of size 21 will be added as the first dimension. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/bioinformatics.jl#L159-L167">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.embed_vhse8" href="#NNHelferlein.embed_vhse8"><code>NNHelferlein.embed_vhse8</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">embed_vhse8(x)</code></pre><p>Embed a protein sequence into a 8-dimensional vector using the VHSE8 amino acid embedding scheme. Aminoacid are encoded as with  <em>NNHelferleins</em> <code>aminoacid tokenizer</code> function. <code>x</code> can be any <code>AbstractArray</code> of <code>Int</code> and a dimension of size 21 will be added as the first dimension. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/bioinformatics.jl#L174-L182">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.EmbedAminoAcids" href="#NNHelferlein.EmbedAminoAcids"><code>NNHelferlein.EmbedAminoAcids</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">EmbedAminoAcids &lt;: AbstractLayer</code></pre><p>Embed a protein sequence into a 21-dimensional vector using the BLOSUM62 amino acid substitution matrix or as a 8-dimensional vector using the VHSE8 parameters. Aminoacids must be encoded acording to  <em>NNHelferlein&#39;s</em> <code>aminoacid tokenizer</code> function.</p><p>Layer input a is a n-dimensional array of an Integer type. Output is a (n+1)-dimensional array of Float32 type with a first (added) dimension  of size 21 or 8.</p><p><strong>Constructor:</strong></p><ul><li><code>EmbedAminoAcids(embedding::Symbol=:blosum62)</code>: <ul><li><code>embedding=:blosum62</code>: Either <code>:blosum62</code> or <code>:vhse8</code>            to select the embedding scheme.</li></ul></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/bioinformatics.jl#L192-L209">source</a></section></article><h2 id="Saving,-loading-and-inspection-of-models"><a class="docs-heading-anchor" href="#Saving,-loading-and-inspection-of-models">Saving, loading and inspection of models</a><a id="Saving,-loading-and-inspection-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#Saving,-loading-and-inspection-of-models" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.save_network" href="#NNHelferlein.save_network"><code>NNHelferlein.save_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">save_network(fname, mdl)</code></pre><p>Save a model as jld2-file.</p><p><strong>Arguments:</strong></p><ul><li><code>fname</code>: filename; if the name does not end with the extension <code>.jld2</code>,          it will be added.</li><li><code>mdl</code>: network model to be saved. The model will be copied to a          cpu-based model via <code>copy_network(mdl, to=:cpu)</code> before         saving, to remove hardware dependencies of          parameters on the gpu.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/io.jl#L75-L87">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.load_network" href="#NNHelferlein.load_network"><code>NNHelferlein.load_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">load_network(fname; to=:gpu)</code></pre><p>Load a model from a jld2-file.</p><p><strong>Arguments:</strong></p><ul><li><code>fname</code>: filename; if the name does not end with the extension <code>.jld2</code>,           it will be added.</li><li><code>to=:gpu</code>: by default, parameters are loaded as CuArrays, if          a functional gpu is detected. If <code>to=:cpu</code> is specified          parameters are loaded as cpu-arrays.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/io.jl#L97-L108">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.copy_network" href="#NNHelferlein.copy_network"><code>NNHelferlein.copy_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">copy_network(mdl::AbstractNN; to=:gpu)</code></pre><p>Returns a copy of a Helferlein model. <em>cave: the copy is generated by <code>Adapt.adapt()</code> and no deep copy!</em></p><p><strong>Arguments:</strong></p><ul><li><code>mdl</code>: Network model of type <code>AbstractNN</code>.</li><li><code>to=:gpu</code>: by default all parameters of the copy are <code>CuArrays</code> for            GPU usage. If <code>to=:cpu</code> is specified, parameters             are Arrays and the model will be processed in the cpu.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/io.jl#L49-L61">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Base.summary" href="#Base.summary"><code>Base.summary</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function summary(mdl)</code></pre><p>Print a network summary of any model of Type <code>AbstractNN</code>,  <code>AbstractChain</code> or <code>AbstractLayer</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L223-L228">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.print_network" href="#NNHelferlein.print_network"><code>NNHelferlein.print_network</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function print_network(mdl::AbstractNN)</code></pre><p>Alias to <code>summary()</code>, kept for backward compatibility only.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/nets.jl#L268-L272">source</a></section></article><h2 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_mit_nsr" href="#NNHelferlein.dataset_mit_nsr"><code>NNHelferlein.dataset_mit_nsr</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_mit_nsr(records=nothing; force=false)</code></pre><p>Retrieve the Physionet ECG data set: &quot;MIT-BIH Normal Sinus Rhythm Database&quot;. If necessary the data is downloaded from Zenodo (and stored in the <em>NNHelferlein</em> data directory,  <a href="https://doi.org/10.5281/zenodo.6526342"><img src="https://zenodo.org/badge/DOI/10.5281/zenodo.6526342.svg" alt="DOI"/></a>).</p><p>All 18 recordings are returned as a list of DataFrames.</p><p>ECGs from the MIT-NSR database with some modifications to make them more  suitable as playground data set for machine learning.</p><ul><li>all 18 ECGs are trimmed to approx. 50000 heart beats from a region  without recording errors</li><li>scaled to a range -1 to 1 (non-linear/tanh)</li><li>heart beats annotation as time series with  value 1.0 at the point of the annotated beat and 0.0 for all other times</li><li>additional heart beat column smoothed by applying a gaussian filter</li><li>provided as csv with columns &quot;time in sec&quot;, &quot;channel 1&quot;, &quot;channel 2&quot;,  &quot;beat&quot; and  &quot;smooth&quot;.</li></ul><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code> the download will be forced and local data will be        overwitten.</li><li><code>records</code>: list of records names to be downloaded.</li></ul><p><strong>Examples:</strong></p><pre><code class="language-juliaREPL hljs">nsr_16265 = dataset_mit_nsr(&quot;16265&quot;)
 nsr_16265 = dataset_mit_nsr([&quot;16265&quot;, &quot;19830&quot;])
-nsr_all = dataset_mit_nsr()</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/datasets.jl#L153-L188">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_mnist" href="#NNHelferlein.dataset_mnist"><code>NNHelferlein.dataset_mnist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_mnist(; force=false)</code></pre><p>Download the MNIST dataset with help of <code>MLDatasets.jl</code> from  Yann LeCun&#39;s official website. 4 arrays <code>xtrn, ytrn, xtst, ytst</code> are returned. </p><p><code>xtrn</code> and <code>xtst</code> will be the images as a multi-dimensional array, and <code>ytrn</code> and <code>ytst</code> the corresponding labels as integers.</p><p>The image(s) is/are returned in the horizontal-major memory layout as a single numeric array of eltype <code>Float32</code>.  The values are scaled to be between 0 and 1.  The labels are returned as a vector of <code>Int8</code>.</p><p>In the  teaching input (i.e. <code>y</code>) the digit <code>0</code> is encoded as <code>10</code>.</p><p>The data is stored in the <em>Helferlein</em> data directory and only downloaded the files are not already saved.</p><p>Ref.:  Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. &quot;Gradient-based learning applied to document recognition.&quot; <em>Proceedings of the IEEE,</em> 86(11):2278-2324, November 1998       <a href="http://yann.lecun.com/exdb/mnist/">http://yann.lecun.com/exdb/mnist/</a>.</p><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code>, the dataset download will be forced.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/datasets.jl#L230-L259">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_fashion_mnist" href="#NNHelferlein.dataset_fashion_mnist"><code>NNHelferlein.dataset_fashion_mnist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_fashion_mnist(; force=false)</code></pre><p>Download Zalando&#39;s Fashion-MNIST datset with help of <code>MLDatasets.jl</code>  from https://github.com/zalandoresearch/fashion-mnist.</p><p>4 arrays <code>xtrn, ytrn, xtst, ytst</code> are returned in the  same structure as the original MNIST dataset.</p><p>The data is stored in the <em>Helferlein</em> data directory and only downloaded the files are not already saved.</p><p>Authors: Han Xiao, Kashif Rasul, Roland Vollgraf</p><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code>, the dataset download will be forced.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/datasets.jl#L277-L294">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_iris" href="#NNHelferlein.dataset_iris"><code>NNHelferlein.dataset_iris</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_iris()</code></pre><p>Return Fisher&#39;s <em>iris</em> dataset of 150 records as dataframe.</p><p>Ref: Fisher,R.A.  &quot;The use of multiple measurements in taxonomic problems&quot;  <em>Annual Eugenics</em>, 7, Part II, 179-188 (1936);  also in &quot;Contributions to Mathematical Statistics&quot; (John Wiley, NY, 1950).      <a href="https://archive.ics.uci.edu/ml/datasets/Iris">https://archive.ics.uci.edu/ml/datasets/Iris</a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/datasets.jl#L347-L357">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_pfam" href="#NNHelferlein.dataset_pfam"><code>NNHelferlein.dataset_pfam</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_pfam(records; force=false)</code></pre><p>Retrieve the curated PFAM protein families database from Zenodo including 46872 sequences from 62 families. Sequences are between 100 and 1000 amino acids long and families have between 100 and 200 memebers. Training and test data are padded to a length of 1000 amino acids with the padding token of the amino acid tokenizer (26).</p><p>More information about the data set can be found at  <a href="https://zenodo.org/record/8138939">https://zenodo.org/record/8138939</a>, including PDB sequence IDs for each data table.</p><p><strong>Available records:</strong></p><ul><li><code>:raw</code>: dataframe with all (46872) rows of data and the columns <em>ID</em> (PDB-ID),            <em>family</em> (family name) and <em>sequence</em> (amino acid sequence)</li><li><code>:families</code>: list of all family names as dataframe with the  columns            <em>class</em> (cnumeric class ID 1-62), <em>family</em> (family name) and           and <em>count</em> (number of family members in the dataset)</li><li><code>:aminoacids</code>: list of amino acid tokes as dataframe with the columns           <em>Token</em> (aa token 1-26), <em>One-Letter</em> (one-letter code of the amino acid),           and <em>Amino acid</em> (full name of the amino acid)</li><li><code>:train</code>: dataframe with 42187 rows of training data and labels           with the class ID as first column and the            amino acid tokens as columns 2-1001 (padded to 1000 amino acids)</li><li><code>:test</code>: dataframe with 4687 rows of test data in the same format as the training data</li><li><code>:balanced_train</code>: dataframe with 111601 rows of balanced training data in the same format            as the training data. The data is balanced by sampling 1800 sequences from each family.</li><li><code>:balanced_test</code>: dataframe with 12401 rows of balanced test data in the same format as the training data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/datasets.jl#L36-L65">source</a></section></article><h1 id="Pretrained-networks"><a class="docs-heading-anchor" href="#Pretrained-networks">Pretrained networks</a><a id="Pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-networks" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_vgg16" href="#NNHelferlein.get_vgg16"><code>NNHelferlein.get_vgg16</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_vgg16(; filters_only=false, trainable=true)</code></pre><p>Return a VGG16 model with pretrained parameters from Tensorflow/Keras applications API. For details about original model  and training see <a href="https://keras.io/api/applications/"><code>Keras Applications</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>filters_only=false</code>: if <code>true</code>, only the filterstack is returned              (without Flatten() and classifier) to be integrated in to            any chain.</li><li><code>trainable=true</code>: if <code>true</code>, the filterstack is set trainable, otherwise           only the classifier part is trainable and the filter weights are            fixed.</li></ul><p><strong>Details:</strong></p><p>The model weights are imported from the respective Keras <em>Application</em>, which is trained with preprocessed images of size 224x224 pixel. Image data format must be colour channels <code>BGR</code> and  colour values <code>0.0 - 1.0</code>.</p><p>This can be re-built by using a preprocessing pipeline and the <em>Helferlein</em>-function <code>preproc_imagenet_vgg()</code> from a directory <code>img_path</code> with images:</p><pre><code class="language-julia hljs">pipl = CropRatio(ratio=1.0) |&gt; Resize(224,224)
+nsr_all = dataset_mit_nsr()</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/datasets.jl#L153-L188">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_mnist" href="#NNHelferlein.dataset_mnist"><code>NNHelferlein.dataset_mnist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_mnist(; force=false)</code></pre><p>Download the MNIST dataset with help of <code>MLDatasets.jl</code> from  Yann LeCun&#39;s official website. 4 arrays <code>xtrn, ytrn, xtst, ytst</code> are returned. </p><p><code>xtrn</code> and <code>xtst</code> will be the images as a multi-dimensional array, and <code>ytrn</code> and <code>ytst</code> the corresponding labels as integers.</p><p>The image(s) is/are returned in the horizontal-major memory layout as a single numeric array of eltype <code>Float32</code>.  The values are scaled to be between 0 and 1.  The labels are returned as a vector of <code>Int8</code>.</p><p>In the  teaching input (i.e. <code>y</code>) the digit <code>0</code> is encoded as <code>10</code>.</p><p>The data is stored in the <em>Helferlein</em> data directory and only downloaded the files are not already saved.</p><p>Ref.:  Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. &quot;Gradient-based learning applied to document recognition.&quot; <em>Proceedings of the IEEE,</em> 86(11):2278-2324, November 1998       <a href="http://yann.lecun.com/exdb/mnist/">http://yann.lecun.com/exdb/mnist/</a>.</p><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code>, the dataset download will be forced.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/datasets.jl#L230-L259">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_fashion_mnist" href="#NNHelferlein.dataset_fashion_mnist"><code>NNHelferlein.dataset_fashion_mnist</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_fashion_mnist(; force=false)</code></pre><p>Download Zalando&#39;s Fashion-MNIST datset with help of <code>MLDatasets.jl</code>  from https://github.com/zalandoresearch/fashion-mnist.</p><p>4 arrays <code>xtrn, ytrn, xtst, ytst</code> are returned in the  same structure as the original MNIST dataset.</p><p>The data is stored in the <em>Helferlein</em> data directory and only downloaded the files are not already saved.</p><p>Authors: Han Xiao, Kashif Rasul, Roland Vollgraf</p><p><strong>Arguments:</strong></p><ul><li><code>force=false</code>: if <code>true</code>, the dataset download will be forced.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/datasets.jl#L277-L294">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_iris" href="#NNHelferlein.dataset_iris"><code>NNHelferlein.dataset_iris</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_iris()</code></pre><p>Return Fisher&#39;s <em>iris</em> dataset of 150 records as dataframe.</p><p>Ref: Fisher,R.A.  &quot;The use of multiple measurements in taxonomic problems&quot;  <em>Annual Eugenics</em>, 7, Part II, 179-188 (1936);  also in &quot;Contributions to Mathematical Statistics&quot; (John Wiley, NY, 1950).      <a href="https://archive.ics.uci.edu/ml/datasets/Iris">https://archive.ics.uci.edu/ml/datasets/Iris</a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/datasets.jl#L347-L357">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.dataset_pfam" href="#NNHelferlein.dataset_pfam"><code>NNHelferlein.dataset_pfam</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function dataset_pfam(records; force=false)</code></pre><p>Retrieve the curated PFAM protein families database from Zenodo including 46872 sequences from 62 families. Sequences are between 100 and 1000 amino acids long and families have between 100 and 200 memebers. Training and test data are padded to a length of 1000 amino acids with the padding token of the amino acid tokenizer (26).</p><p>More information about the data set can be found at  <a href="https://zenodo.org/record/8138939">https://zenodo.org/record/8138939</a>, including PDB sequence IDs for each data table.</p><p><strong>Available records:</strong></p><ul><li><code>:raw</code>: dataframe with all (46872) rows of data and the columns <em>ID</em> (PDB-ID),            <em>family</em> (family name) and <em>sequence</em> (amino acid sequence)</li><li><code>:families</code>: list of all family names as dataframe with the  columns            <em>class</em> (cnumeric class ID 1-62), <em>family</em> (family name) and           and <em>count</em> (number of family members in the dataset)</li><li><code>:aminoacids</code>: list of amino acid tokes as dataframe with the columns           <em>Token</em> (aa token 1-26), <em>One-Letter</em> (one-letter code of the amino acid),           and <em>Amino acid</em> (full name of the amino acid)</li><li><code>:train</code>: dataframe with 42187 rows of training data and labels           with the class ID as first column and the            amino acid tokens as columns 2-1001 (padded to 1000 amino acids)</li><li><code>:test</code>: dataframe with 4687 rows of test data in the same format as the training data</li><li><code>:balanced_train</code>: dataframe with 111601 rows of balanced training data in the same format            as the training data. The data is balanced by sampling 1800 sequences from each family.</li><li><code>:balanced_test</code>: dataframe with 12401 rows of balanced test data in the same format as the training data.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/datasets.jl#L36-L65">source</a></section></article><h1 id="Pretrained-networks"><a class="docs-heading-anchor" href="#Pretrained-networks">Pretrained networks</a><a id="Pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-networks" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_vgg16" href="#NNHelferlein.get_vgg16"><code>NNHelferlein.get_vgg16</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_vgg16(; filters_only=false, trainable=true)</code></pre><p>Return a VGG16 model with pretrained parameters from Tensorflow/Keras applications API. For details about original model  and training see <a href="https://keras.io/api/applications/"><code>Keras Applications</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>filters_only=false</code>: if <code>true</code>, only the filterstack is returned              (without Flatten() and classifier) to be integrated in to            any chain.</li><li><code>trainable=true</code>: if <code>true</code>, the filterstack is set trainable, otherwise           only the classifier part is trainable and the filter weights are            fixed.</li></ul><p><strong>Details:</strong></p><p>The model weights are imported from the respective Keras <em>Application</em>, which is trained with preprocessed images of size 224x224 pixel. Image data format must be colour channels <code>BGR</code> and  colour values <code>0.0 - 1.0</code>.</p><p>This can be re-built by using a preprocessing pipeline and the <em>Helferlein</em>-function <code>preproc_imagenet_vgg()</code> from a directory <code>img_path</code> with images:</p><pre><code class="language-julia hljs">pipl = CropRatio(ratio=1.0) |&gt; Resize(224,224)
 mini_batches = mk_image_minibatch(img_path, 2, train=false, 
-        aug_pipl=pipl, pre_proc=preproc_imagenet_vgg)</code></pre><p>Model structure is: <a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/assets/netron-vgg16-w200.png"><code>VGG16 topology plot created by netron</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/pretrained.jl#L81-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_resnet50v2" href="#NNHelferlein.get_resnet50v2"><code>NNHelferlein.get_resnet50v2</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_resnet50v2(; filters_only=false, trainable=true)</code></pre><p>Return a ResNet50 v2 model with pretrained parameters from Tensorflow/Keras applications API. For details about original model  and training see <a href="https://keras.io/api/applications/"><code>Keras Applications</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>filters_only=false</code>: if <code>true</code>, only the filterstack is returned              (without Flatten() and classifier) to be integrated in to            any chain.</li><li><code>trainable=true</code>: if <code>true</code>, the filterstack is set trainable, otherwise           only the classifier part is trainable and the filter weights are            fixed.</li></ul><p><strong>Details:</strong></p><p>The model weights are imported from the respective Keras <em>Application</em>, which is trained with images of size 224x224 pixel.      <em>Cave:</em> The training set images have not been preprocessed with the  imagenet default procedure! In contrats image data format must be colour channels <code>RGB</code> and  colour values <code>0.0 - 1.0</code>.</p><p>This can be re-built by using a preprocessing pipeline with application <code>preproc_imagenet_resnetv2()</code> from a directory <code>img_path</code> with images:</p><pre><code class="language-julia hljs">pipl = CropRatio(ratio=1.0) |&gt; Resize(224,224)
+        aug_pipl=pipl, pre_proc=preproc_imagenet_vgg)</code></pre><p>Model structure is: <a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/assets/netron-vgg16-w200.png"><code>VGG16 topology plot created by netron</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/pretrained.jl#L81-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNHelferlein.get_resnet50v2" href="#NNHelferlein.get_resnet50v2"><code>NNHelferlein.get_resnet50v2</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">function get_resnet50v2(; filters_only=false, trainable=true)</code></pre><p>Return a ResNet50 v2 model with pretrained parameters from Tensorflow/Keras applications API. For details about original model  and training see <a href="https://keras.io/api/applications/"><code>Keras Applications</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>filters_only=false</code>: if <code>true</code>, only the filterstack is returned              (without Flatten() and classifier) to be integrated in to            any chain.</li><li><code>trainable=true</code>: if <code>true</code>, the filterstack is set trainable, otherwise           only the classifier part is trainable and the filter weights are            fixed.</li></ul><p><strong>Details:</strong></p><p>The model weights are imported from the respective Keras <em>Application</em>, which is trained with images of size 224x224 pixel.      <em>Cave:</em> The training set images have not been preprocessed with the  imagenet default procedure! In contrats image data format must be colour channels <code>RGB</code> and  colour values <code>0.0 - 1.0</code>.</p><p>This can be re-built by using a preprocessing pipeline with application <code>preproc_imagenet_resnetv2()</code> from a directory <code>img_path</code> with images:</p><pre><code class="language-julia hljs">pipl = CropRatio(ratio=1.0) |&gt; Resize(224,224)
 mini_batches = mk_image_minibatch(img_path, 2, train=false, 
-        aug_pipl=pipl, pre_proc=preproc_imagenet_resnetv2)</code></pre><p>Model structure is: <a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/assets/netron-resnet50v2.png"><code>ResNet50 V2 topology plot created by netron</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/638b28457e906b0c6e7e93104d97b5c12bd37c2e/src/pretrained.jl#L169-L208">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api_overview/">« API Overview</a><a class="docs-footer-nextpage" href="../license/">License »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+        aug_pipl=pipl, pre_proc=preproc_imagenet_resnetv2)</code></pre><p>Model structure is: <a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/assets/netron-resnet50v2.png"><code>ResNet50 V2 topology plot created by netron</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/KnetML/NNHelferlein.jl/blob/b53d8d9de345d25a1d193d3a757fe21c620f60dd/src/pretrained.jl#L169-L208">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api_overview/">« API Overview</a><a class="docs-footer-nextpage" href="../license/">License »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api_overview/index.html b/dev/api_overview/index.html
index 69a3e348..01f47c1b 100644
--- a/dev/api_overview/index.html
+++ b/dev/api_overview/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Overview · NNHelferlein.jl</title><meta name="title" content="API Overview · NNHelferlein.jl"/><meta property="og:title" content="API Overview · NNHelferlein.jl"/><meta property="twitter:title" content="API Overview · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li class="is-active"><a class="tocitem" href>API Overview</a><ul class="internal"><li class="toplevel"><a class="tocitem" href="#Layers"><span>Layers</span></a></li><li class="toplevel"><a class="tocitem" href="#Activation-functions"><span>Activation functions</span></a></li><li class="toplevel"><a class="tocitem" href="#Data-provider-utilities"><span>Data provider utilities</span></a></li><li class="toplevel"><a class="tocitem" href="#Iteration-utilities"><span>Iteration utilities</span></a></li><li class="toplevel"><a class="tocitem" href="#Training"><span>Training</span></a></li><li class="toplevel"><a class="tocitem" href="#Other-utils"><span>Other utils</span></a></li><li class="toplevel"><a class="tocitem" href="#Pretrained-networks"><span>Pretrained networks</span></a></li></ul></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Overview</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Overview</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/api_overview.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Networks-and-chains"><a class="docs-heading-anchor" href="#Networks-and-chains">Networks and chains</a><a id="Networks-and-chains-1"></a><a class="docs-heading-anchor-permalink" href="#Networks-and-chains" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.AbstractNN"><code>AbstractNN</code></a> - <em>Helferlein</em> network type</li><li><a href="../api/#NNHelferlein.AbstractChain"><code>AbstractChain</code></a> - <em>Helferlein</em> chain type</li></ul><ul><li><p><a href="../api/#NNHelferlein.Classifier"><code>Classifier</code></a> - network with NLL loss</p></li><li><p><a href="../api/#NNHelferlein.Regressor"><code>Regressor</code></a> - network with MSE soll</p></li><li><p><a href="../api/#NNHelferlein.VAE"><code>VAE</code></a> - variational autoencoder wrapper The VAE supports ramp-up of the KL-weight beta via the functions <a href="../api/#NNHelferlein.set_beta!"><code>set_beta!</code></a> and <a href="../api/#NNHelferlein.get_beta"><code>get_beta</code></a>.</p></li><li><p><a href="../api/#NNHelferlein.Chain"><code>Chain</code></a></p></li></ul><h3 id="Network-helpers"><a class="docs-heading-anchor" href="#Network-helpers">Network helpers</a><a id="Network-helpers-1"></a><a class="docs-heading-anchor-permalink" href="#Network-helpers" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.add_layer!"><code>add_layer!</code></a></p></li><li><p><a href="../api/#NNHelferlein.add_layer!"><code>+</code></a></p></li><li><p><a href="../api/#Base.summary"><code>summary</code></a></p></li><li><p><a href="../api/#NNHelferlein.save_network"><code>save_network</code></a> - save as jld2 file</p></li><li><p><a href="../api/#NNHelferlein.load_network"><code>load_network</code></a></p></li><li><p><a href="../api/#NNHelferlein.copy_network"><code>copy_network</code></a> - copy from and to GPU</p></li></ul><h1 id="Layers"><a class="docs-heading-anchor" href="#Layers">Layers</a><a id="Layers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.AbstractLayer"><code>AbstractLayer</code></a></li></ul><h3 id="Fully-connected-layers"><a class="docs-heading-anchor" href="#Fully-connected-layers">Fully connected layers</a><a id="Fully-connected-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Fully-connected-layers" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.Dense"><code>Dense</code></a></li><li><a href="../api/#NNHelferlein.Linear"><code>Linear</code></a></li><li><a href="../api/#NNHelferlein.Embed"><code>Embed</code></a></li><li><a href="../api/#NNHelferlein.FeatureSelection"><code>FeatureSelection</code></a></li></ul><h3 id="Convolutional"><a class="docs-heading-anchor" href="#Convolutional">Convolutional</a><a id="Convolutional-1"></a><a class="docs-heading-anchor-permalink" href="#Convolutional" title="Permalink"></a></h3><p>Layers for convolutional networks:</p><ul><li><a href="../api/#NNHelferlein.Conv"><code>Conv</code></a></li><li><a href="../api/#NNHelferlein.DeConv"><code>DeConv</code></a></li><li><a href="../api/#NNHelferlein.ResNetBlock"><code>ResNetBlock</code></a></li><li><a href="../api/#NNHelferlein.DepthwiseConv"><code>DepthwiseConv</code></a></li><li><a href="../api/#NNHelferlein.Pool"><code>Pool</code></a></li><li><a href="../api/#NNHelferlein.UnPool"><code>UnPool</code></a></li><li><a href="../api/#NNHelferlein.Pad"><code>Pad</code></a></li><li><a href="../api/#NNHelferlein.Flat"><code>Flat</code></a></li><li><a href="../api/#NNHelferlein.PyFlat"><code>PyFlat</code></a></li><li><a href="../api/#NNHelferlein.GlobalAveragePooling"><code>GlobalAveragePooling</code></a></li></ul><h3 id="Recurrent"><a class="docs-heading-anchor" href="#Recurrent">Recurrent</a><a id="Recurrent-1"></a><a class="docs-heading-anchor-permalink" href="#Recurrent" title="Permalink"></a></h3><p>Layers for recurrent networks:</p><ul><li><a href="../api/#NNHelferlein.Recurrent"><code>Recurrent</code></a> - type for recurrent layers</li><li><a href="../api/#NNHelferlein.RecurrentUnit"><code>RecurrentUnit</code></a> - type for recurrent units</li></ul><h4 id="Helpers-for-recurrent-networks"><a class="docs-heading-anchor" href="#Helpers-for-recurrent-networks">Helpers for recurrent networks</a><a id="Helpers-for-recurrent-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Helpers-for-recurrent-networks" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.get_hidden_states"><code>get_hidden_states</code></a></li><li><a href="../api/#NNHelferlein.get_cell_states"><code>get_cell_states</code></a></li><li><a href="../api/#NNHelferlein.set_hidden_states!"><code>set_hidden_states!</code></a></li><li><a href="../api/#NNHelferlein.set_cell_states!"><code>set_cell_states!</code></a>!</li><li><a href="../api/#NNHelferlein.reset_hidden_states!"><code>reset_hidden_states!</code></a></li><li><a href="../api/#NNHelferlein.reset_cell_states!"><code>reset_cell_states!</code></a></li></ul><h3 id="Other-layers"><a class="docs-heading-anchor" href="#Other-layers">Other layers</a><a id="Other-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Other-layers" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.Activation"><code>Activation</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Sigm</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Relu</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Swish</code></a></li><li><a href="../api/#NNHelferlein.Softmax"><code>Softmax</code></a></li><li><a href="../api/#NNHelferlein.Logistic"><code>Logistic</code></a></li><li><a href="../api/#NNHelferlein.Dropout"><code>Dropout</code></a></li><li><a href="../api/#NNHelferlein.BatchNorm"><code>BatchNorm</code></a></li><li><a href="../api/#NNHelferlein.LayerNorm"><code>LayerNorm</code></a></li><li><a href="../api/#NNHelferlein.GaussianNoise"><code>GaussianNoise</code></a></li></ul><h3 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.AttentionMechanism"><code>AttentionMechanism</code></a></li><li><a href="../api/#NNHelferlein.AttnBahdanau"><code>AttnBahdanau</code></a></li><li><a href="../api/#NNHelferlein.AttnLuong"><code>AttnLuong</code></a></li><li><a href="../api/#NNHelferlein.AttnDot"><code>AttnDot</code></a></li><li><a href="../api/#NNHelferlein.AttnLocation"><code>AttnLocation</code></a></li><li><a href="../api/#NNHelferlein.AttnInFeed"><code>AttnInFeed</code></a></li></ul><h3 id="Tranformer-API"><a class="docs-heading-anchor" href="#Tranformer-API">Tranformer API</a><a id="Tranformer-API-1"></a><a class="docs-heading-anchor-permalink" href="#Tranformer-API" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.Transformer"><code>Transformer</code></a> - generic transformer type, works on tensors                         of embedded sequences.</p></li><li><p><a href="../api/#NNHelferlein.TokenTransformer"><code>TokenTransformer</code></a> - generic transformer type, works on                              tokenized sequences.</p></li><li><p><a href="../api/#NNHelferlein.TFEncoderLayer"><code>TFEncoderLayer</code></a></p></li><li><p><a href="../api/#NNHelferlein.TFEncoder"><code>TFEncoder</code></a> - Bert-like transformer encoder</p></li><li><p><a href="../api/#NNHelferlein.TFDecoderLayer"><code>TFDecoderLayer</code></a></p></li><li><p><a href="../api/#NNHelferlein.TFDecoder"><code>TFDecoder</code></a> - Bert-like transformer decoder</p></li><li><p><a href="../api/#NNHelferlein.PositionalEncoding"><code>PositionalEncoding</code></a></p></li><li><p><a href="../api/#NNHelferlein.mk_padding_mask"><code>mk_padding_mask</code></a></p></li><li><p><a href="../api/#NNHelferlein.mk_peek_ahead_mask"><code>mk_peek_ahead_mask</code></a></p></li><li><p><a href="../api/#NNHelferlein.dot_prod_attn"><code>dot_prod_attn</code></a></p></li><li><p><a href="../api/#NNHelferlein.MultiHeadAttn"><code>MultiHeadAttn</code></a></p></li><li><p><a href="../api/#NNHelferlein.separate_heads"><code>separate_heads</code></a></p></li><li><p><a href="../api/#NNHelferlein.merge_heads"><code>merge_heads</code></a></p></li></ul><h1 id="Activation-functions"><a class="docs-heading-anchor" href="#Activation-functions">Activation functions</a><a id="Activation-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Activation-functions" title="Permalink"></a></h1><p><em>Helferlein</em>-style is to provide all functions (such activation  or loss functions) as <code>functions</code>.  Therefore any function from any package or any custom function may be  provided as <code>actf</code> to the layer constructors.</p><ul><li><p>... see <a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Activation-functions"><code>Knet docu</code></a>  for all activation functions provided by Knet (<code>elu</code>, <code>relu</code>, <code>selu</code>, <code>sigm</code>, ...).</p></li><li><p><em>Helferlein</em> provides some derived funs, such as  <code>leaky_relu</code>, <code>leaky_tanh</code>, <code>leaky_sigm</code> or <code>swish</code>.</p></li></ul><h1 id="Data-provider-utilities"><a class="docs-heading-anchor" href="#Data-provider-utilities">Data provider utilities</a><a id="Data-provider-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Data-provider-utilities" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.DataLoader"><code>DataLoader</code></a> - type for iterator of minibatches</li><li><a href="../api/#NNHelferlein.SequenceData"><code>SequenceData</code></a> - type for iterator of minibatches of sequences</li></ul><h3 id="For-tabular-data"><a class="docs-heading-anchor" href="#For-tabular-data">For tabular data</a><a id="For-tabular-data-1"></a><a class="docs-heading-anchor-permalink" href="#For-tabular-data" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.dataframe_read"><code>dataframe_read</code></a></li><li><a href="../api/#NNHelferlein.dataframe_minibatch"><code>dataframe_minibatch</code></a> - turn a dataframe into minibatches</li><li><a href="../api/#NNHelferlein.dataframe_split"><code>dataframe_split</code></a></li><li><a href="../api/#NNHelferlein.mk_class_ids"><code>mk_class_ids</code></a></li></ul><h3 id="For-image-data"><a class="docs-heading-anchor" href="#For-image-data">For image data</a><a id="For-image-data-1"></a><a class="docs-heading-anchor-permalink" href="#For-image-data" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.ImageLoader"><code>ImageLoader</code></a> - turn adirectory structure of image files    into minibatches</p></li><li><p><a href="../api/#NNHelferlein.mk_image_minibatch"><code>mk_image_minibatch</code></a></p></li><li><p><a href="../api/#NNHelferlein.get_class_labels"><code>get_class_labels</code></a></p></li></ul><h4 id="Image-to-array-tools"><a class="docs-heading-anchor" href="#Image-to-array-tools">Image to array tools</a><a id="Image-to-array-tools-1"></a><a class="docs-heading-anchor-permalink" href="#Image-to-array-tools" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.image2array"><code>image2array</code></a></li><li><a href="../api/#NNHelferlein.array2image"><code>array2image</code></a></li><li><a href="../api/#NNHelferlein.array2RGB"><code>array2RGB</code></a></li></ul><h4 id="ImageNet-tools"><a class="docs-heading-anchor" href="#ImageNet-tools">ImageNet tools</a><a id="ImageNet-tools-1"></a><a class="docs-heading-anchor-permalink" href="#ImageNet-tools" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_vgg</code></a></li><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_resnet</code></a></li><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_resnetv2</code></a></li><li><a href="../api/#NNHelferlein.predict_imagenet"><code>predict_imagenet</code></a></li><li><a href="../api/#NNHelferlein.get_imagenet_classes"><code>get_imagenet_classes</code></a></li></ul><h3 id="Text-data"><a class="docs-heading-anchor" href="#Text-data">Text data</a><a id="Text-data-1"></a><a class="docs-heading-anchor-permalink" href="#Text-data" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.WordTokenizer"><code>WordTokenizer</code></a></p></li><li><p><a href="../api/#NNHelferlein.sequence_minibatch"><code>sequence_minibatch</code></a> - turn a text corpus into minibatches</p></li><li><p><a href="../api/#NNHelferlein.pad_sequence"><code>pad_sequence</code></a></p></li><li><p><a href="../api/#NNHelferlein.truncate_sequence"><code>truncate_sequence</code></a></p></li></ul><h4 id="Text-corpus-example-data-download"><a class="docs-heading-anchor" href="#Text-corpus-example-data-download">Text corpus example data download</a><a id="Text-corpus-example-data-download-1"></a><a class="docs-heading-anchor-permalink" href="#Text-corpus-example-data-download" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.get_tatoeba_corpus"><code>get_tatoeba_corpus</code></a></li></ul><h1 id="Iteration-utilities"><a class="docs-heading-anchor" href="#Iteration-utilities">Iteration utilities</a><a id="Iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Iteration-utilities" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.PartialIterator"><code>PartialIterator</code></a></li><li><a href="../api/#NNHelferlein.split_minibatches"><code>split_minibatches</code></a></li><li><a href="../api/#NNHelferlein.MBNoiser"><code>MBNoiser</code></a></li><li><a href="../api/#NNHelferlein.MBMasquerade"><code>MBMasquerade</code></a></li></ul><h1 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.tb_train!"><code>tb_train!</code></a> - high-level training utility with    tenorboard support and (maybe too) many optional arguments</li></ul><h3 id="Evaluation-and-accuracy"><a class="docs-heading-anchor" href="#Evaluation-and-accuracy">Evaluation and accuracy</a><a id="Evaluation-and-accuracy-1"></a><a class="docs-heading-anchor-permalink" href="#Evaluation-and-accuracy" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.predict"><code>predict</code></a></li><li><a href="../api/#NNHelferlein.predict_top5"><code>predict_top5</code></a></li><li><a href="../api/#NNHelferlein.minibatch_eval"><code>minibatch_eval</code></a></li><li><a href="../api/#NNHelferlein.confusion_matrix"><code>confusion_matrix</code></a></li></ul><h3 id="Loss-functions"><a class="docs-heading-anchor" href="#Loss-functions">Loss functions</a><a id="Loss-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Loss-functions" title="Permalink"></a></h3><ul><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.nll"><code>Knet.Ops20.nll</code></a> -  Cross-entropy for classifiers (aka negative log likelihood)</li><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.bce"><code>Knet.Ops20.bce</code></a> -  binary cross-entropy for binary classifiers </li><li><a href="../api/#NNHelferlein.focal_nll"><code>focal_nll</code></a></li><li><a href="../api/#NNHelferlein.focal_bce"><code>focal_bce</code></a></li><li>... see <a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Loss-functions"><code>Knet docu</code></a>  for all loss functions provided by Knet.</li></ul><h3 id="Accuracy-functions"><a class="docs-heading-anchor" href="#Accuracy-functions">Accuracy functions</a><a id="Accuracy-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Accuracy-functions" title="Permalink"></a></h3><ul><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.accuracy"><code>Knet.Ops20.accuracy</code></a> </li><li><a href="../api/#NNHelferlein.squared_error_acc"><code>squared_error_acc</code></a></li><li><a href="../api/#NNHelferlein.abs_error_acc"><code>abs_error_acc</code></a></li><li><a href="../api/#NNHelferlein.hamming_dist"><code>hamming_dist</code></a> - Hamming distance-like accuracy</li><li><a href="../api/#NNHelferlein.peak_finder_acc"><code>peak_finder_acc</code></a> - accuracy, suitable for peak detection</li></ul><h1 id="Other-utils"><a class="docs-heading-anchor" href="#Other-utils">Other utils</a><a id="Other-utils-1"></a><a class="docs-heading-anchor-permalink" href="#Other-utils" title="Permalink"></a></h1><h3 id="Utils-for-array-manipulation"><a class="docs-heading-anchor" href="#Utils-for-array-manipulation">Utils for array manipulation</a><a id="Utils-for-array-manipulation-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-array-manipulation" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.crop_array"><code>crop_array</code></a></li><li><a href="../api/#NNHelferlein.blowup_array"><code>blowup_array</code></a></li><li><a href="../api/#NNHelferlein.recycle_array"><code>recycle_array</code></a></li><li><a href="../api/#NNHelferlein.de_embed"><code>de_embed</code></a> - return argmax for a n-dimensional array</li></ul><h3 id="Utils-for-fixing-types-in-GPU-context"><a class="docs-heading-anchor" href="#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a><a id="Utils-for-fixing-types-in-GPU-context-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-fixing-types-in-GPU-context" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.init0"><code>init0</code></a></li><li><a href="../api/#NNHelferlein.convert2CuArray"><code>convert2CuArray</code></a></li><li><a href="../api/#NNHelferlein.ifgpu"><code>ifgpu</code></a></li><li><a href="../api/#NNHelferlein.emptyCuArray"><code>emptyCuArray</code></a></li></ul><h3 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.dataset_mit_nsr"><code>dataset_mit_nsr</code></a> - logterm ECGs</li><li><a href="../api/#NNHelferlein.dataset_mnist"><code>dataset_mnist</code></a> - MNIST</li><li><a href="../api/#NNHelferlein.dataset_iris"><code>dataset_iris</code></a> - Fisher&#39;s Iris dataset</li><li><a href="../api/#NNHelferlein.get_tatoeba_corpus"><code>get_tatoeba_corpus</code></a> - machine translation text corpi</li><li><a href="../api/#NNHelferlein.dataset_pfam"><code>dataset_pfam</code></a> - protein sequences dataset</li></ul><h1 id="Pretrained-networks"><a class="docs-heading-anchor" href="#Pretrained-networks">Pretrained networks</a><a id="Pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-networks" title="Permalink"></a></h1><p>Pretrained network weights, derived from Keras applications.</p><ul><li><a href="../api/#NNHelferlein.get_vgg16"><code>get_vgg16</code></a></li><li><a href="../api/#NNHelferlein.get_resnet50v2"><code>get_resnet50v2</code></a></li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../examples/">« Examples</a><a class="docs-footer-nextpage" href="../api/">API Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Overview · NNHelferlein.jl</title><meta name="title" content="API Overview · NNHelferlein.jl"/><meta property="og:title" content="API Overview · NNHelferlein.jl"/><meta property="twitter:title" content="API Overview · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li class="is-active"><a class="tocitem" href>API Overview</a><ul class="internal"><li class="toplevel"><a class="tocitem" href="#Layers"><span>Layers</span></a></li><li class="toplevel"><a class="tocitem" href="#Activation-functions"><span>Activation functions</span></a></li><li class="toplevel"><a class="tocitem" href="#Data-provider-utilities"><span>Data provider utilities</span></a></li><li class="toplevel"><a class="tocitem" href="#Iteration-utilities"><span>Iteration utilities</span></a></li><li class="toplevel"><a class="tocitem" href="#Training"><span>Training</span></a></li><li class="toplevel"><a class="tocitem" href="#Other-utils"><span>Other utils</span></a></li><li class="toplevel"><a class="tocitem" href="#Pretrained-networks"><span>Pretrained networks</span></a></li></ul></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Overview</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Overview</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/api_overview.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Networks-and-chains"><a class="docs-heading-anchor" href="#Networks-and-chains">Networks and chains</a><a id="Networks-and-chains-1"></a><a class="docs-heading-anchor-permalink" href="#Networks-and-chains" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.AbstractNN"><code>AbstractNN</code></a> - <em>Helferlein</em> network type</li><li><a href="../api/#NNHelferlein.AbstractChain"><code>AbstractChain</code></a> - <em>Helferlein</em> chain type</li></ul><ul><li><p><a href="../api/#NNHelferlein.Classifier"><code>Classifier</code></a> - network with NLL loss</p></li><li><p><a href="../api/#NNHelferlein.Regressor"><code>Regressor</code></a> - network with MSE soll</p></li><li><p><a href="../api/#NNHelferlein.VAE"><code>VAE</code></a> - variational autoencoder wrapper The VAE supports ramp-up of the KL-weight beta via the functions <a href="../api/#NNHelferlein.set_beta!"><code>set_beta!</code></a> and <a href="../api/#NNHelferlein.get_beta"><code>get_beta</code></a>.</p></li><li><p><a href="../api/#NNHelferlein.Chain"><code>Chain</code></a></p></li></ul><h3 id="Network-helpers"><a class="docs-heading-anchor" href="#Network-helpers">Network helpers</a><a id="Network-helpers-1"></a><a class="docs-heading-anchor-permalink" href="#Network-helpers" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.add_layer!"><code>add_layer!</code></a></p></li><li><p><a href="../api/#NNHelferlein.add_layer!"><code>+</code></a></p></li><li><p><a href="../api/#Base.summary"><code>summary</code></a></p></li><li><p><a href="../api/#NNHelferlein.save_network"><code>save_network</code></a> - save as jld2 file</p></li><li><p><a href="../api/#NNHelferlein.load_network"><code>load_network</code></a></p></li><li><p><a href="../api/#NNHelferlein.copy_network"><code>copy_network</code></a> - copy from and to GPU</p></li></ul><h1 id="Layers"><a class="docs-heading-anchor" href="#Layers">Layers</a><a id="Layers-1"></a><a class="docs-heading-anchor-permalink" href="#Layers" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.AbstractLayer"><code>AbstractLayer</code></a></li></ul><h3 id="Fully-connected-layers"><a class="docs-heading-anchor" href="#Fully-connected-layers">Fully connected layers</a><a id="Fully-connected-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Fully-connected-layers" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.Dense"><code>Dense</code></a></li><li><a href="../api/#NNHelferlein.Linear"><code>Linear</code></a></li><li><a href="../api/#NNHelferlein.Embed"><code>Embed</code></a></li><li><a href="../api/#NNHelferlein.FeatureSelection"><code>FeatureSelection</code></a></li></ul><h3 id="Convolutional"><a class="docs-heading-anchor" href="#Convolutional">Convolutional</a><a id="Convolutional-1"></a><a class="docs-heading-anchor-permalink" href="#Convolutional" title="Permalink"></a></h3><p>Layers for convolutional networks:</p><ul><li><a href="../api/#NNHelferlein.Conv"><code>Conv</code></a></li><li><a href="../api/#NNHelferlein.DeConv"><code>DeConv</code></a></li><li><a href="../api/#NNHelferlein.ResNetBlock"><code>ResNetBlock</code></a></li><li><a href="../api/#NNHelferlein.DepthwiseConv"><code>DepthwiseConv</code></a></li><li><a href="../api/#NNHelferlein.Pool"><code>Pool</code></a></li><li><a href="../api/#NNHelferlein.UnPool"><code>UnPool</code></a></li><li><a href="../api/#NNHelferlein.Pad"><code>Pad</code></a></li><li><a href="../api/#NNHelferlein.Flat"><code>Flat</code></a></li><li><a href="../api/#NNHelferlein.PyFlat"><code>PyFlat</code></a></li><li><a href="../api/#NNHelferlein.GlobalAveragePooling"><code>GlobalAveragePooling</code></a></li></ul><h3 id="Recurrent"><a class="docs-heading-anchor" href="#Recurrent">Recurrent</a><a id="Recurrent-1"></a><a class="docs-heading-anchor-permalink" href="#Recurrent" title="Permalink"></a></h3><p>Layers for recurrent networks:</p><ul><li><a href="../api/#NNHelferlein.Recurrent"><code>Recurrent</code></a> - type for recurrent layers</li><li><a href="../api/#NNHelferlein.RecurrentUnit"><code>RecurrentUnit</code></a> - type for recurrent units</li></ul><h4 id="Helpers-for-recurrent-networks"><a class="docs-heading-anchor" href="#Helpers-for-recurrent-networks">Helpers for recurrent networks</a><a id="Helpers-for-recurrent-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Helpers-for-recurrent-networks" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.get_hidden_states"><code>get_hidden_states</code></a></li><li><a href="../api/#NNHelferlein.get_cell_states"><code>get_cell_states</code></a></li><li><a href="../api/#NNHelferlein.set_hidden_states!"><code>set_hidden_states!</code></a></li><li><a href="../api/#NNHelferlein.set_cell_states!"><code>set_cell_states!</code></a>!</li><li><a href="../api/#NNHelferlein.reset_hidden_states!"><code>reset_hidden_states!</code></a></li><li><a href="../api/#NNHelferlein.reset_cell_states!"><code>reset_cell_states!</code></a></li></ul><h3 id="Other-layers"><a class="docs-heading-anchor" href="#Other-layers">Other layers</a><a id="Other-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Other-layers" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.Activation"><code>Activation</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Sigm</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Relu</code></a></li><li><a href="../api/#NNHelferlein.Activation"><code>Swish</code></a></li><li><a href="../api/#NNHelferlein.Softmax"><code>Softmax</code></a></li><li><a href="../api/#NNHelferlein.Logistic"><code>Logistic</code></a></li><li><a href="../api/#NNHelferlein.Dropout"><code>Dropout</code></a></li><li><a href="../api/#NNHelferlein.BatchNorm"><code>BatchNorm</code></a></li><li><a href="../api/#NNHelferlein.LayerNorm"><code>LayerNorm</code></a></li><li><a href="../api/#NNHelferlein.GaussianNoise"><code>GaussianNoise</code></a></li></ul><h3 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.AttentionMechanism"><code>AttentionMechanism</code></a></li><li><a href="../api/#NNHelferlein.AttnBahdanau"><code>AttnBahdanau</code></a></li><li><a href="../api/#NNHelferlein.AttnLuong"><code>AttnLuong</code></a></li><li><a href="../api/#NNHelferlein.AttnDot"><code>AttnDot</code></a></li><li><a href="../api/#NNHelferlein.AttnLocation"><code>AttnLocation</code></a></li><li><a href="../api/#NNHelferlein.AttnInFeed"><code>AttnInFeed</code></a></li></ul><h3 id="Tranformer-API"><a class="docs-heading-anchor" href="#Tranformer-API">Tranformer API</a><a id="Tranformer-API-1"></a><a class="docs-heading-anchor-permalink" href="#Tranformer-API" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.Transformer"><code>Transformer</code></a> - generic transformer type, works on tensors                         of embedded sequences.</p></li><li><p><a href="../api/#NNHelferlein.TokenTransformer"><code>TokenTransformer</code></a> - generic transformer type, works on                              tokenized sequences.</p></li><li><p><a href="../api/#NNHelferlein.TFEncoderLayer"><code>TFEncoderLayer</code></a></p></li><li><p><a href="../api/#NNHelferlein.TFEncoder"><code>TFEncoder</code></a> - Bert-like transformer encoder</p></li><li><p><a href="../api/#NNHelferlein.TFDecoderLayer"><code>TFDecoderLayer</code></a></p></li><li><p><a href="../api/#NNHelferlein.TFDecoder"><code>TFDecoder</code></a> - Bert-like transformer decoder</p></li><li><p><a href="../api/#NNHelferlein.PositionalEncoding"><code>PositionalEncoding</code></a></p></li><li><p><a href="../api/#NNHelferlein.mk_padding_mask"><code>mk_padding_mask</code></a></p></li><li><p><a href="../api/#NNHelferlein.mk_peek_ahead_mask"><code>mk_peek_ahead_mask</code></a></p></li><li><p><a href="../api/#NNHelferlein.dot_prod_attn"><code>dot_prod_attn</code></a></p></li><li><p><a href="../api/#NNHelferlein.MultiHeadAttn"><code>MultiHeadAttn</code></a></p></li><li><p><a href="../api/#NNHelferlein.separate_heads"><code>separate_heads</code></a></p></li><li><p><a href="../api/#NNHelferlein.merge_heads"><code>merge_heads</code></a></p></li></ul><h1 id="Activation-functions"><a class="docs-heading-anchor" href="#Activation-functions">Activation functions</a><a id="Activation-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Activation-functions" title="Permalink"></a></h1><p><em>Helferlein</em>-style is to provide all functions (such activation  or loss functions) as <code>functions</code>.  Therefore any function from any package or any custom function may be  provided as <code>actf</code> to the layer constructors.</p><ul><li><p>... see <a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Activation-functions"><code>Knet docu</code></a>  for all activation functions provided by Knet (<code>elu</code>, <code>relu</code>, <code>selu</code>, <code>sigm</code>, ...).</p></li><li><p><em>Helferlein</em> provides some derived funs, such as  <code>leaky_relu</code>, <code>leaky_tanh</code>, <code>leaky_sigm</code> or <code>swish</code>.</p></li></ul><h1 id="Data-provider-utilities"><a class="docs-heading-anchor" href="#Data-provider-utilities">Data provider utilities</a><a id="Data-provider-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Data-provider-utilities" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.DataLoader"><code>DataLoader</code></a> - type for iterator of minibatches</li><li><a href="../api/#NNHelferlein.SequenceData"><code>SequenceData</code></a> - type for iterator of minibatches of sequences</li></ul><h3 id="For-tabular-data"><a class="docs-heading-anchor" href="#For-tabular-data">For tabular data</a><a id="For-tabular-data-1"></a><a class="docs-heading-anchor-permalink" href="#For-tabular-data" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.dataframe_read"><code>dataframe_read</code></a></li><li><a href="../api/#NNHelferlein.dataframe_minibatch"><code>dataframe_minibatch</code></a> - turn a dataframe into minibatches</li><li><a href="../api/#NNHelferlein.dataframe_split"><code>dataframe_split</code></a></li><li><a href="../api/#NNHelferlein.mk_class_ids"><code>mk_class_ids</code></a></li></ul><h3 id="For-image-data"><a class="docs-heading-anchor" href="#For-image-data">For image data</a><a id="For-image-data-1"></a><a class="docs-heading-anchor-permalink" href="#For-image-data" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.ImageLoader"><code>ImageLoader</code></a> - turn adirectory structure of image files    into minibatches</p></li><li><p><a href="../api/#NNHelferlein.mk_image_minibatch"><code>mk_image_minibatch</code></a></p></li><li><p><a href="../api/#NNHelferlein.get_class_labels"><code>get_class_labels</code></a></p></li></ul><h4 id="Image-to-array-tools"><a class="docs-heading-anchor" href="#Image-to-array-tools">Image to array tools</a><a id="Image-to-array-tools-1"></a><a class="docs-heading-anchor-permalink" href="#Image-to-array-tools" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.image2array"><code>image2array</code></a></li><li><a href="../api/#NNHelferlein.array2image"><code>array2image</code></a></li><li><a href="../api/#NNHelferlein.array2RGB"><code>array2RGB</code></a></li></ul><h4 id="ImageNet-tools"><a class="docs-heading-anchor" href="#ImageNet-tools">ImageNet tools</a><a id="ImageNet-tools-1"></a><a class="docs-heading-anchor-permalink" href="#ImageNet-tools" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_vgg</code></a></li><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_resnet</code></a></li><li><a href="../api/#NNHelferlein.preproc_imagenet_vgg"><code>preproc_imagenet_resnetv2</code></a></li><li><a href="../api/#NNHelferlein.predict_imagenet"><code>predict_imagenet</code></a></li><li><a href="../api/#NNHelferlein.get_imagenet_classes"><code>get_imagenet_classes</code></a></li></ul><h3 id="Text-data"><a class="docs-heading-anchor" href="#Text-data">Text data</a><a id="Text-data-1"></a><a class="docs-heading-anchor-permalink" href="#Text-data" title="Permalink"></a></h3><ul><li><p><a href="../api/#NNHelferlein.WordTokenizer"><code>WordTokenizer</code></a></p></li><li><p><a href="../api/#NNHelferlein.sequence_minibatch"><code>sequence_minibatch</code></a> - turn a text corpus into minibatches</p></li><li><p><a href="../api/#NNHelferlein.pad_sequence"><code>pad_sequence</code></a></p></li><li><p><a href="../api/#NNHelferlein.truncate_sequence"><code>truncate_sequence</code></a></p></li></ul><h4 id="Text-corpus-example-data-download"><a class="docs-heading-anchor" href="#Text-corpus-example-data-download">Text corpus example data download</a><a id="Text-corpus-example-data-download-1"></a><a class="docs-heading-anchor-permalink" href="#Text-corpus-example-data-download" title="Permalink"></a></h4><ul><li><a href="../api/#NNHelferlein.get_tatoeba_corpus"><code>get_tatoeba_corpus</code></a></li></ul><h1 id="Iteration-utilities"><a class="docs-heading-anchor" href="#Iteration-utilities">Iteration utilities</a><a id="Iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Iteration-utilities" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.PartialIterator"><code>PartialIterator</code></a></li><li><a href="../api/#NNHelferlein.split_minibatches"><code>split_minibatches</code></a></li><li><a href="../api/#NNHelferlein.MBNoiser"><code>MBNoiser</code></a></li><li><a href="../api/#NNHelferlein.MBMasquerade"><code>MBMasquerade</code></a></li></ul><h1 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h1><ul><li><a href="../api/#NNHelferlein.tb_train!"><code>tb_train!</code></a> - high-level training utility with    tenorboard support and (maybe too) many optional arguments</li></ul><h3 id="Evaluation-and-accuracy"><a class="docs-heading-anchor" href="#Evaluation-and-accuracy">Evaluation and accuracy</a><a id="Evaluation-and-accuracy-1"></a><a class="docs-heading-anchor-permalink" href="#Evaluation-and-accuracy" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.predict"><code>predict</code></a></li><li><a href="../api/#NNHelferlein.predict_top5"><code>predict_top5</code></a></li><li><a href="../api/#NNHelferlein.minibatch_eval"><code>minibatch_eval</code></a></li><li><a href="../api/#NNHelferlein.confusion_matrix"><code>confusion_matrix</code></a></li></ul><h3 id="Loss-functions"><a class="docs-heading-anchor" href="#Loss-functions">Loss functions</a><a id="Loss-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Loss-functions" title="Permalink"></a></h3><ul><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.nll"><code>Knet.Ops20.nll</code></a> -  Cross-entropy for classifiers (aka negative log likelihood)</li><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.bce"><code>Knet.Ops20.bce</code></a> -  binary cross-entropy for binary classifiers </li><li><a href="../api/#NNHelferlein.focal_nll"><code>focal_nll</code></a></li><li><a href="../api/#NNHelferlein.focal_bce"><code>focal_bce</code></a></li><li>... see <a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Loss-functions"><code>Knet docu</code></a>  for all loss functions provided by Knet.</li></ul><h3 id="Accuracy-functions"><a class="docs-heading-anchor" href="#Accuracy-functions">Accuracy functions</a><a id="Accuracy-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Accuracy-functions" title="Permalink"></a></h3><ul><li><a href="https://denizyuret.github.io/Knet.jl/latest/reference/#Knet.Ops20.accuracy"><code>Knet.Ops20.accuracy</code></a> </li><li><a href="../api/#NNHelferlein.squared_error_acc"><code>squared_error_acc</code></a></li><li><a href="../api/#NNHelferlein.abs_error_acc"><code>abs_error_acc</code></a></li><li><a href="../api/#NNHelferlein.hamming_dist"><code>hamming_dist</code></a> - Hamming distance-like accuracy</li><li><a href="../api/#NNHelferlein.peak_finder_acc"><code>peak_finder_acc</code></a> - accuracy, suitable for peak detection</li></ul><h1 id="Other-utils"><a class="docs-heading-anchor" href="#Other-utils">Other utils</a><a id="Other-utils-1"></a><a class="docs-heading-anchor-permalink" href="#Other-utils" title="Permalink"></a></h1><h3 id="Utils-for-array-manipulation"><a class="docs-heading-anchor" href="#Utils-for-array-manipulation">Utils for array manipulation</a><a id="Utils-for-array-manipulation-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-array-manipulation" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.crop_array"><code>crop_array</code></a></li><li><a href="../api/#NNHelferlein.blowup_array"><code>blowup_array</code></a></li><li><a href="../api/#NNHelferlein.recycle_array"><code>recycle_array</code></a></li><li><a href="../api/#NNHelferlein.de_embed"><code>de_embed</code></a> - return argmax for a n-dimensional array</li></ul><h3 id="Utils-for-fixing-types-in-GPU-context"><a class="docs-heading-anchor" href="#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a><a id="Utils-for-fixing-types-in-GPU-context-1"></a><a class="docs-heading-anchor-permalink" href="#Utils-for-fixing-types-in-GPU-context" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.init0"><code>init0</code></a></li><li><a href="../api/#NNHelferlein.convert2CuArray"><code>convert2CuArray</code></a></li><li><a href="../api/#NNHelferlein.ifgpu"><code>ifgpu</code></a></li><li><a href="../api/#NNHelferlein.emptyCuArray"><code>emptyCuArray</code></a></li></ul><h3 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h3><ul><li><a href="../api/#NNHelferlein.dataset_mit_nsr"><code>dataset_mit_nsr</code></a> - logterm ECGs</li><li><a href="../api/#NNHelferlein.dataset_mnist"><code>dataset_mnist</code></a> - MNIST</li><li><a href="../api/#NNHelferlein.dataset_iris"><code>dataset_iris</code></a> - Fisher&#39;s Iris dataset</li><li><a href="../api/#NNHelferlein.get_tatoeba_corpus"><code>get_tatoeba_corpus</code></a> - machine translation text corpi</li><li><a href="../api/#NNHelferlein.dataset_pfam"><code>dataset_pfam</code></a> - protein sequences dataset</li></ul><h1 id="Pretrained-networks"><a class="docs-heading-anchor" href="#Pretrained-networks">Pretrained networks</a><a id="Pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-networks" title="Permalink"></a></h1><p>Pretrained network weights, derived from Keras applications.</p><ul><li><a href="../api/#NNHelferlein.get_vgg16"><code>get_vgg16</code></a></li><li><a href="../api/#NNHelferlein.get_resnet50v2"><code>get_resnet50v2</code></a></li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../examples/">« Examples</a><a class="docs-footer-nextpage" href="../api/">API Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/changelog/index.html b/dev/changelog/index.html
index f3830e61..29881c20 100644
--- a/dev/changelog/index.html
+++ b/dev/changelog/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Changelog · NNHelferlein.jl</title><meta name="title" content="Changelog · NNHelferlein.jl"/><meta property="og:title" content="Changelog · NNHelferlein.jl"/><meta property="twitter:title" content="Changelog · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li><li class="is-active"><a class="tocitem" href>Changelog</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Changelog</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Changelog</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/changelog.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="ChangeLog-of-NNHelferlein-package"><a class="docs-heading-anchor" href="#ChangeLog-of-NNHelferlein-package">ChangeLog of NNHelferlein package</a><a id="ChangeLog-of-NNHelferlein-package-1"></a><a class="docs-heading-anchor-permalink" href="#ChangeLog-of-NNHelferlein-package" title="Permalink"></a></h1><h3 id="todo"><a class="docs-heading-anchor" href="#todo">todo</a><a id="todo-1"></a><a class="docs-heading-anchor-permalink" href="#todo" title="Permalink"></a></h3><ul><li>use CUDA.CuIterator in train?</li><li>padding no longer imported from NNlib (incompatibility wirh AutoGrad)</li></ul><h3 id=".3.2"><a class="docs-heading-anchor" href="#.3.2">1.3.2</a><a id=".3.2-1"></a><a class="docs-heading-anchor-permalink" href="#.3.2" title="Permalink"></a></h3><ul><li>tidy-up dependency jungle</li><li>Padding added to emebdding layer</li></ul><h3 id="v1.3.1"><a class="docs-heading-anchor" href="#v1.3.1">v1.3.1</a><a id="v1.3.1-1"></a><a class="docs-heading-anchor-permalink" href="#v1.3.1" title="Permalink"></a></h3><ul><li>l1 and l2 decay always parallel to learning rate decay</li><li>severeal bioinformatics tools (aminoacid embedding, blosum, vhse8)</li><li>dataframe_minibatch default &quot;y&quot; changed to nothing.</li><li>Bioinformatics: Aminoacid tokenisation added</li><li>GPU selection added (not yet exported)</li><li>grouped convolutions fixed</li></ul><h3 id="v1.3"><a class="docs-heading-anchor" href="#v1.3">v1.3</a><a id="v1.3-1"></a><a class="docs-heading-anchor-permalink" href="#v1.3" title="Permalink"></a></h3><ul><li>Transformer API added for Bert-like architectures</li><li>Transformer example</li><li>ramp-up of beta added to VAE</li><li>disambiguate vae signature</li></ul><h3 id="v1.2"><a class="docs-heading-anchor" href="#v1.2">v1.2</a><a id="v1.2-1"></a><a class="docs-heading-anchor-permalink" href="#v1.2" title="Permalink"></a></h3><ul><li>imagenet preprocessing fixed for vgg and resnet</li><li>ResNetBlock added</li><li>ResNet added</li><li>Padding layer added</li><li>print_network changed to summary</li><li>Pretrained nets saved at zenodo and simplified constructors added</li><li>AbstractNN and AbstractLayer added</li><li>copy model and save/load as JLD2 added</li></ul><h3 id="v1.1.2"><a class="docs-heading-anchor" href="#v1.1.2">v1.1.2</a><a id="v1.1.2-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.2" title="Permalink"></a></h3><ul><li>Depthwise conv-layer added (experimental)</li><li>focal loss functions added to classifier </li><li>FeatureSelection layer added</li><li>explicit signature added for 3d-convolution</li><li>train: possibility to disable tensorboard logs</li><li>train: possibility to return losses and accs for  plotting after training</li></ul><h3 id="v1.1.1"><a class="docs-heading-anchor" href="#v1.1.1">v1.1.1</a><a id="v1.1.1-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.1" title="Permalink"></a></h3><ul><li>some docstring cosmetics</li><li>Activation Layers added</li><li>layer GlobalAveragePoling added</li><li>pre-trained vgg example fixed for changed &quot;import-HDF&quot;-interface</li><li>hdf5 import with all kwargs possible</li><li>added: Layer + Layer = Chain</li><li>changelog added to docu</li></ul><h3 id="v1.1.0"><a class="docs-heading-anchor" href="#v1.1.0">v1.1.0</a><a id="v1.1.0-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.0" title="Permalink"></a></h3><ul><li>documentation for release added</li><li>split_minibatches() made stable (never returns an empty iterator)</li><li>docs slightly re-organised</li><li>Gaussian Layer added</li><li>minibatch iterator for masking added</li></ul><h3 id="v1.0.0"><a class="docs-heading-anchor" href="#v1.0.0">v1.0.0</a><a id="v1.0.0-1"></a><a class="docs-heading-anchor-permalink" href="#v1.0.0" title="Permalink"></a></h3><ul><li>initial release</li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../license/">« License</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Changelog · NNHelferlein.jl</title><meta name="title" content="Changelog · NNHelferlein.jl"/><meta property="og:title" content="Changelog · NNHelferlein.jl"/><meta property="twitter:title" content="Changelog · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li><li class="is-active"><a class="tocitem" href>Changelog</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Changelog</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Changelog</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/changelog.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="ChangeLog-of-NNHelferlein-package"><a class="docs-heading-anchor" href="#ChangeLog-of-NNHelferlein-package">ChangeLog of NNHelferlein package</a><a id="ChangeLog-of-NNHelferlein-package-1"></a><a class="docs-heading-anchor-permalink" href="#ChangeLog-of-NNHelferlein-package" title="Permalink"></a></h1><h3 id="todo"><a class="docs-heading-anchor" href="#todo">todo</a><a id="todo-1"></a><a class="docs-heading-anchor-permalink" href="#todo" title="Permalink"></a></h3><ul><li>use CUDA.CuIterator in train?</li><li>padding no longer imported from NNlib (incompatibility wirh AutoGrad)</li></ul><h3 id=".3.2"><a class="docs-heading-anchor" href="#.3.2">1.3.2</a><a id=".3.2-1"></a><a class="docs-heading-anchor-permalink" href="#.3.2" title="Permalink"></a></h3><ul><li>tidy-up dependency jungle</li><li>Padding added to emebdding layer</li></ul><h3 id="v1.3.1"><a class="docs-heading-anchor" href="#v1.3.1">v1.3.1</a><a id="v1.3.1-1"></a><a class="docs-heading-anchor-permalink" href="#v1.3.1" title="Permalink"></a></h3><ul><li>l1 and l2 decay always parallel to learning rate decay</li><li>severeal bioinformatics tools (aminoacid embedding, blosum, vhse8)</li><li>dataframe_minibatch default &quot;y&quot; changed to nothing.</li><li>Bioinformatics: Aminoacid tokenisation added</li><li>GPU selection added (not yet exported)</li><li>grouped convolutions fixed</li></ul><h3 id="v1.3"><a class="docs-heading-anchor" href="#v1.3">v1.3</a><a id="v1.3-1"></a><a class="docs-heading-anchor-permalink" href="#v1.3" title="Permalink"></a></h3><ul><li>Transformer API added for Bert-like architectures</li><li>Transformer example</li><li>ramp-up of beta added to VAE</li><li>disambiguate vae signature</li></ul><h3 id="v1.2"><a class="docs-heading-anchor" href="#v1.2">v1.2</a><a id="v1.2-1"></a><a class="docs-heading-anchor-permalink" href="#v1.2" title="Permalink"></a></h3><ul><li>imagenet preprocessing fixed for vgg and resnet</li><li>ResNetBlock added</li><li>ResNet added</li><li>Padding layer added</li><li>print_network changed to summary</li><li>Pretrained nets saved at zenodo and simplified constructors added</li><li>AbstractNN and AbstractLayer added</li><li>copy model and save/load as JLD2 added</li></ul><h3 id="v1.1.2"><a class="docs-heading-anchor" href="#v1.1.2">v1.1.2</a><a id="v1.1.2-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.2" title="Permalink"></a></h3><ul><li>Depthwise conv-layer added (experimental)</li><li>focal loss functions added to classifier </li><li>FeatureSelection layer added</li><li>explicit signature added for 3d-convolution</li><li>train: possibility to disable tensorboard logs</li><li>train: possibility to return losses and accs for  plotting after training</li></ul><h3 id="v1.1.1"><a class="docs-heading-anchor" href="#v1.1.1">v1.1.1</a><a id="v1.1.1-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.1" title="Permalink"></a></h3><ul><li>some docstring cosmetics</li><li>Activation Layers added</li><li>layer GlobalAveragePoling added</li><li>pre-trained vgg example fixed for changed &quot;import-HDF&quot;-interface</li><li>hdf5 import with all kwargs possible</li><li>added: Layer + Layer = Chain</li><li>changelog added to docu</li></ul><h3 id="v1.1.0"><a class="docs-heading-anchor" href="#v1.1.0">v1.1.0</a><a id="v1.1.0-1"></a><a class="docs-heading-anchor-permalink" href="#v1.1.0" title="Permalink"></a></h3><ul><li>documentation for release added</li><li>split_minibatches() made stable (never returns an empty iterator)</li><li>docs slightly re-organised</li><li>Gaussian Layer added</li><li>minibatch iterator for masking added</li></ul><h3 id="v1.0.0"><a class="docs-heading-anchor" href="#v1.0.0">v1.0.0</a><a id="v1.0.0-1"></a><a class="docs-heading-anchor-permalink" href="#v1.0.0" title="Permalink"></a></h3><ul><li>initial release</li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../license/">« License</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/examples/index.html b/dev/examples/index.html
index 8f14368f..22c5c30d 100644
--- a/dev/examples/index.html
+++ b/dev/examples/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Examples · NNHelferlein.jl</title><meta name="title" content="Examples · NNHelferlein.jl"/><meta property="og:title" content="Examples · NNHelferlein.jl"/><meta property="twitter:title" content="Examples · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li class="is-active"><a class="tocitem" href>Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Examples</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Examples</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/examples.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Examples"><a class="docs-heading-anchor" href="#Examples">Examples</a><a id="Examples-1"></a><a class="docs-heading-anchor-permalink" href="#Examples" title="Permalink"></a></h1><p>Examples may be used as templates for new projects...     All examples are at <a href="https://github.com/KnetML/NNHelferlein.jl/tree/main/examples">GitHub/examples</a>:</p><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/01-simple-mlp.ipynb"><code>Simple MLP</code></a>: A simple multi-layer perceptron for MNIST classification, build with Knet and <em>Helferlein</em>-types in just one line of code (or so).</li></ul><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/10-simple-lenet.ipynb"><code>Simple LeNet</code></a>: A simple LeNet for MNIST classification,  build with help of the <em>Helferlein</em> layers in just two (ok: long) lines of code. </p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/11-focal_loss.ipynb"><code>Training unbalanced data with help of a focal loss function</code></a>: A simple MLP with focal loss demonstrate classification of highly unbalanced data.</p></li></ul><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/30-ae.ipynb"><code>Vanilla Autoencoder</code></a>: A simple autoencoder design with help of <em>Knet</em> in <em>Helferlein</em>-style.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/31-cae.ipynb"><code>Convolutional Autoencoder</code></a>: A convolutional autoencoder design with help of <em>Knet</em> in <em>Helferlein</em>-style.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/32-vae.ipynb"><code>Variational Autoencoder</code></a>: Example for a simple VAE utilising the NNHelferlein-type <code>VAE</code> and demonstrating the fascinating regularisation of a VAE.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/60-s2s-nlp-gru.ipynb"><code>Simple sequence-to-sequence network</code></a>: Simple s2s network to demonstrate how to setup macghine translation with  a rnn.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/61-RNN_bi_attn.ipynb"><code>Sequence-to-sequence RNN for machine translation</code></a>: RNN to demonstrate how to setup machine translation with  a bidirectional encoder RNN and attention.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/62-ECG-tagger.ipynb"><code>RNN Sequence tagger for annotation of ECGs</code></a>: RNN to demonstrate how to set-up a sequence tagger to detect heart beats. Only one layer with 8 units is necessary to achieve almost 100% correct predictions.  The example includes the definition on peephole LSTMs to display how to integrate non-standard rnn-units with the <em>NNHelfrelein</em> framework.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/75-keras-model.ipynb"><code>Import a Keras model</code></a>: The notebook shows the import of a pretrained VGG16 model from Tensorflow/Keras into a Knet-style CNN and its application to example images utilising the <em>Helferlein</em> imagenet-utilities.</p></li></ul><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/80-transformer.ipynb"><code>Transformer for machine translation</code></a>: A simple transformer architecture is set up according to the 2017 Vaswani paper <em>Attention is All You Need</em> with help of  <em>NNHelferlein</em>-utils.</li></ul><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/81-transformer-api.ipynb"><code>Simple Transformer API for Bert-like architectures</code></a>: A simple transformer architecture is set up with the <em>NNHelferlein</em> transformer API.</li></ul><h3 id="Pretrained-Nets"><a class="docs-heading-anchor" href="#Pretrained-Nets">Pretrained Nets</a><a id="Pretrained-Nets-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-Nets" title="Permalink"></a></h3><p>Based on the Keras import constructors, it is easy to  import  pretrained models from the TF/Keras ecosystem.</p><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/70-pretrained_vgg16.ipynb"><code>VGG16</code></a></p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/71-pretrained_resnet50v2.ipynb"><code>ResNet50 V2</code></a></p></li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../overview/">« Overview</a><a class="docs-footer-nextpage" href="../api_overview/">API Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Examples · NNHelferlein.jl</title><meta name="title" content="Examples · NNHelferlein.jl"/><meta property="og:title" content="Examples · NNHelferlein.jl"/><meta property="twitter:title" content="Examples · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li class="is-active"><a class="tocitem" href>Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Examples</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Examples</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/examples.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Examples"><a class="docs-heading-anchor" href="#Examples">Examples</a><a id="Examples-1"></a><a class="docs-heading-anchor-permalink" href="#Examples" title="Permalink"></a></h1><p>Examples may be used as templates for new projects...     All examples are at <a href="https://github.com/KnetML/NNHelferlein.jl/tree/main/examples">GitHub/examples</a>:</p><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/01-simple-mlp.ipynb"><code>Simple MLP</code></a>: A simple multi-layer perceptron for MNIST classification, build with Knet and <em>Helferlein</em>-types in just one line of code (or so).</li></ul><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/10-simple-lenet.ipynb"><code>Simple LeNet</code></a>: A simple LeNet for MNIST classification,  build with help of the <em>Helferlein</em> layers in just two (ok: long) lines of code. </p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/11-focal_loss.ipynb"><code>Training unbalanced data with help of a focal loss function</code></a>: A simple MLP with focal loss demonstrate classification of highly unbalanced data.</p></li></ul><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/30-ae.ipynb"><code>Vanilla Autoencoder</code></a>: A simple autoencoder design with help of <em>Knet</em> in <em>Helferlein</em>-style.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/31-cae.ipynb"><code>Convolutional Autoencoder</code></a>: A convolutional autoencoder design with help of <em>Knet</em> in <em>Helferlein</em>-style.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/32-vae.ipynb"><code>Variational Autoencoder</code></a>: Example for a simple VAE utilising the NNHelferlein-type <code>VAE</code> and demonstrating the fascinating regularisation of a VAE.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/60-s2s-nlp-gru.ipynb"><code>Simple sequence-to-sequence network</code></a>: Simple s2s network to demonstrate how to setup macghine translation with  a rnn.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/61-RNN_bi_attn.ipynb"><code>Sequence-to-sequence RNN for machine translation</code></a>: RNN to demonstrate how to setup machine translation with  a bidirectional encoder RNN and attention.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/62-ECG-tagger.ipynb"><code>RNN Sequence tagger for annotation of ECGs</code></a>: RNN to demonstrate how to set-up a sequence tagger to detect heart beats. Only one layer with 8 units is necessary to achieve almost 100% correct predictions.  The example includes the definition on peephole LSTMs to display how to integrate non-standard rnn-units with the <em>NNHelfrelein</em> framework.</p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/75-keras-model.ipynb"><code>Import a Keras model</code></a>: The notebook shows the import of a pretrained VGG16 model from Tensorflow/Keras into a Knet-style CNN and its application to example images utilising the <em>Helferlein</em> imagenet-utilities.</p></li></ul><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/80-transformer.ipynb"><code>Transformer for machine translation</code></a>: A simple transformer architecture is set up according to the 2017 Vaswani paper <em>Attention is All You Need</em> with help of  <em>NNHelferlein</em>-utils.</li></ul><ul><li><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/81-transformer-api.ipynb"><code>Simple Transformer API for Bert-like architectures</code></a>: A simple transformer architecture is set up with the <em>NNHelferlein</em> transformer API.</li></ul><h3 id="Pretrained-Nets"><a class="docs-heading-anchor" href="#Pretrained-Nets">Pretrained Nets</a><a id="Pretrained-Nets-1"></a><a class="docs-heading-anchor-permalink" href="#Pretrained-Nets" title="Permalink"></a></h3><p>Based on the Keras import constructors, it is easy to  import  pretrained models from the TF/Keras ecosystem.</p><ul><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/70-pretrained_vgg16.ipynb"><code>VGG16</code></a></p></li><li><p><a href="https://github.com/KnetML/NNHelferlein.jl/blob/main/examples/71-pretrained_resnet50v2.ipynb"><code>ResNet50 V2</code></a></p></li></ul></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../overview/">« Overview</a><a class="docs-footer-nextpage" href="../api_overview/">API Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index 5d9a28c7..f5473cec 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -135,4 +135,4 @@
 y = Int8[5, 10, 4, 1, 9, 2, 1, 3]
 2.3798099f0</code></pre><p>... or with an iterator of minibatches to get the mean loss for the dataset:</p><pre><code class="language-julia hljs">julia&gt; lenet(dtrn)
 
-2.6070921f0</code></pre><p>The next step is to have a look at the examples in the GitHub repo:</p><ul><li><a href="examples/#Examples">Examples</a></li></ul><h2 id="Overview"><a class="docs-heading-anchor" href="#Overview">Overview</a><a id="Overview-1"></a><a class="docs-heading-anchor-permalink" href="#Overview" title="Permalink"></a></h2><ul><li><a href="overview/#Overview">Overview</a></li><li class="no-marker"><ul><li><a href="overview/#Neural-network-definitions">Neural network definitions</a></li><li><a href="overview/#Layer-definitions">Layer definitions</a></li><li><a href="overview/#Attention-Mechanisms">Attention Mechanisms</a></li><li><a href="overview/#Data-provider">Data provider</a></li><li><a href="overview/#Minibatch-iteration-utilities">Minibatch iteration utilities</a></li><li><a href="overview/#Working-with-pretrained-networks">Working with pretrained networks</a></li><li><a href="overview/#Training">Training</a></li><li><a href="overview/#Utilities">Utilities</a></li><li><a href="overview/#Bioinformatics">Bioinformatics</a></li></ul></li></ul><h2 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h2><p>Some datasets as playground-data are provided with the package. Maybe more will follow...</p><ul><li><p><em>MIT Normal Sinus Rhythm Database</em> is a modified version of the  Physionet dataset, adapted for use in machine leraning (see the docstring of <code>dataset_mit_nsr()</code> for details).</p></li><li><p>the famous <em>MNIST</em> dataset.</p></li><li><p>R.A. Fisher&#39;s <em>Iris</em> dataset.</p></li></ul><h2 id="API-Reference"><a class="docs-heading-anchor" href="#API-Reference">API Reference</a><a id="API-Reference-1"></a><a class="docs-heading-anchor-permalink" href="#API-Reference" title="Permalink"></a></h2><ul><li><a href="api/#Chains">Chains</a></li><li><a href="api/#Layers">Layers</a></li><li class="no-marker"><ul><li><a href="api/#Fully-connected-layers">Fully connected layers</a></li><li><a href="api/#Convolutional">Convolutional</a></li><li><a href="api/#Recurrent">Recurrent</a></li><li><a href="api/#Transformers">Transformers</a></li><li><a href="api/#Others">Others</a></li><li><a href="api/#Attention-Mechanisms">Attention Mechanisms</a></li></ul></li><li><a href="api/#Data-providers">Data providers</a></li><li class="no-marker"><ul><li><a href="api/#Iteration-utilities">Iteration utilities</a></li><li><a href="api/#Tabular-data">Tabular data</a></li><li><a href="api/#Image-data">Image data</a></li><li><a href="api/#Text-data">Text data</a></li></ul></li><li><a href="api/#Training">Training</a></li><li><a href="api/#Evaluation-and-accuracy">Evaluation and accuracy</a></li><li><a href="api/#ImageNet-tools">ImageNet tools</a></li><li><a href="api/#Other-utils">Other utils</a></li><li class="no-marker"><ul><li><a href="api/#Layers-and-helpers-for-transformers">Layers and helpers for transformers</a></li><li><a href="api/#Utils-for-array-manipulation">Utils for array manipulation</a></li><li><a href="api/#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a></li><li><a href="api/#Utils-for-Bioinformatics">Utils for Bioinformatics</a></li><li><a href="api/#Saving,-loading-and-inspection-of-models">Saving, loading and inspection of models</a></li><li><a href="api/#Datasets">Datasets</a></li></ul></li><li><a href="api/#Pretrained-networks">Pretrained networks</a></li></ul><h2 id="Index"><a class="docs-heading-anchor" href="#Index">Index</a><a id="Index-1"></a><a class="docs-heading-anchor-permalink" href="#Index" title="Permalink"></a></h2><ul><li><a href="api/#NNHelferlein.AbstractChain"><code>NNHelferlein.AbstractChain</code></a></li><li><a href="api/#NNHelferlein.AbstractLayer"><code>NNHelferlein.AbstractLayer</code></a></li><li><a href="api/#NNHelferlein.AbstractNN"><code>NNHelferlein.AbstractNN</code></a></li><li><a href="api/#NNHelferlein.Activation"><code>NNHelferlein.Activation</code></a></li><li><a href="api/#NNHelferlein.AttentionMechanism"><code>NNHelferlein.AttentionMechanism</code></a></li><li><a href="api/#NNHelferlein.AttnBahdanau"><code>NNHelferlein.AttnBahdanau</code></a></li><li><a href="api/#NNHelferlein.AttnDot"><code>NNHelferlein.AttnDot</code></a></li><li><a href="api/#NNHelferlein.AttnInFeed"><code>NNHelferlein.AttnInFeed</code></a></li><li><a href="api/#NNHelferlein.AttnLocation"><code>NNHelferlein.AttnLocation</code></a></li><li><a href="api/#NNHelferlein.AttnLuong"><code>NNHelferlein.AttnLuong</code></a></li><li><a href="api/#NNHelferlein.BatchNorm"><code>NNHelferlein.BatchNorm</code></a></li><li><a href="api/#NNHelferlein.Chain"><code>NNHelferlein.Chain</code></a></li><li><a href="api/#NNHelferlein.Classifier"><code>NNHelferlein.Classifier</code></a></li><li><a href="api/#NNHelferlein.Conv"><code>NNHelferlein.Conv</code></a></li><li><a href="api/#NNHelferlein.DataLoader"><code>NNHelferlein.DataLoader</code></a></li><li><a href="api/#NNHelferlein.DeConv"><code>NNHelferlein.DeConv</code></a></li><li><a href="api/#NNHelferlein.Dense"><code>NNHelferlein.Dense</code></a></li><li><a href="api/#NNHelferlein.DepthwiseConv"><code>NNHelferlein.DepthwiseConv</code></a></li><li><a href="api/#NNHelferlein.Dropout"><code>NNHelferlein.Dropout</code></a></li><li><a href="api/#NNHelferlein.Embed"><code>NNHelferlein.Embed</code></a></li><li><a href="api/#NNHelferlein.EmbedAminoAcids"><code>NNHelferlein.EmbedAminoAcids</code></a></li><li><a href="api/#NNHelferlein.FeatureSelection"><code>NNHelferlein.FeatureSelection</code></a></li><li><a href="api/#NNHelferlein.Flat"><code>NNHelferlein.Flat</code></a></li><li><a href="api/#NNHelferlein.GPUIterator"><code>NNHelferlein.GPUIterator</code></a></li><li><a href="api/#NNHelferlein.GaussianNoise"><code>NNHelferlein.GaussianNoise</code></a></li><li><a href="api/#NNHelferlein.GlobalAveragePooling"><code>NNHelferlein.GlobalAveragePooling</code></a></li><li><a href="api/#NNHelferlein.ImageLoader"><code>NNHelferlein.ImageLoader</code></a></li><li><a href="api/#NNHelferlein.LayerNorm"><code>NNHelferlein.LayerNorm</code></a></li><li><a href="api/#NNHelferlein.Linear"><code>NNHelferlein.Linear</code></a></li><li><a href="api/#NNHelferlein.Logistic"><code>NNHelferlein.Logistic</code></a></li><li><a href="api/#NNHelferlein.MBMasquerade"><code>NNHelferlein.MBMasquerade</code></a></li><li><a href="api/#NNHelferlein.MBNoiser"><code>NNHelferlein.MBNoiser</code></a></li><li><a href="api/#NNHelferlein.MultiHeadAttn"><code>NNHelferlein.MultiHeadAttn</code></a></li><li><a href="api/#NNHelferlein.Pad"><code>NNHelferlein.Pad</code></a></li><li><a href="api/#NNHelferlein.PartialIterator"><code>NNHelferlein.PartialIterator</code></a></li><li><a href="api/#NNHelferlein.Pool"><code>NNHelferlein.Pool</code></a></li><li><a href="api/#NNHelferlein.PositionalEncoding"><code>NNHelferlein.PositionalEncoding</code></a></li><li><a href="api/#NNHelferlein.PyFlat"><code>NNHelferlein.PyFlat</code></a></li><li><a href="api/#NNHelferlein.Recurrent"><code>NNHelferlein.Recurrent</code></a></li><li><a href="api/#NNHelferlein.RecurrentUnit"><code>NNHelferlein.RecurrentUnit</code></a></li><li><a href="api/#NNHelferlein.Regressor"><code>NNHelferlein.Regressor</code></a></li><li><a href="api/#NNHelferlein.ResNetBlock"><code>NNHelferlein.ResNetBlock</code></a></li><li><a href="api/#NNHelferlein.SequenceData"><code>NNHelferlein.SequenceData</code></a></li><li><a href="api/#NNHelferlein.Softmax"><code>NNHelferlein.Softmax</code></a></li><li><a href="api/#NNHelferlein.TFDecoder"><code>NNHelferlein.TFDecoder</code></a></li><li><a href="api/#NNHelferlein.TFDecoderLayer"><code>NNHelferlein.TFDecoderLayer</code></a></li><li><a href="api/#NNHelferlein.TFEncoder"><code>NNHelferlein.TFEncoder</code></a></li><li><a href="api/#NNHelferlein.TFEncoderLayer"><code>NNHelferlein.TFEncoderLayer</code></a></li><li><a href="api/#NNHelferlein.TokenTransformer"><code>NNHelferlein.TokenTransformer</code></a></li><li><a href="api/#NNHelferlein.Transformer"><code>NNHelferlein.Transformer</code></a></li><li><a href="api/#NNHelferlein.UnPool"><code>NNHelferlein.UnPool</code></a></li><li><a href="api/#NNHelferlein.VAE"><code>NNHelferlein.VAE</code></a></li><li><a href="api/#NNHelferlein.WordTokenizer"><code>NNHelferlein.WordTokenizer</code></a></li><li><a href="api/#Base.:+"><code>Base.:+</code></a></li><li><a href="api/#Base.summary"><code>Base.summary</code></a></li><li><a href="api/#NNHelferlein.abs_error_acc"><code>NNHelferlein.abs_error_acc</code></a></li><li><a href="api/#NNHelferlein.add_layer!"><code>NNHelferlein.add_layer!</code></a></li><li><a href="api/#NNHelferlein.aminoacid_tokenizer"><code>NNHelferlein.aminoacid_tokenizer</code></a></li><li><a href="api/#NNHelferlein.array2RGB"><code>NNHelferlein.array2RGB</code></a></li><li><a href="api/#NNHelferlein.array2image"><code>NNHelferlein.array2image</code></a></li><li><a href="api/#NNHelferlein.blowup_array"><code>NNHelferlein.blowup_array</code></a></li><li><a href="api/#NNHelferlein.clean_sentence"><code>NNHelferlein.clean_sentence</code></a></li><li><a href="api/#NNHelferlein.confusion_matrix"><code>NNHelferlein.confusion_matrix</code></a></li><li><a href="api/#NNHelferlein.convert2CuArray"><code>NNHelferlein.convert2CuArray</code></a></li><li><a href="api/#NNHelferlein.convert2KnetArray"><code>NNHelferlein.convert2KnetArray</code></a></li><li><a href="api/#NNHelferlein.copy_network"><code>NNHelferlein.copy_network</code></a></li><li><a href="api/#NNHelferlein.crop_array"><code>NNHelferlein.crop_array</code></a></li><li><a href="api/#NNHelferlein.dataframe_minibatch"><code>NNHelferlein.dataframe_minibatch</code></a></li><li><a href="api/#NNHelferlein.dataframe_read"><code>NNHelferlein.dataframe_read</code></a></li><li><a href="api/#NNHelferlein.dataframe_split"><code>NNHelferlein.dataframe_split</code></a></li><li><a href="api/#NNHelferlein.dataset_fashion_mnist"><code>NNHelferlein.dataset_fashion_mnist</code></a></li><li><a href="api/#NNHelferlein.dataset_iris"><code>NNHelferlein.dataset_iris</code></a></li><li><a href="api/#NNHelferlein.dataset_mit_nsr"><code>NNHelferlein.dataset_mit_nsr</code></a></li><li><a href="api/#NNHelferlein.dataset_mnist"><code>NNHelferlein.dataset_mnist</code></a></li><li><a href="api/#NNHelferlein.dataset_pfam"><code>NNHelferlein.dataset_pfam</code></a></li><li><a href="api/#NNHelferlein.de_embed"><code>NNHelferlein.de_embed</code></a></li><li><a href="api/#NNHelferlein.dot_prod_attn"><code>NNHelferlein.dot_prod_attn</code></a></li><li><a href="api/#NNHelferlein.embed_blosum62"><code>NNHelferlein.embed_blosum62</code></a></li><li><a href="api/#NNHelferlein.embed_vhse8"><code>NNHelferlein.embed_vhse8</code></a></li><li><a href="api/#NNHelferlein.emptyCuArray"><code>NNHelferlein.emptyCuArray</code></a></li><li><a href="api/#NNHelferlein.flatten"><code>NNHelferlein.flatten</code></a></li><li><a href="api/#NNHelferlein.focal_bce"><code>NNHelferlein.focal_bce</code></a></li><li><a href="api/#NNHelferlein.focal_nll"><code>NNHelferlein.focal_nll</code></a></li><li><a href="api/#NNHelferlein.get_beta"><code>NNHelferlein.get_beta</code></a></li><li><a href="api/#NNHelferlein.get_cell_states"><code>NNHelferlein.get_cell_states</code></a></li><li><a href="api/#NNHelferlein.get_class_labels"><code>NNHelferlein.get_class_labels</code></a></li><li><a href="api/#NNHelferlein.get_hidden_states"><code>NNHelferlein.get_hidden_states</code></a></li><li><a href="api/#NNHelferlein.get_imagenet_classes"><code>NNHelferlein.get_imagenet_classes</code></a></li><li><a href="api/#NNHelferlein.get_resnet50v2"><code>NNHelferlein.get_resnet50v2</code></a></li><li><a href="api/#NNHelferlein.get_tatoeba_corpus"><code>NNHelferlein.get_tatoeba_corpus</code></a></li><li><a href="api/#NNHelferlein.get_vgg16"><code>NNHelferlein.get_vgg16</code></a></li><li><a href="api/#NNHelferlein.global_average_pooling"><code>NNHelferlein.global_average_pooling</code></a></li><li><a href="api/#NNHelferlein.hamming_dist"><code>NNHelferlein.hamming_dist</code></a></li><li><a href="api/#NNHelferlein.ifgpu"><code>NNHelferlein.ifgpu</code></a></li><li><a href="api/#NNHelferlein.image2array"><code>NNHelferlein.image2array</code></a></li><li><a href="api/#NNHelferlein.init0"><code>NNHelferlein.init0</code></a></li><li><a href="api/#NNHelferlein.load_network"><code>NNHelferlein.load_network</code></a></li><li><a href="api/#NNHelferlein.merge_heads"><code>NNHelferlein.merge_heads</code></a></li><li><a href="api/#NNHelferlein.minibatch_eval"><code>NNHelferlein.minibatch_eval</code></a></li><li><a href="api/#NNHelferlein.mk_class_ids"><code>NNHelferlein.mk_class_ids</code></a></li><li><a href="api/#NNHelferlein.mk_image_minibatch"><code>NNHelferlein.mk_image_minibatch</code></a></li><li><a href="api/#NNHelferlein.mk_padding_mask"><code>NNHelferlein.mk_padding_mask</code></a></li><li><a href="api/#NNHelferlein.mk_peek_ahead_mask"><code>NNHelferlein.mk_peek_ahead_mask</code></a></li><li><a href="api/#NNHelferlein.pad_sequence"><code>NNHelferlein.pad_sequence</code></a></li><li><a href="api/#NNHelferlein.peak_finder_acc"><code>NNHelferlein.peak_finder_acc</code></a></li><li><a href="api/#NNHelferlein.positional_encoding_sincos"><code>NNHelferlein.positional_encoding_sincos</code></a></li><li><a href="api/#NNHelferlein.predict"><code>NNHelferlein.predict</code></a></li><li><a href="api/#NNHelferlein.predict_imagenet"><code>NNHelferlein.predict_imagenet</code></a></li><li><a href="api/#NNHelferlein.predict_top5"><code>NNHelferlein.predict_top5</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_resnet"><code>NNHelferlein.preproc_imagenet_resnet</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_resnetv2"><code>NNHelferlein.preproc_imagenet_resnetv2</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_vgg"><code>NNHelferlein.preproc_imagenet_vgg</code></a></li><li><a href="api/#NNHelferlein.print_network"><code>NNHelferlein.print_network</code></a></li><li><a href="api/#NNHelferlein.recycle_array"><code>NNHelferlein.recycle_array</code></a></li><li><a href="api/#NNHelferlein.reset_cell_states!"><code>NNHelferlein.reset_cell_states!</code></a></li><li><a href="api/#NNHelferlein.reset_hidden_states!"><code>NNHelferlein.reset_hidden_states!</code></a></li><li><a href="api/#NNHelferlein.save_network"><code>NNHelferlein.save_network</code></a></li><li><a href="api/#NNHelferlein.separate_heads"><code>NNHelferlein.separate_heads</code></a></li><li><a href="api/#NNHelferlein.sequence_minibatch"><code>NNHelferlein.sequence_minibatch</code></a></li><li><a href="api/#NNHelferlein.set_beta!"><code>NNHelferlein.set_beta!</code></a></li><li><a href="api/#NNHelferlein.set_cell_states!"><code>NNHelferlein.set_cell_states!</code></a></li><li><a href="api/#NNHelferlein.set_hidden_states!"><code>NNHelferlein.set_hidden_states!</code></a></li><li><a href="api/#NNHelferlein.split_minibatches"><code>NNHelferlein.split_minibatches</code></a></li><li><a href="api/#NNHelferlein.squared_error_acc"><code>NNHelferlein.squared_error_acc</code></a></li><li><a href="api/#NNHelferlein.tb_train!"><code>NNHelferlein.tb_train!</code></a></li><li><a href="api/#NNHelferlein.truncate_sequence"><code>NNHelferlein.truncate_sequence</code></a></li></ul><h2 id="Changelog"><a class="docs-heading-anchor" href="#Changelog">Changelog</a><a id="Changelog-1"></a><a class="docs-heading-anchor-permalink" href="#Changelog" title="Permalink"></a></h2><p>The history can be found here: <a href="changelog/#ChangeLog-of-NNHelferlein-package">ChangeLog of NNHelferlein package</a></p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="overview/">Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+2.6070921f0</code></pre><p>The next step is to have a look at the examples in the GitHub repo:</p><ul><li><a href="examples/#Examples">Examples</a></li></ul><h2 id="Overview"><a class="docs-heading-anchor" href="#Overview">Overview</a><a id="Overview-1"></a><a class="docs-heading-anchor-permalink" href="#Overview" title="Permalink"></a></h2><ul><li><a href="overview/#Overview">Overview</a></li><li class="no-marker"><ul><li><a href="overview/#Neural-network-definitions">Neural network definitions</a></li><li><a href="overview/#Layer-definitions">Layer definitions</a></li><li><a href="overview/#Attention-Mechanisms">Attention Mechanisms</a></li><li><a href="overview/#Data-provider">Data provider</a></li><li><a href="overview/#Minibatch-iteration-utilities">Minibatch iteration utilities</a></li><li><a href="overview/#Working-with-pretrained-networks">Working with pretrained networks</a></li><li><a href="overview/#Training">Training</a></li><li><a href="overview/#Utilities">Utilities</a></li><li><a href="overview/#Bioinformatics">Bioinformatics</a></li></ul></li></ul><h2 id="Datasets"><a class="docs-heading-anchor" href="#Datasets">Datasets</a><a id="Datasets-1"></a><a class="docs-heading-anchor-permalink" href="#Datasets" title="Permalink"></a></h2><p>Some datasets as playground-data are provided with the package. Maybe more will follow...</p><ul><li><p><em>MIT Normal Sinus Rhythm Database</em> is a modified version of the  Physionet dataset, adapted for use in machine leraning (see the docstring of <code>dataset_mit_nsr()</code> for details).</p></li><li><p>the famous <em>MNIST</em> dataset.</p></li><li><p>R.A. Fisher&#39;s <em>Iris</em> dataset.</p></li></ul><h2 id="API-Reference"><a class="docs-heading-anchor" href="#API-Reference">API Reference</a><a id="API-Reference-1"></a><a class="docs-heading-anchor-permalink" href="#API-Reference" title="Permalink"></a></h2><ul><li><a href="api/#Chains">Chains</a></li><li><a href="api/#Layers">Layers</a></li><li class="no-marker"><ul><li><a href="api/#Fully-connected-layers">Fully connected layers</a></li><li><a href="api/#Convolutional">Convolutional</a></li><li><a href="api/#Recurrent">Recurrent</a></li><li><a href="api/#Transformers">Transformers</a></li><li><a href="api/#Others">Others</a></li><li><a href="api/#Attention-Mechanisms">Attention Mechanisms</a></li></ul></li><li><a href="api/#Data-providers">Data providers</a></li><li class="no-marker"><ul><li><a href="api/#Iteration-utilities">Iteration utilities</a></li><li><a href="api/#Tabular-data">Tabular data</a></li><li><a href="api/#Image-data">Image data</a></li><li><a href="api/#Text-data">Text data</a></li></ul></li><li><a href="api/#Training">Training</a></li><li><a href="api/#Evaluation-and-accuracy">Evaluation and accuracy</a></li><li><a href="api/#ImageNet-tools">ImageNet tools</a></li><li><a href="api/#Other-utils">Other utils</a></li><li class="no-marker"><ul><li><a href="api/#Layers-and-helpers-for-transformers">Layers and helpers for transformers</a></li><li><a href="api/#Utils-for-array-manipulation">Utils for array manipulation</a></li><li><a href="api/#Utils-for-fixing-types-in-GPU-context">Utils for fixing types in GPU context</a></li><li><a href="api/#Utils-for-Bioinformatics">Utils for Bioinformatics</a></li><li><a href="api/#Saving,-loading-and-inspection-of-models">Saving, loading and inspection of models</a></li><li><a href="api/#Datasets">Datasets</a></li></ul></li><li><a href="api/#Pretrained-networks">Pretrained networks</a></li></ul><h2 id="Index"><a class="docs-heading-anchor" href="#Index">Index</a><a id="Index-1"></a><a class="docs-heading-anchor-permalink" href="#Index" title="Permalink"></a></h2><ul><li><a href="api/#NNHelferlein.AbstractChain"><code>NNHelferlein.AbstractChain</code></a></li><li><a href="api/#NNHelferlein.AbstractLayer"><code>NNHelferlein.AbstractLayer</code></a></li><li><a href="api/#NNHelferlein.AbstractNN"><code>NNHelferlein.AbstractNN</code></a></li><li><a href="api/#NNHelferlein.Activation"><code>NNHelferlein.Activation</code></a></li><li><a href="api/#NNHelferlein.AttentionMechanism"><code>NNHelferlein.AttentionMechanism</code></a></li><li><a href="api/#NNHelferlein.AttnBahdanau"><code>NNHelferlein.AttnBahdanau</code></a></li><li><a href="api/#NNHelferlein.AttnDot"><code>NNHelferlein.AttnDot</code></a></li><li><a href="api/#NNHelferlein.AttnInFeed"><code>NNHelferlein.AttnInFeed</code></a></li><li><a href="api/#NNHelferlein.AttnLocation"><code>NNHelferlein.AttnLocation</code></a></li><li><a href="api/#NNHelferlein.AttnLuong"><code>NNHelferlein.AttnLuong</code></a></li><li><a href="api/#NNHelferlein.BatchNorm"><code>NNHelferlein.BatchNorm</code></a></li><li><a href="api/#NNHelferlein.Chain"><code>NNHelferlein.Chain</code></a></li><li><a href="api/#NNHelferlein.Classifier"><code>NNHelferlein.Classifier</code></a></li><li><a href="api/#NNHelferlein.Conv"><code>NNHelferlein.Conv</code></a></li><li><a href="api/#NNHelferlein.DataLoader"><code>NNHelferlein.DataLoader</code></a></li><li><a href="api/#NNHelferlein.DeConv"><code>NNHelferlein.DeConv</code></a></li><li><a href="api/#NNHelferlein.Dense"><code>NNHelferlein.Dense</code></a></li><li><a href="api/#NNHelferlein.DepthwiseConv"><code>NNHelferlein.DepthwiseConv</code></a></li><li><a href="api/#NNHelferlein.Dropout"><code>NNHelferlein.Dropout</code></a></li><li><a href="api/#NNHelferlein.Embed"><code>NNHelferlein.Embed</code></a></li><li><a href="api/#NNHelferlein.EmbedAminoAcids"><code>NNHelferlein.EmbedAminoAcids</code></a></li><li><a href="api/#NNHelferlein.FeatureSelection"><code>NNHelferlein.FeatureSelection</code></a></li><li><a href="api/#NNHelferlein.Flat"><code>NNHelferlein.Flat</code></a></li><li><a href="api/#NNHelferlein.GPUIterator"><code>NNHelferlein.GPUIterator</code></a></li><li><a href="api/#NNHelferlein.GaussianNoise"><code>NNHelferlein.GaussianNoise</code></a></li><li><a href="api/#NNHelferlein.GlobalAveragePooling"><code>NNHelferlein.GlobalAveragePooling</code></a></li><li><a href="api/#NNHelferlein.ImageLoader"><code>NNHelferlein.ImageLoader</code></a></li><li><a href="api/#NNHelferlein.LayerNorm"><code>NNHelferlein.LayerNorm</code></a></li><li><a href="api/#NNHelferlein.Linear"><code>NNHelferlein.Linear</code></a></li><li><a href="api/#NNHelferlein.Logistic"><code>NNHelferlein.Logistic</code></a></li><li><a href="api/#NNHelferlein.MBMasquerade"><code>NNHelferlein.MBMasquerade</code></a></li><li><a href="api/#NNHelferlein.MBNoiser"><code>NNHelferlein.MBNoiser</code></a></li><li><a href="api/#NNHelferlein.MultiHeadAttn"><code>NNHelferlein.MultiHeadAttn</code></a></li><li><a href="api/#NNHelferlein.Pad"><code>NNHelferlein.Pad</code></a></li><li><a href="api/#NNHelferlein.PartialIterator"><code>NNHelferlein.PartialIterator</code></a></li><li><a href="api/#NNHelferlein.Pool"><code>NNHelferlein.Pool</code></a></li><li><a href="api/#NNHelferlein.PositionalEncoding"><code>NNHelferlein.PositionalEncoding</code></a></li><li><a href="api/#NNHelferlein.PyFlat"><code>NNHelferlein.PyFlat</code></a></li><li><a href="api/#NNHelferlein.Recurrent"><code>NNHelferlein.Recurrent</code></a></li><li><a href="api/#NNHelferlein.RecurrentUnit"><code>NNHelferlein.RecurrentUnit</code></a></li><li><a href="api/#NNHelferlein.Regressor"><code>NNHelferlein.Regressor</code></a></li><li><a href="api/#NNHelferlein.ResNetBlock"><code>NNHelferlein.ResNetBlock</code></a></li><li><a href="api/#NNHelferlein.SequenceData"><code>NNHelferlein.SequenceData</code></a></li><li><a href="api/#NNHelferlein.Softmax"><code>NNHelferlein.Softmax</code></a></li><li><a href="api/#NNHelferlein.TFDecoder"><code>NNHelferlein.TFDecoder</code></a></li><li><a href="api/#NNHelferlein.TFDecoderLayer"><code>NNHelferlein.TFDecoderLayer</code></a></li><li><a href="api/#NNHelferlein.TFEncoder"><code>NNHelferlein.TFEncoder</code></a></li><li><a href="api/#NNHelferlein.TFEncoderLayer"><code>NNHelferlein.TFEncoderLayer</code></a></li><li><a href="api/#NNHelferlein.TokenTransformer"><code>NNHelferlein.TokenTransformer</code></a></li><li><a href="api/#NNHelferlein.Transformer"><code>NNHelferlein.Transformer</code></a></li><li><a href="api/#NNHelferlein.UnPool"><code>NNHelferlein.UnPool</code></a></li><li><a href="api/#NNHelferlein.VAE"><code>NNHelferlein.VAE</code></a></li><li><a href="api/#NNHelferlein.WordTokenizer"><code>NNHelferlein.WordTokenizer</code></a></li><li><a href="api/#Base.:+"><code>Base.:+</code></a></li><li><a href="api/#Base.summary"><code>Base.summary</code></a></li><li><a href="api/#NNHelferlein.abs_error_acc"><code>NNHelferlein.abs_error_acc</code></a></li><li><a href="api/#NNHelferlein.add_layer!"><code>NNHelferlein.add_layer!</code></a></li><li><a href="api/#NNHelferlein.aminoacid_tokenizer"><code>NNHelferlein.aminoacid_tokenizer</code></a></li><li><a href="api/#NNHelferlein.array2RGB"><code>NNHelferlein.array2RGB</code></a></li><li><a href="api/#NNHelferlein.array2image"><code>NNHelferlein.array2image</code></a></li><li><a href="api/#NNHelferlein.blowup_array"><code>NNHelferlein.blowup_array</code></a></li><li><a href="api/#NNHelferlein.clean_sentence"><code>NNHelferlein.clean_sentence</code></a></li><li><a href="api/#NNHelferlein.confusion_matrix"><code>NNHelferlein.confusion_matrix</code></a></li><li><a href="api/#NNHelferlein.convert2CuArray"><code>NNHelferlein.convert2CuArray</code></a></li><li><a href="api/#NNHelferlein.convert2KnetArray"><code>NNHelferlein.convert2KnetArray</code></a></li><li><a href="api/#NNHelferlein.copy_network"><code>NNHelferlein.copy_network</code></a></li><li><a href="api/#NNHelferlein.crop_array"><code>NNHelferlein.crop_array</code></a></li><li><a href="api/#NNHelferlein.dataframe_minibatch"><code>NNHelferlein.dataframe_minibatch</code></a></li><li><a href="api/#NNHelferlein.dataframe_read"><code>NNHelferlein.dataframe_read</code></a></li><li><a href="api/#NNHelferlein.dataframe_split"><code>NNHelferlein.dataframe_split</code></a></li><li><a href="api/#NNHelferlein.dataset_fashion_mnist"><code>NNHelferlein.dataset_fashion_mnist</code></a></li><li><a href="api/#NNHelferlein.dataset_iris"><code>NNHelferlein.dataset_iris</code></a></li><li><a href="api/#NNHelferlein.dataset_mit_nsr"><code>NNHelferlein.dataset_mit_nsr</code></a></li><li><a href="api/#NNHelferlein.dataset_mnist"><code>NNHelferlein.dataset_mnist</code></a></li><li><a href="api/#NNHelferlein.dataset_pfam"><code>NNHelferlein.dataset_pfam</code></a></li><li><a href="api/#NNHelferlein.de_embed"><code>NNHelferlein.de_embed</code></a></li><li><a href="api/#NNHelferlein.dot_prod_attn"><code>NNHelferlein.dot_prod_attn</code></a></li><li><a href="api/#NNHelferlein.embed_blosum62"><code>NNHelferlein.embed_blosum62</code></a></li><li><a href="api/#NNHelferlein.embed_vhse8"><code>NNHelferlein.embed_vhse8</code></a></li><li><a href="api/#NNHelferlein.emptyCuArray"><code>NNHelferlein.emptyCuArray</code></a></li><li><a href="api/#NNHelferlein.flatten"><code>NNHelferlein.flatten</code></a></li><li><a href="api/#NNHelferlein.focal_bce"><code>NNHelferlein.focal_bce</code></a></li><li><a href="api/#NNHelferlein.focal_nll"><code>NNHelferlein.focal_nll</code></a></li><li><a href="api/#NNHelferlein.get_beta"><code>NNHelferlein.get_beta</code></a></li><li><a href="api/#NNHelferlein.get_cell_states"><code>NNHelferlein.get_cell_states</code></a></li><li><a href="api/#NNHelferlein.get_class_labels"><code>NNHelferlein.get_class_labels</code></a></li><li><a href="api/#NNHelferlein.get_hidden_states"><code>NNHelferlein.get_hidden_states</code></a></li><li><a href="api/#NNHelferlein.get_imagenet_classes"><code>NNHelferlein.get_imagenet_classes</code></a></li><li><a href="api/#NNHelferlein.get_resnet50v2"><code>NNHelferlein.get_resnet50v2</code></a></li><li><a href="api/#NNHelferlein.get_tatoeba_corpus"><code>NNHelferlein.get_tatoeba_corpus</code></a></li><li><a href="api/#NNHelferlein.get_vgg16"><code>NNHelferlein.get_vgg16</code></a></li><li><a href="api/#NNHelferlein.global_average_pooling"><code>NNHelferlein.global_average_pooling</code></a></li><li><a href="api/#NNHelferlein.hamming_dist"><code>NNHelferlein.hamming_dist</code></a></li><li><a href="api/#NNHelferlein.ifgpu"><code>NNHelferlein.ifgpu</code></a></li><li><a href="api/#NNHelferlein.image2array"><code>NNHelferlein.image2array</code></a></li><li><a href="api/#NNHelferlein.init0"><code>NNHelferlein.init0</code></a></li><li><a href="api/#NNHelferlein.load_network"><code>NNHelferlein.load_network</code></a></li><li><a href="api/#NNHelferlein.merge_heads"><code>NNHelferlein.merge_heads</code></a></li><li><a href="api/#NNHelferlein.minibatch_eval"><code>NNHelferlein.minibatch_eval</code></a></li><li><a href="api/#NNHelferlein.mk_class_ids"><code>NNHelferlein.mk_class_ids</code></a></li><li><a href="api/#NNHelferlein.mk_image_minibatch"><code>NNHelferlein.mk_image_minibatch</code></a></li><li><a href="api/#NNHelferlein.mk_padding_mask"><code>NNHelferlein.mk_padding_mask</code></a></li><li><a href="api/#NNHelferlein.mk_peek_ahead_mask"><code>NNHelferlein.mk_peek_ahead_mask</code></a></li><li><a href="api/#NNHelferlein.pad_sequence"><code>NNHelferlein.pad_sequence</code></a></li><li><a href="api/#NNHelferlein.peak_finder_acc"><code>NNHelferlein.peak_finder_acc</code></a></li><li><a href="api/#NNHelferlein.positional_encoding_sincos"><code>NNHelferlein.positional_encoding_sincos</code></a></li><li><a href="api/#NNHelferlein.predict"><code>NNHelferlein.predict</code></a></li><li><a href="api/#NNHelferlein.predict_imagenet"><code>NNHelferlein.predict_imagenet</code></a></li><li><a href="api/#NNHelferlein.predict_top5"><code>NNHelferlein.predict_top5</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_resnet"><code>NNHelferlein.preproc_imagenet_resnet</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_resnetv2"><code>NNHelferlein.preproc_imagenet_resnetv2</code></a></li><li><a href="api/#NNHelferlein.preproc_imagenet_vgg"><code>NNHelferlein.preproc_imagenet_vgg</code></a></li><li><a href="api/#NNHelferlein.print_network"><code>NNHelferlein.print_network</code></a></li><li><a href="api/#NNHelferlein.recycle_array"><code>NNHelferlein.recycle_array</code></a></li><li><a href="api/#NNHelferlein.reset_cell_states!"><code>NNHelferlein.reset_cell_states!</code></a></li><li><a href="api/#NNHelferlein.reset_hidden_states!"><code>NNHelferlein.reset_hidden_states!</code></a></li><li><a href="api/#NNHelferlein.save_network"><code>NNHelferlein.save_network</code></a></li><li><a href="api/#NNHelferlein.separate_heads"><code>NNHelferlein.separate_heads</code></a></li><li><a href="api/#NNHelferlein.sequence_minibatch"><code>NNHelferlein.sequence_minibatch</code></a></li><li><a href="api/#NNHelferlein.set_beta!"><code>NNHelferlein.set_beta!</code></a></li><li><a href="api/#NNHelferlein.set_cell_states!"><code>NNHelferlein.set_cell_states!</code></a></li><li><a href="api/#NNHelferlein.set_hidden_states!"><code>NNHelferlein.set_hidden_states!</code></a></li><li><a href="api/#NNHelferlein.split_minibatches"><code>NNHelferlein.split_minibatches</code></a></li><li><a href="api/#NNHelferlein.squared_error_acc"><code>NNHelferlein.squared_error_acc</code></a></li><li><a href="api/#NNHelferlein.tb_train!"><code>NNHelferlein.tb_train!</code></a></li><li><a href="api/#NNHelferlein.truncate_sequence"><code>NNHelferlein.truncate_sequence</code></a></li></ul><h2 id="Changelog"><a class="docs-heading-anchor" href="#Changelog">Changelog</a><a id="Changelog-1"></a><a class="docs-heading-anchor-permalink" href="#Changelog" title="Permalink"></a></h2><p>The history can be found here: <a href="changelog/#ChangeLog-of-NNHelferlein-package">ChangeLog of NNHelferlein package</a></p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="overview/">Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/license/index.html b/dev/license/index.html
index c79c00a8..ab975fe1 100644
--- a/dev/license/index.html
+++ b/dev/license/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>License · NNHelferlein.jl</title><meta name="title" content="License · NNHelferlein.jl"/><meta property="og:title" content="License · NNHelferlein.jl"/><meta property="twitter:title" content="License · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li class="is-active"><a class="tocitem" href>License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>License</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>License</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/license.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><p>The NNHelferlein.jl package is licensed under the MIT License:</p><p>Copyright (c) 2023 Andreas Dominik, THM University of Applied Sciences, Gießen, Germany</p><p>Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the &quot;Software&quot;), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:</p><p>The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.</p><p>THE SOFTWARE IS PROVIDED &quot;AS IS&quot;, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api/">« API Reference</a><a class="docs-footer-nextpage" href="../changelog/">Changelog »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>License · NNHelferlein.jl</title><meta name="title" content="License · NNHelferlein.jl"/><meta property="og:title" content="License · NNHelferlein.jl"/><meta property="twitter:title" content="License · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li><a class="tocitem" href="../overview/">Overview</a></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li class="is-active"><a class="tocitem" href>License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>License</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>License</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/license.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><p>The NNHelferlein.jl package is licensed under the MIT License:</p><p>Copyright (c) 2023 Andreas Dominik, THM University of Applied Sciences, Gießen, Germany</p><p>Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the &quot;Software&quot;), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:</p><p>The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.</p><p>THE SOFTWARE IS PROVIDED &quot;AS IS&quot;, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api/">« API Reference</a><a class="docs-footer-nextpage" href="../changelog/">Changelog »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/overview/index.html b/dev/overview/index.html
index 586279d6..8bd2995a 100644
--- a/dev/overview/index.html
+++ b/dev/overview/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Overview · NNHelferlein.jl</title><meta name="title" content="Overview · NNHelferlein.jl"/><meta property="og:title" content="Overview · NNHelferlein.jl"/><meta property="twitter:title" content="Overview · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li class="is-active"><a class="tocitem" href>Overview</a><ul class="internal"><li><a class="tocitem" href="#Neural-network-definitions"><span>Neural network definitions</span></a></li><li><a class="tocitem" href="#Layer-definitions"><span>Layer definitions</span></a></li><li><a class="tocitem" href="#Attention-Mechanisms"><span>Attention Mechanisms</span></a></li><li><a class="tocitem" href="#Data-provider"><span>Data provider</span></a></li><li><a class="tocitem" href="#Minibatch-iteration-utilities"><span>Minibatch iteration utilities</span></a></li><li><a class="tocitem" href="#Working-with-pretrained-networks"><span>Working with pretrained networks</span></a></li><li><a class="tocitem" href="#Training"><span>Training</span></a></li><li><a class="tocitem" href="#Utilities"><span>Utilities</span></a></li><li><a class="tocitem" href="#Bioinformatics"><span>Bioinformatics</span></a></li></ul></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Overview</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Overview</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/overview.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Overview"><a class="docs-heading-anchor" href="#Overview">Overview</a><a id="Overview-1"></a><a class="docs-heading-anchor-permalink" href="#Overview" title="Permalink"></a></h1><p>The section provides a brief overview of the functionality provided by NNHelferlen. For more details, please visit the API-Section.</p><h2 id="Neural-network-definitions"><a class="docs-heading-anchor" href="#Neural-network-definitions">Neural network definitions</a><a id="Neural-network-definitions-1"></a><a class="docs-heading-anchor-permalink" href="#Neural-network-definitions" title="Permalink"></a></h2><p>The abstract type <code>AbstractNN</code> provides signatures to be called as</p><ul><li><code>(m::AbstractNN)(x)</code>: evaluate x (sample or minibatch)</li><li><code>(m::AbstractNN)(x,y)</code>: evaluate x and calculate the loss</li><li><code>(m::AbstractNN)(d)</code>: return the mean loss for a dataset, if d is an iterator               of type <code>Knet.Data</code> or <code>NNHelferlen.DataLoader</code></li><li><code>(m::AbstractNN)((x,y))</code>: return the mean loss for a x,y-tuple.</li></ul><p>Explicit signatures exist for types <code>Classifier</code> and <code>Regressor</code> with negative log-likelihood and square loss as loss, respectively. For variational autoencoders the type <code>VAE</code> exists.</p><p>The type <code>Chain</code> wraps a list of layers that are executed sequentially.</p><p>Types <code>Transformer</code> and <code>TokenTransformer</code> are provided to build Bert-like transformer networks from the rspective <code>TFEncoder</code>  and <code>TFDecoder</code> layers.</p><p>A network summary can be printed with <code>summary(mdl::AbstractNN)</code> and a more detailed list of all layers with <code>print_network(mdl::AbstractNN)</code>.</p><h2 id="Layer-definitions"><a class="docs-heading-anchor" href="#Layer-definitions">Layer definitions</a><a id="Layer-definitions-1"></a><a class="docs-heading-anchor-permalink" href="#Layer-definitions" title="Permalink"></a></h2><p>Several layers are predefined with executable signatures:</p><ul><li><p><strong>MLPs:</strong> different flavours of the simple layer:       <code>Dense</code>: default layer for a vector (i.e. sample)          or matrix (i.e. mininbatch) as input with logistic          actvation as default.               <code>Linear</code>: TensorFlow-style layer to process high-dimensional         arrays and identity as default activation.       <code>Embed</code>: embedding layer that adds a first dimension with the          embeddings to the input.</p></li><li><p><strong>Convolutional NNs:</strong> to build CNNs <code>Conv</code>, <code>DeConv</code>, <code>Pool</code>       <code>UnPool</code> and <code>Flat</code>             layers are provided with standard functionality.       The utilitys include methods for array manipulation, such as       clipping arrays or adding dimensions.</p></li><li><p><strong>Recurrent Layers:</strong> a <code>Recurrent</code> layer is defined as wrapper        around the basic Knet RNN type.</p></li><li><p><strong>Others:</strong> additional layers include (please see the API-section for       a complete list!):       <code>Softmax</code>, <code>Dropout</code>, trainable <code>BatchNorm</code>, trainable <code>LayerNorm</code>.</p></li></ul><h2 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h2><p>Some attention mechanisms are implemented for use in sequence-to-sequence networks. If possible projections of values are  precomputed to reduce computational cost:</p><ul><li><strong>AttnBahdanau:</strong> concat- or additive-style attention according to       Bahdanau, 2015.</li><li><strong>AttnLuong:</strong> multiplicative-or general-stype attention according to       Luong, 2015.</li><li><strong>AttnDot:</strong> dot-product-style attention according to       Luong, 2015.</li><li><strong>AttnLocation:</strong> dot-product-style attention according to       Luong, 2015.</li><li><strong>AttnInFeed:</strong> input-feeding attention according to       Luong, 2015.</li></ul><p>A generalised dot-product attention can be computed from (Query, Key, Value) tuple: <code>dot_prod_attn(q, k, v)</code>.</p><p>Helpers for transformer networks include functions for positional encoding, generating padding- and peek-akead-masks and computing scaled multi-headed attention, according to Vaswani, 2017.</p><h2 id="Data-provider"><a class="docs-heading-anchor" href="#Data-provider">Data provider</a><a id="Data-provider-1"></a><a class="docs-heading-anchor-permalink" href="#Data-provider" title="Permalink"></a></h2><h3 id="Image-data"><a class="docs-heading-anchor" href="#Image-data">Image data</a><a id="Image-data-1"></a><a class="docs-heading-anchor-permalink" href="#Image-data" title="Permalink"></a></h3><p>The function <code>mk_image_minibatch()</code> can be used to create an iterator over images, organised in directories, with the first directory-level as class labels.</p><p>Helper functions (such as <code>image2array()</code>, <code>array2image()</code>, <code>array2RGB()</code>) can be used to transform image data to arrays. Imagenet-style preprocessing can be achieved with <code>preproc_imagenet()</code>, readable Imagenet class labels of the top predictions are printed by <code>predict_imagenet()</code>.</p><h3 id="DataFrames"><a class="docs-heading-anchor" href="#DataFrames">DataFrames</a><a id="DataFrames-1"></a><a class="docs-heading-anchor-permalink" href="#DataFrames" title="Permalink"></a></h3><p>Helpers for tabular date include:</p><ul><li><code>dataframe_read</code>: read a csv-file and return a DataFrame</li><li><code>dataframe_split</code>: split tabular data in a DataFrame into train and               validation data; optionally with balancing.</li><li><code>dataframe_minibatch</code>: data provider to turn tabular data from               a DataFrame (with one sample per row)               into a Knet-like iterator of minibatches of type <code>Knet.Data</code>.</li><li><code>mk_class_ids(labels)</code>: may be used to turn class label strings into               class-IDs.</li></ul><h3 id="Texts-and-NLP"><a class="docs-heading-anchor" href="#Texts-and-NLP">Texts and NLP</a><a id="Texts-and-NLP-1"></a><a class="docs-heading-anchor-permalink" href="#Texts-and-NLP" title="Permalink"></a></h3><p>Some utilities are provided for NLP data handling:</p><ul><li><code>WordTokenizer</code>: a simple tool to encode words as ids.       The type comes with signatures to en- and decode in both directions.</li><li><code>get_tatoeba_corpus</code>: download dual-language corpi and provide       corresponding lists of sentences in two languages.</li></ul><p><code>sequence_minibatch()</code> function returns an iterator to sequence or sequence-to-secuence minibatches. Also helpers for padding and truncating sequences are provided.</p><h2 id="Minibatch-iteration-utilities"><a class="docs-heading-anchor" href="#Minibatch-iteration-utilities">Minibatch iteration utilities</a><a id="Minibatch-iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Minibatch-iteration-utilities" title="Permalink"></a></h2><p>A number of iterators are provided to wrap and manipulate minibatch iterators:</p><ul><li><code>PartialIterator(it, states)</code> returns an iterator that only       iterates the given <code>states</code> of iterator <code>it</code>.</li><li><code>MBNoiser(it, σ)</code> applies Gaussian noise to the x-values of        minibatches, provided by iterator <code>it</code>.</li><li><code>MBMasquerade(it, ρ)</code> applies a mask to the x-values of        minibatches, provided by iterator <code>it</code>.</li></ul><h2 id="Working-with-pretrained-networks"><a class="docs-heading-anchor" href="#Working-with-pretrained-networks">Working with pretrained networks</a><a id="Working-with-pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Working-with-pretrained-networks" title="Permalink"></a></h2><p>Layers of pre-trained models can be created from TensorFlow HDF5-parameter files. It is possible to build a network from any pretrained TensorFlow model by importing the parameters by HDF5-constructors for the layers <code>Dense</code>, <code>Conv</code>. The flatten-layer <code>PyFlat</code> allows for Python-like row-major-flattening, necessary to make sure, that the parameters of an imported layer after flattening are in the correct order.</p><p><em>NNHelferlein</em> provides an increasing number of pretrained  models from the Tensorflow/Keras model zoo, such as vgg or resnet. Please see the reference section for a up-to-date list.</p><h2 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h2><p>Although Knet-style is to avoid havyweight interfaces and train networks with lightweight and flexible optimisers, a train interface is added that provides TensorBoard logs with online reporting of minibatch loss, training and validation loss and accuracy.</p><h2 id="Utilities"><a class="docs-heading-anchor" href="#Utilities">Utilities</a><a id="Utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Utilities" title="Permalink"></a></h2><p>A number of additional utilities are included. Please have a look at the utilities section of the API documentation.</p><h2 id="Bioinformatics"><a class="docs-heading-anchor" href="#Bioinformatics">Bioinformatics</a><a id="Bioinformatics-1"></a><a class="docs-heading-anchor-permalink" href="#Bioinformatics" title="Permalink"></a></h2><p>A number of utilities for bioinformatics are provided, including an amino acid tokenizer to convert amino acid sequences from String to  vectors of integers and embedding of amino acids with BLOSUM62 or VHSE8 parameter sets.</p><p>Please have a look at the bioinformatics section of the API documentation.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Introduction</a><a class="docs-footer-nextpage" href="../examples/">Examples »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:13">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Overview · NNHelferlein.jl</title><meta name="title" content="Overview · NNHelferlein.jl"/><meta property="og:title" content="Overview · NNHelferlein.jl"/><meta property="twitter:title" content="Overview · NNHelferlein.jl"/><meta name="description" content="Documentation for NNHelferlein.jl."/><meta property="og:description" content="Documentation for NNHelferlein.jl."/><meta property="twitter:description" content="Documentation for NNHelferlein.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.svg" alt="NNHelferlein.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">NNHelferlein.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Introduction</a></li><li class="is-active"><a class="tocitem" href>Overview</a><ul class="internal"><li><a class="tocitem" href="#Neural-network-definitions"><span>Neural network definitions</span></a></li><li><a class="tocitem" href="#Layer-definitions"><span>Layer definitions</span></a></li><li><a class="tocitem" href="#Attention-Mechanisms"><span>Attention Mechanisms</span></a></li><li><a class="tocitem" href="#Data-provider"><span>Data provider</span></a></li><li><a class="tocitem" href="#Minibatch-iteration-utilities"><span>Minibatch iteration utilities</span></a></li><li><a class="tocitem" href="#Working-with-pretrained-networks"><span>Working with pretrained networks</span></a></li><li><a class="tocitem" href="#Training"><span>Training</span></a></li><li><a class="tocitem" href="#Utilities"><span>Utilities</span></a></li><li><a class="tocitem" href="#Bioinformatics"><span>Bioinformatics</span></a></li></ul></li><li><a class="tocitem" href="../examples/">Examples</a></li><li><a class="tocitem" href="../api_overview/">API Overview</a></li><li><a class="tocitem" href="../api/">API Reference</a></li><li><a class="tocitem" href="../license/">License</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Overview</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Overview</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/KnetML/NNHelferlein.jl/blob/main/docs/src/overview.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Overview"><a class="docs-heading-anchor" href="#Overview">Overview</a><a id="Overview-1"></a><a class="docs-heading-anchor-permalink" href="#Overview" title="Permalink"></a></h1><p>The section provides a brief overview of the functionality provided by NNHelferlen. For more details, please visit the API-Section.</p><h2 id="Neural-network-definitions"><a class="docs-heading-anchor" href="#Neural-network-definitions">Neural network definitions</a><a id="Neural-network-definitions-1"></a><a class="docs-heading-anchor-permalink" href="#Neural-network-definitions" title="Permalink"></a></h2><p>The abstract type <code>AbstractNN</code> provides signatures to be called as</p><ul><li><code>(m::AbstractNN)(x)</code>: evaluate x (sample or minibatch)</li><li><code>(m::AbstractNN)(x,y)</code>: evaluate x and calculate the loss</li><li><code>(m::AbstractNN)(d)</code>: return the mean loss for a dataset, if d is an iterator               of type <code>Knet.Data</code> or <code>NNHelferlen.DataLoader</code></li><li><code>(m::AbstractNN)((x,y))</code>: return the mean loss for a x,y-tuple.</li></ul><p>Explicit signatures exist for types <code>Classifier</code> and <code>Regressor</code> with negative log-likelihood and square loss as loss, respectively. For variational autoencoders the type <code>VAE</code> exists.</p><p>The type <code>Chain</code> wraps a list of layers that are executed sequentially.</p><p>Types <code>Transformer</code> and <code>TokenTransformer</code> are provided to build Bert-like transformer networks from the rspective <code>TFEncoder</code>  and <code>TFDecoder</code> layers.</p><p>A network summary can be printed with <code>summary(mdl::AbstractNN)</code> and a more detailed list of all layers with <code>print_network(mdl::AbstractNN)</code>.</p><h2 id="Layer-definitions"><a class="docs-heading-anchor" href="#Layer-definitions">Layer definitions</a><a id="Layer-definitions-1"></a><a class="docs-heading-anchor-permalink" href="#Layer-definitions" title="Permalink"></a></h2><p>Several layers are predefined with executable signatures:</p><ul><li><p><strong>MLPs:</strong> different flavours of the simple layer:       <code>Dense</code>: default layer for a vector (i.e. sample)          or matrix (i.e. mininbatch) as input with logistic          actvation as default.               <code>Linear</code>: TensorFlow-style layer to process high-dimensional         arrays and identity as default activation.       <code>Embed</code>: embedding layer that adds a first dimension with the          embeddings to the input.</p></li><li><p><strong>Convolutional NNs:</strong> to build CNNs <code>Conv</code>, <code>DeConv</code>, <code>Pool</code>       <code>UnPool</code> and <code>Flat</code>             layers are provided with standard functionality.       The utilitys include methods for array manipulation, such as       clipping arrays or adding dimensions.</p></li><li><p><strong>Recurrent Layers:</strong> a <code>Recurrent</code> layer is defined as wrapper        around the basic Knet RNN type.</p></li><li><p><strong>Others:</strong> additional layers include (please see the API-section for       a complete list!):       <code>Softmax</code>, <code>Dropout</code>, trainable <code>BatchNorm</code>, trainable <code>LayerNorm</code>.</p></li></ul><h2 id="Attention-Mechanisms"><a class="docs-heading-anchor" href="#Attention-Mechanisms">Attention Mechanisms</a><a id="Attention-Mechanisms-1"></a><a class="docs-heading-anchor-permalink" href="#Attention-Mechanisms" title="Permalink"></a></h2><p>Some attention mechanisms are implemented for use in sequence-to-sequence networks. If possible projections of values are  precomputed to reduce computational cost:</p><ul><li><strong>AttnBahdanau:</strong> concat- or additive-style attention according to       Bahdanau, 2015.</li><li><strong>AttnLuong:</strong> multiplicative-or general-stype attention according to       Luong, 2015.</li><li><strong>AttnDot:</strong> dot-product-style attention according to       Luong, 2015.</li><li><strong>AttnLocation:</strong> dot-product-style attention according to       Luong, 2015.</li><li><strong>AttnInFeed:</strong> input-feeding attention according to       Luong, 2015.</li></ul><p>A generalised dot-product attention can be computed from (Query, Key, Value) tuple: <code>dot_prod_attn(q, k, v)</code>.</p><p>Helpers for transformer networks include functions for positional encoding, generating padding- and peek-akead-masks and computing scaled multi-headed attention, according to Vaswani, 2017.</p><h2 id="Data-provider"><a class="docs-heading-anchor" href="#Data-provider">Data provider</a><a id="Data-provider-1"></a><a class="docs-heading-anchor-permalink" href="#Data-provider" title="Permalink"></a></h2><h3 id="Image-data"><a class="docs-heading-anchor" href="#Image-data">Image data</a><a id="Image-data-1"></a><a class="docs-heading-anchor-permalink" href="#Image-data" title="Permalink"></a></h3><p>The function <code>mk_image_minibatch()</code> can be used to create an iterator over images, organised in directories, with the first directory-level as class labels.</p><p>Helper functions (such as <code>image2array()</code>, <code>array2image()</code>, <code>array2RGB()</code>) can be used to transform image data to arrays. Imagenet-style preprocessing can be achieved with <code>preproc_imagenet()</code>, readable Imagenet class labels of the top predictions are printed by <code>predict_imagenet()</code>.</p><h3 id="DataFrames"><a class="docs-heading-anchor" href="#DataFrames">DataFrames</a><a id="DataFrames-1"></a><a class="docs-heading-anchor-permalink" href="#DataFrames" title="Permalink"></a></h3><p>Helpers for tabular date include:</p><ul><li><code>dataframe_read</code>: read a csv-file and return a DataFrame</li><li><code>dataframe_split</code>: split tabular data in a DataFrame into train and               validation data; optionally with balancing.</li><li><code>dataframe_minibatch</code>: data provider to turn tabular data from               a DataFrame (with one sample per row)               into a Knet-like iterator of minibatches of type <code>Knet.Data</code>.</li><li><code>mk_class_ids(labels)</code>: may be used to turn class label strings into               class-IDs.</li></ul><h3 id="Texts-and-NLP"><a class="docs-heading-anchor" href="#Texts-and-NLP">Texts and NLP</a><a id="Texts-and-NLP-1"></a><a class="docs-heading-anchor-permalink" href="#Texts-and-NLP" title="Permalink"></a></h3><p>Some utilities are provided for NLP data handling:</p><ul><li><code>WordTokenizer</code>: a simple tool to encode words as ids.       The type comes with signatures to en- and decode in both directions.</li><li><code>get_tatoeba_corpus</code>: download dual-language corpi and provide       corresponding lists of sentences in two languages.</li></ul><p><code>sequence_minibatch()</code> function returns an iterator to sequence or sequence-to-secuence minibatches. Also helpers for padding and truncating sequences are provided.</p><h2 id="Minibatch-iteration-utilities"><a class="docs-heading-anchor" href="#Minibatch-iteration-utilities">Minibatch iteration utilities</a><a id="Minibatch-iteration-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Minibatch-iteration-utilities" title="Permalink"></a></h2><p>A number of iterators are provided to wrap and manipulate minibatch iterators:</p><ul><li><code>PartialIterator(it, states)</code> returns an iterator that only       iterates the given <code>states</code> of iterator <code>it</code>.</li><li><code>MBNoiser(it, σ)</code> applies Gaussian noise to the x-values of        minibatches, provided by iterator <code>it</code>.</li><li><code>MBMasquerade(it, ρ)</code> applies a mask to the x-values of        minibatches, provided by iterator <code>it</code>.</li></ul><h2 id="Working-with-pretrained-networks"><a class="docs-heading-anchor" href="#Working-with-pretrained-networks">Working with pretrained networks</a><a id="Working-with-pretrained-networks-1"></a><a class="docs-heading-anchor-permalink" href="#Working-with-pretrained-networks" title="Permalink"></a></h2><p>Layers of pre-trained models can be created from TensorFlow HDF5-parameter files. It is possible to build a network from any pretrained TensorFlow model by importing the parameters by HDF5-constructors for the layers <code>Dense</code>, <code>Conv</code>. The flatten-layer <code>PyFlat</code> allows for Python-like row-major-flattening, necessary to make sure, that the parameters of an imported layer after flattening are in the correct order.</p><p><em>NNHelferlein</em> provides an increasing number of pretrained  models from the Tensorflow/Keras model zoo, such as vgg or resnet. Please see the reference section for a up-to-date list.</p><h2 id="Training"><a class="docs-heading-anchor" href="#Training">Training</a><a id="Training-1"></a><a class="docs-heading-anchor-permalink" href="#Training" title="Permalink"></a></h2><p>Although Knet-style is to avoid havyweight interfaces and train networks with lightweight and flexible optimisers, a train interface is added that provides TensorBoard logs with online reporting of minibatch loss, training and validation loss and accuracy.</p><h2 id="Utilities"><a class="docs-heading-anchor" href="#Utilities">Utilities</a><a id="Utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Utilities" title="Permalink"></a></h2><p>A number of additional utilities are included. Please have a look at the utilities section of the API documentation.</p><h2 id="Bioinformatics"><a class="docs-heading-anchor" href="#Bioinformatics">Bioinformatics</a><a id="Bioinformatics-1"></a><a class="docs-heading-anchor-permalink" href="#Bioinformatics" title="Permalink"></a></h2><p>A number of utilities for bioinformatics are provided, including an amino acid tokenizer to convert amino acid sequences from String to  vectors of integers and embedding of amino acids with BLOSUM62 or VHSE8 parameter sets.</p><p>Please have a look at the bioinformatics section of the API documentation.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Introduction</a><a class="docs-footer-nextpage" href="../examples/">Examples »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="auto">Automatic (OS)</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.1.2 on <span class="colophon-date" title="Friday 27 October 2023 11:25">Friday 27 October 2023</span>. Using Julia version 1.9.3.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>