From 35029a0c84e7c9f2c56e2b2256746654a5084432 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Mon, 7 Oct 2024 10:40:43 -0700
Subject: [PATCH 01/18] long context perf

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .../source/performance/performance_summary.md | 19 +++++++++++++++++++
 1 file changed, 19 insertions(+)

diff --git a/docs/source/performance/performance_summary.md b/docs/source/performance/performance_summary.md
index 98dae2dc0a78..0234e1d7ed42 100644
--- a/docs/source/performance/performance_summary.md
+++ b/docs/source/performance/performance_summary.md
@@ -40,3 +40,22 @@
 | LLAMA2-7B  | LoRA     | 8      | 32  | 1   | 4096            | 1  | 1  | 24824              | 663                            | ***0.8***                                          |
 | LLAMA2-13B | LoRA     | 8      | 32  | 1   | 4096            | 1  | 1  | 14629              | 757                            | ***1.4***                                          |
 | LLAMA2-70B | LoRA     | 8      | 32  | 1   | 4096            | 2  | 4  | 2621               | 722                            | ***7.9***                                          |
+
+
+### Long Input Sequences 
+
+- The results in the table below show the pre-training performance of the LLAMA2-7B model with various input sequence lengths at FP8 precision.
+  - Container: [NeMo24.03.01.framework](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo/tags)
+  - System: DGX-H100
+
+| Sequence Length (K)| #-GPUs | GBS  | MBS | TP | PP | CP | VP | DP | Tokens / sec / GPU | Model TFLOP / sec / GPU | ***Est. time to train in days (10T tokens, 1K GPUs)*** |
+| -------------------| ------ | ---  | --- | -- | -- | -- | -- | -- | ------------------ | ----------------------- | ------------------------------------------------------ |
+| 4                  | 4      | 1024 | 1   | 1  | 1  | 1  | 1  | 4  | 16671              | 768                     | ***7***                                                |
+| 8                  | 8      | 512  | 1   | 1  | 2  | 1  | 1  | 4  | 13907              | 730                     | ***8***                                                |
+| 16                 | 16     | 256  | 1   | 2  | 1  | 1  | 1  | 8  | 10082              | 660                     | ***11***                                               |
+| 32                 | 32     | 128  | 1   | 2  | 1  | 2  | 1  | 8  | 6687               | 610                     | ***17***                                               | 
+| 64                 | 64     | 64   | 1   | 4  | 1  | 2  | 1  | 8  | 4021               | 574                     | ***28***                                               |
+| 128                | 128    | 32   | 1   | 4  | 1  | 4  | 1  | 8  | 2260               | 555                     | ***50***                                               |
+| 256                | 256    | 16   | 1   | 4  | 1  | 8  | 1  | 8  | 1214               | 549                     | ***93***                                               |
+| 512                | 512    | 8    | 1   | 8  | 1  | 16 | 1  | 4  | 635                | 549                     | ***178***                                              |
+| 1024               | 1024   | 4    | 1   | 8  | 1  | 32 | 1  | 4  | 318                | 536                     | ***356***                                              |
\ No newline at end of file

From cd3a808656283d38a415d7917af2e6f2ed519dc6 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Mon, 7 Oct 2024 13:19:13 -0700
Subject: [PATCH 02/18] update the long context perf

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .../performance/performance_long_sequence.md  | 155 ++++++++++++++++++
 .../source/performance/performance_summary.md |  19 ---
 docs/source/performance/speedup_figure.png    | Bin 0 -> 20611 bytes
 3 files changed, 155 insertions(+), 19 deletions(-)
 create mode 100644 docs/source/performance/performance_long_sequence.md
 create mode 100644 docs/source/performance/speedup_figure.png

diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
new file mode 100644
index 000000000000..c2816485b54d
--- /dev/null
+++ b/docs/source/performance/performance_long_sequence.md
@@ -0,0 +1,155 @@
+# Long Sequence Performance
+
+## LLAMA2-7B (FP8)
+
+- The results in the table below show the pre-training performance of the LLAMA2-7B model with-CP (context parallelism) and without-CP for various input sequence lengths at FP8 precision. Detailed configurations and the achievable performance are provided for the with-CP configurations. For the without-CP configurations, the best achievable performance is reported within the given memory capacity constraint.
+
+  - Container: [NeMo24.03.01.framework](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo/tags)
+  - System: DGX-H100
+
+<style>
+  table {
+    border-collapse: collapse;
+  }
+  th {
+    border: 1px solid;
+    padding: 5px;
+    text-align: center; /* Center-align all header cells */
+  }
+  td {
+    border: 1px solid;
+    padding: 5px;
+  }
+  th.top-border {
+    border-top: 2px solid;
+  }
+  td.speedup {
+    font-weight: bold;
+  }
+</style>
+
+
+<table>
+  <thead>
+    <tr>
+      <th rowspan="2" class="top-border">SeqLen (K)</th>
+      <th rowspan="2" class="top-border"># of GPUs</th>
+      <th rowspan="1" class="top-border">Without-CP</th>
+      <th colspan="5" class="top-border">With-CP</th>
+      <th rowspan="2" class="top-border">Speedup with-CP/without-CP</th>
+    </tr>
+    <tr>
+      <th>TFLOPS / GPU</th>
+      <th>TP</th>
+      <th>PP</th>
+      <th>DP</th>
+      <th>CP</th>
+      <th>TFLOPS / GPU</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr>
+      <td>4</td>
+      <td>4</td>
+      <td>768</td>
+      <td>1</td>
+      <td>1</td>
+      <td>4</td>
+      <td>1</td>
+      <td>768</td>
+      <td class="speedup">1.00</td>
+    </tr>
+    <tr>
+      <td>8</td>
+      <td>8</td>
+      <td>730</td>
+      <td>1</td>
+      <td>2</td>
+      <td>4</td>
+      <td>1</td>
+      <td>730</td>
+      <td class="speedup">1.00</td>
+    </tr>
+    <tr>
+      <td>16</td>
+      <td>16</td>
+      <td>660</td>
+      <td>2</td>
+      <td>1</td>
+      <td>8</td>
+      <td>1</td>
+      <td>660</td>
+      <td class="speedup">1.00</td>
+    </tr>
+    <tr>
+      <td>32</td>
+      <td>32</td>
+      <td>595</td>
+      <td>2</td>
+      <td>1</td>
+      <td>8</td>
+      <td>2</td>
+      <td>610</td>
+      <td class="speedup">1.03</td>
+    </tr>
+    <tr>
+      <td>64</td>
+      <td>64</td>
+      <td>534</td>
+      <td>4</td>
+      <td>1</td>
+      <td>8</td>
+      <td>2</td>
+      <td>574</td>
+      <td class="speedup">1.07</td>
+    </tr>
+    <tr>
+      <td>128</td>
+      <td>128</td>
+      <td>424</td>
+      <td>4</td>
+      <td>1</td>
+      <td>8</td>
+      <td>4</td>
+      <td>555</td>
+      <td class="speedup">1.31</td>
+    </tr>
+    <tr>
+      <td>256</td>
+      <td>256</td>
+      <td>392</td>
+      <td>4</td>
+      <td>1</td>
+      <td>8</td>
+      <td>8</td>
+      <td>549</td>
+      <td class="speedup">1.40</td>
+    </tr>
+    <tr>
+      <td>512</td>
+      <td>512</td>
+      <td>104</td>
+      <td>8</td>
+      <td>1</td>
+      <td>4</td>
+      <td>16</td>
+      <td>549</td>
+      <td class="speedup">5.28</td>
+    </tr>
+    <tr>
+      <td>1024</td>
+      <td>1024</td>
+      <td>26.5</td>
+      <td>8</td>
+      <td>1</td>
+      <td>4</td>
+      <td>32</td>
+      <td>536</td>
+      <td class="speedup">20.23</td>
+    </tr>
+  </tbody>
+</table>
+
+
+### Speedup enabled by the CP
+![Speedup Graph](speedup_figure.png)
\ No newline at end of file
diff --git a/docs/source/performance/performance_summary.md b/docs/source/performance/performance_summary.md
index 0234e1d7ed42..98dae2dc0a78 100644
--- a/docs/source/performance/performance_summary.md
+++ b/docs/source/performance/performance_summary.md
@@ -40,22 +40,3 @@
 | LLAMA2-7B  | LoRA     | 8      | 32  | 1   | 4096            | 1  | 1  | 24824              | 663                            | ***0.8***                                          |
 | LLAMA2-13B | LoRA     | 8      | 32  | 1   | 4096            | 1  | 1  | 14629              | 757                            | ***1.4***                                          |
 | LLAMA2-70B | LoRA     | 8      | 32  | 1   | 4096            | 2  | 4  | 2621               | 722                            | ***7.9***                                          |
-
-
-### Long Input Sequences 
-
-- The results in the table below show the pre-training performance of the LLAMA2-7B model with various input sequence lengths at FP8 precision.
-  - Container: [NeMo24.03.01.framework](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo/tags)
-  - System: DGX-H100
-
-| Sequence Length (K)| #-GPUs | GBS  | MBS | TP | PP | CP | VP | DP | Tokens / sec / GPU | Model TFLOP / sec / GPU | ***Est. time to train in days (10T tokens, 1K GPUs)*** |
-| -------------------| ------ | ---  | --- | -- | -- | -- | -- | -- | ------------------ | ----------------------- | ------------------------------------------------------ |
-| 4                  | 4      | 1024 | 1   | 1  | 1  | 1  | 1  | 4  | 16671              | 768                     | ***7***                                                |
-| 8                  | 8      | 512  | 1   | 1  | 2  | 1  | 1  | 4  | 13907              | 730                     | ***8***                                                |
-| 16                 | 16     | 256  | 1   | 2  | 1  | 1  | 1  | 8  | 10082              | 660                     | ***11***                                               |
-| 32                 | 32     | 128  | 1   | 2  | 1  | 2  | 1  | 8  | 6687               | 610                     | ***17***                                               | 
-| 64                 | 64     | 64   | 1   | 4  | 1  | 2  | 1  | 8  | 4021               | 574                     | ***28***                                               |
-| 128                | 128    | 32   | 1   | 4  | 1  | 4  | 1  | 8  | 2260               | 555                     | ***50***                                               |
-| 256                | 256    | 16   | 1   | 4  | 1  | 8  | 1  | 8  | 1214               | 549                     | ***93***                                               |
-| 512                | 512    | 8    | 1   | 8  | 1  | 16 | 1  | 4  | 635                | 549                     | ***178***                                              |
-| 1024               | 1024   | 4    | 1   | 8  | 1  | 32 | 1  | 4  | 318                | 536                     | ***356***                                              |
\ No newline at end of file
diff --git a/docs/source/performance/speedup_figure.png b/docs/source/performance/speedup_figure.png
new file mode 100644
index 0000000000000000000000000000000000000000..af73e6f5375b85f789d10cfa59d40aa2e5f104d2
GIT binary patch
literal 20611
zcmdtKXHXSu*9CY`K@ku|6hSg5B1w`+MuLLG3rY?u8RVek3?iT?k|hUGq7sBdjshwm
zQF1y2Ns@C;vm4?4?)T20si~T&srhrMi*)zXPuP3ywbpK5C55|0rzuXOP$(i9X-Q=i
z>ewI(g;R414}N28%#(#eDczHiynWw6cNTjxf>dg6f01E=FgWDfamjO1UMgatsUIRL
zSzo_4M)BX4Ix)Y{5+ZT##EIjtsXbpaT~E383ZL=%^;Fu(cQMwf`}5mn?#+#zO*2ib
zf_cKi^IgmB3@Q)W=Dgze<P>c?Xb9n%D5kxKYRFG%!YV%aF?4G9GW>YzwLk<vZi#Oo
z1EZvKISxPk&z9rBkGrQ}$S4m6FH89G_1ymxm+i;n5?qWpLnRRL`}>>dDsO`5FgC3y
za&Du<X^iW(-SSJ(9lFQJSRMy0)jb>g`7T*K`!V@3+c+V|hLZX6%Ts)o1Eqwn2LpX|
z_a{>gwXl<kNBLT>xxA)kR8OfRrf59MJU7#{Ejht0t86W*6Km5ZzhA-)w><kYlz6b?
z(}Ol8wuC;6Vg1Dn7WFR=Q_%}Um707`D}R)zkUt85KMLB}+n%S$HSfx#`n22>7(mYb
zBiF3sC+YS3wLIpX4GodpOfzxt$6_{K^QYn*$H|F$14Q>6QZuFAT+|Hb&?~nYDZApu
zp<8-uW&rs!124<^4^nSdjx}&M3$;-S*j&4>mn=t4)TdwL-?3HZydHIq;f{$;0UBGC
ze^#f$ZMOkthMDxVw|IimH4*mcn-)3x+WvGdFExs7(wOziOVd<R{B>O--^B{rn{BXb
z<)>pwgH~|iNmKMhd+uCzT21J#&uY!-dKCj`|3BT?`V^^eF0z^NNl7BhE^U!mxzFvg
z@ic_bQrEf?mn?!x(5}1jXY_?DJI1FhRYR}JM|rLK`6D|euvb}xsC<8Ko8omepC$SX
zjmQ(em;FzxiFz%EzCJf<juSQ-EO#~P{_;qZ&*qO(iL)fKdpz31mQ24`^yN<)ab&+S
zZj9nB-YLbhk5t>w4^ZWQ|8uZ@X>4O{4}D3laGjWv9@b)2|MC@Pyor=uTfOFGegEKP
zKD1mH6ZhI0%wV9V^ouj+kWH$5*|dcY*_mQy3`7D)uXFq5-#fnL`EwT8=CpZ1cvSSr
z-cG#P`d;GiknT{}y!!h}|3ALj&|MbQ^cy<GHk?Va;Wsa^sPdFKt#Uc9&EV&yBqc@4
za+I}`vT5Gm?suBb82|eG_#-W8j?c3)&M6A9a<CS1N%!70#M%#-uFVh9<a-a%AI`BG
z#&S?Z<+CbVTZz3T#%<RLWAcmMdMYi&jCRdtwznh3eQ$HTOV6EGHjI^Xb*h7#SY+d2
zx9T3#=UFNNo7(nF9W6`C-rPD?t^5Wabbt5C{NP}(@{OZas)*z@4`WvU@}3`dWh#2K
zHYoW~wa?Z(w!}WV)Nwhg$a=!3x906-<4+GhUo56#|2(_9RL_wtAAQqkexNL2?Jl+B
zw_J<9n>K>AhimIGNXc&{dEBn>`SFueU(PLJ?FUVLAU;^9x~|RiXp&KX5!SHa=wBSE
zvE3%3;JLwY$L~w@ao@xJvk^<k#;lQ(7|CtyKM==f8D|u+z}-P`k+r2*r^L>C>O*$H
zM5KW2C$z-R?__#U6k=`#WeTSrJvtsXzI?NIeWY1)o!7i{-*T;nB{|JrBi~>|vF&u!
zT^iT+ITGo_znk`JmO*}ybrXNh%Aa2hSPc`)ZeMN6$skO(GLwYd2jBcM=^p_!mY-BJ
z%+yYj9Pa&vk00*_Ohr72?2z1}QfN|`XpSFr*_upA5GWeTF=~w3pEjs_;%t?y#hw|;
zJs3x(=kkxSk__(A6D>#6SKjgsm}urdF}0iNesqRf=+bpvvAKb=64Q=P?=<qv-YF2N
zSGc(@w@46s=W{wP8DOl&en#XoN&NFOjLjU6@Y2KlCp{mGJ1_H^8Bn`z47aR#`W;PO
zBeCet+2Vy??3U%5$!k5uwx5e;a_Vh@sEQnvFZKl8m<@V{g}C?D(Rt8qJ41m_z0~1Z
zY>twrLt><g=d1rdz#g;$b~6z=-q#K%SL|$EH~O2;;h&v2H!Ns(_*%5DP7P)s#f00E
z8~#*>vvrET%6{fQm!B)wzk3)6JYOwP2_A-rvyxu(-G|%JgO=W<#{cLMzk2h`XRk{J
zweS?WY+70Gu69M=dv`tZ81CtasgBRleU=s2bghD>xhN7O2GN>`Y|n73Ky+Uoeo50S
z|BCjJ>C?L4E6EOvzbS6bDrai*FV}Nablc+#AyMoU3H|^ijSfGR!l#dpIS`VB;c=AA
zaJkYo3C;^G@2(F<A3u2}Vsm*Sx=le)etOKWP!9=|Hh)@fSFqMzPBE+xGy362ZpX)M
zJucT)Y-_IaK@y*hOxGa+IZ;@)>jw{cy)ZO;;g~}w++C4>jMVj<YNmE%=H45Q@|G>L
zj!#LSo7tAuAy%+f_iPF|E=AZ6y7IMKLaIzwN)RXC8mkXiY+kDIM1rR$t@6TEkD9BG
zOqImbN>MfbRI+Ln_Vc<K;haNmEg$a^?>c9et!ajYg&Ezn9C(shG)|FW-kWQhI{5zf
zi`-0Dc-Wu3noYv%+8NVbS$f?UKQcg^_Am(7fcqM=Ju+=^USg<AZ{+1a&f)&X3*8I=
zAvg13rDuWwc^JJdm(7pmI;MD&SVAdZZ#`<r>tc4S+J!gj;SKSJ^xQU+%6vn^zwkt8
zd~Hf^BzD{1wcp(~LlSvNq-G*|x-%m|fM12(mMdD8b!R66GD?ospBB0{h%kh=1(i*^
zGGlux+$;XqO^^(~yG`xBv#1oyXI&q}a97-R2`^M&(};{fRMZI4o%%PDulNJMuCNmI
za)=9E;bYUxmFD;=wA{eG+Bg<OaAV9;@Y!$4)VYnt5zTObFiJW`iCH_^5|G{@%Lg}%
zO=nfygo|nehZAFLJ2eFQzm+R;y@UAH!^vL5$U51}At`jFfK{isJ(NRlpgm3H?#obz
z(GUO3TM8l(b3kyJt?kSYdOjLReqst!gB9*Rgf-;z=|On=75vuYf2QP+pbQhP*mc#o
zWbC)Bql&j)#N^w}%zr-KQJ+6Z$h4<Mw{DR$(=r^_%rlc(@IP+4uun)MqW#ie`)MVv
z5iGSG#;p5GLV32x{?4*e?2guyLB9&?!LC0wSF!Kb$eWY;!S95_%CEcl1Dy9Zc{Tt>
zO-H9VE{{+8gm!POOn%($&9^Y`%+N}a+GDeo@FC1DxqFbq>lE&ssx>%o^;=n$^R5d?
z93G%3bY}V1)GM6Tf<ZTKBt%QRJi}XJH=7e3yKu}V_qRs=6AhE$_Y3rv<sF~Z_AOsT
z(jIIi`-xb4v*aM$)8{I@k}I>yx31^fIB&1b&KfOmO=rCnPxp)7n@owHVaI18lkI&j
zpuBOhEk&{G0h{c*>q#oaKYt6*xbHkG7KzL*I0?u@ZuDE4_Q`NjcQ?%A@25-1Ii=Mp
zLdnHP)ALU>ea<Yt*X+@EhVs_KWVy)46nyCPT?6gaUmx$zqMzSJpjO??c(i4w5W~LI
z8Y3%F`QneuTfep4cUSFH@L))7{B%kC5uExn<6r-d{fcq%m|sE6W*GNSZJbw9r7U}q
zWc4^;OA|gRK%T6#g9#EROPFsQ%(+J0CCfT4uJOw~u;2f%)wnGT-7aDN06EiqKlqAl
zul(T<q>XrkdgKj0;5!SGeq(xM`UOrP#m%mC4ShbTQ8smzJhKN!w<ND~83@)laI=hw
z@k+WKsjH-$2*|Ws{#@BY6+y3F@*crKK#{(Qq{4TshIoP+JtRzU&jn1gNZS`?*?a)K
z-Drio$(LBwGGo@lCr`e3jf9P0c>owZ)r6OBd_{}jWaJ?lu<g>>dVGOd`CC6k8>6}Y
zqGFd4Ii^C}X;q!LGRwj8UiMvx{I&OA0Q?hKge+^~7R0NNZ#jtOW4KHbg%?1{|Crik
z;WpKK4i4>HdX~EbZrig0^Y(!VOcdi<R6PdgwPyCJK}ZPCHCc@4N{d9Y2F55^-1~I0
z*z;q_%1QT<k?7g-{&rDLjv@N*0u0L^XOnngs46f@c$gRhSw|Vu48kk+cjxWSD;zt_
zDbje{Mhz|f%Gr;lbbKlKUmhhttnYayo769(VGtyG8*j-RrQCVID#?*aaIiSD?)x7l
z8kDG@+fJcrduotTShjvmZ(}Xa(FM&0*%EaudFEZ6%pZf6kC264o%l=J>CTSQrJpQW
z-Ho+$M}MAmRW9PaM$+vUc;kTf?YOO8weeiz7=?HPF~mZyC&$p%P*yRVUB`wqjur{H
zC{Y2IjW1~$m}H}O?z_`^a7iErCaR{uyh}l^ec^YIdpNVok975H`WfBIgKE)@$E^KY
zBQxBvBMmR;R#WSc2hQ;osdpX0NIIc&S)c7wvs?!h^PdYCn50?MGSm4D;poEyA}~Da
zAMX&k{zJy0n_e)xz-C-JmfyfkejmZ-Lwt&&ySYZ8u#x)E;>T{uzQ8?A+CrvUu7fY=
zGT(!_Ye~kce&aqF&%qxvIj_(0LSE|f<ATIjLXJPgO4JEaQLo#wVpr9&Z`gUNJ<Ulg
zxe)(g!!Cp{#W{#C*TwIxFqPor|7ImRL(V0s*FWAE{n95wT1Hm3EzM%|v22mBNek_E
z=HG+kmTf(upi*pO+Q?)*-jE#~MBCK*UhWY?8@^M5pmGrNsj^4RfHUBfyDio-1en|`
zf<rQUl`#;eUZtz{wajc^0UwmGZI0y-S`0M{&=z+Bs54ktaLJJxWup?}p#0+R@4V3s
z(R|J5JI9XDu;`zztoZR@NYCA9anB=A(w}eIc2y-&`dkQ;A~nXKjs)I%oR#QjU~mn!
za|6$l?iK*{Jk<{K3nce%yFCFY!oM=vc9*;EotmyGUnP#QX^Yk{SJ>e{e%<fP)cI`w
zsn*tBbZ_G$mhfauQYGLT(+w9mHjQhueXGCg?@J(WS2UU5hi((H0_4*t|MZ&%1*>X$
zM0q#FFHE#D9`Y)qVS4Vn9iz?hdTLbR?hN*@czpd&U-&f7SL_d<?~x-*7O3U4Hj}22
zsFtN`cb?KJ3~zm=M_<5p%76keBt)d70u`czd*P7}#zI}HlBz^Uwf?q0<O`39o+P;<
z{wG?|o3Ksv)U=;@j|V!>71f|U_3>q2*uD2pIq@J6$$Seb-`|i;#|&_FU>X+R_Ddb&
zJniY;yw65~Cuv+-(Bka0hb0ZW>vAJsl<2+CYmf5LSTjBbG8lF?A#<8~_6Ou#=ulnK
z5O#y)9@Ad#>fA^ZAhMN;_0opDV|1aMj9oi*HfKs%$#880Foad>Ob>?FITTLJOn*@u
z`VPB{Iz<eRsq(d_H7ZkGS(cX94`TM$ccQzKic!LQTU`~s^0iVh{F(q7zLZ!yO`oS!
z8Kd>#&F!iGRJ$xxo?J^l_5z8V+8cj92JqP5PgSa%!IpHvtzquU0%LzF?=HeI=4%p|
zW^JU}->4CtxjM>q6tqm{%eOO){I5Uw6afGdB8A}*N)Ccz<s8M@_Rey1U*iJK(FM{7
zS|2BhEZ$rguD0WL@i>fVj6WeT$-TfM;<1#AxKWArQ4IWBS$jx2ZjOcXnzsYbKq>6h
z{_lUeWi|RLO{XN^=q;3lShP<7%xp@W(<jG22`?KwaS3LULl3f}FB}*SlO~h?!tBhv
zWBc%8V?@IjHjO$f+}*j|_gqS}-XBDB1wn^}_i3sbQR99<?DRA)kQ<vWgzLF+Hj8Yh
zPj+X2WY#WxdhJzxN8#?)D)*qv@>_*~^FX@5u4~)dY<4&MaSNc>@%SW9S~WPr8zTBY
z2qEXwKtY?KbT&)3tk`kr`N;-InlmLQjveg6MAz-L$k!KG%zgwu!QHQ4<+(Q4VxaI7
z2Ja*8Q0}tX0JU>$vm&#*?D`y*!DkfCddg*i^SX~ODC)tUNf+824T<7FU~UP%$h*&`
zO63`4E3L9f$$C{WPKaFG0%O$J@Z;^}*^(<>+A%F9_VcOFod!xB$@j`lTtE7zZ~%}q
zvYQ{UfQsw$=Ht$E4Oo@yFnquLa#&osH0305oP0oU#e`-+eL0T2<y*;b*R!g-vWB(p
zhl7V^FI!OthG!u4`Bq++ZrfHgy0{MgS|f4x7z$-jGgSG4R|{(9PPAtLzw8UH#E}=o
zA~#|MY){9IHN}fo6hVq<gk3#5N70g``0InzY%--96c;K;1vB70FGVysaM=ZlRy&mj
zdJ8fLH@aMu`37~tQEcTqOR^>8A`qM)Br}md`Fua^!=sxRx+?>h%r%mje{HSJn!}O%
zwC{hh+<E0s!m9ocKh6xzJTC9k7g`^2BEh!g&IQvF$oz}Jc`A%*wKDRIWX7gS1GHTE
z&Obw$%XOD{G&j({K6rQ5e}At_!%R{IDdmZMz_nI8IE}yQstw;<U`b}cS9V;iG8khe
z+FPh5wtL4RMKEaiGxXOMZ0RUKlac>s-e%^`$qafEkSQaGTs*;`MB5%~8^vspUVyz*
z7nuXGSMbMJgda2abAu@{@_ahuud?Cn8T#cc_fsFT7yqH>Hf?3plOv>IbGe%Xd1IDE
zjiQFW6Patt9N<fI(v`0IB6xV#h!H}ulIN%NQc@9vYIdOTIx#<snBx{)XSz9OFHcJ)
zWTTPkB(c;SHP0&RM2iPM4a1u+nxJ;YAmHDRMsy%ur=*io$e|@&x2)HXlAp`wdnRjU
zNc0Eqc_`;hT{oJ~YUMxqRs5I7V?UUb$Ju5b*Hzw&;ZX5^IRLUURT>oO_tS!X6eH0$
zf(r(mWc%PanDOynU~?G+f-iyNB+y2`a6Ns9E+^Z<bmf)YB|TV_?7+M`jI#YM%Z+yD
zgI@t^P#gPs`HrgoYf1Kpp=&_w_Q@;2RGW9E|Eg#-Uw<u^aA}sa=kF$AsSoGqBJVnV
zk<|q78ea*y3?vlTW+^_+$kCRcLV)DjWy{iEXshd*8yC4^6#?_Fh@Xtyl^ow|dXj0S
zm3y#(f#%KpMfnWCey_YV7i4C_j>3)KZ){~Y;Y>_MSVry|ERQ!DEsfR*uu8cp@IM{$
zaP9gFvmIR5vGxKbzA-y}Umm@{`Y|FohPe6FQ5umZ)KHY<`gWXH=pi2+K2p>@7l=Db
zsuQl;vjxS$N!s>TKjH@N9PtW509I2s=?YRF-)W*d%rMzeg7%q4CyR8^kSp!DgGmp;
z<EsaN$#?__$+^Cv#Yv7Ri|mcj5JeuoP!=`z@m0w{%A-58Ht|nTHWF38Brb>!96kE2
z>VW&cYa?L}-p{?=ZP-)%M}a)@Jd;8MchfBi?rNAyb3O*t(IrO73bCIY6P#A3##<63
z?6{8sHK8<ctB39Oh0z+w^9oJChRJ`t^OlFh?=oK#`i>ZsEdP4-q0v&{*`qEf2(R9^
z9<4oBLXI=cN;KDlsX-V(B=;d1A~7jx^0Ilg57GEoxZVK)$v5Dj|DJ>O0ANe|Qwy6-
zb!L3l817Kh%Zok-nCc;r$g4!Fz<6}aPgN6(w%bjT9YDgfbpU}<`4fN&{EKRjA>c!|
zw6HvFl+b~Z>v?IeY{mNnnm1ZoT?3$HBR()M+0o~2z+Hj9Se55R(4h<jUHQJT#=m!c
z2&7f0kyo3ds7?xn<QIs2#6k-m<=G0rjBrDhj<B1%$pv<^nyzi{u0Ocfcan(0V37B3
z%@w0Pty*Md*m!EbY&{O^hk}%4SSV8;#<tpcA;V^}^*g1AOLw{XLqt#!D@=)Z(_Fp{
z%=^sM6QO_#)s}vizC%#qh3ofkAn=xZ7?^h+0CZ<|6xj}dZP57;1lGDN#P9Faz!Dr`
z!shdeLJ2)XrOQp2=(7(GleySTcXo9syN)KNGKSxJ#^^)@JY-qbA0kAqY1@xbHmy#8
z6ddhWN6Nz4v@UiEIv|Ckn9x!!LlglLv`X-CCRmB6Z#_k*clK(jFzU^Mbho0%-(P=m
z#w9=!;gg;msV?5S5T<}gph2;I9@b%aCkZdTA679E(tw%pS-=)p>Xl)EkUY9%Wx7j|
zyYfo7_G3OeT*&+bKjq($adh_CZG>?TSd~Bxlu-O05uO5jZ=(B6UXwCEUK(#S4`bDM
z%*TLSz?JddOOf1-erAEKZMJq6$+=UWJ^+Tq%%<B?0>+<1-Tf0S{u;vKbbH!I%lv<1
z{t`5{_~`z{kTpLtksDU|#=C9Rq3<A9yd)(5FpL38B7yFza%*dAgBSk!6+r%tOG^-R
zU!hB8Z#Eu7&@0x96tLdUoJFqK_hiKk7a~1dE{BmE$Q#OS92Gl{mnNFCs-GJE#!qBV
zF;xl+a36c8X5+XCOzC7ZJMidS8CnH5VO~1NCSLd*jD8B@+_(qA7{cfeqXg~h4kRam
z5}5>X?*@cwG)la5c$Sr$FQ{oJkPkJx?_ppxp%A44CjJ9<YfT_27Y7@{e;WZ-%oruU
zhnihbsYb=~dTjt8M}zh}b2Z_;b)eeeafeJkt*r~-c&8bTe?E92+$`3Q^bjVCO$GK~
zN4sf8dF9TE!V8UWQF9ylHBEq)U>ZB019kKGhd1n8Ay?&F(@q_}P=v5^KJ~icxk|`l
zy^Ye3APko-HUaNNI8stAJJA%|m>}_TrUVbE5<J>z%_C}`?OVEUN5k8|z8kP^7T&tb
z3VIf~s9R~1wIdK17^hV;ZmUsgl^g>Z3z}O`-ya6;vo17l8wu3QV_mzDwc%OM;E2N{
zue{x;JV^tABcouTbq(Z!L)wG3+u)Q1B!Jb3eGuKc8$C)Ebkfy1T^WGo?n5gtH7fPO
zrbP4M9t`cFGvtvK2fd_w7p#ZA;*^j-B`b&ImuC6nzmwAG12whMc4y;RTcXT);eh~Z
z;a)UeD2AudxS84-h>B=n79#)8IcO|78hv`DKoTKjD_4J9w)~S_4rNc&GY@GJz`c#(
z*uxBym;)qf<ksr+bhg{?Z@zZiXM<tkPfj`9Q^MWM#z6)C)ODJ?BHSd45_MWh#S(fQ
z0`jv$cl{|j^UOM0M@k8kaJeM@RzG43wNqFtzwkF~V;^K`xr^fP3`OKLo){w$x@>)b
z8v;;n$%((S5Jbpe9$L$}H(M~6AN`y#1}LDK?aY!nE+9&H;m9=JXo}`UNbVlLOPt~X
z^EqGqW0X$hz%c<pSVk^xm&gTd<YE0xdkcz4V<f5ym*3T@yo^LzAnLmKCI<J`_^c7q
z_5JPRWo1CBLm@U80d?RX3k@TYR3t{7sQL1!dZl~d(*u?g{8v9UC>j8gmftQL8w2c%
zE)xc@RVM7b_N2^pz;VKRkWJ~}qUubY5}RuolzpPRD=H=A$aSIb*v5#ii~f2AxovvZ
zZFg(ZLZe3+9S&J`2mKu3HE1tUJI`<mX9Drjgg(XvXffZSPjxU6Ik|lZ&(#Mk%uP@u
z+qs9VO5hlD3co-qGqI(<e2ZvAC8F-C*qUM~A+Lp9a818xOrC;T%+N1XZQ`94y6DP`
zF1qf)PhjN&D1wcfVoVVp0E*6pF{pibpa{q*k-A_3SeMpuXN<%Bm%bAmo7t<hvI=cO
zfpijbkz9u21!K`8yj|M*P>&=VmpZRbU7$EW7bW+45UK`3y|Y4q+Ys#hbHR{KuiV85
z<OaL-R|j~DF(5aWr;*KZX_S+Ij6FB{IV&*2NLh|l2dy^C0eyHx6p5=jr&^+T&2wX*
zs@9x%tJQoDDH1$Jhvj2kpvdc!7n75Iv47Vmu+}>+r^+aI0Sh!_Mm%t$sdFKUBEJO4
zUz5C%?gIh3?h&@wHu>prl~WEHNGI65=b==ClF|rNcEBG32=_+rW9dcK^*>d>8lS^*
zsPnGZ&I+R~WZcu!<T`(!|Fnk6{!jK!@pL98+9Lw?Pj=?h$UYq2-a#c0jWR?bPNXVa
zBnTGRyh!=?`Fl)VZgq7M8U>>h@$RsPk&C#}C8{O>@};N{Pz%~M%>5?#RaKy^PPfX%
z@cJ%$EAaT6%okhL*qh$}W#hGzK=^l}3^)HabMxN9Pb9Wp3fJdKZaBfLnVVW}YUp!b
z^bKkG*c(qZj%s6}fk;boZfXK&Si-OSVH|oe)(foa2hbdi_Z!ZJ6B#Qz!ip}v1DU(Y
zuKo<uDoE*PI-K>(&D&GYbqWgny%y+hGhv@WAV%%ZpXD<)efs_Ng=EMk$uc2VjX<<S
z#CAI}y;r2=QgC?l_LiGPi96~llGr<C))<fqY@@3+N!HfEG>BfR0YLkV5iJ@5GMSk^
za}jgzU<(wy8J2zd84A3N2>YJnpu~ExM>fL!7Q;r?58y6N4-B{$M-USn{b84gD?Y_<
zJ!fAva+0xgQ7<T;0EOC->0ArMCo}kgS$LH4wT_lZ^Mkh~r|XuDPDjrDH05H3zx;VN
zmhF=XvLW_Hbh2(f9TF?SuRT)YgRA?CtXB)!nEDa%k?(lVI1ka!P}h-~06vS}Tuz8J
z9gisE*ma6exTUEg;j->_&TGm7{jyAgSMM-94iKQ*;Gh%gCkrpla~b~plYZ^C6ddHQ
z`v%f!lo-S3ViiqIKBI7u&s@48HdxE1mCpk-WW(8VfZ?ADpyE7dm-P?<AVcmGG1mUn
zn3u#fXg(aKMhF+)CZlg>*mCq=TJn#u6?_J{`4a&JGKjskdG~#-fG*geT-)pO1<}t#
zpuoL&O-XF9fon&ydsxe@GpH{o*gy%PEEYI4er@@2H?@qc9-#2!{7AJ0u8s1@irqD>
zGK-h<;Nh5>1wHr>+jzK_t#)cTrDp;g!Ef^?5KB0E3hKnni(z~=5P2me<Prf|N8+Eq
z`saI?l{3)RYdXh~IOHKpCF<6{vl!ssuD@FNYy_6Q20#?NQ|IJ~vKFee+FX%=FQK4m
z!V4=R>3bx1N7?-Vz}Iprw7vIBasuQc2a{vFMIAYzbu}ye9sS(^Y6}Fb4_-M4ts4QO
z#Fh>-pR>x%_>S|xJ|u_sgK$w}m)NQm3~r0j1|8NJ3uHxKTBFZ6LY;2M9XN9I+wh4v
zXN7=ffW*aT0&q!=oErOqQa0CtBd(cPA8>9)<B@nC-P-}&1uNbCAEXFM@THF+(avqv
zPIdqc)Y1FP@%!D4VpWIPZ-^Kw)Q*v*&53wRJ{<fbOf%P)t^bl)r#P27Ty%fOx`bT%
z8bBLWzu+rchCx)DX(=EqIweSe^QHk{nT0(uC9>8<^Pt^!>Mu!==@`Jx0IGmK(H7&n
z*0W_4pjvD{PyMHE79Y~oXeMl~11l3En}ST+2P1-z1JpsdPjil8eH2KsmH)2Rg|=rf
z#SNvoqQDOwE}+=lR9`_BbpQwOH6{2@k&OZrWO>gZRCtHA3&@t{hft*-^K9hK0Syc!
zDaK|}8Ndo$!U}lKrx~(LpZvY4q%y#Ub}9!|NUbpFD+SbVKnOE~yZ_LjD208tN+b%Z
zeu<1LFpa%#J%-^uf7K#d``R07k$(=-$mY_R9rutQoH!&`gYN^uWQc7D1582&QaVT8
zdiJ5otrcWv445yD^1i(T6OLu+;vIMzl7RVK>|~WrOlAWa$Lh8@Miv5T1K}ZrH&2dU
z1ycG<`(o((Q?2?qC|V7mlOzHv?faj-#GbfyDARjriS8`CXhJIkA&fy|1>k)TwRw+y
zO{4<RD^5ZP$hl9d>FSS)Z6-g|SJkh4|BvmZev2;Waen$``mP`>P!<9qz2M)nErIQ3
z+1=^UKLYViP|fH{s^?BO3iYZ6xCmPk9|%H8EI~7U1<mcB9>_AG!3|NSk!upfzuK<S
z2>uxIskVFztu~;yVml=B5hO@uWcB+Sep{}o3TWRK^QCCl;Skz^2z|&+FGlc~2J?^j
zanDS>Lg?<c6VxIux9;;n4r>Ff(_*2)WB@WrLLVZ7@b9j)tqwvhi{Ej)h)u2Sy~|~6
zo_RP>nS1(xlM=06EPHak&t1HP)Qo%62&4u+@HJu!V}|+|p+wJ6`GSpxf&Aj+WkAtq
zBo=F}7K0fKem*Ae!@`+r%hD6I=z~Q~r3vhF3Bf}onc+|=JFp~_BYKbk`!=FU>$xm?
z^TU>H)G&@sVj9m`f?rMrz5_|wYYNf#!Iwg@bwQ`9Ack4gZbrRZ_<8?CG|x3M`PwFW
zL6w70_-Y2ueCtSJgF-MlKL!h!Pk5Os(|V%GX-(^gi#t#;2f7?cew^NpUz4;IUsyfB
zxM;b8ifo$_!V+;HAd1=Kob@M^dTX@5w>utV-So?`q*@}6^#enVI{pB4k3Z|D^CAkA
z3&&wUm8WuPlHYiE#P%%c0ZwcuU>4?5(DDxftwUcuYd<K&mg`3eq<QjW2JLc}q5*rz
zs74S01wy5aRDy1}|7l6Eaedwo5x$pBu(Xy>rD*6k5)<BB_eAu}ukOMdLobp65)1T8
ziet2wuZ)zJarZ(Y-&yq4yB+H=^7cSn1kHRLh}HxvC>N-@u!Ya`t!MMPrt%HsHXJ8n
zQ>v9v6^CWuP{pX<3iv%NfwE5|n2=UFqxW3-hiw3rpz1!`$k*BP_PUM$lk=LP)=hph
zQ=5PNe(HNN>z^7=zgixKSd_9>RqOlRMxG<CYBU)!TOt}vCY}q3w11D<=_UQ+DB7UW
zs4phm;kCFCtJ7UwBQszOGvGU3j#Rut16JNv06US&SPrP=Iccj$-@vnyu0r_pJ1N&&
zrGsUyr6y;fW@C^Z3e!(N@znV$=~qjnUQm{{^RpjttU)Uy^;-TRKNBbnkSm5L>%r$R
zYW+cn5uy_E1uA3HFJOu()~_XBkOrbVW44v!0Q(y#<}L^uHCz%dziHYknL>49rwKx$
zX5ObcN^aw(+~{`THge1EyrnpbaB+aE#y8iNq;mB-aVnmL;n6q@;b8gyP3T~QK<+Y!
zYYai-N}2^dBWIrC;Pm7$DQ{47KK7#4GR%%!7#qhC1%|`0&^KM~il9s`ef^I+CBgis
zkn+MHfT|oc&J?DV(eCSAZ*l6XOesQH0uftUZ-HgjXT`dlrJNC)neKnWgNU{Y4xmeq
z{>yowuO^@6J@~Bps0_7#$oA1XS5>bAh(VOu)xxbF<9H2W9QS|!qke#Ja!oq{d*Rn9
z+>Q7)9tU8Dx!rag+LGlj@BWAalc`d)-?}cCAk^oF4%igo;B%v_>rlMNK>@HDu>}NM
z4;D2DuR{z51)vtfhv{VlcC=J0v_ZD+F7xz8s+b+Oy-i~kh@*&e2yr(}FWy(^#i^^+
zprC|PG{?#{Qup&63%A#Nx$E}SEck7`uq^7xe0;m>X%A>fyoe424b6+}+9vk6!Jbe8
zy_u+0z&I>E29)#PC@LHCqCm|cKR2sluXpYYBQ78uJJOy<1{=O#stX#9vll?Yb?MV`
z2#{pP-J7#Df$to)jzP)$0oO9Cw$Sp}t>7RKky8|?uo~qDIO=5A|KO+{14KWubJ^xJ
zxDr<m-p^0zo&{>BhNSO+Rt>e3&E23Y56k{H?K9goHm2u1%e%XcE}pzv!fb>@)Ev9-
z7t{<+_5S(^Vb5@oYJnHV8TK}uC1f`CYH2O0*T5Ij3UVQbFMDsHO3aUbe03eHB9VQ-
zr-1Bb=M(cV9P0Pzx0m^P?N7Qi!ZsNj(Q_QTprc!^U1TNDY!AU&4stqg`_H9iw=0-0
zwf-Wo;dOrIgWn2?hDJ>>{B~Ujr$Lg5VA79*_j)oN(+NvPmez-Unz<$kf+a4GP7Q8e
z@p9%aG&oqtJ4@4rF<zg6UHh|fov9G~C?A+gPYs+Rfh$TmClRNy<v606gVV947)Y`n
zEJJWP7YFl!$vD&-d@AL?Y{jmFw#%+A3UXP32*hY!piYVM>f685xUT%;WAV>5evO+J
zy#_Uflp{PYCoLQ|f%us^`CKSW{pmQch5G)K{2!PwrMzN^(qG+xQ<=%Um&^cemYBsm
zXCWWn*4EW6m#ZDr$u;?>gqa?&7VSHkKVczHdCfZs$2`E})+iT!vvDr!`H2O|xVI;?
z;V_4|Iy!1Ns_d07P`Lxqs@at&6Y?eQ7lN|Wq{Y{PA~pfh<AT6meYQ&HMqu3~kNXhC
zu5K?aAuQgWE9jr?mV*%6xgo~46$2sFi)BE-u(EH%<C{SEA~seZ|7}>UP6B3ptn>q5
zmU1{aDvTN_R7xEdTLxz@XcSq4%m+m^?0b*<v<ch5ZJ)_NvyBC~?|ek!awE_SvvXqi
z4#%7Qu<{sHfi8IZsN&fr41O0aAV~cbSJB0Loy&pxMfNe4V5CAcyHnV&`+zTz;qpI6
z{)QeO3mBaX>{<j=gO_<s9_qU9+KTf%{Cnrk7%`}HPWXDE@w-WK%k+!nPF}mM-HS39
z<srJdd~}av{L#Vy@2Hf2d!xzCmZm;ukrY2?Jw(dEkzVK`6J_?e74ZV?wku?6c)KSk
z0#y!^NS(A>QP?C7aO9+?v0Se2<rt*@;Qbzf3XO4l<j32#M@$GkZHfy8)e|UQjKFnc
zQQ|y=@ZEKi;_F(LWl03T9TGsZ=@t0Lld=uxTeAiQ)O#riijWYrOl$@EPAuR8;z0@a
zX5D0$t#Ga}(IC+c^I@_%&Qt^NbUL%dW&&_^y5q1n8A6FRR=zmd2wuxZD5XA=>E7E0
z&cXDMk*`3QJ2?o75(!$WsZiguCS0zigB670*_H|Hf9tbXuMB=k!ZyE|iD88e_cbe7
zF&?b|43W|wq}GaK4N3FF8Ks15B<V|1%zanuwa)(n&~Av@83~ZhTihPlHJdFxalXJy
z5K(|}{3MvyKtNFk&Vu)|;W~DaSW+aXc<9w~{K}ciB`XHy^D&Zipy9#59yjm>WXPUK
zID{7W!dOjxpz2i3&`h$ueDg_5+G~Tg^FYf1r*v2r)LK~OOs8X-PJ5#xZD)!m;uvg>
z`@<O;P>72qA-(Oy3Ixp(V3MedY?@yF8Ax5qaMzy^{s;<?15<=lAT<Gs@gp!mI>*YO
zpmX}cM-5)GLN|X7uwkQ@^1$941@=lG|1>U~s#oV2Y<_T}=Kd-UVspXYAG4P(h+Fqr
z6hyjgF7-72lc8PIO3I-tlvcG~L*xDg#O;2iKO$qHDh;B0i^K=SDGTc5!A5#M<hw36
z1$>np_`YoLu5I@0gTBK9g)?zpO^NWg8iF_^qG%6m>0(4&w-}{1_CWnwp}5XrqXtd~
zA;7<r1I8#TMe(-xl#=M8v5UJvws9H0R<O7a78o7g>0ftgh;|COtN6|eEyl>aYrdrs
zrFCo_$Qqt+ynhYmqU}s^vvk|P@ds5>6nNo4wfP>~p)5zZL2F<QlL)X}m6edxW-?J$
z|5UXOK{{?oONpb{I(upz+~qH)s_UVeYzAE*_@0olvOCl+Nb<!%4}%hxAwcLcco{Dj
zQ#G#zx@L=1ENRx2cD1^L0im355M$ijIg(<}`mU$iAkRdcZ%7fG9soL8iE%VOtij|F
zcGNfPf<wRayUplv2{h6wLfRe=Ci*WKC*H(Y+n0*k%I*UuyRUcGQP_}rec_px#%`|v
z#^~rY!%4&KH(6?k$ZjtC>OuOWCTl>K%V2Pi1TOE-cguW-f|ncLwswGi?XsEfl;58%
zSP(XDqTKyGK3f5PBqR>Xu8*DZ*av;jNw)po*E39_gEG5thuBPn`|Wfpq8ChssT$c$
zLqM`q^4OuZ|5v;L4S4P>-i)Q#eq&<Ni-#Ter;@+4baOnqICKS!j~Q%qs@XpL!>(uU
z6@eVv2!6O3_GcEvi*Lp@cR~JMJyC@q*n(Y>a^N}5*$am}TbxrtWg5?N-~HT!F%H09
z@E``w^^TT;Zq<wS`jdKvkYmhTA!W99ob-jc5fZhnH;;nqR~Y3028tBX#@Fg@ns;5S
zmR5ZwfB1?(iDt+9`)s}WNDyX#i*X1N8z*9#GuM5x_~c+9tExu+_aQ?Y+biDMH2efs
zkA#W@U|Nj;_LKW#P&;5l{sJWC*6_ms{*X5;NsZK=w99Yrxfh|hk$Dl$rVJcSv2tWR
z;PL7>D3$Fu7KXpXY3w6Qv@*>AzOodR)H<?<270Vn@{9<z1^6%TGp0=V+tuD>K?u&V
zs!Fw4l2%WaUbok~<w7?_N(G7QYKIykvcQoizjxE1_H0@HX|W}+5<w$RxtnzDOP{+x
z>|}Ld;;NnY5V9{%zRFPv_lk?GdPoh4n~`&+C$1)$+l^O;%R_?*NRQ^q)^$rLp1mH+
z3*yEu5d^w0ua?5n2(Z>(tcSh(D^8F9pBqrTm1q`G+tJU5`%?)f6JTGQBp!dm6*vmw
z1OlgbzubagWYeFfjT+z8UIYE28{4R3k4>_G@+vp_7(+*uJ*#L2g9!iT)+zoFH7M|7
z#fg}hdd<(fUA2X%CLm(})dR-}GWZsY6o||Xp!7b$vTUuLby=Qh?jq<cYhkPAqyZ{k
zobT*eI4)KXgEYuXiHWnI_3Ap-#uJH>#{p?N=6VIF5JkQBvb0oxp)EHV^m#5x!7ceM
zW3S@$D@BD0k)<T$_$sQsobg@j?KW0EAl$Bs`)t+Ta>4%{<a1oAkJ?jq;4%nX_*5Lw
zu#(2ahA@#3pud5!6fwpkw)L`WlCN0{_2V50h{u<#H3MOq>w9K)HkYS8&rlb=Z@$b$
z+I;ZDQjs`N`lfg<La{7W>_qzxIuyDjMgb6~N-US^C&;POG9@9#ReRjCD9~T7gw~Uq
zfqnE#^f9K$YWoevi4Twxwpe@!)dCj-xPI&>yrgc;cR1M6zgHJ)!?BR}1a+uL6w5=4
zOVx=NgU(A1Go2xk$$U{f8Z0ZGpyftH4mG};&#_bN`ze(unehdSeH(UTd8Cy<=bMxp
z#XIJrQNJLj&iKxA@i9)W22<i1xQgTJK3X|6-59Y-QEe~FNN2r^bBp5k*^j@k)Mw8W
zpgf!RO>c+=Lunz+T2)YSh0CQr%fsrel*6B}?=_=j;;6J22UJ}q<8`5-(R1ayZ0Clx
zRDtyqHdMypKU2%0ob&t1hV~9h`J`z}0<E`YWcF2;OUHMlR7(lsADr5|y|GphLpW$v
z_8kn0<fpM#&#j@83?D1@nm~a<?%3_qgItC>8$zuMWK{+3$nBY`)}giJAwrWQjcdF8
z;SXY=xOH3Jp0DWB!AX2Slvv3qw&BC#Ww}r9vfT&0H;C-3?L3<|DPE9H&GM!J$AjsX
z=?(m|<-@0k`zlTi?uErJAOe&%*0u}MJ0ih??rw}wY$YtHv*kMUF4Y$fA0M|+2ELg#
zjIm<uUK;_{b!+9<yJ#HRsQ-GZ;CyMw7`-&GjhLcgpUdJ2B|tHbBO)5O?+Z$fiQ!es
zypZu&AX0<J=efi1fq3lcSa@z6@2V=Aaz*{F!1gY%EK!ID1}v4Lq{Z{4%L)g~7Yj=A
zLJUvDiBp45y%%N`t?oP+cZRHW^Le9jDboMYql?ljvPvp~Xx!#KCpcGFovwlT(R&(Y
z!1h0{?at_+XWQ{0;i(#wQO61uyDR%xawx{w3i?S-W5JYg17XfSt7YZ$;h;DVmcq-j
z5H!g%RRsavmC>j<nqQvw4nZxh01P;_Pl@@TI$SQT0yOXb-g?EX*@j1YoPDopN+ekK
z!OYA4>SC!1r$@ato$*_J<{ZXt;Mnx91G~kURl0O9Yz#4q04{MzwVCVZubw`ahA;y9
zUvX#!Jwxz9#FVNE<i2+RXC-`@k^A^pQAP(obiewb`O;SD_FRb_8Dqo@jjl&%b`{cT
zc6NNm>L+tb1lZOS30^CLRi*fSgho)pIj*bxpXmv{B3)w1R?sUI4VBXO`=ez>qRH>y
zfxn@TgBN8Wd!zMXa3Jx50aN8~V43i#0#6O?oue+T){DPSFa%Ld)*jCm1oz3N;Gf&P
zLZkQo5E179gQZvewHIQ5O~MLr{?cSZ^`XombgBQ)laUKroLo1Nax2TOI4jB0rEM+m
z!$I-u-iA4wac-ISGcJ~;I53z7@0i~3t$pdhg;d9<hKe|p7tXl5YU4fK)T*kUhmh0e
zJqQLZmNxFv9BCN`^<oI$ZMYd=?@A;vy>c(Kb|JzV6UwT=wNe0ShAkOWMS>d3_7<H3
zZ;<h=YO0J<upW~8s7qc4m#0y27d>VPl6{gh6*r3&k#vp|q#cQ6+q>6KSfcpZirO9q
z2N5qBTwrr>_D0S5v$l{>yCBs)_*mnIdGR@{ek}Sj#kD9QB6cEy0fHvTMxEB@V%bmF
zO|>)YY*;`yXz_wpqI3}9enmekaH;a^>rmmbVh1^l1*>TWf#-^!TLcF`7j~IJPrFKD
z5=>)l!uwXw%#95G$RMpMJF&V!T}EKyUc(M4<)Rc8@B&ZOMdO78x<oL2M9TWn87wZD
zcHpq_fWsTtWbPDMA$<*u3gb>sv;6f7<Q0^AE(l*g8RuTKabsuI4R~&tL=Xi<bT(X}
zDroc8D>};|w1e1}-0HU!kVmW$3woR{#bLZ>m0Zbqh2~G)pP;+cb!REA6{!J|y?8nw
z-hl_YwHjNPl!Lr)GoI1|o`14x9&GTf-h;sO)9GyNf?F5<2LcaFgfN)ScEfI;utPC9
zYzL4*ix7&0nIGmi_&Ay?c(M|dggCl}9w{FhLJFR1)wkLl>ix*MTfNI-*uh*VPeyHB
z{A92k>vsD5y&ru)l}>N3p5j1SIuXaVtJ<w`+^3!e?sdGUQGwERk$55Zb9^j=9-ufL
zd<sR(I<9=Tf;EsQzd5lm*_P7Rf$|^?DuwWO#d_o7uVc3=`)KIl4v4lD+;4iLj?sDJ
z;=xE|9pEYui$v?<@j|4}RuwEDe;m)~u|C!DW(X7~Vq6BItG<wd>2S1-%D|>6Hgj-N
z0}=5A_4_V`Uo~*-^1yJ|@|!dNS2r5cxckH8sJYB&D-f3)vHsWx{HHmWGx)ncqmcbf
z_b&)vcj}?^`D_cg8gtkygtRYTvV{(~YU_<D<9L@P-_AOvQ1yczc4!a2aVLi{*J}cf
zqt6uhdvY_ukMT(Opvw&D27c)Gnv5fJSYupi4H~hfr%#iyQ+9F-AR|8**)@O|6mg1R
zlKzA=XLOD=M)Rc`+k|Ud`I-)t@*l7zq^G~v81@%Fv-7Lx^qRj!DQG7rksk^Uy_ZvE
z+=E=Rv@i>S7pm+DN1xaOB;z%Krgi@N^M~D5Kro?bxvK2_C~iJUy}f-T<06#!BVn+w
zdeoj8<eUPd6W%P4c~JmeO@rKJu|QzW%)zB%c<@LlkLA=bD?i+vrHlkL%AE4BNb`EF
z#{$($SnBCdVPK7EO*?7$EEWQ5_Z+zUP?86&3jY^d{{Jc4+nOHFZByWudQHj+6fs}3
z=zb|yitVuZFhwuau^Bw3rbZ1BkL)GRKo{B<M`#?7pY6?Cb=)7J**DR37{)EpdbI*=
zziCa10WtQ7JOagkYO`hfb{#aq8Z?RSyFeBoS1EUKYy*!yxBZ-U$>qnS^~+F|tr42v
za`=96-tA@WZ^tHK{=5LX;`F!?C|Rd?>LKQ}SOGf_1zcGV_O~0G^6W+^MF7dOkRc0q
z+IyQ5qWh7VgFsu#1DxSnsR#R-BZTR;RjH<8FphEw%_~DIhK2nD_sx1eao0c$Smw|W
z!l+g^-`a|Tto7JOM7ILbK^|#IvOablm}ra+PY6(ZvN&h|iZyORd$e7|=TDB4zgmW-
z;pq$4Lx=gpa8HG^YLHBY$9B<o_$#I@UOQI+)ycHNLaQuCB+|%?BzViN)PERsDv+-h
z?1CxV95J9lu^@4hY4hTnZyPIZimUu0K)y9Xz-fTi<%ZOMKISU8C4F%PAI4SKM$D_j
z<3Er58*=f3>gyqp$V_9s767oDk@KNrIy*A7_@2CZC6(U14gTR|TR!*%!u}lP?b2<k
z4{uia$@q%`=yKHKMxa^9)D;v8=*KkAt*v^Y4DFZ|N)--zZ|)0;B>njKO{TBFGJcM9
z>AZ3wVk<72(T;<CXQBgJy4s1J32WD9u|+A0lv^rzOT8UPQzCIv>&s}Cr8e^-tFZ>~
z1d$y~DB_#wl=-tJ{WV0uRKf-3^CU5Kx~)cRnZI#`#Q0;^CMktE{a37E6Y4%e(z}rG
zBy11QHL0RdmeeQec;%6QekZ~~YXG~mvi)n|nL5x-V&%7G^~lfrep~qQqpq+xz5iJa
zRd3h!D*XeIe(gyRRS}2Vo?zSrU#puHXxwJ8JHMfImX7M=CiG94V4*rXXJ=`kmN<bG
zRN(^#whjD7yQXD_4D6$IKg<6;yh1R~YR<2ow=&Y20-=Vqll!B|{|pcOpHlX(0FWgM
zGLSXGryf4;j(uQcyffLFw93B+R80ezB0m^QjY(LvCdy21)qQtvo%bYQZy_*Mo~&l$
z`?kDur&n=p4r@M#n4HAw`og?|0Z*#*Jzb&~o`?R>76I~-DdG76>k3d#lq%HM1r69b
zC;IJjwlC#eoaGPGD^J=3(GOZ&WoT`eBHu+yXGvYB`8#>i8=2ZgIdi0HI=1ij<VYKl
z4^oH|ke_E3yh6g@zPGdG!~?D>>~q=IJA~H^ZsIs%5t0h&60%KO84kPr*wyXhEEBlm
zIHB{RNH=2CLUxZ$pUH3oUqK;vLdPg9Xu=8nsG1ylHh26kCD?)yh+y@NVTHsKSYHdK
zk^nmMUvc{L)-s+s2d*1(43&H$oRt9NtUPNqFb*NOZcr2WIe`KB%7wz)t0M)YDc2bN
z?Y|chs&p4V`-q1@{It_ZXY}ld&Jca|MWMd~b9CeGQ-qaCTHoIu+|SHPI}uDL;bVdY
zekvrB<9A5UGId>3tl+96&|Xhc*~50v^pH*R7sSqp_@qlY`5SJefstqU(8wc`do-pX
zhEn1?JwZ>n6cn2bkT#ct&uQE_v+U-3ipH7gw*1s+%ClkDKtt6)OpcQ-rdJWs<krKv
zhvGm4Fx3GD5{?PAhVXol!y|?r@h<f>K?A+Sqve|94?F+VDm$_LrJgi2;^rHRpN}lN
zIIz!vAu*_Qv#Gt=6Exa5ze~wD>cL>U#Qjg<aP`Y+MFlRjIh@(a-oQ%(4<;bP5_CAy
zH42&=xCcsIuhobY&dQ!);zGCXZK`?}KH>h%`3^Z&*yz|tz+EK>SlZ!{KmP6eU#l_#
zZXvT>07ulHLfjAe1*2U}hy8$tdx5yyzv%i(ne#?ZoA7Vc_ryX}GK=+-SuKT+f@pTZ
z*BNX?rRcH^d{B-9hCfh_t_JLOmJU^h;W|e4#4_H*dHtlbNKg%W$Cfk*hR+mzdPDVU
zU~toHTYZ40Dn)t?dKE=R4BUgKHII4?h*s<8++O<qYv+srO=#i@5PX#&m2|V9z0Pf@
zZqEGN?EYm*B-ZlrUmn&udtP@mr2}X*{`Tg+#Sa%4eO%tx<!Y}jv*{}v`swhLvin~6
zJDBfRPm^8?OwgH1BwF-cbqxP5UK&YR^~O)-oyTn5(UaHACwy1)?sR?-HU`?F{)XUN
z(#;2GK&J_AJ+5pyX2ikm4xzPSh`_3C^Gix#vy6~8h+8n{lh&=G+Ny!<$31mVmqt^B
zA{ie1z^}hl2g*srf$$Y}CQJ6Ql(XANk0P`<v+Sm+>Z&k9=4=!A#t}E6Gd>qKq3P)r
z;}D4Hv-&5AO>mov9s)Np*<{}N#nR4NZKLKMXzjCoP!G^%GMq_6dtxL&RH2KhBuh$l
z_I5BNLdf?EI0tC#{~O=^fljsmTVEOSAbx@^?m)g4a?s!i?ZQyzv?3qjLAobqz{@)q
zcf)<7nkILn18H{c%G7D=klK!h4*`rR^-<aPpZhf5UxY!QL|IAvP-H=QsKAloe(>o&
zKCV<rjMG%wRW`Q9axHO5f*iLm&{L@b9%1P7er>ZFv^V~aMJ<Zt$~}`^K%nOE<r_+~
zZD<^qgZj_b88s9oD*6bA)*GQ0C`g*23vU_f*eIl>2>40_m_$N-X3__B_2WS{I(l3U
z(Oclx<)6)MlY@@4nPftD%!)Ah&vG-t<7Wh?^HJHpY+N%i*ij(X^~j?FmG-v1a6L#7
zh(z?dVsGm+)&Rb>aneE%DEFCUuP-ms_57%XHTT&aMV_n;?m!RfAH*yl1$#^&ePZ7Q
z5Kr%2FzvB#&Y!=CQuZS&A0gMIufFDUhU%aPgAUxMQR9%8ZbCZFEt%3-fln;;-T!pI
z4tffDcff7ZiK(_Txezq<!FSbiA)mQg3K}*OB}5?3Jp}t^H2Rh_&r*SEmM%8NaXiv4
z<n(?tLh3G!)u$97viPdpJk-NCc+J{dW{-QYWV==!Ly_F^5tCMjvc^?zBoJUW;@BZM
z9XotrhlOTH;?;LgS;)}6yFgnG*d%nEPh$^u*u%@x_3`L-g>mjwyuGss<iDvN|3@m{
zf(!_?lVqq(Amz(u(d2}bx0JR)JZ(cC^YE(Mg4YgIBB9XAGrMp)2(d<IH^M}7BmTXT
zXf<6&scl}%fqw87>us~b*B5f1^e9gQi}zh=5DeU=u`#Ad<Jju{YOi_L<df?I`yASd
z#nQhwFPB>U+*n0=O&i!@lH)r8`!GHE9JC7kw%ZEvB3QKYz*<@MGWbJy5hkU?Z~?NG
z$^Jpp>VF3I)M|Kev@XO}{myWex5RU0AK%(g$Kg@JqTC*gTP-1t@*wO5B`jzV@pOc5
zz(wG_M>9;{Aw|+WIAmcBhR}p!&@9b#o*%;%xBcZ{JZIP!<j_aV0W?SpW;rs5bFEb^
zpBR;Cu!~QCdU~&kzFxzOJcVuK;}u$kNUkgGx_}1=A90sLo5;;}HE@IiK}Fnw&%r2*
zPj)!zXt10n2yEt@o8JEnr=pjm*JWbB&VWLk{O%l<n%^oB%S*L+L2Ro|-2wIV%Cw>|
zN-Q%7^PS{ub#s#w8R;FC*10+`Zg;|vuH9!E{UDv?UMOuF^lh0>vd<{jDSG|V2c1ik
z7877q=;(v>NE!&!J4@f_|9g>L;2pAMG~B0fP$WO1ClQ*#E+iN;)XM%@*9kFk*k(8Q
zD?{ZlTLQb-X4UGdkn+ac`(!dg?h?5ccRm+5z8R}kUzDDittGc2mu0sF@Zi)={s8%K
zjq7MO>ge8gjPQa`NRMr={5^K1P`k+Q)yQ{LGNNxi`@Ju4hfY5}X&%y6c&1yG7e2Z8
z=U4xFgp5a+8}!!{MBh)GtzA|HgIdNse0rn!W<a_P^Q_+{-jH%<yc5d9E0R;+2k$F*
z1pn;xfvvCX%(<Y&TBL=DosDC5xNSxlKB%F|H1Y|)h-0C1PUXi_QLq<ja;Tu323(fq
zGeR&WAYpZ2s<`6%1sTvUJO4F@eIVFm?hZIjvz5q5_4yav2xxF4am!A*`{S3aGiw{k
zF)@X@4O3jy)2ZwlY69F^=`lt3EtO(#bA$AphO3X)L+CLdr9Fu45y!)yiP)krlBPLh
zGDRWx7p{ZpQx*uJmlyBC@5_jp)dqc{_v)L;+Q$;Z2Cr|MhHx;ThVG3XgjFje)$z2j
zz-wfue+u;@uG*$Hs!8!`5a0hOUEt?t|17JTrYtqT4~eQ*XLV0eXwmD8PEhs;f+c&v
zO82eB?4)Ul*PB(#S2lwUQx;s@uhXAuz&D6WWDN7EA|EB`mO1sh0Zp4^9x_`OGc`<b
z3`gvP)L!l})CpaqgbquCOzTZ8!_AGD1voI%YX+5csHe>QBDOdve)dQ(AH|A*_f8JF
zHD;2Z1Yw4Z;~b5`2o~XEDYE{GIw&!J+2jSbFm|0!1d~YIv{TU@-Q&-uLe=_@Znop1
zMpF$42E~(ahEASW^y%ME3fIl~jPuoZbPN6szHV^|X_fka?b~{_`^Wv8sB^3yzrPJn
PMaf7hNM?#Z^!z^nt54i^

literal 0
HcmV?d00001


From 8922a84be7d5048050a50020f94e8c7a3aad326a Mon Sep 17 00:00:00 2001
From: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Date: Mon, 7 Oct 2024 10:06:29 -0700
Subject: [PATCH 03/18] Akoumparouli/mcore microbatch calculator fix (#10780)

* move tests/lightning/{,_}io

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add microbatch calculator context manager

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* use microbatch calculator context manager

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add on_load_checkpoint test to ValidateModelRestoration; use ctx manager to reconfigure microbatch calculator; update save/restore path; add cleanup step at the end

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* remove unused var

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 tests/lightning/{io => _io}/__init__.py       |   0
 tests/lightning/{io => _io}/test_api.py       |   0
 tests/lightning/{io => _io}/test_mixin.py     |   0
 tests/lightning/{io => _io}/test_state.py     |   0
 tests/lightning/mcore_microbatch_utils.py     |  27 +++
 tests/lightning/test_dist_ckpt.py             | 137 ++++++-------
 tests/lightning/test_nemo_resume_from_ckpt.py |  55 ++---
 tests/lightning/test_state_restoration.py     | 190 +++++++++++-------
 8 files changed, 246 insertions(+), 163 deletions(-)
 rename tests/lightning/{io => _io}/__init__.py (100%)
 rename tests/lightning/{io => _io}/test_api.py (100%)
 rename tests/lightning/{io => _io}/test_mixin.py (100%)
 rename tests/lightning/{io => _io}/test_state.py (100%)
 create mode 100644 tests/lightning/mcore_microbatch_utils.py

diff --git a/tests/lightning/io/__init__.py b/tests/lightning/_io/__init__.py
similarity index 100%
rename from tests/lightning/io/__init__.py
rename to tests/lightning/_io/__init__.py
diff --git a/tests/lightning/io/test_api.py b/tests/lightning/_io/test_api.py
similarity index 100%
rename from tests/lightning/io/test_api.py
rename to tests/lightning/_io/test_api.py
diff --git a/tests/lightning/io/test_mixin.py b/tests/lightning/_io/test_mixin.py
similarity index 100%
rename from tests/lightning/io/test_mixin.py
rename to tests/lightning/_io/test_mixin.py
diff --git a/tests/lightning/io/test_state.py b/tests/lightning/_io/test_state.py
similarity index 100%
rename from tests/lightning/io/test_state.py
rename to tests/lightning/_io/test_state.py
diff --git a/tests/lightning/mcore_microbatch_utils.py b/tests/lightning/mcore_microbatch_utils.py
new file mode 100644
index 000000000000..39b3baee446c
--- /dev/null
+++ b/tests/lightning/mcore_microbatch_utils.py
@@ -0,0 +1,27 @@
+import contextlib
+
+
+# @akoumparouli: use a context manager that saves/restores gbs/mbs when using
+# reconfigure_num_microbatches_calculator to avoid interference between tests.
+@contextlib.contextmanager
+def reconfigure_num_microbatches_calculator_manager(*args, **kwargs):
+    import megatron.core.num_microbatches_calculator as mb_calc
+
+    # Store current mbs, gbs values
+    if not mb_calc._GLOBAL_NUM_MICROBATCHES_CALCULATOR is None:
+        _mbs = mb_calc.get_micro_batch_size()
+        _gbs = mb_calc.get_current_global_batch_size()
+
+        # use user's settings
+        mb_calc.reconfigure_num_microbatches_calculator(*args, **kwargs)
+    else:
+        _mbs, _gbs = 1, 1
+
+    try:
+        # run user's code
+        yield
+        # @akoumparouli: no catch
+    finally:
+        # restore old mbs, gbs
+        if not mb_calc._GLOBAL_NUM_MICROBATCHES_CALCULATOR is None:
+            mb_calc.reconfigure_num_microbatches_calculator(0, None, _gbs, _mbs, data_parallel_size=1)
diff --git a/tests/lightning/test_dist_ckpt.py b/tests/lightning/test_dist_ckpt.py
index e6ea381fdf0b..d5037f0aa573 100644
--- a/tests/lightning/test_dist_ckpt.py
+++ b/tests/lightning/test_dist_ckpt.py
@@ -24,7 +24,6 @@ def set_env():
 import pytest
 import pytorch_lightning as pl
 import torch
-from megatron.core.num_microbatches_calculator import reconfigure_num_microbatches_calculator
 
 import nemo.lightning as nl
 from nemo.collections import llm
@@ -43,13 +42,9 @@ def _get_last_checkpoint_dir(model: pl.LightningModule, suffix: str = '') -> Pat
     return f'epoch={model.trainer.current_epoch - 1}-step={model.trainer.max_steps - 1}{suffix}'
 
 
-def get_model_and_data():
-    micro_batch_size = 2
-    global_batch_size = 2
+def get_model_and_data(mbs=2, gbs=2):
     seq_length = 128
-    data = llm.MockDataModule(
-        seq_length=seq_length, micro_batch_size=micro_batch_size, global_batch_size=global_batch_size
-    )
+    data = llm.MockDataModule(seq_length=seq_length, micro_batch_size=mbs, global_batch_size=gbs)
 
     config = llm.GPTConfig(
         num_layers=2,
@@ -59,13 +54,6 @@ def get_model_and_data():
         seq_length=seq_length,
         apply_query_key_layer_scaling=1,
     )
-    reconfigure_num_microbatches_calculator(
-        0,
-        None,
-        global_batch_size,
-        micro_batch_size,
-        data_parallel_size=1,
-    )
     return llm.GPTModel(config, tokenizer=data.tokenizer), data
 
 
@@ -76,21 +64,25 @@ def test_dist_ckpt_io_called_for_mcore_models(self, tmp_path):
 
         set_env()
         assert os.environ['NVTE_APPLY_QK_LAYER_SCALING'] == '1'
-        model, data = get_model_and_data()
+        gbs, mbs = 2, 2
+        model, data = get_model_and_data(mbs, gbs)
+        from tests.lightning.mcore_microbatch_utils import reconfigure_num_microbatches_calculator_manager
 
-        strategy = _get_strategy()
+        with reconfigure_num_microbatches_calculator_manager(0, None, gbs, mbs, data_parallel_size=1):
 
-        trainer = nl.Trainer(
-            devices=1,
-            accelerator="gpu",
-            strategy=strategy,
-            enable_checkpointing=True,
-            max_steps=2,
-            default_root_dir=str(tmp_path),
-            logger=False,
-        )
+            strategy = _get_strategy()
+
+            trainer = nl.Trainer(
+                devices=1,
+                accelerator="gpu",
+                strategy=strategy,
+                enable_checkpointing=True,
+                max_steps=2,
+                default_root_dir=str(tmp_path),
+                logger=False,
+            )
 
-        trainer.fit(model, data)
+            trainer.fit(model, data)
 
         assert isinstance(trainer.strategy.checkpoint_io, MegatronCheckpointIO)
         # Ckpt path doesn't contain the .ckpt suffix
@@ -104,51 +96,54 @@ def test_dist_ckpt_io_called_for_mcore_models(self, tmp_path):
     def test_async_save_produces_same_checkpoints_as_sync(self, tmp_path):
         set_env()
         assert os.environ['NVTE_APPLY_QK_LAYER_SCALING'] == '1'
-        model, data = get_model_and_data()
-
-        sync_ckpt_dir = tmp_path / 'sync_checkpoints'
-        async_ckpt_dir = tmp_path / 'async_checkpoints'
-
-        sync_checkpoint_io = MegatronCheckpointIO('torch_dist')
-        async_checkpoint_io = AsyncFinalizableCheckpointIO(MegatronCheckpointIO('torch_dist', async_save=True))
-
-        # dummy_trainer just to initialize NCCL
-        dummy_trainer = pl.Trainer(
-            devices=1,
-            logger=False,
-            max_steps=2,
-            strategy=_get_strategy(),
-        )
-        dummy_trainer.fit(model, data)
-        strategy = _get_strategy()
-        tmp_path = strategy.broadcast(tmp_path)
-
-        ## reset the model and data and train with sync checkpointing
-        model, data = get_model_and_data()
-        sync_test_trainer = pl.Trainer(
-            devices=1,
-            enable_checkpointing=True,
-            logger=False,
-            max_steps=2,
-            strategy=_get_strategy(),
-            plugins=[sync_checkpoint_io],
-            default_root_dir=str(sync_ckpt_dir),
-        )
-        sync_test_trainer.fit(model, data)
-
-        ## reset the model and data and train with sync checkpointing
-        model, data = get_model_and_data()
-        async_test_trainer = pl.Trainer(
-            devices=1,
-            enable_checkpointing=True,
-            logger=False,
-            max_steps=2,
-            strategy=_get_strategy(),
-            plugins=[async_checkpoint_io],
-            callbacks=AsyncFinalizerCallback(),
-            default_root_dir=str(async_ckpt_dir),
-        )
-        async_test_trainer.fit(model, data)
+        gbs, mbs = 2, 2
+        model, data = get_model_and_data(mbs, gbs)
+        from tests.lightning.mcore_microbatch_utils import reconfigure_num_microbatches_calculator_manager
+
+        with reconfigure_num_microbatches_calculator_manager(0, None, gbs, mbs, data_parallel_size=1):
+
+            sync_ckpt_dir = tmp_path / 'sync_checkpoints'
+            async_ckpt_dir = tmp_path / 'async_checkpoints'
+
+            sync_checkpoint_io = MegatronCheckpointIO('torch_dist')
+            async_checkpoint_io = AsyncFinalizableCheckpointIO(MegatronCheckpointIO('torch_dist', async_save=True))
+
+            # dummy_trainer just to initialize NCCL
+            dummy_trainer = pl.Trainer(
+                devices=1,
+                logger=False,
+                max_steps=2,
+                strategy=_get_strategy(),
+            )
+            dummy_trainer.fit(model, data)
+            strategy = _get_strategy()
+
+            ## reset the model and data and train with sync checkpointing
+            model, data = get_model_and_data(mbs, gbs)
+            sync_test_trainer = pl.Trainer(
+                devices=1,
+                enable_checkpointing=True,
+                logger=False,
+                max_steps=2,
+                strategy=_get_strategy(),
+                plugins=[sync_checkpoint_io],
+                default_root_dir=str(sync_ckpt_dir),
+            )
+            sync_test_trainer.fit(model, data)
+
+            ## reset the model and data and train with sync checkpointing
+            model, data = get_model_and_data(mbs, gbs)
+            async_test_trainer = pl.Trainer(
+                devices=1,
+                enable_checkpointing=True,
+                logger=False,
+                max_steps=2,
+                strategy=_get_strategy(),
+                plugins=[async_checkpoint_io],
+                callbacks=AsyncFinalizerCallback(),
+                default_root_dir=str(async_ckpt_dir),
+            )
+            async_test_trainer.fit(model, data)
 
         checkpoint = {'sharded_state_dict': model.sharded_state_dict()}
 
diff --git a/tests/lightning/test_nemo_resume_from_ckpt.py b/tests/lightning/test_nemo_resume_from_ckpt.py
index 31ab88546cb3..e876e6965000 100644
--- a/tests/lightning/test_nemo_resume_from_ckpt.py
+++ b/tests/lightning/test_nemo_resume_from_ckpt.py
@@ -27,7 +27,6 @@ def set_env():
 
 import pytest
 import torch
-from megatron.core.num_microbatches_calculator import reconfigure_num_microbatches_calculator
 from megatron.core.optimizer import OptimizerConfig
 
 import nemo.lightning as nl
@@ -90,7 +89,7 @@ def compare_ckpts(a, b, path=[]):
         raise ValueError("Unexpected value type " + str(type(a)))
 
 
-def setup_data_model_optim(log_dir, n_steps, data_path, gbs=2, mbs=1):
+def setup_data(log_dir, n_steps, data_path, gbs=2, mbs=1):
     seq_length = 2048
     tokenizer = get_nmt_tokenizer(
         "megatron",
@@ -108,14 +107,11 @@ def setup_data_model_optim(log_dir, n_steps, data_path, gbs=2, mbs=1):
         tokenizer=tokenizer,
         split='9999,1,1',
     )
-    # Other tests might have different configs, so need to configure explicitly.
-    reconfigure_num_microbatches_calculator(
-        0,
-        None,
-        gbs,
-        mbs,
-        data_parallel_size=1,
-    )
+    return data
+
+
+def setup_model_optim(log_dir, n_steps, tokenizer, gbs=2, mbs=1):
+    seq_length = 2048
     gpt_config = llm.GPTConfig(
         num_layers=2,
         hidden_size=128,
@@ -131,7 +127,7 @@ def setup_data_model_optim(log_dir, n_steps, data_path, gbs=2, mbs=1):
         masked_softmax_fusion=False,
     )
 
-    model = llm.GPTModel(gpt_config, tokenizer=data.tokenizer)
+    model = llm.GPTModel(gpt_config, tokenizer=tokenizer)
 
     opt_config = OptimizerConfig(
         optimizer='adam',
@@ -148,7 +144,7 @@ def setup_data_model_optim(log_dir, n_steps, data_path, gbs=2, mbs=1):
     )
     optim = MegatronOptimizerModule(config=opt_config)
 
-    return gpt_config, data, model, optim
+    return gpt_config, model, optim
 
 
 def setup_trainer_and_logger(log_dir):
@@ -248,18 +244,29 @@ def train(n_steps, resume):
             log_dir = f'/tmp/mcore_logs_{n_steps}steps'
             os.makedirs(log_dir, exist_ok=True)
             data_path = [DATA_PATH]
-            gpt_config, data, model, optim = setup_data_model_optim(log_dir, n_steps, data_path)
-            trainer, nemo_logger = setup_trainer_and_logger(log_dir)
-            llm.train(
-                model=model,
-                data=data,
-                trainer=trainer,
-                log=nemo_logger,
-                resume=resume,
-                tokenizer='data',
-                optim=optim,
-            )
-            trainer._teardown()
+            data = setup_data(log_dir, n_steps, data_path, gbs=2, mbs=1)
+            # Other tests might have different configs, so need to configure explicitly.
+            from tests.lightning.mcore_microbatch_utils import reconfigure_num_microbatches_calculator_manager
+
+            with reconfigure_num_microbatches_calculator_manager(
+                0,
+                None,
+                2,  # gbs
+                1,  # mbs
+                data_parallel_size=1,
+            ):
+                gpt_config, model, optim = setup_model_optim(log_dir, n_steps, data.tokenizer)
+                trainer, nemo_logger = setup_trainer_and_logger(log_dir)
+                llm.train(
+                    model=model,
+                    data=data,
+                    trainer=trainer,
+                    log=nemo_logger,
+                    resume=resume,
+                    tokenizer='data',
+                    optim=optim,
+                )
+                trainer._teardown()
 
         set_env()
         assert os.environ['NVTE_FLASH_ATTN'] == '0'
diff --git a/tests/lightning/test_state_restoration.py b/tests/lightning/test_state_restoration.py
index 2f4c60395725..076a2f931f57 100644
--- a/tests/lightning/test_state_restoration.py
+++ b/tests/lightning/test_state_restoration.py
@@ -11,9 +11,10 @@
 from nemo.collections.llm.api import train
 from nemo.collections.llm.gpt.data import PreTrainingDataModule
 from nemo.collections.nlp.modules.common.tokenizer_utils import get_nmt_tokenizer
-from nemo.lightning import NeMoLogger
+from nemo.lightning import AutoResume, NeMoLogger
 from nemo.lightning.pytorch.optim.lr_scheduler import CosineAnnealingScheduler
 from nemo.lightning.pytorch.optim.megatron import MegatronOptimizerModule
+from tests.lightning.mcore_microbatch_utils import reconfigure_num_microbatches_calculator_manager
 
 VOCAB_PATH = "/home/TestData/nlp/megatron_gpt/data/gpt/vocab.json"
 MERGES_PATH = "/home/TestData/nlp/megatron_gpt/data/gpt/merges.txt"
@@ -21,6 +22,12 @@
 EXP_DIR = '/tmp/nemo_exp/'
 
 
+def teardown(exp_dir=EXP_DIR):
+    import shutil
+
+    shutil.rmtree(exp_dir)
+
+
 class ValidateOptStateRestoration(Callback):
     def on_fit_start(self, trainer: "pl.Trainer", pl_module: "pl.LightningModule") -> None:
         # PTL has no on_load_checkpoint_start event to be triggered before
@@ -59,7 +66,7 @@ def on_fit_start(self, trainer: "pl.Trainer", pl_module: "pl.LightningModule") -
 
     def on_train_start(self, trainer: "pl.Trainer", pl_module: "pl.LightningModule") -> None:
         for p in pl_module.parameters():
-            assert torch.all(p == 0), "Expected params to be zero"
+            assert torch.all(p == 0), "Expected params (scratch) to be zero"
         with torch.no_grad():
             for p in pl_module.parameters():
                 p.fill_(random.uniform(0, 1))
@@ -69,14 +76,19 @@ class ValidateModelRestoration(Callback):
     def on_fit_start(self, trainer: "pl.Trainer", pl_module: "pl.LightningModule") -> None:
         for p in pl_module.parameters():
             p.detach().zero_()
+        self.called_on_load_checkpoint = False
+
+    def on_load_checkpoint(self, trainer, pl_module, checkpoint) -> None:
+        self.called_on_load_checkpoint = True
 
     def on_train_start(self, trainer: "pl.Trainer", pl_module: "pl.LightningModule") -> None:
         for p in pl_module.parameters():
-            assert not torch.all(p == 0), "Expected params to be non-zero"
+            assert not torch.all(p == 0), "Expected params (resume) to be non-zero"
+        assert hasattr(self, 'called_on_load_checkpoint')
+        assert self.called_on_load_checkpoint == True, "Expected to have called on_load_checkpoint"
 
 
-def make_model_optim_data():
-    seq_length = 2048
+def setup_data(mbs=1, gbs=2, seq_length=2048):
     tokenizer = get_nmt_tokenizer(
         "megatron",
         "GPT2BPETokenizer",
@@ -87,16 +99,19 @@ def make_model_optim_data():
     data = PreTrainingDataModule(
         paths=DATA_PATH,
         seq_length=2048,
-        micro_batch_size=1,
-        global_batch_size=2,
+        micro_batch_size=mbs,
+        global_batch_size=gbs,
         seed=1234,
         tokenizer=tokenizer,
     )
+    return data
 
+
+def make_model_optim(tokenizer, mbs=1, gbs=2, seq_length=2048):
     gpt_config = llm.GPTConfig(
-        num_layers=12,
-        hidden_size=768,
-        ffn_hidden_size=3072,
+        num_layers=2,
+        hidden_size=128,
+        ffn_hidden_size=256,
         num_attention_heads=12,
         seq_length=seq_length,
         init_method_std=0.023,
@@ -106,7 +121,7 @@ def make_model_optim_data():
         make_vocab_size_divisible_by=128,
         masked_softmax_fusion=False,
     )
-    model = llm.GPTModel(gpt_config, tokenizer=data.tokenizer)
+    model = llm.GPTModel(gpt_config, tokenizer=tokenizer)
 
     opt = MegatronOptimizerModule(
         config=OptimizerConfig(
@@ -125,64 +140,103 @@ def make_model_optim_data():
         ),
     )
 
-    return model, opt, data
-
-
-def run_train_from_scratch():
-    model, opt, data = make_model_optim_data()
-    trainer = nl.Trainer(
-        devices=2,
-        max_steps=10,
-        accelerator="gpu",
-        strategy=nl.MegatronStrategy(),
-        callbacks=[ValidateOptStateScratchInit(), ValidateModelScratchInit()],
-        log_every_n_steps=1,
-        limit_val_batches=2,
-        plugins=nl.MegatronMixedPrecision(precision="bf16-mixed"),
-    )
-
-    train(
-        model=model,
-        data=data,
-        trainer=trainer,
-        log=NeMoLogger(
-            log_dir=EXP_DIR,
-        ),
-        tokenizer='data',
-        optim=opt,
-    )
-
-
-def run_resume_train():
-    model, opt, data = make_model_optim_data()
-    trainer = nl.Trainer(
-        devices=2,
-        max_steps=1,
-        accelerator="gpu",
-        strategy=nl.MegatronStrategy(),
-        callbacks=[ValidateOptStateRestoration(), ValidateModelRestoration()],
-        log_every_n_steps=1,
-        limit_val_batches=2,
-        plugins=nl.MegatronMixedPrecision(precision="bf16-mixed"),
-    )
-
-    train(
-        model=model,
-        data=data,
-        trainer=trainer,
-        log=NeMoLogger(
-            log_dir=EXP_DIR,
-        ),
-        tokenizer='data',
-        optim=opt,
-        resume=nl.AutoResume(
-            resume_if_exists=True,
-            resume_ignore_no_checkpoint=True,
-        ),
-    )
+    return model, opt
+
+
+def run_train_from_scratch(mbs, gbs, num_dev):
+    data = setup_data(mbs, gbs)
+    model, opt = make_model_optim(data.tokenizer, mbs, gbs)
+    # Other tests might have different configs, so need to configure explicitly.
+    with reconfigure_num_microbatches_calculator_manager(
+        0,
+        None,
+        gbs,
+        mbs,
+        data_parallel_size=num_dev,
+    ):
+        trainer = nl.Trainer(
+            devices=num_dev,
+            max_steps=10,
+            accelerator="gpu",
+            strategy=nl.MegatronStrategy(),
+            callbacks=[ValidateOptStateScratchInit(), ValidateModelScratchInit()],
+            log_every_n_steps=1,
+            limit_val_batches=2,
+            plugins=nl.MegatronMixedPrecision(precision="bf16-mixed"),
+        )
+
+        train(
+            model=model,
+            data=data,
+            trainer=trainer,
+            log=NeMoLogger(
+                log_dir=EXP_DIR,
+                version='v1',
+                use_datetime_version=True,
+                update_logger_directory=True,
+                wandb=None,
+            ),
+            resume=AutoResume(
+                resume_if_exists=True,
+                resume_ignore_no_checkpoint=True,
+            ),
+            tokenizer='data',
+            optim=opt,
+        )
+        trainer._teardown()
+
+
+def run_resume_train(mbs, gbs, num_dev):
+    data = setup_data(mbs, gbs)
+    model, opt = make_model_optim(data.tokenizer, mbs, gbs)
+    # Other tests might have different configs, so need to configure explicitly.
+    with reconfigure_num_microbatches_calculator_manager(
+        0,
+        None,
+        gbs,
+        mbs,
+        data_parallel_size=num_dev,
+    ):
+        trainer = nl.Trainer(
+            devices=num_dev,
+            max_steps=1,
+            accelerator="gpu",
+            strategy=nl.MegatronStrategy(),
+            callbacks=[ValidateOptStateRestoration(), ValidateModelRestoration()],
+            log_every_n_steps=1,
+            limit_val_batches=2,
+            plugins=nl.MegatronMixedPrecision(precision="bf16-mixed"),
+        )
+        from nemo.lightning.pytorch.strategies.utils import RestoreConfig
+
+        train(
+            model=model,
+            data=data,
+            trainer=trainer,
+            tokenizer='data',
+            optim=opt,
+            log=NeMoLogger(
+                log_dir=EXP_DIR,
+                version='v1',
+                use_datetime_version=True,
+                update_logger_directory=True,
+                wandb=None,
+            ),
+            resume=AutoResume(
+                resume_if_exists=True,
+                resume_ignore_no_checkpoint=False,
+                resume_from_path=f'{EXP_DIR}default/v1/checkpoints/default--None=0.0000-epoch=0/',
+            ),
+        )
+        trainer._teardown()
 
 
 @pytest.mark.run_only_on('GPU')
 def test_optim_state_restoration():
-    run_train_from_scratch()
-    run_resume_train()
+    mbs, gbs = 1, 2
+    num_devices = 1
+    try:
+        run_train_from_scratch(mbs, gbs, num_devices)
+        run_resume_train(mbs, gbs, num_devices)
+    finally:
+        teardown()

From 4ba92b3608363ee9787c882be0fbf31e001f007c Mon Sep 17 00:00:00 2001
From: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Date: Mon, 7 Oct 2024 10:24:13 -0700
Subject: [PATCH 04/18] remove 8x3b recipes (#10764)

* remove 8x3b recipes

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* remove 8x3b from test_nemo_run

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* rm from __init__

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 nemo/collections/llm/recipes/__init__.py      |   6 -
 nemo/collections/llm/recipes/mixtral_8x3b.py  | 290 ------------------
 .../llm/recipes/mixtral_8x3b_16k.py           | 132 --------
 .../llm/recipes/mixtral_8x3b_64k.py           | 133 --------
 .../llm/recipes/test_mixtral_8x3b.py          | 110 -------
 .../llm/recipes/test_mixtral_8x3b_16k.py      |  84 -----
 .../llm/recipes/test_mixtral_8x3b_64k.py      |  84 -----
 tests/lightning/test_nemo_run.py              |   4 -
 8 files changed, 843 deletions(-)
 delete mode 100644 nemo/collections/llm/recipes/mixtral_8x3b.py
 delete mode 100644 nemo/collections/llm/recipes/mixtral_8x3b_16k.py
 delete mode 100644 nemo/collections/llm/recipes/mixtral_8x3b_64k.py
 delete mode 100644 tests/collections/llm/recipes/test_mixtral_8x3b.py
 delete mode 100644 tests/collections/llm/recipes/test_mixtral_8x3b_16k.py
 delete mode 100644 tests/collections/llm/recipes/test_mixtral_8x3b_64k.py

diff --git a/nemo/collections/llm/recipes/__init__.py b/nemo/collections/llm/recipes/__init__.py
index 43c881110603..6bee8c882ffd 100644
--- a/nemo/collections/llm/recipes/__init__.py
+++ b/nemo/collections/llm/recipes/__init__.py
@@ -22,9 +22,6 @@
     llama3_70b_64k,
     llama31_405b,
     mistral,
-    mixtral_8x3b,
-    mixtral_8x3b_16k,
-    mixtral_8x3b_64k,
     mixtral_8x7b,
     mixtral_8x7b_16k,
     mixtral_8x7b_64k,
@@ -52,9 +49,6 @@
     "llama3_70b_64k",
     "llama31_405b",
     "mistral",
-    "mixtral_8x3b",
-    "mixtral_8x3b_16k",
-    "mixtral_8x3b_64k",
     "mixtral_8x7b",
     "mixtral_8x7b_16k",
     "mixtral_8x7b_64k",
diff --git a/nemo/collections/llm/recipes/mixtral_8x3b.py b/nemo/collections/llm/recipes/mixtral_8x3b.py
deleted file mode 100644
index ca5b4e35039f..000000000000
--- a/nemo/collections/llm/recipes/mixtral_8x3b.py
+++ /dev/null
@@ -1,290 +0,0 @@
-# Copyright (c) 2024, NVIDIA CORPORATION.  All rights reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-
-from typing import Callable, Optional
-
-import nemo_run as run
-import pytorch_lightning as pl
-import torch
-from megatron.core.distributed import DistributedDataParallelConfig
-from pytorch_lightning.callbacks.callback import Callback
-
-from nemo import lightning as nl
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x3B, MixtralModel
-from nemo.collections.llm.peft.lora import LoRA
-from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
-from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
-from nemo.collections.llm.recipes.precision.mixed_precision import bf16_mixed
-from nemo.lightning.pytorch.callbacks.megatron_comm_overlap import MegatronCommOverlapCallback
-from nemo.lightning.pytorch.callbacks.moe_token_drop import MegatronTokenDropCallback
-from nemo.utils.exp_manager import TimingCallback
-
-NAME = "mixtral_8x3b"
-
-
-@run.cli.factory(name=NAME)
-def model() -> run.Config[pl.LightningModule]:
-    """
-    Factory function to create a Mixtral 8x3B model configuration.
-
-    Returns:
-        run.Config[pl.LightningModule]: Configuration for the Mixtral 8x3B model.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain model=mixtral_8x3b ...
-
-        Python API usage:
-            >>> model_config = model()
-            >>> print(model_config)
-    """
-    return run.Config(MixtralModel, config=run.Config(MixtralConfig8x3B))
-
-
-def trainer(
-    tensor_parallelism: int = 1,
-    pipeline_parallelism: int = 1,
-    pipeline_parallelism_type: Optional[torch.dtype] = None,
-    virtual_pipeline_parallelism: Optional[int] = None,
-    context_parallelism: int = 1,
-    sequence_parallelism: bool = False,
-    expert_parallelism: int = 4,
-    num_nodes: int = 2,
-    num_gpus_per_node: int = 8,
-    max_steps: int = 1168251,
-    callbacks: Optional[list[run.Config[Callback]]] = None,
-) -> run.Config[nl.Trainer]:
-    """
-    Configure the NeMo Lightning Trainer for Mixtral 8x3B model.
-
-    This function sets up the distributed training strategy optimized for the Mixtral 8x3B model.
-
-    Args:
-        tensor_parallelism (int): Degree of tensor model parallelism.
-        pipeline_parallelism (int): Degree of pipeline model parallelism.
-        pipeline_parallelism_type (Optional[torch.dtype]): Data type for pipeline parallelism.
-        virtual_pipeline_parallelism (Optional[int]): Size of virtual pipeline parallelism.
-        context_parallelism (int): Degree of context parallelism.
-        sequence_parallelism (bool): Whether to use sequence parallelism.
-        expert_parallelism (int): Degree of expert parallelism.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-        max_steps (int): Maximum number of training steps.
-        callbacks (Optional[list[run.Config[Callback]]]): List of callback configurations.
-
-    Returns:
-        run.Config[nl.Trainer]: Configuration for the NeMo Lightning Trainer.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain trainer=mixtral_8x3b ...
-
-        Python API usage:
-            >>> trainer_config = trainer(num_nodes=2, num_gpus_per_node=8)
-            >>> print(trainer_config)
-    """
-    strategy = run.Config(
-        nl.MegatronStrategy,
-        tensor_model_parallel_size=tensor_parallelism,
-        pipeline_model_parallel_size=pipeline_parallelism,
-        pipeline_dtype=pipeline_parallelism_type,
-        virtual_pipeline_model_parallel_size=virtual_pipeline_parallelism,
-        context_parallel_size=context_parallelism,
-        sequence_parallel=sequence_parallelism,
-        expert_model_parallel_size=expert_parallelism,
-        gradient_as_bucket_view=True,
-        ckpt_async_save=True,
-        ckpt_parallel_load=True,
-        ddp=run.Config(
-            DistributedDataParallelConfig,
-            check_for_nan_in_grad=True,
-            grad_reduce_in_fp32=True,
-            overlap_grad_reduce=True,
-            overlap_param_gather=True,
-        ),
-    )
-
-    trainer = run.Config(
-        nl.Trainer,
-        accelerator="gpu",
-        accumulate_grad_batches=1,
-        callbacks=callbacks,
-        devices=num_gpus_per_node,
-        limit_test_batches=50,
-        limit_val_batches=32,
-        log_every_n_steps=10,
-        max_steps=max_steps,
-        num_nodes=num_nodes,
-        plugins=bf16_mixed(),
-        strategy=strategy,
-        use_distributed_sampler=False,
-        val_check_interval=2000,
-    )
-
-    return trainer
-
-
-@run.cli.factory(target=pretrain, name=NAME)
-def pretrain_recipe(
-    dir: Optional[str] = None, name: str = "default", num_nodes: int = 2, num_gpus_per_node: int = 8, fn=pretrain
-) -> run.Partial:
-    """
-    Create a pre-training recipe for Mixtral 8x3B model.
-
-    This function sets up a complete configuration for pre-training, including
-    model, trainer, and data settings.
-
-    Args:
-        dir (Optional[str]): Directory for saving logs and checkpoints.
-        name (str): Name of the pre-training run.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-        fn (Callable): Function to use for pre-training (default: nemo.collections.llm.api.pretrain).
-
-    Returns:
-        run.Partial: Partial configuration for pre-training.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain --factory mixtral_8x3b
-            $ nemo llm pretrain --factory "mixtral_8x3b(num_nodes=2, name='my_pretrain')"
-
-        Python API usage:
-            >>> recipe = pretrain_recipe(name="mixtral_8x3b_pretrain", num_nodes=2)
-            >>> print(recipe)
-    """
-    return run.Partial(
-        fn,
-        model=model(),
-        trainer=trainer(
-            num_nodes=num_nodes,
-            num_gpus_per_node=num_gpus_per_node,
-            callbacks=[run.Config(TimingCallback)],
-        ),
-        data=run.Config(MockDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1),
-        log=default_log(dir=dir, name=name, tensorboard_logger=tensorboard_logger(name=name)),
-        optim=distributed_fused_adam_with_cosine_annealing(max_lr=3e-4),
-        resume=default_resume(),
-    )
-
-
-@run.cli.factory(target=pretrain, name=NAME + "_performance")
-def pretrain_recipe_performance(
-    dir: Optional[str] = None, name: str = "default", num_nodes: int = 2, num_gpus_per_node: int = 8, fn=pretrain
-) -> run.Partial:
-    """
-    Create a performance-optimized pre-training recipe for Mixtral 8x3B model.
-
-    This recipe enables performance optimizations that may not be suitable for all use cases.
-    It builds upon the standard pre-training recipe and adds additional performance enhancements.
-
-    Args:
-        dir (Optional[str]): Directory for saving logs and checkpoints.
-        name (str): Name of the pre-training run.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-        fn (Callable): The pre-training function to use.
-
-    Returns:
-        run.Partial: Partial configuration for performance-optimized pre-training.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain --factory "mixtral_8x3b.pretrain_recipe_performance(num_nodes=2, name='perf_pretrain')"
-
-        Python API usage:
-            >>> recipe = pretrain_recipe_performance(name="mixtral_8x3b", num_nodes=4)
-            >>> print(recipe)
-
-    Note:
-        Use this recipe with caution and only when you need maximum performance.
-        It may not be suitable for all hardware configurations or use cases.
-    """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=fn)
-
-    recipe.trainer.callbacks.extend(
-        [
-            run.Config(MegatronTokenDropCallback),
-            run.Config(MegatronCommOverlapCallback),
-        ]
-    )
-
-    return recipe
-
-
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """
-    Configure the Hugging Face model resuming for Mixtral 8x3B model.
-
-    This function sets up the configuration for resuming training from a Hugging Face model.
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from a Hugging Face model.
-
-    Examples:
-        CLI usage:
-            $ nemo llm finetune --factory "mixtral_8x3b(resume=hf_resume())"
-
-        Python API usage:
-            >>> recipe = finetune_recipe(name="mixtral_8x3b_finetune", num_nodes=2)
-            >>> recipe.resume = hf_resume()
-            >>> print(recipe)
-    """
-    return run.Config(
-        nl.AutoResume,
-        restore_config=run.Config(nl.RestoreConfig, path="hf://mistralai/Mixtral-8x7B-v0.1"),
-    )
-
-
-@run.cli.factory(target=finetune, name=NAME)
-def finetune_recipe(
-    dir: Optional[str] = None,
-    name: str = "default",
-    num_nodes: int = 1,
-    num_gpus_per_node: int = 8,
-) -> run.Partial:
-    """
-    Create a fine-tuning recipe for Mixtral 8x3B model.
-
-    This function sets up a complete configuration for fine-tuning, including
-    model, trainer, and data settings.
-
-    Args:
-        dir (Optional[str]): Directory for saving logs and checkpoints.
-        name (str): Name of the fine-tuning run.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-
-    Returns:
-        run.Partial: Partial configuration for fine-tuning.
-
-    Examples:
-        CLI usage:
-            $ nemo llm finetune --factory mixtral_8x3b
-            $ nemo llm finetune --factory "mixtral_8x3b(num_nodes=2, name='my_finetune')"
-
-        Python API usage:
-            >>> recipe = finetune_recipe(name="mixtral_8x3b_finetune", num_nodes=2)
-            >>> print(recipe)
-    """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA, target_modules=['linear_qkv', 'linear_proj'], dim=32)
-    recipe.data = run.Config(SquadDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1)
-    return recipe
diff --git a/nemo/collections/llm/recipes/mixtral_8x3b_16k.py b/nemo/collections/llm/recipes/mixtral_8x3b_16k.py
deleted file mode 100644
index 13ca1c2d4537..000000000000
--- a/nemo/collections/llm/recipes/mixtral_8x3b_16k.py
+++ /dev/null
@@ -1,132 +0,0 @@
-# Copyright (c) 2024, NVIDIA CORPORATION.  All rights reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-
-from typing import Optional
-
-import nemo_run as run
-import pytorch_lightning as pl
-import torch
-
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.recipes import mixtral_8x3b
-
-NAME = "mixtral_8x3b_16k"
-
-
-@run.cli.factory(name=NAME)
-def model() -> run.Config[pl.LightningModule]:
-    """
-    Factory function to create a Mixtral 8x3B model configuration with 16k sequence length.
-
-    Returns:
-        run.Config[pl.LightningModule]: Configuration for the Mixtral 8x3B model with 16k sequence length.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain model=mixtral_8x3b_16k ...
-
-        Python API usage:
-            >>> model_config = model()
-            >>> print(model_config)
-    """
-    model_config = mixtral_8x3b.model()
-    model_config.config.seq_length = 16384
-    model_config.config.max_position_embeddings = 16384
-    return model_config
-
-
-def trainer(
-    num_nodes: int = 1,
-    num_gpus_per_node: int = 8,
-) -> run.Config:
-    """
-    Configure the NeMo Lightning Trainer for Mixtral 8x3B model with 16k sequence length.
-
-    This function sets up the distributed training strategy optimized for longer sequences.
-
-    Args:
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-
-    Returns:
-        run.Config: Configuration for the NeMo Lightning Trainer.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain trainer=mixtral_8x3b_16k ...
-
-        Python API usage:
-            >>> trainer_config = trainer(num_nodes=2, num_gpus_per_node=8)
-            >>> print(trainer_config)
-
-    Note:
-        This configuration uses increased parallelism to handle the longer sequence length efficiently.
-    """
-    return mixtral_8x3b.trainer(
-        tensor_parallelism=2,
-        pipeline_parallelism=2,
-        pipeline_parallelism_type=torch.bfloat16,
-        virtual_pipeline_parallelism=8,
-        context_parallelism=2,
-        sequence_parallelism=True,
-        expert_parallelism=1,
-        num_nodes=num_nodes,
-        num_gpus_per_node=num_gpus_per_node,
-    )
-
-
-@run.cli.factory(target=pretrain, name=NAME)
-def pretrain_recipe(
-    dir: Optional[str] = None,
-    name: str = "default",
-    num_nodes: int = 1,
-    num_gpus_per_node: int = 8,
-) -> run.Partial:
-    """
-    Create a pre-training recipe for Mixtral 8x3B model with 16k sequence length.
-
-    This function sets up a complete configuration for pre-training, including
-    model, trainer, and data settings optimized for 16k sequence length.
-
-    Args:
-        dir (Optional[str]): Directory for saving logs and checkpoints.
-        name (str): Name of the pre-training run.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-
-    Returns:
-        run.Partial: Partial configuration for pre-training.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain --factory mixtral_8x3b_16k
-            $ nemo llm pretrain --factory "mixtral_8x3b_16k(num_nodes=2, name='my_16k_pretrain')"
-
-        Python API usage:
-            >>> recipe = pretrain_recipe(name="mixtral_8x3b_16k_pretrain", num_nodes=2)
-            >>> print(recipe)
-
-    Note:
-        This recipe is optimized for handling longer sequences (16k) compared to the standard version.
-    """
-    recipe = mixtral_8x3b.pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-
-    recipe.model = model()
-    recipe.trainer = trainer(num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-    recipe.data = run.Config(MockDataModule, seq_length=16384, global_batch_size=512, micro_batch_size=1)
-
-    return recipe
diff --git a/nemo/collections/llm/recipes/mixtral_8x3b_64k.py b/nemo/collections/llm/recipes/mixtral_8x3b_64k.py
deleted file mode 100644
index e21d85a13dcd..000000000000
--- a/nemo/collections/llm/recipes/mixtral_8x3b_64k.py
+++ /dev/null
@@ -1,133 +0,0 @@
-# Copyright (c) 2024, NVIDIA CORPORATION.  All rights reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-
-from typing import Optional
-
-import nemo_run as run
-import pytorch_lightning as pl
-import torch
-
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.recipes import mixtral_8x3b
-from nemo.utils.exp_manager import TimingCallback
-
-NAME = "mixtral_8x3b_64k"
-
-
-@run.cli.factory(name=NAME)
-def model() -> run.Config[pl.LightningModule]:
-    """
-    Factory function to create a Mixtral 8x3B model configuration with 64k sequence length.
-
-    Returns:
-        run.Config[pl.LightningModule]: Configuration for the Mixtral 8x3B model with 64k sequence length.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain model=mixtral_8x3b_64k ...
-
-        Python API usage:
-            >>> model_config = model()
-            >>> print(model_config)
-    """
-    model_config = mixtral_8x3b.model()
-    model_config.config.seq_length = 65536
-    return model_config
-
-
-def trainer(
-    num_nodes: int = 8,
-    num_gpus_per_node: int = 8,
-) -> run.Config:
-    """
-    Configure the NeMo Lightning Trainer for Mixtral 8x3B model with 64k sequence length.
-
-    This function sets up the distributed training strategy optimized for long sequences.
-
-    Args:
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-
-    Returns:
-        run.Config: Configuration for the NeMo Lightning Trainer.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain trainer=mixtral_8x3b_64k ...
-
-        Python API usage:
-            >>> trainer_config = trainer(num_nodes=8, num_gpus_per_node=8)
-            >>> print(trainer_config)
-
-    Note:
-        This configuration uses significantly increased parallelism to handle the long sequence length efficiently.
-    """
-    return mixtral_8x3b.trainer(
-        tensor_parallelism=4,
-        pipeline_parallelism=4,
-        pipeline_parallelism_type=torch.bfloat16,
-        virtual_pipeline_parallelism=8,
-        context_parallelism=4,
-        sequence_parallelism=True,
-        expert_parallelism=1,
-        num_nodes=num_nodes,
-        num_gpus_per_node=num_gpus_per_node,
-        callbacks=[run.Config(TimingCallback)],
-    )
-
-
-@run.cli.factory(target=pretrain, name=NAME)
-def pretrain_recipe(
-    dir: Optional[str] = None,
-    name: str = "default",
-    num_nodes: int = 8,
-    num_gpus_per_node: int = 8,
-) -> run.Partial:
-    """
-    Create a pre-training recipe for Mixtral 8x3B model with 64k sequence length.
-
-    This function sets up a complete configuration for pre-training, including
-    model, trainer, and data settings optimized for 64k sequence length.
-
-    Args:
-        dir (Optional[str]): Directory for saving logs and checkpoints.
-        name (str): Name of the pre-training run.
-        num_nodes (int): Number of compute nodes to use.
-        num_gpus_per_node (int): Number of GPUs per node.
-
-    Returns:
-        run.Partial: Partial configuration for pre-training.
-
-    Examples:
-        CLI usage:
-            $ nemo llm pretrain --factory mixtral_8x3b_64k
-            $ nemo llm pretrain --factory "mixtral_8x3b_64k(num_nodes=8, name='my_64k_pretrain')"
-
-        Python API usage:
-            >>> recipe = pretrain_recipe(name="mixtral_8x3b_64k_pretrain", num_nodes=8)
-            >>> print(recipe)
-
-    Note:
-        This recipe is optimized for handling long sequences (64k) compared to the standard version.
-        It requires significant computational resources due to the extended sequence length.
-    """
-    recipe = mixtral_8x3b.pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-
-    recipe.model = model()
-    recipe.trainer = trainer(num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-    recipe.data = run.Config(MockDataModule, seq_length=65536, global_batch_size=512, micro_batch_size=1)
-    return recipe
diff --git a/tests/collections/llm/recipes/test_mixtral_8x3b.py b/tests/collections/llm/recipes/test_mixtral_8x3b.py
deleted file mode 100644
index 238fec74e0e1..000000000000
--- a/tests/collections/llm/recipes/test_mixtral_8x3b.py
+++ /dev/null
@@ -1,110 +0,0 @@
-import nemo_run as run
-import pytest
-
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x3B, MixtralModel
-from nemo.collections.llm.peft.lora import LoRA
-from nemo.collections.llm.recipes import mixtral_8x3b
-from nemo.lightning import AutoResume, Trainer
-
-
-class TestMixtral8x3B:
-    @pytest.fixture(scope="class")
-    def recipe_module(self):
-        return mixtral_8x3b
-
-    def test_model(self, recipe_module):
-        model_config = recipe_module.model()
-        assert isinstance(model_config, run.Config)
-        assert model_config.__fn_or_cls__ == MixtralModel
-        assert isinstance(model_config.config, run.Config)
-        assert model_config.config.__fn_or_cls__ == MixtralConfig8x3B
-
-    def test_trainer(self, recipe_module):
-        trainer_config = recipe_module.trainer()
-        assert isinstance(trainer_config, run.Config)
-        assert trainer_config.__fn_or_cls__ == Trainer
-        assert trainer_config.accelerator == "gpu"
-        assert trainer_config.devices == 8
-        assert trainer_config.num_nodes == 2
-
-        # Check strategy configuration
-        assert isinstance(trainer_config.strategy, run.Config)
-        assert trainer_config.strategy.__fn_or_cls__.__name__ == "MegatronStrategy"
-        assert trainer_config.strategy.tensor_model_parallel_size == 1
-        assert trainer_config.strategy.pipeline_model_parallel_size == 1
-        assert trainer_config.strategy.pipeline_dtype is None
-        assert trainer_config.strategy.virtual_pipeline_model_parallel_size is None
-        assert trainer_config.strategy.context_parallel_size == 1
-        assert trainer_config.strategy.sequence_parallel is False
-        assert trainer_config.strategy.expert_model_parallel_size == 4
-
-    def test_pretrain_recipe(self, recipe_module):
-        recipe = recipe_module.pretrain_recipe()
-        assert isinstance(recipe, run.Partial)
-        assert recipe.__fn_or_cls__ == pretrain
-        assert isinstance(recipe.model, run.Config)
-        assert recipe.model.__fn_or_cls__ == MixtralModel
-        assert isinstance(recipe.trainer, run.Config)
-        assert recipe.trainer.__fn_or_cls__ == Trainer
-        assert isinstance(recipe.data, run.Config)
-        assert recipe.data.__fn_or_cls__ == MockDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
-        assert recipe.data.micro_batch_size == 1
-
-    def test_finetune_recipe(self, recipe_module):
-        recipe = recipe_module.finetune_recipe()
-        assert isinstance(recipe, run.Partial)
-        assert recipe.__fn_or_cls__ == finetune
-        assert isinstance(recipe.model, run.Config)
-        assert recipe.model.__fn_or_cls__ == MixtralModel
-        assert isinstance(recipe.trainer, run.Config)
-        assert recipe.trainer.__fn_or_cls__ == Trainer
-        assert isinstance(recipe.data, run.Config)
-        assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
-        assert recipe.data.micro_batch_size == 1
-        assert isinstance(recipe.peft, run.Config)
-        assert recipe.peft.__fn_or_cls__ == LoRA
-        assert recipe.peft.target_modules == ['linear_qkv', 'linear_proj']
-        assert recipe.peft.dim == 32
-
-    @pytest.mark.parametrize("num_nodes,num_gpus_per_node", [(1, 8), (2, 4), (4, 2)])
-    def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_nodes, num_gpus_per_node):
-        recipe = recipe_module.pretrain_recipe(num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-        assert recipe.trainer.num_nodes == num_nodes
-        assert recipe.trainer.devices == num_gpus_per_node
-
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://mistralai/Mixtral-8x7B-v0.1"
-
-    def test_trainer_parallelism_options(self, recipe_module):
-        trainer_config = recipe_module.trainer(
-            tensor_parallelism=8,
-            pipeline_parallelism=2,
-            context_parallelism=4,
-            sequence_parallelism=False,
-            expert_parallelism=2,
-        )
-        assert trainer_config.strategy.tensor_model_parallel_size == 8
-        assert trainer_config.strategy.pipeline_model_parallel_size == 2
-        assert trainer_config.strategy.context_parallel_size == 4
-        assert trainer_config.strategy.sequence_parallel is False
-        assert trainer_config.strategy.expert_model_parallel_size == 2
-
-    def test_model_config_parameters(self, recipe_module):
-        model_config = recipe_module.model()
-        mixtral_config = model_config.config
-        assert mixtral_config.num_layers == 32
-        assert mixtral_config.hidden_size == 2560
-        assert mixtral_config.num_attention_heads == 32
-        assert mixtral_config.seq_length == 4096
-        assert mixtral_config.num_moe_experts == 8
diff --git a/tests/collections/llm/recipes/test_mixtral_8x3b_16k.py b/tests/collections/llm/recipes/test_mixtral_8x3b_16k.py
deleted file mode 100644
index 1f1b041584d8..000000000000
--- a/tests/collections/llm/recipes/test_mixtral_8x3b_16k.py
+++ /dev/null
@@ -1,84 +0,0 @@
-import nemo_run as run
-import pytest
-import torch
-
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x3B, MixtralModel
-from nemo.collections.llm.recipes import mixtral_8x3b_16k
-from nemo.lightning import Trainer
-
-
-class TestMixtral8x3B_16k:
-    @pytest.fixture(scope="class")
-    def recipe_module(self):
-        return mixtral_8x3b_16k
-
-    def test_model(self, recipe_module):
-        model_config = recipe_module.model()
-        assert isinstance(model_config, run.Config)
-        assert model_config.__fn_or_cls__ == MixtralModel
-        assert isinstance(model_config.config, run.Config)
-        assert model_config.config.__fn_or_cls__ == MixtralConfig8x3B
-        assert model_config.config.seq_length == 16384
-        assert model_config.config.max_position_embeddings == 16384
-
-    def test_trainer(self, recipe_module):
-        trainer_config = recipe_module.trainer()
-        assert isinstance(trainer_config, run.Config)
-        assert trainer_config.__fn_or_cls__ == Trainer
-        assert trainer_config.accelerator == "gpu"
-        assert trainer_config.devices == 8
-        assert trainer_config.num_nodes == 1
-
-        # Check strategy configuration
-        assert isinstance(trainer_config.strategy, run.Config)
-        assert trainer_config.strategy.__fn_or_cls__.__name__ == "MegatronStrategy"
-        assert trainer_config.strategy.tensor_model_parallel_size == 2
-        assert trainer_config.strategy.pipeline_model_parallel_size == 2
-        assert trainer_config.strategy.pipeline_dtype == torch.bfloat16
-        assert trainer_config.strategy.virtual_pipeline_model_parallel_size == 8
-        assert trainer_config.strategy.context_parallel_size == 2
-        assert trainer_config.strategy.sequence_parallel is True
-        assert trainer_config.strategy.expert_model_parallel_size == 1
-
-    def test_pretrain_recipe(self, recipe_module):
-        recipe = recipe_module.pretrain_recipe()
-        assert isinstance(recipe, run.Partial)
-        assert recipe.__fn_or_cls__ == pretrain
-        assert isinstance(recipe.model, run.Config)
-        assert recipe.model.__fn_or_cls__ == MixtralModel
-        assert isinstance(recipe.trainer, run.Config)
-        assert recipe.trainer.__fn_or_cls__ == Trainer
-        assert isinstance(recipe.data, run.Config)
-        assert recipe.data.__fn_or_cls__ == MockDataModule
-        assert recipe.data.seq_length == 16384
-        assert recipe.data.global_batch_size == 512
-        assert recipe.data.micro_batch_size == 1
-
-    @pytest.mark.parametrize("num_nodes,num_gpus_per_node", [(1, 8), (2, 4), (4, 2)])
-    def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_nodes, num_gpus_per_node):
-        recipe = recipe_module.pretrain_recipe(num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-        assert recipe.trainer.num_nodes == num_nodes
-        assert recipe.trainer.devices == num_gpus_per_node
-
-    def test_trainer_parallelism_options(self, recipe_module):
-        trainer_config = recipe_module.trainer()
-        assert trainer_config.strategy.tensor_model_parallel_size == 2
-        assert trainer_config.strategy.pipeline_model_parallel_size == 2
-        assert trainer_config.strategy.pipeline_dtype == torch.bfloat16
-        assert trainer_config.strategy.virtual_pipeline_model_parallel_size == 8
-        assert trainer_config.strategy.context_parallel_size == 2
-        assert trainer_config.strategy.sequence_parallel is True
-        assert trainer_config.strategy.expert_model_parallel_size == 1
-
-    def test_model_config_parameters(self, recipe_module):
-        model_config = recipe_module.model()
-        mixtral_config = model_config.config
-        assert mixtral_config.num_layers == 32
-        assert mixtral_config.hidden_size == 2560
-        assert mixtral_config.num_attention_heads == 32
-        assert mixtral_config.seq_length == 16384
-        assert mixtral_config.max_position_embeddings == 16384
-        assert mixtral_config.num_moe_experts == 8
diff --git a/tests/collections/llm/recipes/test_mixtral_8x3b_64k.py b/tests/collections/llm/recipes/test_mixtral_8x3b_64k.py
deleted file mode 100644
index d71017649b1b..000000000000
--- a/tests/collections/llm/recipes/test_mixtral_8x3b_64k.py
+++ /dev/null
@@ -1,84 +0,0 @@
-import nemo_run as run
-import pytest
-import torch
-
-from nemo.collections.llm.api import finetune, pretrain
-from nemo.collections.llm.gpt.data.mock import MockDataModule
-from nemo.collections.llm.gpt.data.squad import SquadDataModule
-from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x3B, MixtralModel
-from nemo.collections.llm.recipes import mixtral_8x3b_64k
-from nemo.lightning import Trainer
-
-
-class TestMixtral8x3B_64k:
-    @pytest.fixture(scope="class")
-    def recipe_module(self):
-        return mixtral_8x3b_64k
-
-    def test_model(self, recipe_module):
-        model_config = recipe_module.model()
-        assert isinstance(model_config, run.Config)
-        assert model_config.__fn_or_cls__ == MixtralModel
-        assert isinstance(model_config.config, run.Config)
-        assert model_config.config.__fn_or_cls__ == MixtralConfig8x3B
-        assert model_config.config.seq_length == 65536
-        assert model_config.config.max_position_embeddings == 4096
-
-    def test_trainer(self, recipe_module):
-        trainer_config = recipe_module.trainer()
-        assert isinstance(trainer_config, run.Config)
-        assert trainer_config.__fn_or_cls__ == Trainer
-        assert trainer_config.accelerator == "gpu"
-        assert trainer_config.devices == 8
-        assert trainer_config.num_nodes == 8
-
-        # Check strategy configuration
-        assert isinstance(trainer_config.strategy, run.Config)
-        assert trainer_config.strategy.__fn_or_cls__.__name__ == "MegatronStrategy"
-        assert trainer_config.strategy.tensor_model_parallel_size == 4
-        assert trainer_config.strategy.pipeline_model_parallel_size == 4
-        assert trainer_config.strategy.pipeline_dtype == torch.bfloat16
-        assert trainer_config.strategy.virtual_pipeline_model_parallel_size == 8
-        assert trainer_config.strategy.context_parallel_size == 4
-        assert trainer_config.strategy.sequence_parallel is True
-        assert trainer_config.strategy.expert_model_parallel_size == 1
-
-    def test_pretrain_recipe(self, recipe_module):
-        recipe = recipe_module.pretrain_recipe()
-        assert isinstance(recipe, run.Partial)
-        assert recipe.__fn_or_cls__ == pretrain
-        assert isinstance(recipe.model, run.Config)
-        assert recipe.model.__fn_or_cls__ == MixtralModel
-        assert isinstance(recipe.trainer, run.Config)
-        assert recipe.trainer.__fn_or_cls__ == Trainer
-        assert isinstance(recipe.data, run.Config)
-        assert recipe.data.__fn_or_cls__ == MockDataModule
-        assert recipe.data.seq_length == 65536
-        assert recipe.data.global_batch_size == 512
-        assert recipe.data.micro_batch_size == 1
-
-    @pytest.mark.parametrize("num_nodes,num_gpus_per_node", [(32, 8), (64, 4), (128, 2)])
-    def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_nodes, num_gpus_per_node):
-        recipe = recipe_module.pretrain_recipe(num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node)
-        assert recipe.trainer.num_nodes == num_nodes
-        assert recipe.trainer.devices == num_gpus_per_node
-
-    def test_trainer_parallelism_options(self, recipe_module):
-        trainer_config = recipe_module.trainer()
-        assert trainer_config.strategy.tensor_model_parallel_size == 4
-        assert trainer_config.strategy.pipeline_model_parallel_size == 4
-        assert trainer_config.strategy.pipeline_dtype == torch.bfloat16
-        assert trainer_config.strategy.virtual_pipeline_model_parallel_size == 8
-        assert trainer_config.strategy.context_parallel_size == 4
-        assert trainer_config.strategy.sequence_parallel is True
-        assert trainer_config.strategy.expert_model_parallel_size == 1
-
-    def test_model_config_parameters(self, recipe_module):
-        model_config = recipe_module.model()
-        mixtral_config = model_config.config
-        assert mixtral_config.num_layers == 32
-        assert mixtral_config.hidden_size == 2560
-        assert mixtral_config.num_attention_heads == 32
-        assert mixtral_config.seq_length == 65536
-        assert mixtral_config.max_position_embeddings == 4096
-        assert mixtral_config.num_moe_experts == 8
diff --git a/tests/lightning/test_nemo_run.py b/tests/lightning/test_nemo_run.py
index d651890b5fd3..8d7814bfe530 100644
--- a/tests/lightning/test_nemo_run.py
+++ b/tests/lightning/test_nemo_run.py
@@ -19,10 +19,6 @@
         ("llama31_405b", "pretrain_recipe", "llama31_405b_pretrain"),
         ("mistral", "pretrain_recipe", "mistral_pretrain"),
         ("mistral", "finetune_recipe", "mistral_finetune"),
-        ("mixtral_8x3b", "pretrain_recipe", "mixtral_8x3b_pretrain"),
-        ("mixtral_8x3b", "finetune_recipe", "mixtral_8x3b_finetune"),
-        ("mixtral_8x3b_16k", "pretrain_recipe", "mixtral_8x3b_16k_pretrain"),
-        ("mixtral_8x3b_64k", "pretrain_recipe", "mixtral_8x3b_64k_pretrain"),
         ("mixtral_8x7b", "pretrain_recipe", "mixtral_8x7b_pretrain"),
         ("mixtral_8x7b", "finetune_recipe", "mixtral_8x7b_finetune"),
         ("mixtral_8x7b_16k", "pretrain_recipe", "mixtral_8x7b_16k_pretrain"),

From 0aa267117ae5f1ff9f8d8308ed3d1ba1ce939f82 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Mon, 7 Oct 2024 13:29:23 -0700
Subject: [PATCH 05/18] change the figure file name

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .../{speedup_figure.png => cp_speedup_figure.png}   | Bin
 .../source/performance/performance_long_sequence.md |   2 +-
 2 files changed, 1 insertion(+), 1 deletion(-)
 rename docs/source/performance/{speedup_figure.png => cp_speedup_figure.png} (100%)

diff --git a/docs/source/performance/speedup_figure.png b/docs/source/performance/cp_speedup_figure.png
similarity index 100%
rename from docs/source/performance/speedup_figure.png
rename to docs/source/performance/cp_speedup_figure.png
diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
index c2816485b54d..d73392e6c78b 100644
--- a/docs/source/performance/performance_long_sequence.md
+++ b/docs/source/performance/performance_long_sequence.md
@@ -152,4 +152,4 @@
 
 
 ### Speedup enabled by the CP
-![Speedup Graph](speedup_figure.png)
\ No newline at end of file
+![Speedup Graph](cp_speedup_figure.png)
\ No newline at end of file

From d3e071217cac844fad0206578f967c7491cff6d5 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Mon, 7 Oct 2024 13:51:27 -0700
Subject: [PATCH 06/18] Accommodating the reviewer's comment

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 docs/source/performance/performance_long_sequence.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
index d73392e6c78b..77e7c9f46e1a 100644
--- a/docs/source/performance/performance_long_sequence.md
+++ b/docs/source/performance/performance_long_sequence.md
@@ -2,7 +2,7 @@
 
 ## LLAMA2-7B (FP8)
 
-- The results in the table below show the pre-training performance of the LLAMA2-7B model with-CP (context parallelism) and without-CP for various input sequence lengths at FP8 precision. Detailed configurations and the achievable performance are provided for the with-CP configurations. For the without-CP configurations, the best achievable performance is reported within the given memory capacity constraint.
+- The table below shows the pre-training performance of the LLAMA2-7B with CP (context parallelism) and compares it against the results without CP at various input sequence lengths. The detailed model-parallel configurations and the achieved performance are shown in the training results with CP. In non-CP training runs, we use the most performant model- and data-parallel configurations without CP given the memory capacity constraint of the H100 GPU system.
 
   - Container: [NeMo24.03.01.framework](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo/tags)
   - System: DGX-H100
@@ -151,5 +151,5 @@
 </table>
 
 
-### Speedup enabled by the CP
+### Speedup of LLAMA2 7B training with CP over without CP
 ![Speedup Graph](cp_speedup_figure.png)
\ No newline at end of file

From ae18787b2b604e74630a05b853116f3439563e14 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Mon, 7 Oct 2024 14:21:35 -0700
Subject: [PATCH 07/18] update the y-axis title

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 docs/source/performance/cp_speedup_figure.png | Bin 20611 -> 20359 bytes
 .../performance/performance_long_sequence.md  |   6 +++---
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/source/performance/cp_speedup_figure.png b/docs/source/performance/cp_speedup_figure.png
index af73e6f5375b85f789d10cfa59d40aa2e5f104d2..ba4eab5d65a8208d55db0db37ccc9686e4e9b088 100644
GIT binary patch
literal 20359
zcmdtKWmuH$_cl6+fCz|!igXI1gp?zVgdi;_h?Gc6N;fE=qDV+LDj`zRF@#{zNaqkz
z(jlFD-H5;6f4}?K$FaY>?}z>6!Q+{kd#?Ms*R|Gpp6gr_psFlOa+2yK3WXxMB`2+h
zLLD1Mp>Tejz=!{1YRaFBLa8X<lD=`rMQ<KU5Peo=`^Pf#RD8sx<>P0LpBZI2K_-pE
zV<;kXri>4j_>SqRv&--Lw=aAI8K1v+nr!#-WVFP)cRnZcjST7qdk%&+_C?X^!Ue(w
zbDb@#9^$R)!IpQegfr5;2XuAUgmK|3KL-r=orAxqU%pEu@aMYZCbAM5Mi1oGpwm@2
z@JIFptP$nI>}LaiKA)*Y!=I58qtx)n*6)AE#}4vrM<qKl)Y;T4Tpp{J+8T7^8!PbH
zPd0^!2|LYQt=_>d)}DuNXP{zFqS4INvwC^%N+hG|RDIZwij5)nCtiQ1T3dfUN0#Pu
zFU7-d4vQ(l|80>|_T92C|C=SYyi3P(t?PbI>EkhpYV?qc)U0*+blSllFI@AqZ#e1j
z-As)xRnE5q1y%#svaj6x^dQ;#_s_TY(N1%{612#hUcsA2ch?7Os0uB*KA!vVrzJ#u
zW6(YgKA5KZRx-`JBh92GUhqaWA-r4q=H{ufjP%6z<h|eTgYjqX7n;3w+*+9+qFm%;
z^r(Mx@kTYW_kOaQ#T<GS_iR3kOvT$bjedEKyJp-F-u6k<Yu6=vxWp!jQs7Z|_uB6^
zW;3Et5}NBjIqny}kbJ9p>Fi1WH=m9FOg1|OHuCr_GBieV@=Y{Gp^Gefy5qPD@-2F}
zNQR9YBa7l{-&Daj$?2xl9IzRdOw2i9%N?7akx<V~ehQ)&&v{aA*7nXx{T}k9eL8Pa
z?FNT3%f?o?{)r{xuJ}1OSk_^!F_d+wCW^5_>o%3ZqqCiSyhu1mJW|-=zi#-QnB-e?
ztN@>E5KUeju65}s{yf@c=|O?>qF#abffwJyuP3sp#XL%}q8$JERBw(hCZtbc?$<2^
zg^r1-c_&KxD$~})WX_v6kx-PgVo$<X2)`7{sQR2}_~B@+f12mcdYbKMt&o_`<o-~h
zxw>3ZQ||zTtHdKl5+UioA4<9L{4i)UUi1GxjYKL#Q7Aqw6B5-a*V>b%{@tL%3nWNF
z`sI92ok3z<joSPt<KMM0|Nr-55={4t9z~RG{(7Y}-JYtHDj%VkB<9&PROya8uU%}(
zgT*+bWnP`{5-S<BdnE`@@aK6+<04V>#rD)!?6=?fW$2XUmyHDpH$A~6cv?K7f3({S
zZF9rb!<|`L9jo&LJ(j(nY9O3@;~#&gq<SF=fyQUmU&vFvyYfn_+#w@FyCjE_-%RzV
z#y(r-Jgt~VZ|BFm3f@B%t_50E9%bgepWc=&R_$bQZjm275;^_?)7EQvMCbV-z6d*C
z(~0);oBfnHs5EV*3q#xJ@!1SjNbL9#k}Gz7yo)PN(D=w*{9w1he4seHuf#?_dGGCW
zI^OyIBBI?n)BmiPM%3*NuG6O)UxL|BXO^+f!{*QW${j6|_x`ks_mq$L%-271uThS_
z{&pZ&zq$%JfY>>O|9ras=bJYd6=DVLCx5p>&gmg0_egVt7`<QOzOf*o%d);MqMoMM
zW+hiP{zjpxK8#JOfA8f{M6%@150uPmnF*@T3q~x_TD#K;&T@`@sot8TR!>)^{j>Ci
zbU@c5;NQD1$5@S5{AuB<bl*57NHk9(92&<gYIk0H#^}W@me68~7Fe*2viP0@ra$T^
znkATooaUl~>Laz3#)oUh=N$HTx0=(Fyr@2eQFB>zeps$nj%+GsGcBK(lo$Ln?>g#F
z;ea1;M}q3_sUUkDA5pdZomvUfNpnkraPwQK=f}Pg)m6H!@n+vI;<f0`YRkZx?aDM=
zs6N>Le2rB(-b^D`kG5;w=#qS2zHu`ysmOz3tARAr?pBdMiho9YN%4!P{=4<NaEDK*
z>F0UP+gawr5)-Z6*5@)_&DAxB<<|xInFj~X(usR{bPnu5nsqF%`StD9Y`8z!*DuZS
zzhwU(=X<kldF;phW$Ay5-<A9B#4L35w$H&QhTHy~XD{i^AiATFT4JtsUOn0^ANn&@
z?`4nfefPx@t>{sn7?Jf`{QWO`h%<S}1|L0>%`>CIZvXj;?rd#SO~zyYK7>y4T#sQ_
zPYc|F+ni_hyYcVJ(odb+-pL4zo#oPrM$zH5k9c$Fl>T&>>8wA?t*0Y?om0D*pU<?V
z5EG+SU=jmYjU&4#Z#LbV_dZ6{y%@{MgrujXS0f|y=Z?8_tBH~1=vDTgy`sS}?#p~p
zB~fIua$|^SRlC6CHT#_ps?WTj(C{flagp>tG(xi82*K!6G>(YSV<q1!nmPPE`rehu
zab{Od_Q}x5jxC!n9%$NLo1M=oOLJZv+P+aX8EerL$*J80D{1c1)X#Q&O^rUaO3iEZ
zqT6v{urDPw)m-pk_fL`a$mb_Fv;zZ?g!)F;q$UoPSX7#NLy(%|%k??5lzf@pL{qEy
zf#rdG&WHO&AH`S+v58g##pb#-Pw{Xac{yZe&U4&5{b6T3Or=oy**j^e?PDkuOD5zY
zW6Fw3CY;?y?&?Bx=*q81e0K;G?p-_zod;D1`|kEjBR5ELOcNX<yBjH&rPl^*s;3RJ
zpb{r5xAZIMy1BW*Kf)Cf;`@T}ky5}?@Z~va9F!fC6aAy|JL~h!kjCbeQYuPQYXC^h
z5OEuR^z~f+Msc6pG@iUmT}3oCc)3lMo}|;&bUBz#6pjyRa%a81QyT-+6CWcAB@~;B
z?D-P@OK7HsSKhmu%OTrPqs%J<aa5#VQE+!tUu_kfqS>=`9miL69b{8UB<@VO{#bH$
zWwM$2Lu24-M56#yaHNpe>s0M;TXejWc}4tITOJC!V`;#iaQ&TYHba&_N@>VTmgXyQ
zMvb~7<KT&yG(>URJnG5GmXa4)A@JLX4yF+#`>@gC(OBr)CiiuU7z!Kh2G&{s*+?_x
zV%9fK5|i_v_$(SZ0Z6rV#msbPTU@j3J>AY~ikX;HaBZqVo46TPWoqWqbqeQ8ySD&P
z=wsaYQ7?x>N%`~R-78O>iDB>gh<am+CY}K<nE4s{UsOOs#sBk%i1WgrM8YoQq(OCl
zvS~r6wnjao8-wr5ZnE}<grZ2rcdyn6%Rg4|)%)C{{Mh_^Am#RfWp|cVT?m=>!Tw&6
zKE`R@A<>R~#U_LLnq@TLgq~IiCx-O;6F4!JlFB*_fle=!2HTTqKNzxex#9?^EpzBs
ziP(~x+lG=3_ZL|ll+-AedRfEL1?Pi=7k(--6P;yp0thi-a!dFSLG%}zmwz<QP)l`~
zxuK;qH5M#>YC4S4-5owsKMtp%YiDF-s?|m#`@S)n`Q*P$AA|My(;RC$UR_6bbihNh
z#uZEAmm~Rx?eJ;($T#>)?Iz^yyY6cGSF+z8Dup8y2M8_>O6fpzl@x+X>gB(?C`5+x
z4A)harq$$DZOc#jgrh%7JwN4C;QFT-jcb@`oHKXsibk)lhyURz3T=ACR1jAuO}_4S
z>F~C*PO_U8U9{D6orrM5A9RAi=W|^#>iA?JHP2>mPA}}#v@l7Hobr%BBQcwL9^PuS
zGHlm~woPTKP5WApz75*=6CkmU3RnC2gF*Y&zUHbMhifx|s#X3bHP9p^g7fhsc+T9F
zt7;^1;`^KLv1hyI`U;A?)0GqEScm_{d`GGRMe|isktH9jpXCc}s&Lm&6R(9Feuu!D
z6Ghz%y#dwE|9kVwHwt|-uTTMHfX@bKW@@-PRG+st8u#zV!uuhV3lFB#ldREOf6phm
z&BY~Ide2U4$!id_4XaOAOkPt7G+3Xs7%Y9<uRml|jQ}|xqfrt<F3p_yAXSMLSi<gH
zLF+2X709?+`9=X)8p*T^;b@KR##LW0nZsLr2M9P;cYlAQdO+zq9^}8h?X?bcwp=LF
z<N{{#8k{yhVdsU;>Ej(89nFJf_RT&xxP`jgkmO=A@Btgz^)<!t36$LVc;}XBs#HsW
z1(k@~>Ic9uA<7M0HV;zXaO;cN4_K8<R}6a{V%LC$@%k`?3dCyP45qU}kDV2~<=Ys=
zjoujcuC59_clpj$ypw0IMqJT^#3l8NczLfOk`Br4at@0Xe}YURKi<u}DQZFc?q5|R
z>kA-fYl<;V%!o0DG~>i)hx&(&jChy*$0GpNO)nXYSLi7D4n;5C36*aJa>qguv-(as
zK`2&J&vWfAhEkA4vxDIf6Z2F61dP2q)01lh=-(R6{CxZo6$=8A$E%oSgN|#k9rRT1
z!#N<B_|O06G5l_(Y(7|;r!4Sdv%ZU84%yo}-TW5$h|72R(-q^cHCx|2)%c$mRK&JY
zOv3G3MSH9zX+r*Da3AS>n_)5aDi7D@O9dv)`Eh3mlsl3K1%<5#uXh0YQvY~|U3WVF
z;kTD*P$5j_dh@EP3nGytK~`@VESk18PU^ksi}kzyI1=Aj<Tj*mTq&j6h-+5;(U95)
zs%QS5flE&F{l@Z9T-jSjz{Q;Setp%V^7#Gs`D!!N*sR1oy=t#Y{Q1H{e$iRZwRSPf
z-m5#Bo`*b`%s;9bu#*{@F(^LuhRs49Jh5SGH8t+}{QNii2vd+?l2cJZe|tsIW6A5i
z?}4FwJkCMVuxah;yRm?^Kxc@$uba8QmGu92Vx+Iyjz~0LjMA|SfOXqP;#cww(|asE
z!KRorMZcXFu;?y~^Emu&yMf8(IHgb~8O1~q*QWMVg?W>&F$MhB0fv(#Au+>UvGla>
zlYwv0R%G5VIjiE;iH3*{r|Bup!-GGty|I`ZGyd{Q&ZoE$p920%1qpSQtSct6mwE9<
zkwT}xMuNmxoziS~HXnkf5rA-b&WQ=)(`w$!2G2CHQs~$|LBLDZGViz7ehbJ$$0Qxz
zw{`hix3)mAj`!Z06FJR4##J^ErSHYF^=GQ**46RhI@*7EvqZ-k>liDuy*h1(u3<Da
zU#bgbHUTP~29M^6IRW#z>XkqC5N`nr-<_wL<22J9A|DUeiMF(l=sIH~7>#5vdreGD
zL4=X@z&GRwj;nZYhhdNZK%UCw8q8wVfMtJSJ&m~6;KDEWLwM5<fZRcO5lZL`&2XJ=
zOM7y~D4AG^&9F5(l^D5k+idL;>)0FkBveLdMvChSz6nB(?P$IIZTY*07idQhd%WLz
zFC9R}Dk_S#zR6`9iLVLO;NzW)hd_Ca#13}t5k`x7wTNtpj{>cb!_VDi1g<O9U4-xR
zR5v#Hmb<$<2$b|;XjsA<Q6qydx)`Uv=EIk1Kk;4pcT<h@`%47Pbl!Wj`tz*gO#iTB
zEo8rxviFkyCryT{JmHw1IXdRHZ(aXP#tQE*2YdG_)fxZW<)I<>MbT!O^aQ6^U~9?P
zK%JXMX!F-Jq@c;Hf|mBCWyp{H&2p%JA4+6ZicN8SfAclfBs>%+KC*wg<VDojkKi~c
zU=~9zfP&^tSWX-*>@;_7#mY;Q*X540v4%f_`kLiVx3oV1NYs?zJY=q!eiqr^7tbMt
zVj&Zmce_AUgO8s!HU9)zmKFG!lcPK$E%~@yKwHO827>YsOU|a;+bqWb7YbulT9wLk
zTYH3IH%62v31wlZC>a12Cj*^_Tz^>Kd<qA?2XHo5`EMrs^*+-$N<Ne5)m4XtQ*04`
zKe00<evm;q7H{8jtIRSY5aMJ@Ra$$up%uXqr6o}N)HBpBHeWyxc3h|rye|B}RKxDo
zZc#5FpHIOCKZ0l}=}fbN(UFK!qUE}~{6j!L-g|$`iJa`L`O-msGa-Px*B3rawI(Go
zI>^pY-{SCu>c9)+luo_aQmdH){#6_onk?sgunbr<W&w+twd4prwnlAJ@pcW3bqK?r
zEwUZJgEJTel<HBjSx-&^*6;WvKS@J_0zl*ngtG3ZMsn*{Nen>kw1j`rbq*u|4~Vv#
zXo#8aNS7rZ9wP97#OS2nS!+cB5J`iVrr4<dO)O{C?ur72lCVjON(5rT90KSoBlXH6
zuW>^(U}C(zNNQYs;-7t*etV@NKqkAlrxM*;3LkYfoV|Rfj^C=k6RO;g1*+Sn^MJp<
z&g}0j_UN=hxyKL^wm|+Ic|x!%)ed@G!tB7QGBTeDQ^K;cdM};C^PKn1D&w<B_s4EX
zZo(m)TQKk+;DR&2VN`0F_wd_cI;-_3f`GUV0pYebLnT(3{OGxk;25WPBg-;!5jw#u
z+-neh=;x&L*C4I)L3p;8(}}wEV!ynpdUc-j-J#eeS?^S~-ftnGUU$z1^ZE_nKk187
z-6a0(B~=OWOT4?%y^O$LD!1(q=|x$MmrhJykww&c#PV$03nvUb+fhWJ02X$SEnpMw
z64r$S>w?5YC04z+uD#cHsth6TNwb|9<R2PI*2fwG2PH0h!YUr2{m;!i!9%ZIPLrtu
zf;7j<LTGAk+xy92+rS*~?1L8p;ZS3uDfB&fu4Wn5g)pw?3Y}y3XtceML0UzehscTm
zfo`j|*8P2YR0x?4Jq`)9HG=SR2w!|^4HV$t4o~$;Ni>a4O4{6$w+=X4Wdo90Hob_;
zr&FP!6FRbi7v4HT!7IT$KWB~HQCmpSO%{bgfT~X9Z(elpUurC=-k!-!1BfY<TiisW
z26YvwuL<Hiy?KV~xz{P!J<?NCV}M$lZu;JM)=_s-z3oJaE{K0wSvqBE_F;q68?(99
zkT#egv3Z!ixskHOX(U7+lc{6v5Msc~%lRu!DQ?dCj7&fBq)l?FwjVX0aUOj99P7t2
z1_hDe=HzB%#<o#p$elxFE^iy4mV*<y;8w}wv6~pmTi>;)!nG7D3K~{MLop7f&9TNH
zwir0CaUQ0Uh%7cH%3qM(isC~2;?r)#n5b&xubP|+wcDOfpPh-yu<hYOkjz)f{IDz|
z_KgH9&~~kFk~KsC8PfyJqGxrsrxj>0<LR%J2JC-<VBYV+vVuv17%P(fkXdup^B+SS
zFX^E+>?PI?OOc=jXi%}VMkb}S!cLHx)XF09Cnldfr}ygZ%+%yQT|O}!EV>?z?Ls38
zBg6gt{Qx5Ztq-U7NBK#(w2PZWR)0z{KEZ&}EVMfwXH%7j**S%*QF<&)MXYU*VfLLg
zCh^1@-va)7CDvGpk;v5B80SUnH06YkSM<|P9^UtN*Qtc`GK2BDP<W8>P{iqz0TjfU
zCc1u4NlzbWW`BZ!p?qFYT$_mjKVBm^Me`{xbR5D4;SYD&54CSzX1-bh^LA;zFC8l4
zR_HHK-fzV883Bk+bDZsVo5~>;&$yTOZb<d2)7(9)4L;<n>zcyZ)%m&aWNI|ifF@`C
zj1(#)0Vy3nh*hM4JEt^(KaI_jQXSqAQ>Isv>tswU1*>B81N0X*$fhw9(qf9dn1J~I
z1zjynO*<U!*BeA{FQ}z7jW-CQ6Mb~}sT!!OzX@jcHik>$!aEh*p(6joO}0U_!fBLp
z`L!0gl_uyfvhWp@!<_UVAanIA$xnU?UxzB$r}US_j8-hw62+2J@tck8xR0nM6!8B}
z*xCo(X^wS5DO0_`L<v)W_|!hLezApgpCf3nGG6b*=K)bj1E<RC!{s|4lD)B3y`OG~
zTk@`JTVD=27oVcLy>$+`-(KLW?<q&2j&^6@kJ`YAXmVmIh^c#iMJ<&5#GXiu2=cDu
zVT0<a^5>dQbfl}4#EnWJ$hPbXU!0C@4PKgJ3@@vGl}DPxbURh~M6Ou%aGwGKplcF6
zR&F&DaNaKj)TVw^qCYc-sfKgw>;0*zKG?ItP>RVMRdo?008MZy&-iryla;<gusi<F
zHu426vbTAO=cr;cR76yBL$SmViv1%IpvalLq~NxOJ2#wcO|n6MnL1LyK7W3KD?qN}
zHl6B3uJ#WNWLR~iDWx5iZxfJ<o%sIyTDJGWek?&w#fm(Jkew;V+!mnd<o0YX8i`4S
z+kg_wSAunl<6=*Gi9G(49Dv*EpB~)MHM0Ho@*G@C8rZy!d}tgYi*0|ZPMO{0RFZeG
z_qUC|?*`&}x8O4gdhh|7(f=za$@*NMy7&X$$GZT?ZBEgNn1N>WphakwBRh)kSH?3q
z6%vo$gasi$G!z9-80xSj(9J6vSsx5K_I(aj9sQz5T_M|dBw*he1OR3_#^M@na;j;=
zLIO?SkoJL)C^fS&Kajq*5~cqUfh!?yUYnx?3D^{9?U;a&4a-)9ccZI><D2?-InE{V
zp->qMD~;Se#7lrb`m7~SzSO2_;&@=%J!zV;8IOaKGe(4<2o9upAYu-h8SkGS^QYjB
z2WTxR`PSI->E(l}orQw9kaw`?N}q&;Uz%Qsn!(Ts%0%{$eFOmj+`&+?U$!>?d8#w-
z>=Ax`3*0;T$3COfvtA`RgQNIKc?MuYCwK>%NIW2{<i~~99zAI}5&*&7WBbGV>u7j#
zK}Uk-!49C<wP^jfl6bnq$4{L9uc#4Y@a0*b(j=@xU~RSs4MhhHNvk-{6G=2a)d?c5
z?fe$qioshdqHEz;;;(2A@3k(?w0n6Ew{gPbPFdPiZJCU)HqLZq#{E~|@TsocS-`ke
zh|d>xG=Ty?Z^Ga;svQ{-RnWhJQ?b%;WsSI0Nh~~!P30O1^|ekf(3q|QJS)Kb)d>!S
z9{=()@B9!-w+R2&0*woMvXosi)J{75|E_d=uCAC<e}9w2e!T7shEf%zs3<+R3@?=T
z?y@X~J*%(4)ZzfR@=-?loHYK@8}#ngdk5_b+<naw5`*mzmVbR)!*%)nlM&vJ(7Xcx
zJN<t%(B~vQuPtNI76sJCpVRl_%R&Dv8)u@gg`kHw10paxfK-gNUOA*-`}V89ZzEYr
zv_&QUdYd)ABt$qOcQDc*eTK8$(4Q>yh9`HF3`fN6s$*%!CQKB{rpf9c0!K`fJk#i<
zwKN<-ZgH|d5%cg9oS=UFH2T#mNHX7N`eHHhv65yCq?<=*T*|KkB<r(1D&m1MFVFHg
z4||rxpluTS%IwpF{N;ho+E29zAlT4px~I(ER6jTjky%E#?fmtF7V|#^aeTrxx|n0G
z`dtDZMBO;BHSy?B`6`dCCaV(c<WBu>LO9@?&lMwd@*c{FR(UVg5IFIjjtt{3_1Lnr
zKiJ!d1FR5x^4#U<V|XW{8zQbCB4#Xv?53Bo0x6O@4=(UL^rIWlE_!5)#yN8&xsQ-S
zq+J8;rps^#nip^%|9vG`WaR~2&p?Td4H_qg>2Jf1VasZ`N|ypJd~91&-_sb43oqb@
zV#3|mX3p`jw8jcF7FuKb!!O;IRDiH)25D>l2*1eDd2co9uSO3QS!iMi>&OtapRx&P
zpQP#Y!+(N$GWH6udLu&1kMOM=T|SfZKYR<cbIyc80otpZ^CiQIpan>!Nf*{<aj0?I
zhXU04uT<`|@#zryu>@ll$`yjAqI8{Nu*an@TAQo3Fl{Uj4<D9a`RlxX(i*+-VxDea
zm$1+g&(??JnY@5N2_edtWUmi-M7;q>Sx8UREnhk9Hhw;{wfQHP<zH0SjNj6pVee!;
zre&<IF?cqV&J<E#;m|DGczal9ef-2<tQSIKUA^Zd8maF-U*tr78btdVwN&|Of>qE)
zyQwLe{rjcKryuuwE1nR{<aS)qwB`T1aIvxth4X|>N#fO8px(i4KZ4W5d%l4vKQD6d
z>@-i^>vL+{-f}MEtV$VSjVf&+Rd)9Hhcb8g_+`kV9b=_~%qAtbcQudX?JyqW$q+c{
z(SX7I6Bt=C-^A65J_9;HF!LX2J9^oy7oko8Hc)DWFj|1_n?d)5)WHf2vQ#zhMXji^
z-~gv^e5VbzTtId4^%nF;+*B9q=77)me02tecABR`zuF6KmCw}ts?G-`Y1kNFe3C-1
zg~75XLR%%BGQ#y<2>tQ-T4dSBORIN3Mea<&Azpn!Y5tQ8aC~8a8n=|W(MQJrO4V71
zj{<}Ym&WCtL&)l5`8XEecy1Mrv4fsv42jGFRSw`SgxygJJ7pD570ZokPyPvC*3-ji
z8l{Dq{qR0yOmd3(#m^N>8Tb<MNQLo~{R{YAK<VwM^4uXF6(}t<?_lkrm;g5L4gTqH
zS7GbR4JYX>X2~?QK#i2cRgZCgLi+9y!`DD^fwpHmsVy|**5*ox2SSoL8w(q?Y%-$O
z&7Rw`Fe7=Q-T)x4+h8QQH3*=EaZfBkpF$K``d|Oiy&Pvc^Zp7{5_Tbsd0yh|F9t^s
z6|b$+i&#vLwHh>K+MHr226K_h%_2srcKOE~uvM8RP6;PL1>d2VMN+ZN&(>;uCQU!r
zV+AaZQYOPPXa}xtO>}~GDcIxjyr$-=)*2zCcK#>NH9>nJ`O1TaXdB8MQG?p!d$Vf&
zdmiezkSct>PLlX$L*Gi!cEkrueE2aF;m|(Caa`Jrx;aL#{bX`b5ae|7-#H?(g3I1&
zhsr>+M38MPKIj!6{|m>2Y@7hp2si{&^4%QRL%LTIzub1_OV;k`LX%S`Y5T(d5$uw9
zh!mWyy+!ZbACD+c5q%{(Qe^_Bq2Bb2IY4jO)B7S_*{i?h)aR@HTrUp^)-K*_zNiPz
zhwM>C3vo&G-ggzf+4w08f*V=Yztoji_d230Q?ti#6=WNT=^y_-aWBzzl2TL*dWwBQ
zT@0sfjw(FMDJ|_35MJFp62&~rq3+`SThdqnTjEbcd%zoWtPIu&@GqC0Y?=G{)|Xws
zs)GIAr{w)_eeY#S`5b5NU@(h_cp^s(P3tZWrXQ2u2Rp-%y3ruRL7S|pY$`F6enl=7
zoHkWejnT*!J-h%m4!Wp;LUThj&f^J=zjb9Cvb4b1_t$)gwD|lKEw4pSj)02zULpJ4
zkKx6jF^;Y0ZX+7M&tQrDl(KrcgSjYIE3h<c{9g@PR9Sa2-gwoSHbontGax2p2Y`7S
z{%s2Q3~m67pW)eh_lhe&&ZiUt6igv+=~tlu_bSQHq*fxvl`tFRKI(J_immuSnnk;=
zjPnqk=lB%&1o?nX*~EF0B#@g5<2niGJ(B0)vqz$<!~)gT2R?Mc2oXc$C<Eome|iU$
z6D6pt&7h8fsl}qu8t!Z|HA=5tI0zI(t74E84?h0=M$x!yR=h}21$j;Y+T{gtI0zXc
zOzh+I8cm-%hLXrH`_m%4cFzOSPoI(xA8-exg?02IZ)jhak!PsK)=CTj=l*6NzN6<m
zerj>30A!;!>tpdpG4i=G$$P(VyftYUQ92d{aqYqi_{~T!)@N--Bqc!+4fqJUhZ}L#
z-ul2i>t{hzz=^#vM>;FRnLR*wLYP~iOdoc6O+J@FuK-$-{78{KfZj+m(q6FlXdK@-
zWXmJf(9%yB9ghn~1V3HC^O;Q`No6(GHh%2$++O8@m?0_5)h=mS{oVE@{PJD-bndFH
z$v8)xYG_|iJ1v&a<|Jq1l(8AKk08qI)PqnkQ0*=v_KG5LMA%ZR?W^{#&d@4I%c00O
z_<~>Hvi#8gJEdu~@Ioo`&vXz@)1ZAVlo?=|e*67Rp(bhNRRQb4Uf|ak&0a|?+=}ME
z<XtI$J%YCFF!Sv8fM8)A%WwWr^SOZgx@7_@qB%b)Z+kBr2b%A3FRuHAA7ZXq-5;4^
zOY&%shZD8E2+j<GnJ0shAO~=9BN~DTm=ma!C)ECm{hPVfd-FDvg&oR#V5z!>gjRyE
zbN5^3QN%v-JmfevBFRWC*V!%Gt?6hqMCM_E$z8=IIMrDU@Bwcdr>t3vs_=({%w?oy
zwwHNGS4XP%*K)n5OXqT{_xn9SX`O_IoIrCNy2pS?YYXJ=Rr#%WLHlElc1(+qy-3Fi
z35V^{pzkvVwz6?C<iYmU$V_6xt39BTaAR+_`)~vNN*;bYDXQEs#}X6DHvnk8hi|y1
z+-d%!X|-|F2B1FE*lSjg@QE*+z8g)T^Dp!8YMqcZk$5!@6mSx3CLOLAU#TLFcg#QT
z5f0Jy1;q~%(XpLk`R8ufpM}TTLCr48tE?Rq1_#?vHTNLY{i{)+GiVyTin2&EbPo$o
zfmjexeyg63QIe_Mg?=bB^~*Scy3pMxQ&_$tE)K=#A&=QlhK?*PK3Z$0w_4Vb;g5%h
zDCeQkHKQ{kIb&V1d=qj16+phQhsJ?~AmrenS!=`|1j3txHZSY$QzEXP2A3q>1!z+h
z;yQ91Y40!m<3p=J+f)Rg;5bS{*idstwV`wY*xhWkF->X0xr#&M$sY)p87ATUYyhUe
zo8#@y)5!57%zlhB>SaSXyP=-PvP{C9o^?)mwW_}B#JPlrs{0=!mUaGWNpe>3-oW`c
zOIx68^()PMAr<fZ*NXXN2tC!woyF>s%qM<`7<PP{Ol*SK<vOoxRiT3TVG}b%)UHUR
zz3HMIt_eajuI~Vegn}RmI(KEjA|i(!r9$AcNBvQsxlo>8q0iFfM<NZgj@!Dko@Ho*
zgay9G+=UB3uK1yI`v&O`Ey(4E&qebXDnhfn(DP7({F{8|06R=rx<FD}yRg%(Z3Dcf
z7U`sw1}L6k5tdD1`r~}FJY}2!X@}&qscm{KlufN}_J6p|oZxg?&H)4sos+Rw4W3Rz
zHBrW8xuQ}Zvjp<PtVg1k{xPIYC$a8FG|bytg3c0X_d6x=?x;(|;9l#$2xoXY@9z28
z<sCwiO@gCx)o1v8ac^g1w##0yxWVC@L_U;h<E<W$0)`5$2D(8@?H-55=}*?N%BVvv
z+#3NIM!~dIhz9zj3%d1qYsbiv{s&S*Yq^aN=WkCy!g}h}sDJo=728elxRoS+4I7&Q
z>SVHZoGh+@fAcdg>2GBwzQioON+GFDI;aHugOCoBhgT>Nbks*qQ$_DOenvD+&t=qK
zgPUO);7q<t(A*`3NRGCo`6W)%iis)s9{iczLj_{J?&lrB#!Al}C-r=zTcT-RK!u&q
z45RQd-t6SLLvb2_A5K&f6sHRx{!k;|lD7jOS`m7viUDL7n*dPc{!lN|B$m4guoi%Y
zqv;mdpw>)<Jyc?NWxR91B9`1@gWT2En@=~)(-Y3ukwCd?%QtId!dwYC3rZkR_Gx8i
z`^RoliYbWi6>0Xm6{PMqt*n4n+gmem#ZF)kqQ4k}d#eUySmUt?vd~IgJc^j7KdB?%
zm_fHbbadJ6;{Ci;JQPEO%*i3W(8t5a0W+qXt7;^<!Cg7JZUMxb78wQ$AumV@*Puvr
zO-zw@?QZ?yo7+SJ$WBiKtUy6VH>m>Kkp_tHwh|g^gqC@Y#7Q?7*a3-e2&<V7Rg~?`
zN=P!3+%$(%ULMv+xjl!_&yXU}JJd5ZvWagDaIMXB@j<1{_okIIQ|@@c%gxRG>Yy%Y
zAM)n3S}`C-UZuEe0$^{+!`wvVW(Hm@Ie-&9w@CyUU6{3C|LdzNJOC4|@-)yB*97ZJ
zmX_G6YCKKi2I9&c6MiFyBgt!f`pH;k{79^bYkuNVegSR`bv^V_6xSyvDl<<12&T{V
zj#(TiX$_<jke_v1{cYe(s<7Qqk@DBUb3O)?j+=jOz<x+RT?6aTx!+(*6zKp8Ac4t>
z-UAzvaqoHmH(~!;7A>_U^ifg6(3uc3p_C+lJu*dvlK3bD#)8DAmHa1X346f=nS;Gs
zf4SEVKb~)B2^aEyaARZR1t-@g@J4~97uhDY7(eT>jZA3fJ)yDf)-sJj`eh5TPsaUR
z5?>$Jfzuu4>FO%T%pJ__@?*HzDz01LQRk%(ridj>;_{oHd$mN|(ExJrgAVNG!VRml
zaE_&r0Xyo8imD!cM!mXBLML+XV0#EEH6?KLZW4O2!3DbW0OdmDjg~7wL#ao6o=?WS
zzC0sYyD2Y;R*NX$kDhC6{MTP_X|lWN-I^#0yceR@=MKZ$od``mLY=k9m7U>(A;nn!
z`uOXQYZp$XoJ}{s8WR;oK}5mTIZX!t%`%-aHi^7vy^DBHb`=-=-;7ldGp<QG18Hjl
zg;3@<{yzQRO!7IH#i4qCdo~Tkv?o7xtGzt`Ovc%emE#;?_0K9S4ho+mljr@dV@|0)
zEiJ^Ut23tx$0V>RMkjw!21iwH$4QM0H5rPG7$L`xSgbR0#Kc1i`T#d}GG5gC9r>0u
zw{o58Ep^>f(B+BR<J}j^o+R+=Om}5Q<vQPoqs$O<>vv=v>%`=(#a>SI-G=6f0I%Wm
zA(!AtdhKmbD__pM5k!q{$pXwoBjiw*rHknn0ea##mRk5fATl_3YLCazEj97A68l-l
zh^eNXVR69CgC}|A?#C42*+5FZIo6Ar+PB=_api}RBYSGZ^cINAa5vuLil4ff8l1XG
zQqSHTdjT@|y?4@tSaas)0Ps1`&8yAVBerL`^LA>AkzN~;B^-p@3SY>}pQuBX8nSET
zEdSF$LN)3E0#5%>fvL)0(cDLR18Q9C_t(?}lFo1vn*oRrVkJ(&ib(0jEVtL^E#xCP
z%x?zKARVyxj?`%DP98(vodAyF0!W2N)(W{hASd%d@^xZx(9G0OT5NlB?jz1P4Mg=o
zOo94ttx78%YJ?QDb2OY~5*m9r`X4iuVR4(#ho6_ARahnx7o|W+*o6_EhJZbPy*tsV
z-yBR7JZ^WXBzBJ+CU$XyhF#!Wm-rp|Z2{p<lIW2We?x?!#A9>xc^aG@2#_*6V?MW4
zGkEo$PR9u7QBCv*o3iR4U}0{t%|`K<0H|-9enscCX)x@K1MGsqx&HC3NC}|4C5g+g
zzr7C@`#3xXrKu#2tUpVuKoP({?py6{>;+KpxEu&RSAv=X9<bRN&C%~xjl6|=p0;2J
zB57>IMg6*Y{uN^DECWM@63A$ClP>^kTY)y8cxQA6#FuMO|GY*nTw$w48vfDHhUwSE
zxqh~U|GH7#X>6Zy&QE&pT^?2Mk<T~?mC{w5LQr*56o(esiJ4Zh9m}5@yI>^SRLL?`
zLmUj{4%6?5g6oYB09i{_6Nu(nljbcyodUDLS@q4xjVuuJZ=I<U-x?=RDDNS{B3=`4
z?-<oh5lbupwpyJ$8R0`~383h{_f&%Ovrd%+p}Vsa?a_!K<{)4~@gs3|SZditTj=v^
zN$Ef$X!hK|Qv4=wTo3Wg^Rp4O?#kBI>`9j!U)qCO7fZxd%E02#SWdOGu~>pRRwo1U
zQD|gX6z<E;Mhg8;0Q}880lG-e?7@)9rV59t8g$`JJBqEe3v7m~B(PGXqDCP7P;3fD
z&NJ@3^YomkeB+v%dH|*ZGG33yTU?aI)Be_4g$v6+=-C`_2|*tI1RB})#RrSKcJ`@-
zeNolp-$|#r(GRxoGCg0s$L*+afb{B@7%6W$G;)^s1u{3yv<p293tJ4P39s}m)J0nB
z778yk3JqH9e|;tV19tHk@JY-z+Z5j`dEP6%vwiBq)z3hLqz)3yM%}@5wEaA;<RJ2j
zW=r4&9;WeNVN?JctO5k7ZQ~^{)8fq-S(bD<&i4y^nDPdBSaE<Qe&=X0&a>T0t%bV)
z2$8Of+byS`$v67XH>Wy&Wn~*qLToxUf~Al)Z$-Tk1DkizMd>U3SJ^5?oz=L?#>wng
z#&2O16bGc{-nn{$J=<OM$r6HeTLZ|dz-VL21+B8rrXK(oOy+v;W%gm5bFt?njzQZ3
znV?xyxWT#-g<>%Mh4d91Z7SCyu~=xOstn{{7K9)lkO_MHu*=YXMm7O5FexgLkeu~*
zs@9o*tFtwV|4ptUVxY;4-&TP}jP06u|BUSkkGjq`uAj!3>FJj}9LLka5W^(P0-dv+
zaf5J>JYqVc6!UO-HkyDUgO<z)w5wKEO85K`<{2`59Bn;lMh9#MngVp3HBV}6U?uMF
z1R3g6?OeHlmXmbWM@wZ0+l|Yqwq|i>GhY+j+kXM=pxLvJ?1u63xnnn2;}KXH4|U$j
zvAE><Z)rDAHs$ya9Vbtg$+Ji?RWhyR({sYy!t`CHvZo$xG8A}0?|H+)D_|g$%Iz~w
z-oqaq8F|?0-ka5V$x&+Pt#zE>4;VR~6>h!OuPyNtaOJ8oEvDeo`QTWyXu}`XeI_p6
z!&?LlQe|bcIheUeUoXXO+-z}ZJjby0zVi=M-rt`XwD||5)@9s0C!}j2ug*W3aO(~5
z&=YcApdl-txL!}+AYs?I#((Lp&i8x>s}AdxggZpTZ94a5E$vqJ&e*Z$N<29x<WHrn
zl(NCPsW`JfKj7%G00#VIO@fKb-w;l(d0-h@OmLw6>A{nsPYY20@a95g=@=bw#X>jD
zf6DHXRaF|0*zu*xE|f<Jl_^pyrzp(6P*ScIx6A!z9Ho0+?fG25>QJw#8e)#9WRrZh
zU3cAs@Ym=wT$z5lT?H{}FgrYEaDeQirg9${6<RpS5OC3pYV}Mtoc*{2ac?i(pEb<r
zRad^QuXg6NdG!GsKfv~%qwekn<d<<x;&NCXHs^UAP-nU4ldf;IP40zeGQz;uC(N;>
zCtp>3#ojcH3U9WT{f1IIBy#1;Iiub8ikz;{S+LONINW9zZxLRnRe*ye8B)9T^r>7n
z-BOlDW*SQYzAkU3oq-v(Lq*7BgjCwSsEXQ;K<@mFpvOQ|%S4trpcBV46lMIE)$Qsj
znN<o8FX`rOG$W~nr!u+mOgo>x_f;_C<I)quyc<;<gJb>+U{Rc=|B*j@Y^nNifyATK
z5*}x&$hAejl<44vyK_91D&Y`g)jWZda_X=pTS}I@<fAUTscgA?VmMzdO>vstLaXkv
zM2=kj%YqvFWn4MLJGXnb%J^-{hyk%2NB%F}rH2}FG=7U;Bg00u7fMQhy}WYlt+ur-
z5Eay0YPrYzw|sAXRvFb!%OXmYr3&z0N`M6haI)2IfZzH-PWjAbWIASVh+C)h6G(aP
zO*YkgCeC;vPj|!c!%(BtIj5`0DMp3!6681a@*eslv|3WKB#RDX+;|>y#pmo~5`XmZ
z6T`OuE-XCij<k9EEpkcx7NGkcBYv>eY64~5NfX83fsx*Pb_EtI1Gkvxw+NP_7zldh
z^n+DNXRvrT1Ads#L7g7?zh877Qfaf@nG}HE^@FxwNT4)BpQ9sJU%UV$A<Pw&(f|Cw
zDVpmKy!O^KF;s*jP)WKM%?Z0lTf7b+S5AJXw&92KPU^P+Rqjice)Z78Nz|DwHWF5x
zTC@(M2OF#4`^$esdp_QM^*QoE1x!&8_UWPYE8U99K(>;S+yDe+-QM1wl@0Xdf9z23
z@x?2-6}+bxz;UmHQaSCkwEi;d=GG7KeV)_hL}xEOKs=7!Il2}$$58M7ZJF^@OM>6x
z@rML9$6S8BI9uWS%5MPxYs9D|%FfTb{Q=pM`ld2;-WID5s&ro;XULpNaO~DHnd!`M
zBDcGdGLrBTM@NrH(1J!O^kN?jGyVC{0K#x{W^PriF?hf=n*e`MU4ly4yyWF~#SUL&
z@PaL>?$kj_4a-Q>5~up(BC|HhR{Z)VlpGCN1z74oBnh~BBW^(q(J#eyQf`Tcd!x|q
zG(KVPgjhjfzUhvfs>iPYHp!<CO=O<ktcxDcddY$602%m0L!W7q)uZk#%DbVhv3v2m
z8pyy;br&jxb6?K)mSolEu!Q5UK;`7Nm0<E=3mO@r-Q7>Qj{+Ihy^gXIM6>7d7b99N
zZ0jO^Qqvd#tyu{9+z)juM72NVR#?c07PupbZ@ea}s_QiuE|tA(xAuMCmqD!z987$>
zZ_i{LukNwCIiP)kt$-<s5LL%spM)Rw!Xto<PuRuXAC<UMi?=wk@xvGu^@EJYivngz
z$h?NVcIFjZml~KZ))-8`QQ#sZybn(DlZ56M<-_Zi;_VB<Ikb3}dOvDpQ*~(>5c;eP
zK26#H&=NID?3}U!;beN@;h7vLvh3rPby*%TOu@KlXd<(Kb4vqA!2JhR3%33vBn;^n
zov9RgS7Z{inDA_JPk+XS;@yAh+X~G<MPyj2f~`iT<+|8<uOZdPA-CD=5Q%FKzrF}z
z=mA%1$m!lxg{Y9Ff#e&$AxqjRa-qZr)q@-$q?5RMAPCrX*RGgva|;U&YQ|P?z!r?T
z!MmemH1Y_{TiI@G&W;Iv%Ldi4ap`&`(_%B&cjokcr9zKguc4L@VX>i7lw2`R49rK^
zdx0~Bo{Pb=#)`c2JBp^tQTM**o21Ka7U;*_xy<V&xVd&6W>}^4yvw$(Ztl^+aGu;I
z$fKrGwTr+uBx`dAUr^9^A^v7>BXZ>wl=6}?jT6ki;J%0&SM<FlU-dJr4M15qBfqL$
zE;5mH+7h}*1~1QZ#=cc4Wg_yGJ6#2~J-8ecCU`nOUQ<($d5LfTejl=gflL~*<PLqV
z9cJIFbI7$t^bW6`et7LurS)=Y1Tbu-`RuAG1*O+jqq-2{F-1d8Fme(Tx(n(8c0W%>
zXI~DU$S+=mWAfhz8dOj2?495ONre}&H}7TIHW7^+U;KJ;;}S|j<f18+qFOywm5c^X
z#hnZ_6PS%#&#eh<f_)RtlS3t4vx?OPj&!Cz4nItl_qoj(b|3Nza;Z{S83+q`EXWhD
zr>pX({Y*F%aPd042ptrW!<eM6Em=Y?!*dI@*pW_J1O1NU<H7vVEKp<pkK4wqfw2<R
z_D6+E@tD_1pqHo@wB}dw!3d8?DBTlTsSb0!ZB2=y0}JQR0K-0<U436@(oCiPLG>*$
zpJ&}uVo=13FwYO?8sw}}YA)6P;vq|f=A9*Eu0q7EdLIpbbBz0cCV2zZzyrW9LXhjV
zUeA_fmOv~Ma5!v%%*DQEQIJ${fjmUoY1Nl6GkaMxhvq+ny`acsAid;7<5R}1px_qe
z0EWIH%FGO09GRtk5DrG7_oG{om!6aZNjB4kUZSttO{eM9s92Nc*xSf-GR!uSl`2MG
ztqFQ8_-u*RAKG3Pb&+=9Z<DBWp43jGJr28&FS!}?G=827dQ<rGuoL(*l{~}nz)5Go
z>xN862dM$!=Qo3f5n7e)AhH~i68;+U$y#O)*$iMWRD}aY7N*^~`nfXuoQ^<13NWxp
z6y@ov_l4j=&~0v~-kCoaGi?Ndnv*lU!hl=rObo7sw9NaFNb7<w{F|c~sM#3Ub<}A@
z-uVAI;x4@kIzwzhi%{cPcD1iy6l@jQnJaLjC_0*CPw=`uDQeS~pBf`*pN1t7oP(ys
zB&b0G(2;NN+1sc-sH6l-G?r6h8zAnzsdU*EVb_%uP!%-4_MrRs#v#-5BJ;C|nFpEe
z2G{vj7+ER6Fv_=8EC5QX`}E9L(=u<t`2=@l&tRDP&IP8%8G%Fri%8`H9pL|*pu{5+
zw8fZjyLu`8%j^P)7ay|L4D3sWue5sa^Q8=fJ4XqO47{t2VQj&!i<KLa+f1gqFey)|
z=gbbuUJ>SF_3i}s6R|oiK+IPW8}#P(O-xe7YP$mEw#y#U&cL%e)AnNBF$vJ#Gcmoq
zZ{@>`75DV)LywlD_irq*=iYe+9U!gA`c6Zy?g)#eelL)2qJjb?Tmy4{mf-U;O!V3|
z-zx*3Fyo%v#zOX5kB<Fo=wiR>0|)EK2#fbOCNZ(SRdvtT$aG!7qpr)yARu&?FJy;h
zo-O9C-fahu3OcoFP-pANi^1pV`5vnRB`m{{td^8kD8;zekMO=GoQz~nCaSio1)GZH
zXWdwb4yA*giK`E>i>I{;bIa=LJ&8wU{!o!p@=DMCJ7aaVzG`s9K1)>J8-8_11X=%A
z_5qN`7S7rcvfqPzT%_m3)e7vTFy}$d(;K3pVb+yv8lJC{Tz(N^*l}GKk;%O;&()?I
zo6nW4-p|ny<b!tFi(H$+om(&;TBIylB6}>c^A`0(^7&aYhu^B%jWctO96PkU6!R8b
zc6RhqE^QUxc>|?qkW%-bu}PPQuH0&Co8eTgWa;C;But=Ul&cSdsN-ahzd}*C8o?9#
zg4D(`J}|Sf0k$Q_voez+V`XBp0j8Z4-Q3Feg(G{=gI;#u&S{z@=sXtk*!&da>j7GC
z5n^y7-L^8)EbYMxt3#z|jRD<#I&}@P^?>d0_@0U7|FS&Th7Pyfq!7LVHBFfROirYf
zar^J`e!b%ErhNybjnQ<R$#o_py|8*P+;8)eZh2wVck$gN-!cfSwX6>w8I>UEG472|
zicUMjkn+rH{*2X$Na}~Ko?qtK5Suuto#{i#<5mii^|a1|Z>_F7j_Pe7hV+C$-8DVu
z#UgzW;*<3YcWCp^&s$gvcX<+Py;`MejIhZN2l*#iS(!{O!Nb&;b<jPu;YWI>OXT&7
zWL24WGTsS)1C=r(Du79=!X<CWjM0$EI9=KU+OPV<!JxMK)0C#v8A~u05Z3Kzz;_f_
zV}-I6Qn^b8PtVU%n~Nj7x@<1LarHZkjrNKSr7-&!728Kll?%KDffe1527BDLq)fpH
ze!mBhkVLiFy0A1`f6t)P$DIrL=cfhMtD;ygxgR=ccm_iK+_}~E{ytkpr)y<zC7p=N
zsD<JIEKGNr@p>t6$uN3V2)7tV*TN&=j~3k;54-PNgGXUTzlcvkSv+#<aAD1}aeZFx
z=_>Sld*CZw+)k8dpoC$|?!9Rh?~~K7dX3i0pBKPSMtEXD%%cAA%A{Iq-QT_0D!n|I
z(PC83kkV_WtI_s)FeL8pfzQdOE9KV4)E|i`jGtJ1b=!$0F0dnvj}i>V0)RKcl>DXQ
z%9$F&MV1e@5_G62hA6a5@C2`9_6TI|r2u)@-m$|UEO6hj&W7DqQ&suOl-VOn!)<wd
z_%~Iv*Yi1|jo1NYM|!F5+RE>C$CRNxxIIG64uxS|(Tu5JqjtyJNMpBYxb;*YtMD6-
z&C13blBce#u|)+ueN7{%#w}Q1nhmYmXN8Pn6_}QuTz&cve<Wj_yX*Fuh%`eAc-?j1
z_o)~Srg&~?JT0~4%jAhT!5!O!jY}|q-1-E(A^4OJe+9q}IK1;b+JR8LG3X<k?ZJc~
zt|JVp&VmL7TEj5W;z6j^GM~4&un}aP$Uz)V0&hVKANX*x!+0_hVN%6R7izpDJfte4
zXDC;!@pqJ~w-J|<jf*5O&-n$MjHT4bIHt;~wgtBQ{M1$V$rw`y#S~%-7nngQQtm$;
zXQsbFZC!d}r`|b!PQ^~Y@x>YDLhH??(I;Mg*u{|BkgMkJZ15Oywp$EVdwXZwz+A~b
z&Sj()=F_|6@90TyA8g--C-qga<)#bz>v4**!fo&);%&PM>!m#14B_UFiEQ+$G*<i=
znD@$W<6_m(VtS>Y9K+)+R7fafyarh^|K1C9n3Qek<Bw#YME`wrv)7_XTUAfp{gfU{
zm?)z(Ne^YxoB%>Z{W}V~!IbGFe(SYqfaMlUS~Sa0=%$1XH>#2g7BFX)<Aud>%+X$n
zO(0HJ=9;Ld$>3GK_gxM$XSc`RT8~r7=?d=UbJJ_Djm5WSGILkevMbWn`b%tb9bf2j
z<YJnzoVi8E2?duKOiL!x<h7mFh5aBy(s2yBvpsap4J~WZD*(lH`eP*YWZK0OjUFci
zi|=1gQE?i6mDRt;0#y)-d<Y8B`TtM;#?AkZZzE)YiM2s!@u%JJ#Ya5bX)qCV_**of
zUYQ&=UrM*Ii(cs$Z;Jw%OhZzyV<_DpAhu*?7%+_Bt^t?qnny`k*KJzip==+4uMn|4
z5FMAnApIQ^$O!AX-(*r4;!Z?z={|ySc@wAs=hWyIcyi%q2oP80ESQcxz+M|FF#s<s
zLM?N2M~&Qn==J58#7RV4;Aa89-=GIL9FXZYlb#%16Dap-&=~Gt2RhxhUsQU6!ic;k
z&+tc+mT7|V;qaibOH=y>VoWH%eD`BC)OCWjeQsA#XfjwHj;`WJ=v*8^bD&b9VbIA;
zbr<XtbF6=$9dD)$KJ)ps(oepY?=FDrPK?V{)N<X>3z-9*dWuIB3+@Y{4^sySx$BoL
zCPH!jz0L)0j~~2+($8)aL#wMZLtR!18A5LIB^5I}MJr_F+*R-!sXB+hod5-E>rc`_
zmg)2PpO<U9WXR?Xmx|fH$RL8dlo-quJV5LgyfiFiu31nL`QYb$g0Lx1>Hk27LJ^ez
z)hW<V?}<vmu*TOrxq7-Kq^KO}4H6>M3u?+^j0SlyzjOPwlo;Sa<TqPz-}03r3<26@
zOg}Yivj<DfH!z^FNJ?h1>F8e16T#2bd*;*Y2?ng~>en}y0A-r%3%;ifD#=DV5%j2)
zMX(a~V|g!tYh^tF03!A{>T%c_>>fi73x6>nsnbj_kCb5;H|#n^YY*)O48|2f0Q!)>
znqbcgTVY}d`B{*neSZqQ6RWg*CV}1cF_WM-sUGMfAgU$p*WSTX|4bsb$R8tC>fdv@
zweqD+oyx%PU6cd%&Kue!xd7Pvonf5BDQu;9@pVFL96DDE<CKl%^o0o4+&O@9lplv;
zu+HCqhKKr^Ft~|+{7afP@^-21s6=wO0TZvW-c>;<F01t{dfqS3J4;}wO0@?HAF;tb
zL~UO>IB`CQEB+pxZ%H$Rq=;$>7$>u@s3uD!bCE4JRay$u*zDA#LL=SHNsN#8yWV!g
zI?m^atZD-slybe6k)Q(SSfQ~;;qkACJE*@g;z|^wC4N-CaU=eALv^Y((2FNRqy9$h
zHML#-<c)?)GwTaj>JMS!*WT@U--RLZ9hun0!GRz}sY_#Zl}WNN0q#t~sEL!36zETY
z(#+rQ#zl=&32JlT@NCXyIZGL&FwG}I-`oO={PL5dr1i_yZ2}@q3T2L9#`bH>T>}+U
zr>DQm*$9L35WDXI`0RJ99UkN0+$FljH1$3)ROy?w5<1wgiWuc<ppv861&O%k^Jh}i
zp|IYkXr8+Sw4tZU+}xaT&5l(>XOrZAUi+J!QI^Z~b*FLSvcQW`<pr%`!oLD9gORR7
zY()o!Y|9!7^_ZE@R}O`=&*oV#qA$r~ntqL%W6O)WL045cn7!~M&@-|M`K=eZy;s~-
z(Ts&JZ5>Cyu3TWU;!~?sI##%<U1pbxO{uk#)hmJ8Z@!20huSAYBe+r%c!p6r4cpqJ
zsMBbm^qr_^5cm_kcfdZ=!^p%-=L%V%ys)B>K&4;<25=rF1ZE~egEr>{yJ0B-302A(
zFJ7`~d&;f*Y^KiXiZSm@{f$M+oNo@E${wAm4Vd7O?l(zsgW-H^+TM`YjuqyXOn*?x
z)!-&CGLM3E+3#%%?))8O$0U^N5A)s(DC303>ue`r&RnXlR)AMS%Y-~{=T>OnZI;gj
z7--7yt^R4zQ6MzgD0l@S{@}n`W33p>vidvq^lYO>#R-`?<?;hYJhWPtm!cJUE?KSo
zIDsl9+aEZI8VQe&iE#aRk)`4FQ(n>yWYX)8!*fjA(nyW3&>fkUz{Z(3$RzLnKGZ2Q
z=dNMzxx60^zh4H@Rs)c10%iZpGYw(6fs6MFgBb@GO*Cx6Vs0DxHv&=WQ@UI*Uq^fe
zz*pBg3`|{1Q7UG$|46`n;y7yg{Nb0o$M|<kGutOL8b(Zv+1CPyadgigIgKg}rx`9s
z`Oriuj%v3kqahKkE?@_&8_o+7d9?^m8I_$TMU5AZZ+7FM#!HP(4nG7YRvyxV*vZh}
zq`k!1ig8hWp(2OZh_D>huIqfp>-j4qWY`<>{QM)7EmK*QBTP*CupIt0!vC>fm|O8~
X>-ef)5YK>UE~r~F%F;QK51#&CE5oWT

literal 20611
zcmdtKXHXSu*9CY`K@ku|6hSg5B1w`+MuLLG3rY?u8RVek3?iT?k|hUGq7sBdjshwm
zQF1y2Ns@C;vm4?4?)T20si~T&srhrMi*)zXPuP3ywbpK5C55|0rzuXOP$(i9X-Q=i
z>ewI(g;R414}N28%#(#eDczHiynWw6cNTjxf>dg6f01E=FgWDfamjO1UMgatsUIRL
zSzo_4M)BX4Ix)Y{5+ZT##EIjtsXbpaT~E383ZL=%^;Fu(cQMwf`}5mn?#+#zO*2ib
zf_cKi^IgmB3@Q)W=Dgze<P>c?Xb9n%D5kxKYRFG%!YV%aF?4G9GW>YzwLk<vZi#Oo
z1EZvKISxPk&z9rBkGrQ}$S4m6FH89G_1ymxm+i;n5?qWpLnRRL`}>>dDsO`5FgC3y
za&Du<X^iW(-SSJ(9lFQJSRMy0)jb>g`7T*K`!V@3+c+V|hLZX6%Ts)o1Eqwn2LpX|
z_a{>gwXl<kNBLT>xxA)kR8OfRrf59MJU7#{Ejht0t86W*6Km5ZzhA-)w><kYlz6b?
z(}Ol8wuC;6Vg1Dn7WFR=Q_%}Um707`D}R)zkUt85KMLB}+n%S$HSfx#`n22>7(mYb
zBiF3sC+YS3wLIpX4GodpOfzxt$6_{K^QYn*$H|F$14Q>6QZuFAT+|Hb&?~nYDZApu
zp<8-uW&rs!124<^4^nSdjx}&M3$;-S*j&4>mn=t4)TdwL-?3HZydHIq;f{$;0UBGC
ze^#f$ZMOkthMDxVw|IimH4*mcn-)3x+WvGdFExs7(wOziOVd<R{B>O--^B{rn{BXb
z<)>pwgH~|iNmKMhd+uCzT21J#&uY!-dKCj`|3BT?`V^^eF0z^NNl7BhE^U!mxzFvg
z@ic_bQrEf?mn?!x(5}1jXY_?DJI1FhRYR}JM|rLK`6D|euvb}xsC<8Ko8omepC$SX
zjmQ(em;FzxiFz%EzCJf<juSQ-EO#~P{_;qZ&*qO(iL)fKdpz31mQ24`^yN<)ab&+S
zZj9nB-YLbhk5t>w4^ZWQ|8uZ@X>4O{4}D3laGjWv9@b)2|MC@Pyor=uTfOFGegEKP
zKD1mH6ZhI0%wV9V^ouj+kWH$5*|dcY*_mQy3`7D)uXFq5-#fnL`EwT8=CpZ1cvSSr
z-cG#P`d;GiknT{}y!!h}|3ALj&|MbQ^cy<GHk?Va;Wsa^sPdFKt#Uc9&EV&yBqc@4
za+I}`vT5Gm?suBb82|eG_#-W8j?c3)&M6A9a<CS1N%!70#M%#-uFVh9<a-a%AI`BG
z#&S?Z<+CbVTZz3T#%<RLWAcmMdMYi&jCRdtwznh3eQ$HTOV6EGHjI^Xb*h7#SY+d2
zx9T3#=UFNNo7(nF9W6`C-rPD?t^5Wabbt5C{NP}(@{OZas)*z@4`WvU@}3`dWh#2K
zHYoW~wa?Z(w!}WV)Nwhg$a=!3x906-<4+GhUo56#|2(_9RL_wtAAQqkexNL2?Jl+B
zw_J<9n>K>AhimIGNXc&{dEBn>`SFueU(PLJ?FUVLAU;^9x~|RiXp&KX5!SHa=wBSE
zvE3%3;JLwY$L~w@ao@xJvk^<k#;lQ(7|CtyKM==f8D|u+z}-P`k+r2*r^L>C>O*$H
zM5KW2C$z-R?__#U6k=`#WeTSrJvtsXzI?NIeWY1)o!7i{-*T;nB{|JrBi~>|vF&u!
zT^iT+ITGo_znk`JmO*}ybrXNh%Aa2hSPc`)ZeMN6$skO(GLwYd2jBcM=^p_!mY-BJ
z%+yYj9Pa&vk00*_Ohr72?2z1}QfN|`XpSFr*_upA5GWeTF=~w3pEjs_;%t?y#hw|;
zJs3x(=kkxSk__(A6D>#6SKjgsm}urdF}0iNesqRf=+bpvvAKb=64Q=P?=<qv-YF2N
zSGc(@w@46s=W{wP8DOl&en#XoN&NFOjLjU6@Y2KlCp{mGJ1_H^8Bn`z47aR#`W;PO
zBeCet+2Vy??3U%5$!k5uwx5e;a_Vh@sEQnvFZKl8m<@V{g}C?D(Rt8qJ41m_z0~1Z
zY>twrLt><g=d1rdz#g;$b~6z=-q#K%SL|$EH~O2;;h&v2H!Ns(_*%5DP7P)s#f00E
z8~#*>vvrET%6{fQm!B)wzk3)6JYOwP2_A-rvyxu(-G|%JgO=W<#{cLMzk2h`XRk{J
zweS?WY+70Gu69M=dv`tZ81CtasgBRleU=s2bghD>xhN7O2GN>`Y|n73Ky+Uoeo50S
z|BCjJ>C?L4E6EOvzbS6bDrai*FV}Nablc+#AyMoU3H|^ijSfGR!l#dpIS`VB;c=AA
zaJkYo3C;^G@2(F<A3u2}Vsm*Sx=le)etOKWP!9=|Hh)@fSFqMzPBE+xGy362ZpX)M
zJucT)Y-_IaK@y*hOxGa+IZ;@)>jw{cy)ZO;;g~}w++C4>jMVj<YNmE%=H45Q@|G>L
zj!#LSo7tAuAy%+f_iPF|E=AZ6y7IMKLaIzwN)RXC8mkXiY+kDIM1rR$t@6TEkD9BG
zOqImbN>MfbRI+Ln_Vc<K;haNmEg$a^?>c9et!ajYg&Ezn9C(shG)|FW-kWQhI{5zf
zi`-0Dc-Wu3noYv%+8NVbS$f?UKQcg^_Am(7fcqM=Ju+=^USg<AZ{+1a&f)&X3*8I=
zAvg13rDuWwc^JJdm(7pmI;MD&SVAdZZ#`<r>tc4S+J!gj;SKSJ^xQU+%6vn^zwkt8
zd~Hf^BzD{1wcp(~LlSvNq-G*|x-%m|fM12(mMdD8b!R66GD?ospBB0{h%kh=1(i*^
zGGlux+$;XqO^^(~yG`xBv#1oyXI&q}a97-R2`^M&(};{fRMZI4o%%PDulNJMuCNmI
za)=9E;bYUxmFD;=wA{eG+Bg<OaAV9;@Y!$4)VYnt5zTObFiJW`iCH_^5|G{@%Lg}%
zO=nfygo|nehZAFLJ2eFQzm+R;y@UAH!^vL5$U51}At`jFfK{isJ(NRlpgm3H?#obz
z(GUO3TM8l(b3kyJt?kSYdOjLReqst!gB9*Rgf-;z=|On=75vuYf2QP+pbQhP*mc#o
zWbC)Bql&j)#N^w}%zr-KQJ+6Z$h4<Mw{DR$(=r^_%rlc(@IP+4uun)MqW#ie`)MVv
z5iGSG#;p5GLV32x{?4*e?2guyLB9&?!LC0wSF!Kb$eWY;!S95_%CEcl1Dy9Zc{Tt>
zO-H9VE{{+8gm!POOn%($&9^Y`%+N}a+GDeo@FC1DxqFbq>lE&ssx>%o^;=n$^R5d?
z93G%3bY}V1)GM6Tf<ZTKBt%QRJi}XJH=7e3yKu}V_qRs=6AhE$_Y3rv<sF~Z_AOsT
z(jIIi`-xb4v*aM$)8{I@k}I>yx31^fIB&1b&KfOmO=rCnPxp)7n@owHVaI18lkI&j
zpuBOhEk&{G0h{c*>q#oaKYt6*xbHkG7KzL*I0?u@ZuDE4_Q`NjcQ?%A@25-1Ii=Mp
zLdnHP)ALU>ea<Yt*X+@EhVs_KWVy)46nyCPT?6gaUmx$zqMzSJpjO??c(i4w5W~LI
z8Y3%F`QneuTfep4cUSFH@L))7{B%kC5uExn<6r-d{fcq%m|sE6W*GNSZJbw9r7U}q
zWc4^;OA|gRK%T6#g9#EROPFsQ%(+J0CCfT4uJOw~u;2f%)wnGT-7aDN06EiqKlqAl
zul(T<q>XrkdgKj0;5!SGeq(xM`UOrP#m%mC4ShbTQ8smzJhKN!w<ND~83@)laI=hw
z@k+WKsjH-$2*|Ws{#@BY6+y3F@*crKK#{(Qq{4TshIoP+JtRzU&jn1gNZS`?*?a)K
z-Drio$(LBwGGo@lCr`e3jf9P0c>owZ)r6OBd_{}jWaJ?lu<g>>dVGOd`CC6k8>6}Y
zqGFd4Ii^C}X;q!LGRwj8UiMvx{I&OA0Q?hKge+^~7R0NNZ#jtOW4KHbg%?1{|Crik
z;WpKK4i4>HdX~EbZrig0^Y(!VOcdi<R6PdgwPyCJK}ZPCHCc@4N{d9Y2F55^-1~I0
z*z;q_%1QT<k?7g-{&rDLjv@N*0u0L^XOnngs46f@c$gRhSw|Vu48kk+cjxWSD;zt_
zDbje{Mhz|f%Gr;lbbKlKUmhhttnYayo769(VGtyG8*j-RrQCVID#?*aaIiSD?)x7l
z8kDG@+fJcrduotTShjvmZ(}Xa(FM&0*%EaudFEZ6%pZf6kC264o%l=J>CTSQrJpQW
z-Ho+$M}MAmRW9PaM$+vUc;kTf?YOO8weeiz7=?HPF~mZyC&$p%P*yRVUB`wqjur{H
zC{Y2IjW1~$m}H}O?z_`^a7iErCaR{uyh}l^ec^YIdpNVok975H`WfBIgKE)@$E^KY
zBQxBvBMmR;R#WSc2hQ;osdpX0NIIc&S)c7wvs?!h^PdYCn50?MGSm4D;poEyA}~Da
zAMX&k{zJy0n_e)xz-C-JmfyfkejmZ-Lwt&&ySYZ8u#x)E;>T{uzQ8?A+CrvUu7fY=
zGT(!_Ye~kce&aqF&%qxvIj_(0LSE|f<ATIjLXJPgO4JEaQLo#wVpr9&Z`gUNJ<Ulg
zxe)(g!!Cp{#W{#C*TwIxFqPor|7ImRL(V0s*FWAE{n95wT1Hm3EzM%|v22mBNek_E
z=HG+kmTf(upi*pO+Q?)*-jE#~MBCK*UhWY?8@^M5pmGrNsj^4RfHUBfyDio-1en|`
zf<rQUl`#;eUZtz{wajc^0UwmGZI0y-S`0M{&=z+Bs54ktaLJJxWup?}p#0+R@4V3s
z(R|J5JI9XDu;`zztoZR@NYCA9anB=A(w}eIc2y-&`dkQ;A~nXKjs)I%oR#QjU~mn!
za|6$l?iK*{Jk<{K3nce%yFCFY!oM=vc9*;EotmyGUnP#QX^Yk{SJ>e{e%<fP)cI`w
zsn*tBbZ_G$mhfauQYGLT(+w9mHjQhueXGCg?@J(WS2UU5hi((H0_4*t|MZ&%1*>X$
zM0q#FFHE#D9`Y)qVS4Vn9iz?hdTLbR?hN*@czpd&U-&f7SL_d<?~x-*7O3U4Hj}22
zsFtN`cb?KJ3~zm=M_<5p%76keBt)d70u`czd*P7}#zI}HlBz^Uwf?q0<O`39o+P;<
z{wG?|o3Ksv)U=;@j|V!>71f|U_3>q2*uD2pIq@J6$$Seb-`|i;#|&_FU>X+R_Ddb&
zJniY;yw65~Cuv+-(Bka0hb0ZW>vAJsl<2+CYmf5LSTjBbG8lF?A#<8~_6Ou#=ulnK
z5O#y)9@Ad#>fA^ZAhMN;_0opDV|1aMj9oi*HfKs%$#880Foad>Ob>?FITTLJOn*@u
z`VPB{Iz<eRsq(d_H7ZkGS(cX94`TM$ccQzKic!LQTU`~s^0iVh{F(q7zLZ!yO`oS!
z8Kd>#&F!iGRJ$xxo?J^l_5z8V+8cj92JqP5PgSa%!IpHvtzquU0%LzF?=HeI=4%p|
zW^JU}->4CtxjM>q6tqm{%eOO){I5Uw6afGdB8A}*N)Ccz<s8M@_Rey1U*iJK(FM{7
zS|2BhEZ$rguD0WL@i>fVj6WeT$-TfM;<1#AxKWArQ4IWBS$jx2ZjOcXnzsYbKq>6h
z{_lUeWi|RLO{XN^=q;3lShP<7%xp@W(<jG22`?KwaS3LULl3f}FB}*SlO~h?!tBhv
zWBc%8V?@IjHjO$f+}*j|_gqS}-XBDB1wn^}_i3sbQR99<?DRA)kQ<vWgzLF+Hj8Yh
zPj+X2WY#WxdhJzxN8#?)D)*qv@>_*~^FX@5u4~)dY<4&MaSNc>@%SW9S~WPr8zTBY
z2qEXwKtY?KbT&)3tk`kr`N;-InlmLQjveg6MAz-L$k!KG%zgwu!QHQ4<+(Q4VxaI7
z2Ja*8Q0}tX0JU>$vm&#*?D`y*!DkfCddg*i^SX~ODC)tUNf+824T<7FU~UP%$h*&`
zO63`4E3L9f$$C{WPKaFG0%O$J@Z;^}*^(<>+A%F9_VcOFod!xB$@j`lTtE7zZ~%}q
zvYQ{UfQsw$=Ht$E4Oo@yFnquLa#&osH0305oP0oU#e`-+eL0T2<y*;b*R!g-vWB(p
zhl7V^FI!OthG!u4`Bq++ZrfHgy0{MgS|f4x7z$-jGgSG4R|{(9PPAtLzw8UH#E}=o
zA~#|MY){9IHN}fo6hVq<gk3#5N70g``0InzY%--96c;K;1vB70FGVysaM=ZlRy&mj
zdJ8fLH@aMu`37~tQEcTqOR^>8A`qM)Br}md`Fua^!=sxRx+?>h%r%mje{HSJn!}O%
zwC{hh+<E0s!m9ocKh6xzJTC9k7g`^2BEh!g&IQvF$oz}Jc`A%*wKDRIWX7gS1GHTE
z&Obw$%XOD{G&j({K6rQ5e}At_!%R{IDdmZMz_nI8IE}yQstw;<U`b}cS9V;iG8khe
z+FPh5wtL4RMKEaiGxXOMZ0RUKlac>s-e%^`$qafEkSQaGTs*;`MB5%~8^vspUVyz*
z7nuXGSMbMJgda2abAu@{@_ahuud?Cn8T#cc_fsFT7yqH>Hf?3plOv>IbGe%Xd1IDE
zjiQFW6Patt9N<fI(v`0IB6xV#h!H}ulIN%NQc@9vYIdOTIx#<snBx{)XSz9OFHcJ)
zWTTPkB(c;SHP0&RM2iPM4a1u+nxJ;YAmHDRMsy%ur=*io$e|@&x2)HXlAp`wdnRjU
zNc0Eqc_`;hT{oJ~YUMxqRs5I7V?UUb$Ju5b*Hzw&;ZX5^IRLUURT>oO_tS!X6eH0$
zf(r(mWc%PanDOynU~?G+f-iyNB+y2`a6Ns9E+^Z<bmf)YB|TV_?7+M`jI#YM%Z+yD
zgI@t^P#gPs`HrgoYf1Kpp=&_w_Q@;2RGW9E|Eg#-Uw<u^aA}sa=kF$AsSoGqBJVnV
zk<|q78ea*y3?vlTW+^_+$kCRcLV)DjWy{iEXshd*8yC4^6#?_Fh@Xtyl^ow|dXj0S
zm3y#(f#%KpMfnWCey_YV7i4C_j>3)KZ){~Y;Y>_MSVry|ERQ!DEsfR*uu8cp@IM{$
zaP9gFvmIR5vGxKbzA-y}Umm@{`Y|FohPe6FQ5umZ)KHY<`gWXH=pi2+K2p>@7l=Db
zsuQl;vjxS$N!s>TKjH@N9PtW509I2s=?YRF-)W*d%rMzeg7%q4CyR8^kSp!DgGmp;
z<EsaN$#?__$+^Cv#Yv7Ri|mcj5JeuoP!=`z@m0w{%A-58Ht|nTHWF38Brb>!96kE2
z>VW&cYa?L}-p{?=ZP-)%M}a)@Jd;8MchfBi?rNAyb3O*t(IrO73bCIY6P#A3##<63
z?6{8sHK8<ctB39Oh0z+w^9oJChRJ`t^OlFh?=oK#`i>ZsEdP4-q0v&{*`qEf2(R9^
z9<4oBLXI=cN;KDlsX-V(B=;d1A~7jx^0Ilg57GEoxZVK)$v5Dj|DJ>O0ANe|Qwy6-
zb!L3l817Kh%Zok-nCc;r$g4!Fz<6}aPgN6(w%bjT9YDgfbpU}<`4fN&{EKRjA>c!|
zw6HvFl+b~Z>v?IeY{mNnnm1ZoT?3$HBR()M+0o~2z+Hj9Se55R(4h<jUHQJT#=m!c
z2&7f0kyo3ds7?xn<QIs2#6k-m<=G0rjBrDhj<B1%$pv<^nyzi{u0Ocfcan(0V37B3
z%@w0Pty*Md*m!EbY&{O^hk}%4SSV8;#<tpcA;V^}^*g1AOLw{XLqt#!D@=)Z(_Fp{
z%=^sM6QO_#)s}vizC%#qh3ofkAn=xZ7?^h+0CZ<|6xj}dZP57;1lGDN#P9Faz!Dr`
z!shdeLJ2)XrOQp2=(7(GleySTcXo9syN)KNGKSxJ#^^)@JY-qbA0kAqY1@xbHmy#8
z6ddhWN6Nz4v@UiEIv|Ckn9x!!LlglLv`X-CCRmB6Z#_k*clK(jFzU^Mbho0%-(P=m
z#w9=!;gg;msV?5S5T<}gph2;I9@b%aCkZdTA679E(tw%pS-=)p>Xl)EkUY9%Wx7j|
zyYfo7_G3OeT*&+bKjq($adh_CZG>?TSd~Bxlu-O05uO5jZ=(B6UXwCEUK(#S4`bDM
z%*TLSz?JddOOf1-erAEKZMJq6$+=UWJ^+Tq%%<B?0>+<1-Tf0S{u;vKbbH!I%lv<1
z{t`5{_~`z{kTpLtksDU|#=C9Rq3<A9yd)(5FpL38B7yFza%*dAgBSk!6+r%tOG^-R
zU!hB8Z#Eu7&@0x96tLdUoJFqK_hiKk7a~1dE{BmE$Q#OS92Gl{mnNFCs-GJE#!qBV
zF;xl+a36c8X5+XCOzC7ZJMidS8CnH5VO~1NCSLd*jD8B@+_(qA7{cfeqXg~h4kRam
z5}5>X?*@cwG)la5c$Sr$FQ{oJkPkJx?_ppxp%A44CjJ9<YfT_27Y7@{e;WZ-%oruU
zhnihbsYb=~dTjt8M}zh}b2Z_;b)eeeafeJkt*r~-c&8bTe?E92+$`3Q^bjVCO$GK~
zN4sf8dF9TE!V8UWQF9ylHBEq)U>ZB019kKGhd1n8Ay?&F(@q_}P=v5^KJ~icxk|`l
zy^Ye3APko-HUaNNI8stAJJA%|m>}_TrUVbE5<J>z%_C}`?OVEUN5k8|z8kP^7T&tb
z3VIf~s9R~1wIdK17^hV;ZmUsgl^g>Z3z}O`-ya6;vo17l8wu3QV_mzDwc%OM;E2N{
zue{x;JV^tABcouTbq(Z!L)wG3+u)Q1B!Jb3eGuKc8$C)Ebkfy1T^WGo?n5gtH7fPO
zrbP4M9t`cFGvtvK2fd_w7p#ZA;*^j-B`b&ImuC6nzmwAG12whMc4y;RTcXT);eh~Z
z;a)UeD2AudxS84-h>B=n79#)8IcO|78hv`DKoTKjD_4J9w)~S_4rNc&GY@GJz`c#(
z*uxBym;)qf<ksr+bhg{?Z@zZiXM<tkPfj`9Q^MWM#z6)C)ODJ?BHSd45_MWh#S(fQ
z0`jv$cl{|j^UOM0M@k8kaJeM@RzG43wNqFtzwkF~V;^K`xr^fP3`OKLo){w$x@>)b
z8v;;n$%((S5Jbpe9$L$}H(M~6AN`y#1}LDK?aY!nE+9&H;m9=JXo}`UNbVlLOPt~X
z^EqGqW0X$hz%c<pSVk^xm&gTd<YE0xdkcz4V<f5ym*3T@yo^LzAnLmKCI<J`_^c7q
z_5JPRWo1CBLm@U80d?RX3k@TYR3t{7sQL1!dZl~d(*u?g{8v9UC>j8gmftQL8w2c%
zE)xc@RVM7b_N2^pz;VKRkWJ~}qUubY5}RuolzpPRD=H=A$aSIb*v5#ii~f2AxovvZ
zZFg(ZLZe3+9S&J`2mKu3HE1tUJI`<mX9Drjgg(XvXffZSPjxU6Ik|lZ&(#Mk%uP@u
z+qs9VO5hlD3co-qGqI(<e2ZvAC8F-C*qUM~A+Lp9a818xOrC;T%+N1XZQ`94y6DP`
zF1qf)PhjN&D1wcfVoVVp0E*6pF{pibpa{q*k-A_3SeMpuXN<%Bm%bAmo7t<hvI=cO
zfpijbkz9u21!K`8yj|M*P>&=VmpZRbU7$EW7bW+45UK`3y|Y4q+Ys#hbHR{KuiV85
z<OaL-R|j~DF(5aWr;*KZX_S+Ij6FB{IV&*2NLh|l2dy^C0eyHx6p5=jr&^+T&2wX*
zs@9x%tJQoDDH1$Jhvj2kpvdc!7n75Iv47Vmu+}>+r^+aI0Sh!_Mm%t$sdFKUBEJO4
zUz5C%?gIh3?h&@wHu>prl~WEHNGI65=b==ClF|rNcEBG32=_+rW9dcK^*>d>8lS^*
zsPnGZ&I+R~WZcu!<T`(!|Fnk6{!jK!@pL98+9Lw?Pj=?h$UYq2-a#c0jWR?bPNXVa
zBnTGRyh!=?`Fl)VZgq7M8U>>h@$RsPk&C#}C8{O>@};N{Pz%~M%>5?#RaKy^PPfX%
z@cJ%$EAaT6%okhL*qh$}W#hGzK=^l}3^)HabMxN9Pb9Wp3fJdKZaBfLnVVW}YUp!b
z^bKkG*c(qZj%s6}fk;boZfXK&Si-OSVH|oe)(foa2hbdi_Z!ZJ6B#Qz!ip}v1DU(Y
zuKo<uDoE*PI-K>(&D&GYbqWgny%y+hGhv@WAV%%ZpXD<)efs_Ng=EMk$uc2VjX<<S
z#CAI}y;r2=QgC?l_LiGPi96~llGr<C))<fqY@@3+N!HfEG>BfR0YLkV5iJ@5GMSk^
za}jgzU<(wy8J2zd84A3N2>YJnpu~ExM>fL!7Q;r?58y6N4-B{$M-USn{b84gD?Y_<
zJ!fAva+0xgQ7<T;0EOC->0ArMCo}kgS$LH4wT_lZ^Mkh~r|XuDPDjrDH05H3zx;VN
zmhF=XvLW_Hbh2(f9TF?SuRT)YgRA?CtXB)!nEDa%k?(lVI1ka!P}h-~06vS}Tuz8J
z9gisE*ma6exTUEg;j->_&TGm7{jyAgSMM-94iKQ*;Gh%gCkrpla~b~plYZ^C6ddHQ
z`v%f!lo-S3ViiqIKBI7u&s@48HdxE1mCpk-WW(8VfZ?ADpyE7dm-P?<AVcmGG1mUn
zn3u#fXg(aKMhF+)CZlg>*mCq=TJn#u6?_J{`4a&JGKjskdG~#-fG*geT-)pO1<}t#
zpuoL&O-XF9fon&ydsxe@GpH{o*gy%PEEYI4er@@2H?@qc9-#2!{7AJ0u8s1@irqD>
zGK-h<;Nh5>1wHr>+jzK_t#)cTrDp;g!Ef^?5KB0E3hKnni(z~=5P2me<Prf|N8+Eq
z`saI?l{3)RYdXh~IOHKpCF<6{vl!ssuD@FNYy_6Q20#?NQ|IJ~vKFee+FX%=FQK4m
z!V4=R>3bx1N7?-Vz}Iprw7vIBasuQc2a{vFMIAYzbu}ye9sS(^Y6}Fb4_-M4ts4QO
z#Fh>-pR>x%_>S|xJ|u_sgK$w}m)NQm3~r0j1|8NJ3uHxKTBFZ6LY;2M9XN9I+wh4v
zXN7=ffW*aT0&q!=oErOqQa0CtBd(cPA8>9)<B@nC-P-}&1uNbCAEXFM@THF+(avqv
zPIdqc)Y1FP@%!D4VpWIPZ-^Kw)Q*v*&53wRJ{<fbOf%P)t^bl)r#P27Ty%fOx`bT%
z8bBLWzu+rchCx)DX(=EqIweSe^QHk{nT0(uC9>8<^Pt^!>Mu!==@`Jx0IGmK(H7&n
z*0W_4pjvD{PyMHE79Y~oXeMl~11l3En}ST+2P1-z1JpsdPjil8eH2KsmH)2Rg|=rf
z#SNvoqQDOwE}+=lR9`_BbpQwOH6{2@k&OZrWO>gZRCtHA3&@t{hft*-^K9hK0Syc!
zDaK|}8Ndo$!U}lKrx~(LpZvY4q%y#Ub}9!|NUbpFD+SbVKnOE~yZ_LjD208tN+b%Z
zeu<1LFpa%#J%-^uf7K#d``R07k$(=-$mY_R9rutQoH!&`gYN^uWQc7D1582&QaVT8
zdiJ5otrcWv445yD^1i(T6OLu+;vIMzl7RVK>|~WrOlAWa$Lh8@Miv5T1K}ZrH&2dU
z1ycG<`(o((Q?2?qC|V7mlOzHv?faj-#GbfyDARjriS8`CXhJIkA&fy|1>k)TwRw+y
zO{4<RD^5ZP$hl9d>FSS)Z6-g|SJkh4|BvmZev2;Waen$``mP`>P!<9qz2M)nErIQ3
z+1=^UKLYViP|fH{s^?BO3iYZ6xCmPk9|%H8EI~7U1<mcB9>_AG!3|NSk!upfzuK<S
z2>uxIskVFztu~;yVml=B5hO@uWcB+Sep{}o3TWRK^QCCl;Skz^2z|&+FGlc~2J?^j
zanDS>Lg?<c6VxIux9;;n4r>Ff(_*2)WB@WrLLVZ7@b9j)tqwvhi{Ej)h)u2Sy~|~6
zo_RP>nS1(xlM=06EPHak&t1HP)Qo%62&4u+@HJu!V}|+|p+wJ6`GSpxf&Aj+WkAtq
zBo=F}7K0fKem*Ae!@`+r%hD6I=z~Q~r3vhF3Bf}onc+|=JFp~_BYKbk`!=FU>$xm?
z^TU>H)G&@sVj9m`f?rMrz5_|wYYNf#!Iwg@bwQ`9Ack4gZbrRZ_<8?CG|x3M`PwFW
zL6w70_-Y2ueCtSJgF-MlKL!h!Pk5Os(|V%GX-(^gi#t#;2f7?cew^NpUz4;IUsyfB
zxM;b8ifo$_!V+;HAd1=Kob@M^dTX@5w>utV-So?`q*@}6^#enVI{pB4k3Z|D^CAkA
z3&&wUm8WuPlHYiE#P%%c0ZwcuU>4?5(DDxftwUcuYd<K&mg`3eq<QjW2JLc}q5*rz
zs74S01wy5aRDy1}|7l6Eaedwo5x$pBu(Xy>rD*6k5)<BB_eAu}ukOMdLobp65)1T8
ziet2wuZ)zJarZ(Y-&yq4yB+H=^7cSn1kHRLh}HxvC>N-@u!Ya`t!MMPrt%HsHXJ8n
zQ>v9v6^CWuP{pX<3iv%NfwE5|n2=UFqxW3-hiw3rpz1!`$k*BP_PUM$lk=LP)=hph
zQ=5PNe(HNN>z^7=zgixKSd_9>RqOlRMxG<CYBU)!TOt}vCY}q3w11D<=_UQ+DB7UW
zs4phm;kCFCtJ7UwBQszOGvGU3j#Rut16JNv06US&SPrP=Iccj$-@vnyu0r_pJ1N&&
zrGsUyr6y;fW@C^Z3e!(N@znV$=~qjnUQm{{^RpjttU)Uy^;-TRKNBbnkSm5L>%r$R
zYW+cn5uy_E1uA3HFJOu()~_XBkOrbVW44v!0Q(y#<}L^uHCz%dziHYknL>49rwKx$
zX5ObcN^aw(+~{`THge1EyrnpbaB+aE#y8iNq;mB-aVnmL;n6q@;b8gyP3T~QK<+Y!
zYYai-N}2^dBWIrC;Pm7$DQ{47KK7#4GR%%!7#qhC1%|`0&^KM~il9s`ef^I+CBgis
zkn+MHfT|oc&J?DV(eCSAZ*l6XOesQH0uftUZ-HgjXT`dlrJNC)neKnWgNU{Y4xmeq
z{>yowuO^@6J@~Bps0_7#$oA1XS5>bAh(VOu)xxbF<9H2W9QS|!qke#Ja!oq{d*Rn9
z+>Q7)9tU8Dx!rag+LGlj@BWAalc`d)-?}cCAk^oF4%igo;B%v_>rlMNK>@HDu>}NM
z4;D2DuR{z51)vtfhv{VlcC=J0v_ZD+F7xz8s+b+Oy-i~kh@*&e2yr(}FWy(^#i^^+
zprC|PG{?#{Qup&63%A#Nx$E}SEck7`uq^7xe0;m>X%A>fyoe424b6+}+9vk6!Jbe8
zy_u+0z&I>E29)#PC@LHCqCm|cKR2sluXpYYBQ78uJJOy<1{=O#stX#9vll?Yb?MV`
z2#{pP-J7#Df$to)jzP)$0oO9Cw$Sp}t>7RKky8|?uo~qDIO=5A|KO+{14KWubJ^xJ
zxDr<m-p^0zo&{>BhNSO+Rt>e3&E23Y56k{H?K9goHm2u1%e%XcE}pzv!fb>@)Ev9-
z7t{<+_5S(^Vb5@oYJnHV8TK}uC1f`CYH2O0*T5Ij3UVQbFMDsHO3aUbe03eHB9VQ-
zr-1Bb=M(cV9P0Pzx0m^P?N7Qi!ZsNj(Q_QTprc!^U1TNDY!AU&4stqg`_H9iw=0-0
zwf-Wo;dOrIgWn2?hDJ>>{B~Ujr$Lg5VA79*_j)oN(+NvPmez-Unz<$kf+a4GP7Q8e
z@p9%aG&oqtJ4@4rF<zg6UHh|fov9G~C?A+gPYs+Rfh$TmClRNy<v606gVV947)Y`n
zEJJWP7YFl!$vD&-d@AL?Y{jmFw#%+A3UXP32*hY!piYVM>f685xUT%;WAV>5evO+J
zy#_Uflp{PYCoLQ|f%us^`CKSW{pmQch5G)K{2!PwrMzN^(qG+xQ<=%Um&^cemYBsm
zXCWWn*4EW6m#ZDr$u;?>gqa?&7VSHkKVczHdCfZs$2`E})+iT!vvDr!`H2O|xVI;?
z;V_4|Iy!1Ns_d07P`Lxqs@at&6Y?eQ7lN|Wq{Y{PA~pfh<AT6meYQ&HMqu3~kNXhC
zu5K?aAuQgWE9jr?mV*%6xgo~46$2sFi)BE-u(EH%<C{SEA~seZ|7}>UP6B3ptn>q5
zmU1{aDvTN_R7xEdTLxz@XcSq4%m+m^?0b*<v<ch5ZJ)_NvyBC~?|ek!awE_SvvXqi
z4#%7Qu<{sHfi8IZsN&fr41O0aAV~cbSJB0Loy&pxMfNe4V5CAcyHnV&`+zTz;qpI6
z{)QeO3mBaX>{<j=gO_<s9_qU9+KTf%{Cnrk7%`}HPWXDE@w-WK%k+!nPF}mM-HS39
z<srJdd~}av{L#Vy@2Hf2d!xzCmZm;ukrY2?Jw(dEkzVK`6J_?e74ZV?wku?6c)KSk
z0#y!^NS(A>QP?C7aO9+?v0Se2<rt*@;Qbzf3XO4l<j32#M@$GkZHfy8)e|UQjKFnc
zQQ|y=@ZEKi;_F(LWl03T9TGsZ=@t0Lld=uxTeAiQ)O#riijWYrOl$@EPAuR8;z0@a
zX5D0$t#Ga}(IC+c^I@_%&Qt^NbUL%dW&&_^y5q1n8A6FRR=zmd2wuxZD5XA=>E7E0
z&cXDMk*`3QJ2?o75(!$WsZiguCS0zigB670*_H|Hf9tbXuMB=k!ZyE|iD88e_cbe7
zF&?b|43W|wq}GaK4N3FF8Ks15B<V|1%zanuwa)(n&~Av@83~ZhTihPlHJdFxalXJy
z5K(|}{3MvyKtNFk&Vu)|;W~DaSW+aXc<9w~{K}ciB`XHy^D&Zipy9#59yjm>WXPUK
zID{7W!dOjxpz2i3&`h$ueDg_5+G~Tg^FYf1r*v2r)LK~OOs8X-PJ5#xZD)!m;uvg>
z`@<O;P>72qA-(Oy3Ixp(V3MedY?@yF8Ax5qaMzy^{s;<?15<=lAT<Gs@gp!mI>*YO
zpmX}cM-5)GLN|X7uwkQ@^1$941@=lG|1>U~s#oV2Y<_T}=Kd-UVspXYAG4P(h+Fqr
z6hyjgF7-72lc8PIO3I-tlvcG~L*xDg#O;2iKO$qHDh;B0i^K=SDGTc5!A5#M<hw36
z1$>np_`YoLu5I@0gTBK9g)?zpO^NWg8iF_^qG%6m>0(4&w-}{1_CWnwp}5XrqXtd~
zA;7<r1I8#TMe(-xl#=M8v5UJvws9H0R<O7a78o7g>0ftgh;|COtN6|eEyl>aYrdrs
zrFCo_$Qqt+ynhYmqU}s^vvk|P@ds5>6nNo4wfP>~p)5zZL2F<QlL)X}m6edxW-?J$
z|5UXOK{{?oONpb{I(upz+~qH)s_UVeYzAE*_@0olvOCl+Nb<!%4}%hxAwcLcco{Dj
zQ#G#zx@L=1ENRx2cD1^L0im355M$ijIg(<}`mU$iAkRdcZ%7fG9soL8iE%VOtij|F
zcGNfPf<wRayUplv2{h6wLfRe=Ci*WKC*H(Y+n0*k%I*UuyRUcGQP_}rec_px#%`|v
z#^~rY!%4&KH(6?k$ZjtC>OuOWCTl>K%V2Pi1TOE-cguW-f|ncLwswGi?XsEfl;58%
zSP(XDqTKyGK3f5PBqR>Xu8*DZ*av;jNw)po*E39_gEG5thuBPn`|Wfpq8ChssT$c$
zLqM`q^4OuZ|5v;L4S4P>-i)Q#eq&<Ni-#Ter;@+4baOnqICKS!j~Q%qs@XpL!>(uU
z6@eVv2!6O3_GcEvi*Lp@cR~JMJyC@q*n(Y>a^N}5*$am}TbxrtWg5?N-~HT!F%H09
z@E``w^^TT;Zq<wS`jdKvkYmhTA!W99ob-jc5fZhnH;;nqR~Y3028tBX#@Fg@ns;5S
zmR5ZwfB1?(iDt+9`)s}WNDyX#i*X1N8z*9#GuM5x_~c+9tExu+_aQ?Y+biDMH2efs
zkA#W@U|Nj;_LKW#P&;5l{sJWC*6_ms{*X5;NsZK=w99Yrxfh|hk$Dl$rVJcSv2tWR
z;PL7>D3$Fu7KXpXY3w6Qv@*>AzOodR)H<?<270Vn@{9<z1^6%TGp0=V+tuD>K?u&V
zs!Fw4l2%WaUbok~<w7?_N(G7QYKIykvcQoizjxE1_H0@HX|W}+5<w$RxtnzDOP{+x
z>|}Ld;;NnY5V9{%zRFPv_lk?GdPoh4n~`&+C$1)$+l^O;%R_?*NRQ^q)^$rLp1mH+
z3*yEu5d^w0ua?5n2(Z>(tcSh(D^8F9pBqrTm1q`G+tJU5`%?)f6JTGQBp!dm6*vmw
z1OlgbzubagWYeFfjT+z8UIYE28{4R3k4>_G@+vp_7(+*uJ*#L2g9!iT)+zoFH7M|7
z#fg}hdd<(fUA2X%CLm(})dR-}GWZsY6o||Xp!7b$vTUuLby=Qh?jq<cYhkPAqyZ{k
zobT*eI4)KXgEYuXiHWnI_3Ap-#uJH>#{p?N=6VIF5JkQBvb0oxp)EHV^m#5x!7ceM
zW3S@$D@BD0k)<T$_$sQsobg@j?KW0EAl$Bs`)t+Ta>4%{<a1oAkJ?jq;4%nX_*5Lw
zu#(2ahA@#3pud5!6fwpkw)L`WlCN0{_2V50h{u<#H3MOq>w9K)HkYS8&rlb=Z@$b$
z+I;ZDQjs`N`lfg<La{7W>_qzxIuyDjMgb6~N-US^C&;POG9@9#ReRjCD9~T7gw~Uq
zfqnE#^f9K$YWoevi4Twxwpe@!)dCj-xPI&>yrgc;cR1M6zgHJ)!?BR}1a+uL6w5=4
zOVx=NgU(A1Go2xk$$U{f8Z0ZGpyftH4mG};&#_bN`ze(unehdSeH(UTd8Cy<=bMxp
z#XIJrQNJLj&iKxA@i9)W22<i1xQgTJK3X|6-59Y-QEe~FNN2r^bBp5k*^j@k)Mw8W
zpgf!RO>c+=Lunz+T2)YSh0CQr%fsrel*6B}?=_=j;;6J22UJ}q<8`5-(R1ayZ0Clx
zRDtyqHdMypKU2%0ob&t1hV~9h`J`z}0<E`YWcF2;OUHMlR7(lsADr5|y|GphLpW$v
z_8kn0<fpM#&#j@83?D1@nm~a<?%3_qgItC>8$zuMWK{+3$nBY`)}giJAwrWQjcdF8
z;SXY=xOH3Jp0DWB!AX2Slvv3qw&BC#Ww}r9vfT&0H;C-3?L3<|DPE9H&GM!J$AjsX
z=?(m|<-@0k`zlTi?uErJAOe&%*0u}MJ0ih??rw}wY$YtHv*kMUF4Y$fA0M|+2ELg#
zjIm<uUK;_{b!+9<yJ#HRsQ-GZ;CyMw7`-&GjhLcgpUdJ2B|tHbBO)5O?+Z$fiQ!es
zypZu&AX0<J=efi1fq3lcSa@z6@2V=Aaz*{F!1gY%EK!ID1}v4Lq{Z{4%L)g~7Yj=A
zLJUvDiBp45y%%N`t?oP+cZRHW^Le9jDboMYql?ljvPvp~Xx!#KCpcGFovwlT(R&(Y
z!1h0{?at_+XWQ{0;i(#wQO61uyDR%xawx{w3i?S-W5JYg17XfSt7YZ$;h;DVmcq-j
z5H!g%RRsavmC>j<nqQvw4nZxh01P;_Pl@@TI$SQT0yOXb-g?EX*@j1YoPDopN+ekK
z!OYA4>SC!1r$@ato$*_J<{ZXt;Mnx91G~kURl0O9Yz#4q04{MzwVCVZubw`ahA;y9
zUvX#!Jwxz9#FVNE<i2+RXC-`@k^A^pQAP(obiewb`O;SD_FRb_8Dqo@jjl&%b`{cT
zc6NNm>L+tb1lZOS30^CLRi*fSgho)pIj*bxpXmv{B3)w1R?sUI4VBXO`=ez>qRH>y
zfxn@TgBN8Wd!zMXa3Jx50aN8~V43i#0#6O?oue+T){DPSFa%Ld)*jCm1oz3N;Gf&P
zLZkQo5E179gQZvewHIQ5O~MLr{?cSZ^`XombgBQ)laUKroLo1Nax2TOI4jB0rEM+m
z!$I-u-iA4wac-ISGcJ~;I53z7@0i~3t$pdhg;d9<hKe|p7tXl5YU4fK)T*kUhmh0e
zJqQLZmNxFv9BCN`^<oI$ZMYd=?@A;vy>c(Kb|JzV6UwT=wNe0ShAkOWMS>d3_7<H3
zZ;<h=YO0J<upW~8s7qc4m#0y27d>VPl6{gh6*r3&k#vp|q#cQ6+q>6KSfcpZirO9q
z2N5qBTwrr>_D0S5v$l{>yCBs)_*mnIdGR@{ek}Sj#kD9QB6cEy0fHvTMxEB@V%bmF
zO|>)YY*;`yXz_wpqI3}9enmekaH;a^>rmmbVh1^l1*>TWf#-^!TLcF`7j~IJPrFKD
z5=>)l!uwXw%#95G$RMpMJF&V!T}EKyUc(M4<)Rc8@B&ZOMdO78x<oL2M9TWn87wZD
zcHpq_fWsTtWbPDMA$<*u3gb>sv;6f7<Q0^AE(l*g8RuTKabsuI4R~&tL=Xi<bT(X}
zDroc8D>};|w1e1}-0HU!kVmW$3woR{#bLZ>m0Zbqh2~G)pP;+cb!REA6{!J|y?8nw
z-hl_YwHjNPl!Lr)GoI1|o`14x9&GTf-h;sO)9GyNf?F5<2LcaFgfN)ScEfI;utPC9
zYzL4*ix7&0nIGmi_&Ay?c(M|dggCl}9w{FhLJFR1)wkLl>ix*MTfNI-*uh*VPeyHB
z{A92k>vsD5y&ru)l}>N3p5j1SIuXaVtJ<w`+^3!e?sdGUQGwERk$55Zb9^j=9-ufL
zd<sR(I<9=Tf;EsQzd5lm*_P7Rf$|^?DuwWO#d_o7uVc3=`)KIl4v4lD+;4iLj?sDJ
z;=xE|9pEYui$v?<@j|4}RuwEDe;m)~u|C!DW(X7~Vq6BItG<wd>2S1-%D|>6Hgj-N
z0}=5A_4_V`Uo~*-^1yJ|@|!dNS2r5cxckH8sJYB&D-f3)vHsWx{HHmWGx)ncqmcbf
z_b&)vcj}?^`D_cg8gtkygtRYTvV{(~YU_<D<9L@P-_AOvQ1yczc4!a2aVLi{*J}cf
zqt6uhdvY_ukMT(Opvw&D27c)Gnv5fJSYupi4H~hfr%#iyQ+9F-AR|8**)@O|6mg1R
zlKzA=XLOD=M)Rc`+k|Ud`I-)t@*l7zq^G~v81@%Fv-7Lx^qRj!DQG7rksk^Uy_ZvE
z+=E=Rv@i>S7pm+DN1xaOB;z%Krgi@N^M~D5Kro?bxvK2_C~iJUy}f-T<06#!BVn+w
zdeoj8<eUPd6W%P4c~JmeO@rKJu|QzW%)zB%c<@LlkLA=bD?i+vrHlkL%AE4BNb`EF
z#{$($SnBCdVPK7EO*?7$EEWQ5_Z+zUP?86&3jY^d{{Jc4+nOHFZByWudQHj+6fs}3
z=zb|yitVuZFhwuau^Bw3rbZ1BkL)GRKo{B<M`#?7pY6?Cb=)7J**DR37{)EpdbI*=
zziCa10WtQ7JOagkYO`hfb{#aq8Z?RSyFeBoS1EUKYy*!yxBZ-U$>qnS^~+F|tr42v
za`=96-tA@WZ^tHK{=5LX;`F!?C|Rd?>LKQ}SOGf_1zcGV_O~0G^6W+^MF7dOkRc0q
z+IyQ5qWh7VgFsu#1DxSnsR#R-BZTR;RjH<8FphEw%_~DIhK2nD_sx1eao0c$Smw|W
z!l+g^-`a|Tto7JOM7ILbK^|#IvOablm}ra+PY6(ZvN&h|iZyORd$e7|=TDB4zgmW-
z;pq$4Lx=gpa8HG^YLHBY$9B<o_$#I@UOQI+)ycHNLaQuCB+|%?BzViN)PERsDv+-h
z?1CxV95J9lu^@4hY4hTnZyPIZimUu0K)y9Xz-fTi<%ZOMKISU8C4F%PAI4SKM$D_j
z<3Er58*=f3>gyqp$V_9s767oDk@KNrIy*A7_@2CZC6(U14gTR|TR!*%!u}lP?b2<k
z4{uia$@q%`=yKHKMxa^9)D;v8=*KkAt*v^Y4DFZ|N)--zZ|)0;B>njKO{TBFGJcM9
z>AZ3wVk<72(T;<CXQBgJy4s1J32WD9u|+A0lv^rzOT8UPQzCIv>&s}Cr8e^-tFZ>~
z1d$y~DB_#wl=-tJ{WV0uRKf-3^CU5Kx~)cRnZI#`#Q0;^CMktE{a37E6Y4%e(z}rG
zBy11QHL0RdmeeQec;%6QekZ~~YXG~mvi)n|nL5x-V&%7G^~lfrep~qQqpq+xz5iJa
zRd3h!D*XeIe(gyRRS}2Vo?zSrU#puHXxwJ8JHMfImX7M=CiG94V4*rXXJ=`kmN<bG
zRN(^#whjD7yQXD_4D6$IKg<6;yh1R~YR<2ow=&Y20-=Vqll!B|{|pcOpHlX(0FWgM
zGLSXGryf4;j(uQcyffLFw93B+R80ezB0m^QjY(LvCdy21)qQtvo%bYQZy_*Mo~&l$
z`?kDur&n=p4r@M#n4HAw`og?|0Z*#*Jzb&~o`?R>76I~-DdG76>k3d#lq%HM1r69b
zC;IJjwlC#eoaGPGD^J=3(GOZ&WoT`eBHu+yXGvYB`8#>i8=2ZgIdi0HI=1ij<VYKl
z4^oH|ke_E3yh6g@zPGdG!~?D>>~q=IJA~H^ZsIs%5t0h&60%KO84kPr*wyXhEEBlm
zIHB{RNH=2CLUxZ$pUH3oUqK;vLdPg9Xu=8nsG1ylHh26kCD?)yh+y@NVTHsKSYHdK
zk^nmMUvc{L)-s+s2d*1(43&H$oRt9NtUPNqFb*NOZcr2WIe`KB%7wz)t0M)YDc2bN
z?Y|chs&p4V`-q1@{It_ZXY}ld&Jca|MWMd~b9CeGQ-qaCTHoIu+|SHPI}uDL;bVdY
zekvrB<9A5UGId>3tl+96&|Xhc*~50v^pH*R7sSqp_@qlY`5SJefstqU(8wc`do-pX
zhEn1?JwZ>n6cn2bkT#ct&uQE_v+U-3ipH7gw*1s+%ClkDKtt6)OpcQ-rdJWs<krKv
zhvGm4Fx3GD5{?PAhVXol!y|?r@h<f>K?A+Sqve|94?F+VDm$_LrJgi2;^rHRpN}lN
zIIz!vAu*_Qv#Gt=6Exa5ze~wD>cL>U#Qjg<aP`Y+MFlRjIh@(a-oQ%(4<;bP5_CAy
zH42&=xCcsIuhobY&dQ!);zGCXZK`?}KH>h%`3^Z&*yz|tz+EK>SlZ!{KmP6eU#l_#
zZXvT>07ulHLfjAe1*2U}hy8$tdx5yyzv%i(ne#?ZoA7Vc_ryX}GK=+-SuKT+f@pTZ
z*BNX?rRcH^d{B-9hCfh_t_JLOmJU^h;W|e4#4_H*dHtlbNKg%W$Cfk*hR+mzdPDVU
zU~toHTYZ40Dn)t?dKE=R4BUgKHII4?h*s<8++O<qYv+srO=#i@5PX#&m2|V9z0Pf@
zZqEGN?EYm*B-ZlrUmn&udtP@mr2}X*{`Tg+#Sa%4eO%tx<!Y}jv*{}v`swhLvin~6
zJDBfRPm^8?OwgH1BwF-cbqxP5UK&YR^~O)-oyTn5(UaHACwy1)?sR?-HU`?F{)XUN
z(#;2GK&J_AJ+5pyX2ikm4xzPSh`_3C^Gix#vy6~8h+8n{lh&=G+Ny!<$31mVmqt^B
zA{ie1z^}hl2g*srf$$Y}CQJ6Ql(XANk0P`<v+Sm+>Z&k9=4=!A#t}E6Gd>qKq3P)r
z;}D4Hv-&5AO>mov9s)Np*<{}N#nR4NZKLKMXzjCoP!G^%GMq_6dtxL&RH2KhBuh$l
z_I5BNLdf?EI0tC#{~O=^fljsmTVEOSAbx@^?m)g4a?s!i?ZQyzv?3qjLAobqz{@)q
zcf)<7nkILn18H{c%G7D=klK!h4*`rR^-<aPpZhf5UxY!QL|IAvP-H=QsKAloe(>o&
zKCV<rjMG%wRW`Q9axHO5f*iLm&{L@b9%1P7er>ZFv^V~aMJ<Zt$~}`^K%nOE<r_+~
zZD<^qgZj_b88s9oD*6bA)*GQ0C`g*23vU_f*eIl>2>40_m_$N-X3__B_2WS{I(l3U
z(Oclx<)6)MlY@@4nPftD%!)Ah&vG-t<7Wh?^HJHpY+N%i*ij(X^~j?FmG-v1a6L#7
zh(z?dVsGm+)&Rb>aneE%DEFCUuP-ms_57%XHTT&aMV_n;?m!RfAH*yl1$#^&ePZ7Q
z5Kr%2FzvB#&Y!=CQuZS&A0gMIufFDUhU%aPgAUxMQR9%8ZbCZFEt%3-fln;;-T!pI
z4tffDcff7ZiK(_Txezq<!FSbiA)mQg3K}*OB}5?3Jp}t^H2Rh_&r*SEmM%8NaXiv4
z<n(?tLh3G!)u$97viPdpJk-NCc+J{dW{-QYWV==!Ly_F^5tCMjvc^?zBoJUW;@BZM
z9XotrhlOTH;?;LgS;)}6yFgnG*d%nEPh$^u*u%@x_3`L-g>mjwyuGss<iDvN|3@m{
zf(!_?lVqq(Amz(u(d2}bx0JR)JZ(cC^YE(Mg4YgIBB9XAGrMp)2(d<IH^M}7BmTXT
zXf<6&scl}%fqw87>us~b*B5f1^e9gQi}zh=5DeU=u`#Ad<Jju{YOi_L<df?I`yASd
z#nQhwFPB>U+*n0=O&i!@lH)r8`!GHE9JC7kw%ZEvB3QKYz*<@MGWbJy5hkU?Z~?NG
z$^Jpp>VF3I)M|Kev@XO}{myWex5RU0AK%(g$Kg@JqTC*gTP-1t@*wO5B`jzV@pOc5
zz(wG_M>9;{Aw|+WIAmcBhR}p!&@9b#o*%;%xBcZ{JZIP!<j_aV0W?SpW;rs5bFEb^
zpBR;Cu!~QCdU~&kzFxzOJcVuK;}u$kNUkgGx_}1=A90sLo5;;}HE@IiK}Fnw&%r2*
zPj)!zXt10n2yEt@o8JEnr=pjm*JWbB&VWLk{O%l<n%^oB%S*L+L2Ro|-2wIV%Cw>|
zN-Q%7^PS{ub#s#w8R;FC*10+`Zg;|vuH9!E{UDv?UMOuF^lh0>vd<{jDSG|V2c1ik
z7877q=;(v>NE!&!J4@f_|9g>L;2pAMG~B0fP$WO1ClQ*#E+iN;)XM%@*9kFk*k(8Q
zD?{ZlTLQb-X4UGdkn+ac`(!dg?h?5ccRm+5z8R}kUzDDittGc2mu0sF@Zi)={s8%K
zjq7MO>ge8gjPQa`NRMr={5^K1P`k+Q)yQ{LGNNxi`@Ju4hfY5}X&%y6c&1yG7e2Z8
z=U4xFgp5a+8}!!{MBh)GtzA|HgIdNse0rn!W<a_P^Q_+{-jH%<yc5d9E0R;+2k$F*
z1pn;xfvvCX%(<Y&TBL=DosDC5xNSxlKB%F|H1Y|)h-0C1PUXi_QLq<ja;Tu323(fq
zGeR&WAYpZ2s<`6%1sTvUJO4F@eIVFm?hZIjvz5q5_4yav2xxF4am!A*`{S3aGiw{k
zF)@X@4O3jy)2ZwlY69F^=`lt3EtO(#bA$AphO3X)L+CLdr9Fu45y!)yiP)krlBPLh
zGDRWx7p{ZpQx*uJmlyBC@5_jp)dqc{_v)L;+Q$;Z2Cr|MhHx;ThVG3XgjFje)$z2j
zz-wfue+u;@uG*$Hs!8!`5a0hOUEt?t|17JTrYtqT4~eQ*XLV0eXwmD8PEhs;f+c&v
zO82eB?4)Ul*PB(#S2lwUQx;s@uhXAuz&D6WWDN7EA|EB`mO1sh0Zp4^9x_`OGc`<b
z3`gvP)L!l})CpaqgbquCOzTZ8!_AGD1voI%YX+5csHe>QBDOdve)dQ(AH|A*_f8JF
zHD;2Z1Yw4Z;~b5`2o~XEDYE{GIw&!J+2jSbFm|0!1d~YIv{TU@-Q&-uLe=_@Znop1
zMpF$42E~(ahEASW^y%ME3fIl~jPuoZbPN6szHV^|X_fka?b~{_`^Wv8sB^3yzrPJn
PMaf7hNM?#Z^!z^nt54i^

diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
index 77e7c9f46e1a..b6e15236dfca 100644
--- a/docs/source/performance/performance_long_sequence.md
+++ b/docs/source/performance/performance_long_sequence.md
@@ -34,9 +34,9 @@
     <tr>
       <th rowspan="2" class="top-border">SeqLen (K)</th>
       <th rowspan="2" class="top-border"># of GPUs</th>
-      <th rowspan="1" class="top-border">Without-CP</th>
-      <th colspan="5" class="top-border">With-CP</th>
-      <th rowspan="2" class="top-border">Speedup with-CP/without-CP</th>
+      <th rowspan="1" class="top-border">Without CP</th>
+      <th colspan="5" class="top-border">With CP</th>
+      <th rowspan="2" class="top-border">Speedup with CP/without CP</th>
     </tr>
     <tr>
       <th>TFLOPS / GPU</th>

From 10701e951e6ec782412883dea26668bd64bb7e3e Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?oliver=20k=C3=B6nig?= <okoenig@nvidia.com>
Date: Tue, 8 Oct 2024 12:57:39 +0200
Subject: [PATCH 08/18] =?UTF-8?q?[=F0=9F=A4=A0]:=20Howdy=20folks,=20let's?=
 =?UTF-8?q?=20bump=20`Dockerfile.ci`=20to=203f90b98=20!=20(#10789)?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 Dockerfile.ci | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Dockerfile.ci b/Dockerfile.ci
index cf084d91982f..f6132bc6cc49 100644
--- a/Dockerfile.ci
+++ b/Dockerfile.ci
@@ -59,7 +59,7 @@ RUN pip install nemo_run@git+https://github.com/NVIDIA/NeMo-Run.git@${NEMO_RUN_T
 # Install NeMo requirements
 ARG TE_TAG=7d576ed25266a17a7b651f2c12e8498f67e0baea
 ARG MODELOPT_VERSION=0.17.0
-ARG MCORE_TAG=73e7b58e79df9da521ff31d74053579b7a060c7e
+ARG MCORE_TAG=3f90b989c477ba9be5d6011866641eda9d91f588
 
 ARG APEX_TAG=810ffae374a2b9cb4b5c5e28eaeca7d7998fca0c
 RUN \

From 9c4c13d847fded4685e7c125170845fb02b56e75 Mon Sep 17 00:00:00 2001
From: Shengliang Xu <106840466+shengliangxu@users.noreply.github.com>
Date: Tue, 8 Oct 2024 05:06:59 -0700
Subject: [PATCH 09/18] Add ModelOpt transformer model pruning example for
 Llama models, default to llama3.1-8b-base (#10294)

* Add ModelOpt transformer model pruning example for Llama3 model

Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: shengliangxu <shengliangxu@users.noreply.github.com>
Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* examples code is at wrong dir, move them

Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* changes as suggested in comment

remove some logging and unused config code, update example model to
llama3.1

Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* Add pruning of hidden_size into example

Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: shengliangxu <shengliangxu@users.noreply.github.com>
Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>

* Update examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Add pruning test to cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Update cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Update cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Update cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Update cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Update cicd-main.yml

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

---------

Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>
Signed-off-by: shengliangxu <shengliangxu@users.noreply.github.com>
Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>
Co-authored-by: shengliangxu <shengliangxu@users.noreply.github.com>
Co-authored-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .github/workflows/cicd-main.yml               |  24 ++++
 .../conf/megatron_gpt_prune.yaml              |  41 ++++++
 .../language_modeling/megatron_gpt_prune.py   | 127 ++++++++++++++++++
 3 files changed, 192 insertions(+)
 create mode 100644 examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml
 create mode 100644 examples/nlp/language_modeling/megatron_gpt_prune.py

diff --git a/.github/workflows/cicd-main.yml b/.github/workflows/cicd-main.yml
index 96d54dbc8324..7aa6cdbfa00a 100644
--- a/.github/workflows/cicd-main.yml
+++ b/.github/workflows/cicd-main.yml
@@ -641,6 +641,29 @@ jobs:
       AFTER_SCRIPT: |
           rm -rf examples/nlp/megatron_llama_distill
 
+  L2_Prune_Width_Llama2:
+    needs: [cicd-test-container-setup]
+    uses: ./.github/workflows/_test_template.yml
+    if: contains(fromJSON(needs.cicd-test-container-setup.outputs.test_to_run), 'L2_Prune_Width_Llama2') || needs.cicd-test-container-setup.outputs.all == 'true'
+    with:
+      RUNNER: self-hosted-azure
+      SCRIPT: |
+        python examples/nlp/language_modeling/megatron_gpt_prune.py \
+          trainer.devices=2 \
+          trainer.num_nodes=1 \
+          trainer.precision=bf16 \
+          model.restore_from_path=/home/TestData/nlp/megatron_llama/llama_ci.nemo \
+          model.tensor_model_parallel_size=1 \
+          model.pipeline_model_parallel_size=2 \
+          prune.num_calib_size=8 \
+          prune.ffn_hidden_size=192 \
+          prune.num_attention_heads=2 \
+          prune.num_query_groups=2 \
+          prune.hidden_size=null \
+          export.save_path=examples/nlp/language_modeling/ci_prune_width.nemo
+      AFTER_SCRIPT: |
+          rm -rf examples/nlp/language_modeling/ci_prune_width.nemo
+
   # L2: ASR dev run
   ASR_dev_run_Speech_to_Text:
     needs: [cicd-test-container-setup]
@@ -5350,6 +5373,7 @@ jobs:
       - L2_Community_LLM_Checkpoints_tests_Llama3
       - L2_PTQ_Llama2_Export_Only
       - L2_Distill_Llama2
+      - L2_Prune_Width_Llama2
       - L2_Speech_to_Text_AED
       - L2_Speech_Estimate_Duration_Bins
       - L2_Speech_Batch_Size_OOMptimizer
diff --git a/examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml b/examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml
new file mode 100644
index 000000000000..cb26d5744b5b
--- /dev/null
+++ b/examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml
@@ -0,0 +1,41 @@
+inference:
+  greedy: false # Whether or not to use sampling ; use greedy decoding otherwise
+  top_k: 0  # The number of highest probability vocabulary tokens to keep for top-k-filtering.
+  top_p: 0.9 # If set to float < 1, only the most probable tokens with probabilities that add up to top_p or higher are kept for generation.
+  temperature: 1.0 # sampling temperature
+  add_BOS: true # add the bos token at the begining of the prompt
+  tokens_to_generate: 30 # The minimum length of the sequence to be generated.
+  all_probs: false  # whether return the log prob for all the tokens in vocab
+  repetition_penalty: 1.2  # The parameter for repetition penalty. 1.0 means no penalty.
+  min_tokens_to_generate: 0  # The minimum length of the sequence to be generated.
+  compute_logprob: false  # a flag used to compute logprob of all the input text, a very special case of running inference, default False
+  batch_size: 64 # batch size for inference
+  max_context_length: 512 # max length of the context, input sequence will be truncated if it is longer than this
+
+trainer:
+  devices: 1
+  num_nodes: 1
+  accelerator: gpu
+  logger: false # logger provided by exp_manager
+  precision: bf16 # 16, 32, or bf16
+  enable_checkpointing: false
+
+model:
+  tensor_model_parallel_size: 1 # Pruning currently only supports tensor_model_parallel_size=1
+  pipeline_model_parallel_size: 1
+  restore_from_path: llama3.1-8b-base.nemo # Nemo file path
+
+  ## Activation Checkpoint
+  activations_checkpoint_granularity: null # 'selective' or 'full'
+  activations_checkpoint_method: null # 'uniform', 'block', not used with 'selective'
+
+prune:
+  calib_dataset: cnn_dailymail # wikitext, cnn_dailymail, or a local dataset
+  num_calib_size: 512 # number of samples used for calibration
+  ffn_hidden_size: 3584 # ffn_hidden_size in the pruned model, ffn_hidden_size // 4
+  num_attention_heads: 8 # num_attention_heads in the pruned model, num_attention_heads // 4
+  num_query_groups: 4 # num_query_groups in the pruned model, num_query_groups // 2
+  hidden_size: 2048 # hidden_size in the pruned model, hidden_size // 2
+
+export:
+  save_path: llama3.1-8b-base-pruned.nemo # Path where the pruned model will be saved
diff --git a/examples/nlp/language_modeling/megatron_gpt_prune.py b/examples/nlp/language_modeling/megatron_gpt_prune.py
new file mode 100644
index 000000000000..b9bf8edbfb1a
--- /dev/null
+++ b/examples/nlp/language_modeling/megatron_gpt_prune.py
@@ -0,0 +1,127 @@
+# Copyright (c) 2024, NVIDIA CORPORATION.  All rights reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+import modelopt.torch.prune as mtp
+import torch
+import torch.multiprocessing as mp
+from datasets import load_dataset
+from omegaconf import OmegaConf
+from pytorch_lightning.trainer.trainer import Trainer
+from tqdm import tqdm
+
+from nemo.collections.nlp.models.language_modeling.megatron_gpt_model import MegatronGPTModel
+from nemo.collections.nlp.parts.nlp_overrides import NLPDDPStrategy
+from nemo.core.config import hydra_runner
+from nemo.utils.model_utils import load_config
+
+mp.set_start_method("spawn", force=True)
+
+"""
+Nemo pruning example script.
+
+Please consult examples/nlp/language_modeling/conf/megatron_gpt_prune.yaml config on available pruning arguments,
+models supported as well as how to set up data and inference for calibration (with defaults recommended).
+
+Example usage:
+```
+python examples/nlp/language_modeling/megatron_gpt_prune.py \
+    model.restore_from_path=llama3.1-8b-base.nemo \
+    model.tensor_model_parallel_size=1 \
+    model.pipeline_model_parallel_size=8 \
+    trainer.num_nodes=1 \
+    trainer.precision=bf16 \
+    trainer.devices=8 \
+    prune.ffn_hidden_size=3584 \
+    prune.num_attention_heads=8 \
+    prune.num_query_groups=4 \
+    prune.hidden_size=2048 \
+    export.save_path=llama3.1-8b-base-pruned.nemo
+```
+where tensor_model_parallel_size must be 1 because of the current prune API limitation
+"""
+
+
+def get_calib_data_iter(data="cnn_dailymail", batch_size=64, calib_size=512, max_sequence_length=512):
+    if data == "wikitext":
+        dataset = load_dataset("wikitext", "wikitext-103-v1", split="train")
+        text_column = "text"
+    elif data == "cnn_dailymail":
+        dataset = load_dataset("cnn_dailymail", name="3.0.0", split="train")
+        text_column = "article"
+    else:
+        # Assume a local JSON dataset with a column named "text"
+        dataset = load_dataset("json", data_files=data, split="train")
+        text_column = "text"
+    calib_size = max(min(len(dataset), calib_size), batch_size)
+    for i in range(calib_size // batch_size):
+        batch = dataset[i * batch_size : (i + 1) * batch_size][text_column]
+        for j in range(len(batch)):
+            batch[j] = batch[j][:max_sequence_length]
+        yield batch
+
+
+@hydra_runner(config_path="conf", config_name="megatron_gpt_prune")
+def main(cfg) -> None:
+    if not torch.cuda.is_available():
+        raise EnvironmentError("GPU is required for the pruning.")
+
+    # Overwrite model config with the one from the model checkpoint and apply pruning modifications
+    model_cfg = load_config(cfg.model.restore_from_path)
+    model_cfg.update(cfg.model)
+    model_cfg.name = "modelopt"  # Use modelopt transformer spec for pruning
+
+    assert cfg.model.tensor_model_parallel_size == 1, "Pruning currently only supports tensor_model_parallel_size=1"
+    assert (
+        not hasattr(cfg.model, "sequence_parallel") or not cfg.model.sequence_parallel
+    ), "Pruning currently does not support sequence parallelism"
+
+    trainer = Trainer(strategy=NLPDDPStrategy(), **cfg.trainer)
+    model = MegatronGPTModel.restore_from(
+        restore_path=cfg.model.restore_from_path, override_config_path=model_cfg, trainer=trainer
+    )
+
+    data_iter = get_calib_data_iter(
+        cfg.prune.calib_dataset,
+        cfg.inference.batch_size,
+        cfg.prune.num_calib_size,
+        cfg.inference.max_context_length,
+    )
+    dataloader = [data for data in data_iter]
+
+    def forward_loop(model):
+        # NOTE: Alternatively you can also use `model.forward_bwd_step(data_iter, forward_only=True)`
+        # if your model is setup for training.
+        model.set_inference_config(OmegaConf.to_container(cfg.inference))
+        for i, batch in enumerate(tqdm(dataloader, desc="Calibrating")):
+            model.predict_step(batch, i)
+
+    model_pruned, _ = mtp.prune(
+        model,
+        mode="mcore_gpt_minitron",
+        constraints={
+            "export_config": {
+                k: cfg.prune.get(k)
+                for k in ["ffn_hidden_size", "num_attention_heads", "num_query_groups", "hidden_size"]
+                if cfg.prune.get(k) is not None
+            },
+        },
+        dummy_input=None,  # Not used
+        config={"forward_loop": forward_loop},
+    )
+
+    model_pruned.save_to(cfg.export.save_path)
+
+
+if __name__ == '__main__':
+    main()

From 0264eb2689b76c0a3e64dacfe720570856d98cbb Mon Sep 17 00:00:00 2001
From: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>
Date: Tue, 8 Oct 2024 18:12:04 +0300
Subject: [PATCH 10/18] Update mamba.rst after dist ckpt addition (#10800)

Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 tutorials/llm/mamba/mamba.rst | 21 ---------------------
 1 file changed, 21 deletions(-)

diff --git a/tutorials/llm/mamba/mamba.rst b/tutorials/llm/mamba/mamba.rst
index 2704c15aa05b..197825c27d58 100644
--- a/tutorials/llm/mamba/mamba.rst
+++ b/tutorials/llm/mamba/mamba.rst
@@ -80,27 +80,6 @@ Convert the Pytorch Checkpoint to a NeMo Checkpoint
 
 * Note: the ``mamba_ssm_ngroups`` parameter should be 1 for the Mamba2 models from the `Transformers are SSMs paper <https://arxiv.org/pdf/2405.21060>`__ (130m, 370m, 780m, 1.3b, and 2.7b) and 8 for the Mamba2 and Mamba2-Hybrid models by `NVIDIA <https://arxiv.org/pdf/2406.07887>`__ (both 8b).
 
-Model (Tensor) Parallelism for the 8b Models
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
-
-* Note: Distributed checkpointing for the Mamba2 and Mamba2-Hybrid models will be implemented in the near future. For now, you should use the method below for converting to Tensor Parallel (TP) of different sizes. 
-
-The HuggingFace checkpoint for the 8b model is for TP of size 1, and so is the ``.nemo`` checkpoint obtained for the previous step. To shard the model weights for a larger TP size, use the script from <SCRIPT PATH>. The example below is for a target TP of size 4.
-
-.. code:: bash
-   
-   CUDA_VISIBLE_DEVICES="0" python /opt/NeMo/examples/nlp/language_modeling/mamba_change_num_partition.py \
-          --model_file=<path to source .nemo model> \
-          --target_file=<path to target .nemo model> \
-          --tensor_model_parallel_size=1 \
-          --target_tensor_model_parallel_size=4 \
-          --precision=bf16 \
-          --tokenizer_path=<path to tokenizer.model>
-
-After running this script, a ``.nemo`` model along with the TP-size number of folders (4 in this example) will be generated in the target path. The folders for each rank will be displayed as ``mp_rank_00`` to ``mp_rank_03`` in this example. 
-
-* Note: You can only use Tensor Parallelism for the 8b models by `NVIDIA <https://arxiv.org/pdf/2406.07887>`__ (Mamba2 8b and Mamba2-Hybrid 8b). This is due to the fact that the ``mamba_ssm_ngroups`` parameter in the model architecture should be divisible by TP size. ``mamba_ssm_ngroups`` parameter is 8 for NVIDIA models and 1 for other models in the list.
-
 Run Fine-Tuning
 ^^^^^^^^^^^^^^^
 1. Follow the steps from `here <https://nemo-framework-tme.gitlab-master-pages.nvidia.com/documentation/user-guide/latest/llms/gemma/dataprep.html>`__ to obtain and preprocess the fine-tuning dataset.

From f91db7f6bb6db13435f845b4668b7321e1978d21 Mon Sep 17 00:00:00 2001
From: "He Huang (Steve)" <105218074+stevehuang52@users.noreply.github.com>
Date: Tue, 8 Oct 2024 11:51:28 -0400
Subject: [PATCH 11/18] fix chunked infer (#10581)

Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .../aed/speech_to_text_aed_chunked_infer.py                 | 6 +-----
 .../ctc/speech_to_text_buffered_infer_ctc.py                | 6 +-----
 .../rnnt/speech_to_text_buffered_infer_rnnt.py              | 6 +-----
 3 files changed, 3 insertions(+), 15 deletions(-)

diff --git a/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py b/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py
index 0195c1edd239..0417522885b9 100644
--- a/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py
+++ b/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py
@@ -119,11 +119,7 @@ def main(cfg: TranscriptionConfig) -> TranscriptionConfig:
     logging.info(f'Hydra config: {OmegaConf.to_yaml(cfg)}')
     torch.set_grad_enabled(False)
 
-    for key in cfg:
-        cfg[key] = None if cfg[key] == 'None' else cfg[key]
-
-    if is_dataclass(cfg):
-        cfg = OmegaConf.structured(cfg)
+    cfg = OmegaConf.structured(cfg)
 
     if cfg.random_seed:
         pl.seed_everything(cfg.random_seed)
diff --git a/examples/asr/asr_chunked_inference/ctc/speech_to_text_buffered_infer_ctc.py b/examples/asr/asr_chunked_inference/ctc/speech_to_text_buffered_infer_ctc.py
index 3feef6a027b8..77b97e0ab516 100644
--- a/examples/asr/asr_chunked_inference/ctc/speech_to_text_buffered_infer_ctc.py
+++ b/examples/asr/asr_chunked_inference/ctc/speech_to_text_buffered_infer_ctc.py
@@ -117,11 +117,7 @@ def main(cfg: TranscriptionConfig) -> TranscriptionConfig:
     logging.info(f'Hydra config: {OmegaConf.to_yaml(cfg)}')
     torch.set_grad_enabled(False)
 
-    for key in cfg:
-        cfg[key] = None if cfg[key] == 'None' else cfg[key]
-
-    if is_dataclass(cfg):
-        cfg = OmegaConf.structured(cfg)
+    cfg = OmegaConf.structured(cfg)
 
     if cfg.random_seed:
         pl.seed_everything(cfg.random_seed)
diff --git a/examples/asr/asr_chunked_inference/rnnt/speech_to_text_buffered_infer_rnnt.py b/examples/asr/asr_chunked_inference/rnnt/speech_to_text_buffered_infer_rnnt.py
index 2014d8782bca..501ca525c1ed 100644
--- a/examples/asr/asr_chunked_inference/rnnt/speech_to_text_buffered_infer_rnnt.py
+++ b/examples/asr/asr_chunked_inference/rnnt/speech_to_text_buffered_infer_rnnt.py
@@ -146,11 +146,7 @@ def main(cfg: TranscriptionConfig) -> TranscriptionConfig:
     logging.info(f'Hydra config: {OmegaConf.to_yaml(cfg)}')
     torch.set_grad_enabled(False)
 
-    for key in cfg:
-        cfg[key] = None if cfg[key] == 'None' else cfg[key]
-
-    if is_dataclass(cfg):
-        cfg = OmegaConf.structured(cfg)
+    cfg = OmegaConf.structured(cfg)
 
     if cfg.random_seed:
         pl.seed_everything(cfg.random_seed)

From 4e8ade55128de27b86dd5c3eca43ca1624006f70 Mon Sep 17 00:00:00 2001
From: Chen Cui <chcui@nvidia.com>
Date: Tue, 8 Oct 2024 13:34:02 -0400
Subject: [PATCH 12/18] fix state transform (#10728)

Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 nemo/lightning/io/state.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/nemo/lightning/io/state.py b/nemo/lightning/io/state.py
index fc2281b9b063..c7a5927ff0a9 100644
--- a/nemo/lightning/io/state.py
+++ b/nemo/lightning/io/state.py
@@ -242,7 +242,7 @@ def __call__(self, ctx: TransformCTX) -> TransformCTX:
 
             if isinstance(target_key, str):
                 target_matches = _match_keys(target_keys, target_key)
-                if target_matches.size < 1:
+                if target_matches.size == 1 and target_matches == np.array(None):
                     raise ValueError(f"No matches found for target key: {target_key}")
             else:
                 if isinstance(target_key, dict):

From 129328722425ccff2ff1f5fb72bd416f2e7984a6 Mon Sep 17 00:00:00 2001
From: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Date: Tue, 8 Oct 2024 10:35:12 -0700
Subject: [PATCH 13/18] use ckpt_to_weights_subdir in restore (#10786)

* use ckpt_to_weights_subdir in restore

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* make ckpt_to_{weight,context}_subdir idempotent

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 nemo/lightning/ckpt_utils.py                        | 13 +++++++++++--
 .../pytorch/strategies/megatron_strategy.py         |  5 ++++-
 2 files changed, 15 insertions(+), 3 deletions(-)

diff --git a/nemo/lightning/ckpt_utils.py b/nemo/lightning/ckpt_utils.py
index 41118022b227..731a1bab7064 100644
--- a/nemo/lightning/ckpt_utils.py
+++ b/nemo/lightning/ckpt_utils.py
@@ -7,14 +7,23 @@
 CONTEXT_PATH: str = "context"
 
 
+def idempotent_path_append(base_dir: Union[str, Path], suffix) -> Path:
+    assert isinstance(base_dir, Path)
+    if base_dir.parts[-1] != suffix:
+        base_dir = base_dir / suffix
+    return base_dir
+
+
 def ckpt_to_weights_subdir(filepath: Union[str, Path]) -> Path:
     """Given an input checkpoint filepath, clean it using `ckpt_to_dir` and then return the weights subdirectory."""
-    return ckpt_to_dir(filepath=filepath) / WEIGHTS_PATH
+    base_dir = ckpt_to_dir(filepath=filepath)
+    return idempotent_path_append(base_dir, WEIGHTS_PATH)
 
 
 def ckpt_to_context_subdir(filepath: Union[str, Path]) -> Path:
     """Given an input checkpoint filepath, clean it using `ckpt_to_dir` and then return the context subdirectory."""
-    return ckpt_to_dir(filepath=filepath) / CONTEXT_PATH
+    base_dir = ckpt_to_dir(filepath=filepath)
+    return idempotent_path_append(base_dir, CONTEXT_PATH)
 
 
 def ckpt_to_dir(filepath: Union[str, Path]) -> Path:
diff --git a/nemo/lightning/pytorch/strategies/megatron_strategy.py b/nemo/lightning/pytorch/strategies/megatron_strategy.py
index d1e2c7dbae57..e55021e3ca11 100644
--- a/nemo/lightning/pytorch/strategies/megatron_strategy.py
+++ b/nemo/lightning/pytorch/strategies/megatron_strategy.py
@@ -57,6 +57,7 @@
 
 from nemo.core.optim.mcore_optim import McoreDistributedOptimizer
 from nemo.lightning import _strategy_lib, io
+from nemo.lightning.ckpt_utils import ckpt_to_weights_subdir
 from nemo.lightning.megatron_parallel import CallbackConnector, MegatronParallel, _ModuleStepFunction
 from nemo.lightning.pytorch.callbacks import ModelTransform
 from nemo.lightning.pytorch.strategies.utils import (
@@ -687,7 +688,9 @@ def load_checkpoint(self, checkpoint_path: Union[str, Path], selective_restore:
             if self.lightning_module.optimizers(use_pl_optimizer=False):
                 sharded_state_dict["optimizer"] = [self.optimizer_sharded_state_dict(is_loading=True)]
 
-        checkpoint = self.checkpoint_io.load_checkpoint(checkpoint_path, sharded_state_dict=sharded_state_dict)
+        checkpoint = self.checkpoint_io.load_checkpoint(
+            ckpt_to_weights_subdir(checkpoint_path), sharded_state_dict=sharded_state_dict
+        )
 
         return checkpoint
 

From 7685a0407e42828b04ab1ef8592c03f7657ed9ae Mon Sep 17 00:00:00 2001
From: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Date: Tue, 8 Oct 2024 10:43:58 -0700
Subject: [PATCH 14/18] Mixtral set seq_length=4k (#10704)

* enable SP & set seq_lenght=4k

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update test expected values

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* 8x22b 4k

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 nemo/collections/llm/recipes/mixtral_8x22b.py       | 2 +-
 nemo/collections/llm/recipes/mixtral_8x7b.py        | 2 +-
 tests/collections/llm/recipes/test_mixtral_8x22b.py | 2 +-
 tests/collections/llm/recipes/test_mixtral_8x7b.py  | 2 +-
 4 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/nemo/collections/llm/recipes/mixtral_8x22b.py b/nemo/collections/llm/recipes/mixtral_8x22b.py
index 82f7cae23dba..b282ba5dc440 100644
--- a/nemo/collections/llm/recipes/mixtral_8x22b.py
+++ b/nemo/collections/llm/recipes/mixtral_8x22b.py
@@ -174,7 +174,7 @@ def pretrain_recipe(
         trainer=trainer(
             num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, callbacks=[run.Config(TimingCallback)]
         ),
-        data=run.Config(MockDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1),
+        data=run.Config(MockDataModule, seq_length=4096, global_batch_size=512, micro_batch_size=1),
         log=default_log(dir=dir, name=name, tensorboard_logger=tensorboard_logger(name=name)),
         optim=distributed_fused_adam_with_cosine_annealing(max_lr=3e-4),
         resume=default_resume(),
diff --git a/nemo/collections/llm/recipes/mixtral_8x7b.py b/nemo/collections/llm/recipes/mixtral_8x7b.py
index 9000c66c3445..30a638712084 100644
--- a/nemo/collections/llm/recipes/mixtral_8x7b.py
+++ b/nemo/collections/llm/recipes/mixtral_8x7b.py
@@ -173,7 +173,7 @@ def pretrain_recipe(
         trainer=trainer(
             num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, callbacks=[run.Config(TimingCallback)]
         ),
-        data=run.Config(MockDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1),
+        data=run.Config(MockDataModule, seq_length=4096, global_batch_size=512, micro_batch_size=1),
         log=default_log(dir=dir, name=name, tensorboard_logger=tensorboard_logger(name=name)),
         optim=distributed_fused_adam_with_cosine_annealing(max_lr=3e-4),
         resume=default_resume(),
diff --git a/tests/collections/llm/recipes/test_mixtral_8x22b.py b/tests/collections/llm/recipes/test_mixtral_8x22b.py
index 3f855721e14f..9f1c56b004ec 100644
--- a/tests/collections/llm/recipes/test_mixtral_8x22b.py
+++ b/tests/collections/llm/recipes/test_mixtral_8x22b.py
@@ -59,7 +59,7 @@ def test_pretrain_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == MockDataModule
-        assert recipe.data.seq_length == 8192
+        assert recipe.data.seq_length == 4096
         assert recipe.data.global_batch_size == 512
         assert recipe.data.micro_batch_size == 1
 
diff --git a/tests/collections/llm/recipes/test_mixtral_8x7b.py b/tests/collections/llm/recipes/test_mixtral_8x7b.py
index 75003891930d..90f959c0a425 100644
--- a/tests/collections/llm/recipes/test_mixtral_8x7b.py
+++ b/tests/collections/llm/recipes/test_mixtral_8x7b.py
@@ -59,7 +59,7 @@ def test_pretrain_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == MockDataModule
-        assert recipe.data.seq_length == 8192
+        assert recipe.data.seq_length == 4096
         assert recipe.data.global_batch_size == 512
         assert recipe.data.micro_batch_size == 1
 

From 34c4608aa152220f27e268f8d08c8d0d2e3cf960 Mon Sep 17 00:00:00 2001
From: Valerie Sarge <vsarge@nvidia.com>
Date: Tue, 8 Oct 2024 10:52:36 -0700
Subject: [PATCH 15/18] Fix for crashes with tensorboard_logger=false and VP +
 LoRA (#10792)

* Fix for crashes with tensorboard_logger=false and virtual pipeline parallel + LoRA

Signed-off-by: Valerie Sarge <vsarge@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: vysarge <vysarge@users.noreply.github.com>

---------

Signed-off-by: Valerie Sarge <vsarge@nvidia.com>
Signed-off-by: vysarge <vysarge@users.noreply.github.com>
Co-authored-by: vysarge <vysarge@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 nemo/collections/common/metrics/perf_metrics.py         | 3 ++-
 nemo/collections/nlp/parts/mixins/nlp_adapter_mixins.py | 5 ++++-
 2 files changed, 6 insertions(+), 2 deletions(-)

diff --git a/nemo/collections/common/metrics/perf_metrics.py b/nemo/collections/common/metrics/perf_metrics.py
index 9a378c705758..2f35171af75d 100644
--- a/nemo/collections/common/metrics/perf_metrics.py
+++ b/nemo/collections/common/metrics/perf_metrics.py
@@ -85,7 +85,8 @@ def on_train_end(self, trainer, pl_module):
             logging.error(f"Failed to calculate TFLOPs per sec per GPU.\n{exc}")
 
         logging.info(f"TFLOPs per sec per GPU={tflops_per_sec_per_gpu:.2f}")
-        pl_module.logger.experiment.add_scalar("tflops_per_sec_per_gpu", tflops_per_sec_per_gpu)
+        if pl_module.logger:
+            pl_module.logger.experiment.add_scalar("tflops_per_sec_per_gpu", tflops_per_sec_per_gpu)
 
     def eval_tflops_per_sec_per_gpu(self, train_step_time: List | float | int) -> float:
         """
diff --git a/nemo/collections/nlp/parts/mixins/nlp_adapter_mixins.py b/nemo/collections/nlp/parts/mixins/nlp_adapter_mixins.py
index a0446f290826..e2ccffeebdfa 100644
--- a/nemo/collections/nlp/parts/mixins/nlp_adapter_mixins.py
+++ b/nemo/collections/nlp/parts/mixins/nlp_adapter_mixins.py
@@ -472,7 +472,10 @@ def on_load_checkpoint(self, checkpoint) -> None:
                 use_mcore = (getattr(self, 'mcore_gpt', False)) or (getattr(self, 'mcore_t5', False))
                 if use_mcore:
                     for index, module in enumerate(self.get_model_module_list()):
-                        if parallel_state.get_virtual_pipeline_model_parallel_world_size() is not None:
+                        if (
+                            parallel_state.get_virtual_pipeline_model_parallel_world_size() is not None
+                            and f'model_{index}' in checkpoint['state_dict']
+                        ):
                             checkpoint_state_dict = checkpoint['state_dict'][f'model_{index}']
                         else:
                             checkpoint_state_dict = checkpoint['state_dict']

From c2f43d9e3517d12c899f8b49c48d2f5d4f04d181 Mon Sep 17 00:00:00 2001
From: Hemil Desai <hemild@nvidia.com>
Date: Tue, 8 Oct 2024 11:31:12 -0700
Subject: [PATCH 16/18] Disable checkpoint conversion inside AutoResume
 (#10645)

* Disable checkpoint conversion inside AutoResume

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>

* Update resume docstrings

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* fix

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* add default finetuning recipe and refactor llama3 8b recipe

Signed-off-by: Chen Cui <chcui@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>

* address comment

Signed-off-by: Chen Cui <chcui@nvidia.com>

* refactor other recipes

Signed-off-by: Chen Cui <chcui@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>

* remove 8x3b finetuning recipe for now because HF version not available

Signed-off-by: Chen Cui <chcui@nvidia.com>

* add copyright header

Signed-off-by: Chen Cui <chcui@nvidia.com>

* adjust unit tests based on recipe fixes

Signed-off-by: Chen Cui <chcui@nvidia.com>

* fix failed unit test

Signed-off-by: Chen Cui <chcui@nvidia.com>

---------

Signed-off-by: Hemil Desai <hemild@nvidia.com>
Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
Co-authored-by: Chen Cui <chcui@nvidia.com>
Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 .../llm/recipes/finetune_default.py           | 133 ++++++++++++++++++
 nemo/collections/llm/recipes/llama3_70b.py    |  43 +++---
 nemo/collections/llm/recipes/llama3_8b.py     |  35 ++---
 nemo/collections/llm/recipes/mistral.py       |  42 ++----
 nemo/collections/llm/recipes/mixtral_8x22b.py |  45 +++---
 nemo/collections/llm/recipes/mixtral_8x7b.py  |  46 +++---
 nemo/collections/llm/recipes/nemotron3_8b.py  |  21 +--
 .../collections/llm/recipes/nemotron4_340b.py |  21 +--
 nemo/lightning/megatron_parallel.py           |  26 ++--
 nemo/lightning/resume.py                      |  26 ++--
 .../llm/recipes/test_llama3_70b.py            |  11 +-
 .../collections/llm/recipes/test_llama3_8b.py |  11 +-
 tests/collections/llm/recipes/test_mistral.py |  11 +-
 .../llm/recipes/test_mixtral_8x22b.py         |  11 +-
 .../llm/recipes/test_mixtral_8x7b.py          |  11 +-
 .../llm/recipes/test_nemotron3_8b.py          |   7 -
 .../llm/recipes/test_nemotron4_340b.py        |   7 -
 17 files changed, 278 insertions(+), 229 deletions(-)
 create mode 100644 nemo/collections/llm/recipes/finetune_default.py

diff --git a/nemo/collections/llm/recipes/finetune_default.py b/nemo/collections/llm/recipes/finetune_default.py
new file mode 100644
index 000000000000..89c982613126
--- /dev/null
+++ b/nemo/collections/llm/recipes/finetune_default.py
@@ -0,0 +1,133 @@
+# Copyright (c) 2024, NVIDIA CORPORATION.  All rights reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from typing import Optional
+
+import nemo_run as run
+import pytorch_lightning as pl
+
+import nemo.lightning as nl
+from nemo.collections import llm
+from nemo.collections.llm.recipes.log.default import tensorboard_logger
+from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
+from nemo.collections.llm.recipes.precision.mixed_precision import bf16_mixed
+
+
+def default_finetune_recipe(
+    model: run.Config[pl.LightningModule],
+    resume_path: str,
+    dir: Optional[str] = None,
+    name: str = "default",
+    num_nodes: int = 1,
+    num_gpus_per_node: int = 8,
+) -> run.Partial:
+    """
+    Create a default fine-tuning recipe for any model.
+
+    This function sets up a template for a complete configuration for fine-tuning, including
+    model, trainer, data, logging, optimization, and resumption settings.
+
+    Args:
+        model (run.Config[pl.LightningModule]): Configuration for a NeMo model.
+        resume_path (str): Path to the Huggingface model.
+        dir (Optional[str]): Directory for saving logs and checkpoints.
+        name (str): Name of the fine-tuning run.
+        num_nodes (int): Number of compute nodes to use.
+        num_gpus_per_node (int): Number of GPUs per node.
+
+    Returns:
+        run.Partial: Partial configuration for fine-tuning.
+
+    See usages of this recipe for further details.
+    """
+    recipe = run.Partial(
+        llm.finetune,
+        model=model,
+        trainer=default_finetune_trainer(
+            num_nodes=num_nodes,
+            num_gpus_per_node=num_gpus_per_node,
+        ),
+        data=run.Config(llm.SquadDataModule, seq_length=2048, global_batch_size=128, micro_batch_size=1),
+        log=llm.default_log(dir=dir, name=name, tensorboard_logger=tensorboard_logger(name=name)),
+        optim=distributed_fused_adam_with_cosine_annealing(max_lr=1e-4, min_lr=0, warmup_steps=50),
+        resume=nemo_resume(resume_path),
+    )
+
+    return recipe
+
+
+def default_finetune_trainer(
+    tensor_parallelism=1,
+    pipeline_parallelism=1,
+    pipeline_parallelism_type=None,
+    virtual_pipeline_parallelism=None,
+    context_parallelism=1,
+    sequence_parallelism=False,
+    num_nodes=1,
+    num_gpus_per_node=8,
+    max_steps=1000,
+    limit_test_batches=None,
+    limit_val_batches=None,
+    val_check_interval=5,
+):
+    strategy = run.Config(
+        nl.MegatronStrategy,
+        tensor_model_parallel_size=tensor_parallelism,
+        pipeline_model_parallel_size=pipeline_parallelism,
+        pipeline_dtype=pipeline_parallelism_type,
+        virtual_pipeline_model_parallel_size=virtual_pipeline_parallelism,
+        context_parallel_size=context_parallelism,
+        sequence_parallel=sequence_parallelism,
+        gradient_as_bucket_view=True,
+    )
+
+    trainer = run.Config(
+        nl.Trainer,
+        accelerator="gpu",
+        accumulate_grad_batches=1,
+        devices=num_gpus_per_node,
+        limit_test_batches=limit_test_batches,
+        limit_val_batches=limit_val_batches,
+        log_every_n_steps=10,
+        max_steps=max_steps,
+        num_nodes=num_nodes,
+        plugins=bf16_mixed(),
+        strategy=strategy,
+        use_distributed_sampler=False,
+        val_check_interval=val_check_interval,
+    )
+
+    return trainer
+
+
+def nemo_resume(model_id: str) -> run.Config[nl.AutoResume]:
+    """
+    Configure automatic resumption from a NeMo checkpoint converted from Huggingface for https://huggingface.co/{model_id}.
+
+    This NeMo checkpoint should be converted from Huggingface beforehand, using nemo.collections.llm.import_ckpt.
+    When converting the checkpoint, the NeMo checkpoint will be saved in NEMO_HOME (set to ~/.cache/nemo by default).
+
+    This function sets up the configuration to resume training from path nemo://{model_id}.
+    This translates to the full path {NEMO_HOME}/models/{model_id}.
+
+    Args:
+        model_id (str): The Huggingface model to resume.
+
+    Returns:
+        run.Config[nl.AutoResume]: Configuration for resuming from NeMo checkpoint.
+    """
+    return run.Config(
+        nl.AutoResume,
+        restore_config=run.Config(nl.RestoreConfig, path=f"nemo://{model_id}"),
+    )
diff --git a/nemo/collections/llm/recipes/llama3_70b.py b/nemo/collections/llm/recipes/llama3_70b.py
index 09c1474ad311..9cfc198038f2 100644
--- a/nemo/collections/llm/recipes/llama3_70b.py
+++ b/nemo/collections/llm/recipes/llama3_70b.py
@@ -27,6 +27,7 @@
 from nemo.collections.llm.gpt.data.squad import SquadDataModule
 from nemo.collections.llm.gpt.model.llama import Llama3Config70B, LlamaModel
 from nemo.collections.llm.peft.lora import LoRA
+from nemo.collections.llm.recipes.finetune_default import default_finetune_recipe
 from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
 from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
 from nemo.collections.llm.recipes.precision.mixed_precision import bf16_mixed
@@ -233,47 +234,27 @@ def pretrain_recipe_performance(
     return recipe
 
 
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """
-    Configure automatic resumption from a Hugging Face checkpoint for Llama3 70B model.
-
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
-
-    More info about the model can be found at: https://huggingface.co/meta-llama/Meta-Llama-3-70B
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
-
-    Note:
-        This is particularly useful for fine-tuning scenarios where you want to
-        start from the pre-trained Llama3 70B model.
-    """
-    return run.Config(
-        nl.AutoResume,
-        restore_config=run.Config(nl.RestoreConfig, path="hf://meta-llama/Meta-Llama-3-70B"),
-    )
-
-
 @run.cli.factory(target=finetune, name=NAME)
 def finetune_recipe(
     dir: Optional[str] = None,
     name: str = "default",
     num_nodes: int = 1,
     num_gpus_per_node: int = 8,
+    peft_scheme: Optional[str] = 'lora',
 ) -> run.Partial:
     """
     Create a fine-tuning recipe for Llama3 70B model.
 
     This function sets up a complete configuration for fine-tuning, including
     model, trainer, data, logging, optimization, and resumption settings.
-    It uses LoRA (Low-Rank Adaptation) for efficient fine-tuning of the large model.
+    The recipe uses LoRA (Low-Rank Adaptation) for efficient fine-tuning, unless peft_scheme is set to None.
 
     Args:
         dir (Optional[str]): Directory for saving logs and checkpoints.
         name (str): Name of the fine-tuning run.
         num_nodes (int): Number of compute nodes to use.
         num_gpus_per_node (int): Number of GPUs per node.
+        peft_scheme (Optional[str]): Name of the peft scheme to use for fine-tuning. Allowed values: 'lora', 'none'/None.
 
     Returns:
         run.Partial: Partial configuration for fine-tuning.
@@ -291,8 +272,16 @@ def finetune_recipe(
         This recipe uses the SQuAD dataset for fine-tuning. Be aware that fine-tuning a 70B model
         requires substantial computational resources.
     """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA)
-    recipe.data = run.Config(SquadDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1)
+    recipe = default_finetune_recipe(model(), "meta-llama/Meta-Llama-3-70B", dir, name, num_nodes, num_gpus_per_node)
+    if peft_scheme is None or peft_scheme.lower() == 'none':
+        assert num_nodes >= 4
+        recipe.trainer.strategy.tensor_model_parallel_size = 8
+        recipe.trainer.strategy.pipeline_model_parallel_size = 4
+        recipe.optim.config.lr = 5e-6
+    elif peft_scheme.lower() == 'lora':
+        recipe.peft = run.Config(LoRA)
+        recipe.trainer.strategy.tensor_model_parallel_size = 8
+        recipe.optim.config.lr = 1e-4
+    else:
+        raise ValueError(f"Unrecognized peft scheme: {peft_scheme}")
     return recipe
diff --git a/nemo/collections/llm/recipes/llama3_8b.py b/nemo/collections/llm/recipes/llama3_8b.py
index 1f45277b2255..4b2934739529 100644
--- a/nemo/collections/llm/recipes/llama3_8b.py
+++ b/nemo/collections/llm/recipes/llama3_8b.py
@@ -27,6 +27,7 @@
 from nemo.collections.llm.gpt.data.squad import SquadDataModule
 from nemo.collections.llm.gpt.model.llama import Llama3Config8B, LlamaModel
 from nemo.collections.llm.peft.lora import LoRA
+from nemo.collections.llm.recipes.finetune_default import default_finetune_recipe
 from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
 from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
 from nemo.collections.llm.recipes.precision.mixed_precision import bf16_mixed
@@ -233,42 +234,27 @@ def pretrain_recipe_performance(
     return recipe
 
 
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """Configure automatic resumption from a Hugging Face checkpoint.
-
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
-
-    More info about the model can be found at: https://huggingface.co/meta-llama/Meta-Llama-3-8B
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
-    """
-    return run.Config(
-        nl.AutoResume,
-        restore_config=run.Config(nl.RestoreConfig, path="hf://meta-llama/Meta-Llama-3-8B"),
-    )
-
-
 @run.cli.factory(target=finetune, name=NAME)
 def finetune_recipe(
     dir: Optional[str] = None,
     name: str = "default",
     num_nodes: int = 1,
     num_gpus_per_node: int = 8,
+    peft_scheme: Optional[str] = 'lora',
 ) -> run.Partial:
     """
     Create a fine-tuning recipe for Llama3 8B model.
 
     This function sets up a complete configuration for fine-tuning, including
     model, trainer, data, logging, optimization, and resumption settings.
-    It uses LoRA (Low-Rank Adaptation) for efficient fine-tuning.
+    The recipe uses LoRA (Low-Rank Adaptation) for efficient fine-tuning, unless peft_scheme is set to None.
 
     Args:
         dir (Optional[str]): Directory for saving logs and checkpoints.
         name (str): Name of the fine-tuning run.
         num_nodes (int): Number of compute nodes to use.
         num_gpus_per_node (int): Number of GPUs per node.
+        peft_scheme (Optional[str]): Name of the peft scheme to use for fine-tuning. Allowed values: 'lora', 'none'/None.
 
     Returns:
         run.Partial: Partial configuration for fine-tuning.
@@ -286,8 +272,13 @@ def finetune_recipe(
         on fine-tuning LLMs with NeMo, see the fine-tuning guide in the
         `examples/llm/finetune/` directory.
     """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA)
-    recipe.data = run.Config(SquadDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1)
+    recipe = default_finetune_recipe(model(), "meta-llama/Meta-Llama-3-8B", dir, name, num_nodes, num_gpus_per_node)
+    if peft_scheme is None or peft_scheme.lower() == 'none':
+        recipe.trainer.strategy.tensor_model_parallel_size = 2
+        recipe.optim.config.lr = 5e-6
+    elif peft_scheme.lower() == 'lora':
+        recipe.peft = run.Config(LoRA)
+        recipe.optim.config.lr = 1e-4
+    else:
+        raise ValueError(f"Unrecognized peft scheme: {peft_scheme}")
     return recipe
diff --git a/nemo/collections/llm/recipes/mistral.py b/nemo/collections/llm/recipes/mistral.py
index c0e50074f26b..16af2b4238f6 100644
--- a/nemo/collections/llm/recipes/mistral.py
+++ b/nemo/collections/llm/recipes/mistral.py
@@ -27,6 +27,7 @@
 from nemo.collections.llm.gpt.data.squad import SquadDataModule
 from nemo.collections.llm.gpt.model.mistral import MistralConfig7B, MistralModel
 from nemo.collections.llm.peft.lora import LoRA
+from nemo.collections.llm.recipes.finetune_default import default_finetune_recipe
 from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
 from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
 from nemo.collections.llm.recipes.precision.mixed_precision import bf16_mixed
@@ -186,47 +187,27 @@ def pretrain_recipe(
     )
 
 
-@run.cli.factory(name=NAME + "_hf")
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """
-    Configure automatic resumption from a Hugging Face checkpoint for Mistral 7B model.
-
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
-
-    More info about the model can be found at: https://huggingface.co/mistralai/Mistral-7B-v0.3
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
-
-    Note:
-        This is particularly useful for fine-tuning scenarios where you want to
-        start from the pre-trained Mistral 7B model.
-    """
-    return run.Config(
-        nl.AutoResume, restore_config=run.Config(nl.RestoreConfig, path="hf://mistralai/Mistral-7B-v0.3")
-    )
-
-
 @run.cli.factory(target=finetune, name=NAME)
 def finetune_recipe(
     dir: Optional[str] = None,
     name: str = "default",
     num_nodes: int = 1,
     num_gpus_per_node: int = 8,
+    peft_scheme: Optional[str] = 'lora',
 ) -> run.Partial:
     """
     Create a fine-tuning recipe for Mistral 7B model.
 
     This function sets up a complete configuration for fine-tuning, including
     model, trainer, data, logging, optimization, and resumption settings.
-    It uses LoRA (Low-Rank Adaptation) for efficient fine-tuning.
+    The recipe uses LoRA (Low-Rank Adaptation) for efficient fine-tuning, unless peft_scheme is set to None.
 
     Args:
         dir (Optional[str]): Directory for saving logs and checkpoints.
         name (str): Name of the fine-tuning run.
         num_nodes (int): Number of compute nodes to use.
         num_gpus_per_node (int): Number of GPUs per node.
+        peft_scheme (Optional[str]): Name of the peft scheme to use for fine-tuning. Allowed values: 'lora', 'none'/None.
 
     Returns:
         run.Partial: Partial configuration for fine-tuning.
@@ -243,8 +224,15 @@ def finetune_recipe(
     Note:
         This recipe uses the SQuAD dataset for fine-tuning.
     """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA)
-    recipe.data = run.Config(SquadDataModule, seq_length=4096, global_batch_size=512, micro_batch_size=1)
+    recipe = default_finetune_recipe(
+        model(), "nemo://mistralai/Mistral-7B-v0.3", dir, name, num_nodes, num_gpus_per_node
+    )
+    if peft_scheme is None or peft_scheme.lower() == 'none':
+        recipe.trainer.strategy.tensor_model_parallel_size = 2
+        recipe.optim.config.lr = 5e-6
+    elif peft_scheme.lower() == 'lora':
+        recipe.peft = run.Config(LoRA)
+        recipe.optim.config.lr = 1e-4
+    else:
+        raise ValueError(f"Unrecognized peft scheme: {peft_scheme}")
     return recipe
diff --git a/nemo/collections/llm/recipes/mixtral_8x22b.py b/nemo/collections/llm/recipes/mixtral_8x22b.py
index b282ba5dc440..222a37d7a0c5 100644
--- a/nemo/collections/llm/recipes/mixtral_8x22b.py
+++ b/nemo/collections/llm/recipes/mixtral_8x22b.py
@@ -27,6 +27,7 @@
 from nemo.collections.llm.gpt.data.squad import SquadDataModule
 from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x22B, MixtralModel
 from nemo.collections.llm.peft.lora import LoRA
+from nemo.collections.llm.recipes.finetune_default import default_finetune_recipe
 from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
 from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
 from nemo.lightning.pytorch.callbacks.megatron_comm_overlap import MegatronCommOverlapCallback
@@ -224,47 +225,27 @@ def pretrain_recipe_performance(
     return recipe
 
 
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """
-    Configure automatic resumption from a Hugging Face checkpoint for Mixtral 8x22B model.
-
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
-
-    More info about the model can be found at: https://huggingface.co/mistralai/Mixtral-8x22B-v0.1
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
-
-    Note:
-        This is particularly useful for fine-tuning scenarios where you want to
-        start from the pre-trained Mixtral 8x22B model.
-    """
-    return run.Config(
-        nl.AutoResume,
-        restore_config=run.Config(nl.RestoreConfig, path="hf://mistralai/Mixtral-8x22B-v0.1"),
-    )
-
-
 @run.cli.factory(target=finetune, name=NAME)
 def finetune_recipe(
     dir: Optional[str] = None,
     name: str = "default",
     num_nodes: int = 8,
     num_gpus_per_node: int = 8,
+    peft_scheme: Optional[str] = 'lora',
 ) -> run.Partial:
     """
     Create a fine-tuning recipe for Mixtral 8x22B model.
 
     This function sets up a complete configuration for fine-tuning, including
     model, trainer, data, logging, optimization, and resumption settings.
-    It uses LoRA (Low-Rank Adaptation) for efficient fine-tuning.
+    The recipe uses LoRA (Low-Rank Adaptation) for efficient fine-tuning, unless peft_scheme is set to None.
 
     Args:
         dir (Optional[str]): Directory for saving logs and checkpoints.
         name (str): Name of the fine-tuning run.
         num_nodes (int): Number of compute nodes to use.
         num_gpus_per_node (int): Number of GPUs per node.
+        peft_scheme (Optional[str]): Name of the peft scheme to use for fine-tuning. Allowed values: 'lora', 'none'/None.
 
     Returns:
         run.Partial: Partial configuration for fine-tuning.
@@ -281,8 +262,18 @@ def finetune_recipe(
     Note:
         This recipe uses the SQuAD dataset for fine-tuning.
     """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA, target_modules=['linear_qkv', 'linear_proj'], dim=32)
-    recipe.data = run.Config(SquadDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1)
+    recipe = default_finetune_recipe(
+        model(), "mistralai/Mixtral-8x22B-v0.1mistralai/Mixtral-8x22B-v0.1", dir, name, num_nodes, num_gpus_per_node
+    )
+    recipe.trainer.strategy.expert_model_parallel_size = 8
+    recipe.trainer.strategy.tensor_model_parallel_size = 8
+    if peft_scheme is None or peft_scheme.lower() == 'none':
+        recipe.trainer.strategy.pipeline_model_parallel_size = 4
+        recipe.trainer.strategy.virtual_pipeline_model_parallel_size = 14
+        recipe.optim.config.lr = 5e-6
+    elif peft_scheme.lower() == 'lora':
+        recipe.peft = run.Config(LoRA, target_modules=['linear_qkv', 'linear_proj'], dim=32)
+        recipe.optim.config.lr = 1e-4
+    else:
+        raise ValueError(f"Unrecognized peft scheme: {peft_scheme}")
     return recipe
diff --git a/nemo/collections/llm/recipes/mixtral_8x7b.py b/nemo/collections/llm/recipes/mixtral_8x7b.py
index 30a638712084..d0609761feea 100644
--- a/nemo/collections/llm/recipes/mixtral_8x7b.py
+++ b/nemo/collections/llm/recipes/mixtral_8x7b.py
@@ -27,6 +27,7 @@
 from nemo.collections.llm.gpt.data.squad import SquadDataModule
 from nemo.collections.llm.gpt.model.mixtral import MixtralConfig8x7B, MixtralModel
 from nemo.collections.llm.peft.lora import LoRA
+from nemo.collections.llm.recipes.finetune_default import default_finetune_recipe
 from nemo.collections.llm.recipes.log.default import default_log, default_resume, tensorboard_logger
 from nemo.collections.llm.recipes.optim.adam import distributed_fused_adam_with_cosine_annealing
 from nemo.lightning.pytorch.callbacks.megatron_comm_overlap import MegatronCommOverlapCallback
@@ -205,7 +206,7 @@ def pretrain_recipe_performance(
             $ nemo llm pretrain --factory "mixtral_8x3b.pretrain_recipe_performance(num_nodes=8, name='perf_pretrain')"
 
         Python API usage:
-            >>> recipe = pretrain_recipe_performance(name="mixtral_8x3b_perf", num_nodes=8)
+            >>> recipe = pretrain_recipe_performance(name="mixtral_8x7b_perf", num_nodes=8)
             >>> print(recipe)
 
     Note:
@@ -223,47 +224,27 @@ def pretrain_recipe_performance(
     return recipe
 
 
-def hf_resume() -> run.Config[nl.AutoResume]:
-    """
-    Configure automatic resumption from a Hugging Face checkpoint for Mixtral 8x7B model.
-
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
-
-    More info about the model can be found at: https://huggingface.co/mistralai/Mixtral-8x7B-v0.1
-
-    Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
-
-    Note:
-        This is particularly useful for fine-tuning scenarios where you want to
-        start from the pre-trained Mixtral 8x7B model.
-    """
-    return run.Config(
-        nl.AutoResume,
-        restore_config=run.Config(nl.RestoreConfig, path="hf://mistralai/Mixtral-8x7B-v0.1"),
-    )
-
-
 @run.cli.factory(target=finetune, name=NAME)
 def finetune_recipe(
     dir: Optional[str] = None,
     name: str = "default",
-    num_nodes: int = 2,
+    num_nodes: int = 1,
     num_gpus_per_node: int = 8,
+    peft_scheme: Optional[str] = 'lora',
 ) -> run.Partial:
     """
     Create a fine-tuning recipe for Mixtral 8x7B model.
 
     This function sets up a complete configuration for fine-tuning, including
     model, trainer, data, logging, optimization, and resumption settings.
-    It uses LoRA (Low-Rank Adaptation) for efficient fine-tuning.
+    The recipe uses LoRA (Low-Rank Adaptation) for efficient fine-tuning, unless peft_scheme is set to None.
 
     Args:
         dir (Optional[str]): Directory for saving logs and checkpoints.
         name (str): Name of the fine-tuning run.
         num_nodes (int): Number of compute nodes to use.
         num_gpus_per_node (int): Number of GPUs per node.
+        peft_scheme (Optional[str]): Name of the peft scheme to use for fine-tuning. Allowed values: 'lora', 'none'/None.
 
     Returns:
         run.Partial: Partial configuration for fine-tuning.
@@ -280,8 +261,15 @@ def finetune_recipe(
     Note:
         This recipe uses the SQuAD dataset for fine-tuning.
     """
-    recipe = pretrain_recipe(name=name, dir=dir, num_nodes=num_nodes, num_gpus_per_node=num_gpus_per_node, fn=finetune)
-    recipe.resume = hf_resume()
-    recipe.peft = run.Config(LoRA, target_modules=['linear_qkv', 'linear_proj'], dim=32)
-    recipe.data = run.Config(SquadDataModule, seq_length=8192, global_batch_size=512, micro_batch_size=1)
+    recipe = default_finetune_recipe(model(), "mistralai/Mixtral-8x7B-v0.1", dir, name, num_nodes, num_gpus_per_node)
+    recipe.trainer.strategy.expert_model_parallel_size = 8
+    if peft_scheme is None or peft_scheme.lower() == 'none':
+        recipe.trainer.strategy.pipeline_model_parallel_size = 4
+        recipe.trainer.strategy.virtual_pipeline_model_parallel_size = 8
+        recipe.optim.config.lr = 5e-6
+    elif peft_scheme.lower() == 'lora':
+        recipe.peft = run.Config(LoRA, target_modules=['linear_qkv', 'linear_proj'], dim=32)
+        recipe.optim.config.lr = 1e-4
+    else:
+        raise ValueError(f"Unrecognized peft scheme: {peft_scheme}")
     return recipe
diff --git a/nemo/collections/llm/recipes/nemotron3_8b.py b/nemo/collections/llm/recipes/nemotron3_8b.py
index 05fb2cb8dcf5..3cdb647b5f84 100644
--- a/nemo/collections/llm/recipes/nemotron3_8b.py
+++ b/nemo/collections/llm/recipes/nemotron3_8b.py
@@ -174,25 +174,28 @@ def pretrain_recipe(
     )
 
 
-@run.cli.factory(name=NAME + "_hf")
-def hf_resume() -> run.Config[nl.AutoResume]:
+@run.cli.factory(name=NAME + "_nemo")
+def nemo_resume() -> run.Config[nl.AutoResume]:
     """
-    Configure automatic resumption from a Hugging Face checkpoint for Nemotron3 8B model.
+    Configure automatic resumption from a NeMo checkpoint converted from Huggingface for Nemotron3 8B model.
 
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
+    More info about the Huggingface model can be found at: https://huggingface.co/nvidia/nemotron-3-8b-base-4k.
 
-    More info about the model can be found at: https://huggingface.co/nvidia/nemotron-3-8b-base-4k
+    This NeMo checkpoint should be converted from Huggingface beforehand, using nemo.collections.llm.import_ckpt.
+    When converting the checkpoint, the NeMo checkpoint will be saved in NEMO_HOME (set to ~/.cache/nemo by default).
+
+    This function sets up the configuration to resume training from path nemo://nvidia/nemotron-3-8b-base-4k.
+    This translates to the full path {NEMO_HOME}/models/nvidia/nemotron-3-8b-base-4k.
 
     Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
+        run.Config[nl.AutoResume]: Configuration for resuming from NeMo checkpoint.
 
     Note:
         This is particularly useful for fine-tuning scenarios where you want to
         start from the pre-trained Nemotron3 8B model.
     """
     return run.Config(
-        nl.AutoResume, restore_config=run.Config(nl.RestoreConfig, path="hf://nvidia/nemotron-3-8b-base-4k")
+        nl.AutoResume, restore_config=run.Config(nl.RestoreConfig, path="nemo://nvidia/nemotron-3-8b-base-4k")
     )
 
 
@@ -308,7 +311,7 @@ def finetune_recipe(
         max_lr=max_lr,
         fn=fn,
     )
-    recipe.resume = hf_resume()
+    recipe.resume = nemo_resume()
     recipe.peft = run.Config(LoRA)
     recipe.data = run.Config(
         SquadDataModule, seq_length=seq_length, global_batch_size=global_batch_size, micro_batch_size=micro_batch_size
diff --git a/nemo/collections/llm/recipes/nemotron4_340b.py b/nemo/collections/llm/recipes/nemotron4_340b.py
index 832b5bad3028..238acb0dac3c 100644
--- a/nemo/collections/llm/recipes/nemotron4_340b.py
+++ b/nemo/collections/llm/recipes/nemotron4_340b.py
@@ -174,25 +174,28 @@ def pretrain_recipe(
     )
 
 
-@run.cli.factory(name=NAME + "_hf")
-def hf_resume() -> run.Config[nl.AutoResume]:
+@run.cli.factory(name=NAME + "_nemo")
+def nemo_resume() -> run.Config[nl.AutoResume]:
     """
-    Configure automatic resumption from a Hugging Face checkpoint for Nemotron4 340B model.
+    Configure automatic resumption from a NeMo checkpoint converted from Huggingface for Nemotron4 340B model.
 
-    This function sets up the configuration to resume training from a pre-trained
-    Hugging Face model checkpoint.
+    More info about the Huggingface model can be found at: https://huggingface.co/nvidia/Nemotron-4-340B-Base.
 
-    More info about the model can be found at: https://huggingface.co/nvidia/Nemotron-4-340B-Base
+    This NeMo checkpoint should be converted from Huggingface beforehand, using nemo.collections.llm.import_ckpt.
+    When converting the checkpoint, the NeMo checkpoint will be saved in NEMO_HOME (set to ~/.cache/nemo by default).
+
+    This function sets up the configuration to resume training from path nemo://nvidia/Nemotron-4-340B-Base.
+    This translates to the full path {NEMO_HOME}/models/nvidia/Nemotron-4-340B-Base.
 
     Returns:
-        run.Config[nl.AutoResume]: Configuration for resuming from HuggingFace checkpoint.
+        run.Config[nl.AutoResume]: Configuration for resuming from NeMo checkpoint.
 
     Note:
         This is particularly useful for fine-tuning scenarios where you want to
         start from the pre-trained Nemotron4 340B model.
     """
     return run.Config(
-        nl.AutoResume, restore_config=run.Config(nl.RestoreConfig, path="hf://nvidia/Nemotron-4-340B-Base")
+        nl.AutoResume, restore_config=run.Config(nl.RestoreConfig, path="nemo://nvidia/Nemotron-4-340B-Base")
     )
 
 
@@ -308,7 +311,7 @@ def finetune_recipe(
         max_lr=max_lr,
         fn=fn,
     )
-    recipe.resume = hf_resume()
+    recipe.resume = nemo_resume()
     recipe.peft = run.Config(LoRA)
     recipe.data = run.Config(
         SquadDataModule, seq_length=seq_length, global_batch_size=global_batch_size, micro_batch_size=micro_batch_size
diff --git a/nemo/lightning/megatron_parallel.py b/nemo/lightning/megatron_parallel.py
index 096c7728d4a1..a1443c7de242 100644
--- a/nemo/lightning/megatron_parallel.py
+++ b/nemo/lightning/megatron_parallel.py
@@ -18,6 +18,7 @@
 import inspect
 import queue
 from collections import defaultdict
+from contextlib import nullcontext
 from dataclasses import dataclass
 from typing import (
     Any,
@@ -426,16 +427,21 @@ def init_ddp(self):
         for model_chunk_idx, model_chunk in enumerate(self):
             module = model_chunk.module
 
-            ddp = DDP(
-                module.config,
-                self.ddp_config,
-                module,
-                data_parallel_group=parallel_state.get_data_parallel_group(with_context_parallel=True),
-                expert_data_parallel_group=parallel_state.get_data_modulo_expert_parallel_group(),
-                # Turn off bucketing for model_chunk 2 onwards, since communication for these
-                # model chunks is overlapped with compute anyway.
-                disable_bucketing=(model_chunk_idx > 0),
-            )
+            # Mcore DistributedDataParallel has to be called with grad. Normally this call is redundant, but for
+            # PEFT with num_sanity_val_steps > 0 this is necessary.
+            init_ddp_context = nullcontext if all(x.requires_grad for x in module.parameters()) else torch.enable_grad
+            with init_ddp_context():
+                ddp = DDP(
+                    module.config,
+                    self.ddp_config,
+                    module,
+                    data_parallel_group=parallel_state.get_data_parallel_group(with_context_parallel=True),
+                    expert_data_parallel_group=parallel_state.get_data_modulo_expert_parallel_group(),
+                    # Turn off bucketing for model_chunk 2 onwards, since communication for these
+                    # model chunks is overlapped with compute anyway.
+                    disable_bucketing=(model_chunk_idx > 0),
+                )
+
             model_chunk.module = ddp
             model_chunk.buffers = ddp.buffers  # We need to do this explicitly since this is a attr pytorch uses
             model_chunk.__class__.__getattr__ = getattr_proxy  # type: ignore
diff --git a/nemo/lightning/resume.py b/nemo/lightning/resume.py
index 9f562e0adb73..99b370d45f71 100644
--- a/nemo/lightning/resume.py
+++ b/nemo/lightning/resume.py
@@ -22,6 +22,7 @@
 import pytorch_lightning as pl
 
 from nemo.lightning import io
+from nemo.lightning.base import NEMO_MODELS_CACHE
 from nemo.lightning.pytorch.strategies.utils import RestoreConfig
 from nemo.utils import logging
 from nemo.utils.app_state import AppState
@@ -101,7 +102,7 @@ def setup(self, trainer: Union[pl.Trainer, fl.Fabric], model=None):
                 model = _try_restore_tokenizer(model, context_path)
 
         elif self.restore_config:
-            new_path = self._try_import_model(
+            new_path = self._extract_path(
                 model=model,
                 path=self.restore_config.path,
                 adapter_path=self.restore_config.adapter_path,
@@ -112,17 +113,22 @@ def setup(self, trainer: Union[pl.Trainer, fl.Fabric], model=None):
             else:
                 self.restore_config.path = str(new_path)
             trainer.strategy.restore_config = self.restore_config
+            # Load artifacts
+            if self.restore_config.load_artifacts:
+                context_path = new_path / "context"
+                if not context_path.is_dir():
+                    context_path = new_path
+
+                _try_restore_tokenizer(model, context_path)
 
-    def _try_import_model(
+    def _extract_path(
         self, model: Optional[io.ConnectorMixin], path: str, adapter_path: Optional[str] = None
     ) -> BasePath:
-
-        if model is None:
-            raise ValueError("Model is needed to import checkpoint from HF or other non-NeMo checkpoint format.")
-        try:
-            new_path = model.import_ckpt(path)
-        except (ValueError, AttributeError):
-            # This is reached when the model connector does not exist for the particular path.
+        if "://" in path:
+            assert path.startswith("nemo://"), "Only NeMo based paths starting with nemo:// are currently supported."
+            _, _path = path.split("://")
+            new_path = os.path.join(NEMO_MODELS_CACHE, _path)
+        else:
             new_path = path
 
         if adapter_path:
@@ -146,7 +152,7 @@ def _resume_peft(self, adapter_meta_path, model):
         assert (
             "://" in self.restore_config.path
         ), "For now PEFT resume requires specifying the import path instead of local path"
-        base_model_path = self._try_import_model(model, self.restore_config.path)
+        base_model_path = self._extract_path(model, self.restore_config.path)
         if base_model_path != Path(metadata['model_ckpt_path']):
             raise ValueError(
                 f"When trying to resume a PEFT training run, found mismatching values: "
diff --git a/tests/collections/llm/recipes/test_llama3_70b.py b/tests/collections/llm/recipes/test_llama3_70b.py
index 4271dd4ef47c..a842975846dd 100644
--- a/tests/collections/llm/recipes/test_llama3_70b.py
+++ b/tests/collections/llm/recipes/test_llama3_70b.py
@@ -67,8 +67,8 @@ def test_finetune_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
+        assert recipe.data.seq_length == 2048
+        assert recipe.data.global_batch_size == 128
         assert recipe.data.micro_batch_size == 1
         assert isinstance(recipe.peft, run.Config)
         assert recipe.peft.__fn_or_cls__ == LoRA
@@ -88,13 +88,6 @@ def test_pretrain_recipe_performance(self, recipe_module):
             for cb in recipe.trainer.callbacks
         )
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://meta-llama/Meta-Llama-3-70B"
-
     def test_trainer_parallelism_options(self, recipe_module):
         trainer_config = recipe_module.trainer(
             tensor_parallelism=8, pipeline_parallelism=2, context_parallelism=4, sequence_parallelism=False
diff --git a/tests/collections/llm/recipes/test_llama3_8b.py b/tests/collections/llm/recipes/test_llama3_8b.py
index 2ad22aedf863..df4f05eec2ae 100644
--- a/tests/collections/llm/recipes/test_llama3_8b.py
+++ b/tests/collections/llm/recipes/test_llama3_8b.py
@@ -56,13 +56,6 @@ def test_trainer(self, recipe_module):
         assert isinstance(trainer_config.plugins, run.Config)
         assert trainer_config.plugins.__fn_or_cls__.__name__ == "MegatronMixedPrecision"
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://meta-llama/Meta-Llama-3-8B"
-
     def test_pretrain_recipe(self, recipe_module):
         recipe = recipe_module.pretrain_recipe()
         assert isinstance(recipe, run.Partial)
@@ -86,8 +79,8 @@ def test_finetune_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
+        assert recipe.data.seq_length == 2048
+        assert recipe.data.global_batch_size == 128
         assert isinstance(recipe.peft, run.Config)
         assert recipe.peft.__fn_or_cls__ == LoRA
 
diff --git a/tests/collections/llm/recipes/test_mistral.py b/tests/collections/llm/recipes/test_mistral.py
index fb64c8fe17cc..490f26a363fc 100644
--- a/tests/collections/llm/recipes/test_mistral.py
+++ b/tests/collections/llm/recipes/test_mistral.py
@@ -64,8 +64,8 @@ def test_finetune_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 4096
-        assert recipe.data.global_batch_size == 512
+        assert recipe.data.seq_length == 2048
+        assert recipe.data.global_batch_size == 128
         assert recipe.data.micro_batch_size == 1
         assert isinstance(recipe.peft, run.Config)
         assert recipe.peft.__fn_or_cls__ == LoRA
@@ -76,13 +76,6 @@ def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_
         assert recipe.trainer.num_nodes == num_nodes
         assert recipe.trainer.devices == num_gpus_per_node
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://mistralai/Mistral-7B-v0.3"
-
     def test_trainer_parallelism_options(self, recipe_module):
         trainer_config = recipe_module.trainer(
             tensor_parallelism=2, pipeline_parallelism=2, context_parallelism=4, sequence_parallelism=True
diff --git a/tests/collections/llm/recipes/test_mixtral_8x22b.py b/tests/collections/llm/recipes/test_mixtral_8x22b.py
index 9f1c56b004ec..0edd56054d4f 100644
--- a/tests/collections/llm/recipes/test_mixtral_8x22b.py
+++ b/tests/collections/llm/recipes/test_mixtral_8x22b.py
@@ -73,8 +73,8 @@ def test_finetune_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
+        assert recipe.data.seq_length == 2048
+        assert recipe.data.global_batch_size == 128
         assert recipe.data.micro_batch_size == 1
         assert isinstance(recipe.peft, run.Config)
         assert recipe.peft.__fn_or_cls__ == LoRA
@@ -87,13 +87,6 @@ def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_
         assert recipe.trainer.num_nodes == num_nodes
         assert recipe.trainer.devices == num_gpus_per_node
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://mistralai/Mixtral-8x22B-v0.1"
-
     def test_trainer_parallelism_options(self, recipe_module):
         trainer_config = recipe_module.trainer(
             tensor_parallelism=4,
diff --git a/tests/collections/llm/recipes/test_mixtral_8x7b.py b/tests/collections/llm/recipes/test_mixtral_8x7b.py
index 90f959c0a425..409dc26a8aa4 100644
--- a/tests/collections/llm/recipes/test_mixtral_8x7b.py
+++ b/tests/collections/llm/recipes/test_mixtral_8x7b.py
@@ -73,21 +73,14 @@ def test_finetune_recipe(self, recipe_module):
         assert recipe.trainer.__fn_or_cls__ == Trainer
         assert isinstance(recipe.data, run.Config)
         assert recipe.data.__fn_or_cls__ == SquadDataModule
-        assert recipe.data.seq_length == 8192
-        assert recipe.data.global_batch_size == 512
+        assert recipe.data.seq_length == 2048
+        assert recipe.data.global_batch_size == 128
         assert recipe.data.micro_batch_size == 1
         assert isinstance(recipe.peft, run.Config)
         assert recipe.peft.__fn_or_cls__ == LoRA
         assert recipe.peft.target_modules == ['linear_qkv', 'linear_proj']
         assert recipe.peft.dim == 32
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://mistralai/Mixtral-8x7B-v0.1"
-
     def test_trainer_parallelism_options(self, recipe_module):
         trainer_config = recipe_module.trainer(
             tensor_parallelism=4,
diff --git a/tests/collections/llm/recipes/test_nemotron3_8b.py b/tests/collections/llm/recipes/test_nemotron3_8b.py
index de9ce2b13ad3..dd38ac100f21 100644
--- a/tests/collections/llm/recipes/test_nemotron3_8b.py
+++ b/tests/collections/llm/recipes/test_nemotron3_8b.py
@@ -49,13 +49,6 @@ def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_
         assert recipe.trainer.num_nodes == num_nodes
         assert recipe.trainer.devices == num_gpus_per_node
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://nvidia/nemotron-3-8b-base-4k"
-
     def test_finetune_recipe(self, recipe_module):
         recipe = recipe_module.finetune_recipe()
         assert isinstance(recipe, run.Partial)
diff --git a/tests/collections/llm/recipes/test_nemotron4_340b.py b/tests/collections/llm/recipes/test_nemotron4_340b.py
index 5b49b8b599c9..7ae25c63ad08 100644
--- a/tests/collections/llm/recipes/test_nemotron4_340b.py
+++ b/tests/collections/llm/recipes/test_nemotron4_340b.py
@@ -49,13 +49,6 @@ def test_pretrain_recipe_with_different_configurations(self, recipe_module, num_
         assert recipe.trainer.num_nodes == num_nodes
         assert recipe.trainer.devices == num_gpus_per_node
 
-    def test_hf_resume(self, recipe_module):
-        resume_config = recipe_module.hf_resume()
-        assert isinstance(resume_config, run.Config)
-        assert resume_config.__fn_or_cls__ == AutoResume
-        assert isinstance(resume_config.restore_config, run.Config)
-        assert resume_config.restore_config.path == "hf://nvidia/Nemotron-4-340B-Base"
-
     def test_finetune_recipe(self, recipe_module):
         recipe = recipe_module.finetune_recipe()
         assert isinstance(recipe, run.Partial)

From 764bcda07ba28ca4dffe029c18fa9ac8094adf99 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Tue, 8 Oct 2024 13:24:50 -0700
Subject: [PATCH 17/18] replace png file to github assets

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 docs/source/performance/cp_speedup_figure.png   | Bin 20359 -> 0 bytes
 .../performance/performance_long_sequence.md    |   2 +-
 2 files changed, 1 insertion(+), 1 deletion(-)
 delete mode 100644 docs/source/performance/cp_speedup_figure.png

diff --git a/docs/source/performance/cp_speedup_figure.png b/docs/source/performance/cp_speedup_figure.png
deleted file mode 100644
index ba4eab5d65a8208d55db0db37ccc9686e4e9b088..0000000000000000000000000000000000000000
GIT binary patch
literal 0
HcmV?d00001

literal 20359
zcmdtKWmuH$_cl6+fCz|!igXI1gp?zVgdi;_h?Gc6N;fE=qDV+LDj`zRF@#{zNaqkz
z(jlFD-H5;6f4}?K$FaY>?}z>6!Q+{kd#?Ms*R|Gpp6gr_psFlOa+2yK3WXxMB`2+h
zLLD1Mp>Tejz=!{1YRaFBLa8X<lD=`rMQ<KU5Peo=`^Pf#RD8sx<>P0LpBZI2K_-pE
zV<;kXri>4j_>SqRv&--Lw=aAI8K1v+nr!#-WVFP)cRnZcjST7qdk%&+_C?X^!Ue(w
zbDb@#9^$R)!IpQegfr5;2XuAUgmK|3KL-r=orAxqU%pEu@aMYZCbAM5Mi1oGpwm@2
z@JIFptP$nI>}LaiKA)*Y!=I58qtx)n*6)AE#}4vrM<qKl)Y;T4Tpp{J+8T7^8!PbH
zPd0^!2|LYQt=_>d)}DuNXP{zFqS4INvwC^%N+hG|RDIZwij5)nCtiQ1T3dfUN0#Pu
zFU7-d4vQ(l|80>|_T92C|C=SYyi3P(t?PbI>EkhpYV?qc)U0*+blSllFI@AqZ#e1j
z-As)xRnE5q1y%#svaj6x^dQ;#_s_TY(N1%{612#hUcsA2ch?7Os0uB*KA!vVrzJ#u
zW6(YgKA5KZRx-`JBh92GUhqaWA-r4q=H{ufjP%6z<h|eTgYjqX7n;3w+*+9+qFm%;
z^r(Mx@kTYW_kOaQ#T<GS_iR3kOvT$bjedEKyJp-F-u6k<Yu6=vxWp!jQs7Z|_uB6^
zW;3Et5}NBjIqny}kbJ9p>Fi1WH=m9FOg1|OHuCr_GBieV@=Y{Gp^Gefy5qPD@-2F}
zNQR9YBa7l{-&Daj$?2xl9IzRdOw2i9%N?7akx<V~ehQ)&&v{aA*7nXx{T}k9eL8Pa
z?FNT3%f?o?{)r{xuJ}1OSk_^!F_d+wCW^5_>o%3ZqqCiSyhu1mJW|-=zi#-QnB-e?
ztN@>E5KUeju65}s{yf@c=|O?>qF#abffwJyuP3sp#XL%}q8$JERBw(hCZtbc?$<2^
zg^r1-c_&KxD$~})WX_v6kx-PgVo$<X2)`7{sQR2}_~B@+f12mcdYbKMt&o_`<o-~h
zxw>3ZQ||zTtHdKl5+UioA4<9L{4i)UUi1GxjYKL#Q7Aqw6B5-a*V>b%{@tL%3nWNF
z`sI92ok3z<joSPt<KMM0|Nr-55={4t9z~RG{(7Y}-JYtHDj%VkB<9&PROya8uU%}(
zgT*+bWnP`{5-S<BdnE`@@aK6+<04V>#rD)!?6=?fW$2XUmyHDpH$A~6cv?K7f3({S
zZF9rb!<|`L9jo&LJ(j(nY9O3@;~#&gq<SF=fyQUmU&vFvyYfn_+#w@FyCjE_-%RzV
z#y(r-Jgt~VZ|BFm3f@B%t_50E9%bgepWc=&R_$bQZjm275;^_?)7EQvMCbV-z6d*C
z(~0);oBfnHs5EV*3q#xJ@!1SjNbL9#k}Gz7yo)PN(D=w*{9w1he4seHuf#?_dGGCW
zI^OyIBBI?n)BmiPM%3*NuG6O)UxL|BXO^+f!{*QW${j6|_x`ks_mq$L%-271uThS_
z{&pZ&zq$%JfY>>O|9ras=bJYd6=DVLCx5p>&gmg0_egVt7`<QOzOf*o%d);MqMoMM
zW+hiP{zjpxK8#JOfA8f{M6%@150uPmnF*@T3q~x_TD#K;&T@`@sot8TR!>)^{j>Ci
zbU@c5;NQD1$5@S5{AuB<bl*57NHk9(92&<gYIk0H#^}W@me68~7Fe*2viP0@ra$T^
znkATooaUl~>Laz3#)oUh=N$HTx0=(Fyr@2eQFB>zeps$nj%+GsGcBK(lo$Ln?>g#F
z;ea1;M}q3_sUUkDA5pdZomvUfNpnkraPwQK=f}Pg)m6H!@n+vI;<f0`YRkZx?aDM=
zs6N>Le2rB(-b^D`kG5;w=#qS2zHu`ysmOz3tARAr?pBdMiho9YN%4!P{=4<NaEDK*
z>F0UP+gawr5)-Z6*5@)_&DAxB<<|xInFj~X(usR{bPnu5nsqF%`StD9Y`8z!*DuZS
zzhwU(=X<kldF;phW$Ay5-<A9B#4L35w$H&QhTHy~XD{i^AiATFT4JtsUOn0^ANn&@
z?`4nfefPx@t>{sn7?Jf`{QWO`h%<S}1|L0>%`>CIZvXj;?rd#SO~zyYK7>y4T#sQ_
zPYc|F+ni_hyYcVJ(odb+-pL4zo#oPrM$zH5k9c$Fl>T&>>8wA?t*0Y?om0D*pU<?V
z5EG+SU=jmYjU&4#Z#LbV_dZ6{y%@{MgrujXS0f|y=Z?8_tBH~1=vDTgy`sS}?#p~p
zB~fIua$|^SRlC6CHT#_ps?WTj(C{flagp>tG(xi82*K!6G>(YSV<q1!nmPPE`rehu
zab{Od_Q}x5jxC!n9%$NLo1M=oOLJZv+P+aX8EerL$*J80D{1c1)X#Q&O^rUaO3iEZ
zqT6v{urDPw)m-pk_fL`a$mb_Fv;zZ?g!)F;q$UoPSX7#NLy(%|%k??5lzf@pL{qEy
zf#rdG&WHO&AH`S+v58g##pb#-Pw{Xac{yZe&U4&5{b6T3Or=oy**j^e?PDkuOD5zY
zW6Fw3CY;?y?&?Bx=*q81e0K;G?p-_zod;D1`|kEjBR5ELOcNX<yBjH&rPl^*s;3RJ
zpb{r5xAZIMy1BW*Kf)Cf;`@T}ky5}?@Z~va9F!fC6aAy|JL~h!kjCbeQYuPQYXC^h
z5OEuR^z~f+Msc6pG@iUmT}3oCc)3lMo}|;&bUBz#6pjyRa%a81QyT-+6CWcAB@~;B
z?D-P@OK7HsSKhmu%OTrPqs%J<aa5#VQE+!tUu_kfqS>=`9miL69b{8UB<@VO{#bH$
zWwM$2Lu24-M56#yaHNpe>s0M;TXejWc}4tITOJC!V`;#iaQ&TYHba&_N@>VTmgXyQ
zMvb~7<KT&yG(>URJnG5GmXa4)A@JLX4yF+#`>@gC(OBr)CiiuU7z!Kh2G&{s*+?_x
zV%9fK5|i_v_$(SZ0Z6rV#msbPTU@j3J>AY~ikX;HaBZqVo46TPWoqWqbqeQ8ySD&P
z=wsaYQ7?x>N%`~R-78O>iDB>gh<am+CY}K<nE4s{UsOOs#sBk%i1WgrM8YoQq(OCl
zvS~r6wnjao8-wr5ZnE}<grZ2rcdyn6%Rg4|)%)C{{Mh_^Am#RfWp|cVT?m=>!Tw&6
zKE`R@A<>R~#U_LLnq@TLgq~IiCx-O;6F4!JlFB*_fle=!2HTTqKNzxex#9?^EpzBs
ziP(~x+lG=3_ZL|ll+-AedRfEL1?Pi=7k(--6P;yp0thi-a!dFSLG%}zmwz<QP)l`~
zxuK;qH5M#>YC4S4-5owsKMtp%YiDF-s?|m#`@S)n`Q*P$AA|My(;RC$UR_6bbihNh
z#uZEAmm~Rx?eJ;($T#>)?Iz^yyY6cGSF+z8Dup8y2M8_>O6fpzl@x+X>gB(?C`5+x
z4A)harq$$DZOc#jgrh%7JwN4C;QFT-jcb@`oHKXsibk)lhyURz3T=ACR1jAuO}_4S
z>F~C*PO_U8U9{D6orrM5A9RAi=W|^#>iA?JHP2>mPA}}#v@l7Hobr%BBQcwL9^PuS
zGHlm~woPTKP5WApz75*=6CkmU3RnC2gF*Y&zUHbMhifx|s#X3bHP9p^g7fhsc+T9F
zt7;^1;`^KLv1hyI`U;A?)0GqEScm_{d`GGRMe|isktH9jpXCc}s&Lm&6R(9Feuu!D
z6Ghz%y#dwE|9kVwHwt|-uTTMHfX@bKW@@-PRG+st8u#zV!uuhV3lFB#ldREOf6phm
z&BY~Ide2U4$!id_4XaOAOkPt7G+3Xs7%Y9<uRml|jQ}|xqfrt<F3p_yAXSMLSi<gH
zLF+2X709?+`9=X)8p*T^;b@KR##LW0nZsLr2M9P;cYlAQdO+zq9^}8h?X?bcwp=LF
z<N{{#8k{yhVdsU;>Ej(89nFJf_RT&xxP`jgkmO=A@Btgz^)<!t36$LVc;}XBs#HsW
z1(k@~>Ic9uA<7M0HV;zXaO;cN4_K8<R}6a{V%LC$@%k`?3dCyP45qU}kDV2~<=Ys=
zjoujcuC59_clpj$ypw0IMqJT^#3l8NczLfOk`Br4at@0Xe}YURKi<u}DQZFc?q5|R
z>kA-fYl<;V%!o0DG~>i)hx&(&jChy*$0GpNO)nXYSLi7D4n;5C36*aJa>qguv-(as
zK`2&J&vWfAhEkA4vxDIf6Z2F61dP2q)01lh=-(R6{CxZo6$=8A$E%oSgN|#k9rRT1
z!#N<B_|O06G5l_(Y(7|;r!4Sdv%ZU84%yo}-TW5$h|72R(-q^cHCx|2)%c$mRK&JY
zOv3G3MSH9zX+r*Da3AS>n_)5aDi7D@O9dv)`Eh3mlsl3K1%<5#uXh0YQvY~|U3WVF
z;kTD*P$5j_dh@EP3nGytK~`@VESk18PU^ksi}kzyI1=Aj<Tj*mTq&j6h-+5;(U95)
zs%QS5flE&F{l@Z9T-jSjz{Q;Setp%V^7#Gs`D!!N*sR1oy=t#Y{Q1H{e$iRZwRSPf
z-m5#Bo`*b`%s;9bu#*{@F(^LuhRs49Jh5SGH8t+}{QNii2vd+?l2cJZe|tsIW6A5i
z?}4FwJkCMVuxah;yRm?^Kxc@$uba8QmGu92Vx+Iyjz~0LjMA|SfOXqP;#cww(|asE
z!KRorMZcXFu;?y~^Emu&yMf8(IHgb~8O1~q*QWMVg?W>&F$MhB0fv(#Au+>UvGla>
zlYwv0R%G5VIjiE;iH3*{r|Bup!-GGty|I`ZGyd{Q&ZoE$p920%1qpSQtSct6mwE9<
zkwT}xMuNmxoziS~HXnkf5rA-b&WQ=)(`w$!2G2CHQs~$|LBLDZGViz7ehbJ$$0Qxz
zw{`hix3)mAj`!Z06FJR4##J^ErSHYF^=GQ**46RhI@*7EvqZ-k>liDuy*h1(u3<Da
zU#bgbHUTP~29M^6IRW#z>XkqC5N`nr-<_wL<22J9A|DUeiMF(l=sIH~7>#5vdreGD
zL4=X@z&GRwj;nZYhhdNZK%UCw8q8wVfMtJSJ&m~6;KDEWLwM5<fZRcO5lZL`&2XJ=
zOM7y~D4AG^&9F5(l^D5k+idL;>)0FkBveLdMvChSz6nB(?P$IIZTY*07idQhd%WLz
zFC9R}Dk_S#zR6`9iLVLO;NzW)hd_Ca#13}t5k`x7wTNtpj{>cb!_VDi1g<O9U4-xR
zR5v#Hmb<$<2$b|;XjsA<Q6qydx)`Uv=EIk1Kk;4pcT<h@`%47Pbl!Wj`tz*gO#iTB
zEo8rxviFkyCryT{JmHw1IXdRHZ(aXP#tQE*2YdG_)fxZW<)I<>MbT!O^aQ6^U~9?P
zK%JXMX!F-Jq@c;Hf|mBCWyp{H&2p%JA4+6ZicN8SfAclfBs>%+KC*wg<VDojkKi~c
zU=~9zfP&^tSWX-*>@;_7#mY;Q*X540v4%f_`kLiVx3oV1NYs?zJY=q!eiqr^7tbMt
zVj&Zmce_AUgO8s!HU9)zmKFG!lcPK$E%~@yKwHO827>YsOU|a;+bqWb7YbulT9wLk
zTYH3IH%62v31wlZC>a12Cj*^_Tz^>Kd<qA?2XHo5`EMrs^*+-$N<Ne5)m4XtQ*04`
zKe00<evm;q7H{8jtIRSY5aMJ@Ra$$up%uXqr6o}N)HBpBHeWyxc3h|rye|B}RKxDo
zZc#5FpHIOCKZ0l}=}fbN(UFK!qUE}~{6j!L-g|$`iJa`L`O-msGa-Px*B3rawI(Go
zI>^pY-{SCu>c9)+luo_aQmdH){#6_onk?sgunbr<W&w+twd4prwnlAJ@pcW3bqK?r
zEwUZJgEJTel<HBjSx-&^*6;WvKS@J_0zl*ngtG3ZMsn*{Nen>kw1j`rbq*u|4~Vv#
zXo#8aNS7rZ9wP97#OS2nS!+cB5J`iVrr4<dO)O{C?ur72lCVjON(5rT90KSoBlXH6
zuW>^(U}C(zNNQYs;-7t*etV@NKqkAlrxM*;3LkYfoV|Rfj^C=k6RO;g1*+Sn^MJp<
z&g}0j_UN=hxyKL^wm|+Ic|x!%)ed@G!tB7QGBTeDQ^K;cdM};C^PKn1D&w<B_s4EX
zZo(m)TQKk+;DR&2VN`0F_wd_cI;-_3f`GUV0pYebLnT(3{OGxk;25WPBg-;!5jw#u
z+-neh=;x&L*C4I)L3p;8(}}wEV!ynpdUc-j-J#eeS?^S~-ftnGUU$z1^ZE_nKk187
z-6a0(B~=OWOT4?%y^O$LD!1(q=|x$MmrhJykww&c#PV$03nvUb+fhWJ02X$SEnpMw
z64r$S>w?5YC04z+uD#cHsth6TNwb|9<R2PI*2fwG2PH0h!YUr2{m;!i!9%ZIPLrtu
zf;7j<LTGAk+xy92+rS*~?1L8p;ZS3uDfB&fu4Wn5g)pw?3Y}y3XtceML0UzehscTm
zfo`j|*8P2YR0x?4Jq`)9HG=SR2w!|^4HV$t4o~$;Ni>a4O4{6$w+=X4Wdo90Hob_;
zr&FP!6FRbi7v4HT!7IT$KWB~HQCmpSO%{bgfT~X9Z(elpUurC=-k!-!1BfY<TiisW
z26YvwuL<Hiy?KV~xz{P!J<?NCV}M$lZu;JM)=_s-z3oJaE{K0wSvqBE_F;q68?(99
zkT#egv3Z!ixskHOX(U7+lc{6v5Msc~%lRu!DQ?dCj7&fBq)l?FwjVX0aUOj99P7t2
z1_hDe=HzB%#<o#p$elxFE^iy4mV*<y;8w}wv6~pmTi>;)!nG7D3K~{MLop7f&9TNH
zwir0CaUQ0Uh%7cH%3qM(isC~2;?r)#n5b&xubP|+wcDOfpPh-yu<hYOkjz)f{IDz|
z_KgH9&~~kFk~KsC8PfyJqGxrsrxj>0<LR%J2JC-<VBYV+vVuv17%P(fkXdup^B+SS
zFX^E+>?PI?OOc=jXi%}VMkb}S!cLHx)XF09Cnldfr}ygZ%+%yQT|O}!EV>?z?Ls38
zBg6gt{Qx5Ztq-U7NBK#(w2PZWR)0z{KEZ&}EVMfwXH%7j**S%*QF<&)MXYU*VfLLg
zCh^1@-va)7CDvGpk;v5B80SUnH06YkSM<|P9^UtN*Qtc`GK2BDP<W8>P{iqz0TjfU
zCc1u4NlzbWW`BZ!p?qFYT$_mjKVBm^Me`{xbR5D4;SYD&54CSzX1-bh^LA;zFC8l4
zR_HHK-fzV883Bk+bDZsVo5~>;&$yTOZb<d2)7(9)4L;<n>zcyZ)%m&aWNI|ifF@`C
zj1(#)0Vy3nh*hM4JEt^(KaI_jQXSqAQ>Isv>tswU1*>B81N0X*$fhw9(qf9dn1J~I
z1zjynO*<U!*BeA{FQ}z7jW-CQ6Mb~}sT!!OzX@jcHik>$!aEh*p(6joO}0U_!fBLp
z`L!0gl_uyfvhWp@!<_UVAanIA$xnU?UxzB$r}US_j8-hw62+2J@tck8xR0nM6!8B}
z*xCo(X^wS5DO0_`L<v)W_|!hLezApgpCf3nGG6b*=K)bj1E<RC!{s|4lD)B3y`OG~
zTk@`JTVD=27oVcLy>$+`-(KLW?<q&2j&^6@kJ`YAXmVmIh^c#iMJ<&5#GXiu2=cDu
zVT0<a^5>dQbfl}4#EnWJ$hPbXU!0C@4PKgJ3@@vGl}DPxbURh~M6Ou%aGwGKplcF6
zR&F&DaNaKj)TVw^qCYc-sfKgw>;0*zKG?ItP>RVMRdo?008MZy&-iryla;<gusi<F
zHu426vbTAO=cr;cR76yBL$SmViv1%IpvalLq~NxOJ2#wcO|n6MnL1LyK7W3KD?qN}
zHl6B3uJ#WNWLR~iDWx5iZxfJ<o%sIyTDJGWek?&w#fm(Jkew;V+!mnd<o0YX8i`4S
z+kg_wSAunl<6=*Gi9G(49Dv*EpB~)MHM0Ho@*G@C8rZy!d}tgYi*0|ZPMO{0RFZeG
z_qUC|?*`&}x8O4gdhh|7(f=za$@*NMy7&X$$GZT?ZBEgNn1N>WphakwBRh)kSH?3q
z6%vo$gasi$G!z9-80xSj(9J6vSsx5K_I(aj9sQz5T_M|dBw*he1OR3_#^M@na;j;=
zLIO?SkoJL)C^fS&Kajq*5~cqUfh!?yUYnx?3D^{9?U;a&4a-)9ccZI><D2?-InE{V
zp->qMD~;Se#7lrb`m7~SzSO2_;&@=%J!zV;8IOaKGe(4<2o9upAYu-h8SkGS^QYjB
z2WTxR`PSI->E(l}orQw9kaw`?N}q&;Uz%Qsn!(Ts%0%{$eFOmj+`&+?U$!>?d8#w-
z>=Ax`3*0;T$3COfvtA`RgQNIKc?MuYCwK>%NIW2{<i~~99zAI}5&*&7WBbGV>u7j#
zK}Uk-!49C<wP^jfl6bnq$4{L9uc#4Y@a0*b(j=@xU~RSs4MhhHNvk-{6G=2a)d?c5
z?fe$qioshdqHEz;;;(2A@3k(?w0n6Ew{gPbPFdPiZJCU)HqLZq#{E~|@TsocS-`ke
zh|d>xG=Ty?Z^Ga;svQ{-RnWhJQ?b%;WsSI0Nh~~!P30O1^|ekf(3q|QJS)Kb)d>!S
z9{=()@B9!-w+R2&0*woMvXosi)J{75|E_d=uCAC<e}9w2e!T7shEf%zs3<+R3@?=T
z?y@X~J*%(4)ZzfR@=-?loHYK@8}#ngdk5_b+<naw5`*mzmVbR)!*%)nlM&vJ(7Xcx
zJN<t%(B~vQuPtNI76sJCpVRl_%R&Dv8)u@gg`kHw10paxfK-gNUOA*-`}V89ZzEYr
zv_&QUdYd)ABt$qOcQDc*eTK8$(4Q>yh9`HF3`fN6s$*%!CQKB{rpf9c0!K`fJk#i<
zwKN<-ZgH|d5%cg9oS=UFH2T#mNHX7N`eHHhv65yCq?<=*T*|KkB<r(1D&m1MFVFHg
z4||rxpluTS%IwpF{N;ho+E29zAlT4px~I(ER6jTjky%E#?fmtF7V|#^aeTrxx|n0G
z`dtDZMBO;BHSy?B`6`dCCaV(c<WBu>LO9@?&lMwd@*c{FR(UVg5IFIjjtt{3_1Lnr
zKiJ!d1FR5x^4#U<V|XW{8zQbCB4#Xv?53Bo0x6O@4=(UL^rIWlE_!5)#yN8&xsQ-S
zq+J8;rps^#nip^%|9vG`WaR~2&p?Td4H_qg>2Jf1VasZ`N|ypJd~91&-_sb43oqb@
zV#3|mX3p`jw8jcF7FuKb!!O;IRDiH)25D>l2*1eDd2co9uSO3QS!iMi>&OtapRx&P
zpQP#Y!+(N$GWH6udLu&1kMOM=T|SfZKYR<cbIyc80otpZ^CiQIpan>!Nf*{<aj0?I
zhXU04uT<`|@#zryu>@ll$`yjAqI8{Nu*an@TAQo3Fl{Uj4<D9a`RlxX(i*+-VxDea
zm$1+g&(??JnY@5N2_edtWUmi-M7;q>Sx8UREnhk9Hhw;{wfQHP<zH0SjNj6pVee!;
zre&<IF?cqV&J<E#;m|DGczal9ef-2<tQSIKUA^Zd8maF-U*tr78btdVwN&|Of>qE)
zyQwLe{rjcKryuuwE1nR{<aS)qwB`T1aIvxth4X|>N#fO8px(i4KZ4W5d%l4vKQD6d
z>@-i^>vL+{-f}MEtV$VSjVf&+Rd)9Hhcb8g_+`kV9b=_~%qAtbcQudX?JyqW$q+c{
z(SX7I6Bt=C-^A65J_9;HF!LX2J9^oy7oko8Hc)DWFj|1_n?d)5)WHf2vQ#zhMXji^
z-~gv^e5VbzTtId4^%nF;+*B9q=77)me02tecABR`zuF6KmCw}ts?G-`Y1kNFe3C-1
zg~75XLR%%BGQ#y<2>tQ-T4dSBORIN3Mea<&Azpn!Y5tQ8aC~8a8n=|W(MQJrO4V71
zj{<}Ym&WCtL&)l5`8XEecy1Mrv4fsv42jGFRSw`SgxygJJ7pD570ZokPyPvC*3-ji
z8l{Dq{qR0yOmd3(#m^N>8Tb<MNQLo~{R{YAK<VwM^4uXF6(}t<?_lkrm;g5L4gTqH
zS7GbR4JYX>X2~?QK#i2cRgZCgLi+9y!`DD^fwpHmsVy|**5*ox2SSoL8w(q?Y%-$O
z&7Rw`Fe7=Q-T)x4+h8QQH3*=EaZfBkpF$K``d|Oiy&Pvc^Zp7{5_Tbsd0yh|F9t^s
z6|b$+i&#vLwHh>K+MHr226K_h%_2srcKOE~uvM8RP6;PL1>d2VMN+ZN&(>;uCQU!r
zV+AaZQYOPPXa}xtO>}~GDcIxjyr$-=)*2zCcK#>NH9>nJ`O1TaXdB8MQG?p!d$Vf&
zdmiezkSct>PLlX$L*Gi!cEkrueE2aF;m|(Caa`Jrx;aL#{bX`b5ae|7-#H?(g3I1&
zhsr>+M38MPKIj!6{|m>2Y@7hp2si{&^4%QRL%LTIzub1_OV;k`LX%S`Y5T(d5$uw9
zh!mWyy+!ZbACD+c5q%{(Qe^_Bq2Bb2IY4jO)B7S_*{i?h)aR@HTrUp^)-K*_zNiPz
zhwM>C3vo&G-ggzf+4w08f*V=Yztoji_d230Q?ti#6=WNT=^y_-aWBzzl2TL*dWwBQ
zT@0sfjw(FMDJ|_35MJFp62&~rq3+`SThdqnTjEbcd%zoWtPIu&@GqC0Y?=G{)|Xws
zs)GIAr{w)_eeY#S`5b5NU@(h_cp^s(P3tZWrXQ2u2Rp-%y3ruRL7S|pY$`F6enl=7
zoHkWejnT*!J-h%m4!Wp;LUThj&f^J=zjb9Cvb4b1_t$)gwD|lKEw4pSj)02zULpJ4
zkKx6jF^;Y0ZX+7M&tQrDl(KrcgSjYIE3h<c{9g@PR9Sa2-gwoSHbontGax2p2Y`7S
z{%s2Q3~m67pW)eh_lhe&&ZiUt6igv+=~tlu_bSQHq*fxvl`tFRKI(J_immuSnnk;=
zjPnqk=lB%&1o?nX*~EF0B#@g5<2niGJ(B0)vqz$<!~)gT2R?Mc2oXc$C<Eome|iU$
z6D6pt&7h8fsl}qu8t!Z|HA=5tI0zI(t74E84?h0=M$x!yR=h}21$j;Y+T{gtI0zXc
zOzh+I8cm-%hLXrH`_m%4cFzOSPoI(xA8-exg?02IZ)jhak!PsK)=CTj=l*6NzN6<m
zerj>30A!;!>tpdpG4i=G$$P(VyftYUQ92d{aqYqi_{~T!)@N--Bqc!+4fqJUhZ}L#
z-ul2i>t{hzz=^#vM>;FRnLR*wLYP~iOdoc6O+J@FuK-$-{78{KfZj+m(q6FlXdK@-
zWXmJf(9%yB9ghn~1V3HC^O;Q`No6(GHh%2$++O8@m?0_5)h=mS{oVE@{PJD-bndFH
z$v8)xYG_|iJ1v&a<|Jq1l(8AKk08qI)PqnkQ0*=v_KG5LMA%ZR?W^{#&d@4I%c00O
z_<~>Hvi#8gJEdu~@Ioo`&vXz@)1ZAVlo?=|e*67Rp(bhNRRQb4Uf|ak&0a|?+=}ME
z<XtI$J%YCFF!Sv8fM8)A%WwWr^SOZgx@7_@qB%b)Z+kBr2b%A3FRuHAA7ZXq-5;4^
zOY&%shZD8E2+j<GnJ0shAO~=9BN~DTm=ma!C)ECm{hPVfd-FDvg&oR#V5z!>gjRyE
zbN5^3QN%v-JmfevBFRWC*V!%Gt?6hqMCM_E$z8=IIMrDU@Bwcdr>t3vs_=({%w?oy
zwwHNGS4XP%*K)n5OXqT{_xn9SX`O_IoIrCNy2pS?YYXJ=Rr#%WLHlElc1(+qy-3Fi
z35V^{pzkvVwz6?C<iYmU$V_6xt39BTaAR+_`)~vNN*;bYDXQEs#}X6DHvnk8hi|y1
z+-d%!X|-|F2B1FE*lSjg@QE*+z8g)T^Dp!8YMqcZk$5!@6mSx3CLOLAU#TLFcg#QT
z5f0Jy1;q~%(XpLk`R8ufpM}TTLCr48tE?Rq1_#?vHTNLY{i{)+GiVyTin2&EbPo$o
zfmjexeyg63QIe_Mg?=bB^~*Scy3pMxQ&_$tE)K=#A&=QlhK?*PK3Z$0w_4Vb;g5%h
zDCeQkHKQ{kIb&V1d=qj16+phQhsJ?~AmrenS!=`|1j3txHZSY$QzEXP2A3q>1!z+h
z;yQ91Y40!m<3p=J+f)Rg;5bS{*idstwV`wY*xhWkF->X0xr#&M$sY)p87ATUYyhUe
zo8#@y)5!57%zlhB>SaSXyP=-PvP{C9o^?)mwW_}B#JPlrs{0=!mUaGWNpe>3-oW`c
zOIx68^()PMAr<fZ*NXXN2tC!woyF>s%qM<`7<PP{Ol*SK<vOoxRiT3TVG}b%)UHUR
zz3HMIt_eajuI~Vegn}RmI(KEjA|i(!r9$AcNBvQsxlo>8q0iFfM<NZgj@!Dko@Ho*
zgay9G+=UB3uK1yI`v&O`Ey(4E&qebXDnhfn(DP7({F{8|06R=rx<FD}yRg%(Z3Dcf
z7U`sw1}L6k5tdD1`r~}FJY}2!X@}&qscm{KlufN}_J6p|oZxg?&H)4sos+Rw4W3Rz
zHBrW8xuQ}Zvjp<PtVg1k{xPIYC$a8FG|bytg3c0X_d6x=?x;(|;9l#$2xoXY@9z28
z<sCwiO@gCx)o1v8ac^g1w##0yxWVC@L_U;h<E<W$0)`5$2D(8@?H-55=}*?N%BVvv
z+#3NIM!~dIhz9zj3%d1qYsbiv{s&S*Yq^aN=WkCy!g}h}sDJo=728elxRoS+4I7&Q
z>SVHZoGh+@fAcdg>2GBwzQioON+GFDI;aHugOCoBhgT>Nbks*qQ$_DOenvD+&t=qK
zgPUO);7q<t(A*`3NRGCo`6W)%iis)s9{iczLj_{J?&lrB#!Al}C-r=zTcT-RK!u&q
z45RQd-t6SLLvb2_A5K&f6sHRx{!k;|lD7jOS`m7viUDL7n*dPc{!lN|B$m4guoi%Y
zqv;mdpw>)<Jyc?NWxR91B9`1@gWT2En@=~)(-Y3ukwCd?%QtId!dwYC3rZkR_Gx8i
z`^RoliYbWi6>0Xm6{PMqt*n4n+gmem#ZF)kqQ4k}d#eUySmUt?vd~IgJc^j7KdB?%
zm_fHbbadJ6;{Ci;JQPEO%*i3W(8t5a0W+qXt7;^<!Cg7JZUMxb78wQ$AumV@*Puvr
zO-zw@?QZ?yo7+SJ$WBiKtUy6VH>m>Kkp_tHwh|g^gqC@Y#7Q?7*a3-e2&<V7Rg~?`
zN=P!3+%$(%ULMv+xjl!_&yXU}JJd5ZvWagDaIMXB@j<1{_okIIQ|@@c%gxRG>Yy%Y
zAM)n3S}`C-UZuEe0$^{+!`wvVW(Hm@Ie-&9w@CyUU6{3C|LdzNJOC4|@-)yB*97ZJ
zmX_G6YCKKi2I9&c6MiFyBgt!f`pH;k{79^bYkuNVegSR`bv^V_6xSyvDl<<12&T{V
zj#(TiX$_<jke_v1{cYe(s<7Qqk@DBUb3O)?j+=jOz<x+RT?6aTx!+(*6zKp8Ac4t>
z-UAzvaqoHmH(~!;7A>_U^ifg6(3uc3p_C+lJu*dvlK3bD#)8DAmHa1X346f=nS;Gs
zf4SEVKb~)B2^aEyaARZR1t-@g@J4~97uhDY7(eT>jZA3fJ)yDf)-sJj`eh5TPsaUR
z5?>$Jfzuu4>FO%T%pJ__@?*HzDz01LQRk%(ridj>;_{oHd$mN|(ExJrgAVNG!VRml
zaE_&r0Xyo8imD!cM!mXBLML+XV0#EEH6?KLZW4O2!3DbW0OdmDjg~7wL#ao6o=?WS
zzC0sYyD2Y;R*NX$kDhC6{MTP_X|lWN-I^#0yceR@=MKZ$od``mLY=k9m7U>(A;nn!
z`uOXQYZp$XoJ}{s8WR;oK}5mTIZX!t%`%-aHi^7vy^DBHb`=-=-;7ldGp<QG18Hjl
zg;3@<{yzQRO!7IH#i4qCdo~Tkv?o7xtGzt`Ovc%emE#;?_0K9S4ho+mljr@dV@|0)
zEiJ^Ut23tx$0V>RMkjw!21iwH$4QM0H5rPG7$L`xSgbR0#Kc1i`T#d}GG5gC9r>0u
zw{o58Ep^>f(B+BR<J}j^o+R+=Om}5Q<vQPoqs$O<>vv=v>%`=(#a>SI-G=6f0I%Wm
zA(!AtdhKmbD__pM5k!q{$pXwoBjiw*rHknn0ea##mRk5fATl_3YLCazEj97A68l-l
zh^eNXVR69CgC}|A?#C42*+5FZIo6Ar+PB=_api}RBYSGZ^cINAa5vuLil4ff8l1XG
zQqSHTdjT@|y?4@tSaas)0Ps1`&8yAVBerL`^LA>AkzN~;B^-p@3SY>}pQuBX8nSET
zEdSF$LN)3E0#5%>fvL)0(cDLR18Q9C_t(?}lFo1vn*oRrVkJ(&ib(0jEVtL^E#xCP
z%x?zKARVyxj?`%DP98(vodAyF0!W2N)(W{hASd%d@^xZx(9G0OT5NlB?jz1P4Mg=o
zOo94ttx78%YJ?QDb2OY~5*m9r`X4iuVR4(#ho6_ARahnx7o|W+*o6_EhJZbPy*tsV
z-yBR7JZ^WXBzBJ+CU$XyhF#!Wm-rp|Z2{p<lIW2We?x?!#A9>xc^aG@2#_*6V?MW4
zGkEo$PR9u7QBCv*o3iR4U}0{t%|`K<0H|-9enscCX)x@K1MGsqx&HC3NC}|4C5g+g
zzr7C@`#3xXrKu#2tUpVuKoP({?py6{>;+KpxEu&RSAv=X9<bRN&C%~xjl6|=p0;2J
zB57>IMg6*Y{uN^DECWM@63A$ClP>^kTY)y8cxQA6#FuMO|GY*nTw$w48vfDHhUwSE
zxqh~U|GH7#X>6Zy&QE&pT^?2Mk<T~?mC{w5LQr*56o(esiJ4Zh9m}5@yI>^SRLL?`
zLmUj{4%6?5g6oYB09i{_6Nu(nljbcyodUDLS@q4xjVuuJZ=I<U-x?=RDDNS{B3=`4
z?-<oh5lbupwpyJ$8R0`~383h{_f&%Ovrd%+p}Vsa?a_!K<{)4~@gs3|SZditTj=v^
zN$Ef$X!hK|Qv4=wTo3Wg^Rp4O?#kBI>`9j!U)qCO7fZxd%E02#SWdOGu~>pRRwo1U
zQD|gX6z<E;Mhg8;0Q}880lG-e?7@)9rV59t8g$`JJBqEe3v7m~B(PGXqDCP7P;3fD
z&NJ@3^YomkeB+v%dH|*ZGG33yTU?aI)Be_4g$v6+=-C`_2|*tI1RB})#RrSKcJ`@-
zeNolp-$|#r(GRxoGCg0s$L*+afb{B@7%6W$G;)^s1u{3yv<p293tJ4P39s}m)J0nB
z778yk3JqH9e|;tV19tHk@JY-z+Z5j`dEP6%vwiBq)z3hLqz)3yM%}@5wEaA;<RJ2j
zW=r4&9;WeNVN?JctO5k7ZQ~^{)8fq-S(bD<&i4y^nDPdBSaE<Qe&=X0&a>T0t%bV)
z2$8Of+byS`$v67XH>Wy&Wn~*qLToxUf~Al)Z$-Tk1DkizMd>U3SJ^5?oz=L?#>wng
z#&2O16bGc{-nn{$J=<OM$r6HeTLZ|dz-VL21+B8rrXK(oOy+v;W%gm5bFt?njzQZ3
znV?xyxWT#-g<>%Mh4d91Z7SCyu~=xOstn{{7K9)lkO_MHu*=YXMm7O5FexgLkeu~*
zs@9o*tFtwV|4ptUVxY;4-&TP}jP06u|BUSkkGjq`uAj!3>FJj}9LLka5W^(P0-dv+
zaf5J>JYqVc6!UO-HkyDUgO<z)w5wKEO85K`<{2`59Bn;lMh9#MngVp3HBV}6U?uMF
z1R3g6?OeHlmXmbWM@wZ0+l|Yqwq|i>GhY+j+kXM=pxLvJ?1u63xnnn2;}KXH4|U$j
zvAE><Z)rDAHs$ya9Vbtg$+Ji?RWhyR({sYy!t`CHvZo$xG8A}0?|H+)D_|g$%Iz~w
z-oqaq8F|?0-ka5V$x&+Pt#zE>4;VR~6>h!OuPyNtaOJ8oEvDeo`QTWyXu}`XeI_p6
z!&?LlQe|bcIheUeUoXXO+-z}ZJjby0zVi=M-rt`XwD||5)@9s0C!}j2ug*W3aO(~5
z&=YcApdl-txL!}+AYs?I#((Lp&i8x>s}AdxggZpTZ94a5E$vqJ&e*Z$N<29x<WHrn
zl(NCPsW`JfKj7%G00#VIO@fKb-w;l(d0-h@OmLw6>A{nsPYY20@a95g=@=bw#X>jD
zf6DHXRaF|0*zu*xE|f<Jl_^pyrzp(6P*ScIx6A!z9Ho0+?fG25>QJw#8e)#9WRrZh
zU3cAs@Ym=wT$z5lT?H{}FgrYEaDeQirg9${6<RpS5OC3pYV}Mtoc*{2ac?i(pEb<r
zRad^QuXg6NdG!GsKfv~%qwekn<d<<x;&NCXHs^UAP-nU4ldf;IP40zeGQz;uC(N;>
zCtp>3#ojcH3U9WT{f1IIBy#1;Iiub8ikz;{S+LONINW9zZxLRnRe*ye8B)9T^r>7n
z-BOlDW*SQYzAkU3oq-v(Lq*7BgjCwSsEXQ;K<@mFpvOQ|%S4trpcBV46lMIE)$Qsj
znN<o8FX`rOG$W~nr!u+mOgo>x_f;_C<I)quyc<;<gJb>+U{Rc=|B*j@Y^nNifyATK
z5*}x&$hAejl<44vyK_91D&Y`g)jWZda_X=pTS}I@<fAUTscgA?VmMzdO>vstLaXkv
zM2=kj%YqvFWn4MLJGXnb%J^-{hyk%2NB%F}rH2}FG=7U;Bg00u7fMQhy}WYlt+ur-
z5Eay0YPrYzw|sAXRvFb!%OXmYr3&z0N`M6haI)2IfZzH-PWjAbWIASVh+C)h6G(aP
zO*YkgCeC;vPj|!c!%(BtIj5`0DMp3!6681a@*eslv|3WKB#RDX+;|>y#pmo~5`XmZ
z6T`OuE-XCij<k9EEpkcx7NGkcBYv>eY64~5NfX83fsx*Pb_EtI1Gkvxw+NP_7zldh
z^n+DNXRvrT1Ads#L7g7?zh877Qfaf@nG}HE^@FxwNT4)BpQ9sJU%UV$A<Pw&(f|Cw
zDVpmKy!O^KF;s*jP)WKM%?Z0lTf7b+S5AJXw&92KPU^P+Rqjice)Z78Nz|DwHWF5x
zTC@(M2OF#4`^$esdp_QM^*QoE1x!&8_UWPYE8U99K(>;S+yDe+-QM1wl@0Xdf9z23
z@x?2-6}+bxz;UmHQaSCkwEi;d=GG7KeV)_hL}xEOKs=7!Il2}$$58M7ZJF^@OM>6x
z@rML9$6S8BI9uWS%5MPxYs9D|%FfTb{Q=pM`ld2;-WID5s&ro;XULpNaO~DHnd!`M
zBDcGdGLrBTM@NrH(1J!O^kN?jGyVC{0K#x{W^PriF?hf=n*e`MU4ly4yyWF~#SUL&
z@PaL>?$kj_4a-Q>5~up(BC|HhR{Z)VlpGCN1z74oBnh~BBW^(q(J#eyQf`Tcd!x|q
zG(KVPgjhjfzUhvfs>iPYHp!<CO=O<ktcxDcddY$602%m0L!W7q)uZk#%DbVhv3v2m
z8pyy;br&jxb6?K)mSolEu!Q5UK;`7Nm0<E=3mO@r-Q7>Qj{+Ihy^gXIM6>7d7b99N
zZ0jO^Qqvd#tyu{9+z)juM72NVR#?c07PupbZ@ea}s_QiuE|tA(xAuMCmqD!z987$>
zZ_i{LukNwCIiP)kt$-<s5LL%spM)Rw!Xto<PuRuXAC<UMi?=wk@xvGu^@EJYivngz
z$h?NVcIFjZml~KZ))-8`QQ#sZybn(DlZ56M<-_Zi;_VB<Ikb3}dOvDpQ*~(>5c;eP
zK26#H&=NID?3}U!;beN@;h7vLvh3rPby*%TOu@KlXd<(Kb4vqA!2JhR3%33vBn;^n
zov9RgS7Z{inDA_JPk+XS;@yAh+X~G<MPyj2f~`iT<+|8<uOZdPA-CD=5Q%FKzrF}z
z=mA%1$m!lxg{Y9Ff#e&$AxqjRa-qZr)q@-$q?5RMAPCrX*RGgva|;U&YQ|P?z!r?T
z!MmemH1Y_{TiI@G&W;Iv%Ldi4ap`&`(_%B&cjokcr9zKguc4L@VX>i7lw2`R49rK^
zdx0~Bo{Pb=#)`c2JBp^tQTM**o21Ka7U;*_xy<V&xVd&6W>}^4yvw$(Ztl^+aGu;I
z$fKrGwTr+uBx`dAUr^9^A^v7>BXZ>wl=6}?jT6ki;J%0&SM<FlU-dJr4M15qBfqL$
zE;5mH+7h}*1~1QZ#=cc4Wg_yGJ6#2~J-8ecCU`nOUQ<($d5LfTejl=gflL~*<PLqV
z9cJIFbI7$t^bW6`et7LurS)=Y1Tbu-`RuAG1*O+jqq-2{F-1d8Fme(Tx(n(8c0W%>
zXI~DU$S+=mWAfhz8dOj2?495ONre}&H}7TIHW7^+U;KJ;;}S|j<f18+qFOywm5c^X
z#hnZ_6PS%#&#eh<f_)RtlS3t4vx?OPj&!Cz4nItl_qoj(b|3Nza;Z{S83+q`EXWhD
zr>pX({Y*F%aPd042ptrW!<eM6Em=Y?!*dI@*pW_J1O1NU<H7vVEKp<pkK4wqfw2<R
z_D6+E@tD_1pqHo@wB}dw!3d8?DBTlTsSb0!ZB2=y0}JQR0K-0<U436@(oCiPLG>*$
zpJ&}uVo=13FwYO?8sw}}YA)6P;vq|f=A9*Eu0q7EdLIpbbBz0cCV2zZzyrW9LXhjV
zUeA_fmOv~Ma5!v%%*DQEQIJ${fjmUoY1Nl6GkaMxhvq+ny`acsAid;7<5R}1px_qe
z0EWIH%FGO09GRtk5DrG7_oG{om!6aZNjB4kUZSttO{eM9s92Nc*xSf-GR!uSl`2MG
ztqFQ8_-u*RAKG3Pb&+=9Z<DBWp43jGJr28&FS!}?G=827dQ<rGuoL(*l{~}nz)5Go
z>xN862dM$!=Qo3f5n7e)AhH~i68;+U$y#O)*$iMWRD}aY7N*^~`nfXuoQ^<13NWxp
z6y@ov_l4j=&~0v~-kCoaGi?Ndnv*lU!hl=rObo7sw9NaFNb7<w{F|c~sM#3Ub<}A@
z-uVAI;x4@kIzwzhi%{cPcD1iy6l@jQnJaLjC_0*CPw=`uDQeS~pBf`*pN1t7oP(ys
zB&b0G(2;NN+1sc-sH6l-G?r6h8zAnzsdU*EVb_%uP!%-4_MrRs#v#-5BJ;C|nFpEe
z2G{vj7+ER6Fv_=8EC5QX`}E9L(=u<t`2=@l&tRDP&IP8%8G%Fri%8`H9pL|*pu{5+
zw8fZjyLu`8%j^P)7ay|L4D3sWue5sa^Q8=fJ4XqO47{t2VQj&!i<KLa+f1gqFey)|
z=gbbuUJ>SF_3i}s6R|oiK+IPW8}#P(O-xe7YP$mEw#y#U&cL%e)AnNBF$vJ#Gcmoq
zZ{@>`75DV)LywlD_irq*=iYe+9U!gA`c6Zy?g)#eelL)2qJjb?Tmy4{mf-U;O!V3|
z-zx*3Fyo%v#zOX5kB<Fo=wiR>0|)EK2#fbOCNZ(SRdvtT$aG!7qpr)yARu&?FJy;h
zo-O9C-fahu3OcoFP-pANi^1pV`5vnRB`m{{td^8kD8;zekMO=GoQz~nCaSio1)GZH
zXWdwb4yA*giK`E>i>I{;bIa=LJ&8wU{!o!p@=DMCJ7aaVzG`s9K1)>J8-8_11X=%A
z_5qN`7S7rcvfqPzT%_m3)e7vTFy}$d(;K3pVb+yv8lJC{Tz(N^*l}GKk;%O;&()?I
zo6nW4-p|ny<b!tFi(H$+om(&;TBIylB6}>c^A`0(^7&aYhu^B%jWctO96PkU6!R8b
zc6RhqE^QUxc>|?qkW%-bu}PPQuH0&Co8eTgWa;C;But=Ul&cSdsN-ahzd}*C8o?9#
zg4D(`J}|Sf0k$Q_voez+V`XBp0j8Z4-Q3Feg(G{=gI;#u&S{z@=sXtk*!&da>j7GC
z5n^y7-L^8)EbYMxt3#z|jRD<#I&}@P^?>d0_@0U7|FS&Th7Pyfq!7LVHBFfROirYf
zar^J`e!b%ErhNybjnQ<R$#o_py|8*P+;8)eZh2wVck$gN-!cfSwX6>w8I>UEG472|
zicUMjkn+rH{*2X$Na}~Ko?qtK5Suuto#{i#<5mii^|a1|Z>_F7j_Pe7hV+C$-8DVu
z#UgzW;*<3YcWCp^&s$gvcX<+Py;`MejIhZN2l*#iS(!{O!Nb&;b<jPu;YWI>OXT&7
zWL24WGTsS)1C=r(Du79=!X<CWjM0$EI9=KU+OPV<!JxMK)0C#v8A~u05Z3Kzz;_f_
zV}-I6Qn^b8PtVU%n~Nj7x@<1LarHZkjrNKSr7-&!728Kll?%KDffe1527BDLq)fpH
ze!mBhkVLiFy0A1`f6t)P$DIrL=cfhMtD;ygxgR=ccm_iK+_}~E{ytkpr)y<zC7p=N
zsD<JIEKGNr@p>t6$uN3V2)7tV*TN&=j~3k;54-PNgGXUTzlcvkSv+#<aAD1}aeZFx
z=_>Sld*CZw+)k8dpoC$|?!9Rh?~~K7dX3i0pBKPSMtEXD%%cAA%A{Iq-QT_0D!n|I
z(PC83kkV_WtI_s)FeL8pfzQdOE9KV4)E|i`jGtJ1b=!$0F0dnvj}i>V0)RKcl>DXQ
z%9$F&MV1e@5_G62hA6a5@C2`9_6TI|r2u)@-m$|UEO6hj&W7DqQ&suOl-VOn!)<wd
z_%~Iv*Yi1|jo1NYM|!F5+RE>C$CRNxxIIG64uxS|(Tu5JqjtyJNMpBYxb;*YtMD6-
z&C13blBce#u|)+ueN7{%#w}Q1nhmYmXN8Pn6_}QuTz&cve<Wj_yX*Fuh%`eAc-?j1
z_o)~Srg&~?JT0~4%jAhT!5!O!jY}|q-1-E(A^4OJe+9q}IK1;b+JR8LG3X<k?ZJc~
zt|JVp&VmL7TEj5W;z6j^GM~4&un}aP$Uz)V0&hVKANX*x!+0_hVN%6R7izpDJfte4
zXDC;!@pqJ~w-J|<jf*5O&-n$MjHT4bIHt;~wgtBQ{M1$V$rw`y#S~%-7nngQQtm$;
zXQsbFZC!d}r`|b!PQ^~Y@x>YDLhH??(I;Mg*u{|BkgMkJZ15Oywp$EVdwXZwz+A~b
z&Sj()=F_|6@90TyA8g--C-qga<)#bz>v4**!fo&);%&PM>!m#14B_UFiEQ+$G*<i=
znD@$W<6_m(VtS>Y9K+)+R7fafyarh^|K1C9n3Qek<Bw#YME`wrv)7_XTUAfp{gfU{
zm?)z(Ne^YxoB%>Z{W}V~!IbGFe(SYqfaMlUS~Sa0=%$1XH>#2g7BFX)<Aud>%+X$n
zO(0HJ=9;Ld$>3GK_gxM$XSc`RT8~r7=?d=UbJJ_Djm5WSGILkevMbWn`b%tb9bf2j
z<YJnzoVi8E2?duKOiL!x<h7mFh5aBy(s2yBvpsap4J~WZD*(lH`eP*YWZK0OjUFci
zi|=1gQE?i6mDRt;0#y)-d<Y8B`TtM;#?AkZZzE)YiM2s!@u%JJ#Ya5bX)qCV_**of
zUYQ&=UrM*Ii(cs$Z;Jw%OhZzyV<_DpAhu*?7%+_Bt^t?qnny`k*KJzip==+4uMn|4
z5FMAnApIQ^$O!AX-(*r4;!Z?z={|ySc@wAs=hWyIcyi%q2oP80ESQcxz+M|FF#s<s
zLM?N2M~&Qn==J58#7RV4;Aa89-=GIL9FXZYlb#%16Dap-&=~Gt2RhxhUsQU6!ic;k
z&+tc+mT7|V;qaibOH=y>VoWH%eD`BC)OCWjeQsA#XfjwHj;`WJ=v*8^bD&b9VbIA;
zbr<XtbF6=$9dD)$KJ)ps(oepY?=FDrPK?V{)N<X>3z-9*dWuIB3+@Y{4^sySx$BoL
zCPH!jz0L)0j~~2+($8)aL#wMZLtR!18A5LIB^5I}MJr_F+*R-!sXB+hod5-E>rc`_
zmg)2PpO<U9WXR?Xmx|fH$RL8dlo-quJV5LgyfiFiu31nL`QYb$g0Lx1>Hk27LJ^ez
z)hW<V?}<vmu*TOrxq7-Kq^KO}4H6>M3u?+^j0SlyzjOPwlo;Sa<TqPz-}03r3<26@
zOg}Yivj<DfH!z^FNJ?h1>F8e16T#2bd*;*Y2?ng~>en}y0A-r%3%;ifD#=DV5%j2)
zMX(a~V|g!tYh^tF03!A{>T%c_>>fi73x6>nsnbj_kCb5;H|#n^YY*)O48|2f0Q!)>
znqbcgTVY}d`B{*neSZqQ6RWg*CV}1cF_WM-sUGMfAgU$p*WSTX|4bsb$R8tC>fdv@
zweqD+oyx%PU6cd%&Kue!xd7Pvonf5BDQu;9@pVFL96DDE<CKl%^o0o4+&O@9lplv;
zu+HCqhKKr^Ft~|+{7afP@^-21s6=wO0TZvW-c>;<F01t{dfqS3J4;}wO0@?HAF;tb
zL~UO>IB`CQEB+pxZ%H$Rq=;$>7$>u@s3uD!bCE4JRay$u*zDA#LL=SHNsN#8yWV!g
zI?m^atZD-slybe6k)Q(SSfQ~;;qkACJE*@g;z|^wC4N-CaU=eALv^Y((2FNRqy9$h
zHML#-<c)?)GwTaj>JMS!*WT@U--RLZ9hun0!GRz}sY_#Zl}WNN0q#t~sEL!36zETY
z(#+rQ#zl=&32JlT@NCXyIZGL&FwG}I-`oO={PL5dr1i_yZ2}@q3T2L9#`bH>T>}+U
zr>DQm*$9L35WDXI`0RJ99UkN0+$FljH1$3)ROy?w5<1wgiWuc<ppv861&O%k^Jh}i
zp|IYkXr8+Sw4tZU+}xaT&5l(>XOrZAUi+J!QI^Z~b*FLSvcQW`<pr%`!oLD9gORR7
zY()o!Y|9!7^_ZE@R}O`=&*oV#qA$r~ntqL%W6O)WL045cn7!~M&@-|M`K=eZy;s~-
z(Ts&JZ5>Cyu3TWU;!~?sI##%<U1pbxO{uk#)hmJ8Z@!20huSAYBe+r%c!p6r4cpqJ
zsMBbm^qr_^5cm_kcfdZ=!^p%-=L%V%ys)B>K&4;<25=rF1ZE~egEr>{yJ0B-302A(
zFJ7`~d&;f*Y^KiXiZSm@{f$M+oNo@E${wAm4Vd7O?l(zsgW-H^+TM`YjuqyXOn*?x
z)!-&CGLM3E+3#%%?))8O$0U^N5A)s(DC303>ue`r&RnXlR)AMS%Y-~{=T>OnZI;gj
z7--7yt^R4zQ6MzgD0l@S{@}n`W33p>vidvq^lYO>#R-`?<?;hYJhWPtm!cJUE?KSo
zIDsl9+aEZI8VQe&iE#aRk)`4FQ(n>yWYX)8!*fjA(nyW3&>fkUz{Z(3$RzLnKGZ2Q
z=dNMzxx60^zh4H@Rs)c10%iZpGYw(6fs6MFgBb@GO*Cx6Vs0DxHv&=WQ@UI*Uq^fe
zz*pBg3`|{1Q7UG$|46`n;y7yg{Nb0o$M|<kGutOL8b(Zv+1CPyadgigIgKg}rx`9s
z`Oriuj%v3kqahKkE?@_&8_o+7d9?^m8I_$TMU5AZZ+7FM#!HP(4nG7YRvyxV*vZh}
zq`k!1ig8hWp(2OZh_D>huIqfp>-j4qWY`<>{QM)7EmK*QBTP*CupIt0!vC>fm|O8~
X>-ef)5YK>UE~r~F%F;QK51#&CE5oWT

diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
index b6e15236dfca..4816f023a4f2 100644
--- a/docs/source/performance/performance_long_sequence.md
+++ b/docs/source/performance/performance_long_sequence.md
@@ -152,4 +152,4 @@
 
 
 ### Speedup of LLAMA2 7B training with CP over without CP
-![Speedup Graph](cp_speedup_figure.png)
\ No newline at end of file
+![cp_speedup_figure](https://github.com/user-attachments/assets/813904c7-3288-40d1-afd5-f3421ee70cb8)
\ No newline at end of file

From f7ed21175368cf770271d8a68fb49119bce3b7e2 Mon Sep 17 00:00:00 2001
From: Youngeun Kwon <youngeunk@nvidia.com>
Date: Fri, 11 Oct 2024 11:33:35 -0700
Subject: [PATCH 18/18] change image url to github release

Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
---
 docs/source/performance/performance_long_sequence.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/source/performance/performance_long_sequence.md b/docs/source/performance/performance_long_sequence.md
index 4816f023a4f2..9dc9c6c52be3 100644
--- a/docs/source/performance/performance_long_sequence.md
+++ b/docs/source/performance/performance_long_sequence.md
@@ -152,4 +152,4 @@
 
 
 ### Speedup of LLAMA2 7B training with CP over without CP
-![cp_speedup_figure](https://github.com/user-attachments/assets/813904c7-3288-40d1-afd5-f3421ee70cb8)
\ No newline at end of file
+![cp_speedup_figure](https://github.com/NVIDIA/NeMo/releases/download/r2.0.0rc1/tutorial_cp_speedup_figure.png)
\ No newline at end of file