Skip to content

Commit

Permalink
Deployed da8f26e with MkDocs version: 1.5.2
Browse files Browse the repository at this point in the history
  • Loading branch information
macielcalebe committed Aug 26, 2024
1 parent 19f42b8 commit ebb9b72
Show file tree
Hide file tree
Showing 9 changed files with 27 additions and 27 deletions.
4 changes: 2 additions & 2 deletions classes/01-intro/aps01_part_1/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -569,7 +569,7 @@ <h2 id="task-01-opening">Task 01: Opening</h2>
</div>
</form>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/01-intro/aps01_part_1/long_1" id="long_1_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/01-intro/aps01_part_1/long_1" id="long_1_0">
<p class="admonition-title">Question 2<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Explain, in general terms, what is the model predicting?</p>
Expand All @@ -588,7 +588,7 @@ <h2 id="task-01-opening">Task 01: Opening</h2>
</ul>
</div>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/01-intro/aps01_part_1/long_2" id="long_2_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/01-intro/aps01_part_1/long_2" id="long_2_0">
<p class="admonition-title">Question 3<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Considering the <code>everything.ipynb</code> notebook, what is the <strong>target variable</strong> used in training the model?</p>
Expand Down
2 changes: 1 addition & 1 deletion classes/03-batch/aps02_sql/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -570,7 +570,7 @@ <h2 id="task-1-query-file-for-analytical-table">TASK 1: Query file for analytica
<p class="admonition-title">For now...</p>
<p>For now, keep saving and reading the model in the <code>models</code> folder!</p>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/aps02_sql/long_1" id="long_1_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/aps02_sql/long_1" id="long_1_0">
<p class="admonition-title">Question 5<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Why should we avoid using <code>*</code> in production queries?</p>
Expand Down
6 changes: 3 additions & 3 deletions classes/03-batch/data_formats/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -473,7 +473,7 @@ <h2 id="csv">CSV</h2>
<p><strong>Lack of Data Compression</strong>: CSV files do not provide built-in data compression. As a result, they can occupy a lot of disk space.</p>
</li>
</ul>
<div class="admonition exercise text short tag-text-exercise tag-short-text editable" data-slug="classes/03-batch/data_formats/short_1" id="short_1_0">
<div class="admonition exercise text short tag-short-text tag-text-exercise editable" data-slug="classes/03-batch/data_formats/short_1" id="short_1_0">
<p class="admonition-title">Question 1<button class="editable-button"></button></p>
<form class="exercise-form">
<p>In ML, where large datasets are common, give some examples where increased storage requirements can become a concern.</p>
Expand Down Expand Up @@ -679,7 +679,7 @@ <h2 id="exercises">Exercises</h2>
</code></pre></div>
</div>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/data_formats/long_2" id="long_2_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/data_formats/long_2" id="long_2_0">
<p class="admonition-title">Question 8<button class="editable-button"></button></p>
<form class="exercise-form">
<p>What are your conclusions?</p>
Expand All @@ -696,7 +696,7 @@ <h2 id="exercises">Exercises</h2>
<p>For example, when performing <strong>column and row filters</strong>, <strong>pandas</strong> generally needs to <strong>read the entire file</strong> into memory first before it can apply the filter.</p>
<p>On the other hand, <strong>Apache Spark</strong> does not need to read the entire <code>parquet</code> file to filter by a column, thanks to its ability to perform predicate pushdown and the efficient structure of the <code>parquet</code> format.</p>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/data_formats/long_3" id="long_3_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/data_formats/long_3" id="long_3_0">
<p class="admonition-title">Question 9<button class="editable-button"></button></p>
<form class="exercise-form">
<p>What <strong>predicate pushdown</strong> means?!</p>
Expand Down
10 changes: 5 additions & 5 deletions classes/03-batch/intro/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -482,7 +482,7 @@ <h2 id="what-is-batch-prediction">What is batch prediction?</h2>
<h2 id="doing-batch-prediction">Doing batch prediction</h2>
<p>In the first class, when we saw how to standardize a data science project, there were specific folders for data storage, notebooks and source code.</p>
<p><img alt="" src="../ds_project_data_folder.png" /></p>
<div class="admonition exercise short tag-text-exercise tag-short-text editable" data-slug="classes/03-batch/intro/short_1" id="short_1_0">
<div class="admonition exercise short tag-short-text tag-text-exercise editable" data-slug="classes/03-batch/intro/short_1" id="short_1_0">
<p class="admonition-title">Question 2<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Would the <code>data.csv</code> file be used during training? How would you deal with data?</p>
Expand All @@ -497,7 +497,7 @@ <h2 id="doing-batch-prediction">Doing batch prediction</h2>
<p>If the model doensn't exists, we need data to train it! So, probably yes!</p>
</div>
</div>
<div class="admonition exercise short tag-text-exercise tag-short-text editable" data-slug="classes/03-batch/intro/short_2" id="short_2_0">
<div class="admonition exercise short tag-short-text tag-text-exercise editable" data-slug="classes/03-batch/intro/short_2" id="short_2_0">
<p class="admonition-title">Question 3<button class="editable-button"></button></p>
<form class="exercise-form">
<p>During the phase of model construction, would this <code>data.csv</code> file change? Explain it.</p>
Expand All @@ -513,7 +513,7 @@ <h2 id="doing-batch-prediction">Doing batch prediction</h2>
<p>Assuming enough data, it would be transformed (feature engineering, feature selection) but would remain the same during training in the phase of model construction.</p>
</div>
</div>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/intro/long_3" id="long_3_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/intro/long_3" id="long_3_0">
<p class="admonition-title">Question 4<button class="editable-button"></button></p>
<form class="exercise-form">
<p>After model deployment:</p>
Expand All @@ -535,7 +535,7 @@ <h2 id="doing-batch-prediction">Doing batch prediction</h2>
<p>We'll deal with retraining in the next classes, but it's important to start thinking about it!</p>
</div>
</div>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/intro/long_4" id="long_4_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/intro/long_4" id="long_4_0">
<p class="admonition-title">Question 5<button class="editable-button"></button></p>
<form class="exercise-form">
<p>What about when we deploy the model and need to predict with it?</p>
Expand All @@ -552,7 +552,7 @@ <h2 id="doing-batch-prediction">Doing batch prediction</h2>
<p>Absolutely not! In the first class we did this for simplicity. We would like to make predictions on new data.</p>
</div>
</div>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/intro/long_5" id="long_5_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/intro/long_5" id="long_5_0">
<p class="admonition-title">Question 6<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Still on prediction in new data: assuming that the model <strong>reads a batch of data</strong> from a file called <code>predict.csv</code>, would this file remain the same (have the same records) every time the predict script is called?</p>
Expand Down
4 changes: 2 additions & 2 deletions classes/03-batch/practicing/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -979,7 +979,7 @@ <h2 id="task-5-making-predictions">Task 5: Making predictions</h2>
<p>Done! Whenever there is a new file that must be predicted, just call the <code>predict.py</code> script, informing which model to be used and the path of the file with the data! Then, the model will read this batch of information and perform the prediction.</p>
<h2 id="extra-questions">Extra questions!</h2>
<h3 id="categorical-variables">Categorical variables</h3>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/practicing/long_1" id="long_1_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/practicing/long_1" id="long_1_0">
<p class="admonition-title">Question 7<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Are there categorical variables in the training data? If yes, which ones?</p>
Expand Down Expand Up @@ -1017,7 +1017,7 @@ <h3 id="oot-validation">OOT validation</h3>
<p class="admonition-title">Tip! 1</p>
<p>Once you decide the model is good enough, you can retrain with the whole base (January to July) and deploy this new version of the model!</p>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/03-batch/practicing/long_2" id="long_2_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/03-batch/practicing/long_2" id="long_2_0">
<p class="admonition-title">Question 9<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Explain when is it a good idea to use OOT and what would be the consequences if you don't!</p>
Expand Down
4 changes: 2 additions & 2 deletions classes/05-docker/s3/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -617,7 +617,7 @@ <h3 id="submit-file">Submit file</h3>
</div>
<h2 id="refactoring">Refactoring</h2>
<p>In the previous class (SQL) we made a version of the project that uses a PostgreSQL server as the data source.</p>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/05-docker/s3/long_1" id="long_1_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/05-docker/s3/long_1" id="long_1_0">
<p class="admonition-title">Question 7<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Do you consider that S3 could serve as a data source for model training/predict? Explain.</p>
Expand All @@ -634,7 +634,7 @@ <h2 id="refactoring">Refactoring</h2>
</div>
</div>
<p>But the model pickles were always only stored locally in the <code>models</code> folder.</p>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/05-docker/s3/long_2" id="long_2_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/05-docker/s3/long_2" id="long_2_0">
<p class="admonition-title">Question 8<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Do you consider that S3 could serve as storage for model pickles? Explain.</p>
Expand Down
18 changes: 9 additions & 9 deletions classes/06-message-broker/intro/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -414,19 +414,19 @@ <h2 id="message-broker_1">Message Broker</h2>
<p><img alt="" src="../example_delivery.png" /></p>
<p>Suppose that, for some deliveries, he recommends sending a message explaining the situation (reason for the delay). This sending (Telegram, Email, WhatsApp) will not be done by the ML model and will require calling an extra application.</p>
<p>An alternative is to add the model's predict result to a queue, so that the sending application (Telegram, Email, WhatsApp) handles the messages according to their availability. The application responsible for managing this queue is the <strong>message broker</strong>.</p>
<div class="admonition exercise short tag-text-exercise tag-short-text editable" data-slug="classes/06-message-broker/intro/short_1" id="short_1_0">
<div class="admonition exercise short tag-short-text tag-text-exercise editable" data-slug="classes/06-message-broker/intro/short_1" id="short_1_0">
<p class="admonition-title">Question 1<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Could the message broker be used to create a queue of JSON from customers for a model to process? For example:
<div class="language-JSON highlight"><pre><span></span><code><a id="__codelineno-0-1" name="__codelineno-0-1" href="#__codelineno-0-1"></a><span class="p">{</span>
<a id="__codelineno-0-2" name="__codelineno-0-2" href="#__codelineno-0-2"></a><span class="nt">&quot;age&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">42</span><span class="p">,</span>
<a id="__codelineno-0-3" name="__codelineno-0-3" href="#__codelineno-0-3"></a><span class="nt">&quot;job&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;entrepreneur&quot;</span><span class="p">,</span>
<a id="__codelineno-0-4" name="__codelineno-0-4" href="#__codelineno-0-4"></a><span class="nt">&quot;marital&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;married&quot;</span><span class="p">,</span>
<a id="__codelineno-0-5" name="__codelineno-0-5" href="#__codelineno-0-5"></a><span class="nt">&quot;education&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;primary&quot;</span><span class="p">,</span>
<a id="__codelineno-0-6" name="__codelineno-0-6" href="#__codelineno-0-6"></a><span class="nt">&quot;balance&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">558</span><span class="p">,</span>
<a id="__codelineno-0-7" name="__codelineno-0-7" href="#__codelineno-0-7"></a><span class="nt">&quot;housing&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;yes&quot;</span><span class="p">,</span>
<a id="__codelineno-0-8" name="__codelineno-0-8" href="#__codelineno-0-8"></a><span class="nt">&quot;duration&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">186</span><span class="p">,</span>
<a id="__codelineno-0-9" name="__codelineno-0-9" href="#__codelineno-0-9"></a><span class="nt">&quot;campaign&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">2</span>
<a id="__codelineno-0-2" name="__codelineno-0-2" href="#__codelineno-0-2"></a><span class="w"> </span><span class="nt">&quot;age&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">42</span><span class="p">,</span>
<a id="__codelineno-0-3" name="__codelineno-0-3" href="#__codelineno-0-3"></a><span class="w"> </span><span class="nt">&quot;job&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;entrepreneur&quot;</span><span class="p">,</span>
<a id="__codelineno-0-4" name="__codelineno-0-4" href="#__codelineno-0-4"></a><span class="w"> </span><span class="nt">&quot;marital&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;married&quot;</span><span class="p">,</span>
<a id="__codelineno-0-5" name="__codelineno-0-5" href="#__codelineno-0-5"></a><span class="w"> </span><span class="nt">&quot;education&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;primary&quot;</span><span class="p">,</span>
<a id="__codelineno-0-6" name="__codelineno-0-6" href="#__codelineno-0-6"></a><span class="w"> </span><span class="nt">&quot;balance&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">558</span><span class="p">,</span>
<a id="__codelineno-0-7" name="__codelineno-0-7" href="#__codelineno-0-7"></a><span class="w"> </span><span class="nt">&quot;housing&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;yes&quot;</span><span class="p">,</span>
<a id="__codelineno-0-8" name="__codelineno-0-8" href="#__codelineno-0-8"></a><span class="w"> </span><span class="nt">&quot;duration&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">186</span><span class="p">,</span>
<a id="__codelineno-0-9" name="__codelineno-0-9" href="#__codelineno-0-9"></a><span class="w"> </span><span class="nt">&quot;campaign&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">2</span>
<a id="__codelineno-0-10" name="__codelineno-0-10" href="#__codelineno-0-10"></a><span class="p">}</span>
</code></pre></div></p>
<div class="form-elements">
Expand Down
6 changes: 3 additions & 3 deletions classes/06-message-broker/rabbitmq/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -469,7 +469,7 @@ <h4 id="create-docker-composeyml">Create <code>docker-compose.yml</code></h4>
<p>Docker Compose allows defining and running multiple Docker containers using a Compose file that describes the whole application. It makes it easy to collaborate on multi-container apps.</p>
<p>For now our application will only contain RabbitMQ, but it costs nothing to practice!</p>
</div>
<div class="admonition exercise text long tag-text-exercise tag-long-text editable" data-slug="classes/06-message-broker/rabbitmq/long_1" id="long_1_0">
<div class="admonition exercise text long tag-long-text tag-text-exercise editable" data-slug="classes/06-message-broker/rabbitmq/long_1" id="long_1_0">
<p class="admonition-title">Question 2<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Explain what this section means:</p>
Expand Down Expand Up @@ -738,7 +738,7 @@ <h3 id="consume-messages">Consume messages</h3>
</div>
</form>
</div>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/06-message-broker/rabbitmq/long_2" id="long_2_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/06-message-broker/rabbitmq/long_2" id="long_2_0">
<p class="admonition-title">Question 14<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Change the <code>callback</code> function to:
Expand All @@ -765,7 +765,7 @@ <h3 id="consume-messages">Consume messages</h3>
<p>Due to <code>auto_ack=False</code>, if the consumer dies before notifying that the message was correctly processed, then the message is automatically redistributed to other consumers (or waits in the queue if there are no free consumers).</p>
</div>
</div>
<div class="admonition exercise long tag-text-exercise tag-long-text editable" data-slug="classes/06-message-broker/rabbitmq/long_3" id="long_3_0">
<div class="admonition exercise long tag-long-text tag-text-exercise editable" data-slug="classes/06-message-broker/rabbitmq/long_3" id="long_3_0">
<p class="admonition-title">Question 15<button class="editable-button"></button></p>
<form class="exercise-form">
<p>Can you think of some situations where the use of <code>auto_ack=True</code> is indicated?</p>
Expand Down
Binary file modified sitemap.xml.gz
Binary file not shown.

0 comments on commit ebb9b72

Please sign in to comment.