-
Notifications
You must be signed in to change notification settings - Fork 1.1k
/
README.html
18 lines (18 loc) · 2.18 KB
/
README.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
<h1 id="example-code-and-data-for-practical-data-science-with-r-by-nina-zumel-and-john-mount-manning-2014.">Example code and data for "Practical Data Science with R" by Nina Zumel and John Mount, Manning 2014.</h1>
<ul>
<li>The book: <a href="http://www.manning.com/zumel/">"Practical Data Science with R" by Nina Zumel and John Mount, Manning 2014</a></li>
<li>The support site: <a href="https://github.com/WinVector/zmPDSwR">GitHub WinVector/zmPDSwR</a></li>
</ul>
<h2 id="the-code-and-data-in-this-directory-supports-examples-from">The code and data in this directory supports examples from:</h2>
<ul>
<li>Chapter 5: Choosing and Evaluating Models</li>
<li>Chapter 6: Using Memorization Methods</li>
</ul>
<p>A workspace containing most of the results has been saved as KDD2009.Rdata and can be loaded in R with the command:</p>
<div class="sourceCode"><pre class="sourceCode r"><code class="sourceCode r"><span class="kw">load</span>(<span class="st">'KDD2009.Rdata'</span>)</code></pre></div>
<p>(note you will have to re-load various libraries like ROCR to perform some of the steps).</p>
<p>6-2-2013 Data from: http://www.sigkdd.org/kdd-cup-2009-customer-relationship-prediction Downloaded: $ shasum * e43a38e3477e38b354943519954b719ec7623c2f orange_small_train.data.zip 8274d23235630717659898900b7f74092ff339ad orange_small_train_appetency.labels.txt ec2de79844657fb892ec9047e6304c12b296ff68 orange_small_train_churn.labels.txt 4cd2d7c9b20fd3638883a91a2fed6a03a4d5d015 orange_small_train_upselling.labels.txt Data to support examples in the chapter on memorization methods in "Practical Data Science with R" ( http://www.manning.com/zumel/ ).</p>
<p>Load data:</p>
<div class="sourceCode"><pre class="sourceCode bash"><code class="sourceCode bash"> <span class="kw">unzip</span> orange_small_train.data.zip
<span class="kw">gzip</span> -9 orange_small_train.data</code></pre></div>
<p>See <a href="KDDmodels.Rmd" class="uri">KDDmodels.Rmd</a> for examples and details and <a href="KDD2009vtreat.Rmd" class="uri">KDD2009vtreat.Rmd</a> for a newer <a href="http://www.win-vector.com/blog/2014/08/vtreat-designing-a-package-for-variable-treatment/">vtreat</a> based demonstration.</p>