Skip to content

verri/dsp-book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Project: An Inductive Learning Approach

DOI

This repository contains the source of Data Science Project: An inductive Learning Approach. The book is built using XeLaTeX.

The book is a work in progress, and the current version is v0.1.0.

Citing the book

@misc{verri2024datascienceproject,
  author       = {Verri, Filipe Alves Neto},
  title        = {Data Science Project: An Inductive Learning Approach},
  year         = 2024,
  publisher    = {Leanpub},
  version      = {v0.1.0},
  doi          = {10.5281/zenodo.14498011},
  url          = {https://leanpub.com/dsp}
}

Downloading the book

You can download the book from the Leanpub website.

Building the book

make ready

It has been tested in:

  • Ubuntu 22.04
  • TeX Live 2021
  • Latexmk 4.76

Abstract

"Data Science Project: An Inductive Learning Approach" provides a comprehensive methodology for data science project development, emphasizing software engineering principles essential for reliable solutions.

Prof. Dr. Filipe Verri, a senior data science project manager and professor at the Aeronautics Institute of Technology (ITA), guides readers through the origins, scope, and key concepts of data science.

This book covers machine learning, data handling, and rigorous validation techniques, all essential for preparing readers to tackle complex, real-world projects.

Contributions from Prof. Dr. Johnny Marques, also professor at ITA and an expert in critical software development, bring an industry-tested perspective to the software aspects, making this an essential guide for aspiring data scientists, researchers and seasoned professionals alike.

Table of contents

  1. A brief history of data science
  2. Fundamental concepts
  3. Data science project
  4. Structured data
  5. Data handling
  6. Learning from data
  7. Data preprocessing
  8. Solution validation

An appendix provides a brief introduction to the mathematical foundations of data science, including algorithms, set theory, linear algebra, and probability.

About the author

Prof. Dr. Filipe Verri is a senior data science project manager and affiliate professor at the Aeronautics Institute of Technology (ITA) and Federal University of São Paulo (UNIFESP). He has a Ph.D. in computer science and computational mathematics from the University of São Paulo (USP).

License

This project is licensed under the Creative Commons Attribution-NonCommercial NoDerivatives 4.0 International License.

If you want to translate this book, please contact the author.

Code of conduct

If you want to contribute to this project, please read the code of conduct and join the discussion forum.

Significant contributions will be acknowledged in the book.