Introduction¶
Data science blueprint is a nothing more than a data science project structure proposition. Its purpose is very simple :
- help you build a robust project
- help you gain some confidence regarding the reliability of the code
- help you meet the requirements needed to deploy in a production environment
- Bring data scientists and machine learning engineers to work together around a common code base
To reach this goal, the blueprint proposes some interesting features that will be covered in this documentation :
- a personalized backbone for your data science project, thanks to cookiecutter
- a dockerized environment that you can use to work with notebooks
- a code quality focus, with the set of tools that will help you profiling and testing your code
- a set of tools to let you use your project as an app that you can deploy on a production server, or on a remote python
repository like Pypi
The Data Science blueprint works with Python projects, with fully packaged code and also Jupyter notebooks. It is particularly adapted to data science projects.