What does it take to be a data scientist
A data scientist should be:
- naturally curious
- storytellers
- able to be self-taught
Skills needed for a data scientist (in order)
- Passionate about some field
- DS for the sake of DS is generally not helpful - you need something to apply it to
- Pi-Model
- Many of the team leads are PhD’s (Often in fields other than Data Science)
- Social skills
- Sense of humor
- Math knowledge
- algebra, geometry, calculus, basic statistics for DA
- more statistics and linear algebra for DS
- Technical skills
- computational thinking
- some skills can be learned later, but there should at least be a background
CRISP-DM: Cross-Industry Standard Process for Data Mining
- Business Understanding
- Data Understanding
- Data Preparation
- Data Integration and Transformation
- ETL (Extract, Transform, and Load)
- Data Visualization
- Modeling (MATH 3280 and 3480)
- Evaluation (MATH 3280 and 3480)
- Deployment
- Model Monitoring and Assessment
Tools
- Data Asset Management
- Code Asset Management (e.g. GitHub)
- Development Environments (or IDEs)
- Execution Environments