Track Awesome Learn Datascience Updates Daily
:chart_with_upwards_trend: Curated list of resources to help you get started with Data Science
🏠 Home · 🔍 Search · 🔥 Feed · 📮 Subscribe · ❤️ Sponsor · 😺 siboehm/awesome-learn-datascience · ⭐ 637 · 🏷️ Computer Science
Jun 08, 2024
What is Data Science?
- Data Science Process: A Beginner’s Comprehensive Guide - Technical Skills for the Data Science: This emphasizes the practical skills needed throughout the data science process.
Jul 22, 2019
Data Science using Python / Various other helpful tools and resources
- Spacy - Open source toolkit for working with text-based data.
May 13, 2019
What is Data Science?
- Data Science for Business (Book) - An introduction to Data Science and its use as a business asset.
Apr 30, 2019
Data Science using Python / General
- O'Reilly Data Science from Scratch (Book) - Data processing, implementation, and visualization with example code.
- Coursera Applied Data Science - Online Course using Python that covers most of the relevant toolkits.
Sep 25, 2017
What is Data Science?
- Explanation of important vocabulary - Differentiation of Big Data, Machine Learning, Data Science.
Common Algorithms and Procedures
- Supervised vs unsupervised learning - The two most common types of Machine Learning algorithms.
- Cross validation - Evaluate the performance of your algorithm/model.
- Feature engineering - Modifying the data to better model predictions.
- Model ensemble: Explanation - Combine multiple models into one for better performance.
Data Science using Python / Learning Python
Data Science using Python / pandas
- DataCamp pandas foundations - Paid course, but 30 free days upon account creation (enough to complete course).
- Pandas cheatsheet (⭐42k) - Quick overview over the most important functions.
Data Science using Python / Jupyter Notebook
- Seaborn data visualization tutorial - Plot library that works great with Jupyter.
Data Science using Python / Various other helpful tools and resources
- Anaconda Python distribution - Contains most of the important Python packages for Data Science.
- LightGBM gradient boosting framework (⭐16k) - Successfully used in many Kaggle challenges.
- Amazon AWS - Rent cloud servers for more timeconsuming calculations (r4.xlarge server is a good place to start).
Data Science Challenges for Beginners / Various other helpful tools and resources
- Walkthrough: House prices challenge - Walkthrough through a simple challenge on house prices.
- Blood Donation Challenge - Predict if a donor will donate again.
- Titanic Challenge - Predict survival on the Titanic.
- Water Pump Challenge - Predict the operating condition of water pumps in Africa.
Aug 06, 2017
What is Data Science?
Common Algorithms and Procedures
Data Science using Python / scikit-learn
Jul 15, 2017
Data Science using Python / numpy
Jul 12, 2017
Common Algorithms and Procedures
Data Science using Python / Learning Python
Data Science using Python / pandas
Data Science using Python / scikit-learn
Data Science using Python / Jupyter Notebook
Data Science using Python / Various other helpful tools and resources
More advanced resources and lists / Various other helpful tools and resources