Skip to content

Latest commit

 

History

History
139 lines (91 loc) · 4.76 KB

Learnings.md

File metadata and controls

139 lines (91 loc) · 4.76 KB

♦️ #21DaysToData Learnings

  1. Day 1

    • It's all about the introductions and intro to the challenge
    • A basic data test
  2. Day 2

    • Today's learnings were on data lifecycle, data vocabulary, types of data, types of analytics.
    • Data lifecycle: It ranges from data collection to deployment. Data collection, data storage, data cleaning, data exploration, data visualisation, data modelling, data communication and deployment.
    • Data Vocabulary: It discussed different words related to the data world. Data Analysis, Data Science, Data Engineering, Machine Learning, Deep Learning, AI, Big Data, Cloud Computing.

    Types of data: Qualitative (Nominal, Ordinal) and Quantitative (Discrete, Continuous)
    Types of analytics: Descriptive, Diagnostic, Predictive, Perspective.

     1. Descriptive: What happened?
     2. Diagnostic: Why did it happen?
     3. Predictive: What will happen? What might happen?
     4. Perspective: What can be done to achieve? Further steps
    
  3. Day 3

    • Today its all about the different data careers available. Data Analyst, Data Engineer and Data Scientist.
    • Path to transition to these careers.
  4. Day 4

    • It's a discussion on data cleaning. There are six types of dirty data.

      • Misentries - > Spellings, typos, inconsistent data logging
      • Duplicates -> No row is repeated
      • Missing Data -> NaN's empty data
      • Combining Data Sources -> Combining two or more sources of data
      • Unwanted outliers -> Deciding upon what to do with the outliers
      • Quality check -> Data Quality Check
    • Open Refine is the tool which I got to know for data cleaning.

    • All the data cleaning steps can be found here

  5. Day 5

    • Descriptive Statistics gives insights about the data. Different columns have been analysed.
    • In general, with respect to descriptive statistics following are depicted
      • Aggregate / Summary statistics
      • Proportions
      • Counts
      • Min/Max
      • Distributions
    • Different factors with respect to number of crimes, gender and race have been analysed.
  6. Day 6

    • It's a discussion on Data Visualization, importance of data visualization
    • Different types of charts
    • Different data viz tools
  7. Day 7

    • Graph Representation
    • Flourish
    • Pivot Table creation
    • Racing bar chart with number of crimes
  8. Day 8

    • Tableau Introduction
  9. Day 9

    • Pie Chart
    • Dashboard.
    • Here's the link to dashboard. It keeps on updating as I proceed further in the challenge.
    • Difference between a bad pie chart and a good pie chart.
  10. Day 10

    • Data Exploration and visualization using the data.
  11. Day 11

    • Today's learning is about creating a map using Tableau with longitude and latitude data
    • Along with that creating a dashboard in Tableau.
    • Here's the link to dashboard. It keeps on updating as I proceed further in the challenge.
  12. Day 12

    • Today's learning is on Presenting and Delivering insights.
    • Different types of delivering presentations - Tableau file, report, export, stand alone graphs, ppt, demo, video
  13. Day 13

    • Kind of a holiday to work with pending tasks
  14. Day 14

    • Introduction to SQL
    • Discussed on all the basic concepts of SQL (LIMIT, IN, Aggregate functions, Joins, group by, Order by)
    • Link to website that convert CSV data to SQL table. Link
  15. Day 15

    • Data Exploration using SQL queries
  16. Day 16

    • Introduction to Python
    • Why Python for Data Science, Advantages of Python
    • Important Libraries (Pandas, Numpy, matplotlib and Seaborn)
  17. Day 17

    • Today's discussion was on Data Wrangling
    • Different functions of Pandas (describe, info, loc, iloc)
  18. Day 18

    • Data visualization using Python
    • Different Libraries (matplotlib, seaborn, plotly, bokeh, dash, seaborn, altair, folium)
  19. Day 19

    • Introduction to Machine Learning
    • Types of ML (Supervised, Unsupervised)
    • Grouping, Quantification, Forecasting way of addressing ML problems.
    • Brief intro to Linear Regression
  20. Day 20

    • Today's discussion is on Classification or Grouping
    • Classification on Iris data
  21. Day 21

    • Wrap up and complete overview
    • Data Quiz
    • Feedback