Skip to content

Commit

Permalink
push new website
Browse files Browse the repository at this point in the history
  • Loading branch information
janismdhanbad committed Jul 28, 2023
1 parent 8da71db commit 65d8a20
Show file tree
Hide file tree
Showing 44 changed files with 3,273 additions and 0 deletions.
Binary file added blogs_md/.DS_Store
Binary file not shown.
71 changes: 71 additions & 0 deletions blogs_md/things_miss.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
# UNDERSTANDING THE DATA : Things you might miss!!
Data exploration, the right way

Usually we don’t focus on the data exploration part as a beginner, however, understanding the data just by exploration and running statistical tests on it could provide great insights about it.

The following article deals with the measures of statistical analysis for univarite( analysis on one variable) and bivariate(analysis between two variable). You can easily find codes for these in the language you use for doing exploration.

## UNIVARIATE ANALYSIS :

There are two kind of variables you will come across with, that are :

- Categorical( the ones having discrete values, eg. outcome of a rolling dice, variables showing yes/no type attribute etc.)

- Continuous( the ones taking continuous values, eg. age of a person, weight of the vegetables etc.)

Categorical : You could simply use frequency tables or Bar plot to know about the behavior.

Continuous : To understand more about a continuous variable you could use measures of central tendencies(mean, median and mode). You can also use Inter quartile range for outlier identification.


## BIVARIATE ANALYSIS :

Here we can come across following cases:

- Categorical & Categorical
- Categorical & Continuous
- Continuous and Continuous

I would not go in very much detail of the things used here in this article. The detailed description will be given in the articles following this one.

Categorical & Categorical:

Following are the things we could use for analyzing two categorical variables:

- A two way table of counts, count percent, contingency table.
- Stacked column charts for the two categorical variables.
- Chi-square test of independence.

<figure>
<img src="../images/blog_things_miss/im1.webp" alt="Trulli" style="width:100%">
<figcaption align = "center">two way table of counts</figcaption>
</figure>

<figure>
<img src="../images/blog_things_miss/im2.gif" alt="Trulli" style="width:100%">
<figcaption align = "center">stacked charts</figcaption>
</figure>


### Categorical & Continuous :

Following are the things we could use for analyzing categorical and continuous variables :

- Box plots for each level of categorical variables. Be careful about choosing these levels as if there is a small number in a particular level, the results would be insignificant.

- Z-Test

- T-test

- Anova (analysis of variance)

Continuous & Continuous :

Following are the things we could use for analyzing two continuous variables :

- Scatter plots between the two variables
- Correlation between the variables
<figure>
<img src="../images/blog_things_miss/im3.gif" alt="Trulli" style="width:100%">
<figcaption align = "center">scatter plot</figcaption>
</figure>
Binary file added css/.DS_Store
Binary file not shown.
48 changes: 48 additions & 0 deletions css/html5reset.css
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
/* http://meyerweb.com/eric/tools/css/reset/
v2.0 | 20110126
License: none (public domain)
*/

html, body, div, span, applet, object, iframe,
h1, h2, h3, h4, h5, h6, p, blockquote, pre,
a, abbr, acronym, address, big, cite, code,
del, dfn, em, img, ins, kbd, q, s, samp,
small, strike, strong, sub, sup, tt, var,
b, u, i, center,
dl, dt, dd, ol, ul, li,
fieldset, form, label, legend,
table, caption, tbody, tfoot, thead, tr, th, td,
article, aside, canvas, details, embed,
figure, figcaption, footer, header, hgroup,
menu, nav, output, ruby, section, summary,
time, mark, audio, video {
margin: 0;
padding: 0;
border: 0;
font-size: 100%;
font: inherit;
vertical-align: baseline;
}
/* HTML5 display-role reset for older browsers */
article, aside, details, figcaption, figure,
footer, header, hgroup, menu, nav, section {
display: block;
}
body {
line-height: 1;
}
ol, ul {
list-style: none;
}
blockquote, q {
quotes: none;
}
blockquote:before, blockquote:after,
q:before, q:after {
content: '';
content: none;
}
table {
border-collapse: collapse;
border-spacing: 0;
}
Loading

0 comments on commit 65d8a20

Please sign in to comment.