-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathanalysis.html
327 lines (221 loc) · 13 KB
/
analysis.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
<!DOCTYPE html>
<html lang="en-us">
<head>
<link href="http://gmpg.org/xfn/11" rel="profile">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta http-equiv="content-type" content="text/html; charset=utf-8">
<!-- Enable responsiveness on mobile devices-->
<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1">
<title>
Berkeley Marriage Pact - Data Analysis
</title>
<!-- CSS -->
<link rel="stylesheet" href="./public/css/poole.css">
<link rel="stylesheet" href="./public/css/syntax.css">
<link rel="stylesheet" href="./public/css/lanyon.css">
<link rel="stylesheet" href="https://fonts.googleapis.com/css?family=PT+Serif:400,400italic,700%7CPT+Sans:400">
<!-- Icons -->
<link rel="apple-touch-icon-precomposed" sizes="144x144" href="./public/apple-touch-icon-precomposed.png">
<link rel="shortcut icon" href="./public/favicon.ico">
<!-- Font -->
<link href="https://fonts.googleapis.com/css?family=EB+Garamond&display=swap" rel="stylesheet">
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-157593138-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-157593138-1');
</script>
</head>
<body>
<!-- Wrap is the content to shift when toggling the sidebar. We wrap the
content to avoid any CSS collisions with our real content. -->
<div class="wrap">
<div class="masthead" style="border-bottom: 0px">
</div>
<div class="container content">
<div class="post">
<h1 class="post-title">Berkeley Marriage Pact - Data Analysis</h1>
<span class="post-date">By: Daniel Geng</span>
<span class="post-date">25 Feb 2020</span>
<h2 id="the-berkeley-marriage-pact">The Berkeley Marriage Pact</h2>
<p>Dating apps, at least it seems to me, operate on perverse incentives. Take two Tinder users who are matched together. They go on a date together, sparks fly, and love blossoms. They start dating, get married, and never open Tinder again. Years pass and the app is eventually uninstalled. In their success lies Tinder’s loss.</p>
<p>The true optimal strategy for Tinder and other dating apps is to keep users swiping for as long as possible. Feed users matches who are <em>almost</em> compatible. Keep them crawling back to shoot their shot. Keep them pulling on the lever of the slot machine.</p>
<p>At least, this was the thesis on which the Berkeley Marriage Pact was built. The idea was simple. Write a survey, have Berkeley students take it, and match people based on survey responses. And unlike dating apps, we have no ulterior motive. Our sole goal was to maximize the number of relationships.</p>
<p>All in all, we had more than 3,000 responses, of which about 2500 confirmed their emails. In the end we made about 1300 matches. While the success of these matches is still not clear, we would like to present some interesting findings from the data.</p>
<h2 id="tldr">TL;DR</h2>
<p>If you’re too lazy to read everything, I’ve organized the parts of this post that absolutely scintillate into clickbait titles for your enjoyment:</p>
<ul>
<li><a href="#smarter">Only 30% of women, 26% of Hispanics, and 27% of Blacks think they are smarter than average. 60% of men think they are smarter than average.</a></li>
<li><a href="#depressed">There seems to be a very depressed portion of Berkeley.</a></li>
<li><a href="#extreme">The more extreme your political views, the more extreme your sex life.</a></li>
<li><a href="#choice">Choice of Alcohol.</a></li>
<li><a href="#fish">Fish people are the most confident.</a></li>
<li><a href="#sex">Sex.</a></li>
<li><a href="#politics">The part where we laugh at the Right.</a></li>
<li><a href="#dunning">Dunning-Kruger.</a></li>
</ul>
<h2 id="table-of-contents">Table of Contents</h2>
<ul>
<li><a href="#data">The Data</a></li>
<li><a href="#innocence">Loss of Innocence</a></li>
<li><a href="#sex">Sex, Sexuality, and Sexual Partners</a></li>
<li><a href="#politics">Political Affiliation</a></li>
<li><a href="#health">Health and Lifestyle</a></li>
<li><a href="#smarter">Are you smarter than the Average Bear?</a></li>
</ul>
<h2 id="data">The Data</h2>
<p>We asked 43 questions, ranging from “religious affiliation” to “boba consumption.” All data revealed in this post will be in aggregate and no individual data should be recoverable. If you have any concerns please let us know <a href="mailto:[email protected]" target="_blank">through email</a>.</p>
<p>This post mainly contains interesting correlations and trends in the data. You can see a presentation of just the <a href="https://berkeley206399.typeform.com/report/YvzqQp/hKNOBS5tFVGJ36Y4" target="_blank">aggregate responses here.</a></p>
<div align="middle">
<img src="./public/imgs/tsne.png" align="middle" width="600px" />
<figcaption align="middle">t-SNE visualization of survey responses. <a href="https://distill.pub/2016/misread-tsne/" target="_blank">We honestly have no idea what the clusters mean.</a></figcaption>
</div>
<hr />
<h2 id="innocence">Loss of Innocence</h2>
<blockquote>
<p>“If a body meet a body coming through the rye”</p>
<p>– Rabbie Burns, “Comin thro’ the Rye”</p>
</blockquote>
<p>It’s not entirely surprising that there is a loss of innocence as people go through college, although it’s mildly interesting that we are able to quantify and graph this loss of innocence.</p>
<p>Let’s start with the sexual stuff:</p>
<div align="middle">
<img src="./public/imgs/innocence/num_sexual_year.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/innocence/adventurousness_year.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/innocence/num_dates_year.png" align="middle" width="600px" />
</div>
<p>So all in all, people have had more sexual partners, become more sexually adventurous, and get more desperate as they grow older. It’s probably as good a time as any to point out that the error bars capture a 95% confidence interval.</p>
<div id="choice"></div>
<h3 id="substance-abuse">Substance Abuse</h3>
<p>We can also take a look at various substances that people put in their bodies:</p>
<div align="middle">
<img src="./public/imgs/innocence/alc_choice_year.png" align="middle" width="600px" />
</div>
<p>It’s interesting to note that vodka preference drops drastically with age. Perhaps fueled by the <em>“omg I fucking love to drink, vodka is my favorite, I’m so cool”</em> to <em>“holy shit I’m literally just guzzling industrial grade solvents what am I doing with my life”</em> transition that most people go through.</p>
<p>I also like how “Wine” peaks in Junior year and “Beer” just keeps rising, as if people go from <em>“I love vodka”</em> to <em>“I’m fancy as fuck”</em> to <em>“I just don’t care anymore”</em>.</p>
<div align="middle">
<img src="./public/imgs/innocence/marijuana_year.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/innocence/boba_year.png" align="middle" width="600px" />
</div>
<p>It’s interesting how boba consumption drops off as well, perhaps because seniors have finally come to terms with the fact that they can’t put just anything in to their bodies. In a similar vein, it seems that people actually live healthier lifestyles as they age:</p>
<div align="middle">
<img src="./public/imgs/innocence/exercise_year.png" align="middle" width="600px" />
</div>
<p>As a side note, while these differences seem small you have to remember that it’s really the error bars that tell the story. While the increase in absolute days of exercise is small the error bars show that this increase <em>is</em> statistically significant.</p>
<p>Finally, perhaps the saddest graph in the section, we see Berkeley slowly crushing the intellectual curiosity and academic vitality out of her students. Or at least seniors just care about things a whole lot less:</p>
<div align="middle">
<img src="./public/imgs/innocence/academics_year.png" align="middle" width="600px" />
</div>
<hr />
<h2 id="sex">Sex, Sexuality, and Sexual Partners</h2>
<blockquote>
<p>“He toss my salad like his name Romaine”</p>
<p>– Nicki Minaj, “Anaconda”</p>
</blockquote>
<p>My lawyers advised me not to say too much in this section, so here are some box and whisker plots (with the outliers removed):</p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_gender.png" align="middle" width="600px" />
</div>
<p>And the distributions for male and female:</p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_gender_hist.png" align="middle" width="600px" />
</div>
<p><br /></p>
<p>More boxplots:</p>
<p><br /></p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_race.png" align="middle" width="600px" />
</div>
<p><br /></p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_religion.png" align="middle" width="600px" />
</div>
<p><br /></p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_sexuality.png" align="middle" width="600px" />
</div>
<p><br /></p>
<div id="depressed"></div>
<p>If we plot out “number of sexual partners” against “importance of academics” we get an interesting trend:</p>
<div align="middle">
<img src="./public/imgs/sex/academics_num_sexual.png" align="middle" width="600px" />
</div>
<p>It’s sort of sad to see that there seems to be a rather depressed portion of Berkeley students that are not only apathetic towards academics, but can’t get any sex either. You would think the relationship should be the other way around (students that don’t care about academics should have boat-loads of sex), and this is true except for the students who just don’t care about academics at all.</p>
<div id="extreme"></div>
<p>This is probably one of my favorite plots:</p>
<div align="middle">
<img src="./public/imgs/sex/num_sexual_political.png" align="middle" width="600px" />
</div>
<p>Look at how beautiful this U-shape is. The underlying mechanism here is probably that politically extreme people are also extreme in other parts of their lives.</p>
<hr />
<h2 id="politics">Political Affiliation</h2>
<blockquote>
<p>“the night wind hammering about their siren scream and their mighty metal thunder!”</p>
<p>– Ray Bradbury, “Fahrenheit 451”</p>
</blockquote>
<p>This is the part of the post where we simultaneously marvel at how the Right basically writes our jokes for us and try not to be featured on “Fox and Friends.”</p>
<div align="middle">
<img src="./public/imgs/politics/academics_political.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/politics/num_children_political.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/politics/num_dates_political.png" align="middle" width="600px" />
</div>
<hr />
<h2 id="health">Health and Lifestyle</h2>
<blockquote>
<p>“But if snow is like skin: it pulls away, so easy, dragged from the body / What if all stripped wood is branches and all frozen lakes are water?”</p>
<p>– Jordaan Mason, “Avalanches”</p>
</blockquote>
<p>Berkeley students are surprisingly healthy:</p>
<div align="middle">
<img src="./public/imgs/health/exercise.png" align="middle" width="600px" />
</div>
<div align="middle">
<img src="./public/imgs/health/boba.png" align="middle" width="600px" />
</div>
<hr />
<h2 id="smarter">Are you Smarter than the Average Bear?</h2>
<blockquote>
<p>“I’m smarter than the av-er-age bear!”</p>
<p>– Yogi Bear</p>
</blockquote>
<p>The last question we asked was “Are you smarter than the average (Cal) Bear?” 46.2% of people responded yes to this question, but things start to get interesting when you break down the results by other categories. For example, let’s start with gender. We plot the proportion of people that answered “yes” to the question:</p>
<div align="middle">
<img src="./public/imgs/smarter/smarter_gender.png" align="middle" width="600px" />
</div>
<p>60% of men say they are smarter, while only 30% of women think they are smarter than average. The difference is stark. The percentages are even worse if we look at race:</p>
<div align="middle">
<img src="./public/imgs/smarter/smarter_race.png" align="middle" width="600px" />
</div>
<div id="fish"></div>
<p>On a lighter note it seems that fish-people are for some strange reason very confident:</p>
<div align="middle">
<img src="./public/imgs/smarter/smarter_pet.png" align="middle" width="600px" />
</div>
<div id="dunning"></div>
<p>Finally, if we break the data down by “Perceived importance of academics” we get an interesting Dunning-Kruger-esque trend:</p>
<div align="middle">
<img src="./public/imgs/smarter/smarter_academics.png" align="middle" width="600px" />
</div>
</div>
<!--
<div class="related">
<h2>Related Posts</h2>
<ul class="related-posts">
</ul>
</div>
-->
</div>
</div>
</body>
</html>