diff --git a/.DS_Store b/.DS_Store
index 51c5e16..c3daac9 100644
Binary files a/.DS_Store and b/.DS_Store differ
diff --git a/_site/background.html b/_site/background.html
index e2f0136..b828d00 100644
--- a/_site/background.html
+++ b/_site/background.html
@@ -59,6 +59,8 @@
   }
 }</script>
 
+  <script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script>
+  <script src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml-full.js" type="text/javascript"></script>
 
 <link rel="stylesheet" href="styles.css">
 </head>
@@ -83,6 +85,10 @@
   <li class="nav-item">
     <a class="nav-link" href="./index.html" rel="" target="">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="./codebook.html" rel="" target="">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link active" href="./background.html" rel="" target="" aria-current="page">
@@ -108,7 +114,11 @@
     <h2 id="toc-title">On this page</h2>
    
   <ul>
-  <li><a href="#the-washington-post-fatal-force-database" id="toc-the-washington-post-fatal-force-database" class="nav-link active" data-scroll-target="#the-washington-post-fatal-force-database">The Washington Post Fatal Force Database</a></li>
+  <li><a href="#proposal" id="toc-proposal" class="nav-link active" data-scroll-target="#proposal">Proposal</a>
+  <ul class="collapse">
+  <li><a href="#objectives" id="toc-objectives" class="nav-link" data-scroll-target="#objectives"><strong>Objectives:</strong></a></li>
+  </ul></li>
+  <li><a href="#the-washington-post-fatal-force-database" id="toc-the-washington-post-fatal-force-database" class="nav-link" data-scroll-target="#the-washington-post-fatal-force-database">The Washington Post Fatal Force Database</a></li>
   <li><a href="#most-police-dont-live-in-the-cities-they-serve" id="toc-most-police-dont-live-in-the-cities-they-serve" class="nav-link" data-scroll-target="#most-police-dont-live-in-the-cities-they-serve">Most Police Don’t Live In The Cities They Serve</a></li>
   </ul>
 </nav>
@@ -136,8 +146,60 @@ <h1 class="title">Background</h1>
 <blockquote class="blockquote">
 <h1 id="on-average-police-in-the-united-states-shoot-and-kill-more-than-1000-people-every-year-according-to-an-ongoing-analysis-by-the-washington-post." style="red">On average, police in the United States shoot and kill more than 1,000 people every year, according to an ongoing analysis by The Washington Post.</h1>
 </blockquote>
-<section id="the-washington-post-fatal-force-database" class="level2">
-<h2 class="anchored" data-anchor-id="the-washington-post-fatal-force-database">The Washington Post Fatal Force Database</h2>
+<section id="proposal" class="level1">
+<h1>Proposal</h1>
+<p>We propose a case study to explore the relationship between police residence and fatal police shootings, employing advanced data science methodologies. Focusing on officers residing in the cities they serve, our project aims to uncover insights and patterns that contribute to a nuanced understanding of this complex issue.</p>
+<section id="objectives" class="level3">
+<h3 class="anchored" data-anchor-id="objectives"><strong>Objectives:</strong></h3>
+<ol type="1">
+<li>Investigate the correlation between police residence and fatal police shootings.</li>
+<li>Utilize a comprehensive dataset spanning 2015 to 2023, focusing on police agencies involved in at least one fatal shooting.</li>
+<li>Apply advanced statistical methods and machine learning techniques to identify patterns and potential biases.</li>
+<li>Examine disparities in incident rates based on officers’ residency status, considering demographic, socioeconomic, and policing variables.</li>
+</ol>
+<p><strong>Methodology:</strong></p>
+<ol type="a">
+<li>Data Collection:
+<ul>
+<li>Compile a dataset comprising information on police agencies involved in fatal police shootings.</li>
+<li>Compile a data set of census variables such as officer residency, race, community demographics, and departmental policies.</li>
+</ul></li>
+<li>Analysis:
+<ul>
+<li>Employ advanced statistical methods and machine learning techniques to discern patterns and correlations.</li>
+<li>Conduct a comprehensive exploration of variables influencing fatal police shootings.</li>
+</ul></li>
+</ol>
+<p><strong>Hypothesis and Expected Outcomes:</strong></p>
+<p>We will conduct two hypothesis tests to analyze both;</p>
+<ul>
+<li><p>the nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths and</p></li>
+<li><p>the categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.</p></li>
+</ul>
+<ol type="1">
+<li><p>Inference for a Difference in Proportions</p>
+<ul>
+<li><p><span class="math inline">\(H_0\)</span>: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.</p></li>
+<li><p><span class="math inline">\(H_A\)</span>: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.</p>
+<ul>
+<li><span class="math inline">\(H_0 : p\_{maj} − p\_{min} = 0\)</span>, or equivalently <span class="math inline">\(H_0 : p\_{maj} = p\_{min}\)</span></li>
+<li><span class="math inline">\(H_A : p\_{maj} − p\_{min} &lt; 0\)</span>, or equivalently <span class="math inline">\(H_A : p\_{maj} &lt; p\_{min}\)</span></li>
+</ul></li>
+</ul></li>
+<li><p>Inference for a Correlation</p>
+<ul>
+<li><p><span class="math inline">\(H_O\)</span>: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p></li>
+<li><p><span class="math inline">\(H_A\)</span>: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p>
+<ul>
+<li><p><span class="math inline">\(H_0 : \rho = 0\)</span></p></li>
+<li><p><span class="math inline">\(H_0 : \rho \neq 0\)</span></p></li>
+</ul></li>
+</ul></li>
+</ol>
+</section>
+</section>
+<section id="the-washington-post-fatal-force-database" class="level1">
+<h1>The Washington Post Fatal Force Database</h1>
 <p>In 2015, The Washington Post <a href="https://www.washingtonpost.com/graphics/investigations/police-shootings-database/">began tracking</a> details about each police-involved killing in the United States — the race of the deceased, the circumstances of the shooting, whether the person was armed and whether the person was experiencing a mental-health crisis — by manually culling local news reports, collecting information from law enforcement websites and social media, and monitoring independent databases such as <a href="https://fatalencounters.org/">Fatal Encounters</a> and the now-defunct Killed by Police project. In many cases, The Post conducts additional reporting.</p>
 <p>In 2022, The Post updated its database to standardize and publish the names of the police agencies involved in each shooting to better measure accountability at the department level.</p>
 <p>The 2014 killing of Michael Brown in Ferguson, Mo. began a protest movement culminating in the Black Lives Matter movement and an increased focus on police accountability nationwide. In this data set, The Post tracks only shootings with circumstances closely paralleling those like the killing of Brown — incidents in which a police officer, in the line of duty, shoots and kills a civilian. The Post is not tracking deaths of people in police custody, fatal shootings by off-duty officers or non-shooting deaths in this data set.</p>
diff --git a/_site/about.html b/_site/codebook.html
similarity index 73%
rename from _site/about.html
rename to _site/codebook.html
index c1b9615..2897ab9 100644
--- a/_site/about.html
+++ b/_site/codebook.html
@@ -7,7 +7,7 @@
 <meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
 
 
-<title>Police Shootings - About</title>
+<title>Police Shootings - Codebook</title>
 <style>
 code{white-space: pre-wrap;}
 span.smallcaps{font-variant: small-caps;}
@@ -20,40 +20,6 @@
   margin: 0 0.8em 0.2em -1em; /* quarto-specific, see https://github.com/quarto-dev/quarto-cli/issues/4556 */ 
   vertical-align: middle;
 }
-/* CSS for syntax highlighting */
-pre > code.sourceCode { white-space: pre; position: relative; }
-pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
-pre > code.sourceCode > span:empty { height: 1.2em; }
-.sourceCode { overflow: visible; }
-code.sourceCode > span { color: inherit; text-decoration: inherit; }
-div.sourceCode { margin: 1em 0; }
-pre.sourceCode { margin: 0; }
-@media screen {
-div.sourceCode { overflow: auto; }
-}
-@media print {
-pre > code.sourceCode { white-space: pre-wrap; }
-pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
-}
-pre.numberSource code
-  { counter-reset: source-line 0; }
-pre.numberSource code > span
-  { position: relative; left: -4em; counter-increment: source-line; }
-pre.numberSource code > span > a:first-child::before
-  { content: counter(source-line);
-    position: relative; left: -1em; text-align: right; vertical-align: baseline;
-    border: none; display: inline-block;
-    -webkit-touch-callout: none; -webkit-user-select: none;
-    -khtml-user-select: none; -moz-user-select: none;
-    -ms-user-select: none; user-select: none;
-    padding: 0 4px; width: 4em;
-  }
-pre.numberSource { margin-left: 3em;  padding-left: 4px; }
-div.sourceCode
-  {   }
-@media screen {
-pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
-}
 </style>
 
 
@@ -117,6 +83,10 @@
   <li class="nav-item">
     <a class="nav-link" href="./index.html" rel="" target="">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link active" href="./codebook.html" rel="" target="" aria-current="page">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link" href="./background.html" rel="" target="">
@@ -138,14 +108,25 @@
 <!-- sidebar -->
 <!-- margin-sidebar -->
     <div id="quarto-margin-sidebar" class="sidebar margin-sidebar">
-        
+        <nav id="TOC" role="doc-toc" class="toc-active">
+    <h2 id="toc-title">On this page</h2>
+   
+  <ul>
+  <li><a href="#explanatory-variables" id="toc-explanatory-variables" class="nav-link active" data-scroll-target="#explanatory-variables">Explanatory Variables</a>
+  <ul class="collapse">
+  <li><a href="#incident-information" id="toc-incident-information" class="nav-link" data-scroll-target="#incident-information">Incident Information</a></li>
+  <li><a href="#agency-information" id="toc-agency-information" class="nav-link" data-scroll-target="#agency-information">Agency Information</a></li>
+  </ul></li>
+  <li><a href="#project-thoughts" id="toc-project-thoughts" class="nav-link" data-scroll-target="#project-thoughts">Project thoughts</a></li>
+  </ul>
+</nav>
     </div>
 <!-- main -->
 <main class="content" id="quarto-document-content">
 
 <header id="title-block-header" class="quarto-title-block default">
 <div class="quarto-title">
-<h1 class="title">About</h1>
+<h1 class="title">Codebook</h1>
 </div>
 
 
@@ -160,15 +141,139 @@ <h1 class="title">About</h1>
 
 </header>
 
-<p>About this site</p>
-<div class="cell">
-<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="dv">1</span> <span class="sc">+</span> <span class="dv">1</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code>[1] 2</code></pre>
-</div>
-</div>
+<section id="explanatory-variables" class="level2">
+<h2 class="anchored" data-anchor-id="explanatory-variables">Explanatory Variables</h2>
+<table class="table">
+<colgroup>
+<col style="width: 34%">
+<col style="width: 65%">
+</colgroup>
+<thead>
+<tr class="header">
+<th>Name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td><code>city</code></td>
+<td>U.S. city</td>
+</tr>
+<tr class="even">
+<td><code>police_force_size</code></td>
+<td>Number of police officers serving that city</td>
+</tr>
+<tr class="odd">
+<td><code>all</code></td>
+<td>Percentage of the total police force that lives in the city</td>
+</tr>
+<tr class="even">
+<td><code>white</code></td>
+<td>Percentage of white (non-Hispanic) police officers who live in the city</td>
+</tr>
+<tr class="odd">
+<td><code>non-white</code></td>
+<td>Percentage of non-white police officers who live in the city</td>
+</tr>
+<tr class="even">
+<td><code>black</code></td>
+<td>Percentage of black police officers who live in the city</td>
+</tr>
+<tr class="odd">
+<td><code>hispanic</code></td>
+<td>Percentage of Hispanic police officers who live in the city</td>
+</tr>
+<tr class="even">
+<td><code>asian</code></td>
+<td>Percentage of Asian police officers who live in the city</td>
+</tr>
+</tbody>
+</table>
+<section id="incident-information" class="level3">
+<h3 class="anchored" data-anchor-id="incident-information">Incident Information</h3>
+<table class="table">
+<colgroup>
+<col style="width: 34%">
+<col style="width: 65%">
+</colgroup>
+<thead>
+<tr class="header">
+<th>Name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td><code>id</code></td>
+<td>A unique identifier for each fatal police shooting incident.</td>
+</tr>
+<tr class="even">
+<td><code>date</code></td>
+<td>The date of the fatal shooting.</td>
+</tr>
+<tr class="odd">
+<td><code>body_camera</code></td>
+<td>Whether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.</td>
+</tr>
+<tr class="even">
+<td><code>city</code></td>
+<td>The municipality where the fatal shooting took place</td>
+</tr>
+<tr class="odd">
+<td><code>county</code></td>
+<td>County where the fatal shooting took place.</td>
+</tr>
+<tr class="even">
+<td><code>state</code></td>
+<td>The two-letter postal code abbreviation for the state in which the fatal shooting took place.</td>
+</tr>
+<tr class="odd">
+<td><code>latitude</code></td>
+<td>The latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.</td>
+</tr>
+<tr class="even">
+<td><code>longitude</code></td>
+<td>The longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.</td>
+</tr>
+</tbody>
+</table>
+</section>
+<section id="agency-information" class="level3">
+<h3 class="anchored" data-anchor-id="agency-information">Agency Information</h3>
+<table class="table">
+<thead>
+<tr class="header">
+<th></th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td><code>id</code></td>
+<td>Department Database Id</td>
+</tr>
+<tr class="even">
+<td><code>name</code></td>
+<td>Department Name</td>
+</tr>
+<tr class="odd">
+<td><code>state</code></td>
+<td>State in which the agency is located.</td>
+</tr>
+</tbody>
+</table>
+</section>
+</section>
+<section id="project-thoughts" class="level2">
+<h2 class="anchored" data-anchor-id="project-thoughts">Project thoughts</h2>
+<p>I am interested in exploring data related to…</p>
+<ul>
+<li>Political Extremism</li>
+<li>Black American Opinion</li>
+</ul>
 
 
+</section>
 
 </main> <!-- /main -->
 <script id="quarto-html-after-body" type="application/javascript">
diff --git a/_site/data.html b/_site/data.html
index 6e3d1f1..e2c8a7b 100644
--- a/_site/data.html
+++ b/_site/data.html
@@ -93,6 +93,8 @@
   }
 }</script>
 
+  <script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script>
+  <script src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml-full.js" type="text/javascript"></script>
 
 <link rel="stylesheet" href="styles.css">
 </head>
@@ -117,6 +119,10 @@
   <li class="nav-item">
     <a class="nav-link" href="./index.html" rel="" target="">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="./codebook.html" rel="" target="">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link" href="./background.html" rel="" target="">
@@ -161,369 +167,165 @@ <h1 class="title">Data</h1>
 </header>
 
 <div class="cell">
-<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(tidyverse)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: package 'lubridate' was built under R version 4.3.1</code></pre>
-</div>
-<div class="cell-output cell-output-stderr">
-<pre><code>── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
-✔ dplyr     1.1.3     ✔ readr     2.1.4
-✔ forcats   1.0.0     ✔ stringr   1.5.0
-✔ ggplot2   3.4.2     ✔ tibble    3.2.1
-✔ lubridate 1.9.3     ✔ tidyr     1.3.0
-✔ purrr     1.0.2     
-── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
-✖ dplyr::filter() masks stats::filter()
-✖ dplyr::lag()    masks stats::lag()
-ℹ Use the conflicted package (&lt;http://conflicted.r-lib.org/&gt;) to force all conflicts to become errors</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb4"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(usmap)</span>
-<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(sf)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Linking to GEOS 3.11.0, GDAL 3.5.3, PROJ 9.1.0; sf_use_s2() is TRUE</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(infer)</span>
-<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(moderndive)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(tidyverse)</span>
+<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(usmap)</span>
+<span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(sf)</span>
+<span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(infer)</span>
+<span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(moderndive)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <div class="cell">
-<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a><span class="do">##Tidying Data</span></span>
-<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a><span class="co">#creating dfs from .csv files</span></span>
-<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/police-locals.csv"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Rows: 75 Columns: 10
-── Column specification ────────────────────────────────────────────────────────
-Delimiter: ","
-chr (6): city_old, city, state, black, hispanic, asian
-dbl (4): police_force_size, all, white, non-white
-
-ℹ Use `spec()` to retrieve the full column specification for this data.
-ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-agencies.csv"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Rows: 3422 Columns: 6
-── Column specification ────────────────────────────────────────────────────────
-Delimiter: ","
-chr (4): name, type, state, oricodes
-dbl (2): id, total_shootings
-
-ℹ Use `spec()` to retrieve the full column specification for this data.
-ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-data.csv"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Rows: 9129 Columns: 19
-── Column specification ────────────────────────────────────────────────────────
-Delimiter: ","
-chr  (12): threat_type, flee_status, armed_with, city, county, state, locati...
-dbl   (4): id, latitude, longitude, age
-lgl   (2): was_mental_illness_related, body_camera
-date  (1): date
-
-ℹ Use `spec()` to retrieve the full column specification for this data.
-ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="co">#removing old `city` tag from data set that we created when decatenated the city names</span></span>
-<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> police_locals <span class="sc">|&gt;</span></span>
-<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>city_old)</span>
-<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb13-5"><a href="#cb13-5" aria-hidden="true" tabindex="-1"></a><span class="co"># creating `agencies` df with just police departments</span></span>
-<span id="cb13-6"><a href="#cb13-6" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
-<span id="cb13-7"><a href="#cb13-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="fu">grepl</span>(<span class="st">"department"</span>, <span class="fu">tolower</span>(name))) <span class="sc">|&gt;</span></span>
-<span id="cb13-8"><a href="#cb13-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="sc">!</span><span class="fu">grepl</span>(<span class="st">"county"</span>, <span class="fu">tolower</span>(name)))</span>
-<span id="cb13-9"><a href="#cb13-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb13-10"><a href="#cb13-10" aria-hidden="true" tabindex="-1"></a><span class="co">#creating binned categorical account of if shooting victim was `armed`</span></span>
-<span id="cb13-11"><a href="#cb13-11" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
-<span id="cb13-12"><a href="#cb13-12" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">armed =</span> <span class="fu">case_when</span>(<span class="fu">is.na</span>(armed_with) <span class="sc">~</span> <span class="st">"NO"</span>,</span>
-<span id="cb13-13"><a href="#cb13-13" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unarmed"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
-<span id="cb13-14"><a href="#cb13-14" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unknown"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
-<span id="cb13-15"><a href="#cb13-15" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"undetermined"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
-<span id="cb13-16"><a href="#cb13-16" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"gun"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
-<span id="cb13-17"><a href="#cb13-17" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"knife"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
-<span id="cb13-18"><a href="#cb13-18" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"blunt_object"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
-<span id="cb13-19"><a href="#cb13-19" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"other"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
-<span id="cb13-20"><a href="#cb13-20" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"replica"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
-<span id="cb13-21"><a href="#cb13-21" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"vehicle"</span> <span class="sc">~</span> <span class="st">"YES"</span>))</span>
-<span id="cb13-22"><a href="#cb13-22" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb13-23"><a href="#cb13-23" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with only agency `names`, `id`, and `state`</span></span>
-<span id="cb13-24"><a href="#cb13-24" aria-hidden="true" tabindex="-1"></a>agencies_ids <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
-<span id="cb13-25"><a href="#cb13-25" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(name, id, state)</span>
-<span id="cb13-26"><a href="#cb13-26" aria-hidden="true" tabindex="-1"></a>agencies_ids</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 2,057 × 3
-   name                                   id state
-   &lt;chr&gt;                               &lt;dbl&gt; &lt;chr&gt;
- 1 Aberdeen Police Department           2576 WA   
- 2 Abilene Police Department            2114 TX   
- 3 Abington Township Police Department  2088 PA   
- 4 Acworth Police Department            3375 GA   
- 5 Ada Police Department                2579 OK   
- 6 Adel Police Department               3107 GA   
- 7 Akron Police Department               815 OH   
- 8 Alamogordo Police Department         1434 NM   
- 9 Alamosa Police Department            2354 CO   
-10 Albany Police Department             1443 GA   
-# ℹ 2,047 more rows</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb15"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city`, `agency`, and `state` info for each shooting</span></span>
-<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a>shooting_agencies <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
-<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(city, agency_ids, state)</span>
-<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a>shooting_agencies</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 9,129 × 3
-   city          agency_ids state
-   &lt;chr&gt;         &lt;chr&gt;      &lt;chr&gt;
- 1 Shelton       73         WA   
- 2 Aloha         70         OR   
- 3 Wichita       238        KS   
- 4 San Francisco 196        CA   
- 5 Evans         473        CO   
- 6 Guthrie       101        OK   
- 7 Chandler      195        AZ   
- 8 Assaria       490        KS   
- 9 Burlington    287        IA   
-10 Knoxville     26254      PA   
-# ℹ 9,119 more rows</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="co">#changing `shooting` var in `shooting_agencies` df to numeric</span></span>
-<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a>shooting_agencies<span class="sc">$</span>agency_ids <span class="ot">&lt;-</span> <span class="fu">as.numeric</span>(shootings<span class="sc">$</span>agency_ids)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: NAs introduced by coercion</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb19"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`</span></span>
-<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a>agencies_w_cities <span class="ot">&lt;-</span> agencies_ids <span class="sc">|&gt;</span></span>
-<span id="cb19-3"><a href="#cb19-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shooting_agencies, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"id"</span> <span class="ot">=</span> <span class="st">"agency_ids"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb19-4"><a href="#cb19-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(city) <span class="sc">|&gt;</span></span>
-<span id="cb19-5"><a href="#cb19-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span>
-<span id="cb19-6"><a href="#cb19-6" aria-hidden="true" tabindex="-1"></a>agencies_w_cities</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 1,781 × 4
-   name                                   id state city             
-   &lt;chr&gt;                               &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;            
- 1 Aberdeen Police Department           2576 WA    Aberdeen         
- 2 Abilene Police Department            2114 TX    Abilene          
- 3 Abington Township Police Department  2088 PA    Abington Township
- 4 Acworth Police Department            3375 GA    Acworth          
- 5 Ada Police Department                2579 OK    Ada              
- 6 Adel Police Department               3107 GA    Adel             
- 7 Akron Police Department               815 OH    Akron            
- 8 Alamogordo Police Department         1434 NM    Alamogordo       
- 9 Alamosa Police Department            2354 CO    Alamosa          
-10 Albany Police Department             1443 GA    Albany           
-# ℹ 1,771 more rows</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb21"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`</span></span>
-<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_w_cities <span class="sc">|&gt;</span></span>
-<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">full_join</span>(police_locals, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(police_force_size) <span class="sc">|&gt;</span></span>
-<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
-<span id="cb21-6"><a href="#cb21-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">majority =</span> <span class="fu">if_else</span>(all <span class="sc">&gt;=</span> <span class="fl">0.5</span>, <span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
-<span id="cb21-7"><a href="#cb21-7" aria-hidden="true" tabindex="-1"></a>agencies_census</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 109 × 12
-   name         id state city  police_force_size    all  white `non-white` black
-   &lt;chr&gt;     &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;       &lt;dbl&gt; &lt;chr&gt;
- 1 Albany P…  2237 NY    Alba…               890 0.185  0.160        0.364 **   
- 2 Albuquer…   508 NM    Albu…              1340 0.616  0.630        0.602 **   
- 3 Amtrak P…  1657 IL    Chic…             12120 0.875  0.872        0.877 0.89…
- 4 Atlanta …   447 GA    Atla…              2950 0.137  0.186        0.111 0.10…
- 5 Austin P…   141 TX    Aust…              1985 0.295  0.195        0.427 0.25 
- 6 Baltimor…  4784 MD    Balt…              2800 0.257  0.133        0.362 0.39…
- 7 Baltimor…   149 MD    Balt…              2800 0.257  0.133        0.362 0.39…
- 8 BART Pol…  2015 CA    Oakl…              1530 0.0948 0.0267       0.160 0.06…
- 9 Baton Ro…  1098 LA    Bato…               980 0.214  0.144        0.321 0.34…
-10 Boston P…     3 MA    Bost…              2560 0.477  0.442        0.583 0.68…
-# ℹ 99 more rows
-# ℹ 3 more variables: hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb23"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df of only shootings involving agencies within `agencies` df</span></span>
-<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a>shootings_case <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
-<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>agency_ids) <span class="sc">|&gt;</span></span>
-<span id="cb23-5"><a href="#cb23-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">rename</span>(<span class="at">agency_ids =</span> id.y, <span class="at">id =</span> id.x, <span class="at">agency =</span> name.y, <span class="at">victim =</span> name.x) <span class="sc">|&gt;</span></span>
-<span id="cb23-6"><a href="#cb23-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>location_precision, <span class="sc">-</span>race_source)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning in right_join(shootings, agencies_census, by = c(city = "city", : Detected an unexpected many-to-many relationship between `x` and `y`.
-ℹ Row 4 of `x` matches multiple rows in `y`.
-ℹ Row 29 of `y` matches multiple rows in `x`.
-ℹ If a many-to-many relationship is expected, set `relationship =
-  "many-to-many"` to silence this warning.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb25"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a>shootings_case</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 3,677 × 27
-      id date       threat_type flee_status armed_with city         county state
-   &lt;dbl&gt; &lt;date&gt;     &lt;chr&gt;       &lt;chr&gt;       &lt;chr&gt;      &lt;chr&gt;        &lt;chr&gt;  &lt;chr&gt;
- 1     5 2015-01-03 move        not         unarmed    Wichita      Sedgw… KS   
- 2     8 2015-01-04 point       not         replica    San Francis… San F… CA   
- 3     8 2015-01-04 point       not         replica    San Francis… San F… CA   
- 4    22 2015-01-07 threat      not         knife      Columbus     Frank… OH   
- 5    22 2015-01-07 threat      not         knife      Columbus     Frank… OH   
- 6    27 2015-01-07 shoot       foot        gun        New Orleans  Orlea… LA   
- 7   325 2015-01-09 point       not         gun        El Paso      El Pa… TX   
- 8    46 2015-01-13 shoot       foot        gun        Albuquerque  Berna… NM   
- 9    46 2015-01-13 shoot       foot        gun        Albuquerque  Berna… NM   
-10    56 2015-01-15 shoot       foot        gun        Indianapolis Marion IN   
-# ℹ 3,667 more rows
-# ℹ 19 more variables: latitude &lt;dbl&gt;, longitude &lt;dbl&gt;, victim &lt;chr&gt;,
-#   age &lt;dbl&gt;, gender &lt;chr&gt;, race &lt;chr&gt;, was_mental_illness_related &lt;lgl&gt;,
-#   body_camera &lt;lgl&gt;, armed &lt;chr&gt;, agency &lt;chr&gt;, agency_ids &lt;dbl&gt;,
-#   police_force_size &lt;dbl&gt;, all &lt;dbl&gt;, white &lt;dbl&gt;, `non-white` &lt;dbl&gt;,
-#   black &lt;chr&gt;, hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;</code></pre>
-</div>
+<div class="sourceCode cell-code" id="cb2"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="do">##Tidying Data</span></span>
+<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-3"><a href="#cb2-3" aria-hidden="true" tabindex="-1"></a><span class="co">#creating dfs from .csv files</span></span>
+<span id="cb2-4"><a href="#cb2-4" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/police-locals.csv"</span>)</span>
+<span id="cb2-5"><a href="#cb2-5" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-agencies.csv"</span>)</span>
+<span id="cb2-6"><a href="#cb2-6" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-data.csv"</span>)</span>
+<span id="cb2-7"><a href="#cb2-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-8"><a href="#cb2-8" aria-hidden="true" tabindex="-1"></a><span class="co">#removing old `city` tag from data set that we created when decatenated the city names</span></span>
+<span id="cb2-9"><a href="#cb2-9" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> police_locals <span class="sc">|&gt;</span></span>
+<span id="cb2-10"><a href="#cb2-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>city_old)</span>
+<span id="cb2-11"><a href="#cb2-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-12"><a href="#cb2-12" aria-hidden="true" tabindex="-1"></a><span class="co"># creating `agencies` df with just police departments</span></span>
+<span id="cb2-13"><a href="#cb2-13" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
+<span id="cb2-14"><a href="#cb2-14" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="fu">grepl</span>(<span class="st">"department"</span>, <span class="fu">tolower</span>(name))) <span class="sc">|&gt;</span></span>
+<span id="cb2-15"><a href="#cb2-15" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="sc">!</span><span class="fu">grepl</span>(<span class="st">"county"</span>, <span class="fu">tolower</span>(name)))</span>
+<span id="cb2-16"><a href="#cb2-16" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-17"><a href="#cb2-17" aria-hidden="true" tabindex="-1"></a><span class="co">#creating binned categorical account of if shooting victim was `armed`</span></span>
+<span id="cb2-18"><a href="#cb2-18" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb2-19"><a href="#cb2-19" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">armed =</span> <span class="fu">case_when</span>(<span class="fu">is.na</span>(armed_with) <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb2-20"><a href="#cb2-20" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unarmed"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb2-21"><a href="#cb2-21" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unknown"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb2-22"><a href="#cb2-22" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"undetermined"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb2-23"><a href="#cb2-23" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"gun"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb2-24"><a href="#cb2-24" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"knife"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb2-25"><a href="#cb2-25" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"blunt_object"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb2-26"><a href="#cb2-26" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"other"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb2-27"><a href="#cb2-27" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"replica"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb2-28"><a href="#cb2-28" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"vehicle"</span> <span class="sc">~</span> <span class="st">"YES"</span>))</span>
+<span id="cb2-29"><a href="#cb2-29" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-30"><a href="#cb2-30" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with only agency `names`, `id`, and `state`</span></span>
+<span id="cb2-31"><a href="#cb2-31" aria-hidden="true" tabindex="-1"></a>agencies_ids <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
+<span id="cb2-32"><a href="#cb2-32" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(name, id, state)</span>
+<span id="cb2-33"><a href="#cb2-33" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-34"><a href="#cb2-34" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city`, `agency`, and `state` info for each shooting</span></span>
+<span id="cb2-35"><a href="#cb2-35" aria-hidden="true" tabindex="-1"></a>shooting_agencies <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb2-36"><a href="#cb2-36" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(city, agency_ids, state)</span>
+<span id="cb2-37"><a href="#cb2-37" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-38"><a href="#cb2-38" aria-hidden="true" tabindex="-1"></a><span class="co">#changing `shooting` var in `shooting_agencies` df to numeric</span></span>
+<span id="cb2-39"><a href="#cb2-39" aria-hidden="true" tabindex="-1"></a>shooting_agencies<span class="sc">$</span>agency_ids <span class="ot">&lt;-</span> <span class="fu">as.numeric</span>(shootings<span class="sc">$</span>agency_ids)</span>
+<span id="cb2-40"><a href="#cb2-40" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-41"><a href="#cb2-41" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`</span></span>
+<span id="cb2-42"><a href="#cb2-42" aria-hidden="true" tabindex="-1"></a>agencies_w_cities <span class="ot">&lt;-</span> agencies_ids <span class="sc">|&gt;</span></span>
+<span id="cb2-43"><a href="#cb2-43" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shooting_agencies, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"id"</span> <span class="ot">=</span> <span class="st">"agency_ids"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb2-44"><a href="#cb2-44" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(city) <span class="sc">|&gt;</span></span>
+<span id="cb2-45"><a href="#cb2-45" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb2-46"><a href="#cb2-46" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-47"><a href="#cb2-47" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`</span></span>
+<span id="cb2-48"><a href="#cb2-48" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_w_cities <span class="sc">|&gt;</span></span>
+<span id="cb2-49"><a href="#cb2-49" aria-hidden="true" tabindex="-1"></a>  <span class="fu">full_join</span>(police_locals, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb2-50"><a href="#cb2-50" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(police_force_size) <span class="sc">|&gt;</span></span>
+<span id="cb2-51"><a href="#cb2-51" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb2-52"><a href="#cb2-52" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">majority =</span> <span class="fu">if_else</span>(all <span class="sc">&gt;=</span> <span class="fl">0.5</span>, <span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb2-53"><a href="#cb2-53" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-54"><a href="#cb2-54" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df of only shootings involving agencies within `agencies` df</span></span>
+<span id="cb2-55"><a href="#cb2-55" aria-hidden="true" tabindex="-1"></a>shootings_case <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb2-56"><a href="#cb2-56" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb2-57"><a href="#cb2-57" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>agency_ids) <span class="sc">|&gt;</span></span>
+<span id="cb2-58"><a href="#cb2-58" aria-hidden="true" tabindex="-1"></a>  <span class="fu">rename</span>(<span class="at">agency_ids =</span> id.y, <span class="at">id =</span> id.x, <span class="at">agency =</span> name.y, <span class="at">victim =</span> name.x) <span class="sc">|&gt;</span></span>
+<span id="cb2-59"><a href="#cb2-59" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>location_precision, <span class="sc">-</span>race_source)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <div class="cell">
-<div class="sourceCode cell-code" id="cb27"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb27-1"><a href="#cb27-1" aria-hidden="true" tabindex="-1"></a>shootings_by_agency <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
-<span id="cb27-2"><a href="#cb27-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency)</span>
-<span id="cb27-3"><a href="#cb27-3" aria-hidden="true" tabindex="-1"></a>shootings_by_agency</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 108 × 2
-   agency                               n
-   &lt;chr&gt;                            &lt;int&gt;
- 1 Albany Police Department             1
- 2 Albuquerque Police Department       66
- 3 Amtrak Police Department            54
- 4 Atlanta Police Department           36
- 5 Austin Police Department            40
- 6 BART Police Department              13
- 7 Baltimore City Police Department    30
- 8 Baltimore Police Department         30
- 9 Baton Rouge Police Department       15
-10 Boston Police Department            10
-# ℹ 98 more rows</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb29"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb29-1"><a href="#cb29-1" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_case,</span>
-<span id="cb29-2"><a href="#cb29-2" aria-hidden="true" tabindex="-1"></a>       <span class="at">mapping =</span> <span class="fu">aes</span>(<span class="at">x =</span> agency)) <span class="sc">+</span></span>
-<span id="cb29-3"><a href="#cb29-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_bar</span>() <span class="sc">+</span></span>
-<span id="cb29-4"><a href="#cb29-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">theme</span>(<span class="at">axis.text.x =</span> <span class="fu">element_text</span>(<span class="at">angle =</span> <span class="dv">90</span>,</span>
-<span id="cb29-5"><a href="#cb29-5" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">vjust =</span> <span class="dv">1</span>,</span>
-<span id="cb29-6"><a href="#cb29-6" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">hjust =</span> <span class="dv">1</span>,</span>
-<span id="cb29-7"><a href="#cb29-7" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">margin =</span> <span class="fu">margin</span>(<span class="at">t =</span> <span class="dv">5</span>, <span class="at">b =</span> <span class="dv">5</span>)))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="co">#count shootings by agency</span></span>
+<span id="cb3-2"><a href="#cb3-2" aria-hidden="true" tabindex="-1"></a>shootings_by_agency <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb3-3"><a href="#cb3-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency)</span>
+<span id="cb3-4"><a href="#cb3-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb3-5"><a href="#cb3-5" aria-hidden="true" tabindex="-1"></a><span class="co">#find top 25 agencies with the most shootings</span></span>
+<span id="cb3-6"><a href="#cb3-6" aria-hidden="true" tabindex="-1"></a>top_25_agencies <span class="ot">&lt;-</span> shootings_by_agency <span class="sc">|&gt;</span></span>
+<span id="cb3-7"><a href="#cb3-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">slice_max</span>(n, <span class="at">n =</span> <span class="dv">25</span>)</span>
+<span id="cb3-8"><a href="#cb3-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb3-9"><a href="#cb3-9" aria-hidden="true" tabindex="-1"></a><span class="co"># visulize top 25 agencies with the most shootings</span></span>
+<span id="cb3-10"><a href="#cb3-10" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> top_25_agencies,</span>
+<span id="cb3-11"><a href="#cb3-11" aria-hidden="true" tabindex="-1"></a>       <span class="at">mapping =</span> <span class="fu">aes</span>(<span class="at">x =</span> agency, <span class="at">y =</span> n)) <span class="sc">+</span></span>
+<span id="cb3-12"><a href="#cb3-12" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_col</span>() <span class="sc">+</span></span>
+<span id="cb3-13"><a href="#cb3-13" aria-hidden="true" tabindex="-1"></a>  <span class="fu">theme</span>(<span class="at">axis.text.x =</span> <span class="fu">element_text</span>(<span class="at">angle =</span> <span class="dv">75</span>,</span>
+<span id="cb3-14"><a href="#cb3-14" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">vjust =</span> <span class="dv">1</span>,</span>
+<span id="cb3-15"><a href="#cb3-15" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">hjust =</span> <span class="dv">1</span>,</span>
+<span id="cb3-16"><a href="#cb3-16" aria-hidden="true" tabindex="-1"></a>                                    <span class="at">margin =</span> <span class="fu">margin</span>(<span class="at">t =</span> <span class="dv">5</span>, <span class="at">b =</span> <span class="dv">5</span>)))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-3-1.png" class="img-fluid" width="672"></p>
+<p><img src="data_files/figure-html/Counting Shootings-1.png" class="img-fluid" width="672"></p>
 </div>
 </div>
 <div class="cell">
-<div class="sourceCode cell-code" id="cb30"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb30-1"><a href="#cb30-1" aria-hidden="true" tabindex="-1"></a><span class="co">#mapping Locations of Police-Involved Shootings between 2015 and 2023</span></span>
-<span id="cb30-2"><a href="#cb30-2" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb30-3"><a href="#cb30-3" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(ggmap)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: package 'ggmap' was built under R version 4.3.1</code></pre>
-</div>
-<div class="cell-output cell-output-stderr">
-<pre><code>ℹ Google's Terms of Service: &lt;https://mapsplatform.google.com&gt;
-  Stadia Maps' Terms of Service: &lt;https://stadiamaps.com/terms-of-service/&gt;
-  OpenStreetMap's Tile Usage Policy: &lt;https://operations.osmfoundation.org/policies/tiles/&gt;
-ℹ Please cite ggmap if you use it! Use `citation("ggmap")` for details.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb33"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb33-1"><a href="#cb33-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(maps)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: package 'maps' was built under R version 4.3.1</code></pre>
-</div>
-<div class="cell-output cell-output-stderr">
-<pre><code>
-Attaching package: 'maps'
-
-The following object is masked from 'package:purrr':
-
-    map</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb36"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb36-1"><a href="#cb36-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(mapdata)</span>
-<span id="cb36-2"><a href="#cb36-2" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb36-3"><a href="#cb36-3" aria-hidden="true" tabindex="-1"></a>usa <span class="ot">&lt;-</span> <span class="fu">map_data</span>(<span class="st">"usa"</span>)</span>
-<span id="cb36-4"><a href="#cb36-4" aria-hidden="true" tabindex="-1"></a>states <span class="ot">&lt;-</span> <span class="fu">map_data</span>(<span class="st">"state"</span>)</span>
-<span id="cb36-5"><a href="#cb36-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb36-6"><a href="#cb36-6" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> states) <span class="sc">+</span> </span>
-<span id="cb36-7"><a href="#cb36-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_polygon</span>(<span class="fu">aes</span>(<span class="at">x =</span> long, <span class="at">y =</span> lat, <span class="at">fill =</span> group, <span class="at">group =</span> group), <span class="at">color =</span> <span class="st">"white"</span>) <span class="sc">+</span> </span>
-<span id="cb36-8"><a href="#cb36-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">coord_fixed</span>(<span class="fl">1.3</span>) <span class="sc">+</span></span>
-<span id="cb36-9"><a href="#cb36-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">guides</span>(<span class="at">fill=</span><span class="cn">FALSE</span>) <span class="sc">+</span>  <span class="co"># do this to leave off the color legend</span></span>
-<span id="cb36-10"><a href="#cb36-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>(<span class="at">data =</span> shootings_case, <span class="fu">aes</span>(<span class="at">x =</span> longitude, <span class="at">y =</span> latitude), <span class="at">color =</span> <span class="st">"black"</span>, <span class="at">size =</span> .<span class="dv">2</span>) <span class="sc">+</span></span>
-<span id="cb36-11"><a href="#cb36-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>(<span class="at">data =</span> shootings_case, <span class="fu">aes</span>(<span class="at">x =</span> longitude, <span class="at">y =</span> latitude), <span class="at">color =</span> <span class="st">"red"</span>, <span class="at">size =</span> .<span class="dv">1</span>) <span class="sc">+</span></span>
-<span id="cb36-12"><a href="#cb36-12" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Locations of Police-Involved Shootings between 2015 and 2023"</span>,</span>
-<span id="cb36-13"><a href="#cb36-13" aria-hidden="true" tabindex="-1"></a>       <span class="at">captions =</span> <span class="st">"This is only includes cities where we have agency census data."</span>,</span>
-<span id="cb36-14"><a href="#cb36-14" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Longitude"</span>,</span>
-<span id="cb36-15"><a href="#cb36-15" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Latitude"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: The `&lt;scale&gt;` argument of `guides()` cannot be `FALSE`. Use "none" instead as
-of ggplot2 3.3.4.</code></pre>
-</div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning: Removed 309 rows containing missing values (`geom_point()`).
-Removed 309 rows containing missing values (`geom_point()`).</code></pre>
-</div>
-<div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-4-1.png" class="img-fluid" width="672"></p>
-</div>
+<div class="sourceCode cell-code" id="cb4"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="co">#mapping Locations of Police-Involved Shootings between 2015 and 2023</span></span>
+<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a><span class="co">#load geo-viz libraries</span></span>
+<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(ggmap)</span>
+<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(maps)</span>
+<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(mapdata)</span>
+<span id="cb4-7"><a href="#cb4-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb4-8"><a href="#cb4-8" aria-hidden="true" tabindex="-1"></a><span class="co">#create blank map</span></span>
+<span id="cb4-9"><a href="#cb4-9" aria-hidden="true" tabindex="-1"></a>usa <span class="ot">&lt;-</span> <span class="fu">map_data</span>(<span class="st">"usa"</span>)</span>
+<span id="cb4-10"><a href="#cb4-10" aria-hidden="true" tabindex="-1"></a>states <span class="ot">&lt;-</span> <span class="fu">map_data</span>(<span class="st">"state"</span>)</span>
+<span id="cb4-11"><a href="#cb4-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb4-12"><a href="#cb4-12" aria-hidden="true" tabindex="-1"></a><span class="co">#add locations of shootings to maps</span></span>
+<span id="cb4-13"><a href="#cb4-13" aria-hidden="true" tabindex="-1"></a>shot_map <span class="ot">&lt;-</span> <span class="fu">ggplot</span>(<span class="at">data =</span> states) <span class="sc">+</span> </span>
+<span id="cb4-14"><a href="#cb4-14" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_polygon</span>(<span class="fu">aes</span>(<span class="at">x =</span> long, <span class="at">y =</span> lat, <span class="at">fill =</span> group, <span class="at">group =</span> group), <span class="at">color =</span> <span class="st">"white"</span>) <span class="sc">+</span> </span>
+<span id="cb4-15"><a href="#cb4-15" aria-hidden="true" tabindex="-1"></a>  <span class="fu">coord_fixed</span>(<span class="fl">1.3</span>) <span class="sc">+</span></span>
+<span id="cb4-16"><a href="#cb4-16" aria-hidden="true" tabindex="-1"></a>  <span class="fu">guides</span>(<span class="at">fill=</span><span class="cn">FALSE</span>) <span class="sc">+</span>  <span class="co"># do this to leave off the color legend</span></span>
+<span id="cb4-17"><a href="#cb4-17" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>(<span class="at">data =</span> shootings_case, <span class="fu">aes</span>(<span class="at">x =</span> longitude, <span class="at">y =</span> latitude), <span class="at">color =</span> <span class="st">"black"</span>, <span class="at">size =</span> .<span class="dv">2</span>) <span class="sc">+</span></span>
+<span id="cb4-18"><a href="#cb4-18" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>(<span class="at">data =</span> shootings_case, <span class="fu">aes</span>(<span class="at">x =</span> longitude, <span class="at">y =</span> latitude), <span class="at">color =</span> <span class="st">"red"</span>, <span class="at">size =</span> .<span class="dv">1</span>) <span class="sc">+</span></span>
+<span id="cb4-19"><a href="#cb4-19" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Locations of Police-Involved Shootings between 2015 and 2023"</span>,</span>
+<span id="cb4-20"><a href="#cb4-20" aria-hidden="true" tabindex="-1"></a>       <span class="at">captions =</span> <span class="st">"This is only includes cities where we have agency census data."</span>,</span>
+<span id="cb4-21"><a href="#cb4-21" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Longitude"</span>,</span>
+<span id="cb4-22"><a href="#cb4-22" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Latitude"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <div class="cell">
-<div class="sourceCode cell-code" id="cb39"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb39-1"><a href="#cb39-1" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
-<span id="cb39-2"><a href="#cb39-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shootings_by_agency, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"name"</span> <span class="ot">=</span> <span class="st">"agency"</span>))</span>
-<span id="cb39-3"><a href="#cb39-3" aria-hidden="true" tabindex="-1"></a>agencies_census</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 109 × 13
-   name         id state city  police_force_size    all  white `non-white` black
-   &lt;chr&gt;     &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;       &lt;dbl&gt; &lt;chr&gt;
- 1 Albany P…  2237 NY    Alba…               890 0.185  0.160        0.364 **   
- 2 Albuquer…   508 NM    Albu…              1340 0.616  0.630        0.602 **   
- 3 Amtrak P…  1657 IL    Chic…             12120 0.875  0.872        0.877 0.89…
- 4 Atlanta …   447 GA    Atla…              2950 0.137  0.186        0.111 0.10…
- 5 Austin P…   141 TX    Aust…              1985 0.295  0.195        0.427 0.25 
- 6 Baltimor…  4784 MD    Balt…              2800 0.257  0.133        0.362 0.39…
- 7 Baltimor…   149 MD    Balt…              2800 0.257  0.133        0.362 0.39…
- 8 BART Pol…  2015 CA    Oakl…              1530 0.0948 0.0267       0.160 0.06…
- 9 Baton Ro…  1098 LA    Bato…               980 0.214  0.144        0.321 0.34…
-10 Boston P…     3 MA    Bost…              2560 0.477  0.442        0.583 0.68…
-# ℹ 99 more rows
-# ℹ 4 more variables: hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;, n &lt;int&gt;</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb41"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb41-1"><a href="#cb41-1" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="sc">|&gt;</span></span>
-<span id="cb41-2"><a href="#cb41-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">ggplot</span>(<span class="at">mapping =</span> <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n, <span class="at">fill=</span>)) <span class="sc">+</span></span>
-<span id="cb41-3"><a href="#cb41-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>() <span class="sc">+</span></span>
-<span id="cb41-4"><a href="#cb41-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">FALSE</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with total shootings per agency and census data</span></span>
+<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shootings_by_agency, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"name"</span> <span class="ot">=</span> <span class="st">"agency"</span>))</span>
+<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a><span class="co">#prelim visualization of relationship between percentage of officer residency and number of fatal shootings per agency</span></span>
+<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">ggplot</span>(<span class="at">mapping =</span> <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n)) <span class="sc">+</span></span>
+<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_point</span>() <span class="sc">+</span></span>
+<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">se =</span> <span class="cn">TRUE</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-5-1.png" class="img-fluid" width="672"></p>
-</div>
-<div class="sourceCode cell-code" id="cb42"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb42-1"><a href="#cb42-1" aria-hidden="true" tabindex="-1"></a>shootings_case <span class="sc">|&gt;</span></span>
-<span id="cb42-2"><a href="#cb42-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">ggplot</span>(<span class="fu">aes</span>(<span class="at">x =</span> majority, <span class="at">fill =</span> armed)) <span class="sc">+</span></span>
-<span id="cb42-3"><a href="#cb42-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_bar</span>() <span class="sc">+</span> </span>
-<span id="cb42-4"><a href="#cb42-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Shootings in Cities where a Majority of Officers Reside"</span>,</span>
-<span id="cb42-5"><a href="#cb42-5" aria-hidden="true" tabindex="-1"></a>       <span class="at">captions =</span> <span class="st">"This is only includes shootings where we have agency census data."</span>,</span>
-<span id="cb42-6"><a href="#cb42-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Does a majority a of the total police force live in the city?"</span>,</span>
-<span id="cb42-7"><a href="#cb42-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings"</span>,</span>
-<span id="cb42-8"><a href="#cb42-8" aria-hidden="true" tabindex="-1"></a>       <span class="at">fill =</span> <span class="st">"Victim Armed?"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p><img src="data_files/figure-html/unnamed-chunk-1-1.png" class="img-fluid" width="672"></p>
+</div>
+<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating visualization of comparison Shootings in Cities where a Majority/Minority of Officers Reside</span></span>
+<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>p0 <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">ggplot</span>(<span class="fu">aes</span>(<span class="at">x =</span> majority, <span class="at">fill =</span> armed)) <span class="sc">+</span></span>
+<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_bar</span>() <span class="sc">+</span> </span>
+<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Shootings in Cities where a Majority of Officers Reside"</span>,</span>
+<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">caption =</span> <span class="st">"This is only includes shootings where we have agency census data."</span>,</span>
+<span id="cb6-7"><a href="#cb6-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Does a majority a of the total police force live in the city?"</span>,</span>
+<span id="cb6-8"><a href="#cb6-8" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings"</span>,</span>
+<span id="cb6-9"><a href="#cb6-9" aria-hidden="true" tabindex="-1"></a>       <span class="at">fill =</span> <span class="st">"Victim Armed?"</span>)</span>
+<span id="cb6-10"><a href="#cb6-10" aria-hidden="true" tabindex="-1"></a>p0</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-5-2.png" class="img-fluid" width="672"></p>
-</div>
-<div class="sourceCode cell-code" id="cb43"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb43-1"><a href="#cb43-1" aria-hidden="true" tabindex="-1"></a>majority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
-<span id="cb43-2"><a href="#cb43-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
-<span id="cb43-3"><a href="#cb43-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
-<span id="cb43-4"><a href="#cb43-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">maj_mean =</span> <span class="fu">mean</span>(n))</span>
-<span id="cb43-5"><a href="#cb43-5" aria-hidden="true" tabindex="-1"></a>majority_mean</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 1 × 1
-  maj_mean
-     &lt;dbl&gt;
-1     32.4</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb45"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb45-1"><a href="#cb45-1" aria-hidden="true" tabindex="-1"></a>minority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
-<span id="cb45-2"><a href="#cb45-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">FALSE</span>) <span class="sc">|&gt;</span></span>
-<span id="cb45-3"><a href="#cb45-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
-<span id="cb45-4"><a href="#cb45-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">min_mean =</span> <span class="fu">mean</span>(n))</span>
-<span id="cb45-5"><a href="#cb45-5" aria-hidden="true" tabindex="-1"></a>minority_mean</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 1 × 1
-  min_mean
-     &lt;dbl&gt;
-1       35</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb47"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb47-1"><a href="#cb47-1" aria-hidden="true" tabindex="-1"></a>diff_in_means <span class="ot">&lt;-</span> majority_mean <span class="sc">-</span> minority_mean</span>
-<span id="cb47-2"><a href="#cb47-2" aria-hidden="true" tabindex="-1"></a>diff_in_means</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p><img src="data_files/figure-html/unnamed-chunk-1-2.png" class="img-fluid" width="672"></p>
+</div>
+<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate mean number of shootings per agency in cities where a majority of officers reside in the city</span></span>
+<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>majority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">maj_mean =</span> <span class="fu">mean</span>(n))</span>
+<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb7-7"><a href="#cb7-7" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate mean number of shootings per agency in cities where a minority of officers reside in the city</span></span>
+<span id="cb7-8"><a href="#cb7-8" aria-hidden="true" tabindex="-1"></a>minority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb7-9"><a href="#cb7-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">FALSE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb7-10"><a href="#cb7-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb7-11"><a href="#cb7-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">min_mean =</span> <span class="fu">mean</span>(n))</span>
+<span id="cb7-12"><a href="#cb7-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb7-13"><a href="#cb7-13" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate a difference in means between the `majority` and `minority`</span></span>
+<span id="cb7-14"><a href="#cb7-14" aria-hidden="true" tabindex="-1"></a>diff_in_means <span class="ot">&lt;-</span> majority_mean <span class="sc">-</span> minority_mean</span>
+<span id="cb7-15"><a href="#cb7-15" aria-hidden="true" tabindex="-1"></a>diff_in_means</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
 <pre><code>  maj_mean
 1   -2.575</code></pre>
 </div>
-<div class="sourceCode cell-code" id="cb49"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb49-1"><a href="#cb49-1" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(diff_in_means))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy table</span></span>
+<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(diff_in_means))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <table class="table table-sm table-striped small">
 <thead>
@@ -540,8 +342,9 @@ <h1 class="title">Data</h1>
 </div>
 </div>
 <div class="cell">
-<div class="sourceCode cell-code" id="cb50"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb50-1"><a href="#cb50-1" aria-hidden="true" tabindex="-1"></a>fit <span class="ot">&lt;-</span> <span class="fu">lm</span>(n <span class="sc">~</span> all, <span class="at">data =</span> agencies_census)</span>
-<span id="cb50-2"><a href="#cb50-2" aria-hidden="true" tabindex="-1"></a>fit</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb10"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="co">#fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency</span></span>
+<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>fit <span class="ot">&lt;-</span> <span class="fu">lm</span>(n <span class="sc">~</span> all, <span class="at">data =</span> agencies_census)</span>
+<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>fit</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
 <pre><code>
 Call:
@@ -551,8 +354,9 @@ <h1 class="title">Data</h1>
 (Intercept)          all  
     35.7782      -0.5874  </code></pre>
 </div>
-<div class="sourceCode cell-code" id="cb52"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb52-1"><a href="#cb52-1" aria-hidden="true" tabindex="-1"></a>p1 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit)</span>
-<span id="cb52-2"><a href="#cb52-2" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p1))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb12"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy `fit`</span></span>
+<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a>p1 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit)</span>
+<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p1))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <table class="table table-sm table-striped small">
 <colgroup>
@@ -597,46 +401,20 @@ <h1 class="title">Data</h1>
 </tbody>
 </table>
 </div>
-<div class="sourceCode cell-code" id="cb53"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb53-1"><a href="#cb53-1" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_case <span class="sc">%&gt;%</span></span>
-<span id="cb53-2"><a href="#cb53-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">group_by</span>(agency) <span class="sc">%&gt;%</span></span>
-<span id="cb53-3"><a href="#cb53-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(armed) <span class="sc">%&gt;%</span></span>
-<span id="cb53-4"><a href="#cb53-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(n, armed) <span class="sc">%&gt;%</span></span>
-<span id="cb53-5"><a href="#cb53-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"agency"</span> <span class="ot">=</span> <span class="st">"name"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb53-6"><a href="#cb53-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(armed, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning in right_join(., agencies_census, by = c(agency = "name")): Detected an unexpected many-to-many relationship between `x` and `y`.
-ℹ Row 63 of `x` matches multiple rows in `y`.
-ℹ Row 2 of `y` matches multiple rows in `x`.
-ℹ If a many-to-many relationship is expected, set `relationship =
-  "many-to-many"` to silence this warning.</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb55"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb55-1"><a href="#cb55-1" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 202 × 15
-# Groups:   agency [108]
-   agency          armed   n.x    id state city  police_force_size    all  white
-   &lt;chr&gt;           &lt;chr&gt; &lt;int&gt; &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;
- 1 Albany Police … YES       1  2237 NY    Alba…               890 0.185  0.160 
- 2 Albuquerque Po… NO       10   508 NM    Albu…              1340 0.616  0.630 
- 3 Albuquerque Po… YES      54   508 NM    Albu…              1340 0.616  0.630 
- 4 Amtrak Police … NO        8  1657 IL    Chic…             12120 0.875  0.872 
- 5 Amtrak Police … YES      46  1657 IL    Chic…             12120 0.875  0.872 
- 6 Atlanta Police… NO        5   447 GA    Atla…              2950 0.137  0.186 
- 7 Atlanta Police… YES      31   447 GA    Atla…              2950 0.137  0.186 
- 8 Austin Police … NO        3   141 TX    Aust…              1985 0.295  0.195 
- 9 Austin Police … YES      37   141 TX    Aust…              1985 0.295  0.195 
-10 BART Police De… YES      12  2015 CA    Oakl…              1530 0.0948 0.0267
-# ℹ 192 more rows
-# ℹ 6 more variables: `non-white` &lt;dbl&gt;, black &lt;chr&gt;, hispanic &lt;chr&gt;,
-#   asian &lt;chr&gt;, majority &lt;chr&gt;, n.y &lt;int&gt;</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb57"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb57-1"><a href="#cb57-1" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_by_agency_census <span class="sc">|&gt;</span></span>
-<span id="cb57-2"><a href="#cb57-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(n.x, armed, all) </span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Adding missing grouping variables: `agency`</code></pre>
-</div>
-<div class="sourceCode cell-code" id="cb59"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb59-1"><a href="#cb59-1" aria-hidden="true" tabindex="-1"></a>fit_multi <span class="ot">&lt;-</span> <span class="fu">lm</span>(n.x <span class="sc">~</span> all <span class="sc">+</span> armed, <span class="at">data =</span> shootings_by_agency_census)</span>
-<span id="cb59-2"><a href="#cb59-2" aria-hidden="true" tabindex="-1"></a>fit_multi</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="co">#add `armed` and `majority` to `shootings_by_agency` df</span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">group_by</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(armed) <span class="sc">|&gt;</span></span>
+<span id="cb13-5"><a href="#cb13-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(n, armed) <span class="sc">|&gt;</span></span>
+<span id="cb13-6"><a href="#cb13-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"agency"</span> <span class="ot">=</span> <span class="st">"name"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb13-7"><a href="#cb13-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(armed, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb13-8"><a href="#cb13-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb13-9"><a href="#cb13-9" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_by_agency_census <span class="sc">|&gt;</span></span>
+<span id="cb13-10"><a href="#cb13-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(n.x, armed, all) </span>
+<span id="cb13-11"><a href="#cb13-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb13-12"><a href="#cb13-12" aria-hidden="true" tabindex="-1"></a><span class="co">#fit multiple linear regression model for correlation between percentage of officer residency and victim armament and number of fatal shootings per agency</span></span>
+<span id="cb13-13"><a href="#cb13-13" aria-hidden="true" tabindex="-1"></a>fit_multi <span class="ot">&lt;-</span> <span class="fu">lm</span>(n.x <span class="sc">~</span> all <span class="sc">+</span> armed, <span class="at">data =</span> shootings_by_agency_census)</span>
+<span id="cb13-14"><a href="#cb13-14" aria-hidden="true" tabindex="-1"></a>fit_multi</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
 <pre><code>
 Call:
@@ -646,8 +424,9 @@ <h1 class="title">Data</h1>
 (Intercept)          all     armedYES  
       4.117        1.211       24.921  </code></pre>
 </div>
-<div class="sourceCode cell-code" id="cb61"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb61-1"><a href="#cb61-1" aria-hidden="true" tabindex="-1"></a>p2 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit_multi)</span>
-<span id="cb61-2"><a href="#cb61-2" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p2))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb15"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy `fit_multi`</span></span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a>p2 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit_multi)</span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p2))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <table class="table table-sm table-striped small">
 <colgroup>
@@ -701,33 +480,125 @@ <h1 class="title">Data</h1>
 </tbody>
 </table>
 </div>
-<div class="sourceCode cell-code" id="cb62"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb62-1"><a href="#cb62-1" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x)) <span class="sc">+</span></span>
-<span id="cb62-2"><a href="#cb62-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">jitter =</span> <span class="dv">15</span>, <span class="at">alpha =</span> <span class="fl">0.5</span>) <span class="sc">+</span></span>
-<span id="cb62-3"><a href="#cb62-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">FALSE</span>) <span class="sc">+</span></span>
-<span id="cb62-4"><a href="#cb62-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
-<span id="cb62-5"><a href="#cb62-5" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
-<span id="cb62-6"><a href="#cb62-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning in geom_jitter(jitter = 15, alpha = 0.5): Ignoring unknown parameters:
-`jitter`</code></pre>
-</div>
+<div class="sourceCode cell-code" id="cb16"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize polynomial relationship between percentage of officer residency and number of fatal shootings per agency</span></span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x)) <span class="sc">+</span></span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">width =</span> <span class="fl">0.10</span>, <span class="at">height =</span> <span class="dv">0</span>, <span class="at">alpha =</span> <span class="fl">0.45</span>) <span class="sc">+</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">TRUE</span>) <span class="sc">+</span></span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
+<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
+<span id="cb16-7"><a href="#cb16-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<p><img src="data_files/figure-html/unnamed-chunk-2-1.png" class="img-fluid" width="672"></p>
+</div>
+<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize polynomial relationship between percentage of officer residency and victim armament and number of fatal shootings per agency</span></span>
+<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x, <span class="at">color =</span> armed)) <span class="sc">+</span></span>
+<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">width =</span> <span class="fl">0.10</span>, <span class="at">height =</span> <span class="dv">0</span>, <span class="at">alpha =</span> <span class="fl">0.45</span>) <span class="sc">+</span></span>
+<span id="cb17-4"><a href="#cb17-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">TRUE</span>) <span class="sc">+</span></span>
+<span id="cb17-5"><a href="#cb17-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
+<span id="cb17-6"><a href="#cb17-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
+<span id="cb17-7"><a href="#cb17-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>,</span>
+<span id="cb17-8"><a href="#cb17-8" aria-hidden="true" tabindex="-1"></a>       <span class="at">color =</span> <span class="st">"Victim Armed?"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-6-1.png" class="img-fluid" width="672"></p>
+<p><img src="data_files/figure-html/unnamed-chunk-2-2.png" class="img-fluid" width="672"></p>
 </div>
-<div class="sourceCode cell-code" id="cb64"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb64-1"><a href="#cb64-1" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x, <span class="at">color =</span> armed)) <span class="sc">+</span></span>
-<span id="cb64-2"><a href="#cb64-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">jitter =</span> <span class="dv">15</span>, <span class="at">alpha =</span> <span class="fl">0.5</span>) <span class="sc">+</span></span>
-<span id="cb64-3"><a href="#cb64-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">FALSE</span>) <span class="sc">+</span></span>
-<span id="cb64-4"><a href="#cb64-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
-<span id="cb64-5"><a href="#cb64-5" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
-<span id="cb64-6"><a href="#cb64-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<div class="cell-output cell-output-stderr">
-<pre><code>Warning in geom_jitter(jitter = 15, alpha = 0.5): Ignoring unknown parameters:
-`jitter`</code></pre>
 </div>
+<p>The model equation for <code>fit</code> is:</p>
+<p>[ = 35.7782 - 0.5874 ]</p>
+<p>Interpretation:</p>
+<ul>
+<li>The intercept, <span class="math inline">\(35.7782\)</span>, is the estimated number of fatal shootings when the percentage of officer in-city residency (<code>all</code>) is <span class="math inline">\(0\)</span>. For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by <span class="math inline">\(0.5874\)</span> (<span class="math inline">\(-0.5874\)</span>) units, assuming all other factors remain constant.</li>
+</ul>
+<p>This model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it’s important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.</p>
+<p>The model equation for <code>fit_multi</code> considering victim armament (<code>armed</code>) is:</p>
+<p>[ = 4.117 + 1.211 + 24.921 ]</p>
+<ul>
+<li><p>The intercept, <span class="math inline">\(4.117\)</span>, is the estimated number of fatal shootings where the percentage of officer in-city residency (<code>all</code>) is <span class="math inline">\(0\)</span> and the victim was un-armed. For each one-unit increase in the percentage of in-city officer residency compared to the total force (<code>all</code>), we expect an increase of <span class="math inline">\(1.211\)</span> fatal shootings, assuming the victim’s armament status (<code>armedYES</code>) remains constant.</p></li>
+<li><p>The coefficient for ‘armedYES’, <span class="math inline">\(24.921\)</span>, indicates that the victim is armed (<code>armed</code> is <code>YES</code>), we expect an increase of <span class="math inline">\(24.921\)</span> fatal shootings compared to when the victim is not armed (<code>armed</code> is <code>No</code>), assuming the percentage of officer residency (<code>all</code>) remains constant.</p></li>
+</ul>
+<p>In summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb18"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a><span class="co">#generate null distribution</span></span>
+<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a>null_dist <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb18-3"><a href="#cb18-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> majority) <span class="sc">|&gt;</span></span>
+<span id="cb18-4"><a href="#cb18-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">hypothesize</span>(<span class="at">null =</span> <span class="st">"independence"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb18-5"><a href="#cb18-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">generate</span>(<span class="at">reps =</span> <span class="dv">1000</span>, <span class="at">type =</span> <span class="st">"permute"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb18-6"><a href="#cb18-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"diff in means"</span>, <span class="at">order =</span> <span class="fu">c</span>(<span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb18-7"><a href="#cb18-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb18-8"><a href="#cb18-8" aria-hidden="true" tabindex="-1"></a><span class="co">#compute observed test statistic</span></span>
+<span id="cb18-9"><a href="#cb18-9" aria-hidden="true" tabindex="-1"></a>test_stat <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb18-10"><a href="#cb18-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> majority) <span class="sc">|&gt;</span></span>
+<span id="cb18-11"><a href="#cb18-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"diff in means"</span>, <span class="at">order =</span> <span class="fu">c</span>(<span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb18-12"><a href="#cb18-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb18-13"><a href="#cb18-13" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize p-value</span></span>
+<span id="cb18-14"><a href="#cb18-14" aria-hidden="true" tabindex="-1"></a>null_dist <span class="sc">|&gt;</span></span>
+<span id="cb18-15"><a href="#cb18-15" aria-hidden="true" tabindex="-1"></a>  <span class="fu">visualize</span>() <span class="sc">+</span></span>
+<span id="cb18-16"><a href="#cb18-16" aria-hidden="true" tabindex="-1"></a>  <span class="fu">shade_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"less"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="data_files/figure-html/unnamed-chunk-6-2.png" class="img-fluid" width="672"></p>
+<p><img src="data_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png" class="img-fluid" width="672"></p>
+</div>
+<div class="sourceCode cell-code" id="cb19"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a><span class="co">#compute p-value</span></span>
+<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a>  null_dist <span class="sc">|&gt;</span></span>
+<span id="cb19-3"><a href="#cb19-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">get_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"less"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code># A tibble: 1 × 1
+  p_value
+    &lt;dbl&gt;
+1   0.263</code></pre>
 </div>
 </div>
+<p>Inference for a Difference in Means</p>
+<ul>
+<li><span class="math inline">\(H_0\)</span>: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.</li>
+<li><span class="math inline">\(H_A\)</span>: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.</li>
+</ul>
+<p>– <span class="math inline">\(H_0 : \mu_{maj} − \mu_{min} = 0\)</span>, or equivalently <span class="math inline">\(H_0 : \mu_{maj} = \mu_{min}\)</span> – <span class="math inline">\(H_A : \mu_{maj} − \mu_{min} &lt; 0\)</span>, or equivalently <span class="math inline">\(H_A : \mu_{maj} &lt; \mu_{min}\)</span></p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb21"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="co">#generate null distribution</span></span>
+<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> white) <span class="sc">|&gt;</span></span>
+<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">hypothesize</span>(<span class="at">null =</span> <span class="st">"independence"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">generate</span>(<span class="at">reps =</span> <span class="dv">1000</span>, <span class="at">type =</span> <span class="st">"permute"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb21-6"><a href="#cb21-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"correlation"</span>)</span>
+<span id="cb21-7"><a href="#cb21-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb21-8"><a href="#cb21-8" aria-hidden="true" tabindex="-1"></a><span class="co">#compute observed test statistic</span></span>
+<span id="cb21-9"><a href="#cb21-9" aria-hidden="true" tabindex="-1"></a>test_stat_cor <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb21-10"><a href="#cb21-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> white) <span class="sc">|&gt;</span></span>
+<span id="cb21-11"><a href="#cb21-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"correlation"</span>)</span>
+<span id="cb21-12"><a href="#cb21-12" aria-hidden="true" tabindex="-1"></a>test_stat_cor</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>Response: n (numeric)
+Explanatory: white (numeric)
+# A tibble: 1 × 1
+     stat
+    &lt;dbl&gt;
+1 -0.0470</code></pre>
+</div>
+<div class="sourceCode cell-code" id="cb23"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize p-value</span></span>
+<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="sc">|&gt;</span></span>
+<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">visualize</span>() <span class="sc">+</span></span>
+<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">shade_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"two.sided"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<p><img src="data_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png" class="img-fluid" width="672"></p>
+</div>
+<div class="sourceCode cell-code" id="cb24"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a><span class="co">#compute p-value</span></span>
+<span id="cb24-2"><a href="#cb24-2" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="sc">|&gt;</span></span>
+<span id="cb24-3"><a href="#cb24-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">get_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"two.sided"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code># A tibble: 1 × 1
+  p_value
+    &lt;dbl&gt;
+1       0</code></pre>
+</div>
+</div>
+<p>Inference for a Correlation</p>
+<ul>
+<li><p><span class="math inline">\(H_O\)</span>: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p></li>
+<li><p><span class="math inline">\(H_A\)</span>: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p>
+<ul>
+<li><p><span class="math inline">\(H_0 : \rho = 0\)</span></p></li>
+<li><p><span class="math inline">\(H_0 : \rho \neq 0\)</span></p></li>
+</ul></li>
+</ul>
 
 
 
diff --git a/_site/data/README-fatal-police-shoot.html b/_site/data/README-fatal-police-shoot.html
index cf474ad..28d1dd2 100644
--- a/_site/data/README-fatal-police-shoot.html
+++ b/_site/data/README-fatal-police-shoot.html
@@ -83,6 +83,10 @@
   <li class="nav-item">
     <a class="nav-link" href="../index.html" rel="" target="">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="../codebook.html" rel="" target="">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link" href="../background.html" rel="" target="">
diff --git a/_site/data/README-police-locals.html b/_site/data/README-police-locals.html
index 6649f00..81b735c 100644
--- a/_site/data/README-police-locals.html
+++ b/_site/data/README-police-locals.html
@@ -83,6 +83,10 @@
   <li class="nav-item">
     <a class="nav-link" href="../index.html" rel="" target="">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="../codebook.html" rel="" target="">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link" href="../background.html" rel="" target="">
diff --git a/_site/data_files/figure-html/Counting Shootings-1.png b/_site/data_files/figure-html/Counting Shootings-1.png
new file mode 100644
index 0000000..6f6b933
Binary files /dev/null and b/_site/data_files/figure-html/Counting Shootings-1.png differ
diff --git a/_site/data_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png b/_site/data_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png
new file mode 100644
index 0000000..f627cb1
Binary files /dev/null and b/_site/data_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png differ
diff --git a/_site/data_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png b/_site/data_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png
new file mode 100644
index 0000000..4d68852
Binary files /dev/null and b/_site/data_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-1-1.png b/_site/data_files/figure-html/unnamed-chunk-1-1.png
new file mode 100644
index 0000000..40ff615
Binary files /dev/null and b/_site/data_files/figure-html/unnamed-chunk-1-1.png differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-5-2.png b/_site/data_files/figure-html/unnamed-chunk-1-2.png
similarity index 100%
rename from _site/data_files/figure-html/unnamed-chunk-5-2.png
rename to _site/data_files/figure-html/unnamed-chunk-1-2.png
diff --git a/_site/data_files/figure-html/unnamed-chunk-2-1.png b/_site/data_files/figure-html/unnamed-chunk-2-1.png
new file mode 100644
index 0000000..73915ba
Binary files /dev/null and b/_site/data_files/figure-html/unnamed-chunk-2-1.png differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-2-2.png b/_site/data_files/figure-html/unnamed-chunk-2-2.png
new file mode 100644
index 0000000..6abf3ab
Binary files /dev/null and b/_site/data_files/figure-html/unnamed-chunk-2-2.png differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-3-1.png b/_site/data_files/figure-html/unnamed-chunk-3-1.png
deleted file mode 100644
index 8311f55..0000000
Binary files a/_site/data_files/figure-html/unnamed-chunk-3-1.png and /dev/null differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-5-1.png b/_site/data_files/figure-html/unnamed-chunk-5-1.png
deleted file mode 100644
index 44aaebb..0000000
Binary files a/_site/data_files/figure-html/unnamed-chunk-5-1.png and /dev/null differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-6-1.png b/_site/data_files/figure-html/unnamed-chunk-6-1.png
deleted file mode 100644
index d8c247d..0000000
Binary files a/_site/data_files/figure-html/unnamed-chunk-6-1.png and /dev/null differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-6-2.png b/_site/data_files/figure-html/unnamed-chunk-6-2.png
deleted file mode 100644
index f0d2aa7..0000000
Binary files a/_site/data_files/figure-html/unnamed-chunk-6-2.png and /dev/null differ
diff --git a/_site/index.html b/_site/index.html
index 7297dcc..ddeeba9 100644
--- a/_site/index.html
+++ b/_site/index.html
@@ -21,6 +21,40 @@
   margin: 0 0.8em 0.2em -1em; /* quarto-specific, see https://github.com/quarto-dev/quarto-cli/issues/4556 */ 
   vertical-align: middle;
 }
+/* CSS for syntax highlighting */
+pre > code.sourceCode { white-space: pre; position: relative; }
+pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
+pre > code.sourceCode > span:empty { height: 1.2em; }
+.sourceCode { overflow: visible; }
+code.sourceCode > span { color: inherit; text-decoration: inherit; }
+div.sourceCode { margin: 1em 0; }
+pre.sourceCode { margin: 0; }
+@media screen {
+div.sourceCode { overflow: auto; }
+}
+@media print {
+pre > code.sourceCode { white-space: pre-wrap; }
+pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
+}
+pre.numberSource code
+  { counter-reset: source-line 0; }
+pre.numberSource code > span
+  { position: relative; left: -4em; counter-increment: source-line; }
+pre.numberSource code > span > a:first-child::before
+  { content: counter(source-line);
+    position: relative; left: -1em; text-align: right; vertical-align: baseline;
+    border: none; display: inline-block;
+    -webkit-touch-callout: none; -webkit-user-select: none;
+    -khtml-user-select: none; -moz-user-select: none;
+    -ms-user-select: none; user-select: none;
+    padding: 0 4px; width: 4em;
+  }
+pre.numberSource { margin-left: 3em;  padding-left: 4px; }
+div.sourceCode
+  {   }
+@media screen {
+pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
+}
 </style>
 
 
@@ -60,6 +94,8 @@
   }
 }</script>
 
+  <script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script>
+  <script src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml-full.js" type="text/javascript"></script>
 
 <link rel="stylesheet" href="styles.css">
 </head>
@@ -84,6 +120,10 @@
   <li class="nav-item">
     <a class="nav-link active" href="./index.html" rel="" target="" aria-current="page">
  <span class="menu-text">Home</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="./codebook.html" rel="" target="">
+ <span class="menu-text">Codebook</span></a>
   </li>  
   <li class="nav-item">
     <a class="nav-link" href="./background.html" rel="" target="">
@@ -109,11 +149,21 @@
     <h2 id="toc-title">On this page</h2>
    
   <ul>
-  <li><a href="#proposal" id="toc-proposal" class="nav-link active" data-scroll-target="#proposal">Proposal</a>
+  <li><a href="#sec-abstract" id="toc-sec-abstract" class="nav-link active" data-scroll-target="#sec-abstract">Abstract</a></li>
+  <li><a href="#hypotheses" id="toc-hypotheses" class="nav-link" data-scroll-target="#hypotheses">Hypotheses</a></li>
+  <li><a href="#methods" id="toc-methods" class="nav-link" data-scroll-target="#methods">Methods</a></li>
+  <li><a href="#results" id="toc-results" class="nav-link" data-scroll-target="#results">Results</a>
   <ul class="collapse">
-  <li><a href="#explanatory-variables" id="toc-explanatory-variables" class="nav-link" data-scroll-target="#explanatory-variables">Explanatory Variables</a></li>
+  <li><a href="#multiple-linear-regression-of-relationship-between-percentage-of-officer-residency-and-number-of-fatal-shootings-per-agency-fit" id="toc-multiple-linear-regression-of-relationship-between-percentage-of-officer-residency-and-number-of-fatal-shootings-per-agency-fit" class="nav-link" data-scroll-target="#multiple-linear-regression-of-relationship-between-percentage-of-officer-residency-and-number-of-fatal-shootings-per-agency-fit">Multiple Linear Regression of relationship between percentage of officer residency and number of fatal shootings per agency <code>fit</code></a></li>
+  <li><a href="#multiple-linear-regression-of-relationship-between-percentage-of-officer-residencyvictim-armament-and-number-of-fatal-shootings-per-agency-fit_multi" id="toc-multiple-linear-regression-of-relationship-between-percentage-of-officer-residencyvictim-armament-and-number-of-fatal-shootings-per-agency-fit_multi" class="nav-link" data-scroll-target="#multiple-linear-regression-of-relationship-between-percentage-of-officer-residencyvictim-armament-and-number-of-fatal-shootings-per-agency-fit_multi">Multiple Linear Regression of relationship between percentage of officer residency/victim armament and number of fatal shootings per agency <code>fit_multi</code></a></li>
   </ul></li>
-  <li><a href="#project-thoughts" id="toc-project-thoughts" class="nav-link" data-scroll-target="#project-thoughts">Project thoughts</a></li>
+  <li><a href="#conclusion" id="toc-conclusion" class="nav-link" data-scroll-target="#conclusion">Conclusion</a>
+  <ul class="collapse">
+  <li><a href="#general-conclusions" id="toc-general-conclusions" class="nav-link" data-scroll-target="#general-conclusions">General Conclusions</a></li>
+  <li><a href="#study-limitations" id="toc-study-limitations" class="nav-link" data-scroll-target="#study-limitations">Study Limitations</a></li>
+  <li><a href="#improvements-for-future-study" id="toc-improvements-for-future-study" class="nav-link" data-scroll-target="#improvements-for-future-study">Improvements for Future Study</a></li>
+  </ul></li>
+  <li><a href="#citations" id="toc-citations" class="nav-link" data-scroll-target="#citations">Citations</a></li>
   </ul>
 </nav>
     </div>
@@ -142,138 +192,484 @@ <h1 class="title">A Case Study on the Relationship between Police Residence and
 
 </header>
 
-<section id="proposal" class="level2">
-<h2 class="anchored" data-anchor-id="proposal">Proposal</h2>
+<blockquote class="blockquote">
+<h1 id="on-average-police-in-the-united-states-shoot-and-kill-more-than-1000-people-every-yearand-then-they-go-home-to-their-families">On average, police in the United States shoot and kill more than 1,000 people every year…and then they go home to their families</h1>
+</blockquote>
+<section id="sec-abstract" class="level2">
+<h2 class="anchored" data-anchor-id="sec-abstract">Abstract</h2>
 <p>This case study investigates the intricate relationship between police residence and fatal police shootings, employing a data science approach to uncover insights and patterns within the context of law enforcement agencies. Focused on police officers residing in the cities they serve, the study examines whether this residency factor correlates with the incidence of fatal police shootings. The data set, spanning the years 2015 to 2023, is composed of information on police agencies involved in at least one fatal shooting, and is subjected to rigorous analysis using advanced statistical methods and machine learning techniques.</p>
 <p>This study aims to discern patterns, trends, and potential biases associated with the geographical proximity of police officers to the communities they police. A comprehensive exploration of demographic, socioeconomic, and policing variables contributes to a nuanced understanding of the factors influencing fatal police shootings. Furthermore, the study seeks to identify any disparities in incident rates based on officers’ residency status, considering variables such as race, community demographics, and departmental policies.</p>
 <p>The insights derived from this case study bear substantial implications for informing public policy, refining police training protocols, and strengthening community relations. By unraveling the nuanced dynamics surrounding police residence and fatal police shootings, this case study aims to provide evidence-based recommendations to enhance transparency, accountability, and trust between law enforcement agencies and the communities they serve. In doing so, it contributes to the broader discourse on police reform, fostering a data-driven approach to address critical issues and promote safer, more resilient communities.</p>
-<section id="explanatory-variables" class="level3">
-<h3 class="anchored" data-anchor-id="explanatory-variables">Explanatory Variables</h3>
-<table class="table">
-<colgroup>
-<col style="width: 33%">
-<col style="width: 66%">
-</colgroup>
+</section>
+<section id="hypotheses" class="level2">
+<h2 class="anchored" data-anchor-id="hypotheses">Hypotheses</h2>
+<p>We will conduct two hypothesis tests to analyze both;</p>
+<ol type="1">
+<li><p>The nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths</p>
+<ul>
+<li><p><span class="math inline">\(H_0\)</span>: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.</p></li>
+<li><p><span class="math inline">\(H_A\)</span>: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.</p>
+<ul>
+<li><span class="math inline">\(H_0 : p\_{maj} − p\_{min} = 0\)</span>, or equivalently <span class="math inline">\(H_0 : p\_{maj} = p\_{min}\)</span></li>
+<li><span class="math inline">\(H_A : p\_{maj} − p\_{min} &lt; 0\)</span>, or equivalently <span class="math inline">\(H_A : p\_{maj} &lt; p\_{min}\)</span></li>
+</ul></li>
+</ul></li>
+<li><p>The categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.</p>
+<ul>
+<li><p><span class="math inline">\(H_0\)</span>: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p></li>
+<li><p><span class="math inline">\(H_A\)</span>: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.</p>
+<ul>
+<li><p><span class="math inline">\(H_0 : \rho = 0\)</span></p></li>
+<li><p><span class="math inline">\(H_0 : \rho \neq 0\)</span></p></li>
+</ul></li>
+</ul></li>
+</ol>
+</section>
+<section id="methods" class="level2">
+<h2 class="anchored" data-anchor-id="methods">Methods</h2>
+<section id="tidying-data" class="level4">
+<h4 class="anchored" data-anchor-id="tidying-data">Tidying Data</h4>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="do">##Tidying Data</span></span>
+<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a><span class="co">#creating dfs from .csv files</span></span>
+<span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/police-locals.csv"</span>)</span>
+<span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-agencies.csv"</span>)</span>
+<span id="cb1-6"><a href="#cb1-6" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> <span class="fu">read_csv</span>(<span class="st">"data/fatal-police-shootings-data.csv"</span>)</span>
+<span id="cb1-7"><a href="#cb1-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-8"><a href="#cb1-8" aria-hidden="true" tabindex="-1"></a><span class="co">#removing old `city` tag from data set that we created when decatenated the city names</span></span>
+<span id="cb1-9"><a href="#cb1-9" aria-hidden="true" tabindex="-1"></a>police_locals <span class="ot">&lt;-</span> police_locals <span class="sc">|&gt;</span></span>
+<span id="cb1-10"><a href="#cb1-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>city_old)</span>
+<span id="cb1-11"><a href="#cb1-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-12"><a href="#cb1-12" aria-hidden="true" tabindex="-1"></a><span class="co"># creating `agencies` df with just police departments</span></span>
+<span id="cb1-13"><a href="#cb1-13" aria-hidden="true" tabindex="-1"></a>agencies <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
+<span id="cb1-14"><a href="#cb1-14" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="fu">grepl</span>(<span class="st">"department"</span>, <span class="fu">tolower</span>(name))) <span class="sc">|&gt;</span></span>
+<span id="cb1-15"><a href="#cb1-15" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(<span class="sc">!</span><span class="fu">grepl</span>(<span class="st">"county"</span>, <span class="fu">tolower</span>(name)))</span>
+<span id="cb1-16"><a href="#cb1-16" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-17"><a href="#cb1-17" aria-hidden="true" tabindex="-1"></a><span class="co">#creating binned categorical account of if shooting victim was `armed`</span></span>
+<span id="cb1-18"><a href="#cb1-18" aria-hidden="true" tabindex="-1"></a>shootings <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb1-19"><a href="#cb1-19" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">armed =</span> <span class="fu">case_when</span>(<span class="fu">is.na</span>(armed_with) <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb1-20"><a href="#cb1-20" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unarmed"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb1-21"><a href="#cb1-21" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"unknown"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb1-22"><a href="#cb1-22" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"undetermined"</span> <span class="sc">~</span> <span class="st">"NO"</span>,</span>
+<span id="cb1-23"><a href="#cb1-23" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"gun"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb1-24"><a href="#cb1-24" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"knife"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb1-25"><a href="#cb1-25" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"blunt_object"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb1-26"><a href="#cb1-26" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"other"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb1-27"><a href="#cb1-27" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"replica"</span> <span class="sc">~</span> <span class="st">"YES"</span>,</span>
+<span id="cb1-28"><a href="#cb1-28" aria-hidden="true" tabindex="-1"></a>                           armed_with <span class="sc">==</span> <span class="st">"vehicle"</span> <span class="sc">~</span> <span class="st">"YES"</span>))</span>
+<span id="cb1-29"><a href="#cb1-29" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-30"><a href="#cb1-30" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with only agency `names`, `id`, and `state`</span></span>
+<span id="cb1-31"><a href="#cb1-31" aria-hidden="true" tabindex="-1"></a>agencies_ids <span class="ot">&lt;-</span> agencies <span class="sc">|&gt;</span></span>
+<span id="cb1-32"><a href="#cb1-32" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(name, id, state)</span>
+<span id="cb1-33"><a href="#cb1-33" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-34"><a href="#cb1-34" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city`, `agency`, and `state` info for each shooting</span></span>
+<span id="cb1-35"><a href="#cb1-35" aria-hidden="true" tabindex="-1"></a>shooting_agencies <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb1-36"><a href="#cb1-36" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(city, agency_ids, state)</span>
+<span id="cb1-37"><a href="#cb1-37" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-38"><a href="#cb1-38" aria-hidden="true" tabindex="-1"></a><span class="co">#changing `shooting` var in `shooting_agencies` df to numeric</span></span>
+<span id="cb1-39"><a href="#cb1-39" aria-hidden="true" tabindex="-1"></a>shooting_agencies<span class="sc">$</span>agency_ids <span class="ot">&lt;-</span> <span class="fu">as.numeric</span>(shootings<span class="sc">$</span>agency_ids)</span>
+<span id="cb1-40"><a href="#cb1-40" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-41"><a href="#cb1-41" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`</span></span>
+<span id="cb1-42"><a href="#cb1-42" aria-hidden="true" tabindex="-1"></a>agencies_w_cities <span class="ot">&lt;-</span> agencies_ids <span class="sc">|&gt;</span></span>
+<span id="cb1-43"><a href="#cb1-43" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shooting_agencies, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"id"</span> <span class="ot">=</span> <span class="st">"agency_ids"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb1-44"><a href="#cb1-44" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(city) <span class="sc">|&gt;</span></span>
+<span id="cb1-45"><a href="#cb1-45" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb1-46"><a href="#cb1-46" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-47"><a href="#cb1-47" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`</span></span>
+<span id="cb1-48"><a href="#cb1-48" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_w_cities <span class="sc">|&gt;</span></span>
+<span id="cb1-49"><a href="#cb1-49" aria-hidden="true" tabindex="-1"></a>  <span class="fu">full_join</span>(police_locals, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb1-50"><a href="#cb1-50" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(police_force_size) <span class="sc">|&gt;</span></span>
+<span id="cb1-51"><a href="#cb1-51" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(id, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb1-52"><a href="#cb1-52" aria-hidden="true" tabindex="-1"></a>  <span class="fu">mutate</span>(<span class="at">majority =</span> <span class="fu">if_else</span>(all <span class="sc">&gt;=</span> <span class="fl">0.5</span>, <span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb1-53"><a href="#cb1-53" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-54"><a href="#cb1-54" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df of only shootings involving agencies within `agencies` df</span></span>
+<span id="cb1-55"><a href="#cb1-55" aria-hidden="true" tabindex="-1"></a>shootings_case <span class="ot">&lt;-</span> shootings <span class="sc">|&gt;</span></span>
+<span id="cb1-56"><a href="#cb1-56" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"city"</span> <span class="ot">=</span> <span class="st">"city"</span>, <span class="st">"state"</span> <span class="ot">=</span> <span class="st">"state"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb1-57"><a href="#cb1-57" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>agency_ids) <span class="sc">|&gt;</span></span>
+<span id="cb1-58"><a href="#cb1-58" aria-hidden="true" tabindex="-1"></a>  <span class="fu">rename</span>(<span class="at">agency_ids =</span> id.y, <span class="at">id =</span> id.x, <span class="at">agency =</span> name.y, <span class="at">victim =</span> name.x) <span class="sc">|&gt;</span></span>
+<span id="cb1-59"><a href="#cb1-59" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(<span class="sc">-</span>location_precision, <span class="sc">-</span>race_source)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+</div>
+</section>
+<section id="counting-shootings" class="level4">
+<h4 class="anchored" data-anchor-id="counting-shootings">Counting Shootings</h4>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb2"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="co">#count shootings by agency</span></span>
+<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a>shootings_by_agency <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb2-3"><a href="#cb2-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency)</span>
+<span id="cb2-4"><a href="#cb2-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb2-5"><a href="#cb2-5" aria-hidden="true" tabindex="-1"></a><span class="co">#find top 25 agencies with the most shootings</span></span>
+<span id="cb2-6"><a href="#cb2-6" aria-hidden="true" tabindex="-1"></a>top_25_agencies <span class="ot">&lt;-</span> shootings_by_agency <span class="sc">|&gt;</span></span>
+<span id="cb2-7"><a href="#cb2-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">slice_max</span>(n, <span class="at">n =</span> <span class="dv">25</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+</div>
+</section>
+<section id="mapping-locations-of-police-involved-shootings-between-2015-and-2023" class="level4">
+<h4 class="anchored" data-anchor-id="mapping-locations-of-police-involved-shootings-between-2015-and-2023">Mapping Locations of Police-Involved Shootings between 2015 and 2023</h4>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a>shot_map</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/unnamed-chunk-1-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb4"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="co">#creating df with total shootings per agency and census data</span></span>
+<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a>agencies_census <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">left_join</span>(shootings_by_agency, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"name"</span> <span class="ot">=</span> <span class="st">"agency"</span>))</span>
+<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a><span class="co">#creating visualization of comparison Shootings in Cities where a Majority/Minority of Officers Reside</span></span>
+<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a>p0 <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb4-7"><a href="#cb4-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">ggplot</span>(<span class="fu">aes</span>(<span class="at">x =</span> majority, <span class="at">fill =</span> armed)) <span class="sc">+</span></span>
+<span id="cb4-8"><a href="#cb4-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_bar</span>() <span class="sc">+</span> </span>
+<span id="cb4-9"><a href="#cb4-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Shootings in Cities where a Majority of Officers Reside"</span>,</span>
+<span id="cb4-10"><a href="#cb4-10" aria-hidden="true" tabindex="-1"></a>       <span class="at">caption =</span> <span class="st">"This is only includes shootings where we have agency census data."</span>,</span>
+<span id="cb4-11"><a href="#cb4-11" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Does a majority a of the total police force live in the city?"</span>,</span>
+<span id="cb4-12"><a href="#cb4-12" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings"</span>,</span>
+<span id="cb4-13"><a href="#cb4-13" aria-hidden="true" tabindex="-1"></a>       <span class="at">fill =</span> <span class="st">"Victim Armed?"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a>p0</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/unnamed-chunk-2-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate mean number of shootings per agency in cities where a majority of officers reside in the city</span></span>
+<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>majority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">TRUE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">maj_mean =</span> <span class="fu">mean</span>(n))</span>
+<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-7"><a href="#cb6-7" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate mean number of shootings per agency in cities where a minority of officers reside in the city</span></span>
+<span id="cb6-8"><a href="#cb6-8" aria-hidden="true" tabindex="-1"></a>minority_mean <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb6-9"><a href="#cb6-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">filter</span>(majority <span class="sc">==</span> <span class="cn">FALSE</span>) <span class="sc">|&gt;</span></span>
+<span id="cb6-10"><a href="#cb6-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb6-11"><a href="#cb6-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">summarize</span>(<span class="at">min_mean =</span> <span class="fu">mean</span>(n))</span>
+<span id="cb6-12"><a href="#cb6-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-13"><a href="#cb6-13" aria-hidden="true" tabindex="-1"></a><span class="co">#calculate a difference in means between the `majority` and `minority`</span></span>
+<span id="cb6-14"><a href="#cb6-14" aria-hidden="true" tabindex="-1"></a>diff_in_means <span class="ot">&lt;-</span> majority_mean <span class="sc">-</span> minority_mean</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy table</span></span>
+<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(diff_in_means))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<table class="table table-sm table-striped small">
 <thead>
 <tr class="header">
-<th>Name</th>
-<th>Description</th>
+<th style="text-align: right;">maj_mean</th>
 </tr>
 </thead>
 <tbody>
 <tr class="odd">
-<td><code>city</code></td>
-<td>U.S. city</td>
-</tr>
-<tr class="even">
-<td><code>police_force_size</code></td>
-<td>Number of police officers serving that city</td>
-</tr>
-<tr class="odd">
-<td><code>all</code></td>
-<td>Percentage of the total police force that lives in the city</td>
-</tr>
-<tr class="even">
-<td><code>white</code></td>
-<td>Percentage of white (non-Hispanic) police officers who live in the city</td>
-</tr>
-<tr class="odd">
-<td><code>non-white</code></td>
-<td>Percentage of non-white police officers who live in the city</td>
-</tr>
-<tr class="even">
-<td><code>black</code></td>
-<td>Percentage of black police officers who live in the city</td>
-</tr>
-<tr class="odd">
-<td><code>hispanic</code></td>
-<td>Percentage of Hispanic police officers who live in the city</td>
-</tr>
-<tr class="even">
-<td><code>asian</code></td>
-<td>Percentage of Asian police officers who live in the city</td>
+<td style="text-align: right;">-2.575</td>
 </tr>
 </tbody>
 </table>
-<p><strong>Incident Information</strong></p>
-<table class="table">
+</div>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a><span class="co">#fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency</span></span>
+<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>fit <span class="ot">&lt;-</span> <span class="fu">lm</span>(n <span class="sc">~</span> all, <span class="at">data =</span> agencies_census)</span>
+<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a><span class="co">#add `armed` and `majority` to `shootings_by_agency` df</span></span>
+<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_case <span class="sc">|&gt;</span></span>
+<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">group_by</span>(agency) <span class="sc">|&gt;</span></span>
+<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">count</span>(armed) <span class="sc">|&gt;</span></span>
+<span id="cb8-8"><a href="#cb8-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">drop_na</span>(n, armed) <span class="sc">|&gt;</span></span>
+<span id="cb8-9"><a href="#cb8-9" aria-hidden="true" tabindex="-1"></a>  <span class="fu">right_join</span>(agencies_census, <span class="at">by =</span> <span class="fu">c</span>(<span class="st">"agency"</span> <span class="ot">=</span> <span class="st">"name"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb8-10"><a href="#cb8-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">distinct</span>(armed, <span class="at">.keep_all =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb8-11"><a href="#cb8-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb8-12"><a href="#cb8-12" aria-hidden="true" tabindex="-1"></a>shootings_by_agency_census <span class="ot">&lt;-</span> shootings_by_agency_census <span class="sc">|&gt;</span></span>
+<span id="cb8-13"><a href="#cb8-13" aria-hidden="true" tabindex="-1"></a>  <span class="fu">select</span>(n.x, armed, all) </span>
+<span id="cb8-14"><a href="#cb8-14" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb8-15"><a href="#cb8-15" aria-hidden="true" tabindex="-1"></a><span class="co">#fit multiple linear regression model for correlation between percentage of officer residency and victim armament and number of fatal shootings per agency</span></span>
+<span id="cb8-16"><a href="#cb8-16" aria-hidden="true" tabindex="-1"></a>fit_multi <span class="ot">&lt;-</span> <span class="fu">lm</span>(n.x <span class="sc">~</span> all <span class="sc">+</span> armed, <span class="at">data =</span> shootings_by_agency_census)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+</div>
+</section>
+</section>
+<section id="results" class="level2">
+<h2 class="anchored" data-anchor-id="results">Results</h2>
+<section id="multiple-linear-regression-of-relationship-between-percentage-of-officer-residency-and-number-of-fatal-shootings-per-agency-fit" class="level3">
+<h3 class="anchored" data-anchor-id="multiple-linear-regression-of-relationship-between-percentage-of-officer-residency-and-number-of-fatal-shootings-per-agency-fit">Multiple Linear Regression of relationship between percentage of officer residency and number of fatal shootings per agency <code>fit</code></h3>
+<p>The model equation for <code>fit</code> is:</p>
+<p><span class="math display">\[
+\text{Number of Fatal Shootings (n)} = 35.7782 - 0.5874 \times \text{Percentage of Officer Residency (all)}
+\]</span></p>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy `fit`</span></span>
+<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>p1 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit)</span>
+<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p1))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<table class="table table-sm table-striped small">
 <colgroup>
-<col style="width: 32%">
-<col style="width: 67%">
+<col style="width: 15%">
+<col style="width: 13%">
+<col style="width: 15%">
+<col style="width: 15%">
+<col style="width: 12%">
+<col style="width: 13%">
+<col style="width: 13%">
 </colgroup>
 <thead>
 <tr class="header">
-<th>Name</th>
-<th>Description</th>
+<th style="text-align: left;">term</th>
+<th style="text-align: right;">estimate</th>
+<th style="text-align: right;">std_error</th>
+<th style="text-align: right;">statistic</th>
+<th style="text-align: right;">p_value</th>
+<th style="text-align: right;">lower_ci</th>
+<th style="text-align: right;">upper_ci</th>
 </tr>
 </thead>
 <tbody>
 <tr class="odd">
-<td><code>id</code></td>
-<td>A unique identifier for each fatal police shooting incident.</td>
-</tr>
-<tr class="even">
-<td><code>date</code></td>
-<td>The date of the fatal shooting.</td>
-</tr>
-<tr class="odd">
-<td><code>body_camera</code></td>
-<td>Whether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.</td>
-</tr>
-<tr class="even">
-<td><code>city</code></td>
-<td>The municipality where the fatal shooting took place</td>
-</tr>
-<tr class="odd">
-<td><code>county</code></td>
-<td>County where the fatal shooting took place.</td>
+<td style="text-align: left;">intercept</td>
+<td style="text-align: right;">35.778</td>
+<td style="text-align: right;">6.887</td>
+<td style="text-align: right;">5.195</td>
+<td style="text-align: right;">0.000</td>
+<td style="text-align: right;">22.126</td>
+<td style="text-align: right;">49.430</td>
 </tr>
 <tr class="even">
-<td><code>state</code></td>
-<td>The two-letter postal code abbreviation for the state in which the fatal shooting took place.</td>
-</tr>
-<tr class="odd">
-<td><code>latitude</code></td>
-<td>The latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.</td>
-</tr>
-<tr class="even">
-<td><code>longitude</code></td>
-<td>The longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.</td>
+<td style="text-align: left;">all</td>
+<td style="text-align: right;">-0.587</td>
+<td style="text-align: right;">14.902</td>
+<td style="text-align: right;">-0.039</td>
+<td style="text-align: right;">0.969</td>
+<td style="text-align: right;">-30.130</td>
+<td style="text-align: right;">28.955</td>
 </tr>
 </tbody>
 </table>
-<p><strong>Agency Information</strong></p>
-<table class="table">
+</div>
+</div>
+<p>Interpretation:</p>
+<ul>
+<li>The intercept, <span class="math inline">\(35.7782\)</span>, is the estimated number of fatal shootings when the percentage of officer in-city residency (<code>all</code>) is <span class="math inline">\(0\)</span>. For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by <span class="math inline">\(0.5874\)</span> (<span class="math inline">\(-0.5874\)</span>) units, assuming all other factors remain constant.</li>
+</ul>
+<p>This model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it’s important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.</p>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb10"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize polynomial relationship between percentage of officer residency and number of fatal shootings per agency</span></span>
+<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x)) <span class="sc">+</span></span>
+<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">width =</span> <span class="fl">0.10</span>, <span class="at">height =</span> <span class="dv">0</span>, <span class="at">alpha =</span> <span class="fl">0.45</span>) <span class="sc">+</span></span>
+<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">TRUE</span>) <span class="sc">+</span></span>
+<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
+<span id="cb10-6"><a href="#cb10-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
+<span id="cb10-7"><a href="#cb10-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/unnamed-chunk-7-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
+</section>
+<section id="multiple-linear-regression-of-relationship-between-percentage-of-officer-residencyvictim-armament-and-number-of-fatal-shootings-per-agency-fit_multi" class="level3">
+<h3 class="anchored" data-anchor-id="multiple-linear-regression-of-relationship-between-percentage-of-officer-residencyvictim-armament-and-number-of-fatal-shootings-per-agency-fit_multi">Multiple Linear Regression of relationship between percentage of officer residency/victim armament and number of fatal shootings per agency <code>fit_multi</code></h3>
+<p>The model equation for <code>fit_multi</code> considering victim armament (<code>armed</code>) is:</p>
+<p><span class="math display">\[
+\text{\ of Fatal Shootings (n.x)} = 4.117 + 1.211 \times \text{Percentage of Officer Residency (all)} + 24.921 \times \text{Armed (YES)}
+\]</span></p>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="co">#tidy `fit_multi`</span></span>
+<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a>p2 <span class="ot">&lt;-</span> <span class="fu">get_regression_table</span>(fit_multi)</span>
+<span id="cb11-3"><a href="#cb11-3" aria-hidden="true" tabindex="-1"></a>knitr<span class="sc">::</span><span class="fu">kable</span>(<span class="fu">head</span>(p2))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<table class="table table-sm table-striped small">
+<colgroup>
+<col style="width: 16%">
+<col style="width: 13%">
+<col style="width: 15%">
+<col style="width: 15%">
+<col style="width: 12%">
+<col style="width: 13%">
+<col style="width: 13%">
+</colgroup>
 <thead>
 <tr class="header">
-<th></th>
-<th>Description</th>
+<th style="text-align: left;">term</th>
+<th style="text-align: right;">estimate</th>
+<th style="text-align: right;">std_error</th>
+<th style="text-align: right;">statistic</th>
+<th style="text-align: right;">p_value</th>
+<th style="text-align: right;">lower_ci</th>
+<th style="text-align: right;">upper_ci</th>
 </tr>
 </thead>
 <tbody>
 <tr class="odd">
-<td><code>id</code></td>
-<td>Department Database Id</td>
+<td style="text-align: left;">intercept</td>
+<td style="text-align: right;">4.117</td>
+<td style="text-align: right;">3.362</td>
+<td style="text-align: right;">1.225</td>
+<td style="text-align: right;">0.222</td>
+<td style="text-align: right;">-2.512</td>
+<td style="text-align: right;">10.747</td>
 </tr>
 <tr class="even">
-<td><code>name</code></td>
-<td>Department Name</td>
+<td style="text-align: left;">all</td>
+<td style="text-align: right;">1.211</td>
+<td style="text-align: right;">6.335</td>
+<td style="text-align: right;">0.191</td>
+<td style="text-align: right;">0.849</td>
+<td style="text-align: right;">-11.281</td>
+<td style="text-align: right;">13.702</td>
 </tr>
 <tr class="odd">
-<td><code>state</code></td>
-<td>State in which the agency is located.</td>
+<td style="text-align: left;">armed: YES</td>
+<td style="text-align: right;">24.921</td>
+<td style="text-align: right;">2.891</td>
+<td style="text-align: right;">8.619</td>
+<td style="text-align: right;">0.000</td>
+<td style="text-align: right;">19.219</td>
+<td style="text-align: right;">30.622</td>
 </tr>
 </tbody>
 </table>
-</section>
-</section>
-<section id="project-thoughts" class="level2">
-<h2 class="anchored" data-anchor-id="project-thoughts">Project thoughts</h2>
-<p>I am interested in exploring data related to…</p>
+</div>
+</div>
 <ul>
-<li>Political Extremism</li>
-<li>Black American Opinion</li>
+<li><p>The intercept, <span class="math inline">\(4.117\)</span>, is the estimated number of fatal shootings where the percentage of officer in-city residency (<code>all</code>) is <span class="math inline">\(0\)</span> and the victim was un-armed. <strong>For each one-unit increase in the percentage of in-city officer residency compared to the total force (<code>all</code>), we expect an increase of</strong> <span class="math inline">\(1.211\)</span> fatal shootings, assuming the victim’s armament status (<code>armedYES</code>) remains constant.</p></li>
+<li><p>The coefficient for ‘armedYES’, <span class="math inline">\(24.921\)</span>, indicates that the victim is armed (<code>armed</code> is <code>YES</code>), <strong>we expect an increase of</strong> <span class="math inline">\(24.921\)</span> fatal shootings compared to when the victim is not armed (<code>armed</code> is <code>No</code>), assuming the percentage of officer residency (<code>all</code>) remains constant.</p></li>
 </ul>
+<p>In summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.</p>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb12"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize polynomial relationship between percentage of officer residency and victim armament and number of fatal shootings per agency</span></span>
+<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ggplot</span>(<span class="at">data =</span> shootings_by_agency_census, <span class="fu">aes</span>(<span class="at">x =</span> all, <span class="at">y =</span> n.x, <span class="at">color =</span> armed)) <span class="sc">+</span></span>
+<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_jitter</span>(<span class="at">width =</span> <span class="fl">0.10</span>, <span class="at">height =</span> <span class="dv">0</span>, <span class="at">alpha =</span> <span class="fl">0.45</span>) <span class="sc">+</span></span>
+<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">geom_smooth</span>(<span class="at">method =</span> <span class="st">"lm"</span>, <span class="at">formula =</span> y <span class="sc">~</span> <span class="fu">poly</span>(x, <span class="dv">2</span>), <span class="at">se =</span> <span class="cn">TRUE</span>) <span class="sc">+</span></span>
+<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">labs</span>(<span class="at">title =</span> <span class="st">"Number of Shootings on a Scale of Police Force Residency"</span>,</span>
+<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a>       <span class="at">x =</span> <span class="st">"Percentage of the total police force that lives in the city"</span>,</span>
+<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a>       <span class="at">y =</span> <span class="st">"Number of fatal shootings in that city"</span>,</span>
+<span id="cb12-8"><a href="#cb12-8" aria-hidden="true" tabindex="-1"></a>       <span class="at">color =</span> <span class="st">"Victim Armed?"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/unnamed-chunk-9-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="co">#generate null distribution</span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a>null_dist <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> majority) <span class="sc">|&gt;</span></span>
+<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">hypothesize</span>(<span class="at">null =</span> <span class="st">"independence"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb13-5"><a href="#cb13-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">generate</span>(<span class="at">reps =</span> <span class="dv">1000</span>, <span class="at">type =</span> <span class="st">"permute"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb13-6"><a href="#cb13-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"diff in means"</span>, <span class="at">order =</span> <span class="fu">c</span>(<span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb13-7"><a href="#cb13-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb13-8"><a href="#cb13-8" aria-hidden="true" tabindex="-1"></a><span class="co">#compute observed test statistic</span></span>
+<span id="cb13-9"><a href="#cb13-9" aria-hidden="true" tabindex="-1"></a>test_stat <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb13-10"><a href="#cb13-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> majority) <span class="sc">|&gt;</span></span>
+<span id="cb13-11"><a href="#cb13-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"diff in means"</span>, <span class="at">order =</span> <span class="fu">c</span>(<span class="st">"TRUE"</span>, <span class="st">"FALSE"</span>))</span>
+<span id="cb13-12"><a href="#cb13-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb13-13"><a href="#cb13-13" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize p-value</span></span>
+<span id="cb13-14"><a href="#cb13-14" aria-hidden="true" tabindex="-1"></a>null_dist <span class="sc">|&gt;</span></span>
+<span id="cb13-15"><a href="#cb13-15" aria-hidden="true" tabindex="-1"></a>  <span class="fu">visualize</span>() <span class="sc">+</span></span>
+<span id="cb13-16"><a href="#cb13-16" aria-hidden="true" tabindex="-1"></a>  <span class="fu">shade_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"less"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png" class="img-fluid" width="672"></p>
+</div>
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb14"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a><span class="co">#compute p-value</span></span>
+<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a>  null_dist <span class="sc">|&gt;</span></span>
+<span id="cb14-3"><a href="#cb14-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">get_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"less"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output cell-output-stdout">
+<pre><code># A tibble: 1 × 1
+  p_value
+    &lt;dbl&gt;
+1   0.252</code></pre>
+</div>
+</div>
+<p>At a significance level of <span class="math inline">\(\alpha = 0.05\)</span>, the p-value of <span class="math inline">\(0.248\)</span> suggests that, <strong>there is insufficient evidence to reject the null hypothesis</strong>. In this context, since our null hypothesis asserts that mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (difference in means; <span class="math inline">\(\mu_{maj} − \mu_{min}\)</span>) is <span class="math inline">\(-4.92\)</span> is around <span class="math inline">\(25\%\)</span> (<span class="math inline">\(0.248\)</span>). Meaning our observed difference in means between the groups is likely to have occurred by random chance.</p>
+<div class="cell">
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb16"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="co">#generate null distribution</span></span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> white) <span class="sc">|&gt;</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a>  <span class="fu">hypothesize</span>(<span class="at">null =</span> <span class="st">"independence"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a>  <span class="fu">generate</span>(<span class="at">reps =</span> <span class="dv">1000</span>, <span class="at">type =</span> <span class="st">"permute"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"correlation"</span>)</span>
+<span id="cb16-7"><a href="#cb16-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-8"><a href="#cb16-8" aria-hidden="true" tabindex="-1"></a><span class="co">#compute observed test statistic</span></span>
+<span id="cb16-9"><a href="#cb16-9" aria-hidden="true" tabindex="-1"></a>test_stat_cor <span class="ot">&lt;-</span> agencies_census <span class="sc">|&gt;</span></span>
+<span id="cb16-10"><a href="#cb16-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">specify</span>(n <span class="sc">~</span> white) <span class="sc">|&gt;</span></span>
+<span id="cb16-11"><a href="#cb16-11" aria-hidden="true" tabindex="-1"></a>  <span class="fu">calculate</span>(<span class="at">stat =</span> <span class="st">"correlation"</span>)</span>
+<span id="cb16-12"><a href="#cb16-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-13"><a href="#cb16-13" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-14"><a href="#cb16-14" aria-hidden="true" tabindex="-1"></a><span class="co">#visualize p-value</span></span>
+<span id="cb16-15"><a href="#cb16-15" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="sc">|&gt;</span></span>
+<span id="cb16-16"><a href="#cb16-16" aria-hidden="true" tabindex="-1"></a>  <span class="fu">visualize</span>() <span class="sc">+</span></span>
+<span id="cb16-17"><a href="#cb16-17" aria-hidden="true" tabindex="-1"></a>  <span class="fu">shade_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"two.sided"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output-display">
+<p><img src="index_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png" class="img-fluid" width="672"></p>
+</div>
+<details>
+<summary>Show the code</summary>
+<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="co">#compute p-value</span></span>
+<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a>null_dist_cor <span class="sc">|&gt;</span></span>
+<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a>  <span class="fu">get_p_value</span>(<span class="at">obs_stat =</span> test_stat, <span class="at">direction =</span> <span class="st">"two.sided"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</details>
+<div class="cell-output cell-output-stdout">
+<pre><code># A tibble: 1 × 1
+  p_value
+    &lt;dbl&gt;
+1       0</code></pre>
+</div>
+</div>
+<p>At a significance level of <span class="math inline">\(\alpha = 0.05\)</span>, the p-value of <span class="math inline">\(0.248\)</span> suggests that, there is sufficient evidence to reject the null hypothesis. In this context, since our null hypothesis asserts that there is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (correlation coefficient; <span class="math inline">\(\rho = 0\)</span>) is <span class="math inline">\(-0.0470\)</span> is around <span class="math inline">\(0\%\)</span> (<span class="math inline">\(0\)</span>). Meaning our observed correlation coefficient likely would not happen if there was no relationship between percentage of officer residency and number of fatal shootings for a given agency.</p>
+</section>
+</section>
+<section id="conclusion" class="level2">
+<h2 class="anchored" data-anchor-id="conclusion">Conclusion</h2>
+<section id="general-conclusions" class="level3">
+<h3 class="anchored" data-anchor-id="general-conclusions">General Conclusions</h3>
+</section>
+<section id="study-limitations" class="level3">
+<h3 class="anchored" data-anchor-id="study-limitations">Study Limitations</h3>
+</section>
+<section id="improvements-for-future-study" class="level3">
+<h3 class="anchored" data-anchor-id="improvements-for-future-study">Improvements for Future Study</h3>
+</section>
+</section>
+<section id="citations" class="level2">
+<h2 class="anchored" data-anchor-id="citations">Citations</h2>
 
 
 </section>
diff --git a/_site/index_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png b/_site/index_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png
new file mode 100644
index 0000000..fe27455
Binary files /dev/null and b/_site/index_files/figure-html/Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop-1.png differ
diff --git a/_site/index_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png b/_site/index_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png
new file mode 100644
index 0000000..5fe78ce
Binary files /dev/null and b/_site/index_files/figure-html/Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop-1.png differ
diff --git a/_site/data_files/figure-html/unnamed-chunk-4-1.png b/_site/index_files/figure-html/unnamed-chunk-1-1.png
similarity index 100%
rename from _site/data_files/figure-html/unnamed-chunk-4-1.png
rename to _site/index_files/figure-html/unnamed-chunk-1-1.png
diff --git a/_site/index_files/figure-html/unnamed-chunk-2-1.png b/_site/index_files/figure-html/unnamed-chunk-2-1.png
new file mode 100644
index 0000000..a1e7dba
Binary files /dev/null and b/_site/index_files/figure-html/unnamed-chunk-2-1.png differ
diff --git a/_site/index_files/figure-html/unnamed-chunk-7-1.png b/_site/index_files/figure-html/unnamed-chunk-7-1.png
new file mode 100644
index 0000000..fa72337
Binary files /dev/null and b/_site/index_files/figure-html/unnamed-chunk-7-1.png differ
diff --git a/_site/index_files/figure-html/unnamed-chunk-9-1.png b/_site/index_files/figure-html/unnamed-chunk-9-1.png
new file mode 100644
index 0000000..46fc51f
Binary files /dev/null and b/_site/index_files/figure-html/unnamed-chunk-9-1.png differ
diff --git a/_site/search.json b/_site/search.json
index 67f3d6e..652322a 100644
--- a/_site/search.json
+++ b/_site/search.json
@@ -4,42 +4,70 @@
     "href": "background.html",
     "title": "Background",
     "section": "",
-    "text": "#On average, police in the United States shoot and kill more than 1,000 people every year, according to an ongoing analysis by The Washington Post."
+    "text": "On average, police in the United States shoot and kill more than 1,000 people every year, according to an ongoing analysis by The Washington Post.\n\n\nProposal\nWe propose a case study to explore the relationship between police residence and fatal police shootings, employing advanced data science methodologies. Focusing on officers residing in the cities they serve, our project aims to uncover insights and patterns that contribute to a nuanced understanding of this complex issue.\n\nObjectives:\n\nInvestigate the correlation between police residence and fatal police shootings.\nUtilize a comprehensive dataset spanning 2015 to 2023, focusing on police agencies involved in at least one fatal shooting.\nApply advanced statistical methods and machine learning techniques to identify patterns and potential biases.\nExamine disparities in incident rates based on officers’ residency status, considering demographic, socioeconomic, and policing variables.\n\nMethodology:\n\nData Collection:\n\nCompile a dataset comprising information on police agencies involved in fatal police shootings.\nCompile a data set of census variables such as officer residency, race, community demographics, and departmental policies.\n\nAnalysis:\n\nEmploy advanced statistical methods and machine learning techniques to discern patterns and correlations.\nConduct a comprehensive exploration of variables influencing fatal police shootings.\n\n\nHypothesis and Expected Outcomes:\nWe will conduct two hypothesis tests to analyze both;\n\nthe nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths and\nthe categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.\n\n\nInference for a Difference in Proportions\n\n\\(H_0\\): The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.\n\\(H_A\\): The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.\n\n\\(H_0 : p\\_{maj} − p\\_{min} = 0\\), or equivalently \\(H_0 : p\\_{maj} = p\\_{min}\\)\n\\(H_A : p\\_{maj} − p\\_{min} &lt; 0\\), or equivalently \\(H_A : p\\_{maj} &lt; p\\_{min}\\)\n\n\nInference for a Correlation\n\n\\(H_O\\): There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\\(H_A\\): There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\n\\(H_0 : \\rho = 0\\)\n\\(H_0 : \\rho \\neq 0\\)\n\n\n\n\n\n\nThe Washington Post Fatal Force Database\nIn 2015, The Washington Post began tracking details about each police-involved killing in the United States — the race of the deceased, the circumstances of the shooting, whether the person was armed and whether the person was experiencing a mental-health crisis — by manually culling local news reports, collecting information from law enforcement websites and social media, and monitoring independent databases such as Fatal Encounters and the now-defunct Killed by Police project. In many cases, The Post conducts additional reporting.\nIn 2022, The Post updated its database to standardize and publish the names of the police agencies involved in each shooting to better measure accountability at the department level.\nThe 2014 killing of Michael Brown in Ferguson, Mo. began a protest movement culminating in the Black Lives Matter movement and an increased focus on police accountability nationwide. In this data set, The Post tracks only shootings with circumstances closely paralleling those like the killing of Brown — incidents in which a police officer, in the line of duty, shoots and kills a civilian. The Post is not tracking deaths of people in police custody, fatal shootings by off-duty officers or non-shooting deaths in this data set.\nThe FBI and the Centers for Disease Control and Prevention log fatal shootings by police, but officials acknowledge that their data is incomplete. Since 2015, The Post has documented more than twice as many fatal shootings by police as recorded by federal officials on average annually. That gap has widened in recent years, as the FBI in 2021 tracked only a third of departments’ fatal shootings.\n\n\nMost Police Don’t Live In The Cities They Serve\nIn Ferguson, Missouri, where protests lamented for months following the shooting of a teenager by a police officer this month, more than two-thirds of the civilian population is black. Only 11 percent of the police force is. The racial disparity is troubling enough on its own, but it’s also suggestive of another type of misrepresentation. Given Ferguson’s racial gap, it’s likely that many of its police officers live outside city limits.\nIf so, Ferguson would have something in common with most major American cities. In about two-thirds of the U.S. cities with the largest police forces, the majority of police officers commute to work from another town.\nOn average among the 75 American cities with the largest police forces, 49 percent of black police officers and 47 percent of Hispanic officers live within the city limits. But just 35 percent of white police officers do. The disparity is starkest in cities with largely black populations. In Detroit, for example, 57 percent of black police officers live in the city but just 8 percent of white ones do. Memphis, Tennessee; Baltimore; Birmingham, Alabama; and Jackson, Mississippi — also majority black — likewise have large racial gaps in where their police officers live."
   },
   {
-    "objectID": "background.html#wapo-fatal-force-database",
-    "href": "background.html#wapo-fatal-force-database",
-    "title": "Background",
-    "section": "WaPo Fatal Force Database",
-    "text": "WaPo Fatal Force Database\nIn 2015, The Washington Post began tracking details about each police-involved killing in the United States — the race of the deceased, the circumstances of the shooting, whether the person was armed and whether the person was experiencing a mental-health crisis — by manually culling local news reports, collecting information from law enforcement websites and social media, and monitoring independent databases such as Fatal Encounters and the now-defunct Killed by Police project. In many cases, The Post conducts additional reporting.\nIn 2022, The Post updated its database to standardize and publish the names of the police agencies involved in each shooting to better measure accountability at the department level.\nThe 2014 killing of Michael Brown in Ferguson, Mo. began a protest movement culminating in the Black Lives Matter movement and an increased focus on police accountability nationwide. In this data set, The Post tracks only shootings with circumstances closely paralleling those like the killing of Brown — incidents in which a police officer, in the line of duty, shoots and kills a civilian. The Post is not tracking deaths of people in police custody, fatal shootings by off-duty officers or non-shooting deaths in this data set.\nThe FBI and the Centers for Disease Control and Prevention log fatal shootings by police, but officials acknowledge that their data is incomplete. Since 2015, The Post has documented more than twice as many fatal shootings by police as recorded by federal officials on average annually. That gap has widened in recent years, as the FBI in 2021 tracked only a third of departments’ fatal shootings.\n#Most Police Don’t Live In The Cities They Serve\nIn Ferguson, Missouri, where protests lamented for months following the shooting of a teenager by a police officer this month, more than two-thirds of the civilian population is black. Only 11 percent of the police force is. The racial disparity is troubling enough on its own, but it’s also suggestive of another type of misrepresentation. Given Ferguson’s racial gap, it’s likely that many of its police officers live outside city limits.\nIf so, Ferguson would have something in common with most major American cities. In about two-thirds of the U.S. cities with the largest police forces, the majority of police officers commute to work from another town.\nOn average among the 75 American cities with the largest police forces, 49 percent of black police officers and 47 percent of Hispanic officers live within the city limits. But just 35 percent of white police officers do. The disparity is starkest in cities with largely black populations. In Detroit, for example, 57 percent of black police officers live in the city but just 8 percent of white ones do. Memphis, Tennessee; Baltimore; Birmingham, Alabama; and Jackson, Mississippi — also majority black — likewise have large racial gaps in where their police officers live."
+    "objectID": "codebook.html",
+    "href": "codebook.html",
+    "title": "Codebook",
+    "section": "",
+    "text": "Name\nDescription\n\n\n\n\ncity\nU.S. city\n\n\npolice_force_size\nNumber of police officers serving that city\n\n\nall\nPercentage of the total police force that lives in the city\n\n\nwhite\nPercentage of white (non-Hispanic) police officers who live in the city\n\n\nnon-white\nPercentage of non-white police officers who live in the city\n\n\nblack\nPercentage of black police officers who live in the city\n\n\nhispanic\nPercentage of Hispanic police officers who live in the city\n\n\nasian\nPercentage of Asian police officers who live in the city\n\n\n\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nid\nA unique identifier for each fatal police shooting incident.\n\n\ndate\nThe date of the fatal shooting.\n\n\nbody_camera\nWhether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.\n\n\ncity\nThe municipality where the fatal shooting took place\n\n\ncounty\nCounty where the fatal shooting took place.\n\n\nstate\nThe two-letter postal code abbreviation for the state in which the fatal shooting took place.\n\n\nlatitude\nThe latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.\n\n\nlongitude\nThe longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.\n\n\n\n\n\n\n\n\n\n\nDescription\n\n\n\n\nid\nDepartment Database Id\n\n\nname\nDepartment Name\n\n\nstate\nState in which the agency is located."
   },
   {
-    "objectID": "index.html",
-    "href": "index.html",
-    "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
+    "objectID": "codebook.html#explanatory-variables",
+    "href": "codebook.html#explanatory-variables",
+    "title": "Codebook",
     "section": "",
-    "text": "This case study investigates the intricate relationship between police residence and fatal police shootings, employing a data science approach to uncover insights and patterns within the context of law enforcement agencies. Focused on police officers residing in the cities they serve, the study examines whether this residency factor correlates with the incidence of fatal police shootings. The data set, spanning the years 2015 to 2023, is composed of information on police agencies involved in at least one fatal shooting, and is subjected to rigorous analysis using advanced statistical methods and machine learning techniques.\nThis study aims to discern patterns, trends, and potential biases associated with the geographical proximity of police officers to the communities they police. A comprehensive exploration of demographic, socioeconomic, and policing variables contributes to a nuanced understanding of the factors influencing fatal police shootings. Furthermore, the study seeks to identify any disparities in incident rates based on officers’ residency status, considering variables such as race, community demographics, and departmental policies.\nThe insights derived from this case study bear substantial implications for informing public policy, refining police training protocols, and strengthening community relations. By unraveling the nuanced dynamics surrounding police residence and fatal police shootings, this case study aims to provide evidence-based recommendations to enhance transparency, accountability, and trust between law enforcement agencies and the communities they serve. In doing so, it contributes to the broader discourse on police reform, fostering a data-driven approach to address critical issues and promote safer, more resilient communities.\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\ncity\nU.S. city\n\n\npolice_force_size\nNumber of police officers serving that city\n\n\nall\nPercentage of the total police force that lives in the city\n\n\nwhite\nPercentage of white (non-Hispanic) police officers who live in the city\n\n\nnon-white\nPercentage of non-white police officers who live in the city\n\n\nblack\nPercentage of black police officers who live in the city\n\n\nhispanic\nPercentage of Hispanic police officers who live in the city\n\n\nasian\nPercentage of Asian police officers who live in the city\n\n\n\nIncident Information\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nid\nA unique identifier for each fatal police shooting incident.\n\n\ndate\nThe date of the fatal shooting.\n\n\nbody_camera\nWhether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.\n\n\ncity\nThe municipality where the fatal shooting took place\n\n\ncounty\nCounty where the fatal shooting took place.\n\n\nstate\nThe two-letter postal code abbreviation for the state in which the fatal shooting took place.\n\n\nlatitude\nThe latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.\n\n\nlongitude\nThe longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.\n\n\n\nAgency Information\n\n\n\n\nDescription\n\n\n\n\nid\nDepartment Database Id\n\n\nname\nDepartment Name\n\n\nstate\nState in which the agency is located."
+    "text": "Name\nDescription\n\n\n\n\ncity\nU.S. city\n\n\npolice_force_size\nNumber of police officers serving that city\n\n\nall\nPercentage of the total police force that lives in the city\n\n\nwhite\nPercentage of white (non-Hispanic) police officers who live in the city\n\n\nnon-white\nPercentage of non-white police officers who live in the city\n\n\nblack\nPercentage of black police officers who live in the city\n\n\nhispanic\nPercentage of Hispanic police officers who live in the city\n\n\nasian\nPercentage of Asian police officers who live in the city\n\n\n\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nid\nA unique identifier for each fatal police shooting incident.\n\n\ndate\nThe date of the fatal shooting.\n\n\nbody_camera\nWhether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.\n\n\ncity\nThe municipality where the fatal shooting took place\n\n\ncounty\nCounty where the fatal shooting took place.\n\n\nstate\nThe two-letter postal code abbreviation for the state in which the fatal shooting took place.\n\n\nlatitude\nThe latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.\n\n\nlongitude\nThe longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.\n\n\n\n\n\n\n\n\n\n\nDescription\n\n\n\n\nid\nDepartment Database Id\n\n\nname\nDepartment Name\n\n\nstate\nState in which the agency is located."
+  },
+  {
+    "objectID": "codebook.html#project-thoughts",
+    "href": "codebook.html#project-thoughts",
+    "title": "Codebook",
+    "section": "Project thoughts",
+    "text": "Project thoughts\nI am interested in exploring data related to…\n\nPolitical Extremism\nBlack American Opinion"
   },
   {
-    "objectID": "index.html#proposal",
-    "href": "index.html#proposal",
+    "objectID": "index.html#sec-abstract",
+    "href": "index.html#sec-abstract",
     "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
-    "section": "",
-    "text": "This case study investigates the intricate relationship between police residence and fatal police shootings, employing a data science approach to uncover insights and patterns within the context of law enforcement agencies. Focused on police officers residing in the cities they serve, the study examines whether this residency factor correlates with the incidence of fatal police shootings. The data set, spanning the years 2015 to 2023, is composed of information on police agencies involved in at least one fatal shooting, and is subjected to rigorous analysis using advanced statistical methods and machine learning techniques.\nThis study aims to discern patterns, trends, and potential biases associated with the geographical proximity of police officers to the communities they police. A comprehensive exploration of demographic, socioeconomic, and policing variables contributes to a nuanced understanding of the factors influencing fatal police shootings. Furthermore, the study seeks to identify any disparities in incident rates based on officers’ residency status, considering variables such as race, community demographics, and departmental policies.\nThe insights derived from this case study bear substantial implications for informing public policy, refining police training protocols, and strengthening community relations. By unraveling the nuanced dynamics surrounding police residence and fatal police shootings, this case study aims to provide evidence-based recommendations to enhance transparency, accountability, and trust between law enforcement agencies and the communities they serve. In doing so, it contributes to the broader discourse on police reform, fostering a data-driven approach to address critical issues and promote safer, more resilient communities.\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\ncity\nU.S. city\n\n\npolice_force_size\nNumber of police officers serving that city\n\n\nall\nPercentage of the total police force that lives in the city\n\n\nwhite\nPercentage of white (non-Hispanic) police officers who live in the city\n\n\nnon-white\nPercentage of non-white police officers who live in the city\n\n\nblack\nPercentage of black police officers who live in the city\n\n\nhispanic\nPercentage of Hispanic police officers who live in the city\n\n\nasian\nPercentage of Asian police officers who live in the city\n\n\n\nIncident Information\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nid\nA unique identifier for each fatal police shooting incident.\n\n\ndate\nThe date of the fatal shooting.\n\n\nbody_camera\nWhether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.\n\n\ncity\nThe municipality where the fatal shooting took place\n\n\ncounty\nCounty where the fatal shooting took place.\n\n\nstate\nThe two-letter postal code abbreviation for the state in which the fatal shooting took place.\n\n\nlatitude\nThe latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level.\n\n\nlongitude\nThe longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.\n\n\n\nAgency Information\n\n\n\n\nDescription\n\n\n\n\nid\nDepartment Database Id\n\n\nname\nDepartment Name\n\n\nstate\nState in which the agency is located."
+    "section": "Abstract",
+    "text": "Abstract\nThis case study investigates the intricate relationship between police residence and fatal police shootings, employing a data science approach to uncover insights and patterns within the context of law enforcement agencies. Focused on police officers residing in the cities they serve, the study examines whether this residency factor correlates with the incidence of fatal police shootings. The data set, spanning the years 2015 to 2023, is composed of information on police agencies involved in at least one fatal shooting, and is subjected to rigorous analysis using advanced statistical methods and machine learning techniques.\nThis study aims to discern patterns, trends, and potential biases associated with the geographical proximity of police officers to the communities they police. A comprehensive exploration of demographic, socioeconomic, and policing variables contributes to a nuanced understanding of the factors influencing fatal police shootings. Furthermore, the study seeks to identify any disparities in incident rates based on officers’ residency status, considering variables such as race, community demographics, and departmental policies.\nThe insights derived from this case study bear substantial implications for informing public policy, refining police training protocols, and strengthening community relations. By unraveling the nuanced dynamics surrounding police residence and fatal police shootings, this case study aims to provide evidence-based recommendations to enhance transparency, accountability, and trust between law enforcement agencies and the communities they serve. In doing so, it contributes to the broader discourse on police reform, fostering a data-driven approach to address critical issues and promote safer, more resilient communities."
   },
   {
-    "objectID": "index.html#project-thoughts",
-    "href": "index.html#project-thoughts",
+    "objectID": "index.html#hypotheses",
+    "href": "index.html#hypotheses",
     "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
-    "section": "Project thoughts",
-    "text": "Project thoughts\nI am interested in exploring data related to…\n\nPolitical Extremism\nBlack American Opinion"
+    "section": "Hypotheses",
+    "text": "Hypotheses\nWe will conduct two hypothesis tests to analyze both;\n\nThe nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths\n\n\\(H_0\\): The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.\n\\(H_A\\): The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.\n\n\\(H_0 : p\\_{maj} − p\\_{min} = 0\\), or equivalently \\(H_0 : p\\_{maj} = p\\_{min}\\)\n\\(H_A : p\\_{maj} − p\\_{min} &lt; 0\\), or equivalently \\(H_A : p\\_{maj} &lt; p\\_{min}\\)\n\n\nThe categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.\n\n\\(H_0\\): There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\\(H_A\\): There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\n\\(H_0 : \\rho = 0\\)\n\\(H_0 : \\rho \\neq 0\\)"
   },
   {
-    "objectID": "about.html",
-    "href": "about.html",
-    "title": "About",
-    "section": "",
-    "text": "About this site\n\n1 + 1\n\n[1] 2"
+    "objectID": "index.html#methods",
+    "href": "index.html#methods",
+    "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
+    "section": "Methods",
+    "text": "Methods\n\nTidying Data\n\n\nShow the code\n##Tidying Data\n\n#creating dfs from .csv files\npolice_locals &lt;- read_csv(\"data/police-locals.csv\")\nagencies &lt;- read_csv(\"data/fatal-police-shootings-agencies.csv\")\nshootings &lt;- read_csv(\"data/fatal-police-shootings-data.csv\")\n\n#removing old `city` tag from data set that we created when decatenated the city names\npolice_locals &lt;- police_locals |&gt;\n  select(-city_old)\n\n# creating `agencies` df with just police departments\nagencies &lt;- agencies |&gt;\n  filter(grepl(\"department\", tolower(name))) |&gt;\n  filter(!grepl(\"county\", tolower(name)))\n\n#creating binned categorical account of if shooting victim was `armed`\nshootings &lt;- shootings |&gt;\n  mutate(armed = case_when(is.na(armed_with) ~ \"NO\",\n                           armed_with == \"unarmed\" ~ \"NO\",\n                           armed_with == \"unknown\" ~ \"NO\",\n                           armed_with == \"undetermined\" ~ \"NO\",\n                           armed_with == \"gun\" ~ \"YES\",\n                           armed_with == \"knife\" ~ \"YES\",\n                           armed_with == \"blunt_object\" ~ \"YES\",\n                           armed_with == \"other\" ~ \"YES\",\n                           armed_with == \"replica\" ~ \"YES\",\n                           armed_with == \"vehicle\" ~ \"YES\"))\n\n#creating df with only agency `names`, `id`, and `state`\nagencies_ids &lt;- agencies |&gt;\n  select(name, id, state)\n\n#creating df with `city`, `agency`, and `state` info for each shooting\nshooting_agencies &lt;- shootings |&gt;\n  select(city, agency_ids, state)\n\n#changing `shooting` var in `shooting_agencies` df to numeric\nshooting_agencies$agency_ids &lt;- as.numeric(shootings$agency_ids)\n\n#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`\nagencies_w_cities &lt;- agencies_ids |&gt;\n  left_join(shooting_agencies, by = c(\"id\" = \"agency_ids\", \"state\" = \"state\")) |&gt;\n  drop_na(city) |&gt;\n  distinct(id, .keep_all = TRUE)\n\n#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`\nagencies_census &lt;- agencies_w_cities |&gt;\n  full_join(police_locals, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  drop_na(police_force_size) |&gt;\n  distinct(id, .keep_all = TRUE) |&gt;\n  mutate(majority = if_else(all &gt;= 0.5, \"TRUE\", \"FALSE\"))\n\n#creating df of only shootings involving agencies within `agencies` df\nshootings_case &lt;- shootings |&gt;\n  right_join(agencies_census, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  select(-agency_ids) |&gt;\n  rename(agency_ids = id.y, id = id.x, agency = name.y, victim = name.x) |&gt;\n  select(-location_precision, -race_source)\n\n\n\n\nCounting Shootings\n\n\nShow the code\n#count shootings by agency\nshootings_by_agency &lt;- shootings_case |&gt;\n  count(agency)\n\n#find top 25 agencies with the most shootings\ntop_25_agencies &lt;- shootings_by_agency |&gt;\n  slice_max(n, n = 25)\n\n\n\n\nMapping Locations of Police-Involved Shootings between 2015 and 2023\n\n\nShow the code\nshot_map\n\n\n\n\n\n\n\nShow the code\n#creating df with total shootings per agency and census data\nagencies_census &lt;- agencies_census |&gt;\n  left_join(shootings_by_agency, by = c(\"name\" = \"agency\"))\n\n#creating visualization of comparison Shootings in Cities where a Majority/Minority of Officers Reside\np0 &lt;- shootings_case |&gt;\n  ggplot(aes(x = majority, fill = armed)) +\n  geom_bar() + \n  labs(title = \"Shootings in Cities where a Majority of Officers Reside\",\n       caption = \"This is only includes shootings where we have agency census data.\",\n       x = \"Does a majority a of the total police force live in the city?\",\n       y = \"Number of fatal shootings\",\n       fill = \"Victim Armed?\")\n\n\n\n\nShow the code\np0\n\n\n\n\n\n\n\nShow the code\n#calculate mean number of shootings per agency in cities where a majority of officers reside in the city\nmajority_mean &lt;- shootings_case |&gt;\n  filter(majority == TRUE) |&gt;\n  count(agency) |&gt;\n  summarize(maj_mean = mean(n))\n\n#calculate mean number of shootings per agency in cities where a minority of officers reside in the city\nminority_mean &lt;- shootings_case |&gt;\n  filter(majority == FALSE) |&gt;\n  count(agency) |&gt;\n  summarize(min_mean = mean(n))\n\n#calculate a difference in means between the `majority` and `minority`\ndiff_in_means &lt;- majority_mean - minority_mean\n\n\n\n\nShow the code\n#tidy table\nknitr::kable(head(diff_in_means))\n\n\n\n\n\nmaj_mean\n\n\n\n\n-2.575\n\n\n\n\n\n\n\nShow the code\n#fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency\nfit &lt;- lm(n ~ all, data = agencies_census)\n\n#add `armed` and `majority` to `shootings_by_agency` df\nshootings_by_agency_census &lt;- shootings_case |&gt;\n  group_by(agency) |&gt;\n  count(armed) |&gt;\n  drop_na(n, armed) |&gt;\n  right_join(agencies_census, by = c(\"agency\" = \"name\")) |&gt;\n  distinct(armed, .keep_all = TRUE)\n\nshootings_by_agency_census &lt;- shootings_by_agency_census |&gt;\n  select(n.x, armed, all) \n\n#fit multiple linear regression model for correlation between percentage of officer residency and victim armament and number of fatal shootings per agency\nfit_multi &lt;- lm(n.x ~ all + armed, data = shootings_by_agency_census)"
+  },
+  {
+    "objectID": "index.html#results",
+    "href": "index.html#results",
+    "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
+    "section": "Results",
+    "text": "Results\n\nMultiple Linear Regression of relationship between percentage of officer residency and number of fatal shootings per agency fit\nThe model equation for fit is:\n\\[\n\\text{Number of Fatal Shootings (n)} = 35.7782 - 0.5874 \\times \\text{Percentage of Officer Residency (all)}\n\\]\n\n\nShow the code\n#tidy `fit`\np1 &lt;- get_regression_table(fit)\nknitr::kable(head(p1))\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n35.778\n6.887\n5.195\n0.000\n22.126\n49.430\n\n\nall\n-0.587\n14.902\n-0.039\n0.969\n-30.130\n28.955\n\n\n\n\n\nInterpretation:\n\nThe intercept, \\(35.7782\\), is the estimated number of fatal shootings when the percentage of officer in-city residency (all) is \\(0\\). For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by \\(0.5874\\) (\\(-0.5874\\)) units, assuming all other factors remain constant.\n\nThis model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it’s important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.\n\n\nShow the code\n#visualize polynomial relationship between percentage of officer residency and number of fatal shootings per agency\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x)) +\n  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = TRUE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\")\n\n\n\n\n\n\n\nMultiple Linear Regression of relationship between percentage of officer residency/victim armament and number of fatal shootings per agency fit_multi\nThe model equation for fit_multi considering victim armament (armed) is:\n\\[\n\\text{\\ of Fatal Shootings (n.x)} = 4.117 + 1.211 \\times \\text{Percentage of Officer Residency (all)} + 24.921 \\times \\text{Armed (YES)}\n\\]\n\n\nShow the code\n#tidy `fit_multi`\np2 &lt;- get_regression_table(fit_multi)\nknitr::kable(head(p2))\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n4.117\n3.362\n1.225\n0.222\n-2.512\n10.747\n\n\nall\n1.211\n6.335\n0.191\n0.849\n-11.281\n13.702\n\n\narmed: YES\n24.921\n2.891\n8.619\n0.000\n19.219\n30.622\n\n\n\n\n\n\nThe intercept, \\(4.117\\), is the estimated number of fatal shootings where the percentage of officer in-city residency (all) is \\(0\\) and the victim was un-armed. For each one-unit increase in the percentage of in-city officer residency compared to the total force (all), we expect an increase of \\(1.211\\) fatal shootings, assuming the victim’s armament status (armedYES) remains constant.\nThe coefficient for ‘armedYES’, \\(24.921\\), indicates that the victim is armed (armed is YES), we expect an increase of \\(24.921\\) fatal shootings compared to when the victim is not armed (armed is No), assuming the percentage of officer residency (all) remains constant.\n\nIn summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.\n\n\nShow the code\n#visualize polynomial relationship between percentage of officer residency and victim armament and number of fatal shootings per agency\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x, color = armed)) +\n  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = TRUE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\",\n       color = \"Victim Armed?\")\n\n\n\n\n\n\n\nShow the code\n#generate null distribution\nnull_dist &lt;- agencies_census |&gt;\n  specify(n ~ majority) |&gt;\n  hypothesize(null = \"independence\") |&gt;\n  generate(reps = 1000, type = \"permute\") |&gt;\n  calculate(stat = \"diff in means\", order = c(\"TRUE\", \"FALSE\"))\n\n#compute observed test statistic\ntest_stat &lt;- agencies_census |&gt;\n  specify(n ~ majority) |&gt;\n  calculate(stat = \"diff in means\", order = c(\"TRUE\", \"FALSE\"))\n\n#visualize p-value\nnull_dist |&gt;\n  visualize() +\n  shade_p_value(obs_stat = test_stat, direction = \"less\")\n\n\n\n\n\nShow the code\n#compute p-value\n  null_dist |&gt;\n  get_p_value(obs_stat = test_stat, direction = \"less\")\n\n\n# A tibble: 1 × 1\n  p_value\n    &lt;dbl&gt;\n1   0.252\n\n\nAt a significance level of \\(\\alpha = 0.05\\), the p-value of \\(0.248\\) suggests that, there is insufficient evidence to reject the null hypothesis. In this context, since our null hypothesis asserts that mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (difference in means; \\(\\mu_{maj} − \\mu_{min}\\)) is \\(-4.92\\) is around \\(25\\%\\) (\\(0.248\\)). Meaning our observed difference in means between the groups is likely to have occurred by random chance.\n\n\nShow the code\n#generate null distribution\nnull_dist_cor &lt;- agencies_census |&gt;\n  specify(n ~ white) |&gt;\n  hypothesize(null = \"independence\") |&gt;\n  generate(reps = 1000, type = \"permute\") |&gt;\n  calculate(stat = \"correlation\")\n\n#compute observed test statistic\ntest_stat_cor &lt;- agencies_census |&gt;\n  specify(n ~ white) |&gt;\n  calculate(stat = \"correlation\")\n\n\n#visualize p-value\nnull_dist_cor |&gt;\n  visualize() +\n  shade_p_value(obs_stat = test_stat, direction = \"two.sided\")\n\n\n\n\n\nShow the code\n#compute p-value\nnull_dist_cor |&gt;\n  get_p_value(obs_stat = test_stat, direction = \"two.sided\")\n\n\n# A tibble: 1 × 1\n  p_value\n    &lt;dbl&gt;\n1       0\n\n\nAt a significance level of \\(\\alpha = 0.05\\), the p-value of \\(0.248\\) suggests that, there is sufficient evidence to reject the null hypothesis. In this context, since our null hypothesis asserts that there is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (correlation coefficient; \\(\\rho = 0\\)) is \\(-0.0470\\) is around \\(0\\%\\) (\\(0\\)). Meaning our observed correlation coefficient likely would not happen if there was no relationship between percentage of officer residency and number of fatal shootings for a given agency."
+  },
+  {
+    "objectID": "index.html#conclusion",
+    "href": "index.html#conclusion",
+    "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
+    "section": "Conclusion",
+    "text": "Conclusion\n\nGeneral Conclusions\n\n\nStudy Limitations\n\n\nImprovements for Future Study"
+  },
+  {
+    "objectID": "index.html#citations",
+    "href": "index.html#citations",
+    "title": "A Case Study on the Relationship between Police Residence and Fatal Police Shootings",
+    "section": "Citations",
+    "text": "Citations"
   },
   {
     "objectID": "data/README-fatal-police-shoot.html",
@@ -67,13 +95,6 @@
     "href": "data.html",
     "title": "Data",
     "section": "",
-    "text": "library(tidyverse)\n\nWarning: package 'lubridate' was built under R version 4.3.1\n\n\n── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──\n✔ dplyr     1.1.3     ✔ readr     2.1.4\n✔ forcats   1.0.0     ✔ stringr   1.5.0\n✔ ggplot2   3.4.2     ✔ tibble    3.2.1\n✔ lubridate 1.9.3     ✔ tidyr     1.3.0\n✔ purrr     1.0.2     \n── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──\n✖ dplyr::filter() masks stats::filter()\n✖ dplyr::lag()    masks stats::lag()\nℹ Use the conflicted package (&lt;http://conflicted.r-lib.org/&gt;) to force all conflicts to become errors\n\nlibrary(usmap)\nlibrary(sf)\n\nLinking to GEOS 3.11.0, GDAL 3.5.3, PROJ 9.1.0; sf_use_s2() is TRUE\n\nlibrary(infer)\nlibrary(moderndive)\n\n\n##Tidying Data\n\n#creating dfs from .csv files\npolice_locals &lt;- read_csv(\"data/police-locals.csv\")\n\nRows: 75 Columns: 10\n── Column specification ────────────────────────────────────────────────────────\nDelimiter: \",\"\nchr (6): city_old, city, state, black, hispanic, asian\ndbl (4): police_force_size, all, white, non-white\n\nℹ Use `spec()` to retrieve the full column specification for this data.\nℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.\n\nagencies &lt;- read_csv(\"data/fatal-police-shootings-agencies.csv\")\n\nRows: 3422 Columns: 6\n── Column specification ────────────────────────────────────────────────────────\nDelimiter: \",\"\nchr (4): name, type, state, oricodes\ndbl (2): id, total_shootings\n\nℹ Use `spec()` to retrieve the full column specification for this data.\nℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.\n\nshootings &lt;- read_csv(\"data/fatal-police-shootings-data.csv\")\n\nRows: 9129 Columns: 19\n── Column specification ────────────────────────────────────────────────────────\nDelimiter: \",\"\nchr  (12): threat_type, flee_status, armed_with, city, county, state, locati...\ndbl   (4): id, latitude, longitude, age\nlgl   (2): was_mental_illness_related, body_camera\ndate  (1): date\n\nℹ Use `spec()` to retrieve the full column specification for this data.\nℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.\n\n#removing old `city` tag from data set that we created when decatenated the city names\npolice_locals &lt;- police_locals |&gt;\n  select(-city_old)\n\n# creating `agencies` df with just police departments\nagencies &lt;- agencies |&gt;\n  filter(grepl(\"department\", tolower(name))) |&gt;\n  filter(!grepl(\"county\", tolower(name)))\n\n#creating binned categorical account of if shooting victim was `armed`\nshootings &lt;- shootings |&gt;\n  mutate(armed = case_when(is.na(armed_with) ~ \"NO\",\n                           armed_with == \"unarmed\" ~ \"NO\",\n                           armed_with == \"unknown\" ~ \"NO\",\n                           armed_with == \"undetermined\" ~ \"NO\",\n                           armed_with == \"gun\" ~ \"YES\",\n                           armed_with == \"knife\" ~ \"YES\",\n                           armed_with == \"blunt_object\" ~ \"YES\",\n                           armed_with == \"other\" ~ \"YES\",\n                           armed_with == \"replica\" ~ \"YES\",\n                           armed_with == \"vehicle\" ~ \"YES\"))\n\n#creating df with only agency `names`, `id`, and `state`\nagencies_ids &lt;- agencies |&gt;\n  select(name, id, state)\nagencies_ids\n\n# A tibble: 2,057 × 3\n   name                                   id state\n   &lt;chr&gt;                               &lt;dbl&gt; &lt;chr&gt;\n 1 Aberdeen Police Department           2576 WA   \n 2 Abilene Police Department            2114 TX   \n 3 Abington Township Police Department  2088 PA   \n 4 Acworth Police Department            3375 GA   \n 5 Ada Police Department                2579 OK   \n 6 Adel Police Department               3107 GA   \n 7 Akron Police Department               815 OH   \n 8 Alamogordo Police Department         1434 NM   \n 9 Alamosa Police Department            2354 CO   \n10 Albany Police Department             1443 GA   \n# ℹ 2,047 more rows\n\n#creating df with `city`, `agency`, and `state` info for each shooting\nshooting_agencies &lt;- shootings |&gt;\n  select(city, agency_ids, state)\nshooting_agencies\n\n# A tibble: 9,129 × 3\n   city          agency_ids state\n   &lt;chr&gt;         &lt;chr&gt;      &lt;chr&gt;\n 1 Shelton       73         WA   \n 2 Aloha         70         OR   \n 3 Wichita       238        KS   \n 4 San Francisco 196        CA   \n 5 Evans         473        CO   \n 6 Guthrie       101        OK   \n 7 Chandler      195        AZ   \n 8 Assaria       490        KS   \n 9 Burlington    287        IA   \n10 Knoxville     26254      PA   \n# ℹ 9,119 more rows\n\n#changing `shooting` var in `shooting_agencies` df to numeric\nshooting_agencies$agency_ids &lt;- as.numeric(shootings$agency_ids)\n\nWarning: NAs introduced by coercion\n\n#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`\nagencies_w_cities &lt;- agencies_ids |&gt;\n  left_join(shooting_agencies, by = c(\"id\" = \"agency_ids\", \"state\" = \"state\")) |&gt;\n  drop_na(city) |&gt;\n  distinct(id, .keep_all = TRUE)\nagencies_w_cities\n\n# A tibble: 1,781 × 4\n   name                                   id state city             \n   &lt;chr&gt;                               &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;            \n 1 Aberdeen Police Department           2576 WA    Aberdeen         \n 2 Abilene Police Department            2114 TX    Abilene          \n 3 Abington Township Police Department  2088 PA    Abington Township\n 4 Acworth Police Department            3375 GA    Acworth          \n 5 Ada Police Department                2579 OK    Ada              \n 6 Adel Police Department               3107 GA    Adel             \n 7 Akron Police Department               815 OH    Akron            \n 8 Alamogordo Police Department         1434 NM    Alamogordo       \n 9 Alamosa Police Department            2354 CO    Alamosa          \n10 Albany Police Department             1443 GA    Albany           \n# ℹ 1,771 more rows\n\n#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`\nagencies_census &lt;- agencies_w_cities |&gt;\n  full_join(police_locals, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  drop_na(police_force_size) |&gt;\n  distinct(id, .keep_all = TRUE) |&gt;\n  mutate(majority = if_else(all &gt;= 0.5, \"TRUE\", \"FALSE\"))\nagencies_census\n\n# A tibble: 109 × 12\n   name         id state city  police_force_size    all  white `non-white` black\n   &lt;chr&gt;     &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;       &lt;dbl&gt; &lt;chr&gt;\n 1 Albany P…  2237 NY    Alba…               890 0.185  0.160        0.364 **   \n 2 Albuquer…   508 NM    Albu…              1340 0.616  0.630        0.602 **   \n 3 Amtrak P…  1657 IL    Chic…             12120 0.875  0.872        0.877 0.89…\n 4 Atlanta …   447 GA    Atla…              2950 0.137  0.186        0.111 0.10…\n 5 Austin P…   141 TX    Aust…              1985 0.295  0.195        0.427 0.25 \n 6 Baltimor…  4784 MD    Balt…              2800 0.257  0.133        0.362 0.39…\n 7 Baltimor…   149 MD    Balt…              2800 0.257  0.133        0.362 0.39…\n 8 BART Pol…  2015 CA    Oakl…              1530 0.0948 0.0267       0.160 0.06…\n 9 Baton Ro…  1098 LA    Bato…               980 0.214  0.144        0.321 0.34…\n10 Boston P…     3 MA    Bost…              2560 0.477  0.442        0.583 0.68…\n# ℹ 99 more rows\n# ℹ 3 more variables: hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;\n\n#creating df of only shootings involving agencies within `agencies` df\nshootings_case &lt;- shootings |&gt;\n  right_join(agencies_census, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  select(-agency_ids) |&gt;\n  rename(agency_ids = id.y, id = id.x, agency = name.y, victim = name.x) |&gt;\n  select(-location_precision, -race_source)\n\nWarning in right_join(shootings, agencies_census, by = c(city = \"city\", : Detected an unexpected many-to-many relationship between `x` and `y`.\nℹ Row 4 of `x` matches multiple rows in `y`.\nℹ Row 29 of `y` matches multiple rows in `x`.\nℹ If a many-to-many relationship is expected, set `relationship =\n  \"many-to-many\"` to silence this warning.\n\nshootings_case\n\n# A tibble: 3,677 × 27\n      id date       threat_type flee_status armed_with city         county state\n   &lt;dbl&gt; &lt;date&gt;     &lt;chr&gt;       &lt;chr&gt;       &lt;chr&gt;      &lt;chr&gt;        &lt;chr&gt;  &lt;chr&gt;\n 1     5 2015-01-03 move        not         unarmed    Wichita      Sedgw… KS   \n 2     8 2015-01-04 point       not         replica    San Francis… San F… CA   \n 3     8 2015-01-04 point       not         replica    San Francis… San F… CA   \n 4    22 2015-01-07 threat      not         knife      Columbus     Frank… OH   \n 5    22 2015-01-07 threat      not         knife      Columbus     Frank… OH   \n 6    27 2015-01-07 shoot       foot        gun        New Orleans  Orlea… LA   \n 7   325 2015-01-09 point       not         gun        El Paso      El Pa… TX   \n 8    46 2015-01-13 shoot       foot        gun        Albuquerque  Berna… NM   \n 9    46 2015-01-13 shoot       foot        gun        Albuquerque  Berna… NM   \n10    56 2015-01-15 shoot       foot        gun        Indianapolis Marion IN   \n# ℹ 3,667 more rows\n# ℹ 19 more variables: latitude &lt;dbl&gt;, longitude &lt;dbl&gt;, victim &lt;chr&gt;,\n#   age &lt;dbl&gt;, gender &lt;chr&gt;, race &lt;chr&gt;, was_mental_illness_related &lt;lgl&gt;,\n#   body_camera &lt;lgl&gt;, armed &lt;chr&gt;, agency &lt;chr&gt;, agency_ids &lt;dbl&gt;,\n#   police_force_size &lt;dbl&gt;, all &lt;dbl&gt;, white &lt;dbl&gt;, `non-white` &lt;dbl&gt;,\n#   black &lt;chr&gt;, hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;\n\n\n\nshootings_by_agency &lt;- shootings_case |&gt;\n  count(agency)\nshootings_by_agency\n\n# A tibble: 108 × 2\n   agency                               n\n   &lt;chr&gt;                            &lt;int&gt;\n 1 Albany Police Department             1\n 2 Albuquerque Police Department       66\n 3 Amtrak Police Department            54\n 4 Atlanta Police Department           36\n 5 Austin Police Department            40\n 6 BART Police Department              13\n 7 Baltimore City Police Department    30\n 8 Baltimore Police Department         30\n 9 Baton Rouge Police Department       15\n10 Boston Police Department            10\n# ℹ 98 more rows\n\nggplot(data = shootings_case,\n       mapping = aes(x = agency)) +\n  geom_bar() +\n  theme(axis.text.x = element_text(angle = 90,\n                                    vjust = 1,\n                                    hjust = 1,\n                                    margin = margin(t = 5, b = 5)))\n\n\n\n\n\n#mapping Locations of Police-Involved Shootings between 2015 and 2023\n\nlibrary(ggmap)\n\nWarning: package 'ggmap' was built under R version 4.3.1\n\n\nℹ Google's Terms of Service: &lt;https://mapsplatform.google.com&gt;\n  Stadia Maps' Terms of Service: &lt;https://stadiamaps.com/terms-of-service/&gt;\n  OpenStreetMap's Tile Usage Policy: &lt;https://operations.osmfoundation.org/policies/tiles/&gt;\nℹ Please cite ggmap if you use it! Use `citation(\"ggmap\")` for details.\n\nlibrary(maps)\n\nWarning: package 'maps' was built under R version 4.3.1\n\n\n\nAttaching package: 'maps'\n\nThe following object is masked from 'package:purrr':\n\n    map\n\nlibrary(mapdata)\n\nusa &lt;- map_data(\"usa\")\nstates &lt;- map_data(\"state\")\n\nggplot(data = states) + \n  geom_polygon(aes(x = long, y = lat, fill = group, group = group), color = \"white\") + \n  coord_fixed(1.3) +\n  guides(fill=FALSE) +  # do this to leave off the color legend\n  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = \"black\", size = .2) +\n  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = \"red\", size = .1) +\n  labs(title = \"Locations of Police-Involved Shootings between 2015 and 2023\",\n       captions = \"This is only includes cities where we have agency census data.\",\n       x = \"Longitude\",\n       y = \"Latitude\")\n\nWarning: The `&lt;scale&gt;` argument of `guides()` cannot be `FALSE`. Use \"none\" instead as\nof ggplot2 3.3.4.\n\n\nWarning: Removed 309 rows containing missing values (`geom_point()`).\nRemoved 309 rows containing missing values (`geom_point()`).\n\n\n\n\n\n\nagencies_census &lt;- agencies_census |&gt;\n  left_join(shootings_by_agency, by = c(\"name\" = \"agency\"))\nagencies_census\n\n# A tibble: 109 × 13\n   name         id state city  police_force_size    all  white `non-white` black\n   &lt;chr&gt;     &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;       &lt;dbl&gt; &lt;chr&gt;\n 1 Albany P…  2237 NY    Alba…               890 0.185  0.160        0.364 **   \n 2 Albuquer…   508 NM    Albu…              1340 0.616  0.630        0.602 **   \n 3 Amtrak P…  1657 IL    Chic…             12120 0.875  0.872        0.877 0.89…\n 4 Atlanta …   447 GA    Atla…              2950 0.137  0.186        0.111 0.10…\n 5 Austin P…   141 TX    Aust…              1985 0.295  0.195        0.427 0.25 \n 6 Baltimor…  4784 MD    Balt…              2800 0.257  0.133        0.362 0.39…\n 7 Baltimor…   149 MD    Balt…              2800 0.257  0.133        0.362 0.39…\n 8 BART Pol…  2015 CA    Oakl…              1530 0.0948 0.0267       0.160 0.06…\n 9 Baton Ro…  1098 LA    Bato…               980 0.214  0.144        0.321 0.34…\n10 Boston P…     3 MA    Bost…              2560 0.477  0.442        0.583 0.68…\n# ℹ 99 more rows\n# ℹ 4 more variables: hispanic &lt;chr&gt;, asian &lt;chr&gt;, majority &lt;chr&gt;, n &lt;int&gt;\n\nagencies_census |&gt;\n  ggplot(mapping = aes(x = all, y = n, fill=)) +\n  geom_point() +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = FALSE)\n\n\n\nshootings_case |&gt;\n  ggplot(aes(x = majority, fill = armed)) +\n  geom_bar() + \n  labs(title = \"Shootings in Cities where a Majority of Officers Reside\",\n       captions = \"This is only includes shootings where we have agency census data.\",\n       x = \"Does a majority a of the total police force live in the city?\",\n       y = \"Number of fatal shootings\",\n       fill = \"Victim Armed?\")\n\n\n\nmajority_mean &lt;- shootings_case |&gt;\n  filter(majority == TRUE) |&gt;\n  count(agency) |&gt;\n  summarize(maj_mean = mean(n))\nmajority_mean\n\n# A tibble: 1 × 1\n  maj_mean\n     &lt;dbl&gt;\n1     32.4\n\nminority_mean &lt;- shootings_case |&gt;\n  filter(majority == FALSE) |&gt;\n  count(agency) |&gt;\n  summarize(min_mean = mean(n))\nminority_mean\n\n# A tibble: 1 × 1\n  min_mean\n     &lt;dbl&gt;\n1       35\n\ndiff_in_means &lt;- majority_mean - minority_mean\ndiff_in_means\n\n  maj_mean\n1   -2.575\n\nknitr::kable(head(diff_in_means))\n\n\n\n\nmaj_mean\n\n\n\n\n-2.575\n\n\n\n\n\n\nfit &lt;- lm(n ~ all, data = agencies_census)\nfit\n\n\nCall:\nlm(formula = n ~ all, data = agencies_census)\n\nCoefficients:\n(Intercept)          all  \n    35.7782      -0.5874  \n\np1 &lt;- get_regression_table(fit)\nknitr::kable(head(p1))\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n35.778\n6.887\n5.195\n0.000\n22.126\n49.430\n\n\nall\n-0.587\n14.902\n-0.039\n0.969\n-30.130\n28.955\n\n\n\n\nshootings_by_agency_census &lt;- shootings_case %&gt;%\n  group_by(agency) %&gt;%\n  count(armed) %&gt;%\n  drop_na(n, armed) %&gt;%\n  right_join(agencies_census, by = c(\"agency\" = \"name\")) |&gt;\n  distinct(armed, .keep_all = TRUE)\n\nWarning in right_join(., agencies_census, by = c(agency = \"name\")): Detected an unexpected many-to-many relationship between `x` and `y`.\nℹ Row 63 of `x` matches multiple rows in `y`.\nℹ Row 2 of `y` matches multiple rows in `x`.\nℹ If a many-to-many relationship is expected, set `relationship =\n  \"many-to-many\"` to silence this warning.\n\nshootings_by_agency_census\n\n# A tibble: 202 × 15\n# Groups:   agency [108]\n   agency          armed   n.x    id state city  police_force_size    all  white\n   &lt;chr&gt;           &lt;chr&gt; &lt;int&gt; &lt;dbl&gt; &lt;chr&gt; &lt;chr&gt;             &lt;dbl&gt;  &lt;dbl&gt;  &lt;dbl&gt;\n 1 Albany Police … YES       1  2237 NY    Alba…               890 0.185  0.160 \n 2 Albuquerque Po… NO       10   508 NM    Albu…              1340 0.616  0.630 \n 3 Albuquerque Po… YES      54   508 NM    Albu…              1340 0.616  0.630 \n 4 Amtrak Police … NO        8  1657 IL    Chic…             12120 0.875  0.872 \n 5 Amtrak Police … YES      46  1657 IL    Chic…             12120 0.875  0.872 \n 6 Atlanta Police… NO        5   447 GA    Atla…              2950 0.137  0.186 \n 7 Atlanta Police… YES      31   447 GA    Atla…              2950 0.137  0.186 \n 8 Austin Police … NO        3   141 TX    Aust…              1985 0.295  0.195 \n 9 Austin Police … YES      37   141 TX    Aust…              1985 0.295  0.195 \n10 BART Police De… YES      12  2015 CA    Oakl…              1530 0.0948 0.0267\n# ℹ 192 more rows\n# ℹ 6 more variables: `non-white` &lt;dbl&gt;, black &lt;chr&gt;, hispanic &lt;chr&gt;,\n#   asian &lt;chr&gt;, majority &lt;chr&gt;, n.y &lt;int&gt;\n\nshootings_by_agency_census &lt;- shootings_by_agency_census |&gt;\n  select(n.x, armed, all) \n\nAdding missing grouping variables: `agency`\n\nfit_multi &lt;- lm(n.x ~ all + armed, data = shootings_by_agency_census)\nfit_multi\n\n\nCall:\nlm(formula = n.x ~ all + armed, data = shootings_by_agency_census)\n\nCoefficients:\n(Intercept)          all     armedYES  \n      4.117        1.211       24.921  \n\np2 &lt;- get_regression_table(fit_multi)\nknitr::kable(head(p2))\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n4.117\n3.362\n1.225\n0.222\n-2.512\n10.747\n\n\nall\n1.211\n6.335\n0.191\n0.849\n-11.281\n13.702\n\n\narmed: YES\n24.921\n2.891\n8.619\n0.000\n19.219\n30.622\n\n\n\n\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x)) +\n  geom_jitter(jitter = 15, alpha = 0.5) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = FALSE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\")\n\nWarning in geom_jitter(jitter = 15, alpha = 0.5): Ignoring unknown parameters:\n`jitter`\n\n\n\n\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x, color = armed)) +\n  geom_jitter(jitter = 15, alpha = 0.5) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = FALSE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\")\n\nWarning in geom_jitter(jitter = 15, alpha = 0.5): Ignoring unknown parameters:\n`jitter`"
-  },
-  {
-    "objectID": "background.html#the-washington-post-fatal-force-database",
-    "href": "background.html#the-washington-post-fatal-force-database",
-    "title": "Background",
-    "section": "The Washington Post Fatal Force Database",
-    "text": "The Washington Post Fatal Force Database\nIn 2015, The Washington Post began tracking details about each police-involved killing in the United States — the race of the deceased, the circumstances of the shooting, whether the person was armed and whether the person was experiencing a mental-health crisis — by manually culling local news reports, collecting information from law enforcement websites and social media, and monitoring independent databases such as Fatal Encounters and the now-defunct Killed by Police project. In many cases, The Post conducts additional reporting.\nIn 2022, The Post updated its database to standardize and publish the names of the police agencies involved in each shooting to better measure accountability at the department level.\nThe 2014 killing of Michael Brown in Ferguson, Mo. began a protest movement culminating in the Black Lives Matter movement and an increased focus on police accountability nationwide. In this data set, The Post tracks only shootings with circumstances closely paralleling those like the killing of Brown — incidents in which a police officer, in the line of duty, shoots and kills a civilian. The Post is not tracking deaths of people in police custody, fatal shootings by off-duty officers or non-shooting deaths in this data set.\nThe FBI and the Centers for Disease Control and Prevention log fatal shootings by police, but officials acknowledge that their data is incomplete. Since 2015, The Post has documented more than twice as many fatal shootings by police as recorded by federal officials on average annually. That gap has widened in recent years, as the FBI in 2021 tracked only a third of departments’ fatal shootings."
+    "text": "library(tidyverse)\nlibrary(usmap)\nlibrary(sf)\nlibrary(infer)\nlibrary(moderndive)\n\n\n##Tidying Data\n\n#creating dfs from .csv files\npolice_locals &lt;- read_csv(\"data/police-locals.csv\")\nagencies &lt;- read_csv(\"data/fatal-police-shootings-agencies.csv\")\nshootings &lt;- read_csv(\"data/fatal-police-shootings-data.csv\")\n\n#removing old `city` tag from data set that we created when decatenated the city names\npolice_locals &lt;- police_locals |&gt;\n  select(-city_old)\n\n# creating `agencies` df with just police departments\nagencies &lt;- agencies |&gt;\n  filter(grepl(\"department\", tolower(name))) |&gt;\n  filter(!grepl(\"county\", tolower(name)))\n\n#creating binned categorical account of if shooting victim was `armed`\nshootings &lt;- shootings |&gt;\n  mutate(armed = case_when(is.na(armed_with) ~ \"NO\",\n                           armed_with == \"unarmed\" ~ \"NO\",\n                           armed_with == \"unknown\" ~ \"NO\",\n                           armed_with == \"undetermined\" ~ \"NO\",\n                           armed_with == \"gun\" ~ \"YES\",\n                           armed_with == \"knife\" ~ \"YES\",\n                           armed_with == \"blunt_object\" ~ \"YES\",\n                           armed_with == \"other\" ~ \"YES\",\n                           armed_with == \"replica\" ~ \"YES\",\n                           armed_with == \"vehicle\" ~ \"YES\"))\n\n#creating df with only agency `names`, `id`, and `state`\nagencies_ids &lt;- agencies |&gt;\n  select(name, id, state)\n\n#creating df with `city`, `agency`, and `state` info for each shooting\nshooting_agencies &lt;- shootings |&gt;\n  select(city, agency_ids, state)\n\n#changing `shooting` var in `shooting_agencies` df to numeric\nshooting_agencies$agency_ids &lt;- as.numeric(shootings$agency_ids)\n\n#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`\nagencies_w_cities &lt;- agencies_ids |&gt;\n  left_join(shooting_agencies, by = c(\"id\" = \"agency_ids\", \"state\" = \"state\")) |&gt;\n  drop_na(city) |&gt;\n  distinct(id, .keep_all = TRUE)\n\n#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`\nagencies_census &lt;- agencies_w_cities |&gt;\n  full_join(police_locals, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  drop_na(police_force_size) |&gt;\n  distinct(id, .keep_all = TRUE) |&gt;\n  mutate(majority = if_else(all &gt;= 0.5, \"TRUE\", \"FALSE\"))\n\n#creating df of only shootings involving agencies within `agencies` df\nshootings_case &lt;- shootings |&gt;\n  right_join(agencies_census, by = c(\"city\" = \"city\", \"state\" = \"state\")) |&gt;\n  select(-agency_ids) |&gt;\n  rename(agency_ids = id.y, id = id.x, agency = name.y, victim = name.x) |&gt;\n  select(-location_precision, -race_source)\n\n\n#count shootings by agency\nshootings_by_agency &lt;- shootings_case |&gt;\n  count(agency)\n\n#find top 25 agencies with the most shootings\ntop_25_agencies &lt;- shootings_by_agency |&gt;\n  slice_max(n, n = 25)\n\n# visulize top 25 agencies with the most shootings\nggplot(data = top_25_agencies,\n       mapping = aes(x = agency, y = n)) +\n  geom_col() +\n  theme(axis.text.x = element_text(angle = 75,\n                                    vjust = 1,\n                                    hjust = 1,\n                                    margin = margin(t = 5, b = 5)))\n\n\n\n\n\n#mapping Locations of Police-Involved Shootings between 2015 and 2023\n\n#load geo-viz libraries\nlibrary(ggmap)\nlibrary(maps)\nlibrary(mapdata)\n\n#create blank map\nusa &lt;- map_data(\"usa\")\nstates &lt;- map_data(\"state\")\n\n#add locations of shootings to maps\nshot_map &lt;- ggplot(data = states) + \n  geom_polygon(aes(x = long, y = lat, fill = group, group = group), color = \"white\") + \n  coord_fixed(1.3) +\n  guides(fill=FALSE) +  # do this to leave off the color legend\n  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = \"black\", size = .2) +\n  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = \"red\", size = .1) +\n  labs(title = \"Locations of Police-Involved Shootings between 2015 and 2023\",\n       captions = \"This is only includes cities where we have agency census data.\",\n       x = \"Longitude\",\n       y = \"Latitude\")\n\n\n#creating df with total shootings per agency and census data\nagencies_census &lt;- agencies_census |&gt;\n  left_join(shootings_by_agency, by = c(\"name\" = \"agency\"))\n\n#prelim visualization of relationship between percentage of officer residency and number of fatal shootings per agency\nagencies_census |&gt;\n  ggplot(mapping = aes(x = all, y = n)) +\n  geom_point() +\n  geom_smooth(method = \"lm\", se = TRUE)\n\n\n\n#creating visualization of comparison Shootings in Cities where a Majority/Minority of Officers Reside\np0 &lt;- shootings_case |&gt;\n  ggplot(aes(x = majority, fill = armed)) +\n  geom_bar() + \n  labs(title = \"Shootings in Cities where a Majority of Officers Reside\",\n       caption = \"This is only includes shootings where we have agency census data.\",\n       x = \"Does a majority a of the total police force live in the city?\",\n       y = \"Number of fatal shootings\",\n       fill = \"Victim Armed?\")\np0\n\n\n\n#calculate mean number of shootings per agency in cities where a majority of officers reside in the city\nmajority_mean &lt;- shootings_case |&gt;\n  filter(majority == TRUE) |&gt;\n  count(agency) |&gt;\n  summarize(maj_mean = mean(n))\n\n#calculate mean number of shootings per agency in cities where a minority of officers reside in the city\nminority_mean &lt;- shootings_case |&gt;\n  filter(majority == FALSE) |&gt;\n  count(agency) |&gt;\n  summarize(min_mean = mean(n))\n\n#calculate a difference in means between the `majority` and `minority`\ndiff_in_means &lt;- majority_mean - minority_mean\ndiff_in_means\n\n  maj_mean\n1   -2.575\n\n#tidy table\nknitr::kable(head(diff_in_means))\n\n\n\n\nmaj_mean\n\n\n\n\n-2.575\n\n\n\n\n\n\n#fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency\nfit &lt;- lm(n ~ all, data = agencies_census)\nfit\n\n\nCall:\nlm(formula = n ~ all, data = agencies_census)\n\nCoefficients:\n(Intercept)          all  \n    35.7782      -0.5874  \n\n#tidy `fit`\np1 &lt;- get_regression_table(fit)\nknitr::kable(head(p1))\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n35.778\n6.887\n5.195\n0.000\n22.126\n49.430\n\n\nall\n-0.587\n14.902\n-0.039\n0.969\n-30.130\n28.955\n\n\n\n\n#add `armed` and `majority` to `shootings_by_agency` df\nshootings_by_agency_census &lt;- shootings_case |&gt;\n  group_by(agency) |&gt;\n  count(armed) |&gt;\n  drop_na(n, armed) |&gt;\n  right_join(agencies_census, by = c(\"agency\" = \"name\")) |&gt;\n  distinct(armed, .keep_all = TRUE)\n\nshootings_by_agency_census &lt;- shootings_by_agency_census |&gt;\n  select(n.x, armed, all) \n\n#fit multiple linear regression model for correlation between percentage of officer residency and victim armament and number of fatal shootings per agency\nfit_multi &lt;- lm(n.x ~ all + armed, data = shootings_by_agency_census)\nfit_multi\n\n\nCall:\nlm(formula = n.x ~ all + armed, data = shootings_by_agency_census)\n\nCoefficients:\n(Intercept)          all     armedYES  \n      4.117        1.211       24.921  \n\n#tidy `fit_multi`\np2 &lt;- get_regression_table(fit_multi)\nknitr::kable(head(p2))\n\n\n\n\n\n\n\n\n\n\n\n\n\nterm\nestimate\nstd_error\nstatistic\np_value\nlower_ci\nupper_ci\n\n\n\n\nintercept\n4.117\n3.362\n1.225\n0.222\n-2.512\n10.747\n\n\nall\n1.211\n6.335\n0.191\n0.849\n-11.281\n13.702\n\n\narmed: YES\n24.921\n2.891\n8.619\n0.000\n19.219\n30.622\n\n\n\n\n#visualize polynomial relationship between percentage of officer residency and number of fatal shootings per agency\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x)) +\n  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = TRUE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\")\n\n\n\n#visualize polynomial relationship between percentage of officer residency and victim armament and number of fatal shootings per agency\nggplot(data = shootings_by_agency_census, aes(x = all, y = n.x, color = armed)) +\n  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +\n  geom_smooth(method = \"lm\", formula = y ~ poly(x, 2), se = TRUE) +\n  labs(title = \"Number of Shootings on a Scale of Police Force Residency\",\n       x = \"Percentage of the total police force that lives in the city\",\n       y = \"Number of fatal shootings in that city\",\n       color = \"Victim Armed?\")\n\n\n\n\nThe model equation for fit is:\n[ = 35.7782 - 0.5874 ]\nInterpretation:\n\nThe intercept, \\(35.7782\\), is the estimated number of fatal shootings when the percentage of officer in-city residency (all) is \\(0\\). For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by \\(0.5874\\) (\\(-0.5874\\)) units, assuming all other factors remain constant.\n\nThis model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it’s important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.\nThe model equation for fit_multi considering victim armament (armed) is:\n[ = 4.117 + 1.211 + 24.921 ]\n\nThe intercept, \\(4.117\\), is the estimated number of fatal shootings where the percentage of officer in-city residency (all) is \\(0\\) and the victim was un-armed. For each one-unit increase in the percentage of in-city officer residency compared to the total force (all), we expect an increase of \\(1.211\\) fatal shootings, assuming the victim’s armament status (armedYES) remains constant.\nThe coefficient for ‘armedYES’, \\(24.921\\), indicates that the victim is armed (armed is YES), we expect an increase of \\(24.921\\) fatal shootings compared to when the victim is not armed (armed is No), assuming the percentage of officer residency (all) remains constant.\n\nIn summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.\n\n#generate null distribution\nnull_dist &lt;- agencies_census |&gt;\n  specify(n ~ majority) |&gt;\n  hypothesize(null = \"independence\") |&gt;\n  generate(reps = 1000, type = \"permute\") |&gt;\n  calculate(stat = \"diff in means\", order = c(\"TRUE\", \"FALSE\"))\n\n#compute observed test statistic\ntest_stat &lt;- agencies_census |&gt;\n  specify(n ~ majority) |&gt;\n  calculate(stat = \"diff in means\", order = c(\"TRUE\", \"FALSE\"))\n\n#visualize p-value\nnull_dist |&gt;\n  visualize() +\n  shade_p_value(obs_stat = test_stat, direction = \"less\")\n\n\n\n#compute p-value\n  null_dist |&gt;\n  get_p_value(obs_stat = test_stat, direction = \"less\")\n\n# A tibble: 1 × 1\n  p_value\n    &lt;dbl&gt;\n1   0.263\n\n\nInference for a Difference in Means\n\n\\(H_0\\): The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.\n\\(H_A\\): The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.\n\n– \\(H_0 : \\mu_{maj} − \\mu_{min} = 0\\), or equivalently \\(H_0 : \\mu_{maj} = \\mu_{min}\\) – \\(H_A : \\mu_{maj} − \\mu_{min} &lt; 0\\), or equivalently \\(H_A : \\mu_{maj} &lt; \\mu_{min}\\)\n\n#generate null distribution\nnull_dist_cor &lt;- agencies_census |&gt;\n  specify(n ~ white) |&gt;\n  hypothesize(null = \"independence\") |&gt;\n  generate(reps = 1000, type = \"permute\") |&gt;\n  calculate(stat = \"correlation\")\n\n#compute observed test statistic\ntest_stat_cor &lt;- agencies_census |&gt;\n  specify(n ~ white) |&gt;\n  calculate(stat = \"correlation\")\ntest_stat_cor\n\nResponse: n (numeric)\nExplanatory: white (numeric)\n# A tibble: 1 × 1\n     stat\n    &lt;dbl&gt;\n1 -0.0470\n\n#visualize p-value\nnull_dist_cor |&gt;\n  visualize() +\n  shade_p_value(obs_stat = test_stat, direction = \"two.sided\")\n\n\n\n#compute p-value\nnull_dist_cor |&gt;\n  get_p_value(obs_stat = test_stat, direction = \"two.sided\")\n\n# A tibble: 1 × 1\n  p_value\n    &lt;dbl&gt;\n1       0\n\n\nInference for a Correlation\n\n\\(H_O\\): There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\\(H_A\\): There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.\n\n\\(H_0 : \\rho = 0\\)\n\\(H_0 : \\rho \\neq 0\\)"
   }
 ]
\ No newline at end of file
diff --git a/about.qmd b/about.qmd
deleted file mode 100644
index 0c6f4a2..0000000
--- a/about.qmd
+++ /dev/null
@@ -1,9 +0,0 @@
----
-title: "About"
----
-
-About this site
-
-```{r}
-1 + 1
-```
diff --git a/background.qmd b/background.qmd
index 1b759fb..3a977ec 100644
--- a/background.qmd
+++ b/background.qmd
@@ -4,7 +4,54 @@ title: "Background"
 
 > # On average, police in the United States shoot and kill more than 1,000 people every year, according to an ongoing analysis by The Washington Post. {style="red"}
 
-## The Washington Post Fatal Force Database
+# Proposal
+
+We propose a case study to explore the relationship between police residence and fatal police shootings, employing advanced data science methodologies. Focusing on officers residing in the cities they serve, our project aims to uncover insights and patterns that contribute to a nuanced understanding of this complex issue.
+
+### **Objectives:**
+
+1.  Investigate the correlation between police residence and fatal police shootings.
+2.  Utilize a comprehensive dataset spanning 2015 to 2023, focusing on police agencies involved in at least one fatal shooting.
+3.  Apply advanced statistical methods and machine learning techniques to identify patterns and potential biases.
+4.  Examine disparities in incident rates based on officers' residency status, considering demographic, socioeconomic, and policing variables.
+
+**Methodology:**
+
+a.  Data Collection:
+    -   Compile a dataset comprising information on police agencies involved in fatal police shootings.
+    -   Compile a data set of census variables such as officer residency, race, community demographics, and departmental policies.
+b.  Analysis:
+    -   Employ advanced statistical methods and machine learning techniques to discern patterns and correlations.
+    -   Conduct a comprehensive exploration of variables influencing fatal police shootings.
+
+**Hypothesis and Expected Outcomes:**
+
+We will conduct two hypothesis tests to analyze both;
+
+-   the nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths and
+
+-   the categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.
+
+1.  Inference for a Difference in Proportions
+
+    -   $H_0$: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.
+
+    -   $H_A$: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.
+
+        -   $H_0 : p\_{maj} − p\_{min} = 0$, or equivalently $H_0 : p\_{maj} = p\_{min}$
+        -   $H_A : p\_{maj} − p\_{min} < 0$, or equivalently $H_A : p\_{maj} < p\_{min}$
+
+2.  Inference for a Correlation
+
+    -   $H_O$: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+    -   $H_A$: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+        -   $H_0 : \rho = 0$
+
+        -   $H_0 : \rho \neq 0$
+
+# The Washington Post Fatal Force Database
 
 In 2015, The Washington Post [began tracking](https://www.washingtonpost.com/graphics/investigations/police-shootings-database/) details about each police-involved killing in the United States --- the race of the deceased, the circumstances of the shooting, whether the person was armed and whether the person was experiencing a mental-health crisis --- by manually culling local news reports, collecting information from law enforcement websites and social media, and monitoring independent databases such as [Fatal Encounters](https://fatalencounters.org/) and the now-defunct Killed by Police project. In many cases, The Post conducts additional reporting.
 
diff --git a/codebook.qmd b/codebook.qmd
new file mode 100644
index 0000000..370941e
--- /dev/null
+++ b/codebook.qmd
@@ -0,0 +1,44 @@
+---
+title: "Codebook"
+---
+
+## Explanatory Variables
+
+| Name                | Description                                                             |
+|-------------------------|-----------------------------------------------|
+| `city`              | U.S. city                                                               |
+| `police_force_size` | Number of police officers serving that city                             |
+| `all`               | Percentage of the total police force that lives in the city             |
+| `white`             | Percentage of white (non-Hispanic) police officers who live in the city |
+| `non-white`         | Percentage of non-white police officers who live in the city            |
+| `black`             | Percentage of black police officers who live in the city                |
+| `hispanic`          | Percentage of Hispanic police officers who live in the city             |
+| `asian`             | Percentage of Asian police officers who live in the city                |
+
+### Incident Information
+
+| Name          | Description                                                                                                                                                                                                                                                                 |
+|-------------------------|------------------------------------------------|
+| `id`          | A unique identifier for each fatal police shooting incident.                                                                                                                                                                                                                |
+| `date`        | The date of the fatal shooting.                                                                                                                                                                                                                                             |
+| `body_camera` | Whether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.                                                                                                                                             |
+| `city`        | The municipality where the fatal shooting took place                                                                                                                                                                                                                        |
+| `county`      | County where the fatal shooting took place.                                                                                                                                                                                                                                 |
+| `state`       | The two-letter postal code abbreviation for the state in which the fatal shooting took place.                                                                                                                                                                               |
+| `latitude`    | The latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level. |
+| `longitude`   | The longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.                                                                                                                                                                             |
+
+### Agency Information
+
+|         | Description                           |
+|---------|---------------------------------------|
+| `id`    | Department Database Id                |
+| `name`  | Department Name                       |
+| `state` | State in which the agency is located. |
+
+## Project thoughts
+
+I am interested in exploring data related to...
+
+-   Political Extremism
+-   Black American Opinion
diff --git a/data.qmd b/data.qmd
index 2323c87..a4a5619 100644
--- a/data.qmd
+++ b/data.qmd
@@ -2,7 +2,11 @@
 title: "Data"
 ---
 
-```{r}
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(warning = FALSE, message = FALSE) 
+```
+
+```{r pacakges}
 library(tidyverse)
 library(usmap)
 library(sf)
@@ -10,8 +14,7 @@ library(infer)
 library(moderndive)
 ```
 
-
-```{r Tidyin}
+```{r Tidying and Wrangling Data}
 
 ##Tidying Data
 
@@ -45,12 +48,10 @@ shootings <- shootings |>
 #creating df with only agency `names`, `id`, and `state`
 agencies_ids <- agencies |>
   select(name, id, state)
-agencies_ids
 
 #creating df with `city`, `agency`, and `state` info for each shooting
 shooting_agencies <- shootings |>
   select(city, agency_ids, state)
-shooting_agencies
 
 #changing `shooting` var in `shooting_agencies` df to numeric
 shooting_agencies$agency_ids <- as.numeric(shootings$agency_ids)
@@ -60,7 +61,6 @@ agencies_w_cities <- agencies_ids |>
   left_join(shooting_agencies, by = c("id" = "agency_ids", "state" = "state")) |>
   drop_na(city) |>
   distinct(id, .keep_all = TRUE)
-agencies_w_cities
 
 #creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`
 agencies_census <- agencies_w_cities |>
@@ -68,7 +68,6 @@ agencies_census <- agencies_w_cities |>
   drop_na(police_force_size) |>
   distinct(id, .keep_all = TRUE) |>
   mutate(majority = if_else(all >= 0.5, "TRUE", "FALSE"))
-agencies_census
 
 #creating df of only shootings involving agencies within `agencies` df
 shootings_case <- shootings |>
@@ -76,21 +75,18 @@ shootings_case <- shootings |>
   select(-agency_ids) |>
   rename(agency_ids = id.y, id = id.x, agency = name.y, victim = name.x) |>
   select(-location_precision, -race_source)
-shootings_case
 
 ```
 
-```{r}
+```{r Counting Shootings}
 
 #count shootings by agency
 shootings_by_agency <- shootings_case |>
   count(agency)
-shootings_by_agency
 
 #find top 25 agencies with the most shootings
 top_25_agencies <- shootings_by_agency |>
   slice_max(n, n = 25)
-top_25_agencies
 
 # visulize top 25 agencies with the most shootings
 ggplot(data = top_25_agencies,
@@ -102,18 +98,22 @@ ggplot(data = top_25_agencies,
                                     margin = margin(t = 5, b = 5)))
 
 ```
-```{r}
+
+```{r shot_map}
 
 #mapping Locations of Police-Involved Shootings between 2015 and 2023
 
+#load geo-viz libraries
 library(ggmap)
 library(maps)
 library(mapdata)
 
+#create blank map
 usa <- map_data("usa")
 states <- map_data("state")
 
-ggplot(data = states) + 
+#add locations of shootings to maps
+shot_map <- ggplot(data = states) + 
   geom_polygon(aes(x = long, y = lat, fill = group, group = group), color = "white") + 
   coord_fixed(1.3) +
   guides(fill=FALSE) +  # do this to leave off the color legend
@@ -131,7 +131,6 @@ ggplot(data = states) +
 #creating df with total shootings per agency and census data
 agencies_census <- agencies_census |>
   left_join(shootings_by_agency, by = c("name" = "agency"))
-agencies_census
 
 #prelim visualization of relationship between percentage of officer residency and number of fatal shootings per agency
 agencies_census |>
@@ -155,14 +154,12 @@ majority_mean <- shootings_case |>
   filter(majority == TRUE) |>
   count(agency) |>
   summarize(maj_mean = mean(n))
-majority_mean
 
 #calculate mean number of shootings per agency in cities where a minority of officers reside in the city
 minority_mean <- shootings_case |>
   filter(majority == FALSE) |>
   count(agency) |>
   summarize(min_mean = mean(n))
-minority_mean
 
 #calculate a difference in means between the `majority` and `minority`
 diff_in_means <- majority_mean - minority_mean
@@ -174,7 +171,6 @@ knitr::kable(head(diff_in_means))
 
 ```
 
-
 ```{r}
 
 #fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency
@@ -186,13 +182,12 @@ p1 <- get_regression_table(fit)
 knitr::kable(head(p1))
 
 #add `armed` and `majority` to `shootings_by_agency` df
-shootings_by_agency_census <- shootings_case %>%
-  group_by(agency) %>%
-  count(armed) %>%
-  drop_na(n, armed) %>%
+shootings_by_agency_census <- shootings_case |>
+  group_by(agency) |>
+  count(armed) |>
+  drop_na(n, armed) |>
   right_join(agencies_census, by = c("agency" = "name")) |>
   distinct(armed, .keep_all = TRUE)
-shootings_by_agency_census
 
 shootings_by_agency_census <- shootings_by_agency_census |>
   select(n.x, armed, all) 
@@ -225,69 +220,90 @@ ggplot(data = shootings_by_agency_census, aes(x = all, y = n.x, color = armed))
 
 ```
 
+The model equation for `fit` is:
+
+\[ \text{Number of Fatal Shootings (n)} = 35.7782 - 0.5874 \times \text{Percentage of Officer Residency (all)} \]
+
+Interpretation:
+
+-   The intercept, $35.7782$, is the estimated number of fatal shootings when the percentage of officer in-city residency (`all`) is $0$. For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by $0.5874$ ($-0.5874$) units, assuming all other factors remain constant.
+
+This model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it's important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.
+
+The model equation for `fit_multi` considering victim armament (`armed`) is:
+
+\[ \text{# of Fatal Shootings (n.x)} = 4.117 + 1.211 \times \text{Percentage of Officer Residency (all)} + 24.921 \times \text{Armed (YES)} \]
+
+-   The intercept, $4.117$, is the estimated number of fatal shootings where the percentage of officer in-city residency (`all`) is $0$ and the victim was un-armed. For each one-unit increase in the percentage of in-city officer residency compared to the total force (`all`), we expect an increase of $1.211$ fatal shootings, assuming the victim's armament status (`armedYES`) remains constant.
+
+-   The coefficient for 'armedYES', $24.921$, indicates that the victim is armed (`armed` is `YES`), we expect an increase of $24.921$ fatal shootings compared to when the victim is not armed (`armed` is `No`), assuming the percentage of officer residency (`all`) remains constant.
+
+In summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.
+
 ```{r Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop}
 
 #generate null distribution
-null_dist <- agencies_census %>%
-  specify(n ~ majority) %>%
-  hypothesize(null = "independence") %>%
-  generate(reps = 1000, type = "permute") %>%
+null_dist <- agencies_census |>
+  specify(n ~ majority) |>
+  hypothesize(null = "independence") |>
+  generate(reps = 1000, type = "permute") |>
   calculate(stat = "diff in means", order = c("TRUE", "FALSE"))
 
 #compute observed test statistic
-test_stat <- agencies_census %>%
-  specify(n ~ majority) %>%
+test_stat <- agencies_census |>
+  specify(n ~ majority) |>
   calculate(stat = "diff in means", order = c("TRUE", "FALSE"))
 
 #visualize p-value
-null_dist %>%
+null_dist |>
   visualize() +
   shade_p_value(obs_stat = test_stat, direction = "less")
 
 #compute p-value
-  null_dist %>%
+  null_dist |>
   get_p_value(obs_stat = test_stat, direction = "less")
 
 ```
-Inference for a Difference in Proportions
 
-- $H_0$: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.
-- $H_A$: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.
+Inference for a Difference in Means
 
-– $H_0 : p_{maj} − p_{min} = 0$, or equivalently $H_0 : p_{maj} = p_{min}$
-– $H_A : p_{maj} − p_{min} < 0$, or equivalently $H_A : p_{maj} < p_{min}$
+-   $H_0$: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.
+-   $H_A$: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.
 
+-- $H_0 : \mu_{maj} − \mu_{min} = 0$, or equivalently $H_0 : \mu_{maj} = \mu_{min}$ -- $H_A : \mu_{maj} − \mu_{min} < 0$, or equivalently $H_A : \mu_{maj} < \mu_{min}$
 
 ```{r Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop}
 
 #generate null distribution
-null_dist_cor <- agencies_census %>%
-  specify(n ~ white) %>%
-  hypothesize(null = "independence") %>%
-  generate(reps = 1000, type = "permute") %>%
+null_dist_cor <- agencies_census |>
+  specify(n ~ white) |>
+  hypothesize(null = "independence") |>
+  generate(reps = 1000, type = "permute") |>
   calculate(stat = "correlation")
 
 #compute observed test statistic
-test_stat_cor <- agencies_census %>%
-  specify(n ~ white) %>%
+test_stat_cor <- agencies_census |>
+  specify(n ~ white) |>
   calculate(stat = "correlation")
 test_stat_cor
 
 #visualize p-value
-null_dist_cor %>%
+null_dist_cor |>
   visualize() +
-  shade_p_value(obs_stat = test_stat, direction = "greater")
+  shade_p_value(obs_stat = test_stat, direction = "two.sided")
 
 #compute p-value
-null_dist_cor %>%
-  get_p_value(obs_stat = test_stat, direction = "greater")
+null_dist_cor |>
+  get_p_value(obs_stat = test_stat, direction = "two.sided")
 
 ```
 
 Inference for a Correlation
 
-- $H_O$: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
-- $H_O$: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+-   $H_O$: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+-   $H_A$: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+    -   $H_0 : \rho = 0$
 
-– $H_0 : \rho = 0$
-– $H_0 : \rho \neq 0$
+    -   $H_0 : \rho \neq 0$
diff --git a/index.qmd b/index.qmd
index 27acd93..1698490 100644
--- a/index.qmd
+++ b/index.qmd
@@ -14,9 +14,26 @@ output:
     toc: true
     toc_float: true
     code_folding: true
+    
+format:
+  html:
+    code-fold: true
+    code-summary: "Show the code"
 ---
 
-## Proposal
+```{r setup, include=FALSE}
+knitr::opts_chunk$set(warning = FALSE, message = FALSE) 
+
+library(tidyverse)
+library(usmap)
+library(sf)
+library(infer)
+library(moderndive)
+```
+
+> # On average, police in the United States shoot and kill more than 1,000 people every year...and then they go home to their families
+
+## Abstract {#sec-abstract}
 
 This case study investigates the intricate relationship between police residence and fatal police shootings, employing a data science approach to uncover insights and patterns within the context of law enforcement agencies. Focused on police officers residing in the cities they serve, the study examines whether this residency factor correlates with the incidence of fatal police shootings. The data set, spanning the years 2015 to 2023, is composed of information on police agencies involved in at least one fatal shooting, and is subjected to rigorous analysis using advanced statistical methods and machine learning techniques.
 
@@ -24,43 +41,324 @@ This study aims to discern patterns, trends, and potential biases associated wit
 
 The insights derived from this case study bear substantial implications for informing public policy, refining police training protocols, and strengthening community relations. By unraveling the nuanced dynamics surrounding police residence and fatal police shootings, this case study aims to provide evidence-based recommendations to enhance transparency, accountability, and trust between law enforcement agencies and the communities they serve. In doing so, it contributes to the broader discourse on police reform, fostering a data-driven approach to address critical issues and promote safer, more resilient communities.
 
-### Explanatory Variables
-
-| Name                | Description                                                             |
-|------------------------|------------------------------------------------|
-| `city`              | U.S. city                                                               |
-| `police_force_size` | Number of police officers serving that city                             |
-| `all`               | Percentage of the total police force that lives in the city             |
-| `white`             | Percentage of white (non-Hispanic) police officers who live in the city |
-| `non-white`         | Percentage of non-white police officers who live in the city            |
-| `black`             | Percentage of black police officers who live in the city                |
-| `hispanic`          | Percentage of Hispanic police officers who live in the city             |
-| `asian`             | Percentage of Asian police officers who live in the city                |
-
-**Incident Information**
-
-| Name          | Description                                                                                                                                                                                                                                                                 |
-|-----------------------|------------------------------------------------|
-| `id`          | A unique identifier for each fatal police shooting incident.                                                                                                                                                                                                                |
-| `date`        | The date of the fatal shooting.                                                                                                                                                                                                                                             |
-| `body_camera` | Whether news reports have indicated an officer was wearing a body camera and it may have recorded some portion of the incident.                                                                                                                                             |
-| `city`        | The municipality where the fatal shooting took place                                                                                                                                                                                                                        |
-| `county`      | County where the fatal shooting took place.                                                                                                                                                                                                                                 |
-| `state`       | The two-letter postal code abbreviation for the state in which the fatal shooting took place.                                                                                                                                                                               |
-| `latitude`    | The latitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses. Please note that the precision and accuracy of incident coordinates varies depending on the precision of the input address which is often only available at the block level. |
-| `longitude`   | The longitude location of the shooting expressed as WGS84 coordinates, geocoded from addresses.                                                                                                                                                                             |
-
-**Agency Information**
-
-|         | Description                           |
-|---------|---------------------------------------|
-| `id`    | Department Database Id                |
-| `name`  | Department Name                       |
-| `state` | State in which the agency is located. |
-
-## Project thoughts
-
-I am interested in exploring data related to...
-
--   Political Extremism
--   Black American Opinion
+## Hypotheses
+
+We will conduct two hypothesis tests to analyze both;
+
+1.  The nominal relationship between an increasing proportion of in-city officer residency and number of fatal police shooting deaths
+
+    -   $H_0$: The mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not.
+
+    -   $H_A$: The mean total number of fatal shootings per agencies is fewer in cities where a majority of the officers live in the city then cities where they do not.
+
+        -   $H_0 : p\_{maj} − p\_{min} = 0$, or equivalently $H_0 : p\_{maj} = p\_{min}$
+        -   $H_A : p\_{maj} − p\_{min} < 0$, or equivalently $H_A : p\_{maj} < p\_{min}$
+
+2.  The categorical difference in fatal police shooting deaths between cities where a majority or or minority of police officers live in the city.
+
+    -   $H_0$: There is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+    -   $H_A$: There is a relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings.
+
+        -   $H_0 : \rho = 0$
+
+        -   $H_0 : \rho \neq 0$
+
+## Methods
+
+#### Tidying Data
+
+```{r Tidying and Wrangling Data}
+##Tidying Data
+
+#creating dfs from .csv files
+police_locals <- read_csv("data/police-locals.csv")
+agencies <- read_csv("data/fatal-police-shootings-agencies.csv")
+shootings <- read_csv("data/fatal-police-shootings-data.csv")
+
+#removing old `city` tag from data set that we created when decatenated the city names
+police_locals <- police_locals |>
+  select(-city_old)
+
+# creating `agencies` df with just police departments
+agencies <- agencies |>
+  filter(grepl("department", tolower(name))) |>
+  filter(!grepl("county", tolower(name)))
+
+#creating binned categorical account of if shooting victim was `armed`
+shootings <- shootings |>
+  mutate(armed = case_when(is.na(armed_with) ~ "NO",
+                           armed_with == "unarmed" ~ "NO",
+                           armed_with == "unknown" ~ "NO",
+                           armed_with == "undetermined" ~ "NO",
+                           armed_with == "gun" ~ "YES",
+                           armed_with == "knife" ~ "YES",
+                           armed_with == "blunt_object" ~ "YES",
+                           armed_with == "other" ~ "YES",
+                           armed_with == "replica" ~ "YES",
+                           armed_with == "vehicle" ~ "YES"))
+
+#creating df with only agency `names`, `id`, and `state`
+agencies_ids <- agencies |>
+  select(name, id, state)
+
+#creating df with `city`, `agency`, and `state` info for each shooting
+shooting_agencies <- shootings |>
+  select(city, agency_ids, state)
+
+#changing `shooting` var in `shooting_agencies` df to numeric
+shooting_agencies$agency_ids <- as.numeric(shootings$agency_ids)
+
+#creating df with `city` and `state` info for each agency by joining `agencies_ids` and `shooting_agencies`
+agencies_w_cities <- agencies_ids |>
+  left_join(shooting_agencies, by = c("id" = "agency_ids", "state" = "state")) |>
+  drop_na(city) |>
+  distinct(id, .keep_all = TRUE)
+
+#creating df with census data for each agency by joining `agencies_w_cities` and `police_locals`
+agencies_census <- agencies_w_cities |>
+  full_join(police_locals, by = c("city" = "city", "state" = "state")) |>
+  drop_na(police_force_size) |>
+  distinct(id, .keep_all = TRUE) |>
+  mutate(majority = if_else(all >= 0.5, "TRUE", "FALSE"))
+
+#creating df of only shootings involving agencies within `agencies` df
+shootings_case <- shootings |>
+  right_join(agencies_census, by = c("city" = "city", "state" = "state")) |>
+  select(-agency_ids) |>
+  rename(agency_ids = id.y, id = id.x, agency = name.y, victim = name.x) |>
+  select(-location_precision, -race_source)
+```
+
+#### Counting Shootings
+
+```{r Counting Shootings}
+#count shootings by agency
+shootings_by_agency <- shootings_case |>
+  count(agency)
+
+#find top 25 agencies with the most shootings
+top_25_agencies <- shootings_by_agency |>
+  slice_max(n, n = 25)
+```
+
+#### Mapping Locations of Police-Involved Shootings between 2015 and 2023
+
+```{r shot_map, include=FALSE}
+#mapping Locations of Police-Involved Shootings between 2015 and 2023
+
+#load geo-viz libraries
+library(ggmap)
+library(maps)
+library(mapdata)
+
+#create blank map
+usa <- map_data("usa")
+states <- map_data("state")
+
+#add locations of shootings to maps
+shot_map <- ggplot(data = states) + 
+  geom_polygon(aes(x = long, y = lat, fill = group, group = group), color = "white") + 
+  coord_fixed(1.3) +
+  guides(fill=FALSE) +  # do this to leave off the color legend
+  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = "black", size = .2) +
+  geom_point(data = shootings_case, aes(x = longitude, y = latitude), color = "red", size = .1) +
+  labs(title = "Locations of Police-Involved Shootings between 2015 and 2023",
+       captions = "This is only includes cities where we have agency census data.",
+       x = "Longitude",
+       y = "Latitude")
+
+```
+
+```{r}
+shot_map
+```
+
+```{r bolstering dfs}
+#creating df with total shootings per agency and census data
+agencies_census <- agencies_census |>
+  left_join(shootings_by_agency, by = c("name" = "agency"))
+
+#creating visualization of comparison Shootings in Cities where a Majority/Minority of Officers Reside
+p0 <- shootings_case |>
+  ggplot(aes(x = majority, fill = armed)) +
+  geom_bar() + 
+  labs(title = "Shootings in Cities where a Majority of Officers Reside",
+       caption = "This is only includes shootings where we have agency census data.",
+       x = "Does a majority a of the total police force live in the city?",
+       y = "Number of fatal shootings",
+       fill = "Victim Armed?")
+```
+
+```{r}
+p0
+```
+
+```{r}
+#calculate mean number of shootings per agency in cities where a majority of officers reside in the city
+majority_mean <- shootings_case |>
+  filter(majority == TRUE) |>
+  count(agency) |>
+  summarize(maj_mean = mean(n))
+
+#calculate mean number of shootings per agency in cities where a minority of officers reside in the city
+minority_mean <- shootings_case |>
+  filter(majority == FALSE) |>
+  count(agency) |>
+  summarize(min_mean = mean(n))
+
+#calculate a difference in means between the `majority` and `minority`
+diff_in_means <- majority_mean - minority_mean
+```
+
+```{r}
+#tidy table
+knitr::kable(head(diff_in_means))
+```
+
+```{r}
+#fit single linear regression model for correlation between percentage of officer residency and number of fatal shootings per agency
+fit <- lm(n ~ all, data = agencies_census)
+
+#add `armed` and `majority` to `shootings_by_agency` df
+shootings_by_agency_census <- shootings_case |>
+  group_by(agency) |>
+  count(armed) |>
+  drop_na(n, armed) |>
+  right_join(agencies_census, by = c("agency" = "name")) |>
+  distinct(armed, .keep_all = TRUE)
+
+shootings_by_agency_census <- shootings_by_agency_census |>
+  select(n.x, armed, all) 
+
+#fit multiple linear regression model for correlation between percentage of officer residency and victim armament and number of fatal shootings per agency
+fit_multi <- lm(n.x ~ all + armed, data = shootings_by_agency_census)
+```
+
+## Results
+
+### Multiple Linear Regression of relationship between percentage of officer residency and number of fatal shootings per agency `fit`
+
+The model equation for `fit` is:
+
+$$
+\text{Number of Fatal Shootings (n)} = 35.7782 - 0.5874 \times \text{Percentage of Officer Residency (all)}
+$$
+
+```{r}
+#tidy `fit`
+p1 <- get_regression_table(fit)
+knitr::kable(head(p1))
+```
+
+Interpretation:
+
+-   The intercept, $35.7782$, is the estimated number of fatal shootings when the percentage of officer in-city residency (`all`) is $0$. For each one-unit increase in the percentage of officer residency, the number of fatal shootings is expected to decrease by $0.5874$ ($-0.5874$) units, assuming all other factors remain constant.
+
+This model suggests that there is a negative association between the percentage of officer residency and the number of fatal shootings. However, it's important to interpret the results in the context of your data and consider potential confounding factors, like whether or not the victim was armed.
+
+```{r}
+#visualize polynomial relationship between percentage of officer residency and number of fatal shootings per agency
+ggplot(data = shootings_by_agency_census, aes(x = all, y = n.x)) +
+  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +
+  geom_smooth(method = "lm", formula = y ~ poly(x, 2), se = TRUE) +
+  labs(title = "Number of Shootings on a Scale of Police Force Residency",
+       x = "Percentage of the total police force that lives in the city",
+       y = "Number of fatal shootings in that city")
+```
+
+### Multiple Linear Regression of relationship between percentage of officer residency/victim armament and number of fatal shootings per agency `fit_multi`
+
+The model equation for `fit_multi` considering victim armament (`armed`) is:
+
+$$
+\text{\ of Fatal Shootings (n.x)} = 4.117 + 1.211 \times \text{Percentage of Officer Residency (all)} + 24.921 \times \text{Armed (YES)} 
+$$
+
+```{r}
+#tidy `fit_multi`
+p2 <- get_regression_table(fit_multi)
+knitr::kable(head(p2))
+```
+
+-   The intercept, $4.117$, is the estimated number of fatal shootings where the percentage of officer in-city residency (`all`) is $0$ and the victim was un-armed. **For each one-unit increase in the percentage of in-city officer residency compared to the total force (`all`), we expect an increase of** $1.211$ fatal shootings, assuming the victim's armament status (`armedYES`) remains constant.
+
+-   The coefficient for 'armedYES', $24.921$, indicates that the victim is armed (`armed` is `YES`), **we expect an increase of** $24.921$ fatal shootings compared to when the victim is not armed (`armed` is `No`), assuming the percentage of officer residency (`all`) remains constant.
+
+In summary, the model suggests that the percentage of officer residency and whether the victim is armed are associated with the number of fatal shootings per agency even as we control for victim armament. However, as correlation does not imply causation, and other factors not included in the model may influence the outcomes.
+
+```{r}
+#visualize polynomial relationship between percentage of officer residency and victim armament and number of fatal shootings per agency
+ggplot(data = shootings_by_agency_census, aes(x = all, y = n.x, color = armed)) +
+  geom_jitter(width = 0.10, height = 0, alpha = 0.45) +
+  geom_smooth(method = "lm", formula = y ~ poly(x, 2), se = TRUE) +
+  labs(title = "Number of Shootings on a Scale of Police Force Residency",
+       x = "Percentage of the total police force that lives in the city",
+       y = "Number of fatal shootings in that city",
+       color = "Victim Armed?")
+```
+
+```{r Hypothesis Testing for Diff in Mean Total Fatal Shootings between Residency Prop}
+
+#generate null distribution
+null_dist <- agencies_census |>
+  specify(n ~ majority) |>
+  hypothesize(null = "independence") |>
+  generate(reps = 1000, type = "permute") |>
+  calculate(stat = "diff in means", order = c("TRUE", "FALSE"))
+
+#compute observed test statistic
+test_stat <- agencies_census |>
+  specify(n ~ majority) |>
+  calculate(stat = "diff in means", order = c("TRUE", "FALSE"))
+
+#visualize p-value
+null_dist |>
+  visualize() +
+  shade_p_value(obs_stat = test_stat, direction = "less")
+
+#compute p-value
+  null_dist |>
+  get_p_value(obs_stat = test_stat, direction = "less")
+
+```
+
+At a significance level of $\alpha = 0.05$, the p-value of $0.248$ suggests that, **there is insufficient evidence to reject the null hypothesis**. In this context, since our null hypothesis asserts that mean total number of fatal shootings per agencies does not differ based on if a majority of the officers live in the city or not, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (difference in means; $\mu_{maj} − \mu_{min}$) is $-4.92$ is around $25\%$ ($0.248$). Meaning our observed difference in means between the groups is likely to have occurred by random chance.
+
+```{r Hypothesis Testing for Correlation between Total Fatal Shootings and Residency Prop}
+
+#generate null distribution
+null_dist_cor <- agencies_census |>
+  specify(n ~ white) |>
+  hypothesize(null = "independence") |>
+  generate(reps = 1000, type = "permute") |>
+  calculate(stat = "correlation")
+
+#compute observed test statistic
+test_stat_cor <- agencies_census |>
+  specify(n ~ white) |>
+  calculate(stat = "correlation")
+
+
+#visualize p-value
+null_dist_cor |>
+  visualize() +
+  shade_p_value(obs_stat = test_stat, direction = "two.sided")
+
+#compute p-value
+null_dist_cor |>
+  get_p_value(obs_stat = test_stat, direction = "two.sided")
+
+```
+
+At a significance level of $\alpha = 0.05$, the p-value of $0.248$ suggests that, there is sufficient evidence to reject the null hypothesis. In this context, since our null hypothesis asserts that there is no relationship between percentage of the total police force that lives in the city they serve and number of fatal shootings, our p-value indicates that, assuming our null is true, the probability of observing our given test statistic (correlation coefficient; $\rho = 0$) is $-0.0470$ is around $0\%$ ($0$). Meaning our observed correlation coefficient likely would not happen if there was no relationship between percentage of officer residency and number of fatal shootings for a given agency.
+
+## Conclusion
+
+### General Conclusions
+
+### Study Limitations
+
+### Improvements for Future Study
+
+## Citations