essay.tex

\documentclass[a4paper,12pt]{article}
\usepackage{partiiiessay}

\title{\vspace{-40pt}Serre's conjecture}
\author{\vspace{-10pt}Alex J. Best}
\date{June 2015}

% used https://www.overleaf.com/2460029rpwpbr#/6413425/

\begin{document}
\maketitle
\vspace{-45pt}
\tableofcontents
\clearpage


% ************************************************************************
% Introduction
% ************************************************************************
\section{Introduction}
In 1987 Jean-Pierre Serre published a paper \cite{Serre87}, ``Sur les repr\'esentations modulaires de degr\'e 2 de $\Gal(\Qb/\QQ)$'', in the Duke Mathematical Journal.
In this paper Serre outlined a conjecture detailing a precise relationship between certain mod $p$ Galois representations and specific mod $p$ modular forms.
This conjecture and its variants have become known as Serre's conjecture, or sometimes \emph{Serre's modularity conjecture} in order to distinguish it from the many other conjectures Serre has made.
The conjecture has since been proven correct by the work of numerous people, culminating with that of Khare--Wintenberger and Kisin, published in 2009 \cite{KWI,KWII,Kisin}.

Here we provide a motivated account of the original form of the conjecture before going on to compute some explicit examples and examining some interesting consequences.

Beyond Serre's paper there are many very good accounts of his statement, including articles by Darmon \cite{Darmon} and by Ribet and Stein \cite{RibetStein} who also have a book chapter on the topic \cite{RibetSteinBook}.
In 1992 Bas Edixhoven wrote a paper \cite{EdixhovenWeight} in which he gave a slight alteration of Serre's conjecture.
There are additional articles covering this version of the conjecture, such as Cais \cite{Cais} and Edixhoven \cite{Edixhoven}.
Finally Alex Ghitza has prepared a translation of part of Serre's paper \cite{Ghitza} which has been helpful.
These articles were of great help when preparing the current essay and many of the ideas used here are contained in at least one of them.


% ************************************************************************
% Background
% ************************************************************************
\section{Background}
We begin by fixing several definitions and key results that will be relevant when discussing Serre's conjecture.

\subsection{Modular forms}
In the interests of space we assume material relating to classical modular forms.
Here we only look at the passage to \emph{mod $p$ modular forms}, as there is some amount of choice in how these forms are defined and they form a key part of Serre's conjecture.
We use Serre and Swinnerton-Dyer's original approach to define mod $p$ modular forms, this is detailed in \cite{SD,Serre73,Serre73Formes}.
This approach is a simple one and it allows us to talk about congruences involving $q$-expansions and their coefficients without getting too bogged down with notation.

\begin{defn}
Given a subring $R$ of $\CC$ we let $S_k(N,\,\varepsilon;\,R)$ be the space of cusp forms of weight $k$, level $N$ and character $\varepsilon\colon (\ZZ/N\ZZ)^* \to R$, whose $q$-expansion coefficients lie in $R$.

Now fix a prime $p\nmid N$.
Given a character
\[
\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*
\]
we may lift to a character
\[
\hat{\varepsilon}\colon (\ZZ/N\ZZ)^* \to \Zb^*,
\]
with values in the prime to $p$ roots of unity.

Additionally fix a place $v$ of $\Qb$ above $p$, this place gives us a reduction map $\Zb \to \Fb_p$.
We let the space of \emph{cuspidal mod $p$ modular forms} of weight $k$, level $N$ and character $\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*$ be the subspace of $\Fb_p[[q]]$ obtained by reducing the $q$-expansions of forms in $S_k(N,\,\hat{\varepsilon};\,\Zb)$ using our place $v$.
We denote this space by
\[
S_k(N,\,\varepsilon;\,\Fb_p).
\]
Taking the union over all characters $\varepsilon$ as above gives us the space of all mod $p$ cusp forms of weight $k$ and level $N$,
\[
S_k(N;\,\Fb_p).
\]

We can in the same way define the full (non-cuspidal) space of mod $p$ modular forms, along with mod $p$ modular forms for more general congruence subgroups.
But we don't need to consider such forms in this essay so we restrict to cusp forms for $\Gamma_1$ to keep things concise.
\end{defn}

Many notions defined for normal modular forms descend to mod $p$ modular forms in the natural way.

\begin{defn}
The standard Hecke operators act on $q$-expansions in a way that preserves each space $S_k(N,\,\varepsilon;\,\Zb)$ (this action is recalled in the proof of \cref{prop:thetaeigen}).
So we may define the action of the Hecke operators on a mod $p$ modular form $f$ by letting them act on a lift of $f$ and then reducing the $q$-expansion mod $p$ again.
Equivalently we may just define the action on $q$-expansions to be given by the same formulae as for normal modular forms.
\end{defn}

\begin{defn}
As for standard modular forms, we say a mod $p$ cusp form $f = \sum_{n\ge 1} a_n q^n$
is \emph{normalised} if $a_1 = 1$.
\end{defn}

An interesting difference between the world of standard modular forms and that of mod $p$ modular forms comes from the fact that distinct modular forms may reduce mod $p$ to have exactly the same $q$-expansion.
\begin{ex}
Using Sage \cite{Sage} we find the following example, let
\begin{align*}
f &= q - q^{2} - 2q^{3} - 7q^{4} + 16q^{5} + 2q^{6} - 7q^{7} + O(q^{8}) \in S_{4}(7,\,\Id;\,\ZZ),\\
g &= q - 6q^{2} - 42q^{3} - 92q^{4} - 84q^{5} + 252q^{6} + 343q^{7} + O(q^{8}) \in S_{8}(7,\,\Id;\,\ZZ),
\end{align*}
then if we reduce mod $5$ we see that
\begin{align*}
\overline{f} &= q + 4q^{2} + 3q^{3} + 3q^{4} + q^{5} + 2q^{6} + 3q^{7} + O(q^{9}) \in S_{4}(7,\,\Id;\,\Fb_5),\\
\overline{g} &= q + 4q^{2} + 3q^{3} + 3q^{4} + q^{5} + 2q^{6} + 3q^{7} + O(q^{9}) \in S_{8}(7,\,\Id;\,\Fb_5),
\end{align*}
which are indeed equal up to this precision.
\end{ex}

This means that we need to be careful if we are just given a $q$-expansion corresponding to some mod $p$ modular form as there is no longer necessarily a single weight attached that form.
There is a useful necessary condition controlling when this sort of behaviour can happen however.

\begin{prop}\label{prop:pm1}
If $f$ and $g$ are two non-zero mod $p$ modular forms of weights $k$ and $k'$ respectively, whose $q$-expansions are equal, then
\[
k \equiv k' \pmod{p-1}.
\]
\end{prop}
\begin{proof}
See \cite{Serre73}.
\end{proof}

In fact for $p\ge 5$ it is always the case that $S_{k}(N;\,\Fb_p)\subset S_{k + p - 1}(N;\,\Fb_p)$ \cite{Serre73Formes}.
Due to this behaviour the concept of weight is not particularly well defined for mod $p$ modular forms, so we introduce the notion of a \emph{filtration} to help us deal with the ambiguity.

\begin{defn}\label{def:filtration}
The \emph{filtration} of a mod $p$ cusp form $f$ of level $N$ is the minimal $k\ge 0$ for which $f\in S_k(N;\,\Fb_p)$.
We denote this by $w(f)$.
\end{defn}

Now we look at an important operator on the space of mod $p$ modular forms, which we shall study more in \cref{sec:galassoc}.

\begin{defn}\label{def:theta}
The $\Theta$ operator is defined on (cuspidal) mod $p$ modular forms via its action on $q$-expansions, which is
\[
\Theta\left(\sum_{n\ge 0} a_nq^n\right) = q\ddq \left(\sum_{n\ge 0} a_nq^n\right) = \sum_{n\ge 0} na_nq^n.
\]
\end{defn}

It is not clear from this definition that the resulting $q$-expansion should  always correspond to another modular form, but this is indeed the case.
In fact many of the properties of the cusp form are preserved by $\Theta$.

\begin{prop}\label{prop:thetafilt}
If $f$ is a mod $p$ cusp form of filtration $w(f) = k$, then $\Theta(f)$ is also a mod $p$ cusp form of the same level and character and has filtration
\[
w(\Theta(f)) = \begin{cases}
k + p + 1 &\text{ if } p\nmid k,\\
k + p + 1 - n(p-1),\, n\ge 1&\text{ if } p\mid k.
\end{cases}
\]
\end{prop}
\begin{proof}
See \cite{Serre73} and also \cite{Jochnowitz} for more detail about how the filtration lowers in the $p\mid k$ case.
\end{proof}

We see straight away from the definition of the action that $\Theta$ preserves the set of normalised mod $p$ cusp forms.

\begin{prop}\label{prop:thetaeigen}
$\Theta$ semicommutes with the Hecke operators $T_{\ell}$ (specifically we have $T_{\ell}\Theta = \ell\Theta T_{\ell}$), and hence $\Theta$ preserves eigenforms.
\end{prop}
\begin{proof}
The Hecke operators $T_{\ell}$ on $S_k(N,\,\varepsilon;\,\Fb_p)$ act on $q$-expansions by
\[
T_{\ell}\left(\sum_{n\ge 1} a_nq^n\right) = \begin{cases}
\sum_{n\ge 1} a_{\ell n}q^n + \ell^{k-1}\varepsilon(\ell)\sum_{n\ge 1} a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} a_{\ell n}q^n &\text{ if }\ell \mid N.
\end{cases}
\]
We let $f = \sum_{n\ge 1} a_nq^n\in S_k(N,\,\varepsilon;\,\Fb_p)$ and calculate
\[
\Theta T_{\ell} f = \begin{cases}
\sum_{n\ge 1} na_{\ell n}q^n + \ell^{k-1}\varepsilon(\ell)\sum_{n\ge 1} \ell n a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} na_{\ell n}q^n &\text{ if }\ell \mid N,
\end{cases}
\]
and
\[
T_{\ell}\Theta f = \begin{cases}
\sum_{n\ge 1} \ell n a_{\ell n}q^n + \ell^{k+p+1-1} \varepsilon(\ell)\sum_{n\ge 1} n a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} \ell n a_{\ell n}q^n &\text{ if }\ell \mid N.
\end{cases}
\]
As we are working in characteristic $p$ here $\ell^{k+p} = \ell^{k+1}$, so
\[
 T_{\ell} \Theta = \ell \Theta T_{\ell},
\]
thus if $f$ is an eigenform for the $T_\ell$ then $\Theta f$ is an eigenform too.
However the eigenvalue for each $T_\ell$ is $\ell$ times the original.
\end{proof}


\subsection{Galois representations}\label{sec:gals}
We now move on to the next major component of Serre's conjecture, Galois representations.
As above we mostly concern ourselves with fixing definitions and recalling important results that shall be needed later.
There are many good references for this type of material, for example \cite{DDT,WieseGal}.

\begin{defn}
An \emph{$n$-dimensional mod $p$ Galois representation} is a homomorphism
\[
\rho\colon \GQ \to \GL_n(\Fb_p).
\]
Similarly, an \emph{$n$-dimensional $p$-adic Galois representation} is a homomorphism
\[
\rho\colon \GQ \to \GL_n(\Qpb).
\]
Unless stated otherwise the term Galois representation will refer to a mod~$p$ Galois representation.
\end{defn}

Recall that $\GQ$ is defined as the inverse limit of $\Gal(K/\QQ)$ as $K$ ranges over all number fields.
So the group $\GQ$ naturally has the profinite topology, where the open subgroups are the subgroups of finite index.
We demand that all of our mod $p$ representations be continuous with respect to this topology and the discrete topology on $\GL_n(\Fb_p)$.

\begin{rmk}\label{rmk:ctsfin}
The continuity condition for mod $p$ Galois representations reduces to having an open kernel, so continuous mod $p$ Galois representations always have finite image.
\end{rmk}

We deal mostly with 1 and 2 dimensional mod $p$ Galois representations.
Those of dimension 1 (i.e. maps $\phi\colon\GQ \to \Fb_p^*$) are called \emph{characters}.
Given a 2-dimensional mod $p$ representation $\rho\colon G\to \GL_2(\Fb_p)$ we often use the notation
\[
\rho \sim \begin{pmatrix}
\alpha & \beta \\
\gamma & \delta
\end{pmatrix},
\]
where $\alpha,\,\beta,\,\gamma$ and $\delta$ are functions $G \to \Fb_p$, to indicate that there is some $A\in\GL_2(\Fb_p)$ such that for every $\sigma\in G$
\[
\rho(\sigma) = A
 \begin{pmatrix}
\alpha(\sigma) & \beta(\sigma) \\
\gamma(\sigma) & \delta(\sigma)
\end{pmatrix}A^{-1}.
\]

\begin{defn}
Let $\rho$ be a mod $p$ Galois representation and $\phi$ be a mod $p$ Galois character.
We can form a new mod $p$ Galois representation of the same dimension as $\rho$ by taking the product of the images for each element of $\GQ$.
This is called the \emph{twist} of $\rho$ by $\phi$, and is denoted $\phi\otimes \rho$.
\end{defn}

\begin{defn}
Let $\phi\colon\GQ \to K^*$ be a character for some field $K$ and fix an embedding $\Qb\hookrightarrow \CC$.
We may then view complex conjugation as an element $c\in \GQ$, looking at its image $\phi(c)$ we see it is an element of order 2 in $K^*$, so $\phi(c)$ must be $\pm 1$.
If $\phi(c) = -1$ we say $\phi$ is \emph{odd}, otherwise we say $\phi$ is \emph{even} (though we shall mostly be concerned with distinguishing odd representations here).

Now given any Galois representation
\[
\rho\colon\GQ \to \GL_n(K),
\]
we define the parity of $\rho$ to be that of the character $\det\rho$.
\end{defn}


Due to \cref{rmk:ctsfin} all of our Galois representations will factor through finite extensions, and so our understanding of finite Galois groups will be essential in what follows.
The next few definitions and results will allow us to talk about the properties of finite Galois extensions of $\QQ$ and $\QQ_p$.

\begin{defn}\label{def:higher}
Given a finite Galois extension $K/\QQ$ we may define a series of subgroups of $G = \Gal(K/\QQ)$ that measure the ramification of $K/\QQ$ at a prime $\ell$.
Let $\mathfrak{L}$ be a prime of $K$ above $\ell$ and let
\[
D_\ell =\{\sigma \in G : \sigma(\mathfrak{L}) \subset \mathfrak{L}\}
\]
be the \emph{decomposition group} at $\ell$ of $G$.
Now take $i$ to be a non-negative integer, we define the $i$th \emph{higher ramification group} to be
\[
G_{\ell,i} = \{\sigma \in D_\ell : \sigma(x) - x \in \mathfrak{L}^{i + 1}\ \forall x \in \cO_{K} \}.
\]
These groups tell us how much of the whole of $G$ we are seeing if we just look modulo powers of $\mathfrak{L}$.
They form a descending chain as $i$ increases
\[
D_\ell\supseteq G_{\ell,0} \supseteq G_{\ell,1} \supseteq G_{\ell, 2}\supseteq\cdots.
\]
\end{defn}

Now we recall a result of Kummer theory, which tells us how to obtain abelian Galois extensions of certain fields, see \cite{Birch} for the full story.

\begin{prop}\label{prop:kummer}
Suppose $K$ is a field containing all $n$th roots of unity, then there is a bijection
\[
\{\text{subgroups }H \subset K^*/(K^*)^n\} \leftrightarrow \{\text{abelian extensions } L| K : \Gal(L/K)^n = 1\},
\]
defined by sending $H\mapsto K(\sqrt[n]{H})$ and $K^* \cap (L^*)^n\mapsfrom L$.
The notation $K(\sqrt[n]{H})$ simply means we adjoin $n$th roots of all elements of $H$ to $K$.
\end{prop}

Although our main objects of study are representations $\GQ$ it will be very useful for us to take a prime $\ell$ and also consider representations of
\[
\Gl = \Gal(\Qb_\ell/\QQ_\ell).
\]
Such representations can be obtained from those of $\GQ$ using an inclusion
\[
\Qb \hookrightarrow \Qlb
\]
to define a restriction map
\[
\Gl \to \GQ.
\]
In fact due to Krasner's lemma \cite[p. 238]{Cohen} the map $\Gl \to \GQ$ is injective and so we may view $\Gl$ as a subgroup of $\GQ$.
The way this subgroup sits inside $\GQ$ depends on the choice of embedding $\Qb \hookrightarrow \Qlb$ and varies by conjugation as this embedding changes.

The group $\Gl$ has several subquotients which will be helpful for us to study restrictions of representations to.
\begin{defn}\label{def:inert}
The ring of integers of $\Qlb$ is stable under the action of $\Gl$, as is its unique maximal ideal.
So we get an action of $\Gl$ on the residue field, this field may be identified with $\Fb_p$.
We therefore obtain a map
\[
\Gl \to \Gal(\Fb_\ell/\FF_\ell).
\]
The \emph{inertia subgroup} $I_\ell$ is defined to be the kernel of this map.

The group $\Gal(\Fb_\ell/\FF_\ell)$ is topologically cyclic, generated by the Frobenius morphism $x \mapsto x^\ell$.
In fact the above map $\Gl \to \Gal(\Fb_\ell/\FF_\ell)$ is surjective and so we may let $\Frob_\ell\in \Gl$ be a preimage of this morphism under the restriction map, we call such elements \emph{Frobenius elements}.


Next the \emph{wild} inertia group $I_{\ell,w}$ is the maximal pro-$\ell$-subgroup of $I_\ell$ and the \emph{tame} inertia group is the quotient
\[
I_{\ell,t} = I_\ell / I_{\ell,w}.
\]
\end{defn}

The groups we have been looking at give rise to Galois extensions of $\Ql$.
The fixed field of $I_\ell$ is the maximal non-ramified extension of $\Ql$, denoted $\Ql^\nr$.
Similarly, the fixed field of $I_{\ell,w}$ is the maximal tamely-ramified extension of $\Ql$, denoted $\Ql^\tamr$.
So the full set-up looks as follows
\[
\xymatrix{
\Qlb      \ar@{-}[d]^{I_{\ell,w}} \ar@{-}@/_3pc/[ddd]_{\Gl} \ar@{-}@/^2.5pc/[dd]^{I_{\ell}}\\
\Ql^\tamr \ar@{-}[d]^{I_{\ell,t}} \\
\Ql^\nr   \ar@{-}[d] \\
\Ql
}
\]

The field $\QQ_{\ell}^{\tamr}$ is generated by the fields $\Ql^\nr(\sqrt[n]{\ell})$ \cite{Frohlich} and so, using \cref{prop:kummer} the tame inertia $I_{\ell,t}$ may be identified with
\[
\lim_{\longleftarrow} \Gal(\Ql^\nr(\sqrt[n]{\ell})/\Ql^\nr)= \lim_{\longleftarrow} \FF_{\ell^n}^*,
\]
where the maps in the last inverse limit are the norm maps.

\begin{defn}
We say a Galois representation $\rho$ is \emph{unramified} at $\ell$ if $\rho|_{I_\ell}$ is trivial.
Otherwise, we say $\rho$ is \emph{ramified} at $\ell$.
Similarly we say $\rho$ is \emph{tamely ramified} at $\ell$ if $\rho|_{I_{\ell,w}}$ is trivial.
\end{defn}

The usefulness of the Frobenius elements for us stems in part from the following theorem.

\begin{thm}\label{thm:cheb}
Let $S$ be a finite set of primes, then:
\begin{enumerate}
\item A semi-simple mod $p$ representation
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
is determined by the values of $\tr\rho(\Frob_\ell)$ and $\det\rho(\Frob_\ell)$ for all $\ell\not\in S$ at which $\rho$ is unramified.

\item A mod $p$ character
\[
\phi\colon \GQ \to \Fb_p^*
\]
is determined by the values of $\phi(\Frob_\ell)$ for all $\ell\not\in S$ at which $\phi$ is unramified.
\end{enumerate}

\end{thm}
\begin{proof}
See \cite[prop. 25]{DDT}.
\end{proof}

\begin{defn}
Each character
\[
\phi \colon \GQ \to \Fb_p^*
\]
has finite image and so factors through some $\FF_{p^n}^*$, the smallest $n$ for which this can happen is called the \emph{level} of the character.
\end{defn}


For each $n \ge 1$ we now distinguish $n$ special mod $p$ characters of $I_{p,t}$ of level $n$, these will allow us to describe all such characters of a particular level.

\begin{defn}\label{def:fund}
The identification
\[
I_{p,t}  =  \lim_{\longleftarrow} \FF_{p^n}^*
\]
gives us a natural map
\[
\psi_n\colon I_{p,t} \to \FF_{p^n}^*
\]
for each $n$.
The \emph{fundamental characters} of level $n$ are defined by extending $\psi_n$ to an $\Fb_p$-character of $I_{p,t}$ via the $n$ embeddings $\FF_{p^n} \hookrightarrow \Fb_p$.

While any individual fundamental character is not canonically defined, the set of all of them of a particular level is.
\end{defn}

\begin{rmk}\label{rmk:prodchar}
The embeddings are all obtained from any chosen one by applying Frobenius and as such the product of all fundamental characters of level $n$ is the same as the composition of the norm map $\FF_{p^n}^* \to \FF_p^*$ with any one.
So this product will always be the unique fundamental character of level 1.
\end{rmk}


\begin{prop}
The fundamental characters of level $n$ generate the set of all characters of level $n$.
\end{prop}
\begin{proof}
See \cite[prop. 5]{Serre72}.
\end{proof}

As any character $\phi\colon\GQ \to \Fb_p^*$ factors through an abelian quotient, the Kronecker--Weber theorem tells us that any such character factors as
\[
\phi\colon\GQ \to \Gal(\QQ(\zeta_N)/\QQ)\cong (\ZZ/N\ZZ)^*\xrightarrow{\phi'}\Fb_p^*,
\]
where $\zeta_N$ is a primitive $N$th root of unity.
We can also use this factorisation to extend any Dirichlet character to a character of the absolute Galois group.
Thus characters of the absolute Galois group are in bijection with Dirichlet characters
\[
(\ZZ/N\ZZ)^* \xrightarrow{\phi}\Fb_p^*.
\]

\begin{defn}
The identity map
\[
(\ZZ/p\ZZ)^* \to \FF_p^* \hookrightarrow \Fb_p^*
\]
is a Dirichlet character and thus gives us a character of $\GQ$.
This character is called the \emph{mod $p$ cyclotomic character}, and denoted $\chi_p$.
\end{defn}

\begin{rmk}
Taking any $\sigma\in\GQ$ the definition above is saying that $\sigma$ sends
\[
\zeta_p\mapsto \zeta_p^{\chi_p(\sigma)},
\]
where $\zeta_p$ is a primitive $p$th root of unity.
If we take some $\ell \ne p$ prime and denote reduction mod $\ell$ of $\ZZ[\zeta_p]$ by $\overline{\ \cdot\ }$ we have that
\[
\overline{\Frob_\ell(\zeta_p)} = \overline{\zeta_p}^\ell,
\]
and so as $\chi_p$ takes values mod $p$ we find that
\[
\chi_p(\Frob_\ell) = \ell.
\]

Now if we fix an embedding $\Qb\hookrightarrow\CC$ and consider complex conjugation as some element $c\in\GQ$ we see that it takes $\zeta_p \mapsto\zeta_p^{-1}$ and hence
\[
\chi_p(c) = -1,
\]
so $\chi_p$ is an example of an odd character.

Finally, $\chi_p$ was defined to factor through $\Gal(\QQ(\zeta_p)/\QQ)$, this extension is tamely ramified and hence so is $\chi_p$.
Therefore $\chi_p$ passes to a character on the group $I_{p}/I_{p,w} = I_{p,t}$.
Straight from the definition we see that $\chi_p$, when viewed in this way, is an example of a level 1 character of $I_{p,t}$, but in fact more is true.
\end{rmk}

\begin{prop}
The fundamental character of level 1 is the cyclotomic character $\chi_p$ (or rather its induced character on $I_{p,t}$).
\end{prop}
\begin{proof}
See \cite[prop. 8]{Serre72}.
\end{proof}


\begin{defn}
We now classify Galois representations
\[
\rho\colon\Gp\to\GL_2(\Fb_p)
\]
of the form
\[
\rho\sim
\begin{pmatrix}
\chi_p\varepsilon_1 & * \\
0 & \varepsilon_2\end{pmatrix},
\]
for some unramified characters $\varepsilon_1$ and $\varepsilon_2$.

Let $K$ be the fixed field of the kernel of $\rho$.
There is a unique maximal tamely ramified extension $K_t$ of $\Qp^{\nr}$ that is contained inside of $K$.
These fields fit together as follows
\[
\xymatrix{
K      \ar@{-}[d]|{\rho(I_{p,w})} \ar@{-}@/_2.5pc/[dd]_{\rho(I_{p})}\\
K_t \ar@{-}[d] \\
\Qp^\nr \ar@{-}[d]\\
\Qp \\
}
\]
As
\[
\Gal(K_t/\QQ_p^{\nr}) = (\ZZ/p\ZZ)^*
\]
we may write
\[
K_t = \QQ_p^{\nr}(z),
\]
where $z$ is a primitive $p$th root of unity.
If we look at $\Gal(K/K_t)$ we see that
\[
\Gal(K/K_t) = \rho_p(I_{p,w})
\]
consists only of elements of the form
\[
\begin{pmatrix}
1 & * \\
0 & 1 \end{pmatrix}.
\]

So $\Gal(K/K_t)$ is a finite elementary abelian $p$-group and hence isomorphic to $(\ZZ/p\ZZ)^m$ for some $m$.
Applying \cref{prop:kummer} we see that $K$ is in fact given by
\[
K = K_t(x_1^{1/p},\ldots,x_m^{1/p}),
\]
where in fact we may take the $x_i\in\Qp^\nr$.
The valuations of these $x_i$ will determine our classification.
If
\[
\nu_p(x_i) \equiv 0 \pmod{p}
\]
for all $i$ then we say that $\rho$ is \emph{peu ramif\'e}, otherwise if any of the $\nu_p(x_i)$ is coprime to $p$ then we say it is \emph{tr\`es ramif\'e}.
\end{defn}

Finally we introduce a notion that will allow us to obtain semisimple representations from arbitrary Galois representations.

\begin{defn}\label{def:semisimp}
The \emph{semisimplification} of a 2-dimensional representation $\rho$ is another representation, denoted $\rss$, that is obtained as follows.
If $\rho$ is irreducible (and hence semisimple) we leave it as it is and set $\rss = \rho$.
Otherwise if $\rho$ is reducible we know that there is some subspace on which $\rho$ acts via a character and so we may write
\[
\rho \sim
\begin{pmatrix}
\phi_1 & * \\
0      & \phi_2
\end{pmatrix}.
\]
The semisimplification $\rss$ is then the representation given by
\[
\begin{pmatrix}
\phi_1 & 0 \\
0      & \phi_2
\end{pmatrix},
\]
conjugated in the same way $\rho$ was.
This is indeed semisimple.

In general the process of semisimplification is analogous, it is obtained by taking the direct sum of the Jordan--H\"older constituents of a representation, though for us the above description suffices.
\end{defn}


% ************************************************************************
% Obtaining Galois representations from modular forms
% ************************************************************************
\section{Obtaining Galois representations from modular forms}
The two concepts just introduced, modular forms and Galois representation, appear at first glance not to be particularly related to each other.
However in reality they are inextricably linked, and exploring some of the links between them will be the goal of the rest of this essay.

\subsection{Congruences for Ramanujan's $\tau$ function}
We will start with a historically important example that provides the first glimpse of the sort of connection we will be looking at in this essay.

\begin{ex}\label{ex:delt}
Let
\[
\Delta = \sum_{n \ge 1} \tau(n) q^n
\]
be the unique normalised cusp form of weight 12 for $\Gamma_1(1) = \SL_2(\ZZ)$.
The coefficients of this $q$-expansion were studied in detail by Ramanujan, who made many influential conjectures concerning them.
The function $n \mapsto \tau(n)$ is now known as the Ramanujan $\tau$ function.
The properties of this function provide the first glimpses of behaviours occurring for more general systems of Hecke eigenvalues.

Various people, including Ramanujan in the mod 691 case, found congruences involving the coefficients $\tau(\ell)$ modulo powers of primes for prime $\ell$.
Below are a few examples, though others do exist for higher powers of these primes.
\begin{align}
\tau(\ell) &\equiv 1 + \ell^{11} \pmod{2^8}\text{, if } \ell \ne 2,\label{eq:tau2}\\
\tau(\ell) &\equiv \ell^2 + \ell^9 \pmod{3^3}\text{, if } \ell \ne 3,\label{eq:tau3}\\
\tau(\ell) &\equiv \ell + \ell^{10} \pmod{5^2},\label{eq:tau5}\\
\tau(\ell) &\equiv \ell + \ell^4 \pmod{7},\label{eq:tau7}\\
\tau(\ell) &\equiv\left.\begin{cases}
0\pmod{23} & \text{ if } \left(\frac{\ell}{23}\right) = -1,\\
2\pmod{23} & \text{ if }\ell\text{ is of the form } u^2 + 23v^2,\\
-1\pmod{23} & \text{ otherwise},\\
\end{cases}\right\}\text{ if } \ell \ne 23,\label{eq:tau23}\\
\tau(\ell) &\equiv 1 + \ell^{11} \pmod{691}.\label{eq:tau691}
\end{align}

The original proofs of these congruences were in many cases quite involved and did not all work in the same manner.
Indeed Peter Swinnerton-Dyer writes in \cite{SD} that these proofs ``do little to explain why such congruences occur''.
So in order to try to explain all of the congruences in a unified manner, Serre predicted \cite{Serre67} for each prime $p$ the existence of a $p$-adic Galois representation
\[
\rho_p\colon \GQ \to \GL_2(\Qp)
\]
such that
\begin{enumerate}
\item $\tr(\rho_p (\Frob_\ell)) = \tau(\ell)$ for all $\ell \ne p$,\label{item:trace}
\item $\det(\rho_p (\Frob_\ell)) = \ell^{11}$ for all $\ell \ne p$.\label{item:det}
\end{enumerate}

The congruences would then follow from these Galois representations being of specific forms.
For example \cref{eq:tau2,eq:tau3,eq:tau5,eq:tau7,eq:tau691} can all be obtained from these Galois representations if the $\rho_p$ satisfy
\[
\rho_p \equiv \begin{pmatrix}
\chi_p^a & * \\
0        & \chi_p^{11-a}
\end{pmatrix}\pmod{p^b},
\]
where $a$ is $0,\,2,\,1,\,1$ or $0$ respectively and $b$ is as in the original congruences in each case.
Here in each case we can see that $\det\rho_p \equiv \chi_p^{11}$, which is consistent with \cref{item:det} above, and knowing \cref{item:trace} would give us the desired congruences.

Serre's prediction for the representation $\rho_{23}$ has a more interesting form, but nevertheless the images of Frobenius elements can be described explicitly.
Following Serre we take $K$ to be the splitting field of $x^3 - x - 1$, this is ramified only at 23 and has Galois group $S_3$.
We then let $r$ be the unique irreducible degree 2 representation of $S_3$ taken with coefficients in $\QQ_{23}$, this satisfies
\[
\tr(r(\sigma)) = \begin{cases}
0 &\text{ if } |\sigma| = 2,\\
2 &\text{ if } |\sigma| = 1,\\
-1 &\text{ if } |\sigma| = 3,
\end{cases}
\]
for each $\sigma \in S_3$.
As $\Gal(K/\QQ)$ is a quotient of $\GQ$ the representation $r$ extends to a representation of $\GQ$.
If some $\rho_{23}$ exists satisfying the conditions above it being isomorphic to $r$ then gives rise to \cref{eq:tau23} in the same way as before.

Shortly after Serre hypothesised their existence Pierre Deligne constructed the representations $\rho_p$ for all primes $p$  \cite{Deligne}.
In doing so Deligne also reduced another conjecture of Ramanujan's, that $|\tau(p)| \le 2p^{11/2}$ for all $p$, to the Weil conjectures.
This provided an indication that relating questions regarding $\tau$ to Galois representations is a natural and effective way of working on these questions.
Being able to compute these associated representations makes it possible to read off many more congruences for $\tau(n)$ (see, for example, \cite{Mascot}).
\end{ex}

\subsection{Attaching Galois representations to general eigenforms}
Given the above example it is natural to wonder whether such a relationship holds more generally.
Indeed Serre also asked if one could associate to each normalised cuspidal eigenform a Galois representation whose traces of Frobenius elements match the $q$-expansion coefficients.
Serre's conjectures on this led to the following more general theorem, which we state for mod $p$ modular forms now.

\begin{thm}[Deligne]\label{thm:assoc}
Let $p$ be a prime, $k \ge 2$, $N \ge 1$ an integer coprime to $p$ and $\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*$. Given a normalised eigenform $f\in S_k(N,\,\varepsilon;\,\Fb_p)$ which has $q$-expansion coefficients $a_i$, there exists a two-dimensional mod $p$ Galois representation $\rho_f$ such that
\begin{enumerate}[(i)]
\item $\rho_f$ is semi-simple,
\item $\rho_f$ is unramified outside $Np$,
\item $\tr(\rho_f (\Frob_\ell)) = a_\ell$ for all $\ell \nmid Np$,
\item $\det(\rho_f (\Frob_\ell)) = \varepsilon(\ell)\ell^{k-1}$ for all $\ell \nmid Np$.
\end{enumerate}
We often refer to the representation $\rho_f$ as arising from, or being attached to, $f$.
\end{thm}

The construction of these representations in this generality is due to Deligne \cite{Deligne}, building on work of Shimura and others.
There is also a similar statement for weight 1 due to both Deligne and Serre \cite{DeligneSerre}.
However, as we will discuss Serre's original statement of his conjecture here, we will ignore weight 1 forms.
(There is a translation of Deligne's paper available from the IAS \cite{DeligneEng}, it has nicer typesetting too.)

In fact the representations obtained in these constructions are $p$-adic Galois representations $\rho_f\colon \GQ \to \GL_2(\Qb_p)$, as they were in \cref{ex:delt}.
The representations of the theorem are then obtained from the $p$-adic ones via a process of mod $p$ reduction and semisimplification.
The mod $p$ representations are the ones that we will be most interested in from here on though.

%TOD do we want to make some remark about forms giving same rep?

\begin{rmk}\label{rmk:detrho}
Let $\rho_f$ be a Galois representation attached to a normalised eigenform $f\in S_k(N,\,\varepsilon;\,\Fb_p)$ as in the theorem.
As $\rho_f$ is semisimple and
\[
\det(\rho_f(\Frob_\ell)) = \chi_p^{k-1}(\Frob_\ell)\varepsilon(\Frob_\ell)
\]
for all $\ell \nmid Np$ (here viewing $\varepsilon$ as character of $\GQ$ now) we may apply \cref{thm:cheb} to see that condition (iv) of the theorem is equivalent to the statement that
\[
\det\rho_f = \varepsilon\chi_p^{k-1}.
\]
By looking at the action of $\langle-1\rangle$ on $f$ we find $\varepsilon(-1)f = \langle -1 \rangle f = (-1)^k f$, and so
\[
\varepsilon(c)\chi_p^{k-1}(c) = (-1)^k(-1)^{k-1} = -1,
\]
hence $\det\rho_f$ must be odd (i.e $\rho_f$ is odd).
\end{rmk}

We will look at some more properties of this construction in \cref{sec:galassoc}, but first we move on to the conjecture itself.


% ************************************************************************
% Serre's Conjecture
% ************************************************************************
\section{Serre's conjecture}
\subsection{The qualitative form}
Given the above result one might wonder about a converse statement, given a mod $p$ Galois representation satisfying some necessary conditions, does it arise from an eigenform?
Serre's conjecture was that the answer to this question is yes, all Galois representations that could possibly arise from an eigenform, based on \cref{thm:assoc} and the remark following it, do.

The conjecture naturally comes into two parts, a weaker existence statement, and another refined form that makes exact predictions about an extremal eigenform from which the Galois representation arises.
We look at the existence statement, or \emph{qualitative form} first.

\begin{conjecture}[Serre's conjecture, qualitative form]\label{conj:qual}
Let
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
be a continuous, odd, irreducible Galois representation.
Then there exists a normalised cuspidal mod $p$ eigenform $f$, such that $\rho$ is isomorphic to $\rho_f$, the Galois representation associated to $f$.
\end{conjecture}

This is already a very useful thing to know, any statement one could prove about Galois representations attached to mod $p$ modular forms, by using the theory of these forms for example, would hold for all odd 2-dimensional mod $p$ Galois representations.
One interesting consequence of this type stems from the fact that Deligne's construction of Galois representations from modular forms is actually of $p$-adic representations.
If we were to assume \cref{conj:qual} and then apply this construction, we would be able to lift all irreducible odd 2-dimensional mod $p$ Galois representations to $p$-adic Galois representations.

A similar statement to the one above also holds for reducible representations, which correspond to Eisenstein series instead.
We will not consider this more general setup here as it is not what the refined form of Serre's conjecture deals with.

This conjecture (at least for Galois representations unramified outside $p$) appeared much earlier than the Duke paper and is mentioned by Serre in a 1975 paper \cite[sec. 3]{Serre75}.
It was computations performed by J.-F. Mestre that convinced Serre that strengthening this conjecture was plausible, and this led to the form we are about to see.


\subsection{The refined form}
After seeing the qualitative form of Serre's conjecture it is natural to also ask about the properties of the form $f$ whose existence is claimed.
Can anything be said about the weight and level of $f$, based only on the properties of $\rho$?
Serre also conjectured that the answer to this question is yes.
He defined a weight, level and character for each $\rho$, such that there should be a form $f$ of that weight, level and character that $\rho$ is attached to.
In a slightly backwards manner we will first state this refined form of the conjecture more precisely, before moving on to motivate and define the integers $N(\rho)$, $k(\rho)$ and character
\[
\varepsilon(\rho)\colon (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*
\]
used in the statement.

\begin{conjecture}[Serre's conjecture, refined form]\label{conj:ref}
Let
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
be a continuous, odd, irreducible Galois representation.
Then there exists a normalised eigenform
\[
f\in S_{k(\rho)}(N(\rho),\,\varepsilon(\rho);\,\Fb_p)
\]
whose associated Galois representation $\rho_f$ is isomorphic to $\rho$.

Moreover $N(\rho)$ and $k(\rho)$ are the minimal weight and level for which there exists such a form $f$.
\end{conjecture}

From now on we refer to a Galois representation $\rho$ satisfying the hypotheses of this conjecture as being of \emph{Serre-type}.

This conjecture is very bold, even given the existence statement of \cref{conj:qual} it is not clear that there should be a simultaneously minimal weight and level, let alone that they should be given by the relatively straightforward (though intricate) description that we will soon see.

If correct however, the refined form is eminently more useful than the qualitative form for specific applications.
The precise definition of the level, weight and character often allow one to actually find an associated eigenform and work explicitly with it to study a particular Galois representation.
Doing this would be far more difficult in general if no knowledge of the modular forms side could be obtained from the representation.

We will soon give Serre's explicit recipe for the weight, level and character.
First however we will look at some results that might lead one to make these definitions in the first place.

\subsection{Results on Galois representations associated to modular forms}\label{sec:galassoc}
In order to try and understand which eigenforms can give rise to a particular Galois representation, it is useful to take an arbitrary eigenform and study the properties of the representation attached to it, in an attempt to see what information about the eigenform may be recovered.
Several people have obtained interesting results of this type which will be helpful for our definition of the weight and level.

We fix a prime $p$ and a normalised eigenform $f \in S_k(N,\,\varepsilon;\,\Fb_p)$ with $q$-expansion
\[
f = \sum_{n\ge 1} a_nq^n.
\]
Let $\rho_f$ be the mod $p$ Galois representation attached to $f$ by \cref{thm:assoc}.
Concerning the conductor of $\rho_f$ there is the following result due to Carayol and Livn\'e \cite{Carayol, Livne}.

\begin{thm}\label{thm:level}
Let $N(\rho_f)$ be the level associated to $\rho_f$ (which we will define explicitly in \cref{subsec:level}), then
\[
N(\rho_f)|N.
\]
\end{thm}

Given this it is natural to hope that any Galois representation $\rho$ of Serre-type arises from a form of level exactly $N(\rho)$ as this is the minimal possible, of course we still have yet to define this quantity!
\\[12pt] \noindent
We can also make useful observations concerning the restriction of $\rho_f$ to $\Gp$, and its subgroups, these have implications for our definition of the weight.
There are two main cases here depending on whether $a_p \ne 0$ (the \emph{ordinary} case) or otherwise (the \emph{supersingular} case).
We start with the supersingular case.

\begin{thm}[Fontaine]\label{thm:super}
Suppose that $2\le k \le p +1$ and that $a_p = 0$ then $\rho_{f}|_{\Gp}$ is irreducible, moreover, letting $\psi_1$ and $\psi_2$ be the two fundamental characters of level 2, we have
\[
\rho_{f}|_{I_p} \sim \begin{pmatrix} \psi_1^{k-1} & 0 \\ 0 & \psi_2^{k-1}\end{pmatrix}.
\]
\end{thm}

This was originally proved by Fontaine in letters to Serre in 1979.
There is a published proof in \cite[sec. 6]{EdixhovenWeight}.
So in this case we can certainly recover some information about the weight by looking at the restriction to inertia.

In the ordinary case we have slightly different behaviour.

\begin{thm}[Deligne]\label{thm:ordinary}
Suppose $2\le k\le p+1$ and $a_p \ne 0$ then $\rho_{f}|_{\Gp}$ is reducible.
Moreover, letting $\lambda(a)\colon \Gp \to \Fb_p^*$ be the unramified character of $\Gp$ that takes each $\Frob_p \in \Gp /I_p$ to some $a\in\Fb_p^*$, we have
\[
\rho_{f}|_{\Gp} \sim \begin{pmatrix} \chi_p^{k-1}\lambda(\varepsilon(p)/a_p) & * \\ 0 & \lambda(a_p)\end{pmatrix}.
\]

In particular when we look at the restriction to inertia we get
\[
\rho_{f}|_{I_{p}} \sim \begin{pmatrix} \chi_p^{k-1} & * \\ 0 & 1\end{pmatrix}.
\]
\end{thm}

A proof of this result when $k \le p$ is given in \cite{Gross} and the general case was originally proved in an unpublished letter from Deligne to Serre.

Once again we can recover some information about the weight by looking at the restriction to $I_p$.
As $\chi_p^{2} = \chi_p^{p+1}$ eigenforms of weight $2$ and $p+1$ give us similar looking representations when restricted to inertia.
This makes it more difficult to tell exactly what $k$ was just by looking at restriction to inertia in this case.
This will cause us some issues later, fortunately the following result will allow us to distinguish these two cases.

\begin{thm}[Mazur]\label{thm:mazur}
Let $f$ have \emph{filtration} $p+1$ and be such that $\rho_{f}$ is irreducible, then $\rho_{f}|_{\Gp}$ tr\`es ramif\'e.
\end{thm}

For $p> 2$ and trivial character this is due to Mazur \cite[sec. 6]{Ribet90}.
In \cite[sec. 2]{EdixhovenWeight} Edixhoven gives a modification to the general case.
\\[12pt] \noindent
In all the theorems we have just introduced the weight $k$ is at most $p+1$, of course we can also make Galois representation from higher weight eigenforms.
So we need some way of finding out about the representations attached to higher weight forms, using what we know about low weight ones.
To do this we use the $\Theta$ operator.
Which preserves the set of mod $p$ normalised cuspidal eigenforms of a particular level, but in most cases  increases the filtration of the eigenform.

We then need to know how the action of $\Theta$ affects the associated Galois representation, it turns out that $\Theta$ changes these representations in a very simple way.

\begin{prop}\label{prop:theta}
Let
\[
\Theta\colon S_k(N,\,\epsilon;\,\Fb_p)\to S_{k+p+1}(N,\,\epsilon;\,\Fb_p)
\]
be the operator defined in \cref{def:theta}.
Then if $f \in S_k(N,\,\epsilon;\,\Fb_p)$ is a normalised eigenform the Galois representation associated to $\Theta(f)$ is
\[
\rho_{\Theta(f)} \cong \chi_p\otimes\rho_{f}.
\]
\end{prop}
\begin{proof}
In \cref{prop:thetaeigen} we saw that $\Theta$ took eigenforms to eigenforms, but with the eigenvalue for each $T_{\ell}$ being $\ell$ times the original.
So
\[
\tr(\rho_{\Theta(f)}(\Frob_\ell)) = \ell a_{\ell} = \tr((\chi_p \otimes \rho_{f})(\Frob_{\ell}))
\]
and
\begin{align*}
\det(\rho_{\Theta(f)}(\Frob_\ell)) &= \ell^{k+p+1}\varepsilon(\ell)\\
                                   &= \ell^{k+1}\varepsilon(\ell)\\
                                   &= \ell^2\ell^{k-1}\varepsilon(\ell)\\
                                   &= \det((\chi_p \otimes \rho_{f})(\Frob_{\ell})).
\end{align*}
By \cref{thm:cheb} the representations $\rho_{\Theta(f)}$ and $\chi_p \otimes \rho_{f}$ are isomorphic.
\end{proof}

So applying $\Theta$ twists the associated representation by $\chi_p$ and if $p\nmid w(f)$ then $\Theta$ increases the filtration by $p+1$.
Hence we can use the $\Theta$ operator to transfer our understanding of Galois representations attached to modular forms of weight at most $p+1$ to modular forms of higher weight.

It is worth noting that the proofs of some of the above theorems actually came after Serre's paper.
However it seems likely that observations of these results in specific examples informed the recipe below.


\subsection{The level}\label{subsec:level}
Assume that we have a Galois representation $\rho\colon \GQ \to \GL_2(\Fb_p)$ of Serre-type.
We now define the integer $N(\rho) \ge 1$ which plays the role of the level in the refined conjecture.

We can view our representation $\rho$ as a homomorphism
\[
\GQ \to \Aut(V),
\]
where $V$ is a two-dimensional $\Fb_p$ vector space.
Let $G\cong \im \rho$ be the finite Galois group obtained by quotienting out by the kernel of $\rho$.
Then for each prime $\ell$ we let $G_{\ell,i}\subset G$ be the $i$th ramification group at $\ell$ of $G$, as defined in \cref{def:higher}.
Now, for each $\ell$ and $i$, consider the fixed subspace of $V$
\[
V^{\ell,i} = \{\mathbf{v}\in V : \rho(\sigma) \mathbf{v} = \mathbf{v}\ \forall \sigma \in G_{\ell,i}\}.
\]
For each $\ell$ we define
\[
\nu_\ell(\rho) = \sum_{i = 0}^{\infty} \frac{1}{[G_{\ell,0} : G_{\ell,i}]} \dim(V/V^{\ell,i}),
\]
this quantity is (non-trivially) an integer \cite[p. 99]{SerreLF}.
We then set the level to be
\[
N(\rho) = \prod_{\substack{\ell \ne p\\ \ell\text{ prime}}} \ell^{\nu_\ell(\rho)}.
\]
This is indeed a positive integer, and by construction it is coprime to $p$, this is necessary for it to be the level of a mod $p$ cusp form.
This number serves to quantify the higher ramification of a Galois representation at all primes other than $p$, the larger the ramification groups for a particular prime, the larger the power of that prime in $N(\rho)$.
The definition here is almost that of the \emph{Artin conductor} of a representation, but here the $p$-part is ignored.

\begin{rmk}\label{rmk:unram}
Unwinding this definition when $\rho$ is unramified at some $\ell$, we see that each $V^{\ell,i}$ is in fact the whole of $V$, as all the ramification groups involved are trivial.
Hence in this case $\nu_\ell(\rho) = 0$ and so $N(\rho)$ is only divisible by the primes $\ell \ne p$ at which $\rho$ is ramified.
\end{rmk}

\cref{thm:level} stated that when $\rho$ comes from an eigenform $f$ the integer $N(\rho)$ defined here divides the level of $f$.
With that in mind conjecturing that any Serre-type representation comes from an eigenform of level exactly $N(\rho)$ is fairly logical, though perhaps optimistic without more evidence.


\subsection{The character and the weight mod $p-1$}\label{subsec:char}
Beginning with a Galois representation of Serre-type, as before, we now define the character
\[
\varepsilon(\rho)\colon  (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*.
\]
We also state the class of $k(\rho)$~mod~$p-1$, though the full definition of $k(\rho)$ will be given in the following section.

Given a Serre-type Galois representation $\rho$ we can compose with the determinant map to obtain a continuous character
\[
\det \rho\colon \GQ \to \Fb_p^*.
\]
The Artin conductor of a 1-dimensional Galois representation (i.e. the level $N(\rho)$ introduced in \cref{subsec:level}, but including the $p$-part now) is actually equal to the conductor of the associated Dirichlet character \cite[p. 228]{SerreLF}.
So we now partially compute the Artin conductor of $\det\rho$ in order to find a number field that $\det\rho$ factors through.

Let $V_1$ be the 2-dimensional vector space realising $\rho$ and $V_2$ be the 1-dimensional vector space realising $\det\rho$.
If $\rho|_{G_{\ell,i}}$ is trivial then $\det\rho|_{G_{\ell, i}}$ is also trivial.
So having $\dim(V_2/V_2^{\ell,i}) > 0$ implies that $\dim(V_1/V_1^{\ell,i}) > 0$.
As $0 \le \dim(V_2/V_2^{\ell,i}) \le 1$ we get that
\[
\dim(V_2/V_2^{\ell,i})\le \dim(V_1/V_1^{\ell,i})
\]
for all $\ell$ and $i$ and so
\[
\nu_\ell(\det\rho) \le \nu_\ell(\rho).
\]
This gives us that
\[
N(\det\rho) \mid N(\rho).
\]

As the restriction of $\det\rho$ to $I_{p,w}$ is trivial (see the proof of \cref{prop:wildtriv}, using that characters are simple) we find that $\nu_p(\det\rho)\le 1$.
So the full Artin conductor of $\det\rho$ divides $pN(\rho)$.

We can therefore identify $\det\rho$ with a homomorphism
\[
(\ZZ/pN(\rho)\ZZ)^* \to \Fb_p^*,
\]
or equivalently with a pair of homomorphisms
\begin{align*}
\phi\colon& (\ZZ/p\ZZ)^* \to \Fb_p^*,\\
\varepsilon\colon& (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*.
\end{align*}

The group $(\ZZ/p\ZZ)^*$ is cyclic of order $p-1$ and so the image of $\phi$ lies inside $\FF_p^*$.
So $\phi$ is an endomorphism of the cyclic group $(\ZZ/p\ZZ)^*$ and hence is of the form
\[
x \mapsto x^h,
\]
for some $h \in \ZZ/(p-1)\ZZ$.
We have expressed the fact that $\phi = \chi_p^h$, where $\chi_p$ is the mod $p$ cyclotomic character.

This gives us the factorisation
\[
\det\rho = \varepsilon \chi_p^h
\]
and so, comparing with \cref{rmk:detrho}, we set $\varepsilon(\rho)$ to be the $\varepsilon$ obtained here.
We also see that $h$ had better be the same as $k(\rho)-1$ modulo $p-1$.


\subsection{The weight}
We now come to the final ingredient in Serre's recipe, the weight $k(\rho)$.
The general strategy of our approach here is to express a representation of Serre-type as a twist of another representation, one that looks like it comes from a cusp form of weight at most $p+1$.
We then read off the minimal weight of a cusp form that could give this twisted representation.
Then we can apply the results above regarding the $\Theta$ operator (\cref{prop:thetafilt} and \cref{prop:theta}) to define the weight of the original representation.

Given our Galois representation
\[
\rho \colon \GQ \to \GL_2(\Fb_p)
\]
we restrict to the subgroup $\Gp$ to form
\[
\rho_p\colon \Gp \to \GL_2(\Fb_p).
\]
The definition of $k(\rho)$ will only depend on this $\rho_p$ (in fact it will only depend on $\rho_p|_{I_p}$).
We will from here on refer to $k(\rho)$ as $k(\rho_p)$ to emphasise this fact.
As such the weight will only reflect the behaviour at $p$ of the representation, whereas the level reflected the behaviour away from $p$.

\begin{prop}(Serre \cite[prop. 4]{Serre72})\label{prop:wildtriv}
The semisimplification $\rss_p$ of $\rho_p$ is trivial when restricted to $I_{p,w}$.
\end{prop}
\begin{proof}
It suffices to prove this for simple representations $\rho_p$, as a sum of trivial representations is trivial.

The wild inertia $I_{p,w}$ is a pro-$p$-group, and so the image is also a pro-$p$-group.
This group is finite, so it is simply a $p$-group, and defined over some finite field $\FF_q$.
Consider an $\FF_q$-vector space $V$ that realises $\rss_p|_{I_{p,w}}$.
Additively $V$ is a $p$-group too, of order $q^n$, where $n$ is the dimension of $V$.
Looking at the action of $\rss_p|_{I_{p,w}}$ on $V$ we see that there is a singleton orbit $\{0\}$.
As all orbits are of $p$-power order there must be an additional $p-1$ singleton orbits at least, else the orbits could not partition $V$.
Therefore the fixed subspace $W$ of $V$ is non-trivial.
However as $I_{p,w}$ is normal in $\Gp$ the subspace $W$ is stable under $\Gp$, hence $W$ must equal $V$ by simplicity.
\end{proof}

We may therefore view $\rss_p$ as a representation of $I_{p,t}$, we shall write $\rss_t$ for this new representation.
The tame inertia group is abelian, and so this representation is diagonalisable.
The representation $\rss_t$ is therefore given by a pair of characters
\[
\phi_1,\,\phi_2\colon I_{p,t} \to \Fb_p^*.
\]

\begin{prop}
Both of the characters $\phi_1$ and $\phi_2$ are of the same level, and that level is either 1 or 2.

Moreover if they are both of level 2 then they are $p$th powers of each other.
\end{prop}
\begin{proof}
Letting a Frobenius element at $p$ act by conjugation on $\sigma\in I_{p}$ we have that \cite[p. 167]{RibetStein}
\[
\Frob_p \sigma \Frob_p^{-1} \equiv \sigma^p \pmod{I_{p,w}},
\]
and so
\[
\rss_t( \Frob_p\sigma \Frob_p^{-1}) = \rss_t(\sigma^p) = \rss_t(\sigma)^p,
\]
so we have an equivalence of representations
\[
\rss_t \cong (\rss_t)^p.
\]
Hence the set $\{\phi_1,\,\phi_2\}$ must be fixed by $p$th powering.

We then have two possibilities, either taking the $p$th power fixes each of $\phi_1$ and $\phi_2$ or it swaps them.
If they are both fixed then their images lie in the prime field, so they are of level 1.
Otherwise, if they swap under $p$th powering, each of them is fixed under powering by $p^2$, and hence they are of level 2.
\end{proof}

We now treat three different cases separately, based on the levels of the characters just obtained and whether or not $\rho|_{I_{p,w}}$ is trivial.


\subsubsection{The level 2 case}\label{sec:l2}
If the two characters $\phi_1$ and $\phi_2$ are of level 2 then $\rho_p$ is irreducible.

To see this, assume otherwise and consider a stable 1-dimensional subspace of the vector space realising $\rho_p$.
The representation $\rho_p$ acts by a character on this subspace, which is tamely ramified by the argument in \cref{prop:wildtriv}.
So this gives a character $\phi$ of $I_{p,t}$ which extends to all of $\Gp$, so we have
\[
\phi(\sigma) = \phi(\Frob_p\sigma\Frob_p^{-1}) = \phi^p(\sigma).
\]
Hence $\phi$ must factor through $\FF_p^*$, i.e. $\phi$ is of level 1.
This is a contradiction as $\phi$ must be one of $\phi_1$ or $\phi_2$.

So $\rho_p$ is irreducible and hence $\rho_p = \rss_p$, therefore the characters $\phi_1$ and $\phi_2$ above define the representation $\rho_p|_{I_{p}}$.
We can write them in terms the fundamental characters of level 2, $\psi_1$ and $\psi_2$ (as defined in \cref{def:fund}) and use this description to define $k(\rho_p)$.
Specifically we can write $\phi_1$ as
\[
\phi_1 = \psi_1^a\psi_2^b
\]
with $0\le a,\,b\le p-1$.
If $a = b$ then $\phi_1 = (\psi_1 \psi_2)^a=\chi_p^a$ (recall \cref{rmk:prodchar}), which contradicts $\phi_1$ being of level 2 as the mod $p$ cyclotomic character is of level 1.
Now we observe that
\[
\phi_2 = \phi_1^p = (\psi_1^a\psi_2^b)^p = \psi_2^a\psi_1^b,
\]
so by switching the places of $\phi_1$ and $\phi_2$ if necessary we may assume that in fact $0\le a < b\le p-1$.

Now when restricting to inertia we have
\[
\rho_p|_{I_{p}} \sim
\begin{pmatrix}
\psi_1^b\psi_2^a & 0 \\
0                & \psi_1^a\psi_2^b
\end{pmatrix}.
\]
This looks a bit like the supersingular case of \cref{thm:super}.
So we massage our representation into the form seen in the theorem by factoring out a character to get
\[
\rho_p|_{I_{p}} \sim
\begin{pmatrix}
\psi_1^b\psi_2^a & 0 \\
0                & \psi_1^a\psi_2^b
\end{pmatrix} =
\psi_2^a\psi_1^a\begin{pmatrix}
\psi_1^{b-a} & 0 \\
0            & \psi_2^{b-a}
\end{pmatrix} =
\chi_p^a\begin{pmatrix}
\psi_1^{b-a} & 0 \\
0            & \psi_2^{b-a}
\end{pmatrix}
\]
If we were just considering the rightmost matrix we would like to set $k(\rho_p) - 1 = b - a$ to be consistent with the theorem, however we have twisted by $\chi_p^a$.
Recalling \cref{prop:thetafilt} and \cref{prop:theta} we make the definition
\[
k(\rho_p) - 1 = b - a + a(p+1),
\]
or equivalently
\begin{equation}\label{eq:l2}
k(\rho_p) = 1 + pa + b.
\end{equation}

As we have $0 \le a< b \le p-1$ we see that
\[
2\le k(\rho_p) \le 1 + p(p-2) + p-1 = p^2-p.
\]


\subsubsection{The level 1 tame case}\label{sec:l1t}
Assuming $\phi_1$ and $\phi_2$ are of level 1 and that $\rho_p|_{I_{p,w}}$ is trivial we can write
\[
\rho_p |_{I_p} \sim \begin{pmatrix}
\phi_1 & 0 \\
0      & \phi_2 \end{pmatrix} = \begin{pmatrix}
\chi_p^b & 0 \\
0      & \chi_p^a \end{pmatrix}.
\]
For some integers $a$ and $b$ defined modulo $p-1$, we can assume that $0\le a \le b \le p-2$ by switching $\phi_1$ and $\phi_2$ if necessary.
This looks similar to the ordinary case we covered in \cref{thm:ordinary}.
So we factor out a character again to get something that looks exactly like that theorem,
\[
\rho_p|_{I_p} \sim
\begin{pmatrix}
\chi_p^b & 0 \\
0        & \chi_p^a \end{pmatrix}=
\chi_p^a\begin{pmatrix}
\chi_p^{b-a} & 0 \\
0            & 1 \end{pmatrix}.
\]
If we just had the right hand matrix we would want to set $k(\rho_p) - 1 = b-a$, but once again we have a twist.
Taking this into account we try to set
\[
k(\rho_p) - 1 = b - a + a(p+1),
\]
or equivalently
\[
k(\rho_p) = 1 + pa + b
\]
as above, but there is a small issue this time.
It is possible here that $a = b = 0$, in which case this definition would give us $k(\rho_p) = 1$.
However we do not want to consider weight 1 modular forms at all (indeed we only attached Galois representations to eigenforms of weight at least 2), so our formula needs modifying in this case.
Looking at \cref{subsec:char,prop:pm1} we see that it is only permissible to change the weight by multiples of $p-1$.
So to remedy the situation we add $p-1$ when we are in the problem case.
The definition in this case is then
\begin{equation}\label{eq:l1t}
k(\rho_p) = \begin{cases}
1 + pa + b & \text{if }(a,\,b) \ne (0,\,0), \\
         p & \text{if }(a,\,b) = (0,\,0).
\end{cases}
\end{equation}

With this definition we have
\[
2\le k(\rho_p) \le 1 + p(p-2) + p-2 = p^2 - p - 1,
\]
unless $p = 2$, where the above inequality makes no sense, in which case $k(\rho_p) = 2$ is the only possibility.


\subsubsection{The level 1 non-tame case}\label{sec:l1nt}
The final case is where $\phi_1$ and $\phi_2$ are of level 1 but the action of $I_{p,w}$ on $V$ is non-trivial.

If we consider the subspace of $V$ fixed by $I_{p,w}$ the same argument we used in \cref{prop:wildtriv} shows that this subspace is always non-trivial.
However $I_{p,w}$ is assumed to act non-trivially and so $V^{I_{p,w}}$ must in fact be a 1-dimensional subspace.
This subspace is stable under the action of $\Gp$, as is the space $V/V^{I_{p,w}}$, so we may write
\[
\rho_p \sim \begin{pmatrix}
\theta_2 & * \\
0        & \theta_1 \end{pmatrix},
\]
where $\theta_1$ and $\theta_2$ are characters of $\Gp$.

Using this decomposition we see that upon restricting to $I_p$ we have
\[
\rho_p|_{I_p} \sim \begin{pmatrix}
\chi_p^\beta & * \\
0          & \chi_p^\alpha \end{pmatrix},
\]
for some $\alpha,\,\beta \in \ZZ/(p-1)\ZZ$ as the restrictions of $\theta_1$ and $\theta_2$ to $I_p$ must be of level 1.

We fix representatives $\alpha$ and $\beta$ such that
\begin{align*}
0&\le \alpha \le p - 2,\\
1&\le \beta \le p - 1.
\end{align*}
We can then proceed in the same way as we did before,  factoring out a twist by $\chi_p^a$ to get
\[
\rho_p|_{I_{p}} \sim \chi_p^\alpha \begin{pmatrix}
\chi_p^{\beta - \alpha}& * \\
0        & 1 \end{pmatrix}.
\]
As before this looks like \cref{thm:ordinary}, and at first glance it seems as if the representation given by the right hand matrix comes from an eigenform of weight $\beta - \alpha + 1$.
However if $\beta - \alpha = 1$ this representation could have come from a form of weight $\beta - \alpha + p+1$ instead, because $\chi_p = \chi_p^{p}$ and we cannot determine what the correct power of $\chi_p$ is.

This is a real problem as it is incorrect to simply use the smallest weight here (or the largest for that matter).
For example, let $\rho$ be the mod 11 representation arising from the mod 11 reduction of the eigenform $\Delta$ of weight 12.
This representation is unramified outside of 11 and so $N(\rho) =1$.
Then, as the 11th Fourier coefficient of $\Delta$ is $534612\equiv 1 \pmod{11}$, \cref{thm:ordinary} tells us that
\[
\rho|_{I_{11}} \sim \begin{pmatrix} \chi_{11}^{11} & * \\ 0 & 1\end{pmatrix} = \begin{pmatrix} \chi_{11} & * \\ 0 & 1\end{pmatrix}.
\]
So if we use $\beta - \alpha + 1$ for our definition here this would predict the existence of a mod 11 eigenform of weight 2 and level 1 from which $\rho$ arises, but there are no such forms.

If $\beta \ne \alpha + 1$ we do not have this problem, so Serre lets
\begin{align*}
a &= \min(\alpha,\, \beta),\\
b &= \max(\alpha,\, \beta).
\end{align*}
and defines, as we did in \cref{sec:l2},
\begin{equation}\label{eq:l1nt}
k(\rho_p) = 1 + pa + b.
\end{equation}

When $\beta = \alpha + 1$ we have to deal with the ambiguity by finding some way of distinguishing representations that come from eigenforms of weight $2$ and $p + 1$.
In order to decide case we are in we can make use of \cref{thm:mazur}.
This theorem stated that if a Galois representation arises from a filtration $p+1$ form, then the representation at $p$ is tr\`es ramif\'e.

So if $\rho_p$ is peu ramif\'e this cannot be the case, and the twist of $\rho_p$ looks like it came from an eigenform of weight $2$ rather than $p+1$.
We then define $k(\rho_p)$ as we did earlier via
\begin{equation}\label{eq:peu}
k(\rho_p) = 1 + pa + b = 2 + \alpha(p+1).
\end{equation}

If $\rho_p$ is tr\`es ramif\'e then it looks as if the twist comes from a form of weight $p+1$.
So we make an analogous definition to what we have done before, simply accounting for the twist starting from a weight $p+1$ form.
We have one final adjustment to make, if $p=2$ this definition would give $k(\rho_p) = 3$, in Serre's definition this is set to be $4$ instead.
In the end we obtain the following definition
\begin{equation}\label{eq:tres}
k(\rho_p) = \begin{cases}
1 + pa + b + p - 1 = p + 1 + \alpha(p+1) & \text{ if }p\ne 2,\\
4 & \text{ if } p = 2.
\end{cases}
\end{equation}


Looking at the bounds for $k(\rho_p)$ now we see that if $\beta \ne \alpha + 1$ or if $\rho_p$ is peu ramif\'e then for all $p$
\[
2\le k(\rho_p) \le 1 + p(p-2) + p - 1 = p^2 - p.
\]
Otherwise for the tr\`es ramif\'e case we get
\[
2\le k(\rho_p) \le p^2 - p + p -1 = p^2 - 1,
\]
unless $p =2$ where $k(\rho_p) = 4$ is the only possibility.

Considering all cases together we see that $k(\rho_p)$ has range of
\[
2\le k(\rho_p) \le p^2 - 1
\]
for odd $p$, and $k(\rho_p) \in\{2,\,4\}$ for $p=2$.
\\[12pt] \noindent
To see why this approach might be expected to produce the minimal possible weight when twisting is involved depends on analysis of the sequences of filtrations
\[
w(\Theta^i f) \text{ for } 0 \le i \le p + 1,
\]
for mod $p$ eigenforms $f$.
These sequences are known as $\Theta$-cycles and they are studied in detail in \cite{Jochnowitz}.


\subsection{A counterexample}\label{subsec:counter}
In fact the conjecture exactly as stated above is in fact \emph{incorrect}, this was noted by Serre in a letter to Ken Ribet in 1987.
The following counterexample is due to Serre and is given in \cite[sec. 2]{Ribet95} and also in \cite[sec. 21.6.1]{RibetSteinBook} which we are following here (see also the notes for Serre's paper introducing the conjecture in his collected works).

\begin{ex}
Let $\alpha$ be a root of $x^2 + 3x+ 3$, so that $\QQ(\alpha) = \QQ (\sqrt{-3})$.
The space $S_2(13;\,\Zb)$ is spanned by the normalised eigenform
\[
f= q + (-\alpha - 3)q^{2} + (2 \alpha + 2)q^{3} + (\alpha + 2)q^{4} + (-2 \alpha - 3)q^{5} + O(q^{6})
\]
and its $\Gal(\QQ(\alpha)/\QQ)$ conjugate form
\[
g= q + \alpha q^{2} + (-2 \alpha - 4)q^{3} + (-\alpha - 1)q^{4} + (2 \alpha + 3)q^{5} + O(q^{6}),
\]
which is the other normalised eigenform in $S_2(13;\,\Zb)$.

The mod 3 Galois representation attached to $f$ has determinant $\chi_3 \phi$, where $\phi$ is the non-trivial Galois character coming from the extension $\QQ(\sqrt{13})/\QQ$.
We can view $\phi$ as a Dirichlet character $(\ZZ/13\ZZ)^* \to \Fb_3$ using the arguments of \cref{subsec:level}.
Serre's conjecture tells us that this character is our $\varepsilon(\rho)$ and so $\rho$ should arise from some eigenform $f$ in $S_2(13,\,\phi;\,\Fb_3)$.

We claim that no such non-zero $f$ can exist.
First, let $H$ be the group of squares in $(\ZZ/13\ZZ)^*$, i.e. the kernel of $\phi$.
Doing this we may view $f$ as a mod $3$ cusp form of weight 2 for the group
\[
\Gamma_H(13) = \left\{ \begin{pmatrix} a & b\\ c & d\\ \end{pmatrix} \in \SL_2(\ZZ) : c \equiv 0 \pmod{13},\,d \in H\right\}.
\]
Although we did not define mod $p$ modular forms for more general congruence subgroups the definition is the same.
We can see that the order of the character of such a form must divide 2.
However the space of weight 2 cusp forms on $\Gamma_H(13)$ is a subspace of $S_2(13;\,\Fb_3)$.
This larger space is spanned by $\overline{f}$ and $\overline{g}$, both of which have a character of order 6.
So the space of mod $3$ weight 2 cusp forms on $\Gamma_H(13)$ is zero, and $f$ cannot exist.
\end{ex}


This problem is fairly isolated and only arises when we work with mod~2 Galois representations, or mod~3 Galois representations that have abelian restriction to $\Gal(\Qb/\QQ(\sqrt{-3}))$.
In order to fix this issue whilst sticking with our definition of modular forms it is necessary to change the definition of the character in these cases.
This problem is not a serious issue with the validity of the conjecture in general however, if one simply ignores the character completely or is a bit careful in the situations mentioned this can be completely resolved.


\subsection{The proof}\label{subsec:proof}
As mentioned at the start, this conjecture is in fact now a theorem, due to Khare and Wintenberger using results of Kisin and others.

For $p \ne 2$ it was known that the qualitative and refined forms were equivalent before either was known in general.
This reduction of the refined form to the qualitative form was incremental and is due to a large number of people, for example Ribet \cite{Ribet94} lists N. Boston, H. Carayol, F. Diamond, B. Edixhoven, G. Faltings, B. H. Gross, B. Jordan, K. Ribet, H. W. Lenstra, Jr., R. Livn\'e, B. Mazur and J-P. Serre.
The reduction for $p\ne 2$ was completed by Diamond around 1993 \cite{Diamond}.
The missing case of $p=2$ was completed by Khare and Wintenberger as part of their proof of the conjecture.

Many special cases of Serre's conjecture were also known long before the general case.
Indeed in 1973, 14 years before the Duke paper, John Tate proved the conjecture when $p = 2$ for Serre-type representations unramified outside 2 \cite{Tate}.
Serre himself used similar techniques to prove the same thing for $p = 3$ (published as a note on page 710 of volume III of his collected works).
As we will see in \cref{subsec:small}, for these cases what needs to be shown in these proofs is that no such Galois representations exist, to match the lack of cusp forms of level 1 for weights less than 12.
It is interesting to note that these proofs were not completely subsumed or rendered obsolete by the proof of Khare and Wintenberger.
Rather they help form the base case for an induction type argument which gives the full conjecture.


% ************************************************************************
% Examples
% ************************************************************************
\section{Examples}
One of the great things about Serre's conjecture, even if it were not yet known to be correct, is the fact that it can be used in concrete cases easily.
Specifically, given a Galois representation of Serre-type we can calculate the weight and level along with the character as detailed above, then in many instances we can compute the associated space of eigenforms and look for a form from which our Galois representation arises.

\subsection{A Galois representation arising from $\Delta$}
Let's return first to \cref{ex:delt}, which concerned the cusp form $\Delta$, and check that everything we have just done is consistent with what we saw there.

\begin{ex}
We consider the 23-adic Galois representation $\rho_{23}$, as out of the representations we considered there it is the only irreducible one and so the only case in which Serre's conjecture may have something to say.
Recall that this was defined by taking $K$ to be the splitting field of $x^3 - x - 1$, this is ramified only at 23 and has Galois group $S_3$.
We then took $r$ be the unique irreducible degree 2 representation of $S_3$ taken with coefficients in $\QQ_{23}$, this representation satisfies
\[
\tr(r(\sigma)) = \begin{cases}
0 &\text{ if } |\sigma| = 2,\\
2 &\text{ if } |\sigma| = 1,\\
-1 &\text{ if } |\sigma| = 3,
\end{cases}
\]
for each $\sigma \in S_3$.
The Galois representation $\rho_{23}$ was then the composition
\[
\GQ \to \Gal(K/\QQ) \xrightarrow{\sim} S_3 \to \GL_2(\QQ_{23}).
\]
In order to use Serre's conjecture we need a mod $p$ representation so we reduce mod $23$ to obtain $\rho = \overline{\rho}_{23}$.

Straight away we see that $N(\rho) = 1$ due to the fact $K$ is unramified outside 23 (see \cref{rmk:unram}).
The determinant is the sign character $\phi$ of $S_3$, taken with values mod 23 and viewed as a character of $\GQ$.
As $\phi$ is of order exactly 2 it must be the 11th power of $\chi_{23}$.

The inertia group is of order 2, and the wild inertia is trivial.
Additionally, as our whole representation lands in the prime field we are in the level 1 tame case.
The two diagonal characters must be either 1 or $\phi|_{I_{23}}$, we know their product is $\phi|_{I_{23}}$ so we have
\[
\rho_{23}|_{I_{23}} \sim
\begin{pmatrix}
\chi_{23}^{11} & 0 \\
0    & 1 \end{pmatrix}
\]
hence our weight is $k(\rho_{23}) =11 + 1 = 12$.

So $\rho$ should have arisen from a normalised eigenform in
\[
S_{12}(1,\,\Id;\,\Fb_p) = \Fb_p\cdot \overline{\Delta},
\]
as we would expect.
\end{ex}


\subsection{A Galois representation arising from a $D_4$ extension}
Now we move to a new example, once again arising from the Galois group of a number field.

\begin{ex}
Take the $K$ to be the splitting field of
\[
f = x^4 - 3,
\]
so $K = \QQ(\sqrt[4]{3},i)$.
This extension has Galois group $D_4$, generated by some $\sigma$ and $\tau$, where
\[
\sigma(\sqrt[4]{3}) = \sqrt[4]{3},\,\sigma(i) = -i,\,
\tau(\sqrt[4]{3}) = i\sqrt[4]{3},\,\tau(i) = i.
\]

We can turn this into a mod 5 Galois representation using the fact that $D_4$ embeds into $\GL_2(\FF_5)$ via its natural 2-dimensional faithful representation
\[
\sigma \mapsto \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix},\,
\tau \mapsto \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}.
\]
So we obtain an irreducible a mod $5$ Galois representation
\[
\rho\colon \Gal(\Qb/\QQ) \to \Gal(K/\QQ) \xrightarrow{\sim} D_4  \to \GL_2(\FF_5).
\]
What does Serre have to say about $\rho$?
After fixing an embedding $K \hookrightarrow \CC$, complex conjugation corresponds to a reflection which maps to a matrix of determinant $-1$, so we see that this representation is indeed odd, and Serre's conjecture will apply.

The determinant of $\rho$ takes $\tau$ to 1 and $\sigma$ to $-1$, so it factors through $\Gal(\QQ(i)/\QQ)$ and corresponds to the non-trivial Dirichlet character of conductor 4.

The conductor $N(\rho)$ will only be divisible by 2 and 3, as these are the primes dividing $\disc K$ and so $\rho$ is unramified outside of these two primes.
We first note that any order 4 subgroup of $D_4$ contains $\tau^2$.
The image of such a subgroup will therefore contain $-I_2$ and hence will have trivial fixed subspace.

For $\ell = 2$ we find the following ramification groups
\begin{align*}
G_{2,0}&= G_{2,1}= V_4,\\
G_{2,2}&= G_{2,3} = \langle \tau^2\rangle,\\
G_{2,4}&= 1.
\end{align*}
and so
\begin{align*}
\dim V/V^{2,0} &= \dim V/V^{2,1} = 2,\\
\dim V/V^{2,2} &= \dim V/V^{2,3} = 2,\\
\dim V/V^{2,4} &= 0.
\end{align*}
Giving $\nu_2(\rho) = 2 \cdot 2 + 2\cdot \frac{1}{2} \cdot 2 = 6$.

For $\ell = 3$ we find the following ramification groups
\begin{align*}
G_{3,0}&= C_4,\\
G_{3,1}&= 1.
\end{align*}
and so
\begin{align*}
\dim V/V^{3,0} &= 2,\\
\dim V/V^{3,1} &= 0.
\end{align*}
Giving $\nu_3(\rho) = 2$.
Together we get that $N(\rho) = 2^{6}3^{2} = 576$.

As $K$ is unramified at 5 the restriction to inertia is trivial.
So we are in the level 1 tame case, with trivial characters, and hence the weight $k(\rho_p)$ is 5.
Finally we let $\varepsilon$ be the extension of the non-trivial Dirichlet character of conductor 4 to a Dirichlet character for $N(\rho)$.

So $\rho$ should come from some eigenform $f \in S_5(576,\,\varepsilon;\,\Fb_5)$.
We can explicitly compute this space using, for example, Sage \cite{Sage}.
Doing this gives us a number of eigenforms, including
\begin{multline*}
q + 2q^{5} + 2q^{13} + 4q^{25} + 3q^{37} + q^{49} + 3q^{61} + 4q^{65} + 3q^{73} + 3q^{97} + O(q^{109}),
\end{multline*}
which we verify has the expected $q$-expansion coefficients $a_p$ for prime $p$ up to $q^{1000}$ (i.e. $2$ if $\Frob_p= 1$, $-2$ if $\Frob_p = \tau^2$ and $0$ otherwise).

\end{ex}


% ************************************************************************
% Consequences
% ************************************************************************
\section{Consequences}
Serre's conjecture is a strong statement that implies many other results within number theory.
We now mention briefly a few of these.
The first two are little more than examples but interesting ones nonetheless, whereas the third is a very deep statement within arithmetic geometry.

\subsection{Finiteness of classes of Galois representations}
First let us examine a very direct consequence.
Fix a prime $p$ and an integer $N$ and consider Serre-type Galois representations
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
whose associated levels $N(\rho)$ divide $N$.
Serre's conjecture states that each corresponds to some normalised mod $p$ eigenform of level $N(\rho)|N$ and weight $k(\rho)$ in the range $[2,p^2-1]$ (or $\{2,\,4\}$ for $p=2$).
However there are only finitely many spaces of forms satisfying these requirements and only finitely many normalised eigenforms in each.
Therefore for each prime $p$ and integer $N$ there are only finitely many isomorphism classes of mod~$p$ Serre-type Galois representations of conductor dividing $N$.
Apparently there are no alternative methods of proving this result currently known \cite{WieseMod}.


\subsection{Unramified mod $p$ Galois representations for small $p$}\label{subsec:small}
We can specialise the previous type of direct argument further to get more control over the number of representations with particular properties.
In fact we can get enough control to prove the following non-existence result.

Let $\rho$ be a Serre-type mod $p$ Galois representation for some $p \le 7$ that is unramified outside of $p$.
In this case, due to the absence of ramification, $N(\rho)$ is simply 1 (recall \cref{rmk:unram}).
The idea of our definition of the weight was that each Galois representation $\rho$ should be the twist by a power of the cyclotomic character of another form $\rho'$, such that $ 2\le k(\rho')\le p + 1$.

So Serre's conjecture predicts there is some mod $p$ cusp form of level 1 and weight $\le 8$ from which some twist of $\rho$ arises.
But there are no cusp forms of level 1 of weight $< 12$ and so such a twisted representation cannot exist, hence the original $\rho$ cannot exist either.
As mentioned in \cref{subsec:proof} this sort of statement was shown via other methods prior to the proof of the full conjecture and provided important theoretical evidence in support of Serre's conjecture.

\iffalse
\subsection{The Artin conjecture}\label{sec:artin}
\begin{defn}
An \emph{Artin representation} is a complex Galois representation
\[
\rho\colon \GQ \to \GL_2(\CC).
\]
\end{defn}

We may consider the $L$-function
\[
L(s, \rho) = \prod_{p} L_p(s,\rho) = \prod_{p} \frac{1}{\det(I_n - p^{-s}\rho(\Frob_p) |_{V^{p,0}})}.
\]

Given any $L(s,\rho)$ we introduce a related function which has a nice functional equation.
We define
\[
\Lambda(s, \rho) = N^{s/2} (2\pi)^{-s} \Gamma(s)L(s,\rho),
\]
where $N$ is the Artin conductor, recalling the notation of \cref{subsec:level} this is given by
\[
N = \prod_{p}p^{\nu_p(\rho)},
\]
the product running over all $p$ now.
This function satisfies
\[
\Lambda(1-s, \rho) = W(\rho)\Lambda(s,\rho),
\]
where $W(\rho)$ is a constant of absolute value 1, called the \emph{Artin root number}.

The following conjecture is a major open question concerning this function that dates back to ???.

\begin{conjecture}[Weak Artin conjecture]
Let
\[
\rho\colon \GQ \to \GL_n(\CC)
\]
be an Artin representation, then the meromorphic continuation of
\[
\Lambda(s,\rho)
\]
to the complex plane is holomorphic on the whole of $\CC$.
\end{conjecture}

In fact this follows from another related conjecture.

\begin{conjecture}[Strong Artin conjecture]
Any Artin representation
\[
\rho\colon \GQ \to \GL_n(\CC),
\]
is modular, in the sense that it
\end{conjecture}

As the $L$-function of a ??

\begin{prop}
Serre's conjecture implies the strong Artin conjecture for odd 2-dimensional Artin representations.
\end{prop}
\begin{proof}
Given an odd Artin representation
\[
\rho\colon \GQ\to \GL_2(\CC)
\]

\end{proof}

\fi

\subsection{Modularity of abelian varieties}
In this section we look briefly at another strong result that follows from Serre's conjecture, despite it not obviously concerning the objects related in the conjecture.
This was a genuinely new result that was not known before the proof of Serre's conjecture and so serves as a good example of the usefulness of the conjecture outside of its immediate domain.
Going into detail would take us too far afield so this section is necessarily sketch-like and without background material.
For more details see \cite{Ribet04} or \cite[chap. 15]{RibetSteinBook}.

\begin{defn}
An abelian variety $A$ over $\QQ$ is \emph{modular} if there exists an $N$ such that there is a surjective map defined over $\QQ$
\[
J_1(N) \twoheadrightarrow A.
\]
\end{defn}

\begin{defn}
An abelian variety $A$ defined over $\QQ$ is said to be of \emph{$\GL_2$-type} if it is simple and its endomorphism algebra
\[
\QQ\otimes \End_{\QQ}(A)
\]
contains a number field $E$ whose degree is equal to the dimension of $A$.
\end{defn}

\begin{ex}
Elliptic curves are of $\GL_2$-type because all endomorphism rings over characteristic 0 fields of elliptic curves contain $\ZZ$ and hence
\[
\QQ \subset \QQ \otimes \End_\QQ(E).
\]
\end{ex}

Ken Ribet has shown \cite{Ribet04} that Serre's conjecture implies the following nice classification of modular abelian varieties.

\begin{thm}
Every abelian variety of $\GL_2$-type is modular.
\end{thm}

This theorem is a generalisation of the Taniyama--Shimura--Weil conjecture, or modularity theorem, first proved by Breuil, Conrad, Diamond and Taylor in 2001 \cite{BCDT}.
However this modularity statement is significantly stronger and uses the full power of Serre's conjecture.
Indeed this theorem was not known prior to the proof of the conjecture.


\bibliographystyle{alpha}
\bibliography{essay}

\end{document}