-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathessay.tex
1552 lines (1322 loc) · 74.3 KB
/
essay.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\documentclass[a4paper,12pt]{article}
\usepackage{partiiiessay}
\title{\vspace{-40pt}Serre's conjecture}
\author{\vspace{-10pt}Alex J. Best}
\date{June 2015}
% used https://www.overleaf.com/2460029rpwpbr#/6413425/
\begin{document}
\maketitle
\vspace{-45pt}
\tableofcontents
\clearpage
% ************************************************************************
% Introduction
% ************************************************************************
\section{Introduction}
In 1987 Jean-Pierre Serre published a paper \cite{Serre87}, ``Sur les repr\'esentations modulaires de degr\'e 2 de $\Gal(\Qb/\QQ)$'', in the Duke Mathematical Journal.
In this paper Serre outlined a conjecture detailing a precise relationship between certain mod $p$ Galois representations and specific mod $p$ modular forms.
This conjecture and its variants have become known as Serre's conjecture, or sometimes \emph{Serre's modularity conjecture} in order to distinguish it from the many other conjectures Serre has made.
The conjecture has since been proven correct by the work of numerous people, culminating with that of Khare--Wintenberger and Kisin, published in 2009 \cite{KWI,KWII,Kisin}.
Here we provide a motivated account of the original form of the conjecture before going on to compute some explicit examples and examining some interesting consequences.
Beyond Serre's paper there are many very good accounts of his statement, including articles by Darmon \cite{Darmon} and by Ribet and Stein \cite{RibetStein} who also have a book chapter on the topic \cite{RibetSteinBook}.
In 1992 Bas Edixhoven wrote a paper \cite{EdixhovenWeight} in which he gave a slight alteration of Serre's conjecture.
There are additional articles covering this version of the conjecture, such as Cais \cite{Cais} and Edixhoven \cite{Edixhoven}.
Finally Alex Ghitza has prepared a translation of part of Serre's paper \cite{Ghitza} which has been helpful.
These articles were of great help when preparing the current essay and many of the ideas used here are contained in at least one of them.
% ************************************************************************
% Background
% ************************************************************************
\section{Background}
We begin by fixing several definitions and key results that will be relevant when discussing Serre's conjecture.
\subsection{Modular forms}
In the interests of space we assume material relating to classical modular forms.
Here we only look at the passage to \emph{mod $p$ modular forms}, as there is some amount of choice in how these forms are defined and they form a key part of Serre's conjecture.
We use Serre and Swinnerton-Dyer's original approach to define mod $p$ modular forms, this is detailed in \cite{SD,Serre73,Serre73Formes}.
This approach is a simple one and it allows us to talk about congruences involving $q$-expansions and their coefficients without getting too bogged down with notation.
\begin{defn}
Given a subring $R$ of $\CC$ we let $S_k(N,\,\varepsilon;\,R)$ be the space of cusp forms of weight $k$, level $N$ and character $\varepsilon\colon (\ZZ/N\ZZ)^* \to R$, whose $q$-expansion coefficients lie in $R$.
Now fix a prime $p\nmid N$.
Given a character
\[
\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*
\]
we may lift to a character
\[
\hat{\varepsilon}\colon (\ZZ/N\ZZ)^* \to \Zb^*,
\]
with values in the prime to $p$ roots of unity.
Additionally fix a place $v$ of $\Qb$ above $p$, this place gives us a reduction map $\Zb \to \Fb_p$.
We let the space of \emph{cuspidal mod $p$ modular forms} of weight $k$, level $N$ and character $\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*$ be the subspace of $\Fb_p[[q]]$ obtained by reducing the $q$-expansions of forms in $S_k(N,\,\hat{\varepsilon};\,\Zb)$ using our place $v$.
We denote this space by
\[
S_k(N,\,\varepsilon;\,\Fb_p).
\]
Taking the union over all characters $\varepsilon$ as above gives us the space of all mod $p$ cusp forms of weight $k$ and level $N$,
\[
S_k(N;\,\Fb_p).
\]
We can in the same way define the full (non-cuspidal) space of mod $p$ modular forms, along with mod $p$ modular forms for more general congruence subgroups.
But we don't need to consider such forms in this essay so we restrict to cusp forms for $\Gamma_1$ to keep things concise.
\end{defn}
Many notions defined for normal modular forms descend to mod $p$ modular forms in the natural way.
\begin{defn}
The standard Hecke operators act on $q$-expansions in a way that preserves each space $S_k(N,\,\varepsilon;\,\Zb)$ (this action is recalled in the proof of \cref{prop:thetaeigen}).
So we may define the action of the Hecke operators on a mod $p$ modular form $f$ by letting them act on a lift of $f$ and then reducing the $q$-expansion mod $p$ again.
Equivalently we may just define the action on $q$-expansions to be given by the same formulae as for normal modular forms.
\end{defn}
\begin{defn}
As for standard modular forms, we say a mod $p$ cusp form $f = \sum_{n\ge 1} a_n q^n$
is \emph{normalised} if $a_1 = 1$.
\end{defn}
An interesting difference between the world of standard modular forms and that of mod $p$ modular forms comes from the fact that distinct modular forms may reduce mod $p$ to have exactly the same $q$-expansion.
\begin{ex}
Using Sage \cite{Sage} we find the following example, let
\begin{align*}
f &= q - q^{2} - 2q^{3} - 7q^{4} + 16q^{5} + 2q^{6} - 7q^{7} + O(q^{8}) \in S_{4}(7,\,\Id;\,\ZZ),\\
g &= q - 6q^{2} - 42q^{3} - 92q^{4} - 84q^{5} + 252q^{6} + 343q^{7} + O(q^{8}) \in S_{8}(7,\,\Id;\,\ZZ),
\end{align*}
then if we reduce mod $5$ we see that
\begin{align*}
\overline{f} &= q + 4q^{2} + 3q^{3} + 3q^{4} + q^{5} + 2q^{6} + 3q^{7} + O(q^{9}) \in S_{4}(7,\,\Id;\,\Fb_5),\\
\overline{g} &= q + 4q^{2} + 3q^{3} + 3q^{4} + q^{5} + 2q^{6} + 3q^{7} + O(q^{9}) \in S_{8}(7,\,\Id;\,\Fb_5),
\end{align*}
which are indeed equal up to this precision.
\end{ex}
This means that we need to be careful if we are just given a $q$-expansion corresponding to some mod $p$ modular form as there is no longer necessarily a single weight attached that form.
There is a useful necessary condition controlling when this sort of behaviour can happen however.
\begin{prop}\label{prop:pm1}
If $f$ and $g$ are two non-zero mod $p$ modular forms of weights $k$ and $k'$ respectively, whose $q$-expansions are equal, then
\[
k \equiv k' \pmod{p-1}.
\]
\end{prop}
\begin{proof}
See \cite{Serre73}.
\end{proof}
In fact for $p\ge 5$ it is always the case that $S_{k}(N;\,\Fb_p)\subset S_{k + p - 1}(N;\,\Fb_p)$ \cite{Serre73Formes}.
Due to this behaviour the concept of weight is not particularly well defined for mod $p$ modular forms, so we introduce the notion of a \emph{filtration} to help us deal with the ambiguity.
\begin{defn}\label{def:filtration}
The \emph{filtration} of a mod $p$ cusp form $f$ of level $N$ is the minimal $k\ge 0$ for which $f\in S_k(N;\,\Fb_p)$.
We denote this by $w(f)$.
\end{defn}
Now we look at an important operator on the space of mod $p$ modular forms, which we shall study more in \cref{sec:galassoc}.
\begin{defn}\label{def:theta}
The $\Theta$ operator is defined on (cuspidal) mod $p$ modular forms via its action on $q$-expansions, which is
\[
\Theta\left(\sum_{n\ge 0} a_nq^n\right) = q\ddq \left(\sum_{n\ge 0} a_nq^n\right) = \sum_{n\ge 0} na_nq^n.
\]
\end{defn}
It is not clear from this definition that the resulting $q$-expansion should always correspond to another modular form, but this is indeed the case.
In fact many of the properties of the cusp form are preserved by $\Theta$.
\begin{prop}\label{prop:thetafilt}
If $f$ is a mod $p$ cusp form of filtration $w(f) = k$, then $\Theta(f)$ is also a mod $p$ cusp form of the same level and character and has filtration
\[
w(\Theta(f)) = \begin{cases}
k + p + 1 &\text{ if } p\nmid k,\\
k + p + 1 - n(p-1),\, n\ge 1&\text{ if } p\mid k.
\end{cases}
\]
\end{prop}
\begin{proof}
See \cite{Serre73} and also \cite{Jochnowitz} for more detail about how the filtration lowers in the $p\mid k$ case.
\end{proof}
We see straight away from the definition of the action that $\Theta$ preserves the set of normalised mod $p$ cusp forms.
\begin{prop}\label{prop:thetaeigen}
$\Theta$ semicommutes with the Hecke operators $T_{\ell}$ (specifically we have $T_{\ell}\Theta = \ell\Theta T_{\ell}$), and hence $\Theta$ preserves eigenforms.
\end{prop}
\begin{proof}
The Hecke operators $T_{\ell}$ on $S_k(N,\,\varepsilon;\,\Fb_p)$ act on $q$-expansions by
\[
T_{\ell}\left(\sum_{n\ge 1} a_nq^n\right) = \begin{cases}
\sum_{n\ge 1} a_{\ell n}q^n + \ell^{k-1}\varepsilon(\ell)\sum_{n\ge 1} a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} a_{\ell n}q^n &\text{ if }\ell \mid N.
\end{cases}
\]
We let $f = \sum_{n\ge 1} a_nq^n\in S_k(N,\,\varepsilon;\,\Fb_p)$ and calculate
\[
\Theta T_{\ell} f = \begin{cases}
\sum_{n\ge 1} na_{\ell n}q^n + \ell^{k-1}\varepsilon(\ell)\sum_{n\ge 1} \ell n a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} na_{\ell n}q^n &\text{ if }\ell \mid N,
\end{cases}
\]
and
\[
T_{\ell}\Theta f = \begin{cases}
\sum_{n\ge 1} \ell n a_{\ell n}q^n + \ell^{k+p+1-1} \varepsilon(\ell)\sum_{n\ge 1} n a_{n}q^{\ell n} &\text{ if }\ell \nmid N,\\
\sum_{n\ge 1} \ell n a_{\ell n}q^n &\text{ if }\ell \mid N.
\end{cases}
\]
As we are working in characteristic $p$ here $\ell^{k+p} = \ell^{k+1}$, so
\[
T_{\ell} \Theta = \ell \Theta T_{\ell},
\]
thus if $f$ is an eigenform for the $T_\ell$ then $\Theta f$ is an eigenform too.
However the eigenvalue for each $T_\ell$ is $\ell$ times the original.
\end{proof}
\subsection{Galois representations}\label{sec:gals}
We now move on to the next major component of Serre's conjecture, Galois representations.
As above we mostly concern ourselves with fixing definitions and recalling important results that shall be needed later.
There are many good references for this type of material, for example \cite{DDT,WieseGal}.
\begin{defn}
An \emph{$n$-dimensional mod $p$ Galois representation} is a homomorphism
\[
\rho\colon \GQ \to \GL_n(\Fb_p).
\]
Similarly, an \emph{$n$-dimensional $p$-adic Galois representation} is a homomorphism
\[
\rho\colon \GQ \to \GL_n(\Qpb).
\]
Unless stated otherwise the term Galois representation will refer to a mod~$p$ Galois representation.
\end{defn}
Recall that $\GQ$ is defined as the inverse limit of $\Gal(K/\QQ)$ as $K$ ranges over all number fields.
So the group $\GQ$ naturally has the profinite topology, where the open subgroups are the subgroups of finite index.
We demand that all of our mod $p$ representations be continuous with respect to this topology and the discrete topology on $\GL_n(\Fb_p)$.
\begin{rmk}\label{rmk:ctsfin}
The continuity condition for mod $p$ Galois representations reduces to having an open kernel, so continuous mod $p$ Galois representations always have finite image.
\end{rmk}
We deal mostly with 1 and 2 dimensional mod $p$ Galois representations.
Those of dimension 1 (i.e. maps $\phi\colon\GQ \to \Fb_p^*$) are called \emph{characters}.
Given a 2-dimensional mod $p$ representation $\rho\colon G\to \GL_2(\Fb_p)$ we often use the notation
\[
\rho \sim \begin{pmatrix}
\alpha & \beta \\
\gamma & \delta
\end{pmatrix},
\]
where $\alpha,\,\beta,\,\gamma$ and $\delta$ are functions $G \to \Fb_p$, to indicate that there is some $A\in\GL_2(\Fb_p)$ such that for every $\sigma\in G$
\[
\rho(\sigma) = A
\begin{pmatrix}
\alpha(\sigma) & \beta(\sigma) \\
\gamma(\sigma) & \delta(\sigma)
\end{pmatrix}A^{-1}.
\]
\begin{defn}
Let $\rho$ be a mod $p$ Galois representation and $\phi$ be a mod $p$ Galois character.
We can form a new mod $p$ Galois representation of the same dimension as $\rho$ by taking the product of the images for each element of $\GQ$.
This is called the \emph{twist} of $\rho$ by $\phi$, and is denoted $\phi\otimes \rho$.
\end{defn}
\begin{defn}
Let $\phi\colon\GQ \to K^*$ be a character for some field $K$ and fix an embedding $\Qb\hookrightarrow \CC$.
We may then view complex conjugation as an element $c\in \GQ$, looking at its image $\phi(c)$ we see it is an element of order 2 in $K^*$, so $\phi(c)$ must be $\pm 1$.
If $\phi(c) = -1$ we say $\phi$ is \emph{odd}, otherwise we say $\phi$ is \emph{even} (though we shall mostly be concerned with distinguishing odd representations here).
Now given any Galois representation
\[
\rho\colon\GQ \to \GL_n(K),
\]
we define the parity of $\rho$ to be that of the character $\det\rho$.
\end{defn}
Due to \cref{rmk:ctsfin} all of our Galois representations will factor through finite extensions, and so our understanding of finite Galois groups will be essential in what follows.
The next few definitions and results will allow us to talk about the properties of finite Galois extensions of $\QQ$ and $\QQ_p$.
\begin{defn}\label{def:higher}
Given a finite Galois extension $K/\QQ$ we may define a series of subgroups of $G = \Gal(K/\QQ)$ that measure the ramification of $K/\QQ$ at a prime $\ell$.
Let $\mathfrak{L}$ be a prime of $K$ above $\ell$ and let
\[
D_\ell =\{\sigma \in G : \sigma(\mathfrak{L}) \subset \mathfrak{L}\}
\]
be the \emph{decomposition group} at $\ell$ of $G$.
Now take $i$ to be a non-negative integer, we define the $i$th \emph{higher ramification group} to be
\[
G_{\ell,i} = \{\sigma \in D_\ell : \sigma(x) - x \in \mathfrak{L}^{i + 1}\ \forall x \in \cO_{K} \}.
\]
These groups tell us how much of the whole of $G$ we are seeing if we just look modulo powers of $\mathfrak{L}$.
They form a descending chain as $i$ increases
\[
D_\ell\supseteq G_{\ell,0} \supseteq G_{\ell,1} \supseteq G_{\ell, 2}\supseteq\cdots.
\]
\end{defn}
Now we recall a result of Kummer theory, which tells us how to obtain abelian Galois extensions of certain fields, see \cite{Birch} for the full story.
\begin{prop}\label{prop:kummer}
Suppose $K$ is a field containing all $n$th roots of unity, then there is a bijection
\[
\{\text{subgroups }H \subset K^*/(K^*)^n\} \leftrightarrow \{\text{abelian extensions } L| K : \Gal(L/K)^n = 1\},
\]
defined by sending $H\mapsto K(\sqrt[n]{H})$ and $K^* \cap (L^*)^n\mapsfrom L$.
The notation $K(\sqrt[n]{H})$ simply means we adjoin $n$th roots of all elements of $H$ to $K$.
\end{prop}
Although our main objects of study are representations $\GQ$ it will be very useful for us to take a prime $\ell$ and also consider representations of
\[
\Gl = \Gal(\Qb_\ell/\QQ_\ell).
\]
Such representations can be obtained from those of $\GQ$ using an inclusion
\[
\Qb \hookrightarrow \Qlb
\]
to define a restriction map
\[
\Gl \to \GQ.
\]
In fact due to Krasner's lemma \cite[p. 238]{Cohen} the map $\Gl \to \GQ$ is injective and so we may view $\Gl$ as a subgroup of $\GQ$.
The way this subgroup sits inside $\GQ$ depends on the choice of embedding $\Qb \hookrightarrow \Qlb$ and varies by conjugation as this embedding changes.
The group $\Gl$ has several subquotients which will be helpful for us to study restrictions of representations to.
\begin{defn}\label{def:inert}
The ring of integers of $\Qlb$ is stable under the action of $\Gl$, as is its unique maximal ideal.
So we get an action of $\Gl$ on the residue field, this field may be identified with $\Fb_p$.
We therefore obtain a map
\[
\Gl \to \Gal(\Fb_\ell/\FF_\ell).
\]
The \emph{inertia subgroup} $I_\ell$ is defined to be the kernel of this map.
The group $\Gal(\Fb_\ell/\FF_\ell)$ is topologically cyclic, generated by the Frobenius morphism $x \mapsto x^\ell$.
In fact the above map $\Gl \to \Gal(\Fb_\ell/\FF_\ell)$ is surjective and so we may let $\Frob_\ell\in \Gl$ be a preimage of this morphism under the restriction map, we call such elements \emph{Frobenius elements}.
Next the \emph{wild} inertia group $I_{\ell,w}$ is the maximal pro-$\ell$-subgroup of $I_\ell$ and the \emph{tame} inertia group is the quotient
\[
I_{\ell,t} = I_\ell / I_{\ell,w}.
\]
\end{defn}
The groups we have been looking at give rise to Galois extensions of $\Ql$.
The fixed field of $I_\ell$ is the maximal non-ramified extension of $\Ql$, denoted $\Ql^\nr$.
Similarly, the fixed field of $I_{\ell,w}$ is the maximal tamely-ramified extension of $\Ql$, denoted $\Ql^\tamr$.
So the full set-up looks as follows
\[
\xymatrix{
\Qlb \ar@{-}[d]^{I_{\ell,w}} \ar@{-}@/_3pc/[ddd]_{\Gl} \ar@{-}@/^2.5pc/[dd]^{I_{\ell}}\\
\Ql^\tamr \ar@{-}[d]^{I_{\ell,t}} \\
\Ql^\nr \ar@{-}[d] \\
\Ql
}
\]
The field $\QQ_{\ell}^{\tamr}$ is generated by the fields $\Ql^\nr(\sqrt[n]{\ell})$ \cite{Frohlich} and so, using \cref{prop:kummer} the tame inertia $I_{\ell,t}$ may be identified with
\[
\lim_{\longleftarrow} \Gal(\Ql^\nr(\sqrt[n]{\ell})/\Ql^\nr)= \lim_{\longleftarrow} \FF_{\ell^n}^*,
\]
where the maps in the last inverse limit are the norm maps.
\begin{defn}
We say a Galois representation $\rho$ is \emph{unramified} at $\ell$ if $\rho|_{I_\ell}$ is trivial.
Otherwise, we say $\rho$ is \emph{ramified} at $\ell$.
Similarly we say $\rho$ is \emph{tamely ramified} at $\ell$ if $\rho|_{I_{\ell,w}}$ is trivial.
\end{defn}
The usefulness of the Frobenius elements for us stems in part from the following theorem.
\begin{thm}\label{thm:cheb}
Let $S$ be a finite set of primes, then:
\begin{enumerate}
\item A semi-simple mod $p$ representation
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
is determined by the values of $\tr\rho(\Frob_\ell)$ and $\det\rho(\Frob_\ell)$ for all $\ell\not\in S$ at which $\rho$ is unramified.
\item A mod $p$ character
\[
\phi\colon \GQ \to \Fb_p^*
\]
is determined by the values of $\phi(\Frob_\ell)$ for all $\ell\not\in S$ at which $\phi$ is unramified.
\end{enumerate}
\end{thm}
\begin{proof}
See \cite[prop. 25]{DDT}.
\end{proof}
\begin{defn}
Each character
\[
\phi \colon \GQ \to \Fb_p^*
\]
has finite image and so factors through some $\FF_{p^n}^*$, the smallest $n$ for which this can happen is called the \emph{level} of the character.
\end{defn}
For each $n \ge 1$ we now distinguish $n$ special mod $p$ characters of $I_{p,t}$ of level $n$, these will allow us to describe all such characters of a particular level.
\begin{defn}\label{def:fund}
The identification
\[
I_{p,t} = \lim_{\longleftarrow} \FF_{p^n}^*
\]
gives us a natural map
\[
\psi_n\colon I_{p,t} \to \FF_{p^n}^*
\]
for each $n$.
The \emph{fundamental characters} of level $n$ are defined by extending $\psi_n$ to an $\Fb_p$-character of $I_{p,t}$ via the $n$ embeddings $\FF_{p^n} \hookrightarrow \Fb_p$.
While any individual fundamental character is not canonically defined, the set of all of them of a particular level is.
\end{defn}
\begin{rmk}\label{rmk:prodchar}
The embeddings are all obtained from any chosen one by applying Frobenius and as such the product of all fundamental characters of level $n$ is the same as the composition of the norm map $\FF_{p^n}^* \to \FF_p^*$ with any one.
So this product will always be the unique fundamental character of level 1.
\end{rmk}
\begin{prop}
The fundamental characters of level $n$ generate the set of all characters of level $n$.
\end{prop}
\begin{proof}
See \cite[prop. 5]{Serre72}.
\end{proof}
As any character $\phi\colon\GQ \to \Fb_p^*$ factors through an abelian quotient, the Kronecker--Weber theorem tells us that any such character factors as
\[
\phi\colon\GQ \to \Gal(\QQ(\zeta_N)/\QQ)\cong (\ZZ/N\ZZ)^*\xrightarrow{\phi'}\Fb_p^*,
\]
where $\zeta_N$ is a primitive $N$th root of unity.
We can also use this factorisation to extend any Dirichlet character to a character of the absolute Galois group.
Thus characters of the absolute Galois group are in bijection with Dirichlet characters
\[
(\ZZ/N\ZZ)^* \xrightarrow{\phi}\Fb_p^*.
\]
\begin{defn}
The identity map
\[
(\ZZ/p\ZZ)^* \to \FF_p^* \hookrightarrow \Fb_p^*
\]
is a Dirichlet character and thus gives us a character of $\GQ$.
This character is called the \emph{mod $p$ cyclotomic character}, and denoted $\chi_p$.
\end{defn}
\begin{rmk}
Taking any $\sigma\in\GQ$ the definition above is saying that $\sigma$ sends
\[
\zeta_p\mapsto \zeta_p^{\chi_p(\sigma)},
\]
where $\zeta_p$ is a primitive $p$th root of unity.
If we take some $\ell \ne p$ prime and denote reduction mod $\ell$ of $\ZZ[\zeta_p]$ by $\overline{\ \cdot\ }$ we have that
\[
\overline{\Frob_\ell(\zeta_p)} = \overline{\zeta_p}^\ell,
\]
and so as $\chi_p$ takes values mod $p$ we find that
\[
\chi_p(\Frob_\ell) = \ell.
\]
Now if we fix an embedding $\Qb\hookrightarrow\CC$ and consider complex conjugation as some element $c\in\GQ$ we see that it takes $\zeta_p \mapsto\zeta_p^{-1}$ and hence
\[
\chi_p(c) = -1,
\]
so $\chi_p$ is an example of an odd character.
Finally, $\chi_p$ was defined to factor through $\Gal(\QQ(\zeta_p)/\QQ)$, this extension is tamely ramified and hence so is $\chi_p$.
Therefore $\chi_p$ passes to a character on the group $I_{p}/I_{p,w} = I_{p,t}$.
Straight from the definition we see that $\chi_p$, when viewed in this way, is an example of a level 1 character of $I_{p,t}$, but in fact more is true.
\end{rmk}
\begin{prop}
The fundamental character of level 1 is the cyclotomic character $\chi_p$ (or rather its induced character on $I_{p,t}$).
\end{prop}
\begin{proof}
See \cite[prop. 8]{Serre72}.
\end{proof}
\begin{defn}
We now classify Galois representations
\[
\rho\colon\Gp\to\GL_2(\Fb_p)
\]
of the form
\[
\rho\sim
\begin{pmatrix}
\chi_p\varepsilon_1 & * \\
0 & \varepsilon_2\end{pmatrix},
\]
for some unramified characters $\varepsilon_1$ and $\varepsilon_2$.
Let $K$ be the fixed field of the kernel of $\rho$.
There is a unique maximal tamely ramified extension $K_t$ of $\Qp^{\nr}$ that is contained inside of $K$.
These fields fit together as follows
\[
\xymatrix{
K \ar@{-}[d]|{\rho(I_{p,w})} \ar@{-}@/_2.5pc/[dd]_{\rho(I_{p})}\\
K_t \ar@{-}[d] \\
\Qp^\nr \ar@{-}[d]\\
\Qp \\
}
\]
As
\[
\Gal(K_t/\QQ_p^{\nr}) = (\ZZ/p\ZZ)^*
\]
we may write
\[
K_t = \QQ_p^{\nr}(z),
\]
where $z$ is a primitive $p$th root of unity.
If we look at $\Gal(K/K_t)$ we see that
\[
\Gal(K/K_t) = \rho_p(I_{p,w})
\]
consists only of elements of the form
\[
\begin{pmatrix}
1 & * \\
0 & 1 \end{pmatrix}.
\]
So $\Gal(K/K_t)$ is a finite elementary abelian $p$-group and hence isomorphic to $(\ZZ/p\ZZ)^m$ for some $m$.
Applying \cref{prop:kummer} we see that $K$ is in fact given by
\[
K = K_t(x_1^{1/p},\ldots,x_m^{1/p}),
\]
where in fact we may take the $x_i\in\Qp^\nr$.
The valuations of these $x_i$ will determine our classification.
If
\[
\nu_p(x_i) \equiv 0 \pmod{p}
\]
for all $i$ then we say that $\rho$ is \emph{peu ramif\'e}, otherwise if any of the $\nu_p(x_i)$ is coprime to $p$ then we say it is \emph{tr\`es ramif\'e}.
\end{defn}
Finally we introduce a notion that will allow us to obtain semisimple representations from arbitrary Galois representations.
\begin{defn}\label{def:semisimp}
The \emph{semisimplification} of a 2-dimensional representation $\rho$ is another representation, denoted $\rss$, that is obtained as follows.
If $\rho$ is irreducible (and hence semisimple) we leave it as it is and set $\rss = \rho$.
Otherwise if $\rho$ is reducible we know that there is some subspace on which $\rho$ acts via a character and so we may write
\[
\rho \sim
\begin{pmatrix}
\phi_1 & * \\
0 & \phi_2
\end{pmatrix}.
\]
The semisimplification $\rss$ is then the representation given by
\[
\begin{pmatrix}
\phi_1 & 0 \\
0 & \phi_2
\end{pmatrix},
\]
conjugated in the same way $\rho$ was.
This is indeed semisimple.
In general the process of semisimplification is analogous, it is obtained by taking the direct sum of the Jordan--H\"older constituents of a representation, though for us the above description suffices.
\end{defn}
% ************************************************************************
% Obtaining Galois representations from modular forms
% ************************************************************************
\section{Obtaining Galois representations from modular forms}
The two concepts just introduced, modular forms and Galois representation, appear at first glance not to be particularly related to each other.
However in reality they are inextricably linked, and exploring some of the links between them will be the goal of the rest of this essay.
\subsection{Congruences for Ramanujan's $\tau$ function}
We will start with a historically important example that provides the first glimpse of the sort of connection we will be looking at in this essay.
\begin{ex}\label{ex:delt}
Let
\[
\Delta = \sum_{n \ge 1} \tau(n) q^n
\]
be the unique normalised cusp form of weight 12 for $\Gamma_1(1) = \SL_2(\ZZ)$.
The coefficients of this $q$-expansion were studied in detail by Ramanujan, who made many influential conjectures concerning them.
The function $n \mapsto \tau(n)$ is now known as the Ramanujan $\tau$ function.
The properties of this function provide the first glimpses of behaviours occurring for more general systems of Hecke eigenvalues.
Various people, including Ramanujan in the mod 691 case, found congruences involving the coefficients $\tau(\ell)$ modulo powers of primes for prime $\ell$.
Below are a few examples, though others do exist for higher powers of these primes.
\begin{align}
\tau(\ell) &\equiv 1 + \ell^{11} \pmod{2^8}\text{, if } \ell \ne 2,\label{eq:tau2}\\
\tau(\ell) &\equiv \ell^2 + \ell^9 \pmod{3^3}\text{, if } \ell \ne 3,\label{eq:tau3}\\
\tau(\ell) &\equiv \ell + \ell^{10} \pmod{5^2},\label{eq:tau5}\\
\tau(\ell) &\equiv \ell + \ell^4 \pmod{7},\label{eq:tau7}\\
\tau(\ell) &\equiv\left.\begin{cases}
0\pmod{23} & \text{ if } \left(\frac{\ell}{23}\right) = -1,\\
2\pmod{23} & \text{ if }\ell\text{ is of the form } u^2 + 23v^2,\\
-1\pmod{23} & \text{ otherwise},\\
\end{cases}\right\}\text{ if } \ell \ne 23,\label{eq:tau23}\\
\tau(\ell) &\equiv 1 + \ell^{11} \pmod{691}.\label{eq:tau691}
\end{align}
The original proofs of these congruences were in many cases quite involved and did not all work in the same manner.
Indeed Peter Swinnerton-Dyer writes in \cite{SD} that these proofs ``do little to explain why such congruences occur''.
So in order to try to explain all of the congruences in a unified manner, Serre predicted \cite{Serre67} for each prime $p$ the existence of a $p$-adic Galois representation
\[
\rho_p\colon \GQ \to \GL_2(\Qp)
\]
such that
\begin{enumerate}
\item $\tr(\rho_p (\Frob_\ell)) = \tau(\ell)$ for all $\ell \ne p$,\label{item:trace}
\item $\det(\rho_p (\Frob_\ell)) = \ell^{11}$ for all $\ell \ne p$.\label{item:det}
\end{enumerate}
The congruences would then follow from these Galois representations being of specific forms.
For example \cref{eq:tau2,eq:tau3,eq:tau5,eq:tau7,eq:tau691} can all be obtained from these Galois representations if the $\rho_p$ satisfy
\[
\rho_p \equiv \begin{pmatrix}
\chi_p^a & * \\
0 & \chi_p^{11-a}
\end{pmatrix}\pmod{p^b},
\]
where $a$ is $0,\,2,\,1,\,1$ or $0$ respectively and $b$ is as in the original congruences in each case.
Here in each case we can see that $\det\rho_p \equiv \chi_p^{11}$, which is consistent with \cref{item:det} above, and knowing \cref{item:trace} would give us the desired congruences.
Serre's prediction for the representation $\rho_{23}$ has a more interesting form, but nevertheless the images of Frobenius elements can be described explicitly.
Following Serre we take $K$ to be the splitting field of $x^3 - x - 1$, this is ramified only at 23 and has Galois group $S_3$.
We then let $r$ be the unique irreducible degree 2 representation of $S_3$ taken with coefficients in $\QQ_{23}$, this satisfies
\[
\tr(r(\sigma)) = \begin{cases}
0 &\text{ if } |\sigma| = 2,\\
2 &\text{ if } |\sigma| = 1,\\
-1 &\text{ if } |\sigma| = 3,
\end{cases}
\]
for each $\sigma \in S_3$.
As $\Gal(K/\QQ)$ is a quotient of $\GQ$ the representation $r$ extends to a representation of $\GQ$.
If some $\rho_{23}$ exists satisfying the conditions above it being isomorphic to $r$ then gives rise to \cref{eq:tau23} in the same way as before.
Shortly after Serre hypothesised their existence Pierre Deligne constructed the representations $\rho_p$ for all primes $p$ \cite{Deligne}.
In doing so Deligne also reduced another conjecture of Ramanujan's, that $|\tau(p)| \le 2p^{11/2}$ for all $p$, to the Weil conjectures.
This provided an indication that relating questions regarding $\tau$ to Galois representations is a natural and effective way of working on these questions.
Being able to compute these associated representations makes it possible to read off many more congruences for $\tau(n)$ (see, for example, \cite{Mascot}).
\end{ex}
\subsection{Attaching Galois representations to general eigenforms}
Given the above example it is natural to wonder whether such a relationship holds more generally.
Indeed Serre also asked if one could associate to each normalised cuspidal eigenform a Galois representation whose traces of Frobenius elements match the $q$-expansion coefficients.
Serre's conjectures on this led to the following more general theorem, which we state for mod $p$ modular forms now.
\begin{thm}[Deligne]\label{thm:assoc}
Let $p$ be a prime, $k \ge 2$, $N \ge 1$ an integer coprime to $p$ and $\varepsilon\colon (\ZZ/N\ZZ)^* \to \Fb_p^*$. Given a normalised eigenform $f\in S_k(N,\,\varepsilon;\,\Fb_p)$ which has $q$-expansion coefficients $a_i$, there exists a two-dimensional mod $p$ Galois representation $\rho_f$ such that
\begin{enumerate}[(i)]
\item $\rho_f$ is semi-simple,
\item $\rho_f$ is unramified outside $Np$,
\item $\tr(\rho_f (\Frob_\ell)) = a_\ell$ for all $\ell \nmid Np$,
\item $\det(\rho_f (\Frob_\ell)) = \varepsilon(\ell)\ell^{k-1}$ for all $\ell \nmid Np$.
\end{enumerate}
We often refer to the representation $\rho_f$ as arising from, or being attached to, $f$.
\end{thm}
The construction of these representations in this generality is due to Deligne \cite{Deligne}, building on work of Shimura and others.
There is also a similar statement for weight 1 due to both Deligne and Serre \cite{DeligneSerre}.
However, as we will discuss Serre's original statement of his conjecture here, we will ignore weight 1 forms.
(There is a translation of Deligne's paper available from the IAS \cite{DeligneEng}, it has nicer typesetting too.)
In fact the representations obtained in these constructions are $p$-adic Galois representations $\rho_f\colon \GQ \to \GL_2(\Qb_p)$, as they were in \cref{ex:delt}.
The representations of the theorem are then obtained from the $p$-adic ones via a process of mod $p$ reduction and semisimplification.
The mod $p$ representations are the ones that we will be most interested in from here on though.
%TOD do we want to make some remark about forms giving same rep?
\begin{rmk}\label{rmk:detrho}
Let $\rho_f$ be a Galois representation attached to a normalised eigenform $f\in S_k(N,\,\varepsilon;\,\Fb_p)$ as in the theorem.
As $\rho_f$ is semisimple and
\[
\det(\rho_f(\Frob_\ell)) = \chi_p^{k-1}(\Frob_\ell)\varepsilon(\Frob_\ell)
\]
for all $\ell \nmid Np$ (here viewing $\varepsilon$ as character of $\GQ$ now) we may apply \cref{thm:cheb} to see that condition (iv) of the theorem is equivalent to the statement that
\[
\det\rho_f = \varepsilon\chi_p^{k-1}.
\]
By looking at the action of $\langle-1\rangle$ on $f$ we find $\varepsilon(-1)f = \langle -1 \rangle f = (-1)^k f$, and so
\[
\varepsilon(c)\chi_p^{k-1}(c) = (-1)^k(-1)^{k-1} = -1,
\]
hence $\det\rho_f$ must be odd (i.e $\rho_f$ is odd).
\end{rmk}
We will look at some more properties of this construction in \cref{sec:galassoc}, but first we move on to the conjecture itself.
% ************************************************************************
% Serre's Conjecture
% ************************************************************************
\section{Serre's conjecture}
\subsection{The qualitative form}
Given the above result one might wonder about a converse statement, given a mod $p$ Galois representation satisfying some necessary conditions, does it arise from an eigenform?
Serre's conjecture was that the answer to this question is yes, all Galois representations that could possibly arise from an eigenform, based on \cref{thm:assoc} and the remark following it, do.
The conjecture naturally comes into two parts, a weaker existence statement, and another refined form that makes exact predictions about an extremal eigenform from which the Galois representation arises.
We look at the existence statement, or \emph{qualitative form} first.
\begin{conjecture}[Serre's conjecture, qualitative form]\label{conj:qual}
Let
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
be a continuous, odd, irreducible Galois representation.
Then there exists a normalised cuspidal mod $p$ eigenform $f$, such that $\rho$ is isomorphic to $\rho_f$, the Galois representation associated to $f$.
\end{conjecture}
This is already a very useful thing to know, any statement one could prove about Galois representations attached to mod $p$ modular forms, by using the theory of these forms for example, would hold for all odd 2-dimensional mod $p$ Galois representations.
One interesting consequence of this type stems from the fact that Deligne's construction of Galois representations from modular forms is actually of $p$-adic representations.
If we were to assume \cref{conj:qual} and then apply this construction, we would be able to lift all irreducible odd 2-dimensional mod $p$ Galois representations to $p$-adic Galois representations.
A similar statement to the one above also holds for reducible representations, which correspond to Eisenstein series instead.
We will not consider this more general setup here as it is not what the refined form of Serre's conjecture deals with.
This conjecture (at least for Galois representations unramified outside $p$) appeared much earlier than the Duke paper and is mentioned by Serre in a 1975 paper \cite[sec. 3]{Serre75}.
It was computations performed by J.-F. Mestre that convinced Serre that strengthening this conjecture was plausible, and this led to the form we are about to see.
\subsection{The refined form}
After seeing the qualitative form of Serre's conjecture it is natural to also ask about the properties of the form $f$ whose existence is claimed.
Can anything be said about the weight and level of $f$, based only on the properties of $\rho$?
Serre also conjectured that the answer to this question is yes.
He defined a weight, level and character for each $\rho$, such that there should be a form $f$ of that weight, level and character that $\rho$ is attached to.
In a slightly backwards manner we will first state this refined form of the conjecture more precisely, before moving on to motivate and define the integers $N(\rho)$, $k(\rho)$ and character
\[
\varepsilon(\rho)\colon (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*
\]
used in the statement.
\begin{conjecture}[Serre's conjecture, refined form]\label{conj:ref}
Let
\[
\rho\colon \GQ \to \GL_2(\Fb_p)
\]
be a continuous, odd, irreducible Galois representation.
Then there exists a normalised eigenform
\[
f\in S_{k(\rho)}(N(\rho),\,\varepsilon(\rho);\,\Fb_p)
\]
whose associated Galois representation $\rho_f$ is isomorphic to $\rho$.
Moreover $N(\rho)$ and $k(\rho)$ are the minimal weight and level for which there exists such a form $f$.
\end{conjecture}
From now on we refer to a Galois representation $\rho$ satisfying the hypotheses of this conjecture as being of \emph{Serre-type}.
This conjecture is very bold, even given the existence statement of \cref{conj:qual} it is not clear that there should be a simultaneously minimal weight and level, let alone that they should be given by the relatively straightforward (though intricate) description that we will soon see.
If correct however, the refined form is eminently more useful than the qualitative form for specific applications.
The precise definition of the level, weight and character often allow one to actually find an associated eigenform and work explicitly with it to study a particular Galois representation.
Doing this would be far more difficult in general if no knowledge of the modular forms side could be obtained from the representation.
We will soon give Serre's explicit recipe for the weight, level and character.
First however we will look at some results that might lead one to make these definitions in the first place.
\subsection{Results on Galois representations associated to modular forms}\label{sec:galassoc}
In order to try and understand which eigenforms can give rise to a particular Galois representation, it is useful to take an arbitrary eigenform and study the properties of the representation attached to it, in an attempt to see what information about the eigenform may be recovered.
Several people have obtained interesting results of this type which will be helpful for our definition of the weight and level.
We fix a prime $p$ and a normalised eigenform $f \in S_k(N,\,\varepsilon;\,\Fb_p)$ with $q$-expansion
\[
f = \sum_{n\ge 1} a_nq^n.
\]
Let $\rho_f$ be the mod $p$ Galois representation attached to $f$ by \cref{thm:assoc}.
Concerning the conductor of $\rho_f$ there is the following result due to Carayol and Livn\'e \cite{Carayol, Livne}.
\begin{thm}\label{thm:level}
Let $N(\rho_f)$ be the level associated to $\rho_f$ (which we will define explicitly in \cref{subsec:level}), then
\[
N(\rho_f)|N.
\]
\end{thm}
Given this it is natural to hope that any Galois representation $\rho$ of Serre-type arises from a form of level exactly $N(\rho)$ as this is the minimal possible, of course we still have yet to define this quantity!
\\[12pt] \noindent
We can also make useful observations concerning the restriction of $\rho_f$ to $\Gp$, and its subgroups, these have implications for our definition of the weight.
There are two main cases here depending on whether $a_p \ne 0$ (the \emph{ordinary} case) or otherwise (the \emph{supersingular} case).
We start with the supersingular case.
\begin{thm}[Fontaine]\label{thm:super}
Suppose that $2\le k \le p +1$ and that $a_p = 0$ then $\rho_{f}|_{\Gp}$ is irreducible, moreover, letting $\psi_1$ and $\psi_2$ be the two fundamental characters of level 2, we have
\[
\rho_{f}|_{I_p} \sim \begin{pmatrix} \psi_1^{k-1} & 0 \\ 0 & \psi_2^{k-1}\end{pmatrix}.
\]
\end{thm}
This was originally proved by Fontaine in letters to Serre in 1979.
There is a published proof in \cite[sec. 6]{EdixhovenWeight}.
So in this case we can certainly recover some information about the weight by looking at the restriction to inertia.
In the ordinary case we have slightly different behaviour.
\begin{thm}[Deligne]\label{thm:ordinary}
Suppose $2\le k\le p+1$ and $a_p \ne 0$ then $\rho_{f}|_{\Gp}$ is reducible.
Moreover, letting $\lambda(a)\colon \Gp \to \Fb_p^*$ be the unramified character of $\Gp$ that takes each $\Frob_p \in \Gp /I_p$ to some $a\in\Fb_p^*$, we have
\[
\rho_{f}|_{\Gp} \sim \begin{pmatrix} \chi_p^{k-1}\lambda(\varepsilon(p)/a_p) & * \\ 0 & \lambda(a_p)\end{pmatrix}.
\]
In particular when we look at the restriction to inertia we get
\[
\rho_{f}|_{I_{p}} \sim \begin{pmatrix} \chi_p^{k-1} & * \\ 0 & 1\end{pmatrix}.
\]
\end{thm}
A proof of this result when $k \le p$ is given in \cite{Gross} and the general case was originally proved in an unpublished letter from Deligne to Serre.
Once again we can recover some information about the weight by looking at the restriction to $I_p$.
As $\chi_p^{2} = \chi_p^{p+1}$ eigenforms of weight $2$ and $p+1$ give us similar looking representations when restricted to inertia.
This makes it more difficult to tell exactly what $k$ was just by looking at restriction to inertia in this case.
This will cause us some issues later, fortunately the following result will allow us to distinguish these two cases.
\begin{thm}[Mazur]\label{thm:mazur}
Let $f$ have \emph{filtration} $p+1$ and be such that $\rho_{f}$ is irreducible, then $\rho_{f}|_{\Gp}$ tr\`es ramif\'e.
\end{thm}
For $p> 2$ and trivial character this is due to Mazur \cite[sec. 6]{Ribet90}.
In \cite[sec. 2]{EdixhovenWeight} Edixhoven gives a modification to the general case.
\\[12pt] \noindent
In all the theorems we have just introduced the weight $k$ is at most $p+1$, of course we can also make Galois representation from higher weight eigenforms.
So we need some way of finding out about the representations attached to higher weight forms, using what we know about low weight ones.
To do this we use the $\Theta$ operator.
Which preserves the set of mod $p$ normalised cuspidal eigenforms of a particular level, but in most cases increases the filtration of the eigenform.
We then need to know how the action of $\Theta$ affects the associated Galois representation, it turns out that $\Theta$ changes these representations in a very simple way.
\begin{prop}\label{prop:theta}
Let
\[
\Theta\colon S_k(N,\,\epsilon;\,\Fb_p)\to S_{k+p+1}(N,\,\epsilon;\,\Fb_p)
\]
be the operator defined in \cref{def:theta}.
Then if $f \in S_k(N,\,\epsilon;\,\Fb_p)$ is a normalised eigenform the Galois representation associated to $\Theta(f)$ is
\[
\rho_{\Theta(f)} \cong \chi_p\otimes\rho_{f}.
\]
\end{prop}
\begin{proof}
In \cref{prop:thetaeigen} we saw that $\Theta$ took eigenforms to eigenforms, but with the eigenvalue for each $T_{\ell}$ being $\ell$ times the original.
So
\[
\tr(\rho_{\Theta(f)}(\Frob_\ell)) = \ell a_{\ell} = \tr((\chi_p \otimes \rho_{f})(\Frob_{\ell}))
\]
and
\begin{align*}
\det(\rho_{\Theta(f)}(\Frob_\ell)) &= \ell^{k+p+1}\varepsilon(\ell)\\
&= \ell^{k+1}\varepsilon(\ell)\\
&= \ell^2\ell^{k-1}\varepsilon(\ell)\\
&= \det((\chi_p \otimes \rho_{f})(\Frob_{\ell})).
\end{align*}
By \cref{thm:cheb} the representations $\rho_{\Theta(f)}$ and $\chi_p \otimes \rho_{f}$ are isomorphic.
\end{proof}
So applying $\Theta$ twists the associated representation by $\chi_p$ and if $p\nmid w(f)$ then $\Theta$ increases the filtration by $p+1$.
Hence we can use the $\Theta$ operator to transfer our understanding of Galois representations attached to modular forms of weight at most $p+1$ to modular forms of higher weight.
It is worth noting that the proofs of some of the above theorems actually came after Serre's paper.
However it seems likely that observations of these results in specific examples informed the recipe below.
\subsection{The level}\label{subsec:level}
Assume that we have a Galois representation $\rho\colon \GQ \to \GL_2(\Fb_p)$ of Serre-type.
We now define the integer $N(\rho) \ge 1$ which plays the role of the level in the refined conjecture.
We can view our representation $\rho$ as a homomorphism
\[
\GQ \to \Aut(V),
\]
where $V$ is a two-dimensional $\Fb_p$ vector space.
Let $G\cong \im \rho$ be the finite Galois group obtained by quotienting out by the kernel of $\rho$.
Then for each prime $\ell$ we let $G_{\ell,i}\subset G$ be the $i$th ramification group at $\ell$ of $G$, as defined in \cref{def:higher}.
Now, for each $\ell$ and $i$, consider the fixed subspace of $V$
\[
V^{\ell,i} = \{\mathbf{v}\in V : \rho(\sigma) \mathbf{v} = \mathbf{v}\ \forall \sigma \in G_{\ell,i}\}.
\]
For each $\ell$ we define
\[
\nu_\ell(\rho) = \sum_{i = 0}^{\infty} \frac{1}{[G_{\ell,0} : G_{\ell,i}]} \dim(V/V^{\ell,i}),
\]
this quantity is (non-trivially) an integer \cite[p. 99]{SerreLF}.
We then set the level to be
\[
N(\rho) = \prod_{\substack{\ell \ne p\\ \ell\text{ prime}}} \ell^{\nu_\ell(\rho)}.
\]
This is indeed a positive integer, and by construction it is coprime to $p$, this is necessary for it to be the level of a mod $p$ cusp form.
This number serves to quantify the higher ramification of a Galois representation at all primes other than $p$, the larger the ramification groups for a particular prime, the larger the power of that prime in $N(\rho)$.
The definition here is almost that of the \emph{Artin conductor} of a representation, but here the $p$-part is ignored.
\begin{rmk}\label{rmk:unram}
Unwinding this definition when $\rho$ is unramified at some $\ell$, we see that each $V^{\ell,i}$ is in fact the whole of $V$, as all the ramification groups involved are trivial.
Hence in this case $\nu_\ell(\rho) = 0$ and so $N(\rho)$ is only divisible by the primes $\ell \ne p$ at which $\rho$ is ramified.
\end{rmk}
\cref{thm:level} stated that when $\rho$ comes from an eigenform $f$ the integer $N(\rho)$ defined here divides the level of $f$.
With that in mind conjecturing that any Serre-type representation comes from an eigenform of level exactly $N(\rho)$ is fairly logical, though perhaps optimistic without more evidence.
\subsection{The character and the weight mod $p-1$}\label{subsec:char}
Beginning with a Galois representation of Serre-type, as before, we now define the character
\[
\varepsilon(\rho)\colon (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*.
\]
We also state the class of $k(\rho)$~mod~$p-1$, though the full definition of $k(\rho)$ will be given in the following section.
Given a Serre-type Galois representation $\rho$ we can compose with the determinant map to obtain a continuous character
\[
\det \rho\colon \GQ \to \Fb_p^*.
\]
The Artin conductor of a 1-dimensional Galois representation (i.e. the level $N(\rho)$ introduced in \cref{subsec:level}, but including the $p$-part now) is actually equal to the conductor of the associated Dirichlet character \cite[p. 228]{SerreLF}.
So we now partially compute the Artin conductor of $\det\rho$ in order to find a number field that $\det\rho$ factors through.
Let $V_1$ be the 2-dimensional vector space realising $\rho$ and $V_2$ be the 1-dimensional vector space realising $\det\rho$.
If $\rho|_{G_{\ell,i}}$ is trivial then $\det\rho|_{G_{\ell, i}}$ is also trivial.
So having $\dim(V_2/V_2^{\ell,i}) > 0$ implies that $\dim(V_1/V_1^{\ell,i}) > 0$.
As $0 \le \dim(V_2/V_2^{\ell,i}) \le 1$ we get that
\[
\dim(V_2/V_2^{\ell,i})\le \dim(V_1/V_1^{\ell,i})
\]
for all $\ell$ and $i$ and so
\[
\nu_\ell(\det\rho) \le \nu_\ell(\rho).
\]
This gives us that
\[
N(\det\rho) \mid N(\rho).
\]
As the restriction of $\det\rho$ to $I_{p,w}$ is trivial (see the proof of \cref{prop:wildtriv}, using that characters are simple) we find that $\nu_p(\det\rho)\le 1$.
So the full Artin conductor of $\det\rho$ divides $pN(\rho)$.
We can therefore identify $\det\rho$ with a homomorphism
\[
(\ZZ/pN(\rho)\ZZ)^* \to \Fb_p^*,
\]
or equivalently with a pair of homomorphisms
\begin{align*}
\phi\colon& (\ZZ/p\ZZ)^* \to \Fb_p^*,\\
\varepsilon\colon& (\ZZ/N(\rho)\ZZ)^* \to \Fb_p^*.
\end{align*}
The group $(\ZZ/p\ZZ)^*$ is cyclic of order $p-1$ and so the image of $\phi$ lies inside $\FF_p^*$.
So $\phi$ is an endomorphism of the cyclic group $(\ZZ/p\ZZ)^*$ and hence is of the form
\[
x \mapsto x^h,
\]
for some $h \in \ZZ/(p-1)\ZZ$.
We have expressed the fact that $\phi = \chi_p^h$, where $\chi_p$ is the mod $p$ cyclotomic character.
This gives us the factorisation
\[
\det\rho = \varepsilon \chi_p^h
\]
and so, comparing with \cref{rmk:detrho}, we set $\varepsilon(\rho)$ to be the $\varepsilon$ obtained here.
We also see that $h$ had better be the same as $k(\rho)-1$ modulo $p-1$.
\subsection{The weight}
We now come to the final ingredient in Serre's recipe, the weight $k(\rho)$.
The general strategy of our approach here is to express a representation of Serre-type as a twist of another representation, one that looks like it comes from a cusp form of weight at most $p+1$.
We then read off the minimal weight of a cusp form that could give this twisted representation.
Then we can apply the results above regarding the $\Theta$ operator (\cref{prop:thetafilt} and \cref{prop:theta}) to define the weight of the original representation.
Given our Galois representation
\[
\rho \colon \GQ \to \GL_2(\Fb_p)
\]
we restrict to the subgroup $\Gp$ to form
\[
\rho_p\colon \Gp \to \GL_2(\Fb_p).
\]
The definition of $k(\rho)$ will only depend on this $\rho_p$ (in fact it will only depend on $\rho_p|_{I_p}$).
We will from here on refer to $k(\rho)$ as $k(\rho_p)$ to emphasise this fact.
As such the weight will only reflect the behaviour at $p$ of the representation, whereas the level reflected the behaviour away from $p$.
\begin{prop}(Serre \cite[prop. 4]{Serre72})\label{prop:wildtriv}
The semisimplification $\rss_p$ of $\rho_p$ is trivial when restricted to $I_{p,w}$.
\end{prop}
\begin{proof}
It suffices to prove this for simple representations $\rho_p$, as a sum of trivial representations is trivial.
The wild inertia $I_{p,w}$ is a pro-$p$-group, and so the image is also a pro-$p$-group.
This group is finite, so it is simply a $p$-group, and defined over some finite field $\FF_q$.
Consider an $\FF_q$-vector space $V$ that realises $\rss_p|_{I_{p,w}}$.
Additively $V$ is a $p$-group too, of order $q^n$, where $n$ is the dimension of $V$.
Looking at the action of $\rss_p|_{I_{p,w}}$ on $V$ we see that there is a singleton orbit $\{0\}$.
As all orbits are of $p$-power order there must be an additional $p-1$ singleton orbits at least, else the orbits could not partition $V$.
Therefore the fixed subspace $W$ of $V$ is non-trivial.
However as $I_{p,w}$ is normal in $\Gp$ the subspace $W$ is stable under $\Gp$, hence $W$ must equal $V$ by simplicity.
\end{proof}
We may therefore view $\rss_p$ as a representation of $I_{p,t}$, we shall write $\rss_t$ for this new representation.
The tame inertia group is abelian, and so this representation is diagonalisable.
The representation $\rss_t$ is therefore given by a pair of characters
\[
\phi_1,\,\phi_2\colon I_{p,t} \to \Fb_p^*.
\]
\begin{prop}
Both of the characters $\phi_1$ and $\phi_2$ are of the same level, and that level is either 1 or 2.
Moreover if they are both of level 2 then they are $p$th powers of each other.
\end{prop}
\begin{proof}
Letting a Frobenius element at $p$ act by conjugation on $\sigma\in I_{p}$ we have that \cite[p. 167]{RibetStein}
\[
\Frob_p \sigma \Frob_p^{-1} \equiv \sigma^p \pmod{I_{p,w}},
\]
and so
\[
\rss_t( \Frob_p\sigma \Frob_p^{-1}) = \rss_t(\sigma^p) = \rss_t(\sigma)^p,
\]
so we have an equivalence of representations
\[
\rss_t \cong (\rss_t)^p.
\]
Hence the set $\{\phi_1,\,\phi_2\}$ must be fixed by $p$th powering.