Skip to content

Commit

Permalink
Scripts and HMM profiles
Browse files Browse the repository at this point in the history
HMM profiles for CCMs

Scripts for looking for CCMs in MMETSP data
  • Loading branch information
MDHDZ91 authored May 3, 2017
1 parent 39e7cf5 commit 6927049
Show file tree
Hide file tree
Showing 34 changed files with 4,708 additions and 0 deletions.
42 changes: 42 additions & 0 deletions ALAT_GGAT.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
>XP_002289904.1|alanine_aminotransferase|Thalassiosira_pseudonana_CCMP1335
MQYAVRGEVVIRADAMAAEGRKIIYTNIGNPHAVGQKPITYYRQVLSLCDLPAECGVDNTQVAAAFPSDV
IERAIEMRDAIGPAGTGAYTNSQGIGKFRDDVAHFITARDEHVALPSNIFLSNGASAAIENVLTGLIGSN
RDAIMIPIPQYPIYSAIISRLGARQVGYFLERRTAAVERDGLDIRALTLINPGNPTGQVLGREDLEIICT
FCAKHNIVLLADEVYQRNIYDDKKEFVSAKKVAVETPGCENLQLISFHSTSKGLIGECGRRGGYMELHNI
DPYVQTQLYKLASSGLCSGVDGQMMTSLMVRPPLPGEESHELFSRQEFEIFSSLKRRAVSLVRGLNDIDG
MTCTPAEGAMYAFPRVELPPKALDAAAINDQTPDNLYALSLLEETGICVVPASGFGQKEGRIGFRTTFLP
PEDELNQAVVEFKRHHEWFCEKYA
>OEU21541.1|alanine_aminotransferase|Fragilariopsis_cylindrus_CCMP1102
MEYAVRGTVVIAADRINDELKAEQSMGAESKYKFQKIIYTNIGNPQSVGQQPLTWPRQVLALIDLPDEEG
INHPNIQNIFPSDAIARARTIKIGLGGNGSGAYSHSKGIKMFREDVCTFLQNRDGIDVPTDVENIFLSNG
ASAAIFNLLTSLIADNKCGIMIPIPQYPIYSASVEQLGGQKVGYYLDEKNKWNLSIDELERSLKEALENG
TNVVAFVLINPGNPTGAVLTKQTVQDVVKFCSKHNLVLLADEVYQENVYNEQDKFYSCKRAAYDCGLLET
NSIELASFHSTSKGVFGECGRRGGYMELTGFDENIKNQLYKLASASLCSTVNGQCMTSLMCRGPSPDDVS
YESHEKEKLDIFNSLKKRSKIVNDGLNSIDGFSCQPAQGAMYCFPSIDNMPMKAINEAAEQNITPDTLYA
LSLLERTGICVVPASGFGQRPGRYGFRTTFLPSEDDMAYSVNAMKDHHKEFCQKYA
>NP_005300.1|alanine_aminotransferase1|Homo_sapiens
MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRALELEQELRQGVKKPFTEVIRANIGD
AQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKKRAERILQACGGHSLGAYSVSSGIQLIREDVARYI
ERRDGGIPADPNNVFLSTGASDAIVTVLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDE
ERAWALDVAELHRALGQARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVY
AAGSQFHSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSVRLCP
PVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCNPVQGAMYSFPRVQL
PPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHFRMTILPPLEKLRLLLEKLSRFHAK
FTLEYS
>AAC62456.1|alanine_aminotransferase|Zea_mays
MAASVTVENLNPKVLKCEYAVRGEIVIHAQRRQQQLQTQPGSLPFDEILYCNIGNPQSLGQQPVTFFREV
LALCDHPCLLEKEETKSLFSADAISRAKQILATIPGRATGAYSHSQGIKGLRDAIAAGIMSRDGFPANAD
DIFITDGASPGVHMMMQLLIRNEKDGILCPIPQYPLYSASIALHGGTLVPYYLNEKNGWGLEISDFKTRL
EDVRSKGIDVRALVVINPGNPTGQVLAEDNQYDIVKFCKNEGLVLLADEVYQENIYVDNKKFNSFKKIVR
SMGYGEDDLPLVSLQSVSKGYYGECGKRGGYMEITGFSAPVREQIYKIASVNLCSNITGQILASLVMNPP
KAGDESYASYKAEKDGILESLARRAKALEDAFNKLEGFSCNKAEGAMYLFPQIHLPQKAIEAAKAAKKAP
DAFYALRLLESTGIVVVPGSGFGQVPGTWHIRCTILPQEDKIPAVISRFRAFHEAFLAEYRD
>XP_009315638.1|alanine_aminotransferase|Trypanosoma_grayi
MSTSRKAIHINPRVVEAQYAVRGLIPMRADEIKNALATPEGKGKYPFSSLVYCNIGNPQALEQKPLTFNR
QVMSLVDAPFLLDNAAIKAQYPADAVARAQEYLSHIGNRTGAYTDSAGYAFVREIVARHINERDHGAKPL
MDASSIMLTDGASTGVRLLLQILIGDASDGVMIPIPQYPLYTAQIALLGGTPAMYYLDENKGWALNVADL
ASAYDECVAQRKATPRVLVVINPGNPTGGVLERGVMEAVAKFCCDRGMVLMADEVYQENIYAEGKRFVSF
REVVLGLPAPYNTDTVLASLHSTSKGIIGECGRRGGYFSLTNAPAALTEQVVKMSSINLCSNVNGQLMTA
LMCAPPRAGDASYDAYWAEYNAIFGSLKKRALMLAKELNSIRGFACQPVEGAMYAFPTIQLPEKYAQHNA
ELNAREGRKLAPDARWALELLESSGIVVVPGSGFGQQPNTLHFRTTILPPEAQMERMVKALRGFQEDVWA
KYA
54 changes: 54 additions & 0 deletions Bestrophin.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,54 @@
>gi|224000585|ref|XP_002289965.1|T.pse|bestrophin1_TPS
MGPPIDPSVPVTDQVGEGSRKYRRTVYTHDDWVRHRSPDRFGNNLSTLFNSGIYKQVANEVFATTAVATF
VFLWNMIAGGYTDLAGVQHGPIIDSPLAQMVGLPMTAFTILTPSLGLLLVFRTNTSYGRWDEARKMWGLN
INHTRDLNRMATAWYGNEGNMDSVAFMGGDIPYSQPIDPVQRAYDLGQVSLFTWAFVRSMKRHLSPPEED
EEDFKAELRARLTPEQAENIINAAHRPNRALFDLSVAIENLPMHFLRKNAINTNLSIFEDTLGGCERLLS
SPVPLFYSRHTARFLSTWLLLLPFGLYEQFKDSWNHIAMIPATAFISVCLFGIEELATQLEEPFTILPMQ
GFCDKIGGWCDEIVSWAGQGQQEYTEENAMSNEQEMTYWR
>gi|223999673|ref|XP_002289509.1|T.pse|bestrophin2_TPS
MPSFTSLSTLLLLALSSPQISAFAPLSSTSTPINVAPSTTTSTTNLQMGPPKTDIVLSETYGEGSRKYRR
TVYTHNEWVKHRSSDRFAKNLFSMVNSGVYKSLAKEVFATTAVASAIVAWNGIAGGYTDFNGVEHGAIMS
FLPQLVLPLTPFTLLSPSLGLLLVFRTNSSYGRWDEARKMWGLNINHTRDLNRMATAWYGHDNQIIDPAK
RAEDLRQVSLYTWAFVRSMKRHLSPPSEDEEAFVEELYARMAPEQAEAIISAAHRPNRALYDLSVVIDKL
PMHFMRKNEINKNLSIFEDTLGGCERLLSSPVPLFYTRHTARFLSTWLLLLPLAMYQPFSGSWNHVAMIP
ATALTSVFLFGIDELSTQLEEPFTILPMQGFCDKIGGWCDEIVSWRGQGLDKEEQQYY
>gi|WP_077172616.1|bestrophin|Pseudomonas_psychrotolerans
MITRPQNPSLRELLFTVRGSIVQAIWPKLLYVVLLSLAVTLSHDVFLRFDFGLTTTPLTLWGLTLAIFLG
FRNTTAYQRFWEARGLWGELLIAGRNLARQVETLVPGLTAPERRQLLTPLLAFGYALRDHLRREAPSADL
QRVLVGEDALLAAPHRPSALIRRLGTRLVARAREEGLGDPLIANLDHQLDRLTAVLSGCERIRQTPIPYP
YILMLHRVVHVYCFLLPFCLVDSLGWFTPLAVLVLAYTFFGLDALGDQIADPFGTQPNHLPLDALSRGLE
IAVLDLLGEPTPEPIRAEAGLLR
>gi|NP_004174.1|bestrophin-1_isoform1|Homo_sapiens
MTITYTSQVANARLGSFSRLLLCWRGSIYKLLYGEFLIFLLCYYIIRFIYRLALTEEQQLMFEKLTLYCD
SYIQLIPISFVLGFYVTLVVTRWWNQYENLPWPDRLMSLVSGFVEGKDEQGRLLRRTLIRYANLGNVLIL
RSVSTAVYKRFPSAQHLVQAGFMTPAEHKQLEKLSLPHNMFWVPWVWFANLSMKAWLGGRIRDPILLQSL
LNEMNTLRTQCGHLYAYDWISIPLVYTQVVTVAVYSFFLTCLVGRQFLNPAKAYPGHELDLVVPVFTFLQ
FFFYVGWLKVAEQLINPFGEDDDDFETNWIVDRNLQVSLLAVDEMHQDLPRMEPDMYWNKPEPQPPYTAA
SAQFRRASFMGSTFNISLNKEEMEFQPNQEDEEDAHAGIIGRFLGLQSHDHHPPRANSRTKLLWPKRESL
LHEGLPKNHKAAKQNVRGQEDNKAWKLKAVDAFKSAPLYQRPGYYSAPQTPLSPTPMFFPLEPSAPSKLH
SVTGIDTKDKSLKTVSSGAKKSFELLSESDGALMEHPEVSQVRRKTVEFNLTDMPEIPENHLKEPLEQSP
TNIHTTLKDHMDPYWALENRDEAHS
>gi|AAR99655.1|bestrophin2|Homo_sapiens
MTVTYTARVANARFGGFSQLLLLWRGSIYKLLWRELLCFLGFYMALSAAYRFVLTEGQKRYFEKLVIYCD
QYASLIPVSFVLGFYVTLVVNRWWSQYLCMPLPDALMCVVAGTVHGRDDRGRLYRRTLMRYAGLSAVLIL
RSVSTAVFKRFPTIDHVVEAGFMTREERKKFENLNSSYNKYWVPCVWFSNLAAQARREGRIRDNSALKLL
LEELNVFRGKCGMLFHYDWISVPLVYTQVVTIALYSYFLACLIGRQFLDPAQGYKDHDLDLCVPIFTLLQ
FFFYAGWLKVAEQLINPFGEDDDDFETNFLIDRNFQVSMLAVDEMYDDLAVLEKDLYWDAAEARAPYTAA
TVFQLRQPSFQGSTFDITLAKEDMQFQRLDGLDGPMGGAPGDFLQRLLPAGAGMVAGGPLGRRLSFLLRK
NSCVSEASTGASCSCAVVPEGAAPECSCGDPLLDPGLPEPEAPPPAGPEPLTLIPGPVEPFSIVTMPGPR
GPAPPWLPSPIGEEEENLA
>gi|NP_988974.1|bestrophin-2|Xenopus_tropicalis
MTVTYTARVANARFGGFYKLLLLWRGSIYKLLYKEFLAFFLMYLALSIIYRFFLNEEQKLYFDKVAIYCN
NYANLIPVSFVLGFYVNLVVNRWWNQYLSLPFPDRVMCAISGTVHGSDETGRLYRRTLMRYCSLSGLLIL
RSVSTAAFKRFPTIDHVVEAGFMTRLERKKFENLQSSYNKYWVPCVWFCNLASQARSEGRIRDDHSFKML
MEELNTFRGNCGMLFHYDWISVPLVYTQVVTIAVYSFFLTCLIGRQFLDPARGYPGHELDLYVPVFTLLQ
FFFYAGWLKVAEQLINPFGEDDDDFEINFLIDRNFQVSMLAVDEMYSDVPPMEKDRYWNHSDPRPPYTAA
TLFQKHMPSFQGSTFNMAIPKEDMQFQPLSDIEEMNEDTLTHPPPLLSRFLPGVGPSPLSSSAALASHFA
APGSRLTLLRRSTSSFSSSSEFQCQEPVQDPPYSLVDSLGPGLNVQEGHTEELCNMGSQASLFLPPKTMD
GGENVQPVEEGEDAASLVAT
>gi|WP_068888990.1|bestrophin|Acinetobacter_celticus
MIVRDQPNIFKVLFSWRGTILPKILPPLGVVMLISAIIGVLSYIGYFKFPELPFVGFTVIGVVLSIFLGF
KNSACYERWWDARKLWGILIANSRHFDRDCRMLSQGRRERVIQHVIVFANVLRDRLRHQTANPTELVKTS
GMSQQALTQLYQQANAPQYTLSLIQWELMQALKDGEISDIIYTQMNDHVMDLSMVQTGCDRIATTPLPFA
YSVLLNRTVYFFCLILPFSLGSTLGIFTPLLVGVLAYTFLGLDALSSELEEPFGTQSNDLPLDSMVRTIE
IELLGTLGKPTPPPIQAQDNNLL
76 changes: 76 additions & 0 deletions CA_alpha.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
>jgi|Emihu1|456048|estExtDG_Genemark1.C_1660056|CA2_alpha_EHUX
MSPNKHSWRYARPGPNHVADEWVQRETWGASFPTCINGIEQSPINIVTGEAIPMKSLPEISTDIDAAPHY
VSNTGSGFQLFETTPTESMIANGTFIDTIEGSSKGESWVGGQKFLFYQMHWHTPSENTIDGRSFPLEAHF
VHQLDDPMLVGTLHRLAVISLLYEPGPCNAFLDQFWEEFPMVPGFRQHFADGVNDFERLADEVINIDEGE
GYFYWHGSLTTPPCTEGVGWYMLKHRETVSDRQIDALRYALAVS
>XP_005764209.1 carbonic anhydrase [Emiliania huxleyi CCMP1516]
MGCTQSKHDASEDASNGTMLQAVLGHLGQLDGDAKLDHTTMSIVYDIFKDMDKDSDGTVDKSEFEKFLST
HPAAKTLWEGEGKASMSRSLKDAIADERLSFYELVAAFAPEAPHHAGDASGIGGLESLISDAAWGYRGYN
GPENWALLSPKNKLAATGKEQCPVDILPSTCVPCPAVDGDASLAYGVGPGTILNNGHTIQVNWKGGSMSV
GGTTFEAAQFHFHTASETTIRGMQYPLEMHVVHVTPGANERVSEPMRIAVLAVLFETRTDVEEVFLSQFF
DQLPSHVAHDQDDAETLTRPVDLSSISLDGGYYRLRGSLTTPPCTEGLEWSVLASPLPILPAQLETFRKA
LGKTVRNFRPTQPLNGRSITWVCACQA
>jgi|Thaps3|22391|estExt_fgenesh1_pg.C_chr_40655|CA1_alpha_TPS
MILLQPMTKRMSTSSHLVILVVLLRLQSSNSRSWLDCINTSKLENDGMPRVGKRHNATTSSLSSDAAITI
AQMSGNIGTSTTSEDTEIVIHGATTTLFEEVDPFRVTDSPSTVPSYSSSPPTLSPSASPTITPLPTTEKP
TRLPTLPPTFQTGKNEPLNPKPGYFNYDMNSDYGPHRWKRVDVEDDFFHTFDLKAEDTNNCGSGDHQSPI
DVCTKPRGNCKETHEMRPKSGDYKMDGELITKQILPSKLRLVMAPRTGDEPDPPQVDFSSNGRGIIDMTN
IDFKFPSEHTVCGSKFDGEMQYYMYHPGRERFVAVSFFLEASPTNPTNEHLQEVIDAFRTVFIKDKSLCA
EKQRLENYAQGFVSPANRKLHGEENKTLDSIEDDGELWNTTTIESNEDREYQRRLALKWHPFHPDIQKTI
HFWGYHGSFTEPPCTDDIVDWKIMDVPTPISTKQLAQLKQLLFNHVDKNCERTSVHNSDGSVARPTQETS
KYYKCTRDDYVSDEERGVCGDLGCINPFGEGLNPYYPPIVDVTGPPTRAPST
>jgi|Thaps3|262006|thaps1_ua_kg.chr_4000016|CA2_alpha_TPS
SEHRLCGKQYDAEMQLFHLHNEGNLEALAILIDADDGTSENPHFQKLLDFFQKKFNADKSMSRDWVWDPL
EPGYILRSIHFWAYSGSTTEPPCFEGVNWRIIDVPMKISPGQYQQLQRLMFDHSNARPVQP
>jgi|Thaps3|22257|estExt_fgenesh1_pg.C_chr_40398|CA11_alpha_TPS
MTRVSNIDSMMDGFGKLSRRAKILYLSSLAVSLAMVVFGACVLTLDYTTRTTSKVENSIGGVVNADDSDE
AKIQIETQTPTLSPSSSPIYTEKLSLVSSPAPSTSNLRATSAPTNSPVDIGTLQPVTRKPVQPKPTPRPA
SPKPSSPPSTRYPSISPSQHPTNSPSLSPVTPSPPPTLTQSILPSITNMPSLESLFQSHEVPKDPKPTYF
NYNGNSDYGPRSWENVTLLNSTENYWHEFGFNDNQCGVGAQSPIDVCTTPMRHCQEHHEFRSKLRVLMHR
REGDEPDPPHVDFAGVGAKSLDLLNIDIKIPSEHTVCGRRYDGEMQYYFYHPVKGSLIVIAWLFDAQNEF
ASNEHLQLVIDEFQALYDDTEGACLVNMTLNETGVTAPPHQRLSSRSDRELEKENHGCSGSNLNGPAPSN
AEYPIQQP
>jgi|Phatr2|35370|fgenesh1_pg.C_chr_7000291|CA1_PTRI
MRLIAISLCCLMPCTVRCRSWRNIEPLHGWNENDTSGTIWRMEFNPLFTSAPTSMPTTATPSDIPSSRPS
SFPSAPPSASPSVAPSPSPSTAPSESDPYRPNDPPKNPEQWYFNYDTSANALYGPGHAGIIQQQNNQFNV
GYKNNRWGSVGNPPNNYWTEFMDNGFGPWRGILANRNPTRNMCDRVGMQSPIDLRPSGAVCDEHHEVRSR
RGDFQIFEDEVTKEIQPNKLRLRYKRRPCRNLNELACQEPDPPNADFPNNWGGYADVTHIDFKVPGEHLI
RGEKFDGEMQIFHIHRGRRRMVVQSVTIRATSTGFNSYFQEAIDVFRAVYDINIARCSALRRKERRLVSN
AHIILGKNMTSKFHDYSSWGDFSTGLEDVELESKRSLRKSNWDPYHELLIPSIHFYRYDGSLTEPPCGEF
VSWFVSDTPMRISLSQLEEVKTILFKNVDENCQPTSVQFGHSVARPIQETAGRPVWQCTPREFGPDP*
>jgi|Phatr2|44526|estExt_fgenesh1_pg.C_chr_40337|CA2_PTRI
MVGLPSVLLCTLIAFTTAQTGRDLDRFNYRGTDGTDYGPEDWDQVSCTDTETCLGWPDGFETARGWDLGE
NHCRWCPLGTRQCGIHHQSPIDLQRNRAVPGDPEEKECIDVHWMAYYDSTCDWENLKALNAFSIERHALK
VNQPIEQLASGDYRLACRNASGRRFGRIDFSKGFSEWWLMSHMDIHVPSEHTQEGKRYDGEIHLYHFYSI
PGSQSSTNNEMASVTIFLEAYDDVPDYPMLNRLICQWRQVEDKTREECGLPSVETEYPGCFYYQRGHTID
GFNTIALTQDGTQRNLRQKSRNLRPKSMSVHDLILYNYAQSQTNSSYTPKRLLHSEEDHAEADPNFDWEK
FVTRQDGNANITQGNRQLLNYDHVGPWFNYFPMLGVRTEYYYRYSGTQTVPPCYGRFFEGNNRRQTNHWR
VLKDPIRVTQRQVDEMHRLLKERIASVDDPLASCEPDTAAKVDENDPTKISVARPVMETRSTHYKVFCEC
EDWRSKFPEDVEWCKKGLQDRLFNHPYNFETDGF*
>jgi|Phatr2|55029|estExt_Phatr1_ua_kg.C_chr_210026|CA3_PTRI
MSLSGIVCSRAKWFLLSIALPTLGLGLNKTAFSYNKKDEYSPDNWYRLDIAGNVCRGPRNSPIALESTPC
DAYEGYGLYSGTCTLNDLDFQLTELGVKIKYPKDGSCDINTLTVPGVSGNFRLLEVTIHGGSEHSIDGNF
SGAEIQLVHEKINSQEGHLAVLAILVEPEGPKDNLFFGTLLDEWRAVRADSTASCAKAGYDVPTLYWLAS
GTPVNTRHSYVRSYFTSPRFNAYSLLPTNTSFYRYYGGLTTPPCSEIVWWSVADTVMRISTGQYAELMTM
ITTGYVNVTDEAGCEPWSVASPSGSTSRPLQARNGRPVDRICPV*
>jgi|Phatr2|54251|estExt_Phatr1_ua_kg.C_chr_40037|CA7_PTRI
MRSFLLWSLVASFATAQEGSNLDRFNYRGTEGTDYGPEDWDQVSCSDTENCLGWPDAFEASRGWSLKDNF
CRWCPAGSSSCGTHHQSPIDLQRNRAVPGDPDENECIDVHWMAYYDSTCTWDTLKELNAFSVERHALKVV
QPITETTSGEWEIACRDDSGKRFGRIDFSKGFSQWWFLSHMDFHVPSEHTQEGKRYDGELHMYHFYSVTG
AEAGIDNEMASVAFFLEAYDDIPDYPMLNRLICQWREAEEKTREECGLPSILTEYPGCFFYNRGHTDSAV
TTQSISNGQRKLRTTSRNLRPKVKSVHDIILQNHEQMQSNATFKPHKLILSEDDHAEADPDFDWGAFVAE
QVAKSTSSQEHRELMNYDHVGPWFNYFPLVDVRTEYYYRYSGSQTVPPCYGRHIGGSRKQTNHWRFMKDP
LRVTQRQIDEMHRLLKERIAPLDDPLASCQPDTAAKVNEDDPTKISVARPLMETRDTHYKVFCECIDWPS
KWPEDRAWCEQGFMDRLYTHPYNFQTDGF*
>gbi|AQL05019.1|Alpha_carbonic_anhydrase_7|Zea_mays
MHALVRPWDTLPVLLLSRLCMVLLDALRAGWLGSVDEDEEDFSYRRNAGNGPARWGLIRREWATCNVGLL
QSPIGLSDTLAGLADRSGRLGRSYRPAAASLVNRGHSIMVRFNSNPGGVVIDGVAYRLRQMHWHAPSEHA
INGRRYALELQMVHQSDTNRYAVVSQLYRISRRRPDRTIHRLERYIRRIARRKNHEELIDEEVDPRRPGT
RSNRRPLQEANGRAITFYYTSPAHGRGANGD
>gi|OMO73707.1|Alpha_carbonic_anhydrase|Corchorus_olitorius
MKHQSKPIFVSAFLIIFAVLFLSHSASVSAQEVEDEREFDYLEKSGKGPKHWGDLKQEWAACKNGDLQSP
IDMSSLRVKVIKKSGEMKKRYKPCHAVVKNRGHDISLQWLDNDAGSIKINGTEYFLQQAHWHSPSEHTIN
GRRYALELHMVHQSKDPNLKNNLAVVGLLYKFGAPDSFISKLISNITSMNDHVQERYMGVIDPSAIKMGG
KKYYRYMGSLTVPPCTEGVIWTMNKKVRTVSRDQVRALRIAVHDYAEANARPVQPLNRREVELYGPNPGD
VSN

24 changes: 24 additions & 0 deletions CA_beta.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
>jgi|Phatr2|45443|estExt_fgenesh1_pg.C_chr_70069|CA5_beta
MKFATAATVTLLALSTVDALNVKKLFRFGKTSLPKDSSPKPAAKGGYDLDVSELFDGNNKFIADKLAGDP
AYFDTLGTVHSPKYLYIGCVDARAPPNMIMGTEAGTMLTVRNIANMVVNNDLAVMSAIQFGINVLKIPNV
ILCGHYECGGVRASVANVDHAPPLSIWLRNIRDVYRLHAKELDAIKDPEERHRRLVDLNVIEQCVNLFKT
GVIQAKRIESYKDGGVAIPQVHPVVFDPKTGEVKKLKVDFDKYMAEINGIYDLYDLENAKVPM
>jgi|Phatr2|51305|estExt_fgenesh1_kg.C_chr_10001|CA4_beta
MKFLSASIALLACATSVEAFNANKAFRFGAKAMPEVSSESATSALSAGGAEKKSYDLDITEIFDGNKKFI
ETKKAQDAAYFDTLGTVHSPKYLYIGCVDARAPPNMIMGTEAGTMLTVRNIANMVVNNDLAVMSAIQFGI
NVLKIPHVIVCGHYECGGVRASVANVDHAPPLSIWLRNIRDVYRLHARELDAIKDPEDRHRRLVDLNVIE
QCVNLYKTGVIQAKRIESYQEGAPAAIPRVHPIVFDPKTGAIRKLQVDFDKYMSELDAIYDLYELENAKI
PA*
>gi|ONM39907.1|BetaCA_4|Zea_mays
MAVERLKTGFEQFKADVYDKKPELFEPLKAHQSPKYMVFACSDSRVCPSVTLGLHPGEAFAVRNIASMVP
PYDKTKYAGVGSAIEYAVCALKVEVIVVIGHSRCGGIKALLSLEDGAPDKFHFVEEWVRVGAPAKSKVLA
DHASAPFEDQCSILEKEAVNVSLENLKSYPFVKEGLEKGTLKLVGGHYDFVNGKFETWEP
>gi|SIT99918.1|beta-carbonic_anhydrase|Mycobacterium_bovis_AF2122/97
MTVTDDYLANNVDYASGFKGPLPMPPSKHIAIVACMDARLDVYRMLGIKEGEAHVIRNAGCVVTDDVIRS
LAISQRLLGTREIILLHHTDCGMLTFTDDDFKRAIQDETGIRPTWSPESYPDAVEDVRQSLRRIEVNPFV
TKHTSLRGFVFDVATGKLNEVTP
>gi|XP_014177286.1|betaCA|Trichosporon_asahii_var.asahii_CBS_2479
MSNYLQETHDRVFAQNKEWAAKQRAKDPEFFTRLAAGQSPEYLWIGCSDSRMPAEMITGLEPGEAFIHRN
IANMVNNLDLSAMAVINYAVRHLKVKHIIVCGHYGCGGVQAAMTPKDLGILNPWLRNIRDVYRLHEKELD
AIADDEKRYERLVELNVVEQCRNVIKTAAVQQSYAENEYPIVHGWVFDFRTGLLKDLEIDYAKVLKDIQK
IYNLTE
61 changes: 61 additions & 0 deletions CA_delta.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
>jgi|Thaps3|262009|thaps1_ua_kg.chr_4000019|CA7_delta_TPS
LTKKTARDWVWDPLEPGYILHFWAYSGSTTEPPCFEGVNWRIFDVPMKISPGQYQQLQRLMFDHVDPDTC
KLTSTHYNESNARPVQPYRGGANYRCRRSGYVSDKERKASGLRRGFKDPADWRGVDLLPWIEGEFPNV*
>jgi|Thaps3|233|fgenesh1_pm.C_chr_2000003|CA4_delta_TPS
MGDITPNTKPYFQSSMCPVNVHWHLGSEHYSYGEFDENGNGPHGNVARPSWANRDLATDGAAVADGFRCH
HYDENDPKFTTKYDWKHCHGMEVGETYEVHWPHSAAGACGTVNQYQTPFYDGVFCNLPMESFTTLGGQDI
ANAVGVHGQVFTIVNDESYFYPDMIRGMIVEPEMNMGQDIAMYTGSTTGDSRSNEMCSQYAPITWQVDRK
CHMISASSFDKLCYDMKMQRDDMSDDLHAHGSRELVKDEYVANNQANRNLRA
>jgi|Thaps3|814|fgenesh1_pm.C_chr_19a_19000002|CA5_delta_TPS
MVNNVDCVHTPGPQAGANVTKGYKGGMEVDYVPNTKPYFQSSMCPVNVHWHLGTEHYSAGEYDEFGTGPN
SVNNNLPQNQQVRPGYRCRHFDKSQPMFTNEYRWEFCVGMQVGETYEVHWPHSAAGACGTPDQYQTPFYD
GVFCNLDEEKFSTLSAQDVADAVGVQAQVFTVVNDERYFYPDLMRGFIKDGEYGKDIAMYTGSTTGTTRS
NEVCSSYAPITWQVDRKCHLISASSFDRLCETMRLQRDNMTLDMHAHGSRELVKDSLVANNQANRRLGGH
DHHHHHHGHDHADHLWADGHGHLHEEWF
>jgi|Thaps3|34125|e_gw1.5.359.1|CA6_delta_TPS
VPGPQAGGNVTKGYVGELDVGDLTPNTKQYFQSSMCPVNVHWHLGSEHYSYGEFDENGDGPHGNIPRPDW
ANRDLAGAGESVPDGFRCHHFDETDAKFTTKYEWKHCEGMEVGETYEVHWPHSAAGACGTVNQYQTPFYD
GVFCNLPMETFVTLGAQDIASAVGVHGQVFTVVNDESYFYPDMIRGMIVDPDMNMGQDVAMYTGSTTGDS
RSNEMCSQYAPITWQVDRKCHMISASSFDKLCYDMKMQRDDMSDDLHAHGSRELVMDSLVANNQAN*
>gi|OEU09193.1|delta_carbonic_anhydrase|Fragilariopsis_cylindrus_CCMP1102
MTFYQAAVVALLASTVNNAVNAEEDCTSIVDLACGTEGFSTLCSVLTDVAPALDPDVVSSLKTVFAPTDD
AFAAVKFDLVTEEALLDILGYHLSTFELTGECGSLIEMADGKDTRTLCNKDKEPVFQKGWANSRAVMPQF
DPTAGIAVCGDATVYVIDSVLIPKDYFVDEEGEVVEDNVQEVIDAPDPNDGKDYFKELLIAKGTVTEGSN
TCANTNPQFPNINCLGEDGTVDVGPQAAANVTKGYVGGMEVDIVPITKSYYQAGLCPVNVHWHLGSEHFS
AGEFDCEDPKKCGPYHAADDAAHDDDGHTDDAGEGDSRRQLAGDARKGYQCNYYDEDDSKFTAPYDWQFC
DKTMEVGQTYEIHWPHSSAGACGTPNQYQTPFYDGVFCNLPLDVFQTLSAQDIASNVGVQAQVFTIVNDE
AYYYPNLFGGMIVDGDFGADMAIYTGSTTGTSRDNEVCSQYAPITWQVDRKCHMISASSFDKMCADMMAQ
RDDMTDDLYAHGAREVTADIITADNQQTRGRGLRLRKNNKN
>gi|XP_005772538.1|delta_CA|Emiliania_huxleyi_CCMP1516
MSQADWLEQNVERISKDDLTETPTTAEALEGQPNEKAVIIGAAKATGSDIAYKLSHLLALVVAGVIALLA
SAALADGRSVIKLKDTSNLPRLTALTATLDGETINLKDHGLDYRADELLGPQYGVGLHHDSSGYGWGKAG
ARETLQEYIDELGLLQVIAAVPSVIATDGLKHPAHFLECTELKKAGLSAMSLAIIAEVASAVMIIFHGLA
LVGLLPLSAKLAKGFAGLVWFTLTAGFLIVVCLPIGVYETEWTCNKDFVPAIRLWDHFVYNWAFPVGYLG
YACSLLVFSVVLCFPSLEEGAQEFDKKKTKLGLVKVVAGLFVGLVIAASVSVGIAASQDAFKDPEVDPSV
NPCKAQKPYHAAPGDNYFRNIECMKDNLVQHLSEGQYDYHGTGPAYNSTNSTKDLYANHVHGYHDRDAED
YVSKEEYYESKKNKGDPYADDGKKKKEKKEWTERLGLRCHHYDDEHEMFKTVATGAKKPYEWKHCVEMMV
GETYEVPWPHSAAGACGTEWQYPDALLRRRLLQEGVVNILTPLNTYEKIGVQGQVFTIVNSDEEQYQYEN
LIDGAWMDGKDKWVDVAKYTGSTTGTTRNNEMCSRYAPITWQVDRTCHMISAKSFDKLCYDMKQKKDDMG
GDLYPHGAREIVADYLVANNQQSRK
>gb|ABS87870.1|delta_carbonic_anhydrase2|Lingulodinium_polyedrum
MVARLMLAASVLLVRAWGTGCPDDPEVDLCSETTTDESGTGTGTEEVNVNGAMRTRTSLMPMLXLAGVFR
SKNALFALPLLGXPLAAEAAAAAGTSGPSTCGAVKDMYKEQGCCGRPDKELDVVIVPKPTKRLFGANICE
GKQPVHATPGDNYFKNVDCLNGTTLQVLEQAGANVTLGYRGRLDASSRTPILTPYWQNGLCPVNVHWHLG
TEHYSKGQFDEHGTGPDIAAEEDAEGEADSRRLAVARRGYRCSKYDAKDAKFTTEYNWQHCEGMHVGETY
EVHWPHSAAGACGTPYQYQTPFYDGVFCVDGIVSLSPLNTYMKIGVQSQVYTIVNDETYYYPEMIKGMIV
DGHYGQDIAKYTGSTTGTSRDNEVCSRYTPITWQVDRKCHLISASSFDKMCADMKNQHDDMSSDLHAHGS
RVLVDRNFTGNNFHRRM
>ABG37687.1 delta-carbonic anhydrase [Emiliania huxleyi]
MSQADWLEHNVERISKDDLTETPTTAEALEGQPNAKAVTIGAAKATGSDIAYKLSHLLALVVAGVIALLA
SAALADGRSVIKLKDTSNLPRLTALTATLDGKTINLKDHGLDYRADELLGPQYGVGLHHDSSGYGWGKAG
ARETLQEYIDELGLLQVIAAVPSVIATDGLKHPAHFLECTELKKAGLSAMSLAIIAEVASAVMIIFHGLA
LVGLLPLSAKLAKGFAGLVWFTLTAGFLIVVCLAIGVYETEWTCNNDFVPAIRLSDHFVYNWAFPVGYLG
YACSLLVFSVVLCFTSLEEGAQEFDKKKTKLGLVTVVAGLFVGLVIAASVSVGIAASQDAFKEVEVDPSV
NPCKAQKPYHAAPGDNYFRNIECMKDNLVQVLEQAGANVTRGYVGGLDAGNWRTPILDHYDDTDLCTVNV
HWHLGAEHLSEGQYDYHGTGPAYNSTNSTKDLYANHVHGYHDRDAEDYVSKEEYYESKKNKGDPYADDGK
KKKEKKEWTERLGLRCHHYDDEHEMFKTAATGAKKPYEWKHCVEMMVGETYEVHWPHSAAGACGTEWQYQ
TPFYDGVFCKEGVVNILTPLNTYEKIGVQGQVFTIVNSDEEQYQYENLIDGAWMDGKDKWVDVAKYTGST
TGTTRNNEMCSRYAPITWQVDRTCHMISAKSFDKLCYDMKQKKDDMGGDLYPHGAREIVADYLVANNQQS
RK


10 changes: 10 additions & 0 deletions CA_zeta.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
>gi|XP_002295227.1|TPSE|CA3_zeta
MCMHVDLQVAMSSILSKLTGKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVD
GRGSDNTRMAGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGE
FDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFNL
DVVKFLVAAAATVEMLGGPRIAKIVVA
>pdb|3BOH|Tweisflo|CA_zeta
SHMSLTPDQIVAALQERGWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGGIYAIAHN
RGVTTLEGLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYPRPQFDADQGAKAVENAG
GVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGKFGLDVPKFLIAAAATVEMLGGPKKAKI
VIP
Loading

0 comments on commit 6927049

Please sign in to comment.