O-GlcNAc site prediction with Deep Learning
>sp|P37198|NUP62_HUMAN Nuclear pore glycoprotein p62 OS=Homo sapiens OX=9606 GN=NUP62 PE=1 SV=3
MSGFNFGGTGAPTGGFTFGTAKTATTTPATGFSFSTSGTGGFNFGAPFQPATSTPSTGLF
SLATQTPATQTTGFTFGTATLASGGTGFSLGIGASKLNLSNTAATPAMANPSGFGLGSSN
LTNAISSTVTSSQGTAPTGFVFGPSTTSVAPATTSGGFSFTGGSTAQPSGFNIGSAGNSA
QPTAPATLPFTPATPAATTAGATQPAAPTPTATITSTGPSLFASIATAPTSSATTGLSLC
TPVTTAGAPTAGTQGFSLKAPGAASGTSTTTSTAATATATTTSSSSTTGFALNLKPLAPA
GIPSNTAAAVTAPPGPGAAAGAAASSAMTYAQLESLINKWSLELEDQERHFLQQATQVNA
WDRTLIENGEKITSLHREVEKVKLDQKRLDQELDFILSQQKELEDLLSPLEELVKEQSGT
IYLQHADEEREKTYKLAENIDAQLKRMAQDLKDIIEHLNTSGAPADTSDPLQQICKILNA
HMDSLQWIDQNSALLQRKVEEVTKVCEGRRKEQERSFRITFD
>sp|Q63850|NUP62_MOUSE Nuclear pore glycoprotein p62 OS=Mus musculus OX=10090 GN=Nup62 PE=1 SV=2
MSGFNFGGTGAPAGGFTFGTAKTATTTPATGFSFSASGTGTGGFNFGTPSQPAATTPSTS
LFSLTTQTPTTQTPGFNFGTTPASGGTGFSLGISTPKLSLSNAAATPATANTGSFGLGSS
TLTNAISSGSTSNQGTAPTGFVFGSSTTSAPSTGSTGFSFTSGSASQPGASGFSLGSVGS
SAQPTALSGSPFTPATLVTTTAGATQPAAAAPTAATTSAGSTLFASIAAAPASSSATGLS
LPAPVTTAATPSAGTLGFSLKAPGAAPGASTTSTTTTTTTTTTTAAAAAASTTTTGFALS
LKPLVSAGPSSVAATALPASSTAAGTATGPAMTYAQLESLINKWSLELEDQERHFLQQAT
QVNAWDRTLIENGEKITSLHREVEKVKLDQKRLDQELDFILSQQKELEDLLSPLEESVKE
QSGTIYLQHADEEREKTYKLAENIDAQLKRMAQDLKDIIEHLNMAGGPADTSDPLQQICK
ILNAHMDSLQWVDQSSALLQRRVEEASRVCEGRRKEQERSLRIAFD
Upload a file with multiple protein sequences in fasta format ( example for human proteins / example for mouse proteins; see Tutorial if help is needed)
>sp|P37198|NUP62_HUMAN Nuclear pore glycoprotein p62 OS=Homo sapiens OX=9606 GN=NUP62 PE=1 SV=3
MSGFNFGGTGAPTGGFTFGTAKTATTTPATGFSFSTSGTGGFNFGAPFQPATSTPSTGLF
SLATQTPATQTTGFTFGTATLASGGTGFSLGIGASKLNLSNTAATPAMANPSGFGLGSSN
LTNAISSTVTSSQGTAPTGFVFGPSTTSVAPATTSGGFSFTGGSTAQPSGFNIGSAGNSA
QPTAPATLPFTPATPAATTAGATQPAAPTPTATITSTGPSLFASIATAPTSSATTGLSLC
TPVTTAGAPTAGTQGFSLKAPGAASGTSTTTSTAATATATTTSSSSTTGFALNLKPLAPA
GIPSNTAAAVTAPPGPGAAAGAAASSAMTYAQLESLINKWSLELEDQERHFLQQATQVNA
WDRTLIENGEKITSLHREVEKVKLDQKRLDQELDFILSQQKELEDLLSPLEELVKEQSGT
IYLQHADEEREKTYKLAENIDAQLKRMAQDLKDIIEHLNTSGAPADTSDPLQQICKILNA
HMDSLQWIDQNSALLQRKVEEVTKVCEGRRKEQERSFRITFD
>sp|P51610|HCFC1_HUMAN Host cell factor 1 OS=Homo sapiens OX=9606 GN=HCFC1 PE=1 SV=2
MASAVSPANLPAVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGGNEGIVDELH
VYNTATNQWFIPAVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKYSNDLYELQASRWEW
KRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRP
GSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTL
TWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLN
LDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLE
TEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSV
PANPPKSPAPAAAAPAVQPLTQVGITLLPQAAPAPPTTTTIQVLPTVPGSSISVPTAART
QGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSS
PQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTMAVTPGTTTLPATVKVASSPV
MVSNPATRMLKTAAAQVGTSVSSATNTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKT
ITLVKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGT
ILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIITQ
AGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPG
QPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGH
STSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQPVSQPTQV
TLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTLVCSNPPCET
HETGTTNTATTTVVANLGGHPQPTQVQFVCDRQEAAASLVTSTVGQQNGSVVRVCSNPPC
ETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTNTATTAMSSVGANHQRDARRACA
AGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLGPSMARE
PGGRSPAFVQLAPLSSKVRLSSPSIKDLPAGRHSHAVSTAAMTRSSVGAGEPRMAPVCES
LQGGSPSTTVTVTALEALLCPSATVTQVCSNPPCETHETGTTNTATTSNAGSAQRVCSNP
PCETHETGTTHTATTATSNGGTGQPEGGQQPPAGRPCETHQTTSTGTTMSVSVGALLPDA
TSSHRTVESGLEVAAAPSVTPQAGTALLAPFPTQRVCSNPPCETHETGTTHTATTVTSNM
SSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPP
PEELQVSPGPRQQLPPRQLLQSASTALMGESAEVLSASQTPELPAAVDLSSTGEPSSGQE
SAGSAVVATVVVQPPPPTQSEVDQLSLPQELMAEAQAGTTTLMVTGLTPEELAVTAAAEA
AAQAAATEEAQALAIQAVLQAAQQAVMGTGEPMDTSEAAATVTQAELGHLSAEGQEGQAT
TIPIVLTQQELAALVQQQQLQEAQAQQQHHHLPTEALAPADSLNDPAIESNCLNELAGTV
PSTVALLPSTATESLAPSNTFVAPQPVVVASPAKLQAAATLTEVANGIESLGVKPDLPPP
PSKAPMKKENQWFDVGVIKGTNVMVTHYFLPPDDAVPSDDDLGTVPDYNQLKKQELQPGT
AYKFRVAGINACGRGPFSEISAFKTCLPGFPGAPCAIKISKSPDGAHLTWEPPSVTSGKI
IEYSVYLAIQSSQAGGELKSSTPAQLAFMRVYCGPSPSCLVQSSSLSNAHIDYTTKPAII
FRIAARNEKGYGPATQVRWLQETSKDSSGTKPANKRPMSSPEMKSAPKKSKADGQ
>sp|Q9UPA5|BSN_HUMAN Protein bassoon OS=Homo sapiens OX=9606 GN=BSN PE=1 SV=4
MGNEVSLEGGAGDGPLPPGGAGPGPGPGPGPGAGKPPSAPAGGGQLPAAGAARSTAVPPV
PGPGPGPGPGPGPGSTSRRLDPKEPLGNQRAASPTPKQASATTPGHESPRETRAQGPAGQ
EADGPRRTLQVDSRTQRSGRSPSVSPDRGSTPTSPYSVPQIAPLPSSTLCPICKTSDLTS
TPSQPNFNTCTQCHNKVCNQCGFNPNPHLTQVKEWLCLNCQMQRALGMDMTTAPRSKSQQ
QLHSPALSPAHSPAKQPLGKPDQERSRGPGGPQPGSRQAETARATSVPGPAQAAAPPEVG
RVSPQPPQPTKPSTAEPRPPAGEAPAKSATAVPAGLGATEQTQEGLTGKLFGLGASLLTQ
ASTLMSVQPEADTQGQPAPSKGTPKIVFNDASKEAGPKPLGSGPGPGPAPGAKTEPGARM
GPGSGPGALPKTGGTTSPKHGRAEHQAASKAAAKPKTMPKERAICPLCQAELNVGSKSPA
NYNTCTTCRLQVCNLCGFNPTPHLVEKTEWLCLNCQTKRLLEGSLGEPTPLPPPTSQQPP
VGAPHRASGTSPLKQKGPQGLGQPSGPLPAKASPLSTKASPLPSKASPQAKPLRASEPSK
TPSSVQEKKTRVPTKAEPMPKPPPETTPTPATPKVKSGVRRAEPATPVVKAVPEAPKGGE
AEDLVGKPYSQDASRSPQSLSDTGYSSDGISSSQSEITGVVQQEVEQLDSAGVTGPHPPS
PSEIHKVGSSMRPLLQAQGLAPSERSKPLSSGTGEEQKQRPHSLSITPEAFDSDEELEDI
LEEDEDSAEWRRRREQQDTAESSDDFGSQLRHDYVEDSSEGGLSPLPPQPPARAAELTDE
DFMRRQILEMSAEEDNLEEDDTATSGRGLAKHGTQKGGPRPRPEPSQEPAALPKRRLPHN
ATTGYEELLPEGGSAEATDGSGTLQGGLRRFKTIELNSTGSYGHELDLGQGPDPSLDREP
ELEMESLTGSPEDRSRGEHSSTLPASTPSYTSGTSPTSLSSLEEDSDSSPSRRQRLEEAK
QQRKARHRSHGPLLPTIEDSSEEEELREEEELLREQEKMREVEQQRIRSTARKTRRDKEE
LRAQRRRERSKTPPSNLSPIEDASPTEELRQAAEMEELHRSSCSEYSPSPSLDSEAEALD
GGPSRLYKSGSEYNLPTFMSLYSPTETPSGSSTTPSSGRPLKSAEEAYEEMMRKAELLQR
QQGQAAGARGPHGGPSQPTGPRGLGSFEYQDTTDREYGQAAQPAAEGTPASLGAAVYEEI
LQTSQSIVRMRQASSRDLAFAEDKKKEKQFLNAESAYMDPMKQNGGPLTPGTSPTQLAAP
VSFSTPTSSDSSGGRVIPDVRVTQHFAKETQDPLKLHSSPASPSSASKEIGMPFSQGPGT
PATTAVAPCPAGLPRGYMTPASPAGSERSPSPSSTAHSYGHSPTTANYGSQTEDLPQAPS
GLAAAGRAAREKPLSASDGEGGTPQPSRAYSYFASSSPPLSPSSPSESPTFSPGKMGPRA
TAEFSTQTPSPAPASDMPRSPGAPTPSPMVAQGTQTPHRPSTPRLVWQESSQEAPFMVIT
LASDASSQTRMVHASASTSPLCSPTETQPTTHGYSQTTPPSVSQLPPEPPGPPGFPRVPS
AGADGPLALYGWGALPAENISLCRISSVPGTSRVEPGPRTPGTAVVDLRTAVKPTPIILT
DQGMDLTSLAVEARKYGLALDPIPGRQSTAVQPLVINLNAQEHTFLATATTVSITMASSV
FMAQQKQPVVYGDPYQSRLDFGQGGGSPVCLAQVKQVEQAVQTAPYRSGPRGRPREAKFA
RYNLPNQVAPLARRDVLITQMGTAQSIGLKPGPVPEPGAEPHRATPAELRSHALPGARKP
HTVVVQMGEGTAGTVTTLLPEEPAGALDLTGMRPESQLACCDMVYKLPFGSSCTGTFHPA
PSVPEKSMADAAPPGQSSSPFYGPRDPEPPEPPTYRAQGVVGPGPHEEQRPYPQGLPGRL
YSSMSDTNLAEAGLNYHAQRIGQLFQGPGRDSAMDLSSLKHSYSLGFADGRYLGQGLQYG
SVTDLRHPTDLLAHPLPMRRYSSVSNIYSDHRYGPRGDAVGFQEASLAQYSATTAREISR
MCAALNSMDQYGGRHGSGGGGPDLVQYQPQHGPGLSAPQSLVPLRPGLLGNPTFPEGHPS
PGNLAQYGPAAGQGTAVRQLLPSTATVRAADGMIYSTINTPIAATLPITTQPASVLRPMV
RGGMYRPYASGGITAVPLTSLTRVPMIAPRVPLGPTGLYRYPAPSRFPIASSVPPAEGPV
YLGKPAAAKAPGAGGPSRPEMPVGAAREEPLPTTTPAAIKEAAGAPAPAPLAGQKPPADA
APGGGSGALSRPGFEKEEASQEERQRKQQEQLLQLERERVELEKLRQLRLQEELERERVE
LQRHREEEQLLVQRELQELQTIKHHVLQQQQEERQAQFALQREQLAQQRLQLEQIQQLQQ
QLQQQLEEQKQRQKAPFPAACEAPGRGPPLAAAELAQNGQYWPPLTHAAFIAMAGPEGLG
QPREPVLHRGLPSSASDMSLQTEEQWEASRSGIKKRHSMPRLRDACELESGTEPCVVRRI
ADSSVQTDDEDGESRYLLSRRRRARRSADCSVQTDDEDSAEWEQPVRRRRSRLPRHSDSG
SDSKHDATASSSSAAATVRAMSSVGIQTISDCSVQTEPDQLPRVSPAIHITAATDPKVEI
VRYISAPEKTGRGESLACQTEPDGQAQGVAGPQLVGPTAISPYLPGIQIVTPGPLGRFEK
KKPDPLEIGYQAHLPPESLSQLVSRQPPKSPQVLYSPVSPLSPHRLLDTSFASSERLNKA
HVSPQKHFTADSALRQQTLPRPMKTLQRSLSDPKPLSPTAEESAKERFSLYQHQGGLGSQ
VSALPPNSLVRKVKRTLPSPPPEEAHLPLAGQASPQLYAASLLQRGLTGPTTVPATKASL
LRELDRDLRLVEHESTKLRKKQAELDEEEKEIDAKLKYLELGITQRKESLAKDRGGRDYP
PLRGLGEHRDYLSDSELNQLRLQGCTTPAGQFVDFPATAAAPATPSGPTAFQQPRFQPPA
PQYSAGSGGPTQNGFPAHQAPTYPGPSTYPAPAFPPGASYPAEPGLPNQQAFRPTGHYAG
QTPMPTTQSTLFPVPADSRAPLQKPRQTSLADLEQKVPTNYEVIASPVVPMSSAPSETSY
SGPAVSSGYEQGKVPEVPRAGDRGSVSQSPAPTYPSDSHYTSLEQNVPRNYVMIDDISEL
TKDSTSTAPDSQRLEPLGPGSSGRPGKEPGEPGVLDGPTLPCCYARGEEESEEDSYDPRG
KGGHLRSMESNGRPASTHYYGDSDYRHGARVEKYGPGPMGPKHPSKSLAPAAISSKRSKH
RKQGMEQKISKFSPIEEAKDVESDLASYPPPAVSSSLVSRGRKFQDEITYGLKKNVYEQQ
KYYGMSSRDAVEDDRIYGGSSRSRAPSAYSGEKLSSHDFSGWGKGYEREREAVERLQKAG
PKPSSLSMAHSRVRPPMRSQASEEESPVSPLGRPRPAGGPLPPGGDTCPQFCSSHSMPDV
QEHVKDGPRAHAYKREEGYILDDSHCVVSDSEAYHLGQEETDWFDKPRDARSDRFRHHGG
HAVSSSSQKRGPARHSYHDYDEPPEEGLWPHDEGGPGRHASAKEHRHGDHGRHSGRHTGE
EPGRRAAKPHARDLGRHEARPHSQPSSAPAMPKKGQPGYPSSAEYSQPSRASSAYHHASD
SKKGSRQAHSGPAALQSKAEPQAQPQLQGRQAAPGPQQSQSPSSRQIPSGAASRQPQTQQ
QQQGLGLQPPQQALTQARLQQQSQPTTRGSAPAASQPAGKPQPGPSTATGPQPAGPPRAE
QTNGSKGTAKAPQQGRAPQAQPAPGPGPAGVKAGARPGGTPGAPAGQPGADGESVFSKIL
PGGAAEQAGKLTEAVSAFGKKFSSFW
>sp|Q63850|NUP62_MOUSE Nuclear pore glycoprotein p62 OS=Mus musculus OX=10090 GN=Nup62 PE=1 SV=2
MSGFNFGGTGAPAGGFTFGTAKTATTTPATGFSFSASGTGTGGFNFGTPSQPAATTPSTS
LFSLTTQTPTTQTPGFNFGTTPASGGTGFSLGISTPKLSLSNAAATPATANTGSFGLGSS
TLTNAISSGSTSNQGTAPTGFVFGSSTTSAPSTGSTGFSFTSGSASQPGASGFSLGSVGS
SAQPTALSGSPFTPATLVTTTAGATQPAAAAPTAATTSAGSTLFASIAAAPASSSATGLS
LPAPVTTAATPSAGTLGFSLKAPGAAPGASTTSTTTTTTTTTTTAAAAAASTTTTGFALS
LKPLVSAGPSSVAATALPASSTAAGTATGPAMTYAQLESLINKWSLELEDQERHFLQQAT
QVNAWDRTLIENGEKITSLHREVEKVKLDQKRLDQELDFILSQQKELEDLLSPLEESVKE
QSGTIYLQHADEEREKTYKLAENIDAQLKRMAQDLKDIIEHLNMAGGPADTSDPLQQICK
ILNAHMDSLQWVDQSSALLQRRVEEASRVCEGRRKEQERSLRIAFD
>sp|Q61191|HCFC1_MOUSE Host cell factor 1 OS=Mus musculus OX=10090 GN=Hcfc1 PE=1 SV=2
MASAVSPANLPAVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGGNEGIVDELH
VYNTATNQWFIPAVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKYSNDLYELQASRWEW
KRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRP
GSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETL
TWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLN
LDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLE
TEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSV
PANPPKSPAPAAAAPAVQPLTQVGITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAART
QGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPASVRMVVPTQSAQGTVIGSN
PQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPV
MVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKT
ITLVKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGT
ILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIITQ
AGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPG
QPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGAH
STSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQPVSQPTQV
TLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTLVCSNPPCET
HETGTTNTATTTVVANLGGHPQPTQVQFVCDRQETAASLVTSAVGQQNGNVVRVCSNPPC
ETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTSTATTAMSSMGTGQQRDTRRTTN
TPTVVRITVAPGALERVQGTVKPQCQTQQTNMTTTTMTVQATGAPCSAGPLLRPSVALES
GSHSPAFVQLALPSVRVGLSGPSSKDMPTGRQPETYHTYTTNTPTTTRSIMVAGELGAAR
VVPTSTYESLQASSPSSTMTMTALEALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG
SAQRVCSNPPCETHETGTTHTATTATSNGGAGQPEGGQQPASGHPCETHQTTSTGTTMSV
SVGTLIPDATSSHGTLESGLEVVAVPTVTSQAGSTLLASFPTQRVCSNPPCETHETGTTH
TATTVTSNMSSNQDPPPAASDQGEVASTQGDSTNITSASAITTSVSSTLPRAVTTVTQST
PVPGPSVPPPEELQVSPGPRQQLPPRQLLQSASTPLMGESTEVLSASQTPELQAAVDLSS
TGDPSSGQEPTTSAVVATVVVQPPPPTQSEVDQLSLPQELMAEAQAGTTTLMVTGLTPEE
LAVTAAAEAAAQAAATEEAQALAIQAVLQAAQQAVMGTGEPMDTSEAAAAVTQAELGHLS
AEGQEGQATTIPIVLTQQELAALVQQQQQLQEAQAQAQQQHHLPTEALAPADSLNDPSIE
SNCLNELASAVPSTVALLPSTATESLAPSNTFVAPQPVVASPAKMQAAATLTEVANGIES
LGVKPDLPPPPSKAPVKKENQWFDVGVIKGTSVMVTHYFLPPDDAVQSDDDSGTVPDYNQ
LKKQELQPGTAYKFRVAGINACGRGPFSEISAFKTCLPGFPGAPCAIKISKSPDGAHLTW
EPPSVTSGKIIEYSVYLAIQSSQASGEPKSSTPAQLAFMRVYCGPSPSCLVQSSSLSNAH
IDYTTKPAIIFRIAARNEKGYGPATQVRWLQETSKDSSGTKPASKRPMSSPEMKSAPKKS
KADGQ
>sp|O88737|BSN_MOUSE Protein bassoon OS=Mus musculus OX=10090 GN=Bsn PE=1 SV=4
MGNEASLEGGAGEGPLPPGGSGLGPGPGAGKPPSALAGGGQLPVAGAARAAGPPTPGLGP
VPGPGPGPGPGSVPRRLDPKEPLGSQRTTSPTPKQASATAPGRESPRETRAQGPSGQEAE
SPRRTLQVDSRTQRSGRSPSVSPDRGSTPTSPYSVPQIAPLPSSTLCPICKTSDLTSTPS
QPNFNTCTQCHNKVCNQCGFNPNPHLTQVKEWLCLNCQMQRALGMDMTTAPRSKSQQQLH
SPALSPAHSPAKQPLGKPEQERSPRGPGATQSGPRQAEAARATSVPGPTQATAPPEVGRV
SPQPPLSTKPSTAEPRPPAGEAQGKSATTVPSGLGAGEQTQEGLTGKLFGLGASLLTQAS
TLMSVQPEADTQGQPSPSKGQPKIVFSDASKEAGPRPPGSGPGPGPTPGAKTEPGARMGP
GSGPGALAKTGGTASPKHGRAEHQAASKAAAKPKTMPKERASACPLCQAELNMGSRGPAN
YNTCTACKLQVCNLCGFNPTPHLVEKTEWLCLNCQTKRLLEGSLGEPAPLPLPTPQQPPA
GVPHRAAGAAPLKQKGPQGLGQPSGSLPAKASPQATKASPQATKASPQATKASPQTTKAS
PQAKPLRATEPSKTSSSAQEKKTVTSAKAEPVPKPPPETTVPPGTPKAKSGVKRTDPATP
VVKPVPEAPKGGEAEEPVPKPYSQDLSRSPQSLSDTGYSSDGVSSSQSEITGVVQQEVEQ
LDSAGVTGPRPPSPSELHKVGSSLRPSLEAQAVAPSAEWSKPPRSSSSAVEDQKRRPHSL
SITPEAFDSDEELGDILEEDDSLAWGRQREQQDTAESSDDFGSQLRHDYVEDSSEGGLSP
LPPQPPARADMTDEEFMRRQILEMSAEEDNLEEDDTAVSGRGLAKHSAQKASARPRPESS
QEPKRRLPHNATTGYEELLSEAGPAEPTDSSGALQGGLRRFKTIELNSTGSYGHELDLGQ
GPDPNLDREPELEMESLTGSPEDRSRGEHSSTLPASTPSYTSGTSPTSLSSLEEDSDSSP
SRRQRLEEAKQQRKARHRSHGPLLPTIEDSSEEEELREEEELLREQEKMREVEQQRIRST
ARKTRRDKEELRAQRRRERSKTPPSNLSPIEDASPTEELRQAAEMEELHRSSCSEYSPSP
SLDSEAETLDGGPTRLYKSGSEYNLPAFMSLYSPTETPSGSSTTPSSGRPLKSAEEAYED
MMRKAEMLQRQQGQVAGARGPHGGPSQPTGPRSQGSFEYQDTQDHDYGGRASQPVAESTP
AGLGAAVYEEILQTSQSIARMRQASSRDLGFTEDKKKEKQFLNAESAYMDPMKQNGGPLT
PGTSPTQLAAPVSFSTSTSSDSSGGRVIPDVRVTQHFAKEPQDPLKLHSSPVSSTLTSKE
VGMTFSQGPGSPATTASPTRGYMTPTSPAGSERSPSTSSTIHSYGQPPTTANYGSQTEEL
PHAPSGPPGSGRAPREKPLSGGDSEVGAPQPSRGYSYFTGSSPPLSPSTPSESPTFSPGK
LGPRATAEFSTQTPSLTLSSDIPRSPGPPSPMVAQGTQTPHRPSTPRLVWQQSSQEAPIM
VITLASDASSQTRMVHASASTSPLCSPTDSQPTSHSYSQTTPPSASQMPSEPAGPPGFPR
APSAGTDGPLALYGWGALPAENISLCRISSVPGTSRVEPGPRPPGTAVVDLRTAVKPTPI
ILTDQGMDLTSLAVEARKYGLALDPVSGRQSTAVQPLVINLNAQEQTHTFLATATTVSIT
MASSVLMAQQKQPVVYGDPFQSRLDFGQGSGSPVCLAQVKQVEQAVQTAPYRGGPRGRPR
EAKFARYNLPNQVTPLARRDILITQMGTAQGVGLKPGPVPEPGAEPHRATPAELRSHAPP
GTRKPHTVVVQMGEGTAGTVTTLLPEEPAGALDLTGMRPESQLACCDMVYKFPFGSSCTG
TFHPAPSAPDKSVTDTALPGQSSGPFYSPRDPEPPEPLTFRTQGVVGPGPHEEQRPYPQG
LPGRLYSSMSDTNLAEAGLNYHAQRLGQLFQGPGRDSAVDLSSLKHSYSLGFADGRYLGQ
GLQYGSFTDLRHPTDLLSHPLPLRRYSSVSNIYSDHRYGPRGDAVGFQEASLAQYSATTA
REISRMCAALNSMDQYGGRHGSGSGGPDLVQYQPQHGPGLSAPQGLAPLRSGLLGNPTYP
EGQPSPGNLAQYGPAASQATAVRQLLPSTATVRAADGMIYSTINTPIAATLPITTQPASV
LRPMVRGGMYRPYVSGGVTAVPLTSLTRVPMIAPRVPLGPAGLYRYPAPRFPIASSVPPA
EGPVYLGKPAAAKASGAGGPPRPELPAGVAREEPFSTTAPAVIKEAPVAPAPGPAPAPPP
GQKPAGEAVAGSGSGVLSRPASEKEEASQEDRQRKQQEQLLQLERERVELEKLRQLRLQE
ELERERVELQRHREEEQLLVQRELQELQTIKQHVLQQQQEERQAQFALQREQLAQQRLQL
EQIQQLQQQLQLQLEEQKQRQKAPFPATCEAPSRGPPPAATELAQNGQYWPPLTHAAFIA
VAGTEGPGQPREPVLHRGLPSSASDMSLQTEEQWEAGRSGIKKRHSMPRLRDACEPESGP
DPSTVRRIADSSVQTDDEEGEGRYLVTRRRRTRRSADCSVQTDDEDNADWEQPVRRRRSR
LSRHSDSGSDSKHDATASSSTTAAATARAMSSVGIQTISDCSVQTEPEQLPRVSPAIHIT
AATDPKVEIVRYISAPEKTGRGESLACQTEPDGQAQGVAGPQLIGPTAISPYLPGIQIVT
PGALGRFEKKKPDPLEIGYQAHLPPESLSQLVSRQPPKSPQVLYSPVSPLSPHRLLDTSF
ASSERLNKAHVSPQKQFIADSTLRQQTLPRPMKTLQRSLSDPKPLSPTAEESAKERFSLY
QHQGGLGSQVSALPPNGLVRKVKRTLPSPPPEEAHLPLAGQVPSQLYAASLLQRGLAGPT
TVPATKASLLRELDRDLRLVEHESTKLRKKQAELDEEEKEIDAKLKYLELGITQRKESLA
KDRGGRDYPPLRGLGEHRDYLSDSELNQLRLQGCTTPAGQYVDYPASAAVPATPSGPTAF
QQPRFPPAAPQYTAGSSGPTQNGFPAHQAPTYTGPSTYPAPTYPPGTGYPAEPGLPSQPA
FHPTGHYAAPTPMPTTQSAPFPVQADSRAAHQKPRQTSLADLEQKVPTNYEVIGSPAVTM
SSAPPETGYSGPAVSGSYEQGKAPEHPRGSDRSSVSQSPAPTYPSDSHYTSLEQNVPRNY
VMIDDISELTKDSTPTASESQRLEPLGPGGVSGRPGKDPGEPAVLEGPTLPCCYGRGEEE
SEEDSYDPRGKSGHHRSMESNGRPSTHYYGDSDYRHGARADKYGPGPMGPKHPSKSLAPA
AISSKRSKHRKQGMEQKISKFSPIEEAKDVESDLASYPPPTVSSSLTSRGRKFQDEITYG
LKKNVYEQQRYYGVSSRDAAEEDERMYGSSSRSRMASAYSGEKLSSHDYSSRGKGYERER
DTAERLQKAGSKPSSLSMAHGRARPPMRSQASEEESPVSPLGRPRPAGGALPPGDTCPQF
CSSHSMPDVQEHVKDGPRAHAYKREEGYMLDDSHCVVSDSEAYHLGQEETDWFDKPRDAR
SDRFRHHGGHTVSSSQKRGPARHSYHDYDEPPEEGLWPHDEGGPGRHTSAKEHRHHSDHG
RHSGRHAGEEPGRRAAKPHARDMGRHEARPHPQASPAPAMQKKGQPGYPSSADYSQSSRA
PSAYHHASESKKGSRQAHTGPSALQPKADTQAQPQMQGRQAAPGPQQSQPPSSRQTPSGT
ASRQPQTQQQQQQQQQQQGLGQQAPQQAPSQARLQPQSQPTTRGTAPAASQPAGKPQPGP
TTAPGPQPAGPPRAEQASSSKPPAAKAPQQGRAPQAQTTPGPGPAGAKPGARPGGTPGAP
ASQPGAEGESVFSKILPGGAAEQAGKLTEAVSAFGKKFSSFW