; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00710 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00710
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSART-1 family protein DOT2
Genome locationClcChr08:1275633..1297884
RNA-Seq ExpressionClc08G00710
SyntenyClc08G00710
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.74Show/hide
Query:  DGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDEH
        DG  +   DERNG +   ARDRGEG DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDDASKEKEKEVKDS+RDRV  RE+RKEDRDEH
Subjt:  DGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDEH

Query:  EKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQE
        +KER+R  KVKDKDYDRE+YKEKEYERERDRKDRGKD+ER RERELEKDNVRGQDKERGKEKD+DR+R+RDRDRKKK+KDKDRSNENEREKGREK RDQE
Subjt:  EKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQE

Query:  DKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLE
        +KESYRNI K+RGKE+ L DD+K DQNKEK +DKEG G KNDEERI WIA  AKDYMLE+DGE+NRDR VDQGN VQHLGGEEN  GLKVGA +SS MLE
Subjt:  DKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLE

Query:  ERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQSIL
        ERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHGIDKVL GGAVVLTLKDQ+IL
Subjt:  ERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQSIL

Query:  ADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDL
        ADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAAADEGLTLDG G F+NDAEKKLEELR+RLQGASSVKHFEDL
Subjt:  ADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDL

Query:  NASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTS
        NAS KV+HDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR NAYQSAYAKADEASRSLQLVQ S
Subjt:  NASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTS

Query:  SARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EEDVFMD
        S RL+DN+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TTS QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE HKP EEDVFMD
Subjt:  SARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EEDVFMD

Query:  DDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSK
        DDE PKEEY ED KDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHE PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+KSK
Subjt:  DDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSK

Query:  DSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQ
        DSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQ
Subjt:  DSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQ

Query:  TSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        TSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ S+TGTKK K+
Subjt:  TSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

XP_008459644.1 PREDICTED: SART-1 family protein DOT2 isoform X2 [Cucumis melo]0.0e+0091.41Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDERNG             DD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDS+RDRVRSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR
        HEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDRERERERELEKDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKHR
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR

Query:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST
        DQEDKESYRN+ K+RGKERILEDDRKTDQ K+K QDKEG GSKNDEER GWIADE KDYMLE+DGENNRDRDV+QGNMVQHLGGEENF GLKVG+H SST
Subjt:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST

Query:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK
        MLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVLTLK
Subjt:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK
        DQSILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVK
Subjt:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK

Query:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ
        HFEDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQ
Subjt:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ

Query:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE
        LVQTSS RLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG IALLATATTS+Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKP EE
Subjt:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE

Query:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEVPKEEY EDVKDKDGGWTEVKDTA+EE  P++N+A+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
Subjt:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

XP_011656109.1 SART-1 family protein DOT2 [Cucumis sativus]0.0e+0091.82Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDER+G             DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKE KDS+RDR+RSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ
        HEKERSRG KVKDKDYDR+IYK+KEYERERDRKDRGKDRERERERELEKD VRG DKERGKEKD+DRD+DRDRDRKKKDKDKDRSNE EREKGR+KHRDQ
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ

Query:  EDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTML
        EDKESYRNI KDRGKERILEDDRKTDQNK+K QDKEG GSKNDEERIG I DE KDYMLE+DGENNRDRDV+QGNMVQHLG EENF GLKVG+HASSTML
Subjt:  EDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTML

Query:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLKDQ
        EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIA EDTT+N  LAGVKVLHG+DKVLEGGAVVLTLKDQ
Subjt:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLKDQ

Query:  SILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF
        SILADG++NE++DVLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF
Subjt:  SILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF

Query:  EDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLV
        EDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQLV
Subjt:  EDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLV

Query:  QTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE-DV
        Q SSARLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG +ALLATATTS+Q TDDQ+TKAGELQENKVVFTEMEEFVWGLQLDE+ HKPEE DV
Subjt:  QTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE-DV

Query:  FMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
        FMDDDE+PKEEY EDVKDKDGGWTEVKDTA EE TPE+NEA+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
Subjt:  FMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES

Query:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
Subjt:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Query:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

XP_016902433.1 PREDICTED: SART-1 family protein DOT2 isoform X1 [Cucumis melo]0.0e+0091.41Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDERNG             DD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDS+RDRVRSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR
        HEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDRERERERELEKDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKHR
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR

Query:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST
        DQEDKESYRN+ K+RGKERILEDDRKTDQ K+K QDKEG GSKNDEER GWIADE KDYMLE+DGENNRDRDV+QGNMVQHLGGEENF GLKVG+H SST
Subjt:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST

Query:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK
        MLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVLTLK
Subjt:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK
        DQSILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVK
Subjt:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK

Query:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ
        HFEDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQ
Subjt:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ

Query:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE
        LVQTSS RLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG IALLATATTS+Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKP EE
Subjt:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE

Query:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEVPKEEY EDVKDKDGGWTEVKDTA+EE  P++N+A+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
Subjt:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

XP_038890025.1 SART-1 family protein DOT2 [Benincasa hispida]0.0e+0093.95Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGE-GHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRD
        MDGER SAPDERNGPDI+WAR+RGE GHDDFGYSG EKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKE KDS+RDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDISWARDRGE-GHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRD

Query:  EHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKE----KDKDRDRDRDRDRKKKDKDKDRSNENEREKGRE
        EHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKE+GKE    +D+DRDRDRDRDRKKKDKDKDRSNENEREKGRE
Subjt:  EHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKE----KDKDRDRDRDRDRKKKDKDKDRSNENEREKGRE

Query:  KHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHA
        KHRDQEDKESYRNI KDRGKERILEDDRKTDQ+KEKSQDKEG GSK DEERIGWIADE KDYM+E+DG+NNR+RDVDQGNMVQ LGGEENF GLKVGAHA
Subjt:  KHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHA

Query:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVL
        SSTMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIA EDTT+  NLAGVKVLHGIDKVLEGGAVVL
Subjt:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVL

Query:  TLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS
        TLKDQ+ILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQG++
Subjt:  TLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS

Query:  SVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASR
        SV HFEDLNASTKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLD+DALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASR
Subjt:  SVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASR

Query:  SLQLVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP
        SL+LVQTSS RLEDNDDTLI DDDEDFYKSL+RARKLALK+QEAASGPG IALLATATTS+QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDE+ HKP
Subjt:  SLQLVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP

Query:  -EEDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED
         EEDVFMDDDEVPKEEY E++KDKDGGWTEVKDTA+EE TPE+NEAIAPDETIHE PVGKGLSSALKLLK+RGTLKESIEWGGRNMDKRKSKLVGIVDED
Subjt:  -EEDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED

Query:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

TrEMBL top hitse value%identityAlignment
A0A0A0KXY6 Uncharacterized protein0.0e+0091.82Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDER+G             DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKE KDS+RDR+RSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ
        HEKERSRG KVKDKDYDR+IYK+KEYERERDRKDRGKDRERERERELEKD VRG DKERGKEKD+DRD+DRDRDRKKKDKDKDRSNE EREKGR+KHRDQ
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ

Query:  EDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTML
        EDKESYRNI KDRGKERILEDDRKTDQNK+K QDKEG GSKNDEERIG I DE KDYMLE+DGENNRDRDV+QGNMVQHLG EENF GLKVG+HASSTML
Subjt:  EDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTML

Query:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLKDQ
        EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIA EDTT+N  LAGVKVLHG+DKVLEGGAVVLTLKDQ
Subjt:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLKDQ

Query:  SILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF
        SILADG++NE++DVLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF
Subjt:  SILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF

Query:  EDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLV
        EDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQLV
Subjt:  EDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLV

Query:  QTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE-DV
        Q SSARLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG +ALLATATTS+Q TDDQ+TKAGELQENKVVFTEMEEFVWGLQLDE+ HKPEE DV
Subjt:  QTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE-DV

Query:  FMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
        FMDDDE+PKEEY EDVKDKDGGWTEVKDTA EE TPE+NEA+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
Subjt:  FMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES

Query:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
Subjt:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Query:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0091.41Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDERNG             DD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDS+RDRVRSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR
        HEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDRERERERELEKDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKHR
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR

Query:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST
        DQEDKESYRN+ K+RGKERILEDDRKTDQ K+K QDKEG GSKNDEER GWIADE KDYMLE+DGENNRDRDV+QGNMVQHLGGEENF GLKVG+H SST
Subjt:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST

Query:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK
        MLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVLTLK
Subjt:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK
        DQSILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVK
Subjt:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK

Query:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ
        HFEDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQ
Subjt:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ

Query:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE
        LVQTSS RLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG IALLATATTS+Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKP EE
Subjt:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE

Query:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEVPKEEY EDVKDKDGGWTEVKDTA+EE  P++N+A+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
Subjt:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0091.41Show/hide
Query:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE
        MD ER SAPDERNG             DD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDS+RDRVRSREKRKEDRDE
Subjt:  MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDE

Query:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR
        HEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDRERERERELEKDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKHR
Subjt:  HEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKHR

Query:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST
        DQEDKESYRN+ K+RGKERILEDDRKTDQ K+K QDKEG GSKNDEER GWIADE KDYMLE+DGENNRDRDV+QGNMVQHLGGEENF GLKVG+H SST
Subjt:  DQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASST

Query:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK
        MLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVLTLK
Subjt:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSN--LAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK
        DQSILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVK
Subjt:  DQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVK

Query:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ
        HFEDLN STKV+HDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSLQ
Subjt:  HFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQ

Query:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE
        LVQTSS RLEDNDD LI DDDEDFYKSLERARKLALKKQ+AASGPG IALLATATTS+Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKP EE
Subjt:  LVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EE

Query:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEVPKEEY EDVKDKDGGWTEVKDTA+EE  P++N+A+APDETIHE PVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
Subjt:  DVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  VKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

A0A6J1FR42 SART-1 family protein DOT20.0e+0088.01Show/hide
Query:  MDGERLSAP--DERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDR
        MD +  S P  DERNG +   ARDRGEG DDFG SGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDDASKEKEKEVKDS+RDRV  RE+RKEDR
Subjt:  MDGERLSAP--DERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKD------RDRDRDRDRKKKDKDKDRSNENEREK
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKDRGKD+ER RERELEKDNVRGQDKERGKEKD+D      R+R+RDRDRKKK+KDKDRSNENEREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKD------RDRDRDRDRKKKDKDKDRSNENEREK

Query:  GREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVG
        GREK RDQE+KESYRNI KDRGKE+ L DD+K DQNKEK +DKEGTG KN+EERI WIA  AKDYMLE+DGE+NRDR VDQGN VQ LGGEEN  GLKVG
Subjt:  GREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVG

Query:  AHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVV
        A +SS MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHGIDKVL GGAVV
Subjt:  AHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVV

Query:  LTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGA
        LTLKDQ+ILADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAAADEGLTLDG G F+NDAEKKLEELR+RLQGA
Subjt:  LTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGA

Query:  SSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEAS
        SSVKHFEDLNAS KV+HDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR NAYQSAYAKADEAS
Subjt:  SSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEAS

Query:  RSLQLVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHK
        RSLQLVQ SS RL+DN+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TTS QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE HK
Subjt:  RSLQLVQTSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHK

Query:  P-EEDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
        P EEDVFMDDDE PKEEY ED KDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHE PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DE
Subjt:  P-EEDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE

Query:  DEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        DEPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
Subjt:  DEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ S+TGTKK K+
Subjt:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

A0A6J1IPE4 SART-1 family protein DOT20.0e+0087.39Show/hide
Query:  DGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDEH
        DG  +   DERNG +   ARDRGEG DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDDASKEKEKEVKDS+RDRV  RE+RKEDRDEH
Subjt:  DGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDEH

Query:  EKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKD--RDRDRDRDRKKKDKDKDRSNENEREKGREKHRD
        +KER+R  KVKDKDYDRE+YKEKEY+RERDRKDRGKD+ER RERELEKDNVRGQDKERGKEKD+D  R+R+RDRDRKKK+KDKDRSNENEREKGREK RD
Subjt:  EKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKD--RDRDRDRDRKKKDKDKDRSNENEREKGREKHRD

Query:  QEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTM
        QE+KESYRNI KDRGKE+ L DD+K DQNKEK +DKEG G KNDEERI W+A        E+DGE+NRDR VDQGN VQHLGGE+N  GLKVGA +SS M
Subjt:  QEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTM

Query:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQS
        LEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHGIDKVL GGAVVLTLKDQ+
Subjt:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQS

Query:  ILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFE
        ILADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAAADEGLTLDG G F+NDAEKKLEELR+RLQGASSVKHFE
Subjt:  ILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFE

Query:  DLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQ
        DLNAS KV+HDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR NAYQSAYAKADEASRSLQLVQ
Subjt:  DLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQ

Query:  TSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EEDVF
         SS RL+ N+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TTS QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE HKP EEDVF
Subjt:  TSSARLEDNDDTLI-DDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP-EEDVF

Query:  MDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK
        MDDDE PKEEY ED KDKDGGWTEVKDTA+EEP PEDNE IAPDETIHE PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+K
Subjt:  MDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK

Query:  SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
        SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
Subjt:  SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP

Query:  GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV
        GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+  +TGTKK K+
Subjt:  GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 11.2e-1826.67Show/hide
Query:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT
        R RE +  K    G     G+ + + R+R  +R   ++  + + RS+ + RE+ + +  ++  K   R+ G +     +    D  +   +E ++ +   
Subjt:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT

Query:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK
        G K  E  +  I  EA        G        D  N +     EE    L+    A+    E+R+ N K  ++K   E+     +  +W++RSR+L++ 
Subjt:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK

Query:  KLSEKEKALQLSKIFEEQDNIDQGVS---DDDIALED----TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA
           EK+ A + +K+ EE D  + GVS   +++         +  +L G+ V H ID   EG  ++LTLKD+ +L      E+ DVL NV + ++++ +  
Subjt:  KLSEKEKALQLSKIFEEQDNIDQGVS---DDDIALED----TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA

Query:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNASTKVAHDYYTQDEML
         +  KKK   Y     DE+  +      + +L +YD+    +      L+  G  +   E++LEE+R +L+  A S+          ++A +Y T +EM+
Subjt:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNASTKVAHDYYTQDEML

Query:  KFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLID
         FKK K++ K +RKKEK + + A +   +      GD GS  R   RR+  + E+EK        +  +     D +       +            +++
Subjt:  KFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLID

Query:  DDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVFMDD
        +DE   +  K LE   R R+L   +Q   SG  V+ ++    +  +  ++        ++  +VF    EF         +GL  + E+ +   D F  D
Subjt:  DDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVFMDD

Query:  DEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---EPKE
        +E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++    + K 
Subjt:  DEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---EPKE

Query:  SKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   KTPY+V
Subjt:  SKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVK
        LSG  K
Subjt:  LSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 11.1e-1927.23Show/hide
Query:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT
        R RE +  K    G     G+ + + R+R  +R   ++  + + RS  + RE+ + +  ++  K   R+ G +     +    D  +   +E ++ +   
Subjt:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT

Query:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK
        G K  E  +  +  EA        G        D  N +     EE    L+    A+    E+R+ N K  ++K   E+     +  +W++RSR+L++ 
Subjt:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK

Query:  KLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA
           EK+ A + +K+ EE D  + GVS     + +   +D  S  +L G+ V H ID   EG  VVLTLKD+ +L +G+     DVL NV + ++++ D  
Subjt:  KLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA

Query:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-ASTKVAHDYYTQDEML
         +  KKK   Y     DE+  +      + +L +YD+    +      L+  G  +   E++LEE+R +L+        + LN    ++A +Y + +EM+
Subjt:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-ASTKVAHDYYTQDEML

Query:  KFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDTL
         FKK K++ K +RKKEK ++     + +  G      GD GS  R   RR+  + E+E  E E +    Q      D    ++ +          +    
Subjt:  KFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDTL

Query:  IDDDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVFM
        +++DE   +  K LE   R R+L   +Q   SG  V+ ++    +  +  +++       ++  +VF    EF         +GL  + E+ +   D F 
Subjt:  IDDDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVFM

Query:  DDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---EP
         D+E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++    + 
Subjt:  DDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---EP

Query:  KESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPY
        K S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   KTPY
Subjt:  KESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPY

Query:  LVLSGHVK
        +VLSG  K
Subjt:  LVLSGHVK

Q9LFE0 SART-1 family protein DOT21.3e-21956.07Show/hide
Query:  DSDRDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDR
        +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  ++K++ R+   K++ +DR+R R+ + EK+  RG+DKER K+K +DR +++D+
Subjt:  DSDRDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDR

Query:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQ
        + K++++ KDR NE + EK ++K R +          K+R  ++  EDD +T +  E+ +  +  G                     N+G +N D     
Subjt:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQ

Query:  GNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNL
                          G  AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D     +   +L
Subjt:  GNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNL

Query:  AGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG
        +GVKVLHG++KV+EGGAV+LTLKDQS+L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ AA DEG+ LD +G 
Subjt:  AGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG

Query:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE
        F  +AEKKLEELR+R+QG  +   FEDLN+S KV+ DY++Q+EMLKFKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E
Subjt:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE

Query:  AEMRHNAYQSAYAKADEASRSLQLVQTSS-ARLEDNDDTLIDDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKV
         E R NAYQ A AKADEASR L+  Q     R ED    L DD ED YKSLE+AR+LAL KK+EA SGP  +A L  A+++NQTTDD  T   E QEN V
Subjt:  AEMRHNAYQSAYAKADEASRSLQLVQTSS-ARLEDNDDTLIDDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKV

Query:  VFTEMEEFVWGLQLDEEDHKPE-EDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKES
        VFTEM +FVWGLQ + +  KPE EDVFM++D  PK   +E  ++   G TEV DT  +      D + I PDE IHE  VGKGLS ALKLLKDRGTLKE 
Subjt:  VFTEMEEFVWGLQLDEEDHKPE-EDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKES

Query:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT
        +EWGGRNMDK+KSKLVGIVD+D  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DT
Subjt:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT

Query:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEA--SDTGTKKAK
        PS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E   SDT  K+ K
Subjt:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEA--SDTGTKKAK

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 17.4e-2127.56Show/hide
Query:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT
        R RE +  K    G     G+ + + R+R  +R   ++  + + RS  + RE+ + +  ++  K   R+ G +     +    D  +   +E ++ +   
Subjt:  RERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIGKD-RGKERILEDDRKTDQNKEKSQDKEGT

Query:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK
        G K  E  +  +  EA        G        D  N +     EE    L+    A+    E+R+ N K  ++K   E+     +  +W++RSR+L++ 
Subjt:  GSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEK

Query:  KLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA
           EK+ A + +K+ EE D  + GVS     + +   +D  S  +L G+ V H ID   EG  VVLTLKD+ +L DG+     DVL NV + ++++ D  
Subjt:  KLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIALEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMA

Query:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRL----QGASSVKHFEDLNASTKVAHDYYTQD
         +  KKK   Y     DE+  +      + +L +YD+    +      L+  G  +   E++LEE+R +L    Q  SSV          ++A +Y + +
Subjt:  YKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRRRL----QGASSVKHFEDLNASTKVAHDYYTQD

Query:  EMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDT
        EM+ FKK K++ K +RKKEK + + A +   +      GD GS  R   RR+  + E+E  E E +    Q   +            +     L      
Subjt:  EMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDT

Query:  LIDDDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVF
         +++DE   +  K LE   R R+L   +Q   SG  V+ ++    +  +  +++       ++  +VF    EF         +GL  + E+ +   D F
Subjt:  LIDDDE---DFYKSLE---RARKLALKKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEDVF

Query:  MDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---E
          D+E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++    +
Subjt:  MDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---E

Query:  PKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP
         K S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   KTP
Subjt:  PKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP

Query:  YLVLSGHVK
        Y+VLSG  K
Subjt:  YLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family1.9e-1940.46Show/hide
Query:  EVKDTAEE--EPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERT
        EV   AE+  +   +++  +  D  + EA VG GLS AL  L+++GT KE            + K+VG+      K++  +D R     D  K+I I+R 
Subjt:  EVKDTAEE--EPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERT

Query:  DEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        +++GRIMT KE++R L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  KTPY+VL
Subjt:  DEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family9.3e-22156.07Show/hide
Query:  DSDRDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDR
        +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  ++K++ R+   K++ +DR+R R+ + EK+  RG+DKER K+K +DR +++D+
Subjt:  DSDRDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDR

Query:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQ
        + K++++ KDR NE + EK ++K R +          K+R  ++  EDD +T +  E+ +  +  G                     N+G +N D     
Subjt:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIGKDRGKERILEDDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQ

Query:  GNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNL
                          G  AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D     +   +L
Subjt:  GNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNL

Query:  AGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG
        +GVKVLHG++KV+EGGAV+LTLKDQS+L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ AA DEG+ LD +G 
Subjt:  AGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG

Query:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE
        F  +AEKKLEELR+R+QG  +   FEDLN+S KV+ DY++Q+EMLKFKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E
Subjt:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE

Query:  AEMRHNAYQSAYAKADEASRSLQLVQTSS-ARLEDNDDTLIDDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKV
         E R NAYQ A AKADEASR L+  Q     R ED    L DD ED YKSLE+AR+LAL KK+EA SGP  +A L  A+++NQTTDD  T   E QEN V
Subjt:  AEMRHNAYQSAYAKADEASRSLQLVQTSS-ARLEDNDDTLIDDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTSNQTTDDQNTKAGELQENKV

Query:  VFTEMEEFVWGLQLDEEDHKPE-EDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKES
        VFTEM +FVWGLQ + +  KPE EDVFM++D  PK   +E  ++   G TEV DT  +      D + I PDE IHE  VGKGLS ALKLLKDRGTLKE 
Subjt:  VFTEMEEFVWGLQLDEEDHKPE-EDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKES

Query:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT
        +EWGGRNMDK+KSKLVGIVD+D  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DT
Subjt:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT

Query:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEA--SDTGTKKAK
        PS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E   SDT  K+ K
Subjt:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEA--SDTGTKKAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGGGAACGGTTATCCGCACCTGATGAGAGAAATGGTCCAGATATCTCTTGGGCAAGAGATCGTGGGGAAGGACATGATGACTTTGGTTATAGTGGAGCAGAAAA
GTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGCATCGA
AAGAAAAGGAGAAAGAGGTAAAAGATTCAGATAGGGATCGAGTTCGTAGTCGGGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGCAGCAAA
GTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGACAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCT
GGAGAAGGACAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAAAGACAGGGACAGAGATAGGGATAGGGATAGGAAGAAGAAGGACAAGGACAAGGACC
GATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGGCAAGGACAGAGGAAAAGAGAGAATTTTGGAA
GATGATAGGAAAACAGATCAAAACAAGGAGAAATCACAAGATAAAGAAGGAACTGGCAGCAAAAATGATGAGGAAAGAATTGGTTGGATTGCGGATGAGGCTAAGGATTA
TATGCTAGAAAATGATGGTGAGAATAACAGGGACAGAGATGTTGATCAAGGGAACATGGTCCAGCATTTGGGAGGTGAAGAAAATTTTGGTGGTTTGAAAGTTGGAGCTC
ATGCTTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAA
CTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGACAATATTGATCAAGGTGTAAGCGATGATGATATTGCACTGGA
AGATACAACTAGTAATCTAGCTGGAGTTAAAGTACTTCATGGCATAGACAAAGTATTAGAGGGTGGCGCGGTTGTCTTAACCCTTAAAGATCAGAGTATCTTAGCTGATG
GCGACATTAATGAAGACGTAGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGAC
AAGTTTAATGATGAAAATTATGGTGAGAAGAAGATGCTTCCCCAGTATGATGATCCAGCAGCTGCAGATGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTTAATAATGA
TGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCGTCAGTCAAGCACTTTGAAGATCTTAATGCATCAACGAAAGTCGCACATGATTATTACACTC
AAGATGAGATGCTTAAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCAGAAGCAATTTCTGCTGGATTGGGT
GTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCGCAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGGCATAATGCATACCAGTCAGCCTATGCTAA
AGCAGATGAGGCATCGAGATCTCTACAATTAGTTCAAACTAGCTCCGCCAGATTAGAGGACAACGATGATACTCTCATAGATGATGATGAAGATTTCTATAAGTCATTGG
AGAGGGCAAGAAAATTAGCTCTTAAGAAACAAGAGGCAGCATCCGGACCAGGAGTAATTGCTCTTCTTGCTACAGCAACAACTAGCAATCAGACAACTGATGATCAAAAC
ACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACGGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAGGATCATAAACCAGAAGAAGATGTCTT
TATGGATGACGATGAAGTACCAAAAGAAGAATATCTTGAAGATGTTAAGGATAAAGATGGTGGGTGGACTGAAGTCAAAGATACTGCTGAAGAAGAACCCACTCCTGAGG
ATAATGAGGCAATAGCTCCAGACGAAACAATCCACGAAGCTCCCGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATC
GAATGGGGCGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCGAAGGAATCTAAGTCGAAGGATTCCCGTTTATCTTCTTTGGT
GGATTACAAGAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCGTTTCGTCGACTCTCTCACAAGTTCCATGGGAAGGGACCTG
GAAAAATGAAACAAGAAAAGCGCATGAAACAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCTGATACTCCTTCGTTATCAGTGGAGAGAATGAGGGAAGCT
CAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCAACAGTTGAAAAGGATCTCCCTGGTGG
CTTGACACCCATGCTTGGTGATAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCGGATACAGGAACAAAGAAGGCAAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGGGAACGGTTATCCGCACCTGATGAGAGAAATGGTCCAGATATCTCTTGGGCAAGAGATCGTGGGGAAGGACATGATGACTTTGGTTATAGTGGAGCAGAAAA
GTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGCATCGA
AAGAAAAGGAGAAAGAGGTAAAAGATTCAGATAGGGATCGAGTTCGTAGTCGGGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGCAGCAAA
GTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGACAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCT
GGAGAAGGACAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAAAGACAGGGACAGAGATAGGGATAGGGATAGGAAGAAGAAGGACAAGGACAAGGACC
GATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGGCAAGGACAGAGGAAAAGAGAGAATTTTGGAA
GATGATAGGAAAACAGATCAAAACAAGGAGAAATCACAAGATAAAGAAGGAACTGGCAGCAAAAATGATGAGGAAAGAATTGGTTGGATTGCGGATGAGGCTAAGGATTA
TATGCTAGAAAATGATGGTGAGAATAACAGGGACAGAGATGTTGATCAAGGGAACATGGTCCAGCATTTGGGAGGTGAAGAAAATTTTGGTGGTTTGAAAGTTGGAGCTC
ATGCTTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAA
CTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGACAATATTGATCAAGGTGTAAGCGATGATGATATTGCACTGGA
AGATACAACTAGTAATCTAGCTGGAGTTAAAGTACTTCATGGCATAGACAAAGTATTAGAGGGTGGCGCGGTTGTCTTAACCCTTAAAGATCAGAGTATCTTAGCTGATG
GCGACATTAATGAAGACGTAGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGATGAC
AAGTTTAATGATGAAAATTATGGTGAGAAGAAGATGCTTCCCCAGTATGATGATCCAGCAGCTGCAGATGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTTAATAATGA
TGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCGTCAGTCAAGCACTTTGAAGATCTTAATGCATCAACGAAAGTCGCACATGATTATTACACTC
AAGATGAGATGCTTAAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCAGAAGCAATTTCTGCTGGATTGGGT
GTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCGCAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGGCATAATGCATACCAGTCAGCCTATGCTAA
AGCAGATGAGGCATCGAGATCTCTACAATTAGTTCAAACTAGCTCCGCCAGATTAGAGGACAACGATGATACTCTCATAGATGATGATGAAGATTTCTATAAGTCATTGG
AGAGGGCAAGAAAATTAGCTCTTAAGAAACAAGAGGCAGCATCCGGACCAGGAGTAATTGCTCTTCTTGCTACAGCAACAACTAGCAATCAGACAACTGATGATCAAAAC
ACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACGGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAGGATCATAAACCAGAAGAAGATGTCTT
TATGGATGACGATGAAGTACCAAAAGAAGAATATCTTGAAGATGTTAAGGATAAAGATGGTGGGTGGACTGAAGTCAAAGATACTGCTGAAGAAGAACCCACTCCTGAGG
ATAATGAGGCAATAGCTCCAGACGAAACAATCCACGAAGCTCCCGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATC
GAATGGGGCGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCGAAGGAATCTAAGTCGAAGGATTCCCGTTTATCTTCTTTGGT
GGATTACAAGAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCGTTTCGTCGACTCTCTCACAAGTTCCATGGGAAGGGACCTG
GAAAAATGAAACAAGAAAAGCGCATGAAACAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCTGATACTCCTTCGTTATCAGTGGAGAGAATGAGGGAAGCT
CAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCAACAGTTGAAAAGGATCTCCCTGGTGG
CTTGACACCCATGCTTGGTGATAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCGGATACAGGAACAAAGAAGGCAAAAGTTTGAAGCCTTAGCT
TTATACTAGAATTAGTAGAATTTTCAGTTTATTTTAGTCTACCAACTAGATGCAACGTCGTATATTCAATTTCATATCCCAATCAATTCTTTGACATTTGAAGTGTTTGT
ATTATAGTCAATATATTCTTCTTTTCATGAGTTTACATTTTAGATTTTGTACAAATTTTTCTCAGCTTTTCGGGAAAAGATATTGTACGAGTTACTGAGTTAACAAATAT
TTTAGGGTCAACTAGAATAATAGCTATTCGAATTCTATTGCGATGAAAATCCCATTGAAGTTGTTGACATCAATGTTGAGGATATAGCAAGTCAAGCCTATTGTTGTGGA
TGTCTTCACTATCACAATTCCAAGTTTAAAAGTGTCGATGATGGTTGAGATTAACGACTAGTTTTCCTCTTCACTTACCATTGATATCCCGAAAGAACCTCATAATTAGG
GGAAAATCGTTGCAGAAAATCAGTAGAACCAACAGGGAAATTGTATGATTTGGCTAAGGCTTAAAAAATGAGAAAGAAATTAG
Protein sequenceShow/hide protein sequence
MDGERLSAPDERNGPDISWARDRGEGHDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEVKDSDRDRVRSREKRKEDRDEHEKERSRGSK
VKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIGKDRGKERILE
DDRKTDQNKEKSQDKEGTGSKNDEERIGWIADEAKDYMLENDGENNRDRDVDQGNMVQHLGGEENFGGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRK
LEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIALEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQSILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDD
KFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVAHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLG
VGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIDDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTSNQTTDDQN
TKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEDVFMDDDEVPKEEYLEDVKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEAPVGKGLSSALKLLKDRGTLKESI
EWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREA
QAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASDTGTKKAKV