; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006850 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006850
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSART-1 family protein DOT2
Genome locationchr09:21587228..21595112
RNA-Seq ExpressionPI0006850
SyntenyPI0006850
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.97Show/hide
Query:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH
        MD + SS P  DERNG          DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDD  KEKEKE KDSERDRV  RE+RKEDRDEH
Subjt:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH

Query:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQE
        +KER R  KVKDKDYDRE+YK+KEYERERDRKDRGKD+ER RERELEKDNVRG DKERGKEKDRDRE+ERDRDRKK  KDKDRSNE EREKGR+K RDQE
Subjt:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQE

Query:  DKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLE
        +KESYRNIDKERGKE+ L DD+K DQNK+K +DKEGIG KNDEERI W A   KDYMLESDGE+NRDR V QGN VQHLGGEEN DGLKVG+ +SS MLE
Subjt:  DKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLE

Query:  ERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQS
        ERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA E+ T+N  LAGVKVLHG+DKVL GGAVVLTLKDQ+
Subjt:  ERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQS

Query:  ILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFE
        ILADGDVNE++DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELR+RLQGASSVKHFE
Subjt:  ILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFE

Query:  DLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQ
        DLN S KVS DYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQA+K EQE+SEAEMR +AYQSAYAKADEASRSLQLVQ
Subjt:  DLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQ

Query:  TSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVF
         SS RL+DN DTLI DDDED YKSLERARKLALKKQ+AASGP A+ALLAT TTS Q TDDQNTKAGELQENKVVFTEMEEFVWGLQLDE++HKPEEEDVF
Subjt:  TSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVF

Query:  MDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESK
        MDDDE PKEEYHED KDKDGGWTEVKDTAKEE TPE+NE +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+D+DEPKE+K
Subjt:  MDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESK

Query:  SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
        SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
Subjt:  SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP

Query:  GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ SNTGTKK K+
Subjt:  GQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

XP_008459644.1 PREDICTED: SART-1 family protein DOT2 isoform X2 [Cucumis melo]0.0e+0096.07Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDERNGDD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKE KDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK
        KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDR+K+RDRDR    K KDKDRSNEIEREKGR+KHRDQEDKESYRN+DK
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK

Query:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR
        ERGKERILEDDRKTDQ KQK QDKEGIGSKNDEER GW ADEGKDYMLESDGENNRDRDV QGNMVQHLGGEENFDGLKVGSH SSTMLEERIRNMKEDR
Subjt:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR

Query:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
        LKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPENTTNNHDL GVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
Subjt:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE

Query:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR
        LD+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVKHFEDLNVSTKVS 
Subjt:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR

Query:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG
        DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQTSS+RLEDN 
Subjt:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG

Query:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
        D LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
Subjt:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE

Query:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV
        YHEDVKDKDGGWTEVKDTAKEES P+EN+AVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLV
Subjt:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV

Query:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
        DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
Subjt:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF

Query:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

XP_011656109.1 SART-1 family protein DOT2 [Cucumis sativus]0.0e+0096.28Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDER+GDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKEAKDSERDR+RSREKRKEDRDEHEKER RG KVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKER
        KDYDR+IYKDKEYERERDRKDRGKDRERERERELEKD VRGHDKERGKEKDRDR+K+RDRDRKK  KDKDRSNEIEREKGRDKHRDQEDKESYRNIDK+R
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKER

Query:  GKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLK
        GKERILEDDRKTDQNKQK QDKEGIGSKNDEERIG   DEGKDYMLESDGENNRDRDV QGNMVQHLG EENFDGLKVGSHASSTMLEERIRNMKEDRLK
Subjt:  GKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLK

Query:  KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELD
        KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPE+TTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADG+VNEELD
Subjt:  KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELD

Query:  VLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDY
        VLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVS DY
Subjt:  VLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDY

Query:  YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDT
        YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQ SS+RLEDN D 
Subjt:  YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDT

Query:  LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYH
        LIADDDEDFYKSLERARKLALKKQDAASGPGA+ALLATATTSSQATDDQ+TKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEE+DVFMDDDE+PKEEYH
Subjt:  LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYH

Query:  EDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDY
        EDVKDKDGGWTEVKDTA EESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLVDY
Subjt:  EDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDY

Query:  KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT
        KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT
Subjt:  KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT

Query:  VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

XP_016902433.1 PREDICTED: SART-1 family protein DOT2 isoform X1 [Cucumis melo]0.0e+0096.07Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDERNGDD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKE KDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK
        KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDR+K+RDRDR    K KDKDRSNEIEREKGR+KHRDQEDKESYRN+DK
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK

Query:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR
        ERGKERILEDDRKTDQ KQK QDKEGIGSKNDEER GW ADEGKDYMLESDGENNRDRDV QGNMVQHLGGEENFDGLKVGSH SSTMLEERIRNMKEDR
Subjt:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR

Query:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
        LKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPENTTNNHDL GVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
Subjt:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE

Query:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR
        LD+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVKHFEDLNVSTKVS 
Subjt:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR

Query:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG
        DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQTSS+RLEDN 
Subjt:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG

Query:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
        D LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
Subjt:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE

Query:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV
        YHEDVKDKDGGWTEVKDTAKEES P+EN+AVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLV
Subjt:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV

Query:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
        DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
Subjt:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF

Query:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

XP_038890025.1 SART-1 family protein DOT2 [Benincasa hispida]0.0e+0091.86Show/hide
Query:  MDGERSSAPDERNG--------------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRD
        MDGERSSAPDERNG              DDFGYSG EKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKEAKDSERDRVRSREKRKEDRD
Subjt:  MDGERSSAPDERNG--------------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRD

Query:  EHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR------KKKDKDRSNEIEREKGRD
        EHEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDRERERERELEKDNVRG DKE+GKE+DRDR+++RDRDR      K KDKDRSNE EREKGR+
Subjt:  EHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR------KKKDKDRSNEIEREKGRD

Query:  KHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHA
        KHRDQEDKESYRNIDK+RGKERILEDDRKTDQ+K+KSQDKEGIGSK DEERIGW ADEGKDYM+ESDG+NNR+RDV QGNMVQ LGGEENFDGLKVG+HA
Subjt:  KHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHA

Query:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVL
        SSTMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPE+TTNNH+LAGVKVLHG+DKVLEGGAVVL
Subjt:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVL

Query:  TLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS
        TLKDQ+ILADGD+NE++DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA ADEGLTLDGRGGFNNDAEKKLEELRRRLQG++
Subjt:  TLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS

Query:  SVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASR
        SV HFEDLN STKVS DYYTQDEMLKFKKPRKKKSLRKKEKLD+DALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR +AYQSAYAKADEASR
Subjt:  SVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASR

Query:  SLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKP
        SL+LVQTSS RLEDN DTLIADDDEDFYKSL+RARKLALK+Q+AASGPGAIALLATATTSSQ TDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKP
Subjt:  SLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKP

Query:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDD
        EEEDVFMDDDEVPKEEYHE++KDKDGGWTEVKDTAKEE+TPEENEA+APDETIHEVPVGKGLSSALKLLK+RGTLKESIEWGGRNMDKRKSKLVGIVD+D
Subjt:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDD

Query:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

TrEMBL top hitse value%identityAlignment
A0A0A0KXY6 Uncharacterized protein0.0e+0096.28Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDER+GDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKEAKDSERDR+RSREKRKEDRDEHEKER RG KVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKER
        KDYDR+IYKDKEYERERDRKDRGKDRERERERELEKD VRGHDKERGKEKDRDR+K+RDRDRKK  KDKDRSNEIEREKGRDKHRDQEDKESYRNIDK+R
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKK--KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKER

Query:  GKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLK
        GKERILEDDRKTDQNKQK QDKEGIGSKNDEERIG   DEGKDYMLESDGENNRDRDV QGNMVQHLG EENFDGLKVGSHASSTMLEERIRNMKEDRLK
Subjt:  GKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLK

Query:  KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELD
        KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPE+TTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADG+VNEELD
Subjt:  KQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELD

Query:  VLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDY
        VLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVS DY
Subjt:  VLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDY

Query:  YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDT
        YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQ SS+RLEDN D 
Subjt:  YTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDT

Query:  LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYH
        LIADDDEDFYKSLERARKLALKKQDAASGPGA+ALLATATTSSQATDDQ+TKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEE+DVFMDDDE+PKEEYH
Subjt:  LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYH

Query:  EDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDY
        EDVKDKDGGWTEVKDTA EESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLVDY
Subjt:  EDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDY

Query:  KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT
        KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT
Subjt:  KKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFAT

Query:  VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  VEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0096.07Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDERNGDD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKE KDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK
        KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDR+K+RDRDR    K KDKDRSNEIEREKGR+KHRDQEDKESYRN+DK
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK

Query:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR
        ERGKERILEDDRKTDQ KQK QDKEGIGSKNDEER GW ADEGKDYMLESDGENNRDRDV QGNMVQHLGGEENFDGLKVGSH SSTMLEERIRNMKEDR
Subjt:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR

Query:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
        LKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPENTTNNHDL GVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
Subjt:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE

Query:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR
        LD+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVKHFEDLNVSTKVS 
Subjt:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR

Query:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG
        DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQTSS+RLEDN 
Subjt:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG

Query:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
        D LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
Subjt:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE

Query:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV
        YHEDVKDKDGGWTEVKDTAKEES P+EN+AVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLV
Subjt:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV

Query:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
        DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
Subjt:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF

Query:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0096.07Show/hide
Query:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
        MD ERSSAPDERNGDD GYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDD  KEKEKE KDSERDRVRSREKRKEDRDEHEKERGRGSKVKD
Subjt:  MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKD

Query:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK
        KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDR+K+RDRDR    K KDKDRSNEIEREKGR+KHRDQEDKESYRN+DK
Subjt:  KDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDK

Query:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR
        ERGKERILEDDRKTDQ KQK QDKEGIGSKNDEER GW ADEGKDYMLESDGENNRDRDV QGNMVQHLGGEENFDGLKVGSH SSTMLEERIRNMKEDR
Subjt:  ERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDR

Query:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
        LKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPENTTNNHDL GVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE
Subjt:  LKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEE

Query:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR
        LD+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGRGGFNNDAEKKLEELRRRLQG SSVKHFEDLNVSTKVS 
Subjt:  LDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSR

Query:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG
        DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRL+AYQSAYAKADEASRSLQLVQTSS+RLEDN 
Subjt:  DYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNG

Query:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
        D LIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE
Subjt:  DTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE

Query:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV
        YHEDVKDKDGGWTEVKDTAKEES P+EN+AVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD+DEPKESKSKDSRLSSLV
Subjt:  YHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLV

Query:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
        DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF
Subjt:  DYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF

Query:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Subjt:  ATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

A0A6J1FR42 SART-1 family protein DOT20.0e+0085.91Show/hide
Query:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH
        MD + SS P  DERNG          DDFG SGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDD  KEKEKE KDSERDRV  RE+RKEDRDEH
Subjt:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH

Query:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRD------REKERDRDRKK--KDKDRSNEIEREKGRD
        +KER R  KVKDKDYDRE+YK+KEYERERDRKDRGKD+ER RERELEKDNVRG DKERGKEKDRD      RE+ERDRDRKK  KDKDRSNE EREKGR+
Subjt:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRD------REKERDRDRKK--KDKDRSNEIEREKGRD

Query:  KHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHA
        K RDQE+KESYRNIDK+RGKE+ L DD+K DQNK+K +DKEG G KN+EERI W A   KDYMLESDGE+NRDR V QGN VQ LGGEEN DGLKVG+ +
Subjt:  KHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHA

Query:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVL
        SS MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA E+ T+N  LAGVKVLHG+DKVL GGAVVL
Subjt:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVL

Query:  TLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS
        TLKDQ+ILADGDVNE++DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELR+RLQGAS
Subjt:  TLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGAS

Query:  SVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASR
        SVKHFEDLN S KVS DYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQA+K EQE+SEAEMR +AYQSAYAKADEASR
Subjt:  SVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASR

Query:  SLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKP
        SLQLVQ SS RL+DN DTLI DDDED YKSLERARKLALKKQ+AASGP A+ALLAT TTS Q TDDQNTKAGELQENKVVFTEMEEFVWGLQLDE++HKP
Subjt:  SLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKP

Query:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDD
        EEEDVFMDDDE PKEEYHED KDKDGGWTEVKDTAKEE TPE+NE +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+D+D
Subjt:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDD

Query:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ SNTGTKK K+
Subjt:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

A0A6J1IPE4 SART-1 family protein DOT20.0e+0085.43Show/hide
Query:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH
        MD + SS P  DERNG          DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+RSKR SDD  KEKEKE KDSERDRV  RE+RKEDRDEH
Subjt:  MDGERSSAP--DERNG----------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEH

Query:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRD
        +KER R  KVKDKDYDRE+YK+KEY+RERDRKDRGKD+ER RERELEKDNVRG DKERGKEKDRDRE+ER+RDR    K+KDKDRSNE EREKGR+K RD
Subjt:  EKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDR----KKKDKDRSNEIEREKGRDKHRD

Query:  QEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTM
        QE+KESYRNIDK+RGKE+ L DD+K DQNK+K +DKEGIG KNDEERI W A        ESDGE+NRDR V QGN VQHLGGE+N DGLKVG+ +SS M
Subjt:  QEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTM

Query:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKD
        LEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA E+ T+N  LAGVKVLHG+DKVL GGAVVLTLKD
Subjt:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKD

Query:  QSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH
        Q+ILADGDVNE++DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELR+RLQGASSVKH
Subjt:  QSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH

Query:  FEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQL
        FEDLN S KVS DYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQA+K EQE+SEAEMR +AYQSAYAKADEASRSLQL
Subjt:  FEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQL

Query:  VQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEED
        VQ SS RL+ N DTLI DDDED YKSLERARKLALKKQ+AASGP A+ALLAT TTS Q TDDQNTKAGELQENKVVFTEMEEFVWGLQLDE++HKPEEED
Subjt:  VQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEED

Query:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKE
        VFMDDDE PKEEYHED KDKDGGWTEVKDTAKEE  PE+NE +APDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+D+DEPKE
Subjt:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKE

Query:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        +KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
        KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+  NTGTKK K+
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 12.3e-3529.08Show/hide
Query:  EERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVS---DDDIAP--ENTTNNHDLAGVKVLHGVDKVLEGG
        E+R+ N K  ++K   E+     +  +W++RSR+L++    EK+ A + +K+ EE D  + GVS   +++     ++  +  DL G+ V H +D   EG 
Subjt:  EERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVS---DDDIAP--ENTTNNHDLAGVKVLHGVDKVLEGG

Query:  AVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKK
         ++LTLKD+ +L      EE DVL NV + ++++ +   +  KKK   Y     DE+  +      + +L +YD+  + +      L+  G  +   E++
Subjt:  AVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKK

Query:  LEELRRRLQ-GASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSRRQAKKEEQEKSEAEM
        LEE+R +L+  A S+       V  +++ +Y T +EM+ FKK +++ K +RKKEK + + A +   +      GD GS  R   RR+  + E+EK     
Subjt:  LEELRRRLQ-GASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSRRQAKKEEQEKSEAEM

Query:  RLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDE---DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQEN
         L +        D    ++ +           G   + ++DE   +  K LE   R R+L   +Q   SG   + ++    +  +  ++        ++ 
Subjt:  RLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDE---DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQEN

Query:  KVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFMD---DDEVPKEEYHEDVKDKDGGWTEVK-DTAKEESTPEENEAVAPDETIHEVPVGKGLSSA
         +VF    EF         +GL     A   EE++  MD   D+E       E   +++ GW+ V  D  K++     +     DE   E  V +GL++A
Subjt:  KVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFMD---DDEVPKEEYHEDVKDKDGGWTEVK-DTAKEESTPEENEAVAPDETIHEVPVGKGLSSA

Query:  LKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD---EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKM
        L L +++G L+ +++   R     KS    V  ++D    + K S+ ++ R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKM
Subjt:  LKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD---EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKM

Query:  KQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        K E+RMK+  EE  LK+M ++DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  KQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 18.6e-2226.88Show/hide
Query:  RERERELEKDNVRGHDKERGKEKDRDREKERDRDRKKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKN
        R RE +  K    G     G+ + R RE+  +R   +    R  E E   G       + + S R + +E+  E           +K  S D   +  + 
Subjt:  RERERELEKDNVRGHDKERGKEKDRDREKERDRDRKKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKN

Query:  DEE---RIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEKK
          +   ++G    E      E+ G           N +     EE  + L           E+R+ N K  ++K   E+     +  +W++RSR+L++  
Subjt:  DEE---RIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEEKK

Query:  LSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAY
          EK+ A + +K+ EE D  + GVS     + +   ++  +  DL G+ V H +D   EG  VVLTLKD+ +L +G+     DVL NV + ++++ D   
Subjt:  LSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAY

Query:  KAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-VSTKVSRDYYTQDEMLK
        +  KKK   Y     DE+  +      + +L +YD+  + +      L+  G  +   E++LEE+R +L+        + LN V  +++ +Y + +EM+ 
Subjt:  KAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-VSTKVSRDYYTQDEMLK

Query:  FKKPRKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLI
        FKK +++ K +RKKEK ++     + +  G      GD GS  R   RR+  + E+E  E E +    Q      D    ++ +          +G   +
Subjt:  FKKPRKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLI

Query:  ADDDE--DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFM
         +D+   +  K LE   R R+L   +Q   SG   + ++    +  +  +++       ++  +VF    EF         +GL     A   EE++  M
Subjt:  ADDDE--DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFM

Query:  D---DDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD--
        D   D+E       E   +++ GW+ V     EE   ++  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  ++D   
Subjt:  D---DDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD--

Query:  -EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLK
         + K S+ ++ R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   K
Subjt:  -EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLK

Query:  TPYLVLSGHVK
        TPY+VLSG  K
Subjt:  TPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT24.6e-22556.26Show/hide
Query:  DSERDRVRSREKR--------KEDRDEHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDR
        +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  +DK++ R+   K++ +DR+R R+ + EK+  RG DKER K+K RDR KE+D+
Subjt:  DSERDRVRSREKR--------KEDRDEHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDR

Query:  DRKK-KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQG
        ++++ + KDR NE + EK +DK R +          KER  ++  EDD +T +  ++ +  +  G                                   
Subjt:  DRKK-KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQG

Query:  NMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHD
               G +N D    G  AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D        +   
Subjt:  NMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHD

Query:  LAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRG
        L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN E+D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A  DEG+ LD +G
Subjt:  LAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRG

Query:  GFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKS
         F  +AEKKLEELR+R+QG  +   FEDLN S KVS DY++Q+EMLKFKKP+KKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+E+ 
Subjt:  GFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKS

Query:  EAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLAL-KKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENK
        E E R +AYQ A AKADEASR L+  Q    + +++   ++ADD ED YKSLE+AR+LAL KK++A SGP A+A L  A++++Q TDD  T   E QEN 
Subjt:  EAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLAL-KKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENK

Query:  VVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT---AKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTL
        VVFTEM +FVWGLQ + D  KPE EDVFM++D  PK    E  ++   G TEV DT   A E+S+  + + + PDE IHEV VGKGLS ALKLLKDRGTL
Subjt:  VVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT---AKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTL

Query:  KESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN
        KE +EWGGRNMDK+KSKLVGIVDDD  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN
Subjt:  KESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN

Query:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGT
        +DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E  N+ T
Subjt:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 15.0e-2227.22Show/hide
Query:  RERERELEKDNVRGHDKERGKEKDRDREKERDRDRKKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDD--RKTDQNKQKSQDKEGIGS
        R RE +  K    G     G+ + R RE+  +R   +    R  E E   G       + + S R + +E+       DD       +K  S D   +  
Subjt:  RERERELEKDNVRGHDKERGKEKDRDREKERDRDRKKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDD--RKTDQNKQKSQDKEGIGS

Query:  KNDEE---RIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEE
        +   +   ++G    E      E+ G           N +     EE  + L           E+R+ N K  ++K   E+     +  +W++RSR+L++
Subjt:  KNDEE---RIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKLEE

Query:  KKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDM
            EK+ A + +K+ EE D  + GVS     + +   ++  +  DL G+ V H +D   EG  VVLTLKD+ +L DG+     DVL NV + ++++ D 
Subjt:  KKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDM

Query:  AYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNVSTKVSRDYYTQDEM
          +  KKK   Y     DE+  +      + +L +YD+  + +      L+  G  +   E++LEE+R +L+  A S+      +V  +++ +Y + +EM
Subjt:  AYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPADAD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNVSTKVSRDYYTQDEM

Query:  LKFKKPRKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLI
        + FKK +++ K +RKKEK + + A +   +      GD GS  R   RR+  + E+E  E E +    Q      D    ++ +           G    
Subjt:  LKFKKPRKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSRRQAKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLI

Query:  ADDDE---DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVF
         ++DE   +  K LE   R R+L   +Q   SG   + ++    +  +  +++       ++  +VF    EF         +GL     A   EE++  
Subjt:  ADDDE---DFYKSLE---RARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVF

Query:  MD---DDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD-
        MD   D+E       E   +++ GW+ V     EE   ++  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  ++D  
Subjt:  MD---DDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDDD-

Query:  --EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQL
          + K S+ ++ R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   
Subjt:  --EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQL

Query:  KTPYLVLSGHVK
        KTPY+VLSG  K
Subjt:  KTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family6.3e-2041.51Show/hide
Query:  EENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR
        + +  V  D  + E  VG GLS AL  L+++GT KE            + K+VG+      K++  +D R     D  K+I I+R +++GRIMT KE++R
Subjt:  EENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR

Query:  QLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
         L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  KTPY+VL
Subjt:  QLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family3.3e-22656.26Show/hide
Query:  DSERDRVRSREKR--------KEDRDEHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDR
        +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  +DK++ R+   K++ +DR+R R+ + EK+  RG DKER K+K RDR KE+D+
Subjt:  DSERDRVRSREKR--------KEDRDEHEKERGRGSKVKDKDYDREIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDR

Query:  DRKK-KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQG
        ++++ + KDR NE + EK +DK R +          KER  ++  EDD +T +  ++ +  +  G                                   
Subjt:  DRKK-KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDKEGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQG

Query:  NMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHD
               G +N D    G  AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D        +   
Subjt:  NMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPENTTNNHD

Query:  LAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRG
        L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN E+D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A  DEG+ LD +G
Subjt:  LAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRG

Query:  GFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKS
         F  +AEKKLEELR+R+QG  +   FEDLN S KVS DY++Q+EMLKFKKP+KKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+E+ 
Subjt:  GFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKS

Query:  EAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLAL-KKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENK
        E E R +AYQ A AKADEASR L+  Q    + +++   ++ADD ED YKSLE+AR+LAL KK++A SGP A+A L  A++++Q TDD  T   E QEN 
Subjt:  EAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLAL-KKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENK

Query:  VVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT---AKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTL
        VVFTEM +FVWGLQ + D  KPE EDVFM++D  PK    E  ++   G TEV DT   A E+S+  + + + PDE IHEV VGKGLS ALKLLKDRGTL
Subjt:  VVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT---AKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTL

Query:  KESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN
        KE +EWGGRNMDK+KSKLVGIVDDD  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN
Subjt:  KESIEWGGRNMDKRKSKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN

Query:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGT
        +DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E  N+ T
Subjt:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGGGAACGGTCATCCGCACCTGATGAGAGAAATGGTGATGACTTTGGTTATAGTGGAGCAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAG
TCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGTATTGAAAGAAAAGGAGAAAGAGGCAAAAGATTCAGAAAGGGATC
GGGTTCGTAGTCGGGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGGGCAGGGGTAGCAAAGTTAAAGATAAAGATTATGACAGAGAGATTTATAAGGAT
AAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCTGGAGAAGGACAATGTTCGAGGACATGACAAAGAGAGAGG
AAAGGAGAAAGACAGAGACAGGGAGAAAGAAAGGGATAGGGATAGGAAGAAGAAGGACAAGGACCGATCAAATGAAATTGAAAGGGAGAAGGGAAGAGACAAACACAGAG
ATCAAGAGGACAAGGAAAGCTATAGGAACATTGACAAGGAGAGGGGAAAAGAGAGAATTTTGGAAGATGATAGGAAAACAGATCAAAACAAGCAGAAATCACAAGATAAA
GAAGGAATTGGCAGCAAAAATGATGAGGAACGAATTGGTTGGACTGCAGATGAGGGTAAGGATTATATGCTAGAAAGTGATGGCGAGAATAACAGGGACAGAGATGTTAA
GCAAGGGAACATGGTCCAGCATTTGGGAGGTGAAGAAAATTTTGATGGGTTGAAAGTTGGAAGTCATGCTTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAG
AAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAACTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTT
TCAAAGATTTTTGAGGAACAAGACAATATAGATCAAGGTGTAAGTGATGATGATATTGCACCAGAAAATACAACTAACAATCATGATCTAGCTGGAGTTAAAGTACTCCA
TGGCGTAGACAAAGTACTAGAAGGTGGTGCGGTTGTCTTAACCCTCAAGGATCAGAGCATCTTAGCTGATGGCGATGTTAATGAAGAGTTAGATGTGCTTGAGAATGTGG
AAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCTGCAAAAAAGAAAACAGGGATTTATGATGACAAGTTTAATGATGAAAATTATGGCGAGAAGAAGATGCTT
CCACAGTATGATGATCCAGCAGATGCAGACGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTTAATAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACA
GGGAGCTTCGTCAGTTAAGCACTTTGAAGATCTTAATGTATCAACGAAAGTCTCACGTGATTATTACACTCAAGATGAGATGCTTAAATTTAAGAAGCCCAGGAAAAAGA
AATCCCTTCGAAAGAAAGAAAAGCTAGATATTGATGCCCTTGAAGCGGAAGCGATTTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCTCGAAATGATTCTAGAAGGCAA
GCGAAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACTTAGTGCATACCAGTCAGCCTATGCTAAAGCAGATGAAGCATCAAGATCTCTACAATTAGTTCAAAC
TAGCTCCTCCAGATTAGAGGACAATGGTGATACTCTCATTGCAGATGATGATGAAGATTTCTATAAGTCATTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGATG
CAGCATCCGGACCAGGAGCAATTGCTCTTCTTGCTACGGCAACAACTAGCAGTCAGGCAACTGATGATCAAAACACAAAAGCTGGAGAGTTGCAGGAAAATAAGGTTGTA
TTTACAGAAATGGAAGAATTTGTCTGGGGTCTTCAGCTTGATGAAGATGCTCATAAACCAGAGGAAGAAGATGTCTTTATGGATGATGATGAAGTACCAAAAGAAGAATA
TCATGAAGATGTTAAGGATAAAGATGGTGGATGGACGGAAGTCAAAGATACTGCCAAGGAAGAATCCACTCCTGAGGAAAACGAGGCAGTAGCTCCAGATGAAACAATCC
ACGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGACCGAGGAACTCTGAAGGAAAGCATTGAATGGGGCGGAAGAAACATGGACAAGAGAAAG
AGCAAACTTGTTGGTATAGTTGATGATGATGAACCAAAGGAATCTAAGTCGAAGGATTCTCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGAC
TGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGTCAACTCTCTCACAAGTTCCATGGGAAGGGACCTGGAAAAATGAAGCAAGAAAAGCGCATGAAACAAT
ACCAAGAAGAGTTGAAGTTGAAGCAGATGAAAAATGCTGATACTCCTTCCTTATCAGTGGAGAGAATGAGGGAAGCTCAAGCACAACTAAAGACCCCTTATCTTGTTCTC
AGCGGCCACGTTAAACCTGGCCAAACAAGCGATCCAAGAAGTGGTTTTGCTACAGTTGAAAAGGATCTCCCAGGTGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGA
GCATTTCTTGGGGATAAAGCGTAAAGGTGAAGCTTCAAATACAGGCACAAAGAAAGCAAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTGGTTAACCCTTCGCGCCCTCGTCTTCTCCCTCACTCTCAGGTTTTCTTTTCCCATTCACGCGCCCGGGTTTCTATCTTTTTACTCACGCCCCATCTCCGATCGC
CGCCCTCCTCTTCTTTGTTTTAATTCGCTCCGGTTCTGCCCATCTCCGCCTGCTACCGCAAGTTACGCCGGTCTCCTTCGTTGCCCGAGCGCCATTTCGCCGCCGCTGCC
GACGGTGATTAGATAATAGTGGGTTTTTTCGGGCTTGGTTTTCCATTGATAGTGCTGTGTGGAAATCGGCTGTTTATCCTTGGTTGGCTCCATTTCGACTTCATGTTATT
TATATTTTAGATTCAGGTTTTGAAGGATTTTAGCGTGCTGTCCAGCAAAAGATAAGTACGTTTTGGGGAATTTCCAGTAAGTATCAAAGTTTTGTTTGTCCTTTCGAAAT
TCACTGCAAATGCAGAAAATTGGGCTTTAATATCTGAAATTTCATTATGCCTAGGTTTGGATCGTGTTAAAGGTTTTAGAGAAGTTTTGGATCAAAGACAGCTTGGGCAA
AATGTTTAAGGTTTTTTAGAGTGAAATTACAGTCTATATTTAGGTCCTAAATCTTGAAACTAATTTAAGTTTATGCTTTAGGATGAGATTGACGGTTCGGTAAGCCAAAG
GAGAATTGTTCTGTTCATAGTCGGGATTTAATTGTGAGTTTTTGCAGATGGATGGGGAACGGTCATCCGCACCTGATGAGAGAAATGGTGATGACTTTGGTTATAGTGGA
GCAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGA
TGTATTGAAAGAAAAGGAGAAAGAGGCAAAAGATTCAGAAAGGGATCGGGTTCGTAGTCGGGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGGGCAGGG
GTAGCAAAGTTAAAGATAAAGATTATGACAGAGAGATTTATAAGGATAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAA
AGAGAGCTGGAGAAGGACAATGTTCGAGGACATGACAAAGAGAGAGGAAAGGAGAAAGACAGAGACAGGGAGAAAGAAAGGGATAGGGATAGGAAGAAGAAGGACAAGGA
CCGATCAAATGAAATTGAAAGGGAGAAGGGAAGAGACAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGACAAGGAGAGGGGAAAAGAGAGAATTTTGG
AAGATGATAGGAAAACAGATCAAAACAAGCAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAAATGATGAGGAACGAATTGGTTGGACTGCAGATGAGGGTAAGGAT
TATATGCTAGAAAGTGATGGCGAGAATAACAGGGACAGAGATGTTAAGCAAGGGAACATGGTCCAGCATTTGGGAGGTGAAGAAAATTTTGATGGGTTGAAAGTTGGAAG
TCATGCTTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTA
AACTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTTTCAAAGATTTTTGAGGAACAAGACAATATAGATCAAGGTGTAAGTGATGATGATATTGCACCA
GAAAATACAACTAACAATCATGATCTAGCTGGAGTTAAAGTACTCCATGGCGTAGACAAAGTACTAGAAGGTGGTGCGGTTGTCTTAACCCTCAAGGATCAGAGCATCTT
AGCTGATGGCGATGTTAATGAAGAGTTAGATGTGCTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCTGCAAAAAAGAAAACAGGGATTT
ATGATGACAAGTTTAATGATGAAAATTATGGCGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCAGATGCAGACGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTT
AATAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCGTCAGTTAAGCACTTTGAAGATCTTAATGTATCAACGAAAGTCTCACGTGATTA
TTACACTCAAGATGAGATGCTTAAATTTAAGAAGCCCAGGAAAAAGAAATCCCTTCGAAAGAAAGAAAAGCTAGATATTGATGCCCTTGAAGCGGAAGCGATTTCTGCTG
GATTGGGTGTTGGAGACCTTGGTTCTCGAAATGATTCTAGAAGGCAAGCGAAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACTTAGTGCATACCAGTCAGCC
TATGCTAAAGCAGATGAAGCATCAAGATCTCTACAATTAGTTCAAACTAGCTCCTCCAGATTAGAGGACAATGGTGATACTCTCATTGCAGATGATGATGAAGATTTCTA
TAAGTCATTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGATGCAGCATCCGGACCAGGAGCAATTGCTCTTCTTGCTACGGCAACAACTAGCAGTCAGGCAACTG
ATGATCAAAACACAAAAGCTGGAGAGTTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTTCAGCTTGATGAAGATGCTCATAAACCAGAG
GAAGAAGATGTCTTTATGGATGATGATGAAGTACCAAAAGAAGAATATCATGAAGATGTTAAGGATAAAGATGGTGGATGGACGGAAGTCAAAGATACTGCCAAGGAAGA
ATCCACTCCTGAGGAAAACGAGGCAGTAGCTCCAGATGAAACAATCCACGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGACCGAGGAACTC
TGAAGGAAAGCATTGAATGGGGCGGAAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTTGATGATGATGAACCAAAGGAATCTAAGTCGAAGGATTCTCGT
TTATCTTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGTCAACTCTCTCACAAGTTCCA
TGGGAAGGGACCTGGAAAAATGAAGCAAGAAAAGCGCATGAAACAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAAAATGCTGATACTCCTTCCTTATCAGTGGAGA
GAATGAGGGAAGCTCAAGCACAACTAAAGACCCCTTATCTTGTTCTCAGCGGCCACGTTAAACCTGGCCAAACAAGCGATCCAAGAAGTGGTTTTGCTACAGTTGAAAAG
GATCTCCCAGGTGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGTGAAGCTTCAAATACAGGCACAAAGAAAGCAAAAGT
TTGAAGTCTTAGCTTTAGACCAGAATTTGTAGCAAGAAACCACGTTTAGTTTATTTCAGTCTTCCAACTAGATGCAGCATCGTATATTCAATTTCATATCCTCATCAATT
TTTTTTAGATATTTTGAAGTGTTTGTATGATAATCAATAATATTCTTTCTTGTCATAAGTTTAGAATGTGTACATTTTTTTTCAGGAATTTTTTTGTACAAGTTACCAAA
TTACACGACTAGAAGAATAGCTGAAAAAACTTTATGCAAGCTGGATACATGTATCATCTAGTGAAAGTATCAAAAACTTTAAACTAAATTAAAAATGTAGTTAAGTGAAT
CA
Protein sequenceShow/hide protein sequence
MDGERSSAPDERNGDDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDVLKEKEKEAKDSERDRVRSREKRKEDRDEHEKERGRGSKVKDKDYDREIYKD
KEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDREKERDRDRKKKDKDRSNEIEREKGRDKHRDQEDKESYRNIDKERGKERILEDDRKTDQNKQKSQDK
EGIGSKNDEERIGWTADEGKDYMLESDGENNRDRDVKQGNMVQHLGGEENFDGLKVGSHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQL
SKIFEEQDNIDQGVSDDDIAPENTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKML
PQYDDPADADEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNVSTKVSRDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQ
AKKEEQEKSEAEMRLSAYQSAYAKADEASRSLQLVQTSSSRLEDNGDTLIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQATDDQNTKAGELQENKVV
FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAKEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRK
SKLVGIVDDDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV