; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003418 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003418
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSART-1 family protein DOT2
Genome locationChr08:1025254..1033059
RNA-Seq ExpressionHG10003418
SyntenyHG10003418
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.43Show/hide
Query:  DGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRDE
        DG  +   DERNG +   ARDRG+ G DDFGYSG EKSSKHRSEDHRKSSRGEEK+HRSKDR+RSKR SDDASKEKEKEVKDSERDRV  RE+RKEDRDE
Subjt:  DGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRDE

Query:  HEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ
        H+KER R  KVKDKDYDRE+YKEKEYERERDRKDRGKD+ER REREL+KDNVRGQDKERGKEKD+DR+R+RDRDRKKK+KDKDRSNENEREKGREK RDQ
Subjt:  HEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQ

Query:  EDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTML
        E+KESYRNI+K+RGKE+ L DD+K DQNKEK +DKEGIG KNDEERIDWIA  AKDYMLESDGE+NRDR VDQGN  QHLGGEEN DGLKV A +SS ML
Subjt:  EDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTML

Query:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQNI
        EERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHG+DKVL GGAVVLTLKDQNI
Subjt:  EERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQNI

Query:  LADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFED
        LADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAA DEGLTLDG G F+NDAEKKLEELR+RLQGASSVKHFED
Subjt:  LADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFED

Query:  LNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQT
        LNAS KVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR  AYQSAYAKADEASRSLQLVQ 
Subjt:  LNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQT

Query:  SSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEEDVFM
        SS RL+DN+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TT+ QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE HKPEEEDVFM
Subjt:  SSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEEDVFM

Query:  DDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
        DDDE PKEEYHED KDKDGGWTEVKDTA+EEP PEDNE IAPDETIHEVPVGKGLSS L+LLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+KS
Subjt:  DDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS

Query:  KDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
        KDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
Subjt:  KDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG

Query:  QTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        QTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ S+TGTKK K+
Subjt:  QTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

XP_008459644.1 PREDICTED: SART-1 family protein DOT2 isoform X2 [Cucumis melo]0.0e+0091.32Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKEVKDSERDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH
        EHEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDREREREREL+KDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKH
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH

Query:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS
        RDQEDKESYRN++K+RGKERILEDDRKTDQ K+K QDKEGIGSKNDEER  WIADE KDYMLESDGENNRDRDV+QGNM QHLGGEENFDGLKV +H SS
Subjt:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS

Query:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL
        TMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPE+TT+N  L GVKVLHGVDKVLEGGAVVLTL
Subjt:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL

Query:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV
        KDQ+ILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQG SSV
Subjt:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV

Query:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL
        KHFEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSL
Subjt:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL

Query:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE
        QLVQTSS RLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG IALLATATT++Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKPEE
Subjt:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE

Query:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTA+EE IP++N+A+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
Subjt:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

XP_011656109.1 SART-1 family protein DOT2 [Cucumis sativus]0.0e+0091.3Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDER+G              DDFGYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKE KDSERDR+RSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRD
        EHEKER+RG KVKDKDYDR+IYK+KEYERERDRKDRGKDREREREREL+KD VRG DKERGKEKD+DRD+DRDRDRKKKDKDKDRSNE EREKGR+KHRD
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRD

Query:  QEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTM
        QEDKESYRNI+KDRGKERILEDDRKTDQNK+K QDKEGIGSKNDEERI  I DE KDYMLESDGENNRDRDV+QGNM QHLG EENFDGLKV +HASSTM
Subjt:  QEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTM

Query:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKD
        LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTT+N  LAGVKVLHGVDKVLEGGAVVLTLKD
Subjt:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKD

Query:  QNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH
        Q+ILADG++NE++DVLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH
Subjt:  QNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH

Query:  FEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQL
        FEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSLQL
Subjt:  FEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQL

Query:  VQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEED
        VQ SSARLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG +ALLATATT++Q TDDQ+TKAGELQENKVVFTEMEEFVWGLQLDE+ HKPEE+D
Subjt:  VQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEED

Query:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDE+PKEEYHEDVKDKDGGWTEVKDTA EE  PE+NEA+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
Subjt:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

XP_016902433.1 PREDICTED: SART-1 family protein DOT2 isoform X1 [Cucumis melo]0.0e+0091.32Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKEVKDSERDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH
        EHEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDREREREREL+KDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKH
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH

Query:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS
        RDQEDKESYRN++K+RGKERILEDDRKTDQ K+K QDKEGIGSKNDEER  WIADE KDYMLESDGENNRDRDV+QGNM QHLGGEENFDGLKV +H SS
Subjt:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS

Query:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL
        TMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPE+TT+N  L GVKVLHGVDKVLEGGAVVLTL
Subjt:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL

Query:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV
        KDQ+ILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQG SSV
Subjt:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV

Query:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL
        KHFEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSL
Subjt:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL

Query:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE
        QLVQTSS RLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG IALLATATT++Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKPEE
Subjt:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE

Query:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTA+EE IP++N+A+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
Subjt:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

XP_038890025.1 SART-1 family protein DOT2 [Benincasa hispida]0.0e+0093.84Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MDGER SAPDERNGPDIAWAR+RG+GGHDDFGYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKE KDSERDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKE----KDKDRDRDRDRDRKKKDKDKDRSNENEREKGRE
        EHEKER+RGSKVKDKDYDREIYKEKEYERERDRKDRGKDREREREREL+KDNVRGQDKE+GKE    +D+DRDRDRDRDRKKKDKDKDRSNENEREKGRE
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKE----KDKDRDRDRDRDRKKKDKDKDRSNENEREKGRE

Query:  KHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHA
        KHRDQEDKESYRNI+KDRGKERILEDDRKTDQ+KEKSQDKEGIGSK DEERI WIADE KDYM+ESDG+NNR+RDVDQGNM Q LGGEENFDGLKV AHA
Subjt:  KHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHA

Query:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVL
        SSTMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTT+  NLAGVKVLHG+DKVLEGGAVVL
Subjt:  SSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVL

Query:  TLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGAS
        TLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAA DEGLTLDGRGGFNNDAEKKLEELRRRLQG++
Subjt:  TLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGAS

Query:  SVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASR
        SV HFEDLNASTKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLD+DALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRH AYQSAYAKADEASR
Subjt:  SVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASR

Query:  SLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP
        SL+LVQTSS RLEDNDDTLIADDDEDFYKSL+RARKLALK+QEAASGPG IALLATATT++QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDE+ HKP
Subjt:  SLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKP

Query:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED
        EEEDVFMDDDEVPKEEYHE++KDKDGGWTEVKDTA+EE  PE+NEAIAPDETIHEVPVGKGLSSAL+LLK+RGTLKESIEWGGRNMDKRKSKLVGIVDED
Subjt:  EEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED

Query:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

TrEMBL top hitse value%identityAlignment
A0A0A0KXY6 Uncharacterized protein0.0e+0091.3Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDER+G              DDFGYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKE KDSERDR+RSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRD
        EHEKER+RG KVKDKDYDR+IYK+KEYERERDRKDRGKDREREREREL+KD VRG DKERGKEKD+DRD+DRDRDRKKKDKDKDRSNE EREKGR+KHRD
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRD

Query:  QEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTM
        QEDKESYRNI+KDRGKERILEDDRKTDQNK+K QDKEGIGSKNDEERI  I DE KDYMLESDGENNRDRDV+QGNM QHLG EENFDGLKV +HASSTM
Subjt:  QEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTM

Query:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKD
        LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTT+N  LAGVKVLHGVDKVLEGGAVVLTLKD
Subjt:  LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKD

Query:  QNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH
        Q+ILADG++NE++DVLENVEIGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH
Subjt:  QNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKH

Query:  FEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQL
        FEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSLQL
Subjt:  FEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQL

Query:  VQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEED
        VQ SSARLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG +ALLATATT++Q TDDQ+TKAGELQENKVVFTEMEEFVWGLQLDE+ HKPEE+D
Subjt:  VQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEED

Query:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDE+PKEEYHEDVKDKDGGWTEVKDTA EE  PE+NEA+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
Subjt:  VFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  SKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0091.32Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKEVKDSERDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH
        EHEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDREREREREL+KDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKH
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH

Query:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS
        RDQEDKESYRN++K+RGKERILEDDRKTDQ K+K QDKEGIGSKNDEER  WIADE KDYMLESDGENNRDRDV+QGNM QHLGGEENFDGLKV +H SS
Subjt:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS

Query:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL
        TMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPE+TT+N  L GVKVLHGVDKVLEGGAVVLTL
Subjt:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL

Query:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV
        KDQ+ILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQG SSV
Subjt:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV

Query:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL
        KHFEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSL
Subjt:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL

Query:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE
        QLVQTSS RLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG IALLATATT++Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKPEE
Subjt:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE

Query:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTA+EE IP++N+A+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
Subjt:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0091.32Show/hide
Query:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD
        MD ER SAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEK+HRSKDRERSKR SDDASKEKEKEVKDSERDRVRSREKRKEDRD
Subjt:  MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRD

Query:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH
        EHEKER RGSKVKDKDYDREIYK+KEYERERDRKDRGKDREREREREL+KDNVRG DKERGKEKD+DR  DRDRDRDRKKKDKDKDRSNE EREKGREKH
Subjt:  EHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDR--DRDRDRDRKKKDKDKDRSNENEREKGREKH

Query:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS
        RDQEDKESYRN++K+RGKERILEDDRKTDQ K+K QDKEGIGSKNDEER  WIADE KDYMLESDGENNRDRDV+QGNM QHLGGEENFDGLKV +H SS
Subjt:  RDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASS

Query:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL
        TMLEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ VSDDDIAPE+TT+N  L GVKVLHGVDKVLEGGAVVLTL
Subjt:  TMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSN--LAGVKVLHGVDKVLEGGAVVLTL

Query:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV
        KDQ+ILADGD+NE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA  DEGLTLDGRGGFNNDAEKKLEELRRRLQG SSV
Subjt:  KDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSV

Query:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL
        KHFEDLN STKVSHDYYTQDEMLKFKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR  AYQSAYAKADEASRSL
Subjt:  KHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSL

Query:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE
        QLVQTSS RLEDNDD LIADDDEDFYKSLERARKLALKKQ+AASGPG IALLATATT++Q TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+ HKPEE
Subjt:  QLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEE

Query:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTA+EE IP++N+A+APDETIHEVPVGKGLSSAL+LLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
Subjt:  EDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEAS+TGTKKAKV
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

A0A6J1FR42 SART-1 family protein DOT20.0e+0087.5Show/hide
Query:  MDGERLSAP--DERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKED
        MD +  S P  DERNG +   ARDRG+ G DDFG SG EKSSKHRSEDHRKSSRGEEK+HRSKDR+RSKR SDDASKEKEKEVKDSERDRV  RE+RKED
Subjt:  MDGERLSAP--DERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKED

Query:  RDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKD------RDRDRDRDRKKKDKDKDRSNENERE
        RDEH+KER R  KVKDKDYDRE+YKEKEYERERDRKDRGKD+ER REREL+KDNVRGQDKERGKEKD+D      R+R+RDRDRKKK+KDKDRSNENERE
Subjt:  RDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKD------RDRDRDRDRKKKDKDKDRSNENERE

Query:  KGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKV
        KGREK RDQE+KESYRNI+KDRGKE+ L DD+K DQNKEK +DKEG G KN+EERIDWIA  AKDYMLESDGE+NRDR VDQGN  Q LGGEEN DGLKV
Subjt:  KGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKV

Query:  RAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAV
         A +SS MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHG+DKVL GGAV
Subjt:  RAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAV

Query:  VLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQG
        VLTLKDQNILADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAA DEGLTLDG G F+NDAEKKLEELR+RLQG
Subjt:  VLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQG

Query:  ASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEA
        ASSVKHFEDLNAS KVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR  AYQSAYAKADEA
Subjt:  ASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEA

Query:  SRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDH
        SRSLQLVQ SS RL+DN+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TT+ QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE H
Subjt:  SRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDH

Query:  KPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD
        KPEEEDVFMDDDE PKEEYHED KDKDGGWTEVKDTA+EEP PEDNE IAPDETIHEVPVGKGLSS L+LLKDRGTLKESIEWGGRNMDKRKSKLVGI+D
Subjt:  KPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD

Query:  EDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
        EDEPKE+KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
Subjt:  EDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL

Query:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ S+TGTKK K+
Subjt:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

A0A6J1IPE4 SART-1 family protein DOT20.0e+0087.2Show/hide
Query:  DGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRDE
        DG  +   DERNG +   ARDRG+ G DDFGYSG EKSSKHRSEDHRKSSRGEEK+HRSKDR+RSKR SDDASKEKEKEVKDSERDRV  RE+RKEDRDE
Subjt:  DGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRDE

Query:  HEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKD--RDRDRDRDRKKKDKDKDRSNENEREKGREKHR
        H+KER R  KVKDKDYDRE+YKEKEY+RERDRKDRGKD+ER REREL+KDNVRGQDKERGKEKD+D  R+R+RDRDRKKK+KDKDRSNENEREKGREK R
Subjt:  HEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKD--RDRDRDRDRKKKDKDKDRSNENEREKGREKHR

Query:  DQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASST
        DQE+KESYRNI+KDRGKE+ L DD+K DQNKEK +DKEGIG KNDEERIDW+A        ESDGE+NRDR VDQGN  QHLGGE+N DGLKV A +SS 
Subjt:  DQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASST

Query:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ
        MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIA ED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ
Subjt:  MLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ

Query:  NILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF
        NILADGD+NED+DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAA DEGLTLDG G F+NDAEKKLEELR+RLQGASSVKHF
Subjt:  NILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHF

Query:  EDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLV
        EDLNAS KVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR  AYQSAYAKADEASRSLQLV
Subjt:  EDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLV

Query:  QTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEEDV
        Q SS RL+ N+DTLI DDDED YKSLERARKLALKKQEAASGP  +ALLAT TT+ QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE HKPEEEDV
Subjt:  QTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEEDV

Query:  FMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
        FMDDDE PKEEYHED KDKDGGWTEVKDTA+EEP PEDNE IAPDETIHEVPVGKGLSS L+LLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+
Subjt:  FMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES

Query:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR+LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
Subjt:  KSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Query:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV
        PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+  +TGTKK K+
Subjt:  PGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 13.4e-1826.7Show/hide
Query:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQDK
        R RE +  K    G     G+ + + R+R  +R   ++  + +  +      GRE+ + +  +   +  ++D G E     +    D  +   +E ++ +
Subjt:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQDK

Query:  EGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKL
          +G K  E  ++ I  EA        G        D  N       EE    L+ +  A+    E+R+ N K  ++K   E+     +  +W++RSR+L
Subjt:  EGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRKL

Query:  EEKKLSEKEKALQLSKIFEEQDNIDQGVS---DDDIAPED----TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQR
        ++    EK+ A + +K+ EE D  + GVS   +++         +  +L G+ V H +D   EG  ++LTLKD+ +L      E+ DVL NV + ++++ 
Subjt:  EEKKLSEKEKALQLSKIFEEQDNIDQGVS---DDDIAPED----TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQR

Query:  DMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNASTKVSHDYYTQD
        +   +  KKK   Y     DE+  +      + +L +YD+   G+      L+  G  +   E++LEE+R +L+  A S+          +++ +Y T +
Subjt:  DMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRLQ-GASSVKHFEDLNASTKVSHDYYTQD

Query:  EMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDT
        EM+ FKK K++ K +RKKEK + + A +   +      GD GS  R   RR+  + E+EK        +  +     D +       +   A    +   
Subjt:  EMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDT

Query:  LIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEEDVF
        L  D+ E +  K LE   R R+L   +Q   SG  V+ ++    +  +  ++        ++  +VF    EF         +GL  + E+   EE   F
Subjt:  LIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEEDVF

Query:  MDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---E
          D+E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++    +
Subjt:  MDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED---E

Query:  PKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP
         K S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   KTP
Subjt:  PKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP

Query:  YLVLSGHVK
        Y+VLSG  K
Subjt:  YLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 13.1e-1927.34Show/hide
Query:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQD
        R RE +  K    G     G+ + + R+R  +R   ++  + + RS  +    GRE+ + +  +   +  ++D G E     +    D  +   +E ++ 
Subjt:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQD

Query:  KEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRK
        +  +G K  E  ++ +  EA        G        D  N       EE    L+ +  A+    E+R+ N K  ++K   E+     +  +W++RSR+
Subjt:  KEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRK

Query:  LEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQ
        L++    EK+ A + +K+ EE D  + GVS     + +   +D  S  +L G+ V H +D   EG  VVLTLKD+ +L +G+     DVL NV + ++++
Subjt:  LEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQ

Query:  RDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-ASTKVSHDYYTQ
         D   +  KKK   Y     DE+  +      + +L +YD+   G+      L+  G  +   E++LEE+R +L+        + LN    +++ +Y + 
Subjt:  RDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLN-ASTKVSHDYYTQ

Query:  DEMLKFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDN
        +EM+ FKK K++ K +RKKEK ++     + +  G      GD GS  R   RR+  + E+E  E E +    Q   +            +     L   
Subjt:  DEMLKFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDN

Query:  DDTLIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEE
           L  D+ E +  K LE   R R+L   +Q   SG  V+ ++    +  +  +++       ++  +VF    EF         +GL  + E+   EE 
Subjt:  DDTLIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPEEE

Query:  DVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED-
          F  D+E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++  
Subjt:  DVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDED-

Query:  --EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQL
          + K S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q   
Subjt:  --EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQL

Query:  KTPYLVLSGHVK
        KTPY+VLSG  K
Subjt:  KTPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT22.9e-21955.31Show/hide
Query:  DSERDRVRSREKR--------KEDRDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDR
        +  + R   RE+R        +E RD   KE++  SK K+KDYDRE  ++K++ R+   K++ +DR+R R+ + +K+  RG+DKER K+K +DR +++D+
Subjt:  DSERDRVRSREKR--------KEDRDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDR

Query:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQ
        + K++++ KDR NE + EK ++K R +          K+R  ++  EDD +T +  E+                                E++ +R +++
Subjt:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQ

Query:  GNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNL
        G         +N D       AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D     +   +L
Subjt:  GNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNL

Query:  AGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGG
        +GVKVLHG++KV+EGGAV+LTLKDQ++L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ AA DEG+ LD +G 
Subjt:  AGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGG

Query:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE
        F  +AEKKLEELR+R+QG  +   FEDLN+S KVS DY++Q+EMLKFKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E
Subjt:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE

Query:  AEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKV
         E R  AYQ A AKADEASR L+  Q    + ++++  ++ADD ED YKSLE+AR+LAL KK+EA SGP  +A L  ++T NQTTDD  T   E QEN V
Subjt:  AEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKV

Query:  VFTEMEEFVWGLQLDEEDHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT-AEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKES
        VFTEM +FVWGLQ + +  KPE EDVFM++D  PK    E  ++   G TEV DT  +      D + I PDE IHEV VGKGLS AL+LLKDRGTLKE 
Subjt:  VFTEMEEFVWGLQLDEEDHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT-AEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKES

Query:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT
        +EWGGRNMDK+KSKLVGIVD+D  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DT
Subjt:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT

Query:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGT
        PS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E  ++ T
Subjt:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 16.2e-2027.64Show/hide
Query:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQD
        R RE +  K    G     G+ + + R+R  +R   ++  + + RS  +    GRE+ + +  +   +  ++D G E     +    D  +   +E ++ 
Subjt:  RERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKD-RSNENEREKGREKHRDQEDKESYRNIEKDRGKE-----RILEDDRKTDQNKEKSQD

Query:  KEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRK
        +  +G K  E  ++ +  EA        G        D  N       EE    L+ +  A+    E+R+ N K  ++K   E+     +  +W++RSR+
Subjt:  KEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEE----SEVLSWVKRSRK

Query:  LEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQ
        L++    EK+ A + +K+ EE D  + GVS     + +   +D  S  +L G+ V H +D   EG  VVLTLKD+ +L DG+     DVL NV + ++++
Subjt:  LEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTS--NLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQ

Query:  RDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRL----QGASSVKHFEDLNASTKVSHDY
         D   +  KKK   Y     DE+  +      + +L +YD+   G+      L+  G  +   E++LEE+R +L    Q  SSV          +++ +Y
Subjt:  RDMAYKAAKKKTGIYDDKFNDENYGE------KKMLPQYDDPAAGD--EGLTLDGRGGFNNDAEKKLEELRRRL----QGASSVKHFEDLNASTKVSHDY

Query:  YTQDEMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSR---NGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLE
         + +EM+ FKK K++ K +RKKEK + + A +   +      GD GSR    G RR  + EE+   + E    A           +  +   +   A   
Subjt:  YTQDEMLKFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSR---NGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLE

Query:  DNDDTLIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPE
         + + L  D+ E +  K LE   R R+L   +Q   SG  V+ ++    +  +  +++       ++  +VF    EF         +GL  + E+   E
Subjt:  DNDDTLIADDDE-DFYKSLE---RARKLALKKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEDHKPE

Query:  EEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDE
        E   F  D+E       E   +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  +++
Subjt:  EEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDE

Query:  D---EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQA
            + K S+ ++ R           YK ++ IE  DE GR +TPKE+FR+LSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q 
Subjt:  D---EPKESKSKDSR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQA

Query:  QLKTPYLVLSGHVK
          KTPY+VLSG  K
Subjt:  QLKTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT5G16780.1 SART-1 family2.1e-22055.31Show/hide
Query:  DSERDRVRSREKR--------KEDRDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDR
        +  + R   RE+R        +E RD   KE++  SK K+KDYDRE  ++K++ R+   K++ +DR+R R+ + +K+  RG+DKER K+K +DR +++D+
Subjt:  DSERDRVRSREKR--------KEDRDEHEKERNRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDR

Query:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQ
        + K++++ KDR NE + EK ++K R +          K+R  ++  EDD +T +  E+                                E++ +R +++
Subjt:  DRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKERILEDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQ

Query:  GNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNL
        G         +N D       AS+  L+ RI  M+E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D     +   +L
Subjt:  GNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNL

Query:  AGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGG
        +GVKVLHG++KV+EGGAV+LTLKDQ++L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ AA DEG+ LD +G 
Subjt:  AGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGG

Query:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE
        F  +AEKKLEELR+R+QG  +   FEDLN+S KVS DY++Q+EMLKFKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E
Subjt:  FNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSE

Query:  AEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKV
         E R  AYQ A AKADEASR L+  Q    + ++++  ++ADD ED YKSLE+AR+LAL KK+EA SGP  +A L  ++T NQTTDD  T   E QEN V
Subjt:  AEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLAL-KKQEAASGPGVIALLATATTNNQTTDDQNTKAGELQENKV

Query:  VFTEMEEFVWGLQLDEEDHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT-AEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKES
        VFTEM +FVWGLQ + +  KPE EDVFM++D  PK    E  ++   G TEV DT  +      D + I PDE IHEV VGKGLS AL+LLKDRGTLKE 
Subjt:  VFTEMEEFVWGLQLDEEDHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDT-AEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLKES

Query:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT
        +EWGGRNMDK+KSKLVGIVD+D  KESK K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DT
Subjt:  IEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADT

Query:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGT
        PS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK E  ++ T
Subjt:  PSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGGGAACGACTATCCGCACCTGATGAGAGAAATGGTCCAGATATCGCTTGGGCAAGAGACCGTGGGGATGGAGGACATGATGACTTTGGTTATAGTGGAGTAGA
AAAGTCAAGTAAACATCGAAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGAGCATAGAAGTAAAGATCGAGAACGATCTAAGAGACATAGTGATGATGCAT
CAAAAGAAAAGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCGTAGTCGAGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGAACAGGGGTAGC
AAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGA
GCTGGATAAGGACAATGTTCGAGGACAAGACAAAGAGAGGGGAAAAGAGAAAGACAAAGACAGGGACAGAGATAGGGATAGGGATAGGAAGAAGAAGGACAAGGACAAGG
ACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGAAAAGGACAGAGGAAAAGAGAGAATTTTG
GAAGATGATAGGAAAACAGATCAAAACAAGGAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAAATGATGAGGAAAGAATTGATTGGATTGCAGATGAGGCTAAGGA
TTATATGCTAGAAAGTGACGGTGAGAATAACAGGGACAGAGATGTTGATCAAGGGAACATGGCCCAGCATTTGGGAGGTGAAGAAAATTTTGATGGGTTGAAAGTCAGAG
CTCATGCTTCTTCCACCATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAAGTTTTATCATGGGTTAAAAGGAGTCGT
AAACTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGATAATATTGATCAAGGTGTAAGCGATGATGATATTGCACC
AGAAGATACAACTAGTAATCTAGCTGGAGTTAAAGTACTTCATGGCGTAGACAAAGTACTAGAAGGTGGTGCGGTTGTCTTAACTCTTAAAGATCAGAATATCTTAGCTG
ATGGCGACATTAATGAAGACGTTGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGAT
GACAAGTTTAATGATGAAAATTATGGTGAGAAGAAGATGCTTCCCCAGTATGATGATCCAGCAGCTGGTGACGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTTAATAA
TGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCGTCAGTCAAGCACTTTGAAGATCTTAATGCATCAACGAAAGTCTCACATGATTATTACA
CTCAGGATGAGATGCTTAAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCGGAAGCAATTTCTGCTGGATTG
GGTGTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCGCAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACATAAGGCATACCAGTCAGCCTATGC
TAAAGCAGATGAAGCGTCGAGATCTCTACAATTAGTTCAAACTAGCTCTGCCAGATTAGAGGACAATGATGATACCCTCATTGCAGATGATGATGAAGATTTCTATAAGT
CATTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGCAGCATCTGGACCAGGAGTAATTGCTCTTCTTGCTACAGCGACAACTAACAATCAGACAACTGATGAT
CAAAACACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAGGATCATAAACCAGAGGAAGA
AGATGTCTTTATGGATGACGATGAAGTACCAAAAGAAGAATATCATGAAGATGTTAAGGATAAAGATGGTGGGTGGACTGAAGTCAAAGATACTGCCGAAGAAGAACCCA
TTCCTGAGGATAACGAGGCAATAGCTCCAGACGAAACAATCCACGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAGGCTGCTTAAAGATCGAGGAACTCTGAAG
GAAAGCATTGAATGGGGCGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCGAAGGAATCTAAGTCGAAGGATTCTCGTTTATC
TTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGACGACTGTCTCACAAGTTCCATGGGA
AGGGACCTGGAAAAATGAAGCAAGAAAAGCGCATGAAACAATACCAAGAAGAGTTGAAGCTGAAGCAGATGAAGAATGCCGATACTCCTTCGTTATCAGTGGAGAGAATG
AGGGAAGCTCAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCACGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCAACAGTTGAAAAGGATCT
CCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCGAGTACAGGCACAAAGAAGGCCAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGGGAACGACTATCCGCACCTGATGAGAGAAATGGTCCAGATATCGCTTGGGCAAGAGACCGTGGGGATGGAGGACATGATGACTTTGGTTATAGTGGAGTAGA
AAAGTCAAGTAAACATCGAAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGAGCATAGAAGTAAAGATCGAGAACGATCTAAGAGACATAGTGATGATGCAT
CAAAAGAAAAGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCGTAGTCGAGAAAAGAGGAAGGAAGACAGAGATGAGCATGAAAAAGAAAGGAACAGGGGTAGC
AAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGA
GCTGGATAAGGACAATGTTCGAGGACAAGACAAAGAGAGGGGAAAAGAGAAAGACAAAGACAGGGACAGAGATAGGGATAGGGATAGGAAGAAGAAGGACAAGGACAAGG
ACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGAAAAGGACAGAGGAAAAGAGAGAATTTTG
GAAGATGATAGGAAAACAGATCAAAACAAGGAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAAATGATGAGGAAAGAATTGATTGGATTGCAGATGAGGCTAAGGA
TTATATGCTAGAAAGTGACGGTGAGAATAACAGGGACAGAGATGTTGATCAAGGGAACATGGCCCAGCATTTGGGAGGTGAAGAAAATTTTGATGGGTTGAAAGTCAGAG
CTCATGCTTCTTCCACCATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAAGTTTTATCATGGGTTAAAAGGAGTCGT
AAACTTGAGGAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGATAATATTGATCAAGGTGTAAGCGATGATGATATTGCACC
AGAAGATACAACTAGTAATCTAGCTGGAGTTAAAGTACTTCATGGCGTAGACAAAGTACTAGAAGGTGGTGCGGTTGTCTTAACTCTTAAAGATCAGAATATCTTAGCTG
ATGGCGACATTAATGAAGACGTTGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGAT
GACAAGTTTAATGATGAAAATTATGGTGAGAAGAAGATGCTTCCCCAGTATGATGATCCAGCAGCTGGTGACGAGGGCCTAACCCTAGATGGAAGAGGAGGTTTTAATAA
TGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCGTCAGTCAAGCACTTTGAAGATCTTAATGCATCAACGAAAGTCTCACATGATTATTACA
CTCAGGATGAGATGCTTAAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATTGATGCCCTTGAAGCGGAAGCAATTTCTGCTGGATTG
GGTGTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCGCAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACATAAGGCATACCAGTCAGCCTATGC
TAAAGCAGATGAAGCGTCGAGATCTCTACAATTAGTTCAAACTAGCTCTGCCAGATTAGAGGACAATGATGATACCCTCATTGCAGATGATGATGAAGATTTCTATAAGT
CATTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGCAGCATCTGGACCAGGAGTAATTGCTCTTCTTGCTACAGCGACAACTAACAATCAGACAACTGATGAT
CAAAACACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAGGATCATAAACCAGAGGAAGA
AGATGTCTTTATGGATGACGATGAAGTACCAAAAGAAGAATATCATGAAGATGTTAAGGATAAAGATGGTGGGTGGACTGAAGTCAAAGATACTGCCGAAGAAGAACCCA
TTCCTGAGGATAACGAGGCAATAGCTCCAGACGAAACAATCCACGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAGGCTGCTTAAAGATCGAGGAACTCTGAAG
GAAAGCATTGAATGGGGCGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCGAAGGAATCTAAGTCGAAGGATTCTCGTTTATC
TTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGACGACTGTCTCACAAGTTCCATGGGA
AGGGACCTGGAAAAATGAAGCAAGAAAAGCGCATGAAACAATACCAAGAAGAGTTGAAGCTGAAGCAGATGAAGAATGCCGATACTCCTTCGTTATCAGTGGAGAGAATG
AGGGAAGCTCAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCACGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCAACAGTTGAAAAGGATCT
CCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCGAGTACAGGCACAAAGAAGGCCAAAGTTTGA
Protein sequenceShow/hide protein sequence
MDGERLSAPDERNGPDIAWARDRGDGGHDDFGYSGVEKSSKHRSEDHRKSSRGEEKEHRSKDRERSKRHSDDASKEKEKEVKDSERDRVRSREKRKEDRDEHEKERNRGS
KVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELDKDNVRGQDKERGKEKDKDRDRDRDRDRKKKDKDKDRSNENEREKGREKHRDQEDKESYRNIEKDRGKERIL
EDDRKTDQNKEKSQDKEGIGSKNDEERIDWIADEAKDYMLESDGENNRDRDVDQGNMAQHLGGEENFDGLKVRAHASSTMLEERIRNMKEDRLKKQTEESEVLSWVKRSR
KLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYD
DKFNDENYGEKKMLPQYDDPAAGDEGLTLDGRGGFNNDAEKKLEELRRRLQGASSVKHFEDLNASTKVSHDYYTQDEMLKFKKPKKKKSLRKKEKLDIDALEAEAISAGL
GVGDLGSRNGSRRQAQKEEQEKSEAEMRHKAYQSAYAKADEASRSLQLVQTSSARLEDNDDTLIADDDEDFYKSLERARKLALKKQEAASGPGVIALLATATTNNQTTDD
QNTKAGELQENKVVFTEMEEFVWGLQLDEEDHKPEEEDVFMDDDEVPKEEYHEDVKDKDGGWTEVKDTAEEEPIPEDNEAIAPDETIHEVPVGKGLSSALRLLKDRGTLK
ESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRRLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASSTGTKKAKV