; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g04730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g04730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSART-1 family protein DOT2
Genome locationchr10:3288680..3295497
RNA-Seq ExpressionMoc10g04730
SyntenyMoc10g04730
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600325.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.03Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GE G DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----RDRDRKKKEKEKDRSNENEREKGREKHRDQ
        H+KER+R  KV          KEKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD    RDRDRKKKEK+KDRSNENEREKGREK RDQ
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----RDRDRKKKEKEKDRSNENEREKGREKHRDQ

Query:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVML
        EEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG++     VDQGNAV HLGGEENSDGLKVGAQ SS ML
Subjt:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVML

Query:  EERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSI
        EERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ+I
Subjt:  EERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSI

Query:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFED
        LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HFED
Subjt:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFED

Query:  LNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQT
        LNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLVQ 
Subjt:  LNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQT

Query:  SSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVF
        SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLA TTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEEDVF
Subjt:  SSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVF

Query:  MDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK
        MDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+K
Subjt:  MDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK

Query:  SKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
        SK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
Subjt:  SKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP

Query:  GQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        GQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  GQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.13Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GE G DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----RDRDRKKKEKEKDRSNENEREKGREKHRDQ
        H+KER+R  KV          KEKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD    RDRDRKKKEK+KDRSNENEREKGREK RDQ
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----RDRDRKKKEKEKDRSNENEREKGREKHRDQ

Query:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVML
        EEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG++     VDQGNAV+HLGGEENSDGLKVGAQ SS ML
Subjt:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVML

Query:  EERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSI
        EERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ+I
Subjt:  EERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSI

Query:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFED
        LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HFED
Subjt:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFED

Query:  LNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQT
        LNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLVQ 
Subjt:  LNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQT

Query:  SSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVF
        SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEEDVF
Subjt:  SSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVF

Query:  MDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK
        MDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+K
Subjt:  MDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESK

Query:  SKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
        SK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP
Subjt:  SKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKP

Query:  GQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        GQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  GQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_022149190.1 SART-1 family protein DOT2 [Momordica charantia]0.0e+00100Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
        SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
Subjt:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL

Query:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
        EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
Subjt:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV

Query:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
        LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
Subjt:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ

Query:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
        KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
Subjt:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK

Query:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
        KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
Subjt:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY

Query:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
        KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
Subjt:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW

Query:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
        TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
Subjt:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD

Query:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
        EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
Subjt:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT

Query:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
Subjt:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_022942374.1 SART-1 family protein DOT2 [Cucurbita moschata]0.0e+0087.06Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SS P+HDERNGH     GE G DDFG SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----------RDRDRKKKEKEKDRSNENEREKGR
        H+KER+R  KV          KEKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD          RDRDRKKKEK+KDRSNENEREKGR
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----------RDRDRKKKEKEKDRSNENEREKGR

Query:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQ
        EK RDQEEKES RN DK+RGKEK L DD+K DQNKEK R  E  G KN+EERIDW     KDYML+SDG++     VDQGNAV+ LGGEENSDGLKVGAQ
Subjt:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQ

Query:  PSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLT
         SS MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLT
Subjt:  PSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLT

Query:  LKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASS
        LKDQ+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASS
Subjt:  LKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASS

Query:  VNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRS
        V HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRS
Subjt:  VNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRS

Query:  LQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKP
        LQLVQ SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKP
Subjt:  LQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKP

Query:  EEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED
        EEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DED
Subjt:  EEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED

Query:  EPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        SGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  SGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_023534590.1 SART-1 family protein DOT2 [Cucurbita pepo subsp. pepo]0.0e+0087.84Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GE G DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD------RDRDRKKKEKEKDRSNENEREKGREKHR
        H+KER+R  KV          KEKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD      RDRDRKKKEK+KDRSNENEREKGREK R
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD------RDRDRKKKEKEKDRSNENEREKGREKHR

Query:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSV
        DQEEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG++     VDQGNAV+HLGGEENSDGLKVGAQ SS 
Subjt:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSV

Query:  MLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ
        MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ
Subjt:  MLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ

Query:  SILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHF
        +ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HF
Subjt:  SILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHF

Query:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLV
        EDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG RNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLV
Subjt:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLV

Query:  QTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEED
        Q SS+RL+DNEDTFI DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTT S Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEED
Subjt:  QTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEED

Query:  VFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE
Subjt:  VFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  SKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        +KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  SKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        KPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  KPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

TrEMBL top hitse value%identityAlignment
A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0085.17Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MD ERSS P  DERNG      DD G+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDR+RSKR SDDA KE+EKEVKDSERDRV SR++RKE+RDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKE------KDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEK
         RGSKV          K+KEYERERDRKDRGKD+ER RER+LEKDNVRG DKERGKE      KDRDRDRDRKKK+K+KDRSNE EREKGREKHRDQE+K
Subjt:  SRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKE------KDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEK

Query:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER
        ES RN DKERGKE+ILEDDRK DQ K+K   +E IGSKNDEER  W  D GKDYML+SDG+N    DV+QGN V+HLGGEEN DGLKVG+ PSS MLEER
Subjt:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER

Query:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL
        IR MKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDD+A E+TT+N  L GVKVLHGVDKVLEGGAVVLTLKDQSIL
Subjt:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL

Query:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL
        ADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPA ADEGLTLD RG   NDAEKKLEELR+RLQG SSV HFEDL
Subjt:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL

Query:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS
        N S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQA+KEEQE+SEAEMR +AYQSAYAKADEASRSLQLVQTS
Subjt:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS

Query:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM
        S RLEDN+D  IADDDED YKSLERARKLALKKQ +AASGP AIALLAT TTSSQ+TDD NTKAGE+QENKV+FTEMEEFVWGLQLDE++HKPEEEDVFM
Subjt:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM

Query:  DDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
        DDDE PK EYHED KDKDGGWTEVKDTA+EE  P++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
Subjt:  DDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS

Query:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
        K+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
Subjt:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG

Query:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        QTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE +N+GTKK+KV
Subjt:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0085.17Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MD ERSS P  DERNG      DD G+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDR+RSKR SDDA KE+EKEVKDSERDRV SR++RKE+RDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKE------KDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEK
         RGSKV          K+KEYERERDRKDRGKD+ER RER+LEKDNVRG DKERGKE      KDRDRDRDRKKK+K+KDRSNE EREKGREKHRDQE+K
Subjt:  SRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKE------KDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEK

Query:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER
        ES RN DKERGKE+ILEDDRK DQ K+K   +E IGSKNDEER  W  D GKDYML+SDG+N    DV+QGN V+HLGGEEN DGLKVG+ PSS MLEER
Subjt:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER

Query:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL
        IR MKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDD+A E+TT+N  L GVKVLHGVDKVLEGGAVVLTLKDQSIL
Subjt:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL

Query:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL
        ADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPA ADEGLTLD RG   NDAEKKLEELR+RLQG SSV HFEDL
Subjt:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL

Query:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS
        N S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQA+KEEQE+SEAEMR +AYQSAYAKADEASRSLQLVQTS
Subjt:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS

Query:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM
        S RLEDN+D  IADDDED YKSLERARKLALKKQ +AASGP AIALLAT TTSSQ+TDD NTKAGE+QENKV+FTEMEEFVWGLQLDE++HKPEEEDVFM
Subjt:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM

Query:  DDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
        DDDE PK EYHED KDKDGGWTEVKDTA+EE  P++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
Subjt:  DDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS

Query:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
        K+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
Subjt:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG

Query:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        QTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE +N+GTKK+KV
Subjt:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1D793 SART-1 family protein DOT20.0e+00100Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
        SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
Subjt:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL

Query:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
        EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
Subjt:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV

Query:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
        LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
Subjt:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ

Query:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
        KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
Subjt:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK

Query:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
        KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
Subjt:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY

Query:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
        KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
Subjt:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW

Query:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
        TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
Subjt:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD

Query:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
        EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
Subjt:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT

Query:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
Subjt:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1FR42 SART-1 family protein DOT20.0e+0087.06Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SS P+HDERNGH     GE G DDFG SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----------RDRDRKKKEKEKDRSNENEREKGR
        H+KER+R  KV          KEKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD          RDRDRKKKEK+KDRSNENEREKGR
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD----------RDRDRKKKEKEKDRSNENEREKGR

Query:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQ
        EK RDQEEKES RN DK+RGKEK L DD+K DQNKEK R  E  G KN+EERIDW     KDYML+SDG++     VDQGNAV+ LGGEENSDGLKVGAQ
Subjt:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQ

Query:  PSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLT
         SS MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLT
Subjt:  PSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLT

Query:  LKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASS
        LKDQ+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASS
Subjt:  LKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASS

Query:  VNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRS
        V HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRS
Subjt:  VNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRS

Query:  LQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKP
        LQLVQ SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKP
Subjt:  LQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKP

Query:  EEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED
        EEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DED
Subjt:  EEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDED

Query:  EPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        EPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Subjt:  EPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

Query:  SGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        SGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  SGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1IPE4 SART-1 family protein DOT20.0e+0086.9Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GE G DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD------RDRDRKKKEKEKDRSNENEREKGREKHR
        H+KER+R  KV          KEKEY+RERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRD      RDRDRKKKEK+KDRSNENEREKGREK R
Subjt:  HEKERSRGSKV----------KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRD------RDRDRKKKEKEKDRSNENEREKGREKHR

Query:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSV
        DQEEKES RN DK+RGKEK L DD+K DQNKEK R  E IG KNDEERIDW          +SDG++     VDQGNAV+HLGGE+NSDGLKVGAQ SS 
Subjt:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSV

Query:  MLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ
        MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ
Subjt:  MLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQ

Query:  SILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHF
        +ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HF
Subjt:  SILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHF

Query:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLV
        EDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLV
Subjt:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLV

Query:  QTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEED
        Q SS+RL+ NEDT I DDDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEED
Subjt:  QTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEED

Query:  VFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDEAPK EYHEDEKDKDGGWTEVKDTA+EEP PEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE
Subjt:  VFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  SKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        +KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  SKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        KPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P N+GTKK K+
Subjt:  KPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 13.6e-2026.32Show/hide
Query:  KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKA
        +E +  + R     G   ER R+R  E+   RG  +   + + R     R     E+ ++  +ER   REK  D  E  +   T         +E+  K 
Subjt:  KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKA

Query:  DQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLS
               R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +
Subjt:  DQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLS

Query:  WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLE
        W++RSR+L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  ++LTLKD+ +L      E+ DVL 
Subjt:  WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLE

Query:  NVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVK
        NV + ++++ +   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S++         +
Subjt:  NVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVK

Query:  VSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIR
        ++ +Y T +EM+ FKK K++ K +RKKEK + + A +   +      GD GSR   R + R      SE E             D    ++ +       
Subjt:  VSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIR

Query:  LEDNEDTFIADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDEES
                + ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  + E 
Subjt:  LEDNEDTFIADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDEES

Query:  HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGI
         + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  
Subjt:  HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGI

Query:  VDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMRE
        +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E
Subjt:  VDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMRE

Query:  AQAQLKTPYLVLSGHVK
         Q   KTPY+VLSG  K
Subjt:  AQAQLKTPYLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 11.6e-2026.59Show/hide
Query:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K
        ++  R ++ ++ + R     +  G+ ++R +E+  +R   R+  E E  RS  + RE+ + +  ++  K   R+   E     K    D  +   +E  K
Subjt:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K

Query:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR
         R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +W++RSR
Subjt:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR

Query:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE
        +L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  VVLTLKD+ +L +G+     DVL NV + +
Subjt:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE

Query:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY
        +++ D   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S+N         +++ +Y 
Subjt:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY

Query:  TQDEMLQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
        + +EM+ FKK K++ K +RKKEK ++     + +  G      GD GSR   R + R  E E    E       +    +D+        +  ++ + D 
Subjt:  TQDEMLQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFI-------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLD
        ED           ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  +
Subjt:  EDTFI-------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLD

Query:  EESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KL
         E  + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    
Subjt:  EESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KL

Query:  VGIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
        V  +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  
Subjt:  VGIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER

Query:  MREAQAQLKTPYLVLSGHVK
        ++E Q   KTPY+VLSG  K
Subjt:  MREAQAQLKTPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT25.4e-22658.52Show/hide
Query:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC
        +E RD   KE+   SK KEK+Y+RE+ R KD  +DKE+ R+R+  +D    ++  RG++K+R++D+ R  + KEKD+  E  R K RE  RD E EK+  
Subjt:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC

Query:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK
        R   KER  +K  EDD +  +  E+                               NR  ++           G +N D    G + S++ L+ RI  M+
Subjt:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK

Query:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE
        E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN 
Subjt:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE

Query:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS
        ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEELRKR+QG  + + FEDLN+S KVS
Subjt:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS

Query:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
         DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+ER E E R +AYQ A AKADEASR L+  Q    + +++
Subjt:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK
        E   +ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +Q+TDD+ T   E QEN VVFTEM +FVWGLQ + +  KPE EDVFM++D APK
Subjt:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK

Query:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS
           E +++   G TEV DT  +      D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+   
Subjt:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS

Query:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN
          D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP +
Subjt:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN

Query:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT
         FATVEKD+PG LTPMLGDRKVEHFLGIKRK EP NS T
Subjt:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 19.4e-2126.62Show/hide
Query:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K
        ++  R ++ ++ + R     +  G+ ++R +E+  +R   R+  E E  RS  + RE+ + +  ++  K   R+   E     K    D  +   +E  K
Subjt:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K

Query:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR
         R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +W++RSR
Subjt:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR

Query:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE
        +L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  VVLTLKD+ +L DG+     DVL NV + +
Subjt:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE

Query:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY
        +++ D   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S++     +   +++ +Y 
Subjt:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY

Query:  TQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNED
        + +EM+ FKK K++ K +RKKEK + + A +   +      GD GSR   R + R  E E    E       +    +D+        +  ++ + D ED
Subjt:  TQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNED

Query:  TFI--------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDE
                    ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  + 
Subjt:  TFI--------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDE

Query:  ESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLV
        E  + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V
Subjt:  ESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLV

Query:  GIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
          +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  +
Subjt:  GIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM

Query:  REAQAQLKTPYLVLSGHVK
        +E Q   KTPY+VLSG  K
Subjt:  REAQAQLKTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family1.5e-1839.31Show/hide
Query:  EVKDTAEE--EPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERT
        EV   AE+  +   +++  +  D  + E  VG GLS AL  L+++GT KE            + K+VG+      K++  ++ R     D  K+I I+R 
Subjt:  EVKDTAEE--EPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERT

Query:  DEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        +++GRIMT KE++R L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  KTPY+VL
Subjt:  DEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family3.8e-22758.52Show/hide
Query:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC
        +E RD   KE+   SK KEK+Y+RE+ R KD  +DKE+ R+R+  +D    ++  RG++K+R++D+ R  + KEKD+  E  R K RE  RD E EK+  
Subjt:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC

Query:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK
        R   KER  +K  EDD +  +  E+                               NR  ++           G +N D    G + S++ L+ RI  M+
Subjt:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK

Query:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE
        E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN 
Subjt:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE

Query:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS
        ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEELRKR+QG  + + FEDLN+S KVS
Subjt:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS

Query:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
         DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+ER E E R +AYQ A AKADEASR L+  Q    + +++
Subjt:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK
        E   +ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +Q+TDD+ T   E QEN VVFTEM +FVWGLQ + +  KPE EDVFM++D APK
Subjt:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK

Query:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS
           E +++   G TEV DT  +      D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+   
Subjt:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS

Query:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN
          D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP +
Subjt:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN

Query:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT
         FATVEKD+PG LTPMLGDRKVEHFLGIKRK EP NS T
Subjt:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACGGAACGGTCATCAGTGCCTGATCATGATGAGAGAAATGGTCATGGGGAAGGAGGACACGATGACTTTGGATGGAGTGGAGCAGAAAAGTCTAGTAAG
CATCGGAGTGAGGATCATCGGAAGAATAGTCGAGGGGAGGAAAAAGACCACAGAAGTAAAGATCGAGACCGATCCAAGAGACGTAGTGACGATGCACCAAAAGAA
AGGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATAGTCGGGACAGGAGGAAGGAAGAAAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGTAGCAAA
GTTAAAGAAAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAACGTGGGAGGGAGAGACAGTTGGAGAAGGATAATGTACGAGGACAA
GACAAAGAAAGGGGAAAGGAGAAAGACAGGGACAGGGATAGGGATAGGAAAAAAAAAGAAAAGGAAAAGGACAGGTCAAATGAAAATGAAAGAGAAAAAGGGAGA
GAGAAACACAGAGATCAAGAGGAGAAGGAAAGCTGTAGGAACACTGACAAGGAGAGAGGAAAAGAGAAAATTTTGGAAGATGACAGGAAAGCAGATCAAAACAAG
GAGAAATCAAGGGAATTGATTGGCAGTAAAAATGATGAGGAAAGAATTGATTGGGCTGGAGATGCGGGTAAGGATTATATGCTAAAAAGTGATGGTCAGAACCGG
GACAAAGATGATGTTGATCAAGGGAATGCTGTCCGGCATTTGGGAGGTGAAGAAAATTCTGATGGTTTGAAAGTTGGAGCTCAGCCTTCTTCAGTTATGCTTGAG
GAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAGCTTGAGGAGAAGAAACTT
TCTGAAAAAGAAAAAGCATTGCAGCTCTCAAAGATTTTTGAGGAACAGGACAATATTGATCAAGGTGGAAGTGATGATGATCTTGCAGCAGAAGATACAACTAGT
AATCTAGCAGGAGTTAAAGTACTTCATGGCGTAGATAAAGTACTGGAAGGTGGTGCAGTTGTCTTAACACTTAAAGATCAGAGTATCTTAGCTGATGGTGACGTT
AACGAAGACATGGATGTACTTGAGAATGTGGAAATTGGAGAGCAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATCTATGATGACAAG
TTTAATGATGAAAATCATGGTGAGAAAAAGATTCTTCCACAGTATGATGATCCCGCAGCTGCAGATGAGGGTCTAACTCTAGATGAAAGAGGACGTCTTACTAAT
GATGCAGAGAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAATCACTTTGAAGATCTGAATGCATCAGTGAAAGTCTCACATGATTAT
TACACTCAAGATGAGATGCTTCAGTTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATCGATGCCCTTGAAGCAGAAGCAATCTCT
GCTGGATTGGGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGAAGGCAAGCACGTAAAGAGGAACAGGAAAGGTCTGAAGCAGAAATGCGGCATAGTGCGTAC
CAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTCCAATTAGTTCAAACTAGTTCAATCAGATTAGAGGACAATGAAGATACTTTCATTGCAGACGAT
GATGAAGATCTCTATAAATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCGGACCAGAAGCTATTGCTCTTCTTGCTACAACA
ACGACCAGCAGTCAGTCAACTGATGATCATAACACAAAAGCTGGAGAGGTGCAGGAGAATAAGGTTGTGTTTACAGAAATGGAAGAATTTGTATGGGGTCTCCAG
CTTGATGAAGAATCTCATAAACCTGAGGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGAGAAGGATAAGGATGGTGGGTGG
ACTGAAGTTAAAGATACTGCTGAAGAAGAACCCACACCTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCCGTTGGAAAGGGATTATCC
AGTGCGCTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGGAAAAGCAAACTTGTTGGTATAGTAGAC
GAAGACGAACCAAAGGAATCGAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTTGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATT
ATGACTCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGGCAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTG
AAGTTGAAGCAGATGAAGAATGCAGATACACCTTCTTTGTCAGTGGAGAGAATGAGGGAGGCTCAGGCACAACTAAAGACCCCTTATCTTGTTCTCAGTGGTCAT
GTAAAACCTGGCCAAACGAGTGATCCAATAAATGCTTTTGCTACTGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAACAT
TTCTTGGGGATAAAGCGTAAAGGCGAACCTACGAATTCAGGCACAAAGAAGTCCAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACACGGAACGGTCATCAGTGCCTGATCATGATGAGAGAAATGGTCATGGGGAAGGAGGACACGATGACTTTGGATGGAGTGGAGCAGAAAAGTCTAGTAAG
CATCGGAGTGAGGATCATCGGAAGAATAGTCGAGGGGAGGAAAAAGACCACAGAAGTAAAGATCGAGACCGATCCAAGAGACGTAGTGACGATGCACCAAAAGAA
AGGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATAGTCGGGACAGGAGGAAGGAAGAAAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGTAGCAAA
GTTAAAGAAAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAACGTGGGAGGGAGAGACAGTTGGAGAAGGATAATGTACGAGGACAA
GACAAAGAAAGGGGAAAGGAGAAAGACAGGGACAGGGATAGGGATAGGAAAAAAAAAGAAAAGGAAAAGGACAGGTCAAATGAAAATGAAAGAGAAAAAGGGAGA
GAGAAACACAGAGATCAAGAGGAGAAGGAAAGCTGTAGGAACACTGACAAGGAGAGAGGAAAAGAGAAAATTTTGGAAGATGACAGGAAAGCAGATCAAAACAAG
GAGAAATCAAGGGAATTGATTGGCAGTAAAAATGATGAGGAAAGAATTGATTGGGCTGGAGATGCGGGTAAGGATTATATGCTAAAAAGTGATGGTCAGAACCGG
GACAAAGATGATGTTGATCAAGGGAATGCTGTCCGGCATTTGGGAGGTGAAGAAAATTCTGATGGTTTGAAAGTTGGAGCTCAGCCTTCTTCAGTTATGCTTGAG
GAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAGCTTGAGGAGAAGAAACTT
TCTGAAAAAGAAAAAGCATTGCAGCTCTCAAAGATTTTTGAGGAACAGGACAATATTGATCAAGGTGGAAGTGATGATGATCTTGCAGCAGAAGATACAACTAGT
AATCTAGCAGGAGTTAAAGTACTTCATGGCGTAGATAAAGTACTGGAAGGTGGTGCAGTTGTCTTAACACTTAAAGATCAGAGTATCTTAGCTGATGGTGACGTT
AACGAAGACATGGATGTACTTGAGAATGTGGAAATTGGAGAGCAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATCTATGATGACAAG
TTTAATGATGAAAATCATGGTGAGAAAAAGATTCTTCCACAGTATGATGATCCCGCAGCTGCAGATGAGGGTCTAACTCTAGATGAAAGAGGACGTCTTACTAAT
GATGCAGAGAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAATCACTTTGAAGATCTGAATGCATCAGTGAAAGTCTCACATGATTAT
TACACTCAAGATGAGATGCTTCAGTTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATCGATGCCCTTGAAGCAGAAGCAATCTCT
GCTGGATTGGGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGAAGGCAAGCACGTAAAGAGGAACAGGAAAGGTCTGAAGCAGAAATGCGGCATAGTGCGTAC
CAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTCCAATTAGTTCAAACTAGTTCAATCAGATTAGAGGACAATGAAGATACTTTCATTGCAGACGAT
GATGAAGATCTCTATAAATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCGGACCAGAAGCTATTGCTCTTCTTGCTACAACA
ACGACCAGCAGTCAGTCAACTGATGATCATAACACAAAAGCTGGAGAGGTGCAGGAGAATAAGGTTGTGTTTACAGAAATGGAAGAATTTGTATGGGGTCTCCAG
CTTGATGAAGAATCTCATAAACCTGAGGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGAGAAGGATAAGGATGGTGGGTGG
ACTGAAGTTAAAGATACTGCTGAAGAAGAACCCACACCTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCCGTTGGAAAGGGATTATCC
AGTGCGCTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGGAAAAGCAAACTTGTTGGTATAGTAGAC
GAAGACGAACCAAAGGAATCGAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTTGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATT
ATGACTCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGGCAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTG
AAGTTGAAGCAGATGAAGAATGCAGATACACCTTCTTTGTCAGTGGAGAGAATGAGGGAGGCTCAGGCACAACTAAAGACCCCTTATCTTGTTCTCAGTGGTCAT
GTAAAACCTGGCCAAACGAGTGATCCAATAAATGCTTTTGCTACTGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAACAT
TTCTTGGGGATAAAGCGTAAAGGCGAACCTACGAATTCAGGCACAAAGAAGTCCAAAGTTTGA
Protein sequenceShow/hide protein sequence
MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKERSRGSK
VKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKADQNK
EKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKL
SEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDK
FNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAIS
AGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATT
TTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLS
SALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEEL
KLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV