; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0406 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0406
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSART-1 family protein DOT2
Genome locationMC10:3257289..3265668
RNA-Seq ExpressionMC10g0406
SyntenyMC10g0406
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600325.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. sororia]0.088.25Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GEG  DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----DRDRKKKEKEKDRSNENEREKGREKHRDQ
        H+KER+R  KVK          EKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR    DRDRKKKEK+KDRSNENEREKGREK RDQ
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----DRDRKKKEKEKDRSNENEREKGREKHRDQ

Query:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVM
        EEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG+ NRD+  VDQGNAV HLGGEENSDGLKVGAQ SS M
Subjt:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVM

Query:  LEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQS
        LEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ+
Subjt:  LEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQS

Query:  ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFE
        ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HFE
Subjt:  ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFE

Query:  DLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQ
        DLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLVQ
Subjt:  DLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQ

Query:  TSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDV
         SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTT S Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEEDV
Subjt:  TSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDV

Query:  FMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
        FMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+
Subjt:  FMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES

Query:  KSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
Subjt:  KSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Query:  PGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  PGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.088.35Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GEG  DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----DRDRKKKEKEKDRSNENEREKGREKHRDQ
        H+KER+R  KVK          EKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR    DRDRKKKEK+KDRSNENEREKGREK RDQ
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----DRDRKKKEKEKDRSNENEREKGREKHRDQ

Query:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVM
        EEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG+ NRD+  VDQGNAV+HLGGEENSDGLKVGAQ SS M
Subjt:  EEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVM

Query:  LEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQS
        LEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKDQ+
Subjt:  LEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQS

Query:  ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFE
        ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV HFE
Subjt:  ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFE

Query:  DLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQ
        DLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQLVQ
Subjt:  DLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQ

Query:  TSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDV
         SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEEDV
Subjt:  TSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDV

Query:  FMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES
        FMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE+
Subjt:  FMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKES

Query:  KSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
Subjt:  KSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Query:  PGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  PGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_022149190.1 SART-1 family protein DOT2 [Momordica charantia]0.0100Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
        SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
Subjt:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL

Query:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
        EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
Subjt:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV

Query:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
        LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
Subjt:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ

Query:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
        KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
Subjt:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK

Query:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
        KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
Subjt:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY

Query:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
        KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
Subjt:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW

Query:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
        TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
Subjt:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD

Query:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
        EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
Subjt:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT

Query:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
Subjt:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_022942374.1 SART-1 family protein DOT2 [Cucurbita moschata]0.087.28Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SS P+HDERNGH     GEG  DDFG SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----------DRDRKKKEKEKDRSNENEREKGR
        H+KER+R  KVK          EKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR          DRDRKKKEK+KDRSNENEREKGR
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----------DRDRKKKEKEKDRSNENEREKGR

Query:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGA
        EK RDQEEKES RN DK+RGKEK L DD+K DQNKEK R  E  G KN+EERIDW     KDYML+SDG+ NRD+  VDQGNAV+ LGGEENSDGLKVGA
Subjt:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGA

Query:  QPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVL
        Q SS MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVL
Subjt:  QPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVL

Query:  TLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGAS
        TLKDQ+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGAS
Subjt:  TLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGAS

Query:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASR
        SV HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASR
Subjt:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASR

Query:  SLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHK
        SLQLVQ SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHK
Subjt:  SLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHK

Query:  PEEEDVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
        PEEEDVFMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DE
Subjt:  PEEEDVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE

Query:  DEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        DEPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
Subjt:  DEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        LSGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  LSGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

XP_023534590.1 SART-1 family protein DOT2 [Cucurbita pepo subsp. pepo]0.088.06Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GEG  DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR------DRDRKKKEKEKDRSNENEREKGREKHR
        H+KER+R  KVK          EKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR      DRDRKKKEK+KDRSNENEREKGREK R
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR------DRDRKKKEKEKDRSNENEREKGREKHR

Query:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSS
        DQEEKES RN DKERGKEK L DD+K DQNKEK R  E IG KNDEERIDW     KDYML+SDG+ NRD+  VDQGNAV+HLGGEENSDGLKVGAQ SS
Subjt:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSS

Query:  VMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKD
         MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKD
Subjt:  VMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKD

Query:  QSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNH
        Q+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV H
Subjt:  QSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNH

Query:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQL
        FEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG RNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQL
Subjt:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQL

Query:  VQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEE
        VQ SS+RL+DNEDTFI DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTT S Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEE
Subjt:  VQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEE

Query:  DVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPK
Subjt:  DVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        E+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        VKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  VKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

TrEMBL top hitse value%identityAlignment
A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.085.17Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MD ERSS PD  ERNG      DD G+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDR+RSKR SDDA KE+EKEVKDSERDRV SR++RKE+RDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEK----------EYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRD------RDRKKKEKEKDRSNENEREKGREKHRDQEEK
         RGSKVK+K          EYERERDRKDRGKD+ER RER+LEKDNVRG DKERGKEKDRDRD      RDRKKK+K+KDRSNE EREKGREKHRDQE+K
Subjt:  SRGSKVKEK----------EYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRD------RDRKKKEKEKDRSNENEREKGREKHRDQEEK

Query:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER
        ES RN DKERGKE+ILEDDRK DQ K+K   +E IGSKNDEER  W  D GKDYML+SDG+N    DV+QGN V+HLGGEEN DGLKVG+ PSS MLEER
Subjt:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER

Query:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL
        IR MKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDD+A E+TT+N  L GVKVLHGVDKVLEGGAVVLTLKDQSIL
Subjt:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL

Query:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL
        ADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPA ADEGLTLD RG   NDAEKKLEELR+RLQG SSV HFEDL
Subjt:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL

Query:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS
        N S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQA+KEEQE+SEAEMR +AYQSAYAKADEASRSLQLVQTS
Subjt:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS

Query:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM
        S RLEDN+D  IADDDED YKSLERARKLALKKQ+ AASGP AIALLAT TTSSQ+TDD NTKAGE+QENKV+FTEMEEFVWGLQLDE++HKPEEEDVFM
Subjt:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM

Query:  DDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
        DDDE PKE YHED KDKDGGWTEVKDTA+EE  P++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
Subjt:  DDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS

Query:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
        K+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
Subjt:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG

Query:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        QTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE +N+GTKK+KV
Subjt:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.085.17Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MD ERSS PD  ERNG      DD G+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDR+RSKR SDDA KE+EKEVKDSERDRV SR++RKE+RDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEK----------EYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRD------RDRKKKEKEKDRSNENEREKGREKHRDQEEK
         RGSKVK+K          EYERERDRKDRGKD+ER RER+LEKDNVRG DKERGKEKDRDRD      RDRKKK+K+KDRSNE EREKGREKHRDQE+K
Subjt:  SRGSKVKEK----------EYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRD------RDRKKKEKEKDRSNENEREKGREKHRDQEEK

Query:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER
        ES RN DKERGKE+ILEDDRK DQ K+K   +E IGSKNDEER  W  D GKDYML+SDG+N    DV+QGN V+HLGGEEN DGLKVG+ PSS MLEER
Subjt:  ESCRNTDKERGKEKILEDDRKADQNKEK--SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEER

Query:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL
        IR MKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDD+A E+TT+N  L GVKVLHGVDKVLEGGAVVLTLKDQSIL
Subjt:  IRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSN--LAGVKVLHGVDKVLEGGAVVLTLKDQSIL

Query:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL
        ADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPA ADEGLTLD RG   NDAEKKLEELR+RLQG SSV HFEDL
Subjt:  ADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDL

Query:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS
        N S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQA+KEEQE+SEAEMR +AYQSAYAKADEASRSLQLVQTS
Subjt:  NASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTS

Query:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM
        S RLEDN+D  IADDDED YKSLERARKLALKKQ+ AASGP AIALLAT TTSSQ+TDD NTKAGE+QENKV+FTEMEEFVWGLQLDE++HKPEEEDVFM
Subjt:  SIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFM

Query:  DDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
        DDDE PKE YHED KDKDGGWTEVKDTA+EE  P++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS
Subjt:  DDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKS

Query:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
        K+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG
Subjt:  KESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPG

Query:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        QTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE +N+GTKK+KV
Subjt:  QTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1D793 SART-1 family protein DOT20.0100Show/hide
Query:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
        MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER
Subjt:  MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKER

Query:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
        SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL
Subjt:  SRGSKVKEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKIL

Query:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
        EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV
Subjt:  EDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEV

Query:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
        LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ
Subjt:  LSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQ

Query:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
        KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK
Subjt:  KQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFK

Query:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
        KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY
Subjt:  KPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLY

Query:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
        KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW
Subjt:  KSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGW

Query:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
        TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD
Subjt:  TEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTD

Query:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
        EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT
Subjt:  EFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPINAFATVEKDLPGGLT

Query:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
Subjt:  PMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1FR42 SART-1 family protein DOT20.087.28Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SS P+HDERNGH     GEG  DDFG SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----------DRDRKKKEKEKDRSNENEREKGR
        H+KER+R  KVK          EKEYERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR          DRDRKKKEK+KDRSNENEREKGR
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR----------DRDRKKKEKEKDRSNENEREKGR

Query:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGA
        EK RDQEEKES RN DK+RGKEK L DD+K DQNKEK R  E  G KN+EERIDW     KDYML+SDG+ NRD+  VDQGNAV+ LGGEENSDGLKVGA
Subjt:  EKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGA

Query:  QPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVL
        Q SS MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVL
Subjt:  QPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVL

Query:  TLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGAS
        TLKDQ+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGAS
Subjt:  TLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGAS

Query:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASR
        SV HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASR
Subjt:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASR

Query:  SLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHK
        SLQLVQ SS+RL+DNEDT I DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHK
Subjt:  SLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHK

Query:  PEEEDVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
        PEEEDVFMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEPTPEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DE
Subjt:  PEEEDVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE

Query:  DEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        DEPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
Subjt:  DEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        LSGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P+N+GTKK K+
Subjt:  LSGHVKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

A0A6J1IPE4 SART-1 family protein DOT20.087.12Show/hide
Query:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE
        MD + SSVP+HDERNGH     GEG  DDFG+SGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDRSKRRSDDA KE+EKEVKDSERDRVH R+RRKE+RDE
Subjt:  MDTERSSVPDHDERNGH-----GEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDE

Query:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR------DRDRKKKEKEKDRSNENEREKGREKHR
        H+KER+R  KVK          EKEY+RERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEKDRDR      DRDRKKKEK+KDRSNENEREKGREK R
Subjt:  HEKERSRGSKVK----------EKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDR------DRDRKKKEKEKDRSNENEREKGREKHR

Query:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSS
        DQEEKES RN DK+RGKEK L DD+K DQNKEK R  E IG KNDEERIDW          +SDG+ NRD+  VDQGNAV+HLGGE+NSDGLKVGAQ SS
Subjt:  DQEEKESCRNTDKERGKEKILEDDRKADQNKEKSR--ELIGSKNDEERIDWAGDAGKDYMLKSDGQ-NRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSS

Query:  VMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKD
         MLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDD+AAED TSNLAGVKVLHG+DKVL GGAVVLTLKD
Subjt:  VMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKD

Query:  QSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNH
        Q+ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKK+LPQYDDPAAADEGLTLD  GR +NDAEKKLEELRKRLQGASSV H
Subjt:  QSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNH

Query:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQL
        FEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDS RQARK EQERSEAEMR +AYQSAYAKADEASRSLQL
Subjt:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQL

Query:  VQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEE
        VQ SS+RL+ NEDT I DDDEDLYKSLERARKLALKKQE AASGPEA+ALLATTTTS Q+TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEESHKPEEE
Subjt:  VQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEE

Query:  DVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK
        DVFMDDDEAPKE YHEDEKDKDGGWTEVKDTA+EEP PEDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPK
Subjt:  DVFMDDDEAPKE-YHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPK

Query:  ESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
        E+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH
Subjt:  ESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGH

Query:  VKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV
        VKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P N+GTKK K+
Subjt:  VKPGQTSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 13.6e-2026.32Show/hide
Query:  KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKA
        +E +  + R     G   ER R+R  E+   RG  +   + + R     R     E+ ++  +ER   REK  D  E  +   T         +E+  K 
Subjt:  KEKEYERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKA

Query:  DQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLS
               R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +
Subjt:  DQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLS

Query:  WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLE
        W++RSR+L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  ++LTLKD+ +L      E+ DVL 
Subjt:  WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLE

Query:  NVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVK
        NV + ++++ +   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S++         +
Subjt:  NVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVK

Query:  VSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIR
        ++ +Y T +EM+ FKK K++ K +RKKEK + + A +   +      GD GSR   R + R      SE E             D    ++ +       
Subjt:  VSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIR

Query:  LEDNEDTFIADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDEES
                + ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  + E 
Subjt:  LEDNEDTFIADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDEES

Query:  HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGI
         + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V  
Subjt:  HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLVGI

Query:  VDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMRE
        +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E
Subjt:  VDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMRE

Query:  AQAQLKTPYLVLSGHVK
         Q   KTPY+VLSG  K
Subjt:  AQAQLKTPYLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 11.6e-2026.59Show/hide
Query:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K
        ++  R ++ ++ + R     +  G+ ++R +E+  +R   R+  E E  RS  + RE+ + +  ++  K   R+   E     K    D  +   +E  K
Subjt:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K

Query:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR
         R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +W++RSR
Subjt:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR

Query:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE
        +L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  VVLTLKD+ +L +G+     DVL NV + +
Subjt:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE

Query:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY
        +++ D   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S+N         +++ +Y 
Subjt:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY

Query:  TQDEMLQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
        + +EM+ FKK K++ K +RKKEK ++     + +  G      GD GSR   R + R  E E    E       +    +D+        +  ++ + D 
Subjt:  TQDEMLQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFI-------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLD
        ED           ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  +
Subjt:  EDTFI-------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLD

Query:  EESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KL
         E  + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    
Subjt:  EESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KL

Query:  VGIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
        V  +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  
Subjt:  VGIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER

Query:  MREAQAQLKTPYLVLSGHVK
        ++E Q   KTPY+VLSG  K
Subjt:  MREAQAQLKTPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT25.4e-22658.52Show/hide
Query:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC
        +E RD   KE+   SK KEK+Y+RE+ R KD  +DKE+ R+R+  +D    ++  RG++K+R++D+ R  + KEKD+  E  R K RE  RD E EK+  
Subjt:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC

Query:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK
        R   KER  +K  EDD +  +  E+                               NR  ++           G +N D    G + S++ L+ RI  M+
Subjt:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK

Query:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE
        E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN 
Subjt:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE

Query:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS
        ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEELRKR+QG  + + FEDLN+S KVS
Subjt:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS

Query:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
         DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+ER E E R +AYQ A AKADEASR L+  Q    + +++
Subjt:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK
        E   +ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +Q+TDD+ T   E QEN VVFTEM +FVWGLQ + +  KPE EDVFM++D APK
Subjt:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK

Query:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS
           E +++   G TEV DT  +      D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+   
Subjt:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS

Query:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN
          D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP +
Subjt:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN

Query:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT
         FATVEKD+PG LTPMLGDRKVEHFLGIKRK EP NS T
Subjt:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 19.4e-2126.62Show/hide
Query:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K
        ++  R ++ ++ + R     +  G+ ++R +E+  +R   R+  E E  RS  + RE+ + +  ++  K   R+   E     K    D  +   +E  K
Subjt:  DRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKE-RGKEKILEDDRKADQNKE--K

Query:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR
         R  +G K  E           + + K  G   +    D  N +     EE  + L       +   E+R+   K  ++K   E+     +  +W++RSR
Subjt:  SRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEE----SEVLSWVKRSR

Query:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE
        +L++    EK+ A + +K+ EE   +DQ      L  E+         +  +L G+ V H +D   EG  VVLTLKD+ +L DG+     DVL NV + +
Subjt:  KLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAED---------TTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGE

Query:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY
        +++ D   +  KKK        D+  +D      + IL +YD+    +      L++ G      E++LEE+R +L+  A S++     +   +++ +Y 
Subjt:  QKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKILPQYDDPAAAD--EGLTLDERGRLTNDAEKKLEELRKRLQ-GASSVNHFEDLNASVKVSHDYY

Query:  TQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNED
        + +EM+ FKK K++ K +RKKEK + + A +   +      GD GSR   R + R  E E    E       +    +D+        +  ++ + D ED
Subjt:  TQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNED

Query:  TFI--------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDE
                    ++DE   +L K LE+ R+L   +Q +    SG + + ++    +  +  ++        ++  +VF    EF         +GL  + 
Subjt:  TFI--------ADDDE---DLYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF--------VWGLQLDE

Query:  ESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLV
        E  + E  D   D++ +     E + +++ GW+ V    +EE   +D  A +      E  V +GL++AL L +++G L+ +++   R     KS    V
Subjt:  ESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKS--KLV

Query:  GIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
          +++    + K S+ +E R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  +
Subjt:  GIVDED---EPKESKSKESR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM

Query:  REAQAQLKTPYLVLSGHVK
        +E Q   KTPY+VLSG  K
Subjt:  REAQAQLKTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family1.5e-1839.31Show/hide
Query:  EVKDTAEE--EPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERT
        EV   AE+  +   +++  +  D  + E  VG GLS AL  L+++GT KE            + K+VG+      K++  ++ R     D  K+I I+R 
Subjt:  EVKDTAEE--EPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERT

Query:  DEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
        +++GRIMT KE++R L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  KTPY+VL
Subjt:  DEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family3.8e-22758.52Show/hide
Query:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC
        +E RD   KE+   SK KEK+Y+RE+ R KD  +DKE+ R+R+  +D    ++  RG++K+R++D+ R  + KEKD+  E  R K RE  RD E EK+  
Subjt:  KEERDEHEKERSRGSKVKEKEYERERDR-KDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQE-EKESC

Query:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK
        R   KER  +K  EDD +  +  E+                               NR  ++           G +N D    G + S++ L+ RI  M+
Subjt:  RNTDKERGKEKILEDDRKADQNKEKSRELIGSKNDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMK

Query:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE
        E+R KK  + S+ LSWV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQS+L DGDVN 
Subjt:  EDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNE

Query:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS
        ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEELRKR+QG  + + FEDLN+S KVS
Subjt:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAAADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVS

Query:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN
         DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D RRQA KEE+ER E E R +AYQ A AKADEASR L+  Q    + +++
Subjt:  HDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQERSEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDN

Query:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK
        E   +ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +Q+TDD+ T   E QEN VVFTEM +FVWGLQ + +  KPE EDVFM++D APK
Subjt:  EDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPK

Query:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS
           E +++   G TEV DT  +      D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+   
Subjt:  EYHEDEKDKDGGWTEVKDT-AEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSS

Query:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN
          D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP +
Subjt:  LVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPIN

Query:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT
         FATVEKD+PG LTPMLGDRKVEHFLGIKRK EP NS T
Subjt:  AFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACGGAACGGTCATCAGTGCCTGATCATGATGAGAGAAATGGTCATGGGGAAGGAGGACACGATGACTTTGGATGGAGTGGAGCAGAAAAGTCTAGTAAGCATCG
GAGTGAGGATCATCGGAAGAATAGTCGAGGGGAGGAAAAAGACCACAGAAGTAAAGATCGAGACCGATCCAAGAGACGTAGTGACGATGCACCAAAAGAAAGGGAGAAAG
AGGTAAAAGATTCAGAAAGGGATCGAGTTCATAGTCGGGACAGGAGGAAGGAAGAAAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGTAGCAAAGTTAAAGAAAAAGAA
TATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAACGTGGGAGGGAGAGACAGTTGGAGAAGGATAATGTACGAGGACAAGACAAAGAAAGGGGAAAGGA
GAAAGACAGGGACAGGGATAGGGATAGGAAAAAAAAAGAAAAGGAAAAGGACAGGTCAAATGAAAATGAAAGAGAAAAAGGGAGAGAGAAACACAGAGATCAAGAGGAGA
AGGAAAGCTGTAGGAACACTGACAAGGAGAGAGGAAAAGAGAAAATTTTGGAAGATGACAGGAAAGCAGATCAAAACAAGGAGAAATCAAGGGAATTGATTGGCAGTAAA
AATGATGAGGAAAGAATTGATTGGGCTGGAGATGCGGGTAAGGATTATATGCTAAAAAGTGATGGTCAGAACCGGGACAAAGATGATGTTGATCAAGGGAATGCTGTCCG
GCATTTGGGAGGTGAAGAAAATTCTGATGGTTTGAAAGTTGGAGCTCAGCCTTCTTCAGTTATGCTTGAGGAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGC
AAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTCGTAAGCTTGAGGAGAAGAAACTTTCTGAAAAAGAAAAAGCATTGCAGCTCTCAAAGATTTTTGAGGAA
CAGGACAATATTGATCAAGGTGGAAGTGATGATGATCTTGCAGCAGAAGATACAACTAGTAATCTAGCAGGAGTTAAAGTACTTCATGGCGTAGATAAAGTACTGGAAGG
TGGTGCAGTTGTCTTAACACTTAAAGATCAGAGTATCTTAGCTGATGGTGACGTTAACGAAGACATGGATGTACTTGAGAATGTGGAAATTGGAGAGCAGAAGCAGAGAG
ACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATCTATGATGACAAGTTTAATGATGAAAATCATGGTGAGAAAAAGATTCTTCCACAGTATGATGATCCCGCAGCT
GCAGATGAGGGTCTAACTCTAGATGAAAGAGGACGTCTTACTAATGATGCAGAGAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAATCACTT
TGAAGATCTGAATGCATCAGTGAAAGTCTCACATGATTATTACACTCAAGATGAGATGCTTCAGTTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGC
TAGATATCGATGCCCTTGAAGCAGAAGCAATCTCTGCTGGATTGGGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGAAGGCAAGCACGTAAAGAGGAACAGGAAAGG
TCTGAAGCAGAAATGCGGCATAGTGCGTACCAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTCCAATTAGTTCAAACTAGTTCAATCAGATTAGAGGACAA
TGAAGATACTTTCATTGCAGACGATGATGAAGATCTCTATAAATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCGGACCAGAAGCTA
TTGCTCTTCTTGCTACAACAACGACCAGCAGTCAGTCAACTGATGATCATAACACAAAAGCTGGAGAGGTGCAGGAGAATAAGGTTGTGTTTACAGAAATGGAAGAATTT
GTATGGGGTCTCCAGCTTGATGAAGAATCTCATAAACCTGAGGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGAGAAGGATAAGGA
TGGTGGGTGGACTGAAGTTAAAGATACTGCCGAAGAAGAACCCACACCTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCCGTTGGAAAGGGAT
TATCCAGTGCGCTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGGAAAAGCAAACTTGTTGGTATAGTAGAC
GAAGACGAACCAAAGGAATCGAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTTGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGAC
TCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGGCAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGC
AGATGAAGAATGCAGATACACCTTCTTTGTCAGTGGAGAGAATGAGGGAGGCTCAGGCACAACTAAAGACCCCTTATCTTGTTCTCAGTGGTCATGTAAAACCTGGCCAA
ACGAGTGATCCAATAAATGCTTTTGCTACTGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAACATTTCTTGGGGATAAAGCGTAA
AGGCGAACCTACGAATTCAGGCACAAAGAAGTCCAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
GTGAATAAATTTGCAAAAATTTCGTTACAGTAAAAATACTTTACCATTTTATTCTCTCCTCATTTACCCATTACCCTAAAACCCAGACTTTCTATTTTTCTTCATAACAT
TCTCCATCTAAACAATTAGCCGCCACCACCCTCTCTCTTATCTCTTGACAAACTCCGGCGACACGAGAAGGGGCGGCGCGTCGACGAAATTCGGCGTATGCGTCTAAGGC
GGCGGCGGAACCCTACGGTATAACTACAACCACGAACGGCGGCGGTGTAGTAGCAAATCCGGCGACCTGCAGCAACAAATCGCGACTATCCACGGACGTTCGACGGCGGC
AGATTCCAGCGGTGGGGCGGTGACCGTGCAGACCCGGCAACCCCCTTCCAGCGAGTCTTCTCCCGGAGTGACAGATCGTAGACACCCAGATCCGGCGACCCTCCTATGAA
CCATATCTGATATAATTTCAAACTCACGACAACTCGATTAGCGATTACCATCACTCCGGTGATTAAGCAACGTAGGTAAGTAAGGGTTTCGGCAATGAAAATCCTCAAGT
AATGTGCTGCTAGCGCACAAAAATTCTCAAGGAGTGTGCAAGCGCAAGCTACTATATATTCTGGGCCAATGGACACGGAACGGTCATCAGTGCCTGATCATGATGAGAGA
AATGGTCATGGGGAAGGAGGACACGATGACTTTGGATGGAGTGGAGCAGAAAAGTCTAGTAAGCATCGGAGTGAGGATCATCGGAAGAATAGTCGAGGGGAGGAAAAAGA
CCACAGAAGTAAAGATCGAGACCGATCCAAGAGACGTAGTGACGATGCACCAAAAGAAAGGGAGAAAGAGGTAAAAGATTCAGAAAGGGATCGAGTTCATAGTCGGGACA
GGAGGAAGGAAGAAAGAGATGAGCATGAAAAAGAAAGGAGCAGGGGTAGCAAAGTTAAAGAAAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAA
GAACGTGGGAGGGAGAGACAGTTGGAGAAGGATAATGTACGAGGACAAGACAAAGAAAGGGGAAAGGAGAAAGACAGGGACAGGGATAGGGATAGGAAAAAAAAAGAAAA
GGAAAAGGACAGGTCAAATGAAAATGAAAGAGAAAAAGGGAGAGAGAAACACAGAGATCAAGAGGAGAAGGAAAGCTGTAGGAACACTGACAAGGAGAGAGGAAAAGAGA
AAATTTTGGAAGATGACAGGAAAGCAGATCAAAACAAGGAGAAATCAAGGGAATTGATTGGCAGTAAAAATGATGAGGAAAGAATTGATTGGGCTGGAGATGCGGGTAAG
GATTATATGCTAAAAAGTGATGGTCAGAACCGGGACAAAGATGATGTTGATCAAGGGAATGCTGTCCGGCATTTGGGAGGTGAAGAAAATTCTGATGGTTTGAAAGTTGG
AGCTCAGCCTTCTTCAGTTATGCTTGAGGAGCGCATTCGGACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAGGTTTTATCATGGGTTAAAAGGAGTC
GTAAGCTTGAGGAGAAGAAACTTTCTGAAAAAGAAAAAGCATTGCAGCTCTCAAAGATTTTTGAGGAACAGGACAATATTGATCAAGGTGGAAGTGATGATGATCTTGCA
GCAGAAGATACAACTAGTAATCTAGCAGGAGTTAAAGTACTTCATGGCGTAGATAAAGTACTGGAAGGTGGTGCAGTTGTCTTAACACTTAAAGATCAGAGTATCTTAGC
TGATGGTGACGTTAACGAAGACATGGATGTACTTGAGAATGTGGAAATTGGAGAGCAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATCTATG
ATGACAAGTTTAATGATGAAAATCATGGTGAGAAAAAGATTCTTCCACAGTATGATGATCCCGCAGCTGCAGATGAGGGTCTAACTCTAGATGAAAGAGGACGTCTTACT
AATGATGCAGAGAAGAAGCTTGAGGAGCTTCGGAAAAGATTACAGGGAGCTTCTTCAGTCAATCACTTTGAAGATCTGAATGCATCAGTGAAAGTCTCACATGATTATTA
CACTCAAGATGAGATGCTTCAGTTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAGCTAGATATCGATGCCCTTGAAGCAGAAGCAATCTCTGCTGGAT
TGGGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGAAGGCAAGCACGTAAAGAGGAACAGGAAAGGTCTGAAGCAGAAATGCGGCATAGTGCGTACCAGTCAGCCTAT
GCTAAAGCAGATGAGGCATCAAGATCTCTCCAATTAGTTCAAACTAGTTCAATCAGATTAGAGGACAATGAAGATACTTTCATTGCAGACGATGATGAAGATCTCTATAA
ATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCGGACCAGAAGCTATTGCTCTTCTTGCTACAACAACGACCAGCAGTCAGTCAACTG
ATGATCATAACACAAAAGCTGGAGAGGTGCAGGAGAATAAGGTTGTGTTTACAGAAATGGAAGAATTTGTATGGGGTCTCCAGCTTGATGAAGAATCTCATAAACCTGAG
GAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGAGAAGGATAAGGATGGTGGGTGGACTGAAGTTAAAGATACTGCCGAAGAAGAACC
CACACCTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCCGTTGGAAAGGGATTATCCAGTGCGCTGAAGCTGCTTAAAGATCGAGGAACTCTGA
AGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGGAAAAGCAAACTTGTTGGTATAGTAGACGAAGACGAACCAAAGGAATCGAAGTCAAAGGAATCCCGTTTA
TCTTCTTTGGTTGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGG
CAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCAGATACACCTTCTTTGTCAGTGGAGAGAA
TGAGGGAGGCTCAGGCACAACTAAAGACCCCTTATCTTGTTCTCAGTGGTCATGTAAAACCTGGCCAAACGAGTGATCCAATAAATGCTTTTGCTACTGTTGAAAAGGAT
CTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAACATTTCTTGGGGATAAAGCGTAAAGGCGAACCTACGAATTCAGGCACAAAGAAGTCCAAAGTTTG
AAGTTTCCACATCATTTCACGAATTCAGGCACATCCTTTCACATTGACAAGGCAATTTTATGGTCTCCAAGGCCTTATTGATCCAACTGTGCCGAGTGGGATGGCCATTG
CTCAGCAGATAAAAGTTACCTTCCTTTTTTTTAAGGAAAAAAAAAGAGGTGTTTATGGAAGTGAAGGACAATAGAACTTTTACTGTAACTTTTTCACTAGTGTCCTGTTT
ATGGCCCTCTGGTGTAAATGTCACTTTGTTACATTTTAAATGACCTTTTGATGAATTGGAGGGCTTTTTTGTAACCCATCATCATTGGGTCTTCCCTTTTTGTACATTTT
ATATTATCAATGAAATATGTCTCTTACCAAAAAAAAAAAAAAAGTTTGAAGTTTGCAACTTTAACAACAATT
Protein sequenceShow/hide protein sequence
MDTERSSVPDHDERNGHGEGGHDDFGWSGAEKSSKHRSEDHRKNSRGEEKDHRSKDRDRSKRRSDDAPKEREKEVKDSERDRVHSRDRRKEERDEHEKERSRGSKVKEKE
YERERDRKDRGKDKERGRERQLEKDNVRGQDKERGKEKDRDRDRDRKKKEKEKDRSNENEREKGREKHRDQEEKESCRNTDKERGKEKILEDDRKADQNKEKSRELIGSK
NDEERIDWAGDAGKDYMLKSDGQNRDKDDVDQGNAVRHLGGEENSDGLKVGAQPSSVMLEERIRTMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEE
QDNIDQGGSDDDLAAEDTTSNLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKILPQYDDPAA
ADEGLTLDERGRLTNDAEKKLEELRKRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSRRQARKEEQER
SEAEMRHSAYQSAYAKADEASRSLQLVQTSSIRLEDNEDTFIADDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQSTDDHNTKAGEVQENKVVFTEMEEF
VWGLQLDEESHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAEEEPTPEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD
EDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQ
TSDPINAFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPTNSGTKKSKV