; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020724 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020724
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSART-1 family protein DOT2
Genome locationtig00153554:498549..505949
RNA-Seq ExpressionSgr020724
SyntenySgr020724
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600325.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.14Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SSVPEHDER+GH+   ARD GE G DDFG+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------RDRDRKKKEKEKDRLNENGREKGREKY
        RDEH+KER+R  KVKDKDYDRE YKEKE+ERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD      RDRDRKKKEK+KDR NEN REKGREK 
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------RDRDRKKKEKEKDRLNENGREKGREKY

Query:  RDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPS
        RDQEEKESYRNIDKERGKEK L DD+K DQNKEK RDK+GIG KNDEERIDWIA G KDYML+SD E+NRD+  VDQG  V HLGGEENSDGLKVGAQ S
Subjt:  RDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPS

Query:  SAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLK
        SAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTLK
Subjt:  SAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAV
        DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE            
Subjt:  DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAV

Query:  IFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDLKQ
              LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R++ +++Q
Subjt:  IFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDLKQ

Query:  KC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME
                                      D +  +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLA TTTS QTTDDQNTKAG+LQENKVVFTEME
Subjt:  KC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME

Query:  EFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRN
        EFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEPT EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRN
Subjt:  EFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRN

Query:  MDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
        MDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
Subjt:  MDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER

Query:  MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK + +NTGTKK K+
Subjt:  MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0083.35Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SSVPEHDER+GH+   ARD GE G DDFG+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------RDRDRKKKEKEKDRLNENGREKGREKY
        RDEH+KER+R  KVKDKDYDRE YKEKE+ERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD      RDRDRKKKEK+KDR NEN REKGREK 
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------RDRDRKKKEKEKDRLNENGREKGREKY

Query:  RDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPS
        RDQEEKESYRNIDKERGKEK L DD+K DQNKEK RDK+GIG KNDEERIDWIA G KDYML+SD E+NRD+  VDQG  VQHLGGEENSDGLKVGAQ S
Subjt:  RDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPS

Query:  SAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLK
        SAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTLK
Subjt:  SAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLK

Query:  DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAV
        DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE            
Subjt:  DQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAV

Query:  IFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDLKQ
              LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R++ +++Q
Subjt:  IFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDLKQ

Query:  KC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME
                                      D +  +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS QTTDDQNTKAG+LQENKVVFTEME
Subjt:  KC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME

Query:  EFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRN
        EFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEPT EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGGRN
Subjt:  EFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRN

Query:  MDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
        MDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER
Subjt:  MDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVER

Query:  MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK + +NTGTKK K+
Subjt:  MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

XP_022149190.1 SART-1 family protein DOT2 [Momordica charantia]0.0e+0083.52Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MD ERSSVP+HDER+G        HGEGGHDDFGWSGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDR+KRRSDDA KEREKEVKD ERDRVHSR+RRKE+
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--DRDRDRKKKEKEKDRLNENGREKGREKYRDQE
        RDEHEKERSRGSKV          KEKE+ERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEK  DRDRDRKKKEKEKDR NEN REKGREK+RDQE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--DRDRDRKKKEKEKDRLNENGREKGREKYRDQE

Query:  EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAML
        EKES RN DKERGKEK LEDDRKADQNKEKSR+   IGSKNDEERIDW  D GKDYMLKSD + NRDKD VDQG  V+HLGGEENSDGLKVGAQPSS ML
Subjt:  EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAML

Query:  EDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNI
        E+RIRTMK+DR+KKQT+ESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDD+AAEDTTSNLAGVKVLHG+DKVLEGGAVVLTLKDQ+I
Subjt:  EDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNI

Query:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFAL
        LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE  GEKKILPQYDD AAADEGLTLDERGRLTNDAEKKLEE                
Subjt:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFAL

Query:  CQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQKCDIMHT-
          LR+RLQGASSVNHFEDL+ASVKVSHDYYTQDE+LQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS    + + + R ++  + + ++ H+ 
Subjt:  CQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQKCDIMHT-

Query:  ----------------------------------NDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE
                                          +DDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQ+TDD NTKAG++QENKVVFTEMEE
Subjt:  ----------------------------------NDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE

Query:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD
        FVWGLQLDEE+HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTA+EEPT EDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD
Subjt:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD

Query:  KRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR
        KRKSKLVGI+DEDEPKES SK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR
Subjt:  KRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR

Query:  EAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        EAQAQLKTPYLVLSGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRK E TN+GTKK+KV
Subjt:  EAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

XP_022942374.1 SART-1 family protein DOT2 [Cucurbita moschata]0.0e+0082.33Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SS PEHDER+GH+   ARD GE G DDFG SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------------RDRDRKKKEKEKDRLNENGRE
        RDEH+KER+R  KVKDKDYDRE YKEKE+ERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD            RDRDRKKKEK+KDR NEN RE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------------RDRDRKKKEKEKDRLNENGRE

Query:  KGREKYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLK
        KGREK RDQEEKESYRNIDK+RGKEK L DD+K DQNKEK RDK+G G KN+EERIDWIA G KDYML+SD E+NRD+  VDQG  VQ LGGEENSDGLK
Subjt:  KGREKYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLK

Query:  VGAQPSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGA
        VGAQ SSAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGA
Subjt:  VGAQPSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGA

Query:  VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLL
        VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE      
Subjt:  VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLL

Query:  FELPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRN
                    LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R+
Subjt:  FELPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRN

Query:  KKDLKQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKV
        + +++Q                              D +  +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS QTTDDQNTKAG+LQENKV
Subjt:  KKDLKQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKV

Query:  VFTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI
        VFTEMEEFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEPT EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESI
Subjt:  VFTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI

Query:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP
        EWGGRNMDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP
Subjt:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP

Query:  SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK + +NTGTKK K+
Subjt:  SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

XP_023534590.1 SART-1 family protein DOT2 [Cucurbita pepo subsp. pepo]0.0e+0082.97Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SSVPEHDER+GH+   ARD GE G DDFG+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD--------RDRDRKKKEKEKDRLNENGREKGRE
        RDEH+KER+R  KVKDKDYDRE YKEKE+ERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD        RDRDRKKKEK+KDR NEN REKGRE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD--------RDRDRKKKEKEKDRLNENGREKGRE

Query:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ
        K RDQEEKESYRNIDKERGKEK L DD+K DQNKEK RDK+GIG KNDEERIDWIA G KDYML+SD E+NRD+  VDQG  VQHLGGEENSDGLKVGAQ
Subjt:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ

Query:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLT
         SSAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLT
Subjt:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLT

Query:  LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELP
        LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE  GEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE          
Subjt:  LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELP

Query:  AVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDL
                LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R++ ++
Subjt:  AVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDL

Query:  KQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTE
        +Q                              D    +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTT S QTTDDQNTKAG+LQENKVVFTE
Subjt:  KQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTE

Query:  MEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGG
        MEEFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEPT EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGG
Subjt:  MEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGG

Query:  RNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV
        RNMDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV
Subjt:  RNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV

Query:  ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK + +NTGTKK K+
Subjt:  ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

TrEMBL top hitse value%identityAlignment
A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0079.04Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MD ERSS P  DER+G              DD G+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+R+KR SDDA KE+EKEVKD ERDRV SRE+RKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--------DRDRDRKKKEKEKDRLNENGREKGRE
        RDEHEKER RGSKVKDKDYDRE YK+KE+ERERDRKDRGKD+ER RERELEKDNVRG DKERGKEK        DRDRDRKKK+K+KDR NE  REKGRE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--------DRDRDRKKKEKEKDRLNENGREKGRE

Query:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ
        K+RDQE+KESYRN+DKERGKE+ LEDDRK DQ K+K +DK+GIGSKNDEER  WIAD GKDYML+SD ENNRD+D V+QG +VQHLGGEEN DGLKVG+ 
Subjt:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ

Query:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVV
        PSS MLE+RIR MK+DR+KKQT+ESEVL WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVV
Subjt:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVV

Query:  LTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFE
        LTLKDQ+ILADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD A ADEGLTLD RG   NDAEKKLEE        
Subjt:  LTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFE

Query:  LPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS--EMILEGKHEKRNKKD-
                  LRRRLQG SSV HFEDL+ S KVSHDYYTQDE+L+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGS  +   + K E++ K + 
Subjt:  LPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS--EMILEGKHEKRNKKD-

Query:  ------------------------------LKQKCDIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVV
                                      L+   D +  +DDED YKSLERARKLALKKQ +AASGP AIALLAT TTSSQ TDDQNTKAG+LQENKV+
Subjt:  ------------------------------LKQKCDIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVV

Query:  FTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIE
        FTEMEEFVWGLQLDE+AHKPEEEDVFMDDDE PK EYHED KDKDGGWTEVKDTAKEE   ++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIE
Subjt:  FTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIE

Query:  WGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS
        WGGRNMDKRKSKLVGI+DEDEPKES SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS
Subjt:  WGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS

Query:  LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK E++NTGTKKAKV
Subjt:  LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0079.04Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MD ERSS P  DER+G              DD G+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDR+R+KR SDDA KE+EKEVKD ERDRV SRE+RKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--------DRDRDRKKKEKEKDRLNENGREKGRE
        RDEHEKER RGSKVKDKDYDRE YK+KE+ERERDRKDRGKD+ER RERELEKDNVRG DKERGKEK        DRDRDRKKK+K+KDR NE  REKGRE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--------DRDRDRKKKEKEKDRLNENGREKGRE

Query:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ
        K+RDQE+KESYRN+DKERGKE+ LEDDRK DQ K+K +DK+GIGSKNDEER  WIAD GKDYML+SD ENNRD+D V+QG +VQHLGGEEN DGLKVG+ 
Subjt:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ

Query:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVV
        PSS MLE+RIR MK+DR+KKQT+ESEVL WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ  SDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVV
Subjt:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVV

Query:  LTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFE
        LTLKDQ+ILADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD A ADEGLTLD RG   NDAEKKLEE        
Subjt:  LTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFE

Query:  LPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS--EMILEGKHEKRNKKD-
                  LRRRLQG SSV HFEDL+ S KVSHDYYTQDE+L+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGS  +   + K E++ K + 
Subjt:  LPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS--EMILEGKHEKRNKKD-

Query:  ------------------------------LKQKCDIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVV
                                      L+   D +  +DDED YKSLERARKLALKKQ +AASGP AIALLAT TTSSQ TDDQNTKAG+LQENKV+
Subjt:  ------------------------------LKQKCDIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVV

Query:  FTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIE
        FTEMEEFVWGLQLDE+AHKPEEEDVFMDDDE PK EYHED KDKDGGWTEVKDTAKEE   ++N+A+APDETIHEVPVGKGLSSALKLLKDRGTLKESIE
Subjt:  FTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIE

Query:  WGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS
        WGGRNMDKRKSKLVGI+DEDEPKES SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS
Subjt:  WGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPS

Query:  LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK E++NTGTKKAKV
Subjt:  LSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

A0A6J1D793 SART-1 family protein DOT20.0e+0083.52Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MD ERSSVP+HDER+G        HGEGGHDDFGWSGAEKSSKHRSEDHRK+SRGEEKDHRSKDRDR+KRRSDDA KEREKEVKD ERDRVHSR+RRKE+
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--DRDRDRKKKEKEKDRLNENGREKGREKYRDQE
        RDEHEKERSRGSKV          KEKE+ERERDRKDRGKDKERGRER+LEKDNVRGQDKERGKEK  DRDRDRKKKEKEKDR NEN REKGREK+RDQE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEK--DRDRDRKKKEKEKDRLNENGREKGREKYRDQE

Query:  EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAML
        EKES RN DKERGKEK LEDDRKADQNKEKSR+   IGSKNDEERIDW  D GKDYMLKSD + NRDKD VDQG  V+HLGGEENSDGLKVGAQPSS ML
Subjt:  EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAML

Query:  EDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNI
        E+RIRTMK+DR+KKQT+ESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDD+AAEDTTSNLAGVKVLHG+DKVLEGGAVVLTLKDQ+I
Subjt:  EDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNI

Query:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFAL
        LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE  GEKKILPQYDD AAADEGLTLDERGRLTNDAEKKLEE                
Subjt:  LADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFAL

Query:  CQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQKCDIMHT-
          LR+RLQGASSVNHFEDL+ASVKVSHDYYTQDE+LQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS    + + + R ++  + + ++ H+ 
Subjt:  CQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQKCDIMHT-

Query:  ----------------------------------NDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE
                                          +DDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQ+TDD NTKAG++QENKVVFTEMEE
Subjt:  ----------------------------------NDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE

Query:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD
        FVWGLQLDEE+HKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTA+EEPT EDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD
Subjt:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMD

Query:  KRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR
        KRKSKLVGI+DEDEPKES SK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR
Subjt:  KRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMR

Query:  EAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        EAQAQLKTPYLVLSGHVKPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRK E TN+GTKK+KV
Subjt:  EAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

A0A6J1FR42 SART-1 family protein DOT20.0e+0082.33Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SS PEHDER+GH+   ARD GE G DDFG SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------------RDRDRKKKEKEKDRLNENGRE
        RDEH+KER+R  KVKDKDYDRE YKEKE+ERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD            RDRDRKKKEK+KDR NEN RE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD------------RDRDRKKKEKEKDRLNENGRE

Query:  KGREKYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLK
        KGREK RDQEEKESYRNIDK+RGKEK L DD+K DQNKEK RDK+G G KN+EERIDWIA G KDYML+SD E+NRD+  VDQG  VQ LGGEENSDGLK
Subjt:  KGREKYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLK

Query:  VGAQPSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGA
        VGAQ SSAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGA
Subjt:  VGAQPSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGA

Query:  VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLL
        VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE      
Subjt:  VVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLL

Query:  FELPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRN
                    LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R+
Subjt:  FELPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRN

Query:  KKDLKQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKV
        + +++Q                              D +  +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS QTTDDQNTKAG+LQENKV
Subjt:  KKDLKQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKV

Query:  VFTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI
        VFTEMEEFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEPT EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESI
Subjt:  VFTEMEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI

Query:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP
        EWGGRNMDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP
Subjt:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTP

Query:  SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK + +NTGTKK K+
Subjt:  SLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

A0A6J1IPE4 SART-1 family protein DOT20.0e+0082.15Show/hide
Query:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED
        MDA+ SSVPEHDER+GH+   ARD GE G DDFG+SGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDR+KRRSDDA KE+EKEVKD ERDRVH RERRKED
Subjt:  MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKED

Query:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD--------RDRDRKKKEKEKDRLNENGREKGRE
        RDEH+KER+R  KVKDKDYDRE YKEKE++RERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD        RDRDRKKKEK+KDR NEN REKGRE
Subjt:  RDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKD--------RDRDRKKKEKEKDRLNENGREKGRE

Query:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ
        K RDQEEKESYRNIDK+RGKEK L DD+K DQNKEK RDK+GIG KNDEERIDW+A G      +SD E+NRD+  VDQG  VQHLGGE+NSDGLKVGAQ
Subjt:  KYRDQEEKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQ

Query:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLT
         SSAMLE+RIRTMK+DR+KKQT+ESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNIDQG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLT
Subjt:  PSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLT

Query:  LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELP
        LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDE DGEKK+LPQYDD AAADEGLTLD  GR +NDAEKKLEE          
Subjt:  LKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELP

Query:  AVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDL
                LR+RLQGASSV HFEDL+ASVKVSHDYYTQDE+L+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG    S        ++R++ ++
Subjt:  AVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLG----SEMILEGKHEKRNKKDL

Query:  KQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTE
        +Q                              D +  +DDEDLYKSLERARKLALKKQ EAASGPEA+ALLATTTTS QTTDDQNTKAG+LQENKVVFTE
Subjt:  KQKC----------------------------DIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTE

Query:  MEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGG
        MEEFVWGLQLDEE+HKPEEEDVFMDDDEAPK EYHEDEKDKDGGWTEVKDTAKEEP  EDNE IAPDETIHEVPVGKGLSS LKLLKDRGTLKESIEWGG
Subjt:  MEEFVWGLQLDEEAHKPEEEDVFMDDDEAPK-EYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGG

Query:  RNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV
        RNMDKRKSKLVGIIDEDEPKE+ SKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV
Subjt:  RNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV

Query:  ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV
        ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRK +  NTGTKK K+
Subjt:  ERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 11.2e-2025Show/hide
Query:  KEKEHERERDRK-----DRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKERGKEKTLEDD
        + +EH++ + R        G+ ++R RER  E+ + R   +   +     R+R + E  + R+    R+ G E                     KT   D
Subjt:  KEKEHERERDRK-----DRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKERGKEKTLEDD

Query:  RKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE---
          +   +E ++ +  +G K  E        G K+  + +DV N      + Q + ++               +  +A  E R+   K  +IK   ++   
Subjt:  RKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE---

Query:  -SEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNE
          +   W++RSR+L++    EK+ A + +K+ EE   +DQ      +  E+         +  +L G+ V H ID   EG  ++LTLKD+ +L      E
Subjt:  -SEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNE

Query:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALC
        + DVL NV + ++++ +   +  KKK       +D+  D++  +K   IL +YD+    +      L++ G      E++LEE                 
Subjt:  DMDVLENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALC

Query:  QLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQ-------
        +LR + Q  S+V          +++ +Y T +E++ FKK K++ K +RKKEK + + A +   +      GD GS +   G+      ++ K+       
Subjt:  QLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQ-------

Query:  ----KCDIMHTNDDE--------------------DLYKSLERARKL-ALKKQEEAASGPEAIALLATTTTSSQT--TDDQNTKAGDLQENKVVFTEMEE
            + + M  +D+E                    +L K LE+ R+L  L++ ++     E +  +     S Q    +D++ +    ++  +VF    E
Subjt:  ----KCDIMHTNDDE--------------------DLYKSLERARKL-ALKKQEEAASGPEAIALLATTTTSSQT--TDDQNTKAGDLQENKVVFTEMEE

Query:  F--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI
        F         +GL  + E  + E  D   D++ +     E + +++ GW+ V     EE   +D  A +      E  V +GL++AL L +++G L+ ++
Subjt:  F--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESI

Query:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELK
        +   R     KS    +   ++      K SR                YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  
Subjt:  EWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELK

Query:  LKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
        LK+M ++DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  LKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 16.1e-2025.18Show/hide
Query:  YKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKERGKEKTLEDDRKAD
        +K+ +H         G+ ++R RER  E+ + R   +   +     R+R + E  + R+    R++G E                     K    D  + 
Subjt:  YKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKERGKEKTLEDDRKAD

Query:  QNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE----SEV
          +E ++ +  +G K  E        G K+  + +DV N      + Q + ++               +  +A  E R+   K  +IK   ++     + 
Subjt:  QNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE----SEV

Query:  LTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDV
          W++RSR+L++    EK+ A + +K+ EE   +DQ      +  E+         +  +L G+ V H ID   EG  VVLTLKD+ +L +G+     DV
Subjt:  LTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDV

Query:  LENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALCQLRR
        L NV + ++++ D   +  KKK       +D+  D++  +K   IL +YD+    +      L++ G      E++LEE                 + + 
Subjt:  LENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALCQLRR

Query:  RLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSEMILEGKH----------EKRNKKDLK
        RLQ A S+N         +++ +Y + +E++ FKK K++ K +RKKEK ++     + +  G      GD GS +   G+           E   K  + 
Subjt:  RLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGSEMILEGKH----------EKRNKKDLK

Query:  Q-------KCDIMHTNDDED-------------------LYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEM
        Q       + + M  +D+ED                   L K LE+ R+L   +Q +    SG + + ++    +  +  +++       ++  +VF   
Subjt:  Q-------KCDIMHTNDDED-------------------LYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEM

Query:  EEF--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKE
         EF         +GL  + E  + E  D   D++ +     E + +++ GW+ V     EE   +D  A +      E  V +GL++AL L +++G L+ 
Subjt:  EEF--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKE

Query:  SIEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEE
        +++   R     KS    +   ++      K SR                YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE
Subjt:  SIEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEE

Query:  LKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
          LK+M ++DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  LKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT27.8e-20953.75Show/hide
Query:  KEDRDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQ
        +E RD   KE+   SK K+KDYDRE  ++K+H R+   K++ +D++R R+ + EK+  RG+DKER K+K RDR      KEKD+  E  R K RE  RD 
Subjt:  KEDRDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQ

Query:  E-EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSA
        E EK+  R   KER  +K+ EDD +  +  E+    D  G          + +GG                              +N D    G + S+ 
Subjt:  E-EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSA

Query:  MLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ
         L++RI  M+++R KK  D S+ L+WV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQ
Subjt:  MLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ

Query:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIF
        ++L DGDVN ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEE              
Subjt:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIF

Query:  ALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS------EMILEGK----HEKRNK-
            LR+R+QG  + + FEDL++S KVS DY++Q+E+L+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGS      + + E K    +EKR+  
Subjt:  ALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS------EMILEGK----HEKRNK-

Query:  -KDLKQKCD---------------------IMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE
         ++   K D                     ++  +D EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +QTTDD  T   + QEN VVFTEM +
Subjt:  -KDLKQKCD---------------------IMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE

Query:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKE-EPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNM
        FVWGLQ + +  KPE EDVFM++D APK   E +++   G TEV DT  +    + D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNM
Subjt:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKE-EPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNM

Query:  DKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
        DK+KSKLVGI+D+D  KES  K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RM
Subjt:  DKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM

Query:  REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGT
        REAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK+E  N+ T
Subjt:  REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 17.2e-2126.16Show/hide
Query:  KEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKE-RGKEKTLEDDRKAD
        + +EH++ + R   G     G ER       R + +ERG E  R   R+  E E  R   +GRE+ + +  ++  K   R+   E     K    D  + 
Subjt:  KEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKE-RGKEKTLEDDRKAD

Query:  QNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE----SEV
          +E ++ +  +G K  E        G K+  + +DV N      + Q + ++               +  +A  E R+   K  +IK   ++     + 
Subjt:  QNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDE----SEV

Query:  LTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDV
          W++RSR+L++    EK+ A + +K+ EE   +DQ      +  E+         +  +L G+ V H ID   EG  VVLTLKD+ +L DG+     DV
Subjt:  LTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAED---------TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDV

Query:  LENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALCQLRR
        L NV + ++++ D   +  KKK       +D+  D++  +K   IL +YD+    +      L++ G      E++LEE                 +LR 
Subjt:  LENVEIGEQKQRDMAYKAAKKKTGIY---DDKFNDEIDGEK--KILPQYDDSAAAD--EGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALCQLRR

Query:  RLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSEMILEGKH----------EKRNKKDLKQ-
        + Q  SSV          +++ +Y + +E++ FKK K++ K +RKKEK + + A +   +      GD GS +   G+           E   K  + Q 
Subjt:  RLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGSEMILEGKH----------EKRNKKDLKQ-

Query:  ------KCDIMHTNDDED--------------------LYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME
              + + M  +D+ED                    L K LE+ R+L   +Q +    SG + + ++    +  +  +++       ++  +VF    
Subjt:  ------KCDIMHTNDDED--------------------LYKSLERARKLALKKQEEAA--SGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEME

Query:  EF--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKES
        EF         +GL  + E  + E  D   D++ +     E + +++ GW+ V     EE   +D  A +      E  V +GL++AL L +++G L+ +
Subjt:  EF--------VWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKES

Query:  IEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEEL
        ++   R     KS    +   ++      K SR                YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE 
Subjt:  IEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSR----------LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEEL

Query:  KLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK
         LK+M ++DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  KLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family2.8e-2040.88Show/hide
Query:  EDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR
        +++  +  D  + E  VG GLS AL  L+++GT KE            + K+VG+      K++  +D R     D  K+I I+R +++GRIMT KE++R
Subjt:  EDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFR

Query:  QLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
         L H FHGKGPGK KQEK+ K++++  K KQM++++    SVER+RE  A  KTPY+VL
Subjt:  QLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family5.6e-21053.75Show/hide
Query:  KEDRDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQ
        +E RD   KE+   SK K+KDYDRE  ++K+H R+   K++ +D++R R+ + EK+  RG+DKER K+K RDR      KEKD+  E  R K RE  RD 
Subjt:  KEDRDEHEKERSRGSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQ

Query:  E-EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSA
        E EK+  R   KER  +K+ EDD +  +  E+    D  G          + +GG                              +N D    G + S+ 
Subjt:  E-EKESYRNIDKERGKEKTLEDDRKADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSA

Query:  MLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ
         L++RI  M+++R KK  D S+ L+WV RSRK+EEK+ +EK++A QLS+IFEEQDN++QG ++D    E    +L+GVKVLHG++KV+EGGAV+LTLKDQ
Subjt:  MLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ

Query:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIF
        ++L DGDVN ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKK+LPQYD+ AA DEG+ LD +GR T +AEKKLEE              
Subjt:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIF

Query:  ALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS------EMILEGK----HEKRNK-
            LR+R+QG  + + FEDL++S KVS DY++Q+E+L+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGS      + + E K    +EKR+  
Subjt:  ALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGS------EMILEGK----HEKRNK-

Query:  -KDLKQKCD---------------------IMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE
         ++   K D                     ++  +D EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T +QTTDD  T   + QEN VVFTEM +
Subjt:  -KDLKQKCD---------------------IMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTEMEE

Query:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKE-EPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNM
        FVWGLQ + +  KPE EDVFM++D APK   E +++   G TEV DT  +    + D + I PDE IHEV VGKGLS ALKLLKDRGTLKE +EWGGRNM
Subjt:  FVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKE-EPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNM

Query:  DKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
        DK+KSKLVGI+D+D  KES  K+S+     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS SV+RM
Subjt:  DKRKSKLVGIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM

Query:  REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGT
        REAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK+E  N+ T
Subjt:  REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCGGAACGGTCATCAGTACCTGAACATGATGAGAGAAGTGGTCATGATATCTTTCGGGCAAGAGATCATGGGGAAGGAGGACATGATGACTTTGGATGGAGTGG
AGCAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATAGGAAGAGTAGTCGAGGGGAGGAAAAAGATCATAGAAGTAAAGATCGAGACCGAGCAAAGAGACGTAGTGATG
ATGCACTGAAAGAAAGGGAGAAAGAGGTAAAAGATCCAGAAAGGGATCGAGTTCATAGTCGGGAGAGGAGGAAGGAAGATAGAGATGAGCATGAAAAAGAAAGGAGTAGG
GGTAGCAAAGTTAAAGACAAAGATTATGATAGAGAGAATTACAAGGAAAAAGAACATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAACGTGGAAGGGA
GAGAGAGTTGGAGAAAGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGGGACAGGGATAGAAAGAAAAAAGAAAAGGAAAAAGACAGATTAAATG
AAAATGGAAGGGAGAAGGGGAGAGAGAAATACAGAGATCAGGAGGAGAAGGAAAGCTATAGGAACATTGACAAGGAGAGAGGAAAAGAGAAAACTTTGGAAGATGACAGG
AAAGCAGATCAAAACAAGGAGAAATCACGAGATAAAGATGGAATTGGCAGCAAAAATGATGAGGAAAGAATTGATTGGATTGCAGATGGGGGTAAGGATTATATGCTAAA
AAGTGATGTTGAGAATAACAGAGACAAAGATGTTGTTGATCAAGGCAAGGTTGTCCAGCACTTGGGAGGTGAAGAAAATTCTGATGGCTTGAAAGTTGGAGCTCAGCCTT
CTTCAGCTATGCTTGAGGATCGCATTCGGACCATGAAAGACGACAGGATAAAGAAGCAAACTGACGAATCTGAGGTTTTAACATGGGTTAAAAGGAGTCGTAAGCTTGAG
GAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAGGATAATATTGATCAAGGTGGAAGCGATGATGATATTGCAGCAGAAGATAC
AACTAGTAATCTAGCAGGAGTTAAAGTACTTCATGGCATAGATAAAGTACTAGAAGGTGGTGCAGTTGTCTTAACCCTTAAAGATCAGAATATCTTAGCTGACGGTGACG
TTAATGAAGACATGGATGTACTTGAGAACGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACTGGGATCTATGATGACAAGTTT
AATGATGAAATAGATGGTGAGAAGAAGATACTTCCACAGTATGATGATTCGGCAGCTGCAGACGAGGGTTTAACTCTAGATGAAAGGGGACGTCTTACTAATGACGCAGA
GAAGAAGCTTGAGGAGTATATTATAATGCTTCTTTTTGAGCTTCCTGCTGTAATCTTTGCTCTTTGCCAGCTTCGGAGAAGATTACAGGGAGCTTCTTCAGTCAATCACT
TTGAAGATCTTGATGCATCAGTGAAAGTCTCACATGATTATTACACTCAAGATGAGATACTTCAATTTAAGAAGCCCAAGAAGAAGAAATCCCTTCGAAAGAAGGAAAAG
CTAGATATTGATGCCCTTGAAGCGGAAGCAATCTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCCGAAATGATTCTAGAAGGCAAGCACGAAAAGAGGAACAAGAAAGA
TCTGAAGCAGAAATGCGACATAATGCATACCAATGATGATGAAGATCTCTATAAATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCG
GACCAGAAGCAATTGCTCTTCTTGCTACAACAACAACCAGCAGTCAGACAACTGATGATCAAAACACAAAAGCTGGAGATTTGCAGGAAAATAAGGTTGTCTTTACAGAA
ATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAAGCTCATAAACCTGAGGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGA
GAAGGATAAGGATGGTGGGTGGACTGAAGTTAAAGATACTGCCAAAGAAGAACCCACCACTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCTG
TTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTT
GGTATAATAGATGAAGATGAACCAAAGGAATCTACGTCAAAGGATTCCCGTTTATCTTCTTTGGTTGATTACAAAAAGGAGATTCACATTGAGAGGACTGATGAGTTTGG
TCGAATTATGACTCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGGCAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGT
TGAAGTTGAAGCAGATGAAGAATGCAGATACACCTTCTTTATCGGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACACCTTACCTTGTTCTCAGTGGTCATGTA
AAACCTGGCCAAACGAGTGATCCAAGAAGTGGTTTTGCTACTGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGG
GATAAAGCGTAAAGCAGAATCTACAAATACAGGCACAAAGAAGGCAAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCGGAACGGTCATCAGTACCTGAACATGATGAGAGAAGTGGTCATGATATCTTTCGGGCAAGAGATCATGGGGAAGGAGGACATGATGACTTTGGATGGAGTGG
AGCAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATAGGAAGAGTAGTCGAGGGGAGGAAAAAGATCATAGAAGTAAAGATCGAGACCGAGCAAAGAGACGTAGTGATG
ATGCACTGAAAGAAAGGGAGAAAGAGGTAAAAGATCCAGAAAGGGATCGAGTTCATAGTCGGGAGAGGAGGAAGGAAGATAGAGATGAGCATGAAAAAGAAAGGAGTAGG
GGTAGCAAAGTTAAAGACAAAGATTATGATAGAGAGAATTACAAGGAAAAAGAACATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAAAGAACGTGGAAGGGA
GAGAGAGTTGGAGAAAGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGGGACAGGGATAGAAAGAAAAAAGAAAAGGAAAAAGACAGATTAAATG
AAAATGGAAGGGAGAAGGGGAGAGAGAAATACAGAGATCAGGAGGAGAAGGAAAGCTATAGGAACATTGACAAGGAGAGAGGAAAAGAGAAAACTTTGGAAGATGACAGG
AAAGCAGATCAAAACAAGGAGAAATCACGAGATAAAGATGGAATTGGCAGCAAAAATGATGAGGAAAGAATTGATTGGATTGCAGATGGGGGTAAGGATTATATGCTAAA
AAGTGATGTTGAGAATAACAGAGACAAAGATGTTGTTGATCAAGGCAAGGTTGTCCAGCACTTGGGAGGTGAAGAAAATTCTGATGGCTTGAAAGTTGGAGCTCAGCCTT
CTTCAGCTATGCTTGAGGATCGCATTCGGACCATGAAAGACGACAGGATAAAGAAGCAAACTGACGAATCTGAGGTTTTAACATGGGTTAAAAGGAGTCGTAAGCTTGAG
GAGAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAGGATAATATTGATCAAGGTGGAAGCGATGATGATATTGCAGCAGAAGATAC
AACTAGTAATCTAGCAGGAGTTAAAGTACTTCATGGCATAGATAAAGTACTAGAAGGTGGTGCAGTTGTCTTAACCCTTAAAGATCAGAATATCTTAGCTGACGGTGACG
TTAATGAAGACATGGATGTACTTGAGAACGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACTGGGATCTATGATGACAAGTTT
AATGATGAAATAGATGGTGAGAAGAAGATACTTCCACAGTATGATGATTCGGCAGCTGCAGACGAGGGTTTAACTCTAGATGAAAGGGGACGTCTTACTAATGACGCAGA
GAAGAAGCTTGAGGAGTATATTATAATGCTTCTTTTTGAGCTTCCTGCTGTAATCTTTGCTCTTTGCCAGCTTCGGAGAAGATTACAGGGAGCTTCTTCAGTCAATCACT
TTGAAGATCTTGATGCATCAGTGAAAGTCTCACATGATTATTACACTCAAGATGAGATACTTCAATTTAAGAAGCCCAAGAAGAAGAAATCCCTTCGAAAGAAGGAAAAG
CTAGATATTGATGCCCTTGAAGCGGAAGCAATCTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCCGAAATGATTCTAGAAGGCAAGCACGAAAAGAGGAACAAGAAAGA
TCTGAAGCAGAAATGCGACATAATGCATACCAATGATGATGAAGATCTCTATAAATCCTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCAGGAGGAGGCAGCATCCG
GACCAGAAGCAATTGCTCTTCTTGCTACAACAACAACCAGCAGTCAGACAACTGATGATCAAAACACAAAAGCTGGAGATTTGCAGGAAAATAAGGTTGTCTTTACAGAA
ATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAAGCTCATAAACCTGAGGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAATATCATGAAGATGA
GAAGGATAAGGATGGTGGGTGGACTGAAGTTAAAGATACTGCCAAAGAAGAACCCACCACTGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAAGTTCCTG
TTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGATCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTT
GGTATAATAGATGAAGATGAACCAAAGGAATCTACGTCAAAGGATTCCCGTTTATCTTCTTTGGTTGATTACAAAAAGGAGATTCACATTGAGAGGACTGATGAGTTTGG
TCGAATTATGACTCCAAAGGAGTCATTTCGACAGCTTTCTCACAAGTTCCATGGCAAGGGACCTGGCAAAATGAAGCAAGAAAAGCGCATGAAGCAATACCAAGAAGAGT
TGAAGTTGAAGCAGATGAAGAATGCAGATACACCTTCTTTATCGGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACACCTTACCTTGTTCTCAGTGGTCATGTA
AAACCTGGCCAAACGAGTGATCCAAGAAGTGGTTTTGCTACTGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGG
GATAAAGCGTAAAGCAGAATCTACAAATACAGGCACAAAGAAGGCAAAAGTTTGA
Protein sequenceShow/hide protein sequence
MDAERSSVPEHDERSGHDIFRARDHGEGGHDDFGWSGAEKSSKHRSEDHRKSSRGEEKDHRSKDRDRAKRRSDDALKEREKEVKDPERDRVHSRERRKEDRDEHEKERSR
GSKVKDKDYDRENYKEKEHERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRDRKKKEKEKDRLNENGREKGREKYRDQEEKESYRNIDKERGKEKTLEDDR
KADQNKEKSRDKDGIGSKNDEERIDWIADGGKDYMLKSDVENNRDKDVVDQGKVVQHLGGEENSDGLKVGAQPSSAMLEDRIRTMKDDRIKKQTDESEVLTWVKRSRKLE
EKKLSEKEKALQLSKIFEEQDNIDQGGSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKF
NDEIDGEKKILPQYDDSAAADEGLTLDERGRLTNDAEKKLEEYIIMLLFELPAVIFALCQLRRRLQGASSVNHFEDLDASVKVSHDYYTQDEILQFKKPKKKKSLRKKEK
LDIDALEAEAISAGLGVGDLGSEMILEGKHEKRNKKDLKQKCDIMHTNDDEDLYKSLERARKLALKKQEEAASGPEAIALLATTTTSSQTTDDQNTKAGDLQENKVVFTE
MEEFVWGLQLDEEAHKPEEEDVFMDDDEAPKEYHEDEKDKDGGWTEVKDTAKEEPTTEDNEAIAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDKRKSKLV
GIIDEDEPKESTSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAESTNTGTKKAKV