; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005915 (gene) of Snake gourd v1 genome

Gene IDTan0005915
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSART-1 family protein DOT2
Genome locationLG10:47562004..47573830
RNA-Seq ExpressionTan0005915
SyntenyTan0005915
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009908 - flower development (biological process)
GO:0009933 - meristem structural organization (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048528 - post-embryonic root development (biological process)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
InterPro domainsIPR005011 - SNU66/SART1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600325.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.82Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SSVPEHD RNGH   +ARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--RERDRDRKKKEKDKDRSNEIEREKGREKHR
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD  RERDRDRKKKEKDKDRSNE EREKGREK R
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--RERDRDRKKKEKDKDRSNEIEREKGREKHR

Query:  DQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSS
        DQEEKESYRNIDK+RGKE+ L DD+K DQNKEK RDKEGIG KN+ ER+DWIA GAKDYMLESDGE+NRDR  VDQGN + HLGGEENSDGLKVGAQSSS
Subjt:  DQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSS

Query:  AMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKD
        AMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTLKD
Subjt:  AMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKD

Query:  QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNH
        QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQGASSV H
Subjt:  QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNH

Query:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQS
        FEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEASRSLQ 
Subjt:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQS

Query:  VQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED
        VQSS+RL+DNEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+HKP+EED
Subjt:  VQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED

Query:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEPTPEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE
Subjt:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        AKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+PSNTGTKK KI
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

KAG7030981.1 SART-1 family protein DOT2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0091.93Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SSVPEHD RNGH   +ARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--RERDRDRKKKEKDKDRSNEIEREKGREKHR
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD  RERDRDRKKKEKDKDRSNE EREKGREK R
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--RERDRDRKKKEKDKDRSNEIEREKGREKHR

Query:  DQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSS
        DQEEKESYRNIDK+RGKE+ L DD+K DQNKEK RDKEGIG KN+ ER+DWIA GAKDYMLESDGE+NRDR  VDQGN +QHLGGEENSDGLKVGAQSSS
Subjt:  DQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSS

Query:  AMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKD
        AMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTLKD
Subjt:  AMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKD

Query:  QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNH
        QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQGASSV H
Subjt:  QNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNH

Query:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQS
        FEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEASRSLQ 
Subjt:  FEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQS

Query:  VQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED
        VQSS+RL+DNEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+HKP+EED
Subjt:  VQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED

Query:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEPTPEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEPKE
Subjt:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        AKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+PSNTGTKK KI
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

XP_022942374.1 SART-1 family protein DOT2 [Cucurbita moschata]0.0e+0091.04Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SS PEHD RNGH   +ARDRGEGQDDFG SGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--------RERDRDRKKKEKDKDRSNEIEREK
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD        RERDRDRKKKEKDKDRSNE EREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--------RERDRDRKKKEKDKDRSNEIEREK

Query:  GREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKV
        GREK RDQEEKESYRNIDKDRGKE+ L DD+K DQNKEK RDKEG G KN  ER+DWIA GAKDYMLESDGE+NRDR  VDQGN +Q LGGEENSDGLKV
Subjt:  GREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKV

Query:  GAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAV
        GAQSSSAMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAV
Subjt:  GAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAV

Query:  VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQG
        VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQG
Subjt:  VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQG

Query:  ASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEA
        ASSV HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEA
Subjt:  ASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEA

Query:  SRSLQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAH
        SRSLQ VQSS+RL+DNEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+H
Subjt:  SRSLQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAH

Query:  KPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD
        KP+EEDVFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEPTPEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+D
Subjt:  KPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD

Query:  EDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
        EDEPKEAKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
Subjt:  EDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL

Query:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+PSNTGTKK KI
Subjt:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

XP_022978951.1 SART-1 family protein DOT2 [Cucurbita maxima]0.0e+0090.59Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SSVPEHD RNGH   +ARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK
        DEH+KER+R  KVKDKDYDRE+YKEKEY+RERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD    RERDRDRKKKEKDKDRSNE EREKGREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK

Query:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS
         RDQEEKESYRNIDKDRGKE+ L DD+K DQNKEK RDKEGIG KN+ ER+DW+A G      ESDGE+NRDR  VDQGN +QHLGGE+NSDGLKVGAQS
Subjt:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS

Query:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL
        SSAMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTL
Subjt:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL

Query:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV
        KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQGASSV
Subjt:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV

Query:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL
         HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEASRSL
Subjt:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL

Query:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE
        Q VQSS+RL+ NEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+HKP+E
Subjt:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE

Query:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEP PEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEP
Subjt:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KEAKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P NTGTKK KI
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

XP_023534590.1 SART-1 family protein DOT2 [Cucurbita pepo subsp. pepo]0.0e+0091.42Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SSVPEHD RNGH   +ARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD    RERDRDRKKKEKDKDRSNE EREKGREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK

Query:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS
         RDQEEKESYRNIDK+RGKE+ L DD+K DQNKEK RDKEGIG KN+ ER+DWIA GAKDYMLESDGE+NRDR  VDQGN +QHLGGEENSDGLKVGAQS
Subjt:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS

Query:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL
        SSAMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTL
Subjt:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL

Query:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV
        KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQGASSV
Subjt:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV

Query:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL
         HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLG RNDSSRQARK EQERSEAEMR NAY  AYAKADEASRSL
Subjt:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL

Query:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE
        Q VQSS+RL+DNEDT + DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT T SG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+HKP+E
Subjt:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE

Query:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEPTPEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEP
Subjt:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KEAKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+PSNTGTKK KI
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

TrEMBL top hitse value%identityAlignment
A0A1S3CAS5 SART-1 family protein DOT2 isoform X20.0e+0086.24Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MD E SS P  D RNG             DD GYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDR+RSKR S+DASKEKEKEVK+SERDRV  RE+RKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDR----DRERDRDRKKKEKDKDRSNEIEREKGREK
        DEHEKER RGSKVKDKDYDREIYK+KEYERERDRKD GKD+ER RERELEKDNVRG DKERGKEKDR    DR+RDRDRKKK+KDKDRSNEIEREKGREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDR----DRERDRDRKKKEKDKDRSNEIEREKGREK

Query:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS
        HRDQE+KESYRN+DK+RGKER LEDDRK DQ K+K +DKEGIGSKN+ ER  WIAD  KDYMLESDGENNRDRD V+QGN +QHLGGEEN DGLKVG+  
Subjt:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS

Query:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVVL
        SS MLEERIR MKEDRLKKQTEESEVLAWVKRSRKLEE+KLSEKEKALQLSK+FEEQDNI Q VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVL
Subjt:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVVL

Query:  TLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGAS
        TLKDQ+ILADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELRRRLQG S
Subjt:  TLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGAS

Query:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR
        SV HFEDLN S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDS RQA+K EQE+SEAEMR NAY  AYAKADEASR
Subjt:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR

Query:  SLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHK
        SLQ VQ SS RLEDN+D L+ADDDED YKSLERARKLALKK ++AASGP AIALLATATTS   TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+AHK
Subjt:  SLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHK

Query:  PDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
        P+EEDVFMDDDE PKEE  +D KDKDGGWTEVKDTAKEE  P++N+A+APDETIHE+PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
Subjt:  PDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE

Query:  DEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        DEPKE+KSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
Subjt:  DEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE SNTGTKKAK+
Subjt:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

A0A1S4E2I4 SART-1 family protein DOT2 isoform X10.0e+0086.24Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MD E SS P  D RNG             DD GYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDR+RSKR S+DASKEKEKEVK+SERDRV  RE+RKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDR----DRERDRDRKKKEKDKDRSNEIEREKGREK
        DEHEKER RGSKVKDKDYDREIYK+KEYERERDRKD GKD+ER RERELEKDNVRG DKERGKEKDR    DR+RDRDRKKK+KDKDRSNEIEREKGREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDR----DRERDRDRKKKEKDKDRSNEIEREKGREK

Query:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS
        HRDQE+KESYRN+DK+RGKER LEDDRK DQ K+K +DKEGIGSKN+ ER  WIAD  KDYMLESDGENNRDRD V+QGN +QHLGGEEN DGLKVG+  
Subjt:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS

Query:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVVL
        SS MLEERIR MKEDRLKKQTEESEVLAWVKRSRKLEE+KLSEKEKALQLSK+FEEQDNI Q VSDDDIA E+TT+N  L GVKVLHG+DKVLEGGAVVL
Subjt:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSN--LAGVKVLHGIDKVLEGGAVVL

Query:  TLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGAS
        TLKDQ+ILADGDVNE++D+LENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDG G F+NDAEKKLEELRRRLQG S
Subjt:  TLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGAS

Query:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR
        SV HFEDLN S KVSHDYYTQDEML+FKKP+KKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDS RQA+K EQE+SEAEMR NAY  AYAKADEASR
Subjt:  SVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR

Query:  SLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHK
        SLQ VQ SS RLEDN+D L+ADDDED YKSLERARKLALKK ++AASGP AIALLATATTS   TDDQNTKAGELQENKV+FTEMEEFVWGLQLDE+AHK
Subjt:  SLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHK

Query:  PDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
        P+EEDVFMDDDE PKEE  +D KDKDGGWTEVKDTAKEE  P++N+A+APDETIHE+PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE
Subjt:  PDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDE

Query:  DEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
        DEPKE+KSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV
Subjt:  DEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLV

Query:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGE SNTGTKKAK+
Subjt:  LSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

A0A6J1D793 SART-1 family protein DOT20.0e+0087.84Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGE-GQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKED
        MD E SSVP+HD RNGH        GE G DDFG+SGAEKSSKHRSEDHRK+SRGE+KDHRSKDRDRSKRRS+DA KE+EKEVK+SERDRVH R+RRKE+
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGE-GQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKED

Query:  RDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRD
        RDEHEKERSRGSKV          KEKEYERERDRKD GKDKERGRER+LEKDNVRGQDKERGKEKDRD  RDRDRKKKEK+KDRSNE EREKGREKHRD
Subjt:  RDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRD

Query:  QEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSA
        QEEKES RN DK+RGKE+ LEDDRKADQNKEK R  E IGSKN+ ER+DW  D  KDYML+SDG+ NRD+D VDQGN ++HLGGEENSDGLKVGAQ SS 
Subjt:  QEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSA

Query:  MLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ
        MLEERIRTMKEDRLKKQTEESEVL+WVKRSRKLEE+KLSEKEKALQLSK+FEEQDNI QG SDDD+AAEDTTSNLAGVKVLHG+DKVLEGGAVVLTLKDQ
Subjt:  MLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQ

Query:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNHF
        +ILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKK+LPQYDDPA+ADEGLTLD  GR +NDAEKKLEELR+RLQGASSVNHF
Subjt:  NILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNHF

Query:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQSV
        EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDS RQARK EQERSEAEMRH+AY  AYAKADEASRSLQ V
Subjt:  EDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQSV

Query:  Q-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED
        Q SSIRLEDNEDT +ADDDEDLYKSLERARKLALKK+EEAASGPEAIALLAT TTS  +TDD NTKAGE+QENKVVFTEMEEFVWGLQLDEE+HKP+EED
Subjt:  Q-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEED

Query:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
        VFMDDDEAPK E  +DEKDKDGGWTEVKDTA+EEPTPEDNEAIAPDETIHE+PVGKGLSS LKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE
Subjt:  VFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKE

Query:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
        +KSK+ RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Subjt:  AKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV

Query:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        KPGQTSDP + FATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEP+N+GTKK+K+
Subjt:  KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

A0A6J1FR42 SART-1 family protein DOT20.0e+0091.04Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SS PEHD RNGH   +ARDRGEGQDDFG SGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--------RERDRDRKKKEKDKDRSNEIEREK
        DEH+KER+R  KVKDKDYDRE+YKEKEYERERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD        RERDRDRKKKEKDKDRSNE EREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD--------RERDRDRKKKEKDKDRSNEIEREK

Query:  GREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKV
        GREK RDQEEKESYRNIDKDRGKE+ L DD+K DQNKEK RDKEG G KN  ER+DWIA GAKDYMLESDGE+NRDR  VDQGN +Q LGGEENSDGLKV
Subjt:  GREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKV

Query:  GAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAV
        GAQSSSAMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAV
Subjt:  GAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAV

Query:  VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQG
        VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQG
Subjt:  VLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQG

Query:  ASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEA
        ASSV HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEA
Subjt:  ASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEA

Query:  SRSLQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAH
        SRSLQ VQSS+RL+DNEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+H
Subjt:  SRSLQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAH

Query:  KPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD
        KP+EEDVFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEPTPEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+D
Subjt:  KPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVD

Query:  EDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
        EDEPKEAKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL
Subjt:  EDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYL

Query:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+PSNTGTKK KI
Subjt:  VLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

A0A6J1IPE4 SART-1 family protein DOT20.0e+0090.59Show/hide
Query:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR
        MDA+ SSVPEHD RNGH   +ARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGE+KDHRSKDRDRSKRRS+DASKEKEKEVK+SERDRVH RERRKEDR
Subjt:  MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDR

Query:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK
        DEH+KER+R  KVKDKDYDRE+YKEKEY+RERDRKD GKDKERGRERELEKDNVRGQDKERGKEKDRD    RERDRDRKKKEKDKDRSNE EREKGREK
Subjt:  DEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRD----RERDRDRKKKEKDKDRSNEIEREKGREK

Query:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS
         RDQEEKESYRNIDKDRGKE+ L DD+K DQNKEK RDKEGIG KN+ ER+DW+A G      ESDGE+NRDR  VDQGN +QHLGGE+NSDGLKVGAQS
Subjt:  HRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQS

Query:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL
        SSAMLEERIRTMKEDRLKKQTEESEVL WVKRSRKLEE+KL+EKEKALQLSK+FEEQDNI QG SDDDIAAED TSNLAGVKVLHGIDKVL GGAVVLTL
Subjt:  SSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTL

Query:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV
        KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA+ADEGLTLDGTGRFSNDAEKKLEELR+RLQGASSV
Subjt:  KDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSV

Query:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL
         HFEDLNASVKVSHDYYTQDEML+FKKPKKKKSLRKKEKLDIDALEAEAIS+GLGVGDLGSRNDSSRQARK EQERSEAEMR NAY  AYAKADEASRSL
Subjt:  NHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSL

Query:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE
        Q VQSS+RL+ NEDTL+ DDDEDLYKSLERARKLALKK +EAASGPEA+ALLAT TTSG TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEE+HKP+E
Subjt:  QSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDE

Query:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP
        EDVFMDDDEAPKEE  +DEKDKDGGWTEVKDTAKEEP PEDNE IAPDETIHE+PVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGI+DEDEP
Subjt:  EDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKSKLVGIVDEDEP

Query:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
        KEAKSKD RLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Subjt:  KEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG

Query:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI
        HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+P NTGTKK KI
Subjt:  HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI

SwissProt top hitse value%identityAlignment
O43290 U4/U6.U5 tri-snRNP-associated protein 11.6e-2026.66Show/hide
Query:  YKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDD
        ++E +  + R    GG   ER R+R  E+   RG  +   + + R     R+R + E         ER   REK  D  E  +            ++E+ 
Subjt:  YKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDD

Query:  RKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE---
                K R K G+                +   ++ +     +    D  N +     EE  + L       +A  E+R+   K  ++K   E+   
Subjt:  RKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE---

Query:  -SEVLAWVKRSRKLEEEK-LSEKEKALQLSKVFEEQDNIAQGVS---DDDIAAED----TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNED
          +  AW++RSR+L++EK L+EK      +K+ EE D    GVS   +++         +  +L G+ V H ID   EG  ++LTLKD+ +L      E+
Subjt:  -SEVLAWVKRSRKLEEEK-LSEKEKALQLSKVFEEQDNIAQGVS---DDDIAAED----TTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNED

Query:  MDVLENVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDL
         DVL NV + ++++ +   +  KKK        D+  +D      + +L +YD+    +      L+  G      E++LEE+R +L+  A S++     
Subjt:  MDVLENVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDL

Query:  NASVKVSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAK---ADEASRS
            +++ +Y T +EM+ FKK K++ K +RKKEK + + A +   +      GD GS  R    R+  +VE+E+        +          +DE    
Subjt:  NASVKVSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAK---ADEASRS

Query:  LQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKL-ALKKREEAASGPEAIALLATATTS---GPTTDDQNTKAGELQENKVVFTEMEEF--------VW
             S   LE++E  L      +L K LE+ R+L  L++ ++     E +  +     S   G   D+   + G      +VF    EF         +
Subjt:  LQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKL-ALKKREEAASGPEAIALLATATTS---GPTTDDQNTKAGELQENKVVFTEMEEF--------VW

Query:  GLQLDEEAHKPDEEDVFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNM
        GL     A   +E++  MD   D+E       + + +++ GW+ V     EE   +D  A +      E  V +GL++ L L +++G L+ +++   R  
Subjt:  GLQLDEEAHKPDEEDVFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNM

Query:  DKRKS--KLVGIVDED---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNA
           KS    V  +++    + K ++ +++R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++
Subjt:  DKRKS--KLVGIVDED---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNA

Query:  DTPSLSVERMREAQAQLKTPYLVLSGHVK
        DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  DTPSLSVERMREAQAQLKTPYLVLSGHVK

Q5XIW8 U4/U6.U5 tri-snRNP-associated protein 12.3e-2226.78Show/hide
Query:  GGKDKERGRERELEKDNVR--GQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRD
        GG  ++  R RE +K   R  G     G+ + R RER  +R        R  E E   G       + + S R +     +E+  E    A  +K    D
Subjt:  GGKDKERGRERELEKDNVR--GQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRD

Query:  KEGIG-SKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE----SEVLAWVKRS
           +   + N  R        +   ++ +     +    D  N +     EE  + L       +A  E+R+   K  ++K   E+     +  AW++RS
Subjt:  KEGIG-SKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE----SEVLAWVKRS

Query:  RKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDD-DIAAEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRD
        R+L++EK    EK  +L +  +++  ++  V ++ +   +D  S  +L G+ V H ID   EG  VVLTLKD+ +L +G+     DVL NV + ++++ D
Subjt:  RKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDD-DIAAEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRD

Query:  MAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDLNASVKVSHDYYTQDEM
           +  KKK        D+  +D      + +L +YD+    +      L+  G      E++LEE+R +L+  A S+N         +++ +Y + +EM
Subjt:  MAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDLNASVKVSHDYYTQDEM

Query:  LQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR----SLQSVQSSIRLEDN
        + FKK K++ K +RKKEK ++     + +  G      GD GS  R    R+  +VE+E  E E +     +A     + +R     +   +    L   
Subjt:  LQFKKPKKK-KSLRKKEKLDIDALEAEAISAG---LGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASR----SLQSVQSSIRLEDN

Query:  EDTLVADDDE-DLYKSLERARKL-ALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEAHKPDEED
           L  D+ E +L K LE+ R+L  L++ ++     E +  +     S     ++  +    ++  +VF    EF         +GL     A   +E++
Subjt:  EDTLVADDDE-DLYKSLERARKL-ALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEEAHKPDEED

Query:  VFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDE
          MD   D+E       + + +++ GW+ V     EE   +D  A +      E  V +GL++ L L +++G L+ +++   R     KS    V  +++
Subjt:  VFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRNMDKRKS--KLVGIVDE

Query:  D---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQA
            + K ++ +++R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V  ++E Q 
Subjt:  D---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQA

Query:  QLKTPYLVLSGHVK
          KTPY+VLSG  K
Subjt:  QLKTPYLVLSGHVK

Q9LFE0 SART-1 family protein DOT22.8e-22255.01Show/hide
Query:  DASKEKEKEVKESERDRVHGRERRKEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERD
        +  K K +     ER    G   R E RD   KE+   SK K+KDYDRE  ++K++ R+++++   +D++R R+ + EK+  RG+DKER K+K RDR ++
Subjt:  DASKEKEKEVKESERDRVHGRERRKEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERD

Query:  RDR-KKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDV
        +D+ K++ + KDR NE + EK ++K R +          K+R  +++ EDD +  +  E++   +  G    G                           
Subjt:  RDR-KKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDV

Query:  VDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTT
                     +N D    G ++S+  L+ RI  M+E+R KK  + S+ L+WV RSRK+EE++ +EK++A QLS++FEEQDN+ QG ++D    E   
Subjt:  VDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTT

Query:  SNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDG
         +L+GVKVLHG++KV+EGGAV+LTLKDQ++L DGDVN ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A+ DEG+ LD 
Subjt:  SNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDG

Query:  TGRFSNDAEKKLEELRRRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQE
         GRF+ +AEKKLEELR+R+QG  + + FEDLN+S KVS DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D  RQA K E+E
Subjt:  TGRFSNDAEKKLEELRRRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQE

Query:  RSEAEMRHNAYHLAYAKADEASRSLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQE
        R E E R NAY  A AKADEASR L+  Q    + +++E  ++ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T+  TTDD  T   E QE
Subjt:  RSEAEMRHNAYHLAYAKADEASRSLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQE

Query:  NKVVFTEMEEFVWGLQLDEEAHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKE-EPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTL
        N VVFTEM +FVWGLQ + +  KP+ EDVFM++D APK      E+  D G TEV DT  +      D + I PDE IHE+ VGKGLS  LKLLKDRGTL
Subjt:  NKVVFTEMEEFVWGLQLDEEAHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKE-EPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTL

Query:  KESIEWGGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN
        KE +EWGGRNMDK+KSKLVGIVD+D  KE+K K+ +     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN
Subjt:  KESIEWGGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN

Query:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGT
        +DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK EP N+ T
Subjt:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGT

Q9Z315 U4/U6.U5 tri-snRNP-associated protein 19.6e-2126.27Show/hide
Query:  KEKEYERERDRKDGGKDK--ERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLED
        + +E+++ + R  GG     ER R+R  E+ + RG  +   + + R     R+R + E         ER   REK  D  E  +            ++E+
Subjt:  KEKEYERERDRKDGGKDK--ERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLED

Query:  DRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE--
          K          +     K  G + + +A    + M     E  R++                            +A  E+R+   K  ++K   E+  
Subjt:  DRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEE--

Query:  --SEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDD-DIAAEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVL
           +  AW++RSR+L++EK    EK  +L +  +++  ++  V ++ +   +D  S  +L G+ V H ID   EG  VVLTLKD+ +L DG+     DVL
Subjt:  --SEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDD-DIAAEDTTS--NLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVL

Query:  ENVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDLNASV
         NV + ++++ D   +  KKK        D+  +D      + +L +YD+    +      L+  G      E++LEE+R +L+  A S++     +   
Subjt:  ENVEIGEQKQRDMAYKAAKKKTGIY----DDKFND-ENHGEKKMLPQYDDPASAD--EGLTLDGTGRFSNDAEKKLEELRRRLQ-GASSVNHFEDLNASV

Query:  KVSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQSVQSS
        +++ +Y + +EM+ FKK K++ K +RKKEK + + A +   +      GD GS  R    R+  +VE+E  E E +     +A     + +R       +
Subjt:  KVSHDYYTQDEMLQFKKPKKK-KSLRKKEK-LDIDALEAEAISAGLGVGDLGS--RNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQSVQSS

Query:  IRLEDNED----------TLVADDDE-DLYKSLERARKL-ALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEF--------V
        + + D ED           L  D+ E +L K LE+ R+L  L++ ++     E +  +     S     ++  +    ++  +VF    EF         
Subjt:  IRLEDNED----------TLVADDDE-DLYKSLERARKL-ALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQENKVVFTEMEEF--------V

Query:  WGLQLDEEAHKPDEEDVFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRN
        +GL     A   +E++  MD   D+E       + + +++ GW+ V     EE   +D  A +      E  V +GL++ L L +++G L+ +++   R 
Subjt:  WGLQLDEEAHKPDEEDVFMD---DDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEWGGRN

Query:  MDKRKS--KLVGIVDED---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN
            KS    V  +++    + K ++ +++R           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M +
Subjt:  MDKRKS--KLVGIVDED---EPKEAKSKDFR-----LSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN

Query:  ADTPSLSVERMREAQAQLKTPYLVLSGHVK
        +DTP  +V  ++E Q   KTPY+VLSG  K
Subjt:  ADTPSLSVERMREAQAQLKTPYLVLSGHVK

Arabidopsis top hitse value%identityAlignment
AT3G14700.1 SART-1 family3.3e-1634.25Show/hide
Query:  EEFVWGLQLDEE---AHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEW
        ++ V  + LD+E   +    + D+    DE  + ES +  ++      ++ D A      +++  +  D  + E  VG GLS  L  L+++GT KE    
Subjt:  EEFVWGLQLDEE---AHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLKESIEW

Query:  GGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSL
                + K+VG+ D +       +D R     D  K+I I+R +++GRIMT KE++R L H FHGKGPGK KQEK+ K++++  K KQM++++    
Subjt:  GGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSL

Query:  SVERMREAQAQLKTPYLVL
        SVER+RE  A  KTPY+VL
Subjt:  SVERMREAQAQLKTPYLVL

AT5G16780.1 SART-1 family2.0e-22355.01Show/hide
Query:  DASKEKEKEVKESERDRVHGRERRKEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERD
        +  K K +     ER    G   R E RD   KE+   SK K+KDYDRE  ++K++ R+++++   +D++R R+ + EK+  RG+DKER K+K RDR ++
Subjt:  DASKEKEKEVKESERDRVHGRERRKEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERD

Query:  RDR-KKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDV
        +D+ K++ + KDR NE + EK ++K R +          K+R  +++ EDD +  +  E++   +  G    G                           
Subjt:  RDR-KKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLEDDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDV

Query:  VDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTT
                     +N D    G ++S+  L+ RI  M+E+R KK  + S+ L+WV RSRK+EE++ +EK++A QLS++FEEQDN+ QG ++D    E   
Subjt:  VDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSRKLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTT

Query:  SNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDG
         +L+GVKVLHG++KV+EGGAV+LTLKDQ++L DGDVN ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A+ DEG+ LD 
Subjt:  SNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENHGEKKMLPQYDDPASADEGLTLDG

Query:  TGRFSNDAEKKLEELRRRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQE
         GRF+ +AEKKLEELR+R+QG  + + FEDLN+S KVS DY++Q+EML+FKKPKKKK LRKK+KLD+  LEAEA+++GLG  DLGSR D  RQA K E+E
Subjt:  TGRFSNDAEKKLEELRRRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGLGVGDLGSRNDSSRQARKVEQE

Query:  RSEAEMRHNAYHLAYAKADEASRSLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQE
        R E E R NAY  A AKADEASR L+  Q    + +++E  ++ADD EDLYKSLE+AR+LAL K+EEA SGP+A+A L  ++T+  TTDD  T   E QE
Subjt:  RSEAEMRHNAYHLAYAKADEASRSLQSVQ-SSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDDQNTKAGELQE

Query:  NKVVFTEMEEFVWGLQLDEEAHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKE-EPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTL
        N VVFTEM +FVWGLQ + +  KP+ EDVFM++D APK      E+  D G TEV DT  +      D + I PDE IHE+ VGKGLS  LKLLKDRGTL
Subjt:  NKVVFTEMEEFVWGLQLDEEAHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKE-EPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTL

Query:  KESIEWGGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN
        KE +EWGGRNMDK+KSKLVGIVD+D  KE+K K+ +     D  K+I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN
Subjt:  KESIEWGGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKN

Query:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGT
        +DTPS SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLGIKRK EP N+ T
Subjt:  ADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCGGAGCCGTCATCAGTACCTGAACATGATGTGAGAAATGGTCATGATTTCTCTCAGGCAAGAGATCGTGGGGAAGGACAAGATGACTTTGGTTATAGTGGAGC
AGAAAAGTCAAGCAAACATCGGAGTGAGGATCATCGGAAGAGTAGTCGAGGGGAGGACAAAGACCATAGAAGTAAAGATCGAGACCGATCTAAGAGACGCAGTAATGATG
CATCGAAAGAAAAGGAGAAAGAGGTAAAAGAGTCAGAAAGGGATCGAGTTCATGGTCGGGAAAGGAGAAAGGAAGACAGAGATGAGCATGAAAAGGAAAGGAGCAGGGGT
AGCAAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATGGAGGAAAAGATAAAGAGCGTGGGAGGGAGAG
AGAGTTGGAGAAGGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGGGATAGGGAAAGGGATAGGGATAGGAAGAAGAAAGAGAAGGACAAGGACC
GATCGAATGAAATTGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGAGAAGGAAAGTTATAGGAACATTGACAAGGATAGAGGAAAAGAGAGAACATTGGAA
GATGATAGGAAAGCAGATCAAAACAAGGAAAAATTCCGAGATAAAGAAGGAATTGGCAGCAAAAACAATGGGGAAAGAGTTGATTGGATTGCAGATGGGGCTAAGGATTA
TATGCTAGAAAGTGATGGTGAGAATAACAGGGATAGAGATGTTGTTGATCAAGGGAACGGGATCCAACATTTGGGAGGTGAAGAAAATTCTGATGGATTGAAAGTTGGAG
CCCAATCTTCTTCAGCTATGCTTGAGGAGCGCATTCGAACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAGGTTTTAGCATGGGTTAAAAGGAGTCGC
AAACTTGAGGAGGAGAAACTTTCTGAAAAGGAGAAAGCCTTGCAGCTCTCAAAGGTTTTTGAGGAACAGGACAATATTGCTCAAGGTGTAAGTGATGATGATATTGCAGC
CGAAGATACAACTAGTAATTTAGCCGGAGTTAAAGTACTTCATGGCATAGACAAAGTACTAGAAGGTGGTGCAGTTGTCTTAACCCTTAAAGATCAGAATATCTTAGCTG
ATGGTGACGTTAATGAAGATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGGATTTATGAT
GACAAGTTTAATGATGAAAATCATGGTGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCATCTGCAGACGAGGGCCTAACTCTAGATGGAACCGGACGTTTTAGTAA
TGATGCAGAAAAGAAGCTGGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCTTCAGTCAATCACTTTGAAGATCTTAATGCATCAGTGAAAGTCTCACATGATTATTACA
CTCAAGATGAGATGCTCCAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAACTAGATATTGATGCCCTTGAAGCAGAAGCAATCTCTGCTGGATTG
GGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGCAGGCAAGCACGAAAAGTGGAACAAGAAAGATCTGAAGCAGAAATGCGACATAATGCATACCATTTAGCCTATGC
TAAAGCAGACGAGGCATCGAGATCTCTACAATCAGTTCAAAGCTCAATCAGATTAGAGGACAATGAAGATACTCTCGTTGCAGATGATGATGAAGACCTCTATAAGTCGT
TGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCGGGAGGAGGCAGCATCTGGACCAGAAGCAATTGCTCTTCTTGCGACAGCCACAACCAGCGGGCCGACAACTGATGAT
CAAAACACAAAAGCTGGAGAGTTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAGGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAAGCTCATAAACCCGATGAAGA
AGATGTCTTTATGGATGATGATGAAGCACCAAAAGAAGAATCTCGTCAAGATGAGAAGGATAAAGATGGTGGGTGGACCGAAGTCAAAGACACTGCCAAAGAAGAACCCA
CTCCCGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAACTTCCTGTTGGAAAGGGATTATCCAGTGTACTGAAGCTGCTTAAAGATCGAGGGACTCTGAAG
GAAAGCATCGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCAAAGGAAGCTAAGTCAAAGGATTTCCGTTTATC
TTCTTTGGTGGATTACAAAAAGGAGATTCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGCCAACTTTCTCATAAGTTCCATGGCA
AGGGACCTGGCAAAATGAAACAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCAGATACACCTTCGTTATCAGTGGAGAGAATG
AGGGAAGCTCAAGCACAATTAAAGACACCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACAAGTGATCCAAGAAGTGGTTTTGCTACTGTTGAAAAGGATCT
CCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGAATAAAGCGTAAAGGTGAACCTTCGAATACAGGCACAAAGAAGGCAAAAATTTGA
mRNA sequenceShow/hide mRNA sequence
TTCTTCCTTCGTAGCCCTTAGATTTTTCTCTCGCTTTGTTACTGAGAAAGTCGGAAGAAAGAAAAAGAGAACCCTAACTGGGAAGACGATCACGCTTGAGGTCGAGAGCA
GTGATACCATCGATAATGTCAAAGCCAAAATCCAGGACAAGGAGGGTATTCCACCTGACCAGCAGAGATTAATTTTTGCTGGTGTTAATGGAGATTTAGAGGCAAGCTAT
TTAATATTTGCAAATGGACGCGGAGCCGTCATCAGTACCTGAACATGATGTGAGAAATGGTCATGATTTCTCTCAGGCAAGAGATCGTGGGGAAGGACAAGATGACTTTG
GTTATAGTGGAGCAGAAAAGTCAAGCAAACATCGGAGTGAGGATCATCGGAAGAGTAGTCGAGGGGAGGACAAAGACCATAGAAGTAAAGATCGAGACCGATCTAAGAGA
CGCAGTAATGATGCATCGAAAGAAAAGGAGAAAGAGGTAAAAGAGTCAGAAAGGGATCGAGTTCATGGTCGGGAAAGGAGAAAGGAAGACAGAGATGAGCATGAAAAGGA
AAGGAGCAGGGGTAGCAAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATGGAGGAAAAGATAAAGAGC
GTGGGAGGGAGAGAGAGTTGGAGAAGGATAATGTTCGAGGACAAGACAAAGAGAGGGGAAAGGAGAAAGACAGGGATAGGGAAAGGGATAGGGATAGGAAGAAGAAAGAG
AAGGACAAGGACCGATCGAATGAAATTGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGAGAAGGAAAGTTATAGGAACATTGACAAGGATAGAGGAAAAGA
GAGAACATTGGAAGATGATAGGAAAGCAGATCAAAACAAGGAAAAATTCCGAGATAAAGAAGGAATTGGCAGCAAAAACAATGGGGAAAGAGTTGATTGGATTGCAGATG
GGGCTAAGGATTATATGCTAGAAAGTGATGGTGAGAATAACAGGGATAGAGATGTTGTTGATCAAGGGAACGGGATCCAACATTTGGGAGGTGAAGAAAATTCTGATGGA
TTGAAAGTTGGAGCCCAATCTTCTTCAGCTATGCTTGAGGAGCGCATTCGAACCATGAAAGAAGACAGGCTAAAGAAGCAAACTGAAGAATCTGAGGTTTTAGCATGGGT
TAAAAGGAGTCGCAAACTTGAGGAGGAGAAACTTTCTGAAAAGGAGAAAGCCTTGCAGCTCTCAAAGGTTTTTGAGGAACAGGACAATATTGCTCAAGGTGTAAGTGATG
ATGATATTGCAGCCGAAGATACAACTAGTAATTTAGCCGGAGTTAAAGTACTTCATGGCATAGACAAAGTACTAGAAGGTGGTGCAGTTGTCTTAACCCTTAAAGATCAG
AATATCTTAGCTGATGGTGACGTTAATGAAGATATGGATGTACTTGAGAATGTGGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAAC
CGGGATTTATGATGACAAGTTTAATGATGAAAATCATGGTGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCATCTGCAGACGAGGGCCTAACTCTAGATGGAACCG
GACGTTTTAGTAATGATGCAGAAAAGAAGCTGGAGGAGCTTCGGAGAAGATTACAGGGAGCTTCTTCAGTCAATCACTTTGAAGATCTTAATGCATCAGTGAAAGTCTCA
CATGATTATTACACTCAAGATGAGATGCTCCAATTTAAGAAGCCCAAGAAAAAGAAATCCCTTCGAAAGAAGGAAAAACTAGATATTGATGCCCTTGAAGCAGAAGCAAT
CTCTGCTGGATTGGGTGTTGGAGATCTTGGTTCTCGAAATGATTCTAGCAGGCAAGCACGAAAAGTGGAACAAGAAAGATCTGAAGCAGAAATGCGACATAATGCATACC
ATTTAGCCTATGCTAAAGCAGACGAGGCATCGAGATCTCTACAATCAGTTCAAAGCTCAATCAGATTAGAGGACAATGAAGATACTCTCGTTGCAGATGATGATGAAGAC
CTCTATAAGTCGTTGGAGAGAGCAAGAAAATTAGCTCTTAAGAAGCGGGAGGAGGCAGCATCTGGACCAGAAGCAATTGCTCTTCTTGCGACAGCCACAACCAGCGGGCC
GACAACTGATGATCAAAACACAAAAGCTGGAGAGTTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAGGAATTTGTCTGGGGTCTCCAGCTTGATGAAGAAGCTCATA
AACCCGATGAAGAAGATGTCTTTATGGATGATGATGAAGCACCAAAAGAAGAATCTCGTCAAGATGAGAAGGATAAAGATGGTGGGTGGACCGAAGTCAAAGACACTGCC
AAAGAAGAACCCACTCCCGAGGATAATGAGGCAATAGCTCCAGATGAAACAATCCATGAACTTCCTGTTGGAAAGGGATTATCCAGTGTACTGAAGCTGCTTAAAGATCG
AGGGACTCTGAAGGAAAGCATCGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCAAAGGAAGCTAAGTCAAAGG
ATTTCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATTCACATTGAGAGGACTGATGAATTTGGGCGAATTATGACTCCAAAGGAGTCATTTCGCCAACTTTCTCAT
AAGTTCCATGGCAAGGGACCTGGCAAAATGAAACAAGAAAAGCGCATGAAGCAATACCAAGAAGAGTTGAAGTTGAAGCAGATGAAGAATGCAGATACACCTTCGTTATC
AGTGGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACACCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACAAGTGATCCAAGAAGTGGTTTTGCTACTG
TTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGAATAAAGCGTAAAGGTGAACCTTCGAATACAGGCACAAAGAAG
GCAAAAATTTGAGATTTGTAATTTTTTTTTACCAGAATTTGAATTAGCAGCAAGAAATAGAAACCCATGTTTAGTTTAGAGTTTAATTTCACTCTTACCAAGAAGATGCA
ACGTCGTATAATCAATTTCATGTCCCCATCAGTTAACAGTGCTTCCATATTTCAAGTGTTTGTATGATAATCGATGTATTCTTTGTTTTCCTTTTTTCAAAAAATTTTAG
TAGTTTTTTTTTTTTTTGGTAAAGCTAAGTCGGAGATGATTGATTGCTCTTATATACTATTTCAAGTCA
Protein sequenceShow/hide protein sequence
MDAEPSSVPEHDVRNGHDFSQARDRGEGQDDFGYSGAEKSSKHRSEDHRKSSRGEDKDHRSKDRDRSKRRSNDASKEKEKEVKESERDRVHGRERRKEDRDEHEKERSRG
SKVKDKDYDREIYKEKEYERERDRKDGGKDKERGRERELEKDNVRGQDKERGKEKDRDRERDRDRKKKEKDKDRSNEIEREKGREKHRDQEEKESYRNIDKDRGKERTLE
DDRKADQNKEKFRDKEGIGSKNNGERVDWIADGAKDYMLESDGENNRDRDVVDQGNGIQHLGGEENSDGLKVGAQSSSAMLEERIRTMKEDRLKKQTEESEVLAWVKRSR
KLEEEKLSEKEKALQLSKVFEEQDNIAQGVSDDDIAAEDTTSNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDVNEDMDVLENVEIGEQKQRDMAYKAAKKKTGIYD
DKFNDENHGEKKMLPQYDDPASADEGLTLDGTGRFSNDAEKKLEELRRRLQGASSVNHFEDLNASVKVSHDYYTQDEMLQFKKPKKKKSLRKKEKLDIDALEAEAISAGL
GVGDLGSRNDSSRQARKVEQERSEAEMRHNAYHLAYAKADEASRSLQSVQSSIRLEDNEDTLVADDDEDLYKSLERARKLALKKREEAASGPEAIALLATATTSGPTTDD
QNTKAGELQENKVVFTEMEEFVWGLQLDEEAHKPDEEDVFMDDDEAPKEESRQDEKDKDGGWTEVKDTAKEEPTPEDNEAIAPDETIHELPVGKGLSSVLKLLKDRGTLK
ESIEWGGRNMDKRKSKLVGIVDEDEPKEAKSKDFRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERM
REAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEPSNTGTKKAKI