; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033965 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033965
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationchr3:3338555..3352075
RNA-Seq ExpressionLag0033965
SyntenyLag0033965
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0064.02Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DMEALRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        +PL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDKR SAM YGPAENSHVACY+ ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDL+ITPEK+REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S
          R+PVSRNEKSA GDAPKKR+SNNQRN+ D TA                                                          QLN KE +
Subjt:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S

Query:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDD-IIERNGESVADKDLDD--S
        ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DD IIE  G +VADKDLDD  S
Subjt:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDD-IIERNGESVADKDLDD--S

Query:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        FSLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

KAG7023314.1 Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0063.42Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DMEALRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        +PL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQI----------SGLQGFSADSDDLDNILASIKDL
        +KKDKR SAM YGPAENSHVACY+ ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQI          S +QGFSA+SDDLDNILASIKDL
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQI----------SGLQGFSADSDDLDNILASIKDL

Query:  EITPEKVREFLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKR
        +ITPEK+REFLPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK 
Subjt:  EITPEKVREFLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKR

Query:  EWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDP
        EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDP
Subjt:  EWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDP

Query:  SLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII-----
        SLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++     
Subjt:  SLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII-----

Query:  ------------RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA--------------------------------------------------------
                    R+PVSRNEKSA GDAPKKR+SNNQRN+ D TA                                                        
Subjt:  ------------RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA--------------------------------------------------------

Query:  --QLNSKE-SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDD-IIERNGESV
          QLN KE +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DD IIE  G +V
Subjt:  --QLNSKE-SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDD-IIERNGESV

Query:  ADKDLDD--SFSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        ADKDLDD  SFSLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  ADKDLDD--SFSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

XP_022921860.1 uncharacterized protein LOC111430000 isoform X1 [Cucurbita moschata]0.0e+0063.8Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFSTAADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
         R TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDKR SAM YGPAENSHVACY+ ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIK L+ITPEK+REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA +YL+ RSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S
          R+PVSRNEKSA GDAPKK +SNNQRNQ D TA                                                          QLN KE +
Subjt:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S

Query:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF
        ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLDD  SF
Subjt:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF

Query:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        SLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

XP_022987958.1 uncharacterized protein LOC111485355 isoform X1 [Cucurbita maxima]0.0e+0063.93Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ +N   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK++RSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CALSQM+DPRVQL+SAGKP QAKD
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD

Query:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE
        S+KKDKR SAM YGPAENSHVACY++ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDL+ITPEK+RE
Subjt:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE

Query:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK
        FLPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDK
Subjt:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK

Query:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE
        LRSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTE
Subjt:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE

Query:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------
        EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++              
Subjt:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------

Query:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-
           R+PVSRNEKSA GDAPKKR+SNNQRN+VD TA                                                          QLN KE 
Subjt:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-

Query:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS
        +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLD  +S
Subjt:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS

Query:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        FSLP SCLELR TDSEGV+SYSVDE TDK H VCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0063.61Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FE LR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDKR S+M YGPAENSHVACY+ A TKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIK L+ITPEK+REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVA+FG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S
          R+PVSRNEKSA GDAPK+R+SNNQRN+ D TA                                                          QLN KE +
Subjt:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S

Query:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF
        ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLDD  SF
Subjt:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF

Query:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        SLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

TrEMBL top hitse value%identityAlignment
A0A6J1C075 uncharacterized protein LOC111007172 isoform X20.0e+0062.56Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS ++GGD+E S GEEDDE DV DEDMEALRRACRLVG NPEE  NN   S  AG  SFGG KPGSDSDD+DDLELVRNIR RFS AAD++DQALS 
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
         PL+TLPPVSPDEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSN  DQSC+F GPLEMDSN T V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
         RQTSS RSSMLA EKG+LPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLR+RFKILKDFQ SCRR+T   LSQMIDPRVQL+SA KPQ KDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDK+ SAMYYGPAENSHVACY+MALTKFP SV+RKKW+NVERENLGKGIRQQFQEMVLQIS+DQISGLQGFSADSDDLDNI ASIKDL+ITPE++REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLAS YL  RSGAECEARWLNFEDPLINRNPW+TSEDKNLL TIQQKGLNNWI+IAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVA+ GVGDWQA+ASTLEGR GTQCSNRWKKSLDPARTKRG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+C+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPNEVPLLQEAR+IQKAALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  -RIPVSRNEKSATGDAPKKRRSNNQRNQVDAT----------------------------------------------AQLNSKESERTSSDCTETVDEN
         R  V R EK+ATGDAPK+R+SNN  NQ +                                                 QLNS  S RT+S+C E V EN
Subjt:  -RIPVSRNEKSATGDAPKKRRSNNQRNQVDAT----------------------------------------------AQLNSKESERTSSDCTETVDEN

Query:  GMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVLYSIAGQ-----------------PTG-DDIIERNGESVADKDLDD--SFSLPHSCLELRATD
        GME CENKVAE LSK D+ F EQEE QNSTGSSGVSVL  +                     TG DDI ER    VAD DL D  SFSLPH CL LR  D
Subjt:  GMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVLYSIAGQ-----------------PTG-DDIIERNGESVADKDLDD--SFSLPHSCLELRATD

Query:  SEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDS
        SEGV+S SV E TDK   V K+QGRR++ ++   + + S
Subjt:  SEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDS

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0063.71Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFSTAADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
         R TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDKR SAM YGPAENSHVACY+ ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIK L+ITPEK+REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA +YL+ RSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S
          R+PVSRNEKSA GDAPKK +SNNQRNQ D TA                                                          QLN KE +
Subjt:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S

Query:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF
        ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLDD  SF
Subjt:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF

Query:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        SLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0063.8Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ IN   S  AAGD +      GSDSDD+DDLEL+RNI+ RFSTAADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS
         R TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK+IRSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CAL+QM+DPRVQL+SAGKPQAKDS
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDS

Query:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF
        +KKDKR SAM YGPAENSHVACY+ ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIK L+ITPEK+REF
Subjt:  TKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREF

Query:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA +YL+ RSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++               
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII---------------

Query:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S
          R+PVSRNEKSA GDAPKK +SNNQRNQ D TA                                                          QLN KE +
Subjt:  --RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-S

Query:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF
        ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLDD  SF
Subjt:  ERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLDD--SF

Query:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        SLP SCLELR TDSEGV+SYSVDE TDK HGVCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0063.83Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ +N   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK++RSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CALSQM+DPRVQL+SAGKP QAKD
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD

Query:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE
        S+KKDKR SAM YGPAENSHVACY++ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDL+ITPEK+RE
Subjt:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE

Query:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK
        FLPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDK
Subjt:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK

Query:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE
        LRSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTE
Subjt:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE

Query:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------
        EEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++              
Subjt:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------

Query:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-
           R+PVSRNEKSA GDAPKKR+SNNQRN+VD TA                                                          QLN KE 
Subjt:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-

Query:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS
        +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLD  +S
Subjt:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS

Query:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        FSLP SCLELR TDSEGV+SYSVDE TDK H VCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0063.93Show/hide
Query:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST
        MSRRS  DGGD+E  A EEDDE D+ D+DME LRRACRL GVN E+ +N   S  AAGD +      GSDSDD+DDLEL+RNI+ RFS AADE       
Subjt:  MSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALST

Query:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK
        QPL+ LPPV+ DEEED+FETLR I+                                         R  A E+                           
Subjt:  QPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLK

Query:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV
                                                                                  D LSNKPDQSC+ DGPL+MDS+NT V
Subjt:  GRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVV

Query:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD
        ER TSSERSSM+AFEKGSLPKAALAFIDAIKKNRSQQK++RSKMIHLEARIEENKKLRKRFK+LK FQGSCRRKT+CALSQM+DPRVQL+SAGKP QAKD
Subjt:  ERQTSSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKP-QAKD

Query:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE
        S+KKDKR SAM YGPAENSHVACY++ALTKF P VDRK+W+N ERENLGKGIRQQFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDL+ITPEK+RE
Subjt:  STKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVRE

Query:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK
        FLPKVNWDKLAS+YLR RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDK
Subjt:  FLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDK

Query:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE
        LRSAVAIFG GDWQAVASTLEGRTG QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTE
Subjt:  LRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTE

Query:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------
        EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPAL    F       ++              
Subjt:  EEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFII--------------

Query:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-
           R+PVSRNEKSA GDAPKKR+SNNQRN+VD TA                                                          QLN KE 
Subjt:  ---RIPVSRNEKSATGDAPKKRRSNNQRNQVDATA----------------------------------------------------------QLNSKE-

Query:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS
        +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVL         Y+ +  P        T DDIIE  G +VADKDLD  +S
Subjt:  SERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEENQNSTGSSGVSVL---------YSIAGQP--------TGDDIIERNGESVADKDLD--DS

Query:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK
        FSLP SCLELR TDSEGV+SYSVDE TDK H VCK QGRRKKNSK SNKSQDSLVSC +
Subjt:  FSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNK

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb1.4e-3139.77Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog1.1e-3133.08Show/hide
Query:  VNWDKLASI-YLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+  +  SR+    +++W N  +P  N+  W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASI-YLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAIFGVG---DWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCQWT
           I  +     W  VA  + GRT  Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ +  RN  QCRERW N L+ S     ++T
Subjt:  AVAIFGVG---DWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCQWT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD
          ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD

Q54NA6 Myb-like protein L1.2e-5732.1Show/hide
Query:  PAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLEITPEKVREFLPK
        PA+N      ++     P     ++WT  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ +N + +  +     +K     P 
Subjt:  PAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLEITPEKVREFLPK

Query:  VNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +       +   +RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   
Subjt:  VNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAIFGVG---DWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        + +   G   DWQ +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L + +WT +
Subjt:  VAIFGVG---DWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIPVSRNEKSAT
        ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L         +  +I I    N KS  
Subjt:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIPVSRNEKSAT

Query:  GDAPKKRRSNNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRD
            KK ++             N+ +++   + C +  D +     ++   + L ++D
Subjt:  GDAPKKRRSNNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRD

Q5SXM2 snRNA-activating protein complex subunit 44.4e-4128.47Show/hide
Query:  IKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTKKDKRFSAMYYGPAENSHVACYKMALTK
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP  KD             GP  N           K
Subjt:  IKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTKKDKRFSAMYYGPAENSHVACYKMALTK

Query:  FPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLEITPEK--VREFLPKVNWDKLASI-YLRS
            +   KW N E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D+   PE+  +   L   +W+K+++I +  S
Subjt:  FPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLEITPEK--VREFLPKVNWDKLASI-YLRS

Query:  RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG    ++
Subjt:  RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQ

Query:  AVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   K+GY+ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++ +W  +E+ +L   I+++G
Subjt:  AVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 45.9e-3827.15Show/hide
Query:  QVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVVERQTSSERSSMLAFEKG-SLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEEN
        ++LYP   S   + F  +  +LS+  +     D  LE  +   ++E + SSE S+     K  +LP+        ++ N   Q+ IR K+  +   + +N
Subjt:  QVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVVERQTSSERSSMLAFEKG-SLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEEN

Query:  KKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQ
        ++ ++  +IL D  G+   K     S  +   + +    KP  KD             GP  N           K    +   KW + E+  L K +   
Subjt:  KKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTKKDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQ

Query:  FQEMVLQISV----------DQISGLQGFSADSDDLDNILASIKDLEITPEK--VREFLPKVNWDKLASI-YLRSRSGAECEARWLNFEDPLINRNPWTT
          + +LQ  +           ++S      A    +      I+D+   PE+  +   L   +W+K+++I +  +RS  E    W + E P I++  W+T
Subjt:  FQEMVLQISV----------DQISGLQGFSADSDDLDNILASIKDLEITPEK--VREFLPKVNWDKLASI-YLRSRSGAECEARWLNFEDPLINRNPWTT

Query:  SEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEGRTGTQCSNRWKKSLDPA
         E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG+   ++ +   +EGR   Q   RW KSLDP+
Subjt:  SEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEGRTGTQCSNRWKKSLDPA

Query:  RTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKK
          KRG++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++ +W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK 
Subjt:  RTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKK

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-35.2e-2941.5Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-35.2e-2941.5Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r11.3e-15738.1Show/hide
Query:  EPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALSTQPLNTLPPVSPD
        E    ++DDE D   ED+E LRRAC +  VN ++  +   S    G    GG +  SDS++ DD E++R I+   S  A  +D   S+ P   L  +S  
Subjt:  EPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIRERFSTAADEQDQALSTQPLNTLPPVSPD

Query:  EEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLKGRKCLQKIVGSV
        E ED+FE +R I+  L     + L     + + +D +    DA  ++  + +  R  A +    E K     H               G+K         
Subjt:  EEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQASISLKGRKCLQKIVGSV

Query:  VKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVVERQTSSERSSML
         K++       S   + +S+  E+                                 F  +G ++   PD     DG +   S+                
Subjt:  VKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVVERQTSSERSSML

Query:  AFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------
             S P+AA AF+DAI++NR+ QK++R K+  +EA IE+N+K +K  +I+KDFQ SC+R T  AL Q  DPRV+L+S  K    DS++          
Subjt:  AFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------

Query:  KDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLP
         DK+ S +  GPAEN  V  Y+MAL K+P SV R+KW+  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +LEITPE +R+FLP
Subjt:  KDKRFSAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLP

Query:  KVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        K+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+
Subjt:  KVNWDKLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEED
        AV +FG  DWQ+VA+ L+GRTGTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED
Subjt:  AVAIFGVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEED

Query:  LRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAP
         +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPALV           I+ +P +S   +  +    
Subjt:  LRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAP

Query:  KKRRSNNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKD
        KKR++  +++  +   +   K  +  S D     +E   E+  N   E       C  E ++N +        SV  ++      +   ER      D +
Subjt:  KKRRSNNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKD

Query:  LDDSFSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKK-NSKTSNKSQDS
        + D+           A +       SV E ++   G+ K   RR+K ++    K+QD+
Subjt:  LDDSFSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKK-NSKTSNKSQDS

AT3G18100.2 myb domain protein 4r12.6e-15346.09Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------KDKRF
        S P+AA AF+DAI++NR+ QK++R K+  +EA IE+N+K +K  +I+KDFQ SC+R T  AL Q  DPRV+L+S  K    DS++           DK+ 
Subjt:  SLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------KDKRF

Query:  SAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLPKVNWD
        S +  GPAEN  V  Y+MAL K+P SV R+KW+  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +LEITPE +R+FLPK+NWD
Subjt:  SAMYYGPAENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLPKVNWD

Query:  KLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF
           S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +F
Subjt:  KLASIYLRSRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF

Query:  GVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEI
        G  DWQ+VA+ L+GRTGTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  GVGDWQAVASTLEGRTGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAPKKRRS
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPALV           I+ +P +S   +  +    KKR++
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAPKKRRS

Query:  NNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKDLDDSF
          +++  +   +   K  +  S D     +E   E+  N   E       C  E ++N +        SV  ++      +   ER      D ++ D+ 
Subjt:  NNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKDLDDSF

Query:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKK-NSKTSNKSQDS
                  A +       SV E ++   G+ K   RR+K ++    K+QD+
Subjt:  SLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKK-NSKTSNKSQDS

AT3G18100.3 myb domain protein 4r17.7e-13444.25Show/hide
Query:  HLEARIEEN-KKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------KDKRFSAMYYGPAENSHVACYKMALTKFPPSV
        H+   +EEN +KL++R         S R + +    +M DPRV+L+S  K    DS++           DK+ S +  GPAEN  V  Y+MAL K+P SV
Subjt:  HLEARIEEN-KKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTK----------KDKRFSAMYYGPAENSHVACYKMALTKFPPSV

Query:  DRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLI
         R+KW+  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +LEITPE +R+FLPK+NWD   S+ ++ RS AECEARW++ EDPLI
Subjt:  DRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLPKVNWDKLASIYLRSRSGAECEARWLNFEDPLI

Query:  NRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKS
        N  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG  DWQ+VA+ L+GRTGTQCSNRWKKS
Subjt:  NRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKS

Query:  LDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRR
        L P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RR
Subjt:  LDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRR

Query:  WKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAPKKRRSNNQRNQVDATAQLNSKESERTSSDCTE
        WK+L+P++V LLQEAR++QK A + NFVDRESERPALV           I+ +P +S   +  +    KKR++  +++  +   +   K  +  S D   
Subjt:  WKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIP-VSRNEKSATGDAPKKRRSNNQRNQVDATAQLNSKESERTSSDCTE

Query:  TVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKDLDDSFSLPHSCLELRATDSEGVNSYSVDEVTD
          +E   E+  N   E       C  E ++N +        SV  ++      +   ER      D ++ D+           A +       SV E ++
Subjt:  TVDENGMESCENKVAETLSKRDVCFSEQEEN-QNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKDLDDSFSLPHSCLELRATDSEGVNSYSVDEVTD

Query:  KGHGVCKYQGRRKK-NSKTSNKSQDS
           G+ K   RR+K ++    K+QD+
Subjt:  KGHGVCKYQGRRKK-NSKTSNKSQDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATGTGGGGCAAGCAAGAGTCTCGACGGCACGACGGAAGAAGATCGTCGGCACGGCGGATGTGGGTCGCCGGCGCGATGCGAGAGGTAGGGATTGGGATACACT
CTGTGTTGGGGGGTGGAAAATGGTGGGAAATGGAGGATTAAGTCTTTCAAGGGGGGGCAATGGTGGGAAGGGAGGGTTGTGGAAGATGAAGATGGACGGACGGGTAGTAA
ATATGCAGATGGACAGGGATGTTGTTCCTGATCAATGGAGTTCAGAGGTGTTTCTCTCTACTGACGGGGTGGTTGAACTTTGTACTCATTGTCCGACAAGAGGTTCTCAA
TGGAGTTCAGAGGTGTTTCTCTCCACTGACGGGGTGGTTGAACTTTGTACTCATTGTCCGACAAGCCTTGATTCATCCATGTCTCGCCGCAGCGACGACGACGGCGGTGA
CGAGGAGCCTTCCGCCGGCGAGGAAGACGATGAAGCTGATGTGTCTGATGAGGACATGGAAGCCCTCCGGCGAGCGTGTAGGCTTGTTGGAGTCAATCCTGAGGAGTGTA
TTAATAATCTGTCGTCGTCAAATGCCGCCGGAGATGACTCTTTCGGCGGCAAAAAACCGGGCTCTGATTCTGATGATATTGATGATCTTGAACTTGTTCGGAATATTCGG
GAACGGTTCTCGACTGCGGCCGATGAGCAGGACCAGGCTTTGTCTACGCAGCCGTTGAATACTCTCCCACCAGTGTCACCAGACGAGGAGGAAGACGAGTTCGAGACGCT
CCGTCGGATTCGGCACGCTTTGCGACAGCGATCAGTTTTGGGCCTAATGGCTAGAAAACAAGTTGCTATGGGCAATGACGCTGATCACCCCACCCAAGATGCGGAAACAT
CTATTCTCCCCCAAACCCTTGACACTAGACTACAAGCTGTGGAAGCTTCTATCAAAGAGATGAAGGGTGCTCTTCATGAGCATTGTGAGTTTTGTGCGCATGTTGATCAA
GCTAGCATATCTTTGAAAGGACGCAAATGTTTGCAAAAGATAGTTGGGAGTGTAGTAAAGGAGATCTTTGAAAGGTGTTTTGTGAGTTCTATGAGAGACGTCTTAAAGAG
CTCTTTGAATGAAACATTTGTGTTCTTCATCCCCAGGGAGGAGGGAGCTAGGAGGAGAGATGATCGAAGAGTTCAAGTTCTATATCCATCATTCCTTTCGGAGAAGAGAA
TGGTTTTTATGGCAAATGGGGATACTTTGAGCAATAAACCCGATCAGTCTTGTGAATTTGATGGACCTCTGGAGATGGATTCTAACAACACAGTTGTTGAGAGACAGACA
TCCTCAGAAAGGTCATCTATGCTAGCCTTTGAAAAGGGCAGCTTGCCAAAGGCTGCGTTGGCATTTATTGATGCCATCAAGAAGAATAGGTCTCAGCAGAAGTATATTCG
TAGTAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAGAAGAAAAACAAGTTGTG
CACTGTCTCAGATGATTGATCCTCGAGTCCAGTTAATGTCGGCTGGAAAACCACAGGCTAAGGATTCAACAAAGAAGGACAAACGATTTTCTGCAATGTATTATGGCCCA
GCTGAGAATTCTCATGTTGCATGCTACAAAATGGCATTGACAAAGTTTCCACCTTCTGTAGATCGAAAAAAATGGACCAATGTAGAAAGGGAGAATCTAGGGAAGGGAAT
AAGACAGCAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCAGCAGATTCAGATGATCTAGATAACATTCTTGCATCAATAA
AAGACCTTGAGATTACCCCTGAAAAGGTTAGGGAATTTTTGCCAAAAGTTAATTGGGACAAACTGGCTTCCATTTATCTTCGAAGTCGCTCAGGGGCAGAATGTGAAGCA
AGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGA
CATAGCAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATG
ATAAACTTCGATCTGCTGTTGCTATATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGTACACAGTGCTCTAATAGGTGGAAAAAATCC
CTTGACCCAGCTAGGACAAAAAGAGGCTATTTCACTCCAGATGAAGATAGTCGCTTGAAAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATT
TTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAG
CAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCT
TTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCGGCCTTGGTCCTACTGACTTTCGACCTCTACCGGAT
ACAATTTATTATAAGGATTCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAGATCAAATAACCAGAGGAATCAAGTTGATGCCACTGCTC
AGCTAAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAGAATGGTATGGAGTCGTGTGAGAACAAAGTTGCTGAGACGCTTTCTAAAAGA
GATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTATACAGCATTGCTGGCCAGCCTACTGGGGATGATATTATAGAAAG
AAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGTGAAGGTGTCAACAGCTATTCTGTGG
ATGAAGTCACAGATAAAGGCCATGGGGTTTGCAAGTACCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTTGTAACAAGTGG
AGCTGGAGAGGTCGAGATGAAGGAGCTTCATTGTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATGTGGGGCAAGCAAGAGTCTCGACGGCACGACGGAAGAAGATCGTCGGCACGGCGGATGTGGGTCGCCGGCGCGATGCGAGAGGTAGGGATTGGGATACACT
CTGTGTTGGGGGGTGGAAAATGGTGGGAAATGGAGGATTAAGTCTTTCAAGGGGGGGCAATGGTGGGAAGGGAGGGTTGTGGAAGATGAAGATGGACGGACGGGTAGTAA
ATATGCAGATGGACAGGGATGTTGTTCCTGATCAATGGAGTTCAGAGGTGTTTCTCTCTACTGACGGGGTGGTTGAACTTTGTACTCATTGTCCGACAAGAGGTTCTCAA
TGGAGTTCAGAGGTGTTTCTCTCCACTGACGGGGTGGTTGAACTTTGTACTCATTGTCCGACAAGCCTTGATTCATCCATGTCTCGCCGCAGCGACGACGACGGCGGTGA
CGAGGAGCCTTCCGCCGGCGAGGAAGACGATGAAGCTGATGTGTCTGATGAGGACATGGAAGCCCTCCGGCGAGCGTGTAGGCTTGTTGGAGTCAATCCTGAGGAGTGTA
TTAATAATCTGTCGTCGTCAAATGCCGCCGGAGATGACTCTTTCGGCGGCAAAAAACCGGGCTCTGATTCTGATGATATTGATGATCTTGAACTTGTTCGGAATATTCGG
GAACGGTTCTCGACTGCGGCCGATGAGCAGGACCAGGCTTTGTCTACGCAGCCGTTGAATACTCTCCCACCAGTGTCACCAGACGAGGAGGAAGACGAGTTCGAGACGCT
CCGTCGGATTCGGCACGCTTTGCGACAGCGATCAGTTTTGGGCCTAATGGCTAGAAAACAAGTTGCTATGGGCAATGACGCTGATCACCCCACCCAAGATGCGGAAACAT
CTATTCTCCCCCAAACCCTTGACACTAGACTACAAGCTGTGGAAGCTTCTATCAAAGAGATGAAGGGTGCTCTTCATGAGCATTGTGAGTTTTGTGCGCATGTTGATCAA
GCTAGCATATCTTTGAAAGGACGCAAATGTTTGCAAAAGATAGTTGGGAGTGTAGTAAAGGAGATCTTTGAAAGGTGTTTTGTGAGTTCTATGAGAGACGTCTTAAAGAG
CTCTTTGAATGAAACATTTGTGTTCTTCATCCCCAGGGAGGAGGGAGCTAGGAGGAGAGATGATCGAAGAGTTCAAGTTCTATATCCATCATTCCTTTCGGAGAAGAGAA
TGGTTTTTATGGCAAATGGGGATACTTTGAGCAATAAACCCGATCAGTCTTGTGAATTTGATGGACCTCTGGAGATGGATTCTAACAACACAGTTGTTGAGAGACAGACA
TCCTCAGAAAGGTCATCTATGCTAGCCTTTGAAAAGGGCAGCTTGCCAAAGGCTGCGTTGGCATTTATTGATGCCATCAAGAAGAATAGGTCTCAGCAGAAGTATATTCG
TAGTAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAGAAGAAAAACAAGTTGTG
CACTGTCTCAGATGATTGATCCTCGAGTCCAGTTAATGTCGGCTGGAAAACCACAGGCTAAGGATTCAACAAAGAAGGACAAACGATTTTCTGCAATGTATTATGGCCCA
GCTGAGAATTCTCATGTTGCATGCTACAAAATGGCATTGACAAAGTTTCCACCTTCTGTAGATCGAAAAAAATGGACCAATGTAGAAAGGGAGAATCTAGGGAAGGGAAT
AAGACAGCAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCAGCAGATTCAGATGATCTAGATAACATTCTTGCATCAATAA
AAGACCTTGAGATTACCCCTGAAAAGGTTAGGGAATTTTTGCCAAAAGTTAATTGGGACAAACTGGCTTCCATTTATCTTCGAAGTCGCTCAGGGGCAGAATGTGAAGCA
AGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGA
CATAGCAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATG
ATAAACTTCGATCTGCTGTTGCTATATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGTACACAGTGCTCTAATAGGTGGAAAAAATCC
CTTGACCCAGCTAGGACAAAAAGAGGCTATTTCACTCCAGATGAAGATAGTCGCTTGAAAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATT
TTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAG
CAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCT
TTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCGGCCTTGGTCCTACTGACTTTCGACCTCTACCGGAT
ACAATTTATTATAAGGATTCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAGATCAAATAACCAGAGGAATCAAGTTGATGCCACTGCTC
AGCTAAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAGAATGGTATGGAGTCGTGTGAGAACAAAGTTGCTGAGACGCTTTCTAAAAGA
GATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTATACAGCATTGCTGGCCAGCCTACTGGGGATGATATTATAGAAAG
AAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGTGAAGGTGTCAACAGCTATTCTGTGG
ATGAAGTCACAGATAAAGGCCATGGGGTTTGCAAGTACCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTTGTAACAAGTGG
AGCTGGAGAGGTCGAGATGAAGGAGCTTCATTGTCATAA
Protein sequenceShow/hide protein sequence
MEDVGQARVSTARRKKIVGTADVGRRRDARGRDWDTLCVGGWKMVGNGGLSLSRGGNGGKGGLWKMKMDGRVVNMQMDRDVVPDQWSSEVFLSTDGVVELCTHCPTRGSQ
WSSEVFLSTDGVVELCTHCPTSLDSSMSRRSDDDGGDEEPSAGEEDDEADVSDEDMEALRRACRLVGVNPEECINNLSSSNAAGDDSFGGKKPGSDSDDIDDLELVRNIR
ERFSTAADEQDQALSTQPLNTLPPVSPDEEEDEFETLRRIRHALRQRSVLGLMARKQVAMGNDADHPTQDAETSILPQTLDTRLQAVEASIKEMKGALHEHCEFCAHVDQ
ASISLKGRKCLQKIVGSVVKEIFERCFVSSMRDVLKSSLNETFVFFIPREEGARRRDDRRVQVLYPSFLSEKRMVFMANGDTLSNKPDQSCEFDGPLEMDSNNTVVERQT
SSERSSMLAFEKGSLPKAALAFIDAIKKNRSQQKYIRSKMIHLEARIEENKKLRKRFKILKDFQGSCRRKTSCALSQMIDPRVQLMSAGKPQAKDSTKKDKRFSAMYYGP
AENSHVACYKMALTKFPPSVDRKKWTNVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLEITPEKVREFLPKVNWDKLASIYLRSRSGAECEA
RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGTQCSNRWKKS
LDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP
LLQEARKIQKAALISNFVDRESERPALVLLTFDLYRIQFIIRIPVSRNEKSATGDAPKKRRSNNQRNQVDATAQLNSKESERTSSDCTETVDENGMESCENKVAETLSKR
DVCFSEQEENQNSTGSSGVSVLYSIAGQPTGDDIIERNGESVADKDLDDSFSLPHSCLELRATDSEGVNSYSVDEVTDKGHGVCKYQGRRKKNSKTSNKSQDSLVSCNKW
SWRGRDEGASLS