; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017978 (gene) of Chayote v1 genome

Gene IDSed0017978
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationLG12:31737511..31747834
RNA-Seq ExpressionSed0017978
SyntenySed0017978
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0074.95Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DMEALR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DE+PL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT +ER TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL AMCYGP ENSHVACYR ALTKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT
        IRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIKD D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRN WTTS+DKNLLFT
Subjt:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT

Query:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED
        IQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDED
Subjt:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED

Query:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR
        +RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFPN+VPLLQEAR
Subjt:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR

Query:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS
        KIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A  RNV+ RRMPVSRNE+SA GDAPKKRKS+NQR +AD  AQV  A       EVKS
Subjt:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS

Query:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG
        TKPQRKR    AYT  T+ KG  ++  NSERCAEQN DTQS  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSSG
Subjt:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG

Query:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS
        VS LSEMTNDMDE +NP   PDT  L   T DDI+      +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK S
Subjt:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS

Query:  NKSQDSLVGSSQ-QVELGTKTSPRHHNQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLKD
        NKSQDSLV   Q + E+         NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND NDPT+A LLKD
Subjt:  NKSQDSLVGSSQ-QVELGTKTSPRHHNQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLKD

Query:  KSKKKRHLGS
        K K+K+ L +
Subjt:  KSKKKRHLGS

KAG7023314.1 Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0074.53Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DMEALR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DE+PL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT +ER TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL AMCYGP ENSHVACYR ALTKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQL----------SWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWT
        IRQQFQEMVLQISVDQ+          S +QGFS +SD+LD ILASIKD D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRN WT
Subjt:  IRQQFQEMVLQISVDQL----------SWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWT

Query:  TSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKS
        TS+DKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++
Subjt:  TSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKS

Query:  KRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFP
        KRG+FTPDED+RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFP
Subjt:  KRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFP

Query:  NEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAP
        N+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+ RRMPVSRNE+SA GDAPKKRKS+NQR +AD  AQV  A 
Subjt:  NEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAP

Query:  E-----EVKSTKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-
              EVKSTKPQRKR    AYT  T+ KG  ++  NSERCAEQN DTQS  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q 
Subjt:  E-----EVKSTKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-

Query:  ENQNSTGSSGVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSE
        ENQNSTGSSGVS LSEMTNDMDE +NP   PDT  L   T DDI+      +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK +
Subjt:  ENQNSTGSSGVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSE

Query:  SRRKRNSKTSNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDH
         RRK+NSK SNKSQDSLV S QQ EL        H  NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND 
Subjt:  SRRKRNSKTSNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDH

Query:  NDPTIACLLKDKSKKKRHLGS
        NDPT+A LLKDK K+++ L +
Subjt:  NDPTIACLLKDKSKKKRHLGS

XP_022987958.1 uncharacterized protein LOC111485355 isoform X1 [Cucurbita maxima]0.0e+0075.42Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT +ER TSSERSSM+ F KGSLPKAAL+FIDAIKKNRSQQKF+R+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK
        +EENKKLRKRFKVLK FQGSCRRKTTCALSQM+DPRVQL+S  KP  AKDSSKKDKRL AMCYGP ENSHVACYR+ALTKF P VDRKRWSN ERE+L K
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK

Query:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF
        GIRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIKD D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTS+DKNLLF
Subjt:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF

Query:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE
        TIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDE
Subjt:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE

Query:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA
        D+RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFPN+VPLLQEA
Subjt:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA

Query:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK
        RKIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+TRRMPVSRNE+SA GDAPKKRKS+NQR + D  AQV  A       EVK
Subjt:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK

Query:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS
        STKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT++  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSS
Subjt:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS

Query:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT
        GVS LSEMTNDMDE +NP  LPDT  L   T DDI+E KG  +VADKDLD SNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK 
Subjt:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT

Query:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL
        SNKSQDSLV S QQ EL    +   H  NQ KKRK++STNTSP  T EAVEEVDDCTLLGFLQKRLKR  T+H  K D   ST  E+DND NDPT+A LL
Subjt:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL

Query:  KDKSKKKRH
        K+K K+K+H
Subjt:  KDKSKKKRH

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0075.3Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFE LR IQ+RF AYESD   NK D SCD +GPL+MDS NT +ER TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL +MCYGP ENSHVACYR A TKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT
        IRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIK  D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTS+DKNLLFT
Subjt:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT

Query:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED
        IQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVA+FG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDED
Subjt:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED

Query:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR
        +RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFPN+VPLLQEAR
Subjt:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR

Query:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGI-----APEEVKS
        KIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+TRRMPVSRNE+SA GDAPK+RKS+NQR +AD  AQV       +  EVKS
Subjt:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGI-----APEEVKS

Query:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG
        TKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT+S  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSSG
Subjt:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG

Query:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS
        VS LSEMTNDMDE +NP  LPDT  L   T DDI+E KG  +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK S
Subjt:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS

Query:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK
        NKSQDSLV S QQ EL    +   H  NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND NDPT+A LL 
Subjt:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK

Query:  DKSKKKRH
        DK K+K+H
Subjt:  DKSKKKRH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0075.2Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFE LR IQ+RF AYESD   NK D SCD +GPL+MDS NT +ER TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL +MCYGP ENSHVACYR A TKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT
        IRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIK  D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTS+DKNLLFT
Subjt:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT

Query:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED
        IQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVA+FG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDED
Subjt:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED

Query:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR
        +RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNEC RRWK LFPN+VPLLQEAR
Subjt:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR

Query:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGI-----APEEVKS
        KIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+TRRMPVSRNE+SA GDAPK+RKS+NQR +AD  AQV       +  EVKS
Subjt:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGI-----APEEVKS

Query:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG
        TKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT+S  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSSG
Subjt:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG

Query:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS
        VS LSEMTNDMDE +NP  LPDT  L   T DDI+E KG  +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK S
Subjt:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS

Query:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK
        NKSQDSLV S QQ EL    +   H  NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND NDPT+A LL 
Subjt:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK

Query:  DKSKKKRH
        DK K+K+H
Subjt:  DKSKKKRH

TrEMBL top hitse value%identityAlignment
A0A6J1C075 uncharacterized protein LOC111007172 isoform X20.0e+0070.88Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA----------GEDSDDIDDFELVRNIRKQFSISDDEQ-----
        MSRR+ ++GGD E LS GEED EDDV DEDMEALR ACRL G NPEEY NNP  S TA          G DSDD+DD ELVRNIR +FSI+ D++     
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA----------GEDSDDIDDFELVRNIRKQFSISDDEQ-----

Query:  --PLCVLPTASPDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKF
          PL  LP  SPD EEEDDFETLR IQ+RF AYESDA  N LD SCDF GPLEMDSN T + RQTSS RSSM+   KG+LPKAAL+FIDAIKKNRSQQKF
Subjt:  --PLCVLPTASPDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKF

Query:  IRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRW
        IR+KMIHLEAR+EENKKLR+RFK+LKDFQ SCRR+T   LSQMIDPRVQL+S  KP  KDSSKKDK+L AM YGP ENSHVACYRMALTKFP SV+RK+W
Subjt:  IRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRW

Query:  SNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPW
        SNVERE+L KGIRQQFQEMVLQIS+DQ+S LQGFS DSD+LD I ASIKD D TPE+IREFLPKVNW+KLAS YL GRSGAECEARWLNFEDPLINRNPW
Subjt:  SNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPW

Query:  TTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAK
        +TS+DKNLL TIQQKGLNNWI+IAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED+KLRSAVA+ GVGDWQA+ASTLEGRAGTQCSNRWKK+LDPA+
Subjt:  TTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAK

Query:  SKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLF
        +KRG FTPDEDNRLKIAVLLFGPKNWNKKA F+PGRNQVQCRERW NCLDPSLR+C WTEEEDLRLE AIQEHGYSW KVAA V SRTDNECRRRWK LF
Subjt:  SKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLF

Query:  PNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIA
        PNEVPLLQEAR+IQKAALISNFVDRESERPALGPTDFRP PNTNLLCNADDP PA KRNVK R   V R E++ATGDAPK+RKS+N   QA+ A      
Subjt:  PNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIA

Query:  PEEVKSTKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKESEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQEN-QNS
          E  S KPQR++    AYTA  K KGV E   N+E+CAEQNL+T+S  VQL+S  S  TN++C E V ENGMEEC NKV+E  SK D+ F  QE  QNS
Subjt:  PEEVKSTKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKESEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQEN-QNS

Query:  TGSSGVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKR
        TGSSGVS LSEM NDMDE +NP ILPDTA L  +TGDD +  + +  VAD DL DSNS+SLP  C   RT DSEGVDS SVGE TD+ D + K + RR+R
Subjt:  TGSSGVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKR

Query:  NSKTSNKSQDSLVGSSQQVELGTKTSPRHHNQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKRATSHDMKDD---STPLEIDNDHNDPTIACL
         ++             +++E         HNQ KKRK+NST TS   T E VEE DDCTL GFLQK+LK+ T+H  K D   S  LE++++ ND TIA  
Subjt:  NSKTSNKSQDSLVGSSQQVELGTKTSPRHHNQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKRATSHDMKDD---STPLEIDNDHNDPTIACL

Query:  LKDKSKKKRHLGS
        LK+ SKKK+H  S
Subjt:  LKDKSKKKRHLGS

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0075.1Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+ + NP  SL A      G DSDD+DD EL+RNI+ +FS + DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT + R TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL AMCYGP ENSHVACYR ALTKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT
        IRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIK  D TPEKIREFLPKVNW+KLA MYL+GRSGAECEARWLNFEDPLINRN WTTS+DKNLLFT
Subjt:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT

Query:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED
        IQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDED
Subjt:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED

Query:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR
        +RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNEC RRWK LFPN+VPLLQEAR
Subjt:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR

Query:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS
        KIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+ RRMPVSRNE+SA GDAPKK KS+NQR QAD  AQV  A       EVKS
Subjt:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS

Query:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG
        TKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT+S  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSSG
Subjt:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG

Query:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS
        VS LSEMTNDMDE +NP   PDT  L   T DDI+E KG  +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK S
Subjt:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS

Query:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK
        NKSQDSLV S QQ EL        H  NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND NDPT+A LLK
Subjt:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK

Query:  DKSKKKRH
        DK K+K+H
Subjt:  DKSKKKRH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0075.2Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+ + NP  SL A      G DSDD+DD EL+RNI+ +FS + DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT + R TSSERSSMI F KGSLPKAAL+FIDAIKKNRSQQKFIR+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG
        +EENKKLRKRFKVLK FQGSCRRKTTCAL+QM+DPRVQL+S  KP AKDSSKKDKRL AMCYGP ENSHVACYR ALTKF P VDRKRWSN ERE+L KG
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKG

Query:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT
        IRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIK  D TPEKIREFLPKVNW+KLA MYL+GRSGAECEARWLNFEDPLINRN WTTS+DKNLLFT
Subjt:  IRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFT

Query:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED
        IQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDED
Subjt:  IQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDED

Query:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR
        +RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFPN+VPLLQEAR
Subjt:  NRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEAR

Query:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS
        KIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+ RRMPVSRNE+SA GDAPKK KS+NQR QAD  AQV  A       EVKS
Subjt:  KIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVKS

Query:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG
        TKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT+S  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSSG
Subjt:  TKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSSG

Query:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS
        VS LSEMTNDMDE +NP   PDT  L   T DDI+E KG  +VADKDLDDSNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK S
Subjt:  VSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKTS

Query:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK
        NKSQDSLV S QQ EL        H  NQSKKRK++ TNTSP  T EAVEEVDDCTL GFLQKRLKR  T+HD K D   STP E+DND NDPT+A LLK
Subjt:  NKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLLK

Query:  DKSKKKRH
        DK K+K+H
Subjt:  DKSKKKRH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0075.32Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT +ER TSSERSSM+ F KGSLPKAAL+FIDAIKKNRSQQKF+R+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK
        +EENKKLRKRFKVLK FQGSCRRKTTCALSQM+DPRVQL+S  KP  AKDSSKKDKRL AMCYGP ENSHVACYR+ALTKF P VDRKRWSN ERE+L K
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK

Query:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF
        GIRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIKD D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTS+DKNLLF
Subjt:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF

Query:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE
        TIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDE
Subjt:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE

Query:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA
        D+RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNEC RRWK LFPN+VPLLQEA
Subjt:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA

Query:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK
        RKIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+TRRMPVSRNE+SA GDAPKKRKS+NQR + D  AQV  A       EVK
Subjt:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK

Query:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS
        STKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT++  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSS
Subjt:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS

Query:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT
        GVS LSEMTNDMDE +NP  LPDT  L   T DDI+E KG  +VADKDLD SNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK 
Subjt:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT

Query:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL
        SNKSQDSLV S QQ EL    +   H  NQ KKRK++STNTSP  T EAVEEVDDCTLLGFLQKRLKR  T+H  K D   ST  E+DND NDPT+A LL
Subjt:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL

Query:  KDKSKKKRH
        K+K K+K+H
Subjt:  KDKSKKKRH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0075.42Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS
        MSRR+  DGGD E L A EED EDD++D+DME LR ACRLAGVN E+Y+ NP  SL A      G DSDD+DD EL+RNI+ +FSI+ DEQPL +LP  +
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTA------GEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTAS

Query:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR
         D EEEDDFETLR IQ+RF AYESD   NK D SCD +GPL+MDS+NT +ER TSSERSSM+ F KGSLPKAAL+FIDAIKKNRSQQKF+R+KMIHLEAR
Subjt:  PDEEEEDDFETLRRIQQRFGAYESDAF-NKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEAR

Query:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK
        +EENKKLRKRFKVLK FQGSCRRKTTCALSQM+DPRVQL+S  KP  AKDSSKKDKRL AMCYGP ENSHVACYR+ALTKF P VDRKRWSN ERE+L K
Subjt:  MEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKP-HAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEK

Query:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF
        GIRQQFQEMVLQISVDQ+S +QGFS +SD+LD ILASIKD D TPEKIREFLPKVNW+KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTS+DKNLLF
Subjt:  GIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLF

Query:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE
        TIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KLRSAVAIFG GDWQAVASTLEGR G QCSNRWKK+LDPA++KRG+FTPDE
Subjt:  TIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDE

Query:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA
        D+RLKIAVLLFGPKNWNKKA FLPGRNQVQCRERW NCLDPSLRRC WTEEEDLRLE AIQEHGYSWAKVAA V SRTDNECRRRWK LFPN+VPLLQEA
Subjt:  DNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEA

Query:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK
        RKIQK ALISNFVDRESERPALGPTDFRP PN++LLCN DDP+ A KRNV+TRRMPVSRNE+SA GDAPKKRKS+NQR + D  AQV  A       EVK
Subjt:  RKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQVGIAPE-----EVK

Query:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS
        STKPQRKR    AYT  T+ KG  ++  NSERCAEQN DT++  VQL+ KE +E  N+DC ETVDENGME   NK +E  S+  VCFS Q ENQNSTGSS
Subjt:  STKPQRKR----AYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKE-SEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFSGQ-ENQNSTGSS

Query:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT
        GVS LSEMTNDMDE +NP  LPDT  L   T DDI+E KG  +VADKDLD SNS+SLP+ C E RT DSEGVDSYSV E TD+   +CK + RRK+NSK 
Subjt:  GVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSKT

Query:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL
        SNKSQDSLV S QQ EL    +   H  NQ KKRK++STNTSP  T EAVEEVDDCTLLGFLQKRLKR  T+H  K D   ST  E+DND NDPT+A LL
Subjt:  SNKSQDSLVGSSQQVELGTKTSPRHH--NQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKR-ATSHDMKDD---STPLEIDNDHNDPTIACLL

Query:  KDKSKKKRH
        K+K K+K+H
Subjt:  KDKSKKKRH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb2.4e-3039.2Show/hide
Query:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL
        K  WT++EDEKL+  V   G  DW+ +A+ L  R   QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A  L GR   QCRERW N L
Subjt:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL

Query:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +  RTDN  +  W +    +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVP---LLQEARKIQKAALISNF

P52550 Myb-related protein A9.1e-3037.57Show/hide
Query:  QRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQ
        QRSL     + +WT+DEDEKL+  V   G  DW  +AS L+ R+  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A  L GR   Q
Subjt:  QRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQ

Query:  CRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESER
        CRERW N L+P +++  WTE ED  +  A +  G  WA++A  +  RTDN  +  W +            RK+++   + +     SER
Subjt:  CRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESER

Q54NA6 Myb-like protein L1.6e-5836.24Show/hide
Query:  PTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQ-FQEMVLQISVDQLS---------WLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPK
        P +N      R+     P     +RW+  E E L KGI+++  Q+ + ++S D+LS          +Q  S +++N +  + +  + +   +K     P 
Subjt:  PTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQ-FQEMVLQISVDQLS---------WLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPK

Query:  VNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSA
        +       +    RS  E   RW N +DP IN+ P+T  +DK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+EDE L   
Subjt:  VNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSA

Query:  VAIFGVG---DWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEE
        + +   G   DWQ +   + GR G QC +RW KTLDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +  WT +
Subjt:  VAIFGVG---DWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEE

Query:  EDLRLETAIQEHGY-SWAKVAASVTSRTDNECRRRWKTL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTD
        ED RL     + G   W+ VA  + +RTDN+C RRWK L    N +   QE    +K   +SNF  R+ ER  L   D
Subjt:  EDLRLETAIQEHGY-SWAKVAASVTSRTDNECRRRWKTL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTD

Q5SXM2 snRNA-activating protein complex subunit 46.3e-3927.33Show/hide
Query:  IKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTK
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP+ KD             GP  N           K
Subjt:  IKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTK

Query:  FPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWL-QGFSEDSDNLD---------GILASIKDFDFTPEK--IREFLPKVNWEKLASMYLRG-
            +   +W N E+  L K +     + +LQ  + +L +L Q  S+ S  L+              I+D +  PE+  +   L   +WEK++++   G 
Subjt:  FPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWL-QGFSEDSDNLD---------GILASIKDFDFTPEK--IREFLPKVNWEKLASMYLRG-

Query:  RSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGD---WQ
        RS  E    W N E P IN+  W+  +++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG    ++
Subjt:  RSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGD---WQ

Query:  AVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHG
         +   +EGR   Q   RW K+LDP   K+G++ P+ED +L  AV  +G ++W K    +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHG

Query:  YS-WAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARK
           WAK+A+ +  R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 42.5e-3527.23Show/hide
Query:  IKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTK
        ++ N   Q+ IR K+  +   + +N++ ++  ++L D  G+   K     S  +   + +    KP+ KD             GP  N           K
Subjt:  IKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTK

Query:  FPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWL-QGFSEDSDNLD---------GILASIKDFDFTPEK--IREFLPKVNWEKLASMYLRG-
            +   +W + E+  L K +     + +LQ  + +L +L +  S  S  L+              I+D +  PE+  +   L   +WEK++++   G 
Subjt:  FPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWL-QGFSEDSDNLD---------GILASIKDFDFTPEK--IREFLPKVNWEKLASMYLRG-

Query:  RSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGD---WQ
        RS  E    W + E P I++  W+T + + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG+   ++
Subjt:  RSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIFGVGD---WQ

Query:  AVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHG
         +   +EGR   Q   RW K+LDP+  KRG + P+ED +L  AV  +G ++W K    +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHG

Query:  YS-WAKVAASVTSRTDNECRRRWKTL
           WA++A+ +  R+ ++C  +WK L
Subjt:  YS-WAKVAASVTSRTDNECRRRWKTL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-31.1e-2740.27Show/hide
Query:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL
        K  WT +EDE LR AV  F    W+ +A +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A  LPGR   QCRERW N L
Subjt:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL

Query:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKT
        +P + +  WT EE++ L  A + HG  WA++A  +  RTDN  +  W +
Subjt:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKT

AT3G09370.2 myb domain protein 3r-31.1e-2740.27Show/hide
Query:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL
        K  WT +EDE LR AV  F    W+ +A +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A  LPGR   QCRERW N L
Subjt:  KREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCL

Query:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKT
        +P + +  WT EE++ L  A + HG  WA++A  +  RTDN  +  W +
Subjt:  DPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKT

AT3G18100.1 myb domain protein 4r12.3e-16142.72Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEY---------------------------------------------MNNPSSS
        M+R +  +  DD      ++D E+D I ED+E LR AC ++ VN +++                                              + P   
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEY---------------------------------------------MNNPSSS

Query:  LTAGEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTASPDEEEEDDFETLRRIQQRFGAY-----------ESDAFNKLDHSCDFNGPLEMDSNNTAIE
        L+   DS+  DDFE++R+I+ Q S+S D   + + P    D+EE+D FETLR I++RF AY           +S    K  H+ D     E+ S +   E
Subjt:  LTAGEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTASPDEEEEDDFETLRRIQQRFGAY-----------ESDAFNKLDHSCDFNGPLEMDSNNTAIE

Query:  RQTSSERSSMIV------------FGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQL
              +S + V                S P+AA +F+DAI++NR+ QKF+R K+  +EA +E+N+K +K  +++KDFQ SC+R T  AL Q  DPRV+L
Subjt:  RQTSSERSSMIV------------FGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQL

Query:  LSTAKPHAKDSSK----------KDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDN
        +ST K    DSS+           DK++  +  GP EN  V  YRMAL K+P SV R++WS  E ++L KG++Q+ Q+++L  ++++ S L+G + D   
Subjt:  LSTAKPHAKDSSK----------KDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDN

Query:  LDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRY
        +D I  SI + + TPE IR+FLPK+NW+   S+ ++ RS AECEARW++ EDPLIN  PWT ++DKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RY
Subjt:  LDGILASIKDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRY

Query:  QRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQ
        QRSLN SILK+EWT +ED++LR+AV +FG  DWQ+VA+ L+GR GTQCSNRWKK+L P  +++G ++ +ED R+K+AV LFG +NW+K + F+PGR Q Q
Subjt:  QRSLNASILKREWTKDEDEKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQ

Query:  CRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPT
        CRERW NCLDP + R  WTEEED +L  AI EHGYSW+KVA +++ RTDN+C RRWK L+P++V LLQEAR++QK A + NFVDRESERPAL  +     
Subjt:  CRERWSNCLDPSLRRCPWTEEEDLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPT

Query:  PNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQ
        P+ +L    D      KR  K ++    R         PK+R+   +    D   Q
Subjt:  PNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQ

AT3G18100.2 myb domain protein 4r13.1e-15050.76Show/hide
Query:  SLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSK----------KDKRL
        S P+AA +F+DAI++NR+ QKF+R K+  +EA +E+N+K +K  +++KDFQ SC+R T  AL Q  DPRV+L+ST K    DSS+           DK++
Subjt:  SLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSK----------KDKRL

Query:  PAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWE
          +  GP EN  V  YRMAL K+P SV R++WS  E ++L KG++Q+ Q+++L  ++++ S L+G + D   +D I  SI + + TPE IR+FLPK+NW+
Subjt:  PAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFLPKVNWE

Query:  KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIF
           S+ ++ RS AECEARW++ EDPLIN  PWT ++DKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED++LR+AV +F
Subjt:  KLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLRSAVAIF

Query:  GVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLET
        G  DWQ+VA+ L+GR GTQCSNRWKK+L P  +++G ++ +ED R+K+AV LFG +NW+K + F+PGR Q QCRERW NCLDP + R  WTEEED +L  
Subjt:  GVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLET

Query:  AIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVS
        AI EHGYSW+KVA +++ RTDN+C RRWK L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+ +L    D      KR  K ++    
Subjt:  AIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVS

Query:  RNEESATGDAPKKRKSSNQRKQADAAAQ
        R         PK+R+   +    D   Q
Subjt:  RNEESATGDAPKKRKSSNQRKQADAAAQ

AT3G18100.3 myb domain protein 4r11.5e-14141.14Show/hide
Query:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEY---------------------------------------------MNNPSSS
        M+R +  +  DD      ++D E+D I ED+E LR AC ++ VN +++                                              + P   
Subjt:  MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEY---------------------------------------------MNNPSSS

Query:  LTAGEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTASPDEEEEDDFETLRRIQQRFGAYESDAFNKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMI
        L+   DS+  DDFE++R+I+ Q S+S D   + + P    D+EE+D FETLR I++RF AY++  F K  +  D +G  +  + N     QT      +I
Subjt:  LTAGEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTASPDEEEEDDFETLRRIQQRFGAYESDAFNKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMI

Query:  VFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEEN-KKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSK---------
           +    + +L   + +     + +       H+   +EEN +KL++R         S R + T    +M DPRV+L+ST K    DSS+         
Subjt:  VFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEEN-KKLRKRFKVLKDFQGSCRRKTTCALSQMIDPRVQLLSTAKPHAKDSSK---------

Query:  -KDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFL
          DK++  +  GP EN  V  YRMAL K+P SV R++WS  E ++L KG++Q+ Q+++L  ++++ S L+G + D   +D I  SI + + TPE IR+FL
Subjt:  -KDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASIKDFDFTPEKIREFL

Query:  PKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLR
        PK+NW+   S+ ++ RS AECEARW++ EDPLIN  PWT ++DKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED++LR
Subjt:  PKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLR

Query:  SAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEE
        +AV +FG  DWQ+VA+ L+GR GTQCSNRWKK+L P  +++G ++ +ED R+K+AV LFG +NW+K + F+PGR Q QCRERW NCLDP + R  WTEEE
Subjt:  SAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEE

Query:  DLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKT
        D +L  AI EHGYSW+KVA +++ RTDN+C RRWK L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+ +L    D      KR  K 
Subjt:  DLRLETAIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKT

Query:  RRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQ
        ++    R         PK+R+   +    D   Q
Subjt:  RRMPVSRNEESATGDAPKKRKSSNQRKQADAAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCCGCACCGGTGACGACGGCGGCGACGACGAGAAGCTTTCCGCCGGCGAGGAAGACGGTGAAGACGATGTGATTGATGAGGACATGGAAGCCCTCAGGGGAGC
CTGTAGGCTTGCCGGAGTCAACCCTGAGGAGTATATGAACAATCCGTCCTCGTCGCTTACCGCCGGAGAAGATTCTGATGATATTGATGATTTCGAGCTTGTTCGAAATA
TTCGGAAACAGTTCTCGATTTCGGACGATGAGCAGCCGTTGTGTGTTCTCCCAACGGCGTCCCCCGACGAGGAGGAGGAAGACGATTTCGAGACGCTGCGTCGGATTCAG
CAGCGCTTTGGGGCGTATGAAAGTGATGCTTTCAATAAACTCGATCATTCTTGTGACTTTAATGGCCCTCTGGAGATGGATTCCAACAACACAGCTATTGAGCGACAAAC
ATCCTCAGAAAGGTCATCTATGATAGTCTTTGGAAAGGGAAGCTTGCCAAAGGCGGCATTGTCATTTATTGATGCCATCAAGAAGAATAGGTCTCAACAAAAGTTTATTC
GCACGAAAATGATTCACCTTGAAGCTAGAATGGAGGAGAACAAAAAGCTTAGAAAACGTTTCAAAGTTCTCAAAGATTTCCAGGGTTCATGTAGAAGAAAAACAACTTGT
GCACTATCTCAGATGATCGATCCTCGAGTCCAGTTATTGTCAACTGCAAAACCACATGCTAAGGATTCATCAAAGAAGGACAAGCGATTACCTGCAATGTGTTATGGCCC
AACTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCCCCTTCCGTAGATCGAAAAAGATGGTCCAATGTAGAAAGGGAGAGTCTAGAGAAGGGAA
TAAGGCAACAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAACTTAGTTGGCTACAAGGATTTTCAGAAGATTCAGATAATCTGGATGGCATTCTTGCATCGATA
AAAGACTTTGACTTTACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGAGAAATTGGCTTCTATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGC
AAGGTGGTTAAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGATGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATCG
ACATAGCAGTTTCATTGGGTACAAACAGAACGCCTTTTCAGTGCTTGTCTCGGTACCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACTAAAGATGAGGAT
GAGAAACTTCGATCCGCTGTTGCTATTTTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGTACACAGTGCTCTAATAGGTGGAAAAAAAC
CCTCGACCCAGCTAAGTCAAAAAGAGGCCATTTCACTCCAGATGAAGACAACCGCTTGAAAATTGCTGTACTGCTTTTTGGGCCTAAAAATTGGAACAAAAAAGCATATT
TTTTACCCGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTCCAATTGCTTAGATCCTTCCTTGAGAAGATGTCCATGGACAGAGGAGGAAGATTTAAGGCTGGAGACA
GCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATCTGTGACATCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGACGTTATTTCCCAATGAAGTTCC
TTTGCTCCAGGAAGCTAGAAAGATTCAAAAGGCCGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACCGACTTTCGACCTACACCAA
ATACAAATTTATTATGTAATGCTGATGATCCAAAACCTGCCGCGAAAAGAAATGTGAAGACGAGAAGGATGCCAGTGTCAAGGAATGAAGAGAGTGCTACTGGTGACGCT
CCAAAGAAGAGGAAGTCAAGTAACCAGAGGAAACAAGCTGATGCAGCTGCTCAGGTGGGTATTGCGCCTGAGGAGGTTAAATCTACAAAACCTCAAAGAAAACGAGCTTA
TACCGCAAAGACTAAGACGAAAGGGGTTTCGGAGCTACCTTCTAACAGCGAGAGATGTGCTGAACAGAATTTGGACACTCAAAGCCACGTGGTGCAGCTAGATAGTAAGG
AATCCGAGATGACCAATAATGACTGTGTCGAGACTGTTGATGAGAATGGTATGGAAGAATGTGGAAATAAAGTTTCAGAGAATCCTTCTAAAAGAGATGTATGTTTTTCA
GGACAAGAAAATCAGAACTCTACCGGATCTTCTGGAGTGTCGGCATTGTCGGAGATGACTAATGACATGGATGAGTTCTTCAATCCCTTTATCCTTCCAGATACAGCACA
GTTGGTCCATTCTACTGGGGATGATATTATGGAAAGGAAAGGGGAGGAGAGTGTTGCAGACAAAGATCTCGATGATAGTAACAGTTACTCATTACCACGCTGTTGCTCAG
AATTCAGGACAGCTGACAGCGAAGGCGTCGACAGTTATTCTGTTGGTGAACTTACAGACAGAGACGATAGGATTTGCAAGTCCGAAAGCAGAAGGAAAAGAAATAGTAAA
ACATCAAATAAGAGCCAGGATTCTTTGGTGGGTTCTAGTCAACAAGTCGAGCTAGGGACAAAAACGTCCCCACGTCATCATAATCAATCAAAGAAGAGAAAAAATAACAG
TACAAATACGAGTCCATTTGAAACATCAGAGGCAGTTGAAGAGGTCGACGACTGCACTCTCCTCGGTTTTCTGCAAAAGAGATTGAAGAGGGCAACTAGTCATGACATGA
AAGACGATAGCACACCTCTAGAAATTGATAATGATCATAATGATCCCACCATTGCCTGTCTACTAAAGGATAAATCGAAGAAGAAAAGGCATCTAGGGTCGCCCAGTGGT
GGAGAGCGTAAAACTAACTAG
mRNA sequenceShow/hide mRNA sequence
TTCCTTTCCATGTTTGTGTTCATAGAAGAAGACCGCCACAATTCCAAGACGCCGCTCAAAAGTTTCGTCTGAAATCTTCGAGCCATTGATTCGTACGTTTCGCTCATTGA
TCCATGTCTCGCCGCACCGGTGACGACGGCGGCGACGACGAGAAGCTTTCCGCCGGCGAGGAAGACGGTGAAGACGATGTGATTGATGAGGACATGGAAGCCCTCAGGGG
AGCCTGTAGGCTTGCCGGAGTCAACCCTGAGGAGTATATGAACAATCCGTCCTCGTCGCTTACCGCCGGAGAAGATTCTGATGATATTGATGATTTCGAGCTTGTTCGAA
ATATTCGGAAACAGTTCTCGATTTCGGACGATGAGCAGCCGTTGTGTGTTCTCCCAACGGCGTCCCCCGACGAGGAGGAGGAAGACGATTTCGAGACGCTGCGTCGGATT
CAGCAGCGCTTTGGGGCGTATGAAAGTGATGCTTTCAATAAACTCGATCATTCTTGTGACTTTAATGGCCCTCTGGAGATGGATTCCAACAACACAGCTATTGAGCGACA
AACATCCTCAGAAAGGTCATCTATGATAGTCTTTGGAAAGGGAAGCTTGCCAAAGGCGGCATTGTCATTTATTGATGCCATCAAGAAGAATAGGTCTCAACAAAAGTTTA
TTCGCACGAAAATGATTCACCTTGAAGCTAGAATGGAGGAGAACAAAAAGCTTAGAAAACGTTTCAAAGTTCTCAAAGATTTCCAGGGTTCATGTAGAAGAAAAACAACT
TGTGCACTATCTCAGATGATCGATCCTCGAGTCCAGTTATTGTCAACTGCAAAACCACATGCTAAGGATTCATCAAAGAAGGACAAGCGATTACCTGCAATGTGTTATGG
CCCAACTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCCCCTTCCGTAGATCGAAAAAGATGGTCCAATGTAGAAAGGGAGAGTCTAGAGAAGG
GAATAAGGCAACAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAACTTAGTTGGCTACAAGGATTTTCAGAAGATTCAGATAATCTGGATGGCATTCTTGCATCG
ATAAAAGACTTTGACTTTACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGAGAAATTGGCTTCTATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGA
AGCAAGGTGGTTAAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGATGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGA
TCGACATAGCAGTTTCATTGGGTACAAACAGAACGCCTTTTCAGTGCTTGTCTCGGTACCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACTAAAGATGAG
GATGAGAAACTTCGATCCGCTGTTGCTATTTTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGTACACAGTGCTCTAATAGGTGGAAAAA
AACCCTCGACCCAGCTAAGTCAAAAAGAGGCCATTTCACTCCAGATGAAGACAACCGCTTGAAAATTGCTGTACTGCTTTTTGGGCCTAAAAATTGGAACAAAAAAGCAT
ATTTTTTACCCGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTCCAATTGCTTAGATCCTTCCTTGAGAAGATGTCCATGGACAGAGGAGGAAGATTTAAGGCTGGAG
ACAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATCTGTGACATCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGACGTTATTTCCCAATGAAGT
TCCTTTGCTCCAGGAAGCTAGAAAGATTCAAAAGGCCGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACCGACTTTCGACCTACAC
CAAATACAAATTTATTATGTAATGCTGATGATCCAAAACCTGCCGCGAAAAGAAATGTGAAGACGAGAAGGATGCCAGTGTCAAGGAATGAAGAGAGTGCTACTGGTGAC
GCTCCAAAGAAGAGGAAGTCAAGTAACCAGAGGAAACAAGCTGATGCAGCTGCTCAGGTGGGTATTGCGCCTGAGGAGGTTAAATCTACAAAACCTCAAAGAAAACGAGC
TTATACCGCAAAGACTAAGACGAAAGGGGTTTCGGAGCTACCTTCTAACAGCGAGAGATGTGCTGAACAGAATTTGGACACTCAAAGCCACGTGGTGCAGCTAGATAGTA
AGGAATCCGAGATGACCAATAATGACTGTGTCGAGACTGTTGATGAGAATGGTATGGAAGAATGTGGAAATAAAGTTTCAGAGAATCCTTCTAAAAGAGATGTATGTTTT
TCAGGACAAGAAAATCAGAACTCTACCGGATCTTCTGGAGTGTCGGCATTGTCGGAGATGACTAATGACATGGATGAGTTCTTCAATCCCTTTATCCTTCCAGATACAGC
ACAGTTGGTCCATTCTACTGGGGATGATATTATGGAAAGGAAAGGGGAGGAGAGTGTTGCAGACAAAGATCTCGATGATAGTAACAGTTACTCATTACCACGCTGTTGCT
CAGAATTCAGGACAGCTGACAGCGAAGGCGTCGACAGTTATTCTGTTGGTGAACTTACAGACAGAGACGATAGGATTTGCAAGTCCGAAAGCAGAAGGAAAAGAAATAGT
AAAACATCAAATAAGAGCCAGGATTCTTTGGTGGGTTCTAGTCAACAAGTCGAGCTAGGGACAAAAACGTCCCCACGTCATCATAATCAATCAAAGAAGAGAAAAAATAA
CAGTACAAATACGAGTCCATTTGAAACATCAGAGGCAGTTGAAGAGGTCGACGACTGCACTCTCCTCGGTTTTCTGCAAAAGAGATTGAAGAGGGCAACTAGTCATGACA
TGAAAGACGATAGCACACCTCTAGAAATTGATAATGATCATAATGATCCCACCATTGCCTGTCTACTAAAGGATAAATCGAAGAAGAAAAGGCATCTAGGGTCGCCCAGT
GGTGGAGAGCGTAAAACTAACTAGAATAGGATGTCACAATTTTCGTTGCATTTTGAGATAGAGGAGGTATGAAGCAACAGTTTCAACCTTCAATTCACGATGTCGATGGA
GCTCGGAATAACGGTTGATGCTAGAAGAGGCTGTACAATTCTAGGCGATTGTGGGAGAGTATCAAATAAAGCTCCTCCAGTGGCCAAAATGCACAACTACAGGTTTGATA
TTAAAGTTAGGCAAATTTTTGTATTGTAAGAAAAGTTTTAGCAGCTTTCTTTTGTTATGATTGGGTCATTTTTTTGAGCTAATTCCTCTTTATGAACAGCAAGCAAATTT
TGATGAGTTGTTGCTTGCTTGGCATTGATGATCTTCTCTTTGATCATTTCTCTCCATCTTTGATTTAAATGAAATCATGGGCTGTGTATAGTTTTTTTTTTCTTCATAGT
TAATTGCCAG
Protein sequenceShow/hide protein sequence
MSRRTGDDGGDDEKLSAGEEDGEDDVIDEDMEALRGACRLAGVNPEEYMNNPSSSLTAGEDSDDIDDFELVRNIRKQFSISDDEQPLCVLPTASPDEEEEDDFETLRRIQ
QRFGAYESDAFNKLDHSCDFNGPLEMDSNNTAIERQTSSERSSMIVFGKGSLPKAALSFIDAIKKNRSQQKFIRTKMIHLEARMEENKKLRKRFKVLKDFQGSCRRKTTC
ALSQMIDPRVQLLSTAKPHAKDSSKKDKRLPAMCYGPTENSHVACYRMALTKFPPSVDRKRWSNVERESLEKGIRQQFQEMVLQISVDQLSWLQGFSEDSDNLDGILASI
KDFDFTPEKIREFLPKVNWEKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSDDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
EKLRSAVAIFGVGDWQAVASTLEGRAGTQCSNRWKKTLDPAKSKRGHFTPDEDNRLKIAVLLFGPKNWNKKAYFLPGRNQVQCRERWSNCLDPSLRRCPWTEEEDLRLET
AIQEHGYSWAKVAASVTSRTDNECRRRWKTLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPTPNTNLLCNADDPKPAAKRNVKTRRMPVSRNEESATGDA
PKKRKSSNQRKQADAAAQVGIAPEEVKSTKPQRKRAYTAKTKTKGVSELPSNSERCAEQNLDTQSHVVQLDSKESEMTNNDCVETVDENGMEECGNKVSENPSKRDVCFS
GQENQNSTGSSGVSALSEMTNDMDEFFNPFILPDTAQLVHSTGDDIMERKGEESVADKDLDDSNSYSLPRCCSEFRTADSEGVDSYSVGELTDRDDRICKSESRRKRNSK
TSNKSQDSLVGSSQQVELGTKTSPRHHNQSKKRKNNSTNTSPFETSEAVEEVDDCTLLGFLQKRLKRATSHDMKDDSTPLEIDNDHNDPTIACLLKDKSKKKRHLGSPSG
GERKTN