; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008721 (gene) of Snake gourd v1 genome

Gene IDTan0008721
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 1-like
Genome locationLG01:104089390..104095898
RNA-Seq ExpressionTan0008721
SyntenyTan0008721
Gene Ontology termsGO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038625.1 protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X1 [Cucumis melo var. makuwa]1.5e-8960.13Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR RQMFSGFTKGEIEKMEKLLE+SGEQ LNR+FCQKVTK FNRSSGRAGKPVIKW EV           V+  L+ +                 
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV
             +   K+++   + PK     + ++  + P++         + T+GEKSPDLSELEFEARSSKDGAW                             
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV

Query:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR
                YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCR
Subjt:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR

Query:  CLFLVRYDHDSTEANY
        CLFLVRYDHDSTEA++
Subjt:  CLFLVRYDHDSTEANY

KAG7024610.1 Protein SAWADEE HOMEODOMAIN-like 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-8760.33Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPRDRQMFSGFTKGEI KMEKL+E+SGEQLL+R+FCQKVTK FNRSSGRAGKPVIKWMEV+    +    F                         
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY
                 K+++   + PK     + ++  + P+     G+K PDLSELEFEARSSKDGAW                                     Y
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY

Query:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH
        DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQ+VRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCRCLFL+RYDH
Subjt:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH

Query:  DSTEA
        DSTEA
Subjt:  DSTEA

XP_008466073.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X1 [Cucumis melo]3.6e-8860.38Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR RQMFSGFTKGEIEKMEKLLE+SGEQ LNR+FCQKVTK FNRSSGRAGKPVIKW EV+         ++ S L                    
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV
             +   K+++   + PK     + ++  + P++         + T+GEKSPDLSELEFEARSSKDGAW                             
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV

Query:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR
                YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCR
Subjt:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR

Query:  CLFLVRYDHDSTE
        CLFLVRYDHDSTE
Subjt:  CLFLVRYDHDSTE

XP_022140174.1 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X1 [Momordica charantia]3.0e-8760.53Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR+RQ+FSGFTK EIEKMEKLLE+SGEQLLNREF QKVTKGFNRSSGRAGKP+IKW EV          F                         
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY
                 K+++   + PK     + ++  + P+     GEKSPDLSELEFEARSSKDGAW                                     Y
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY

Query:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH
        DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK+GDLVLCFQERRDQAIYYDA I+E+QRRMHDIRGCRCLFLVRYDH
Subjt:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH

Query:  DSTE
        DSTE
Subjt:  DSTE

XP_022936583.1 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X3 [Cucurbita moschata]9.8e-9462.15Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPRDRQMFSGFTKGEI KMEKL+E+SGEQLL+R+FCQKVTK F                                                +YTIG
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPK-------------IVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFAL
         K DCKTSQKLKRGCLKFPKLAL IRLRKVLKAPK             +    G+K PDLSELEFEARSSKDGAW                         
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPK-------------IVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFAL

Query:  CSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDI
                    YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQ+VRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDI
Subjt:  CSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDI

Query:  RGCRCLFLVRYDHDSTE
        RGCRCLFL+RYDHDSTE
Subjt:  RGCRCLFLVRYDHDSTE

TrEMBL top hitse value%identityAlignment
A0A1S3CQD5 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X11.7e-8860.38Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR RQMFSGFTKGEIEKMEKLLE+SGEQ LNR+FCQKVTK FNRSSGRAGKPVIKW EV+         ++ S L                    
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV
             +   K+++   + PK     + ++  + P++         + T+GEKSPDLSELEFEARSSKDGAW                             
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV

Query:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR
                YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCR
Subjt:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR

Query:  CLFLVRYDHDSTE
        CLFLVRYDHDSTE
Subjt:  CLFLVRYDHDSTE

A0A5A7TB77 Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X17.1e-9060.13Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR RQMFSGFTKGEIEKMEKLLE+SGEQ LNR+FCQKVTK FNRSSGRAGKPVIKW EV           V+  L+ +                 
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV
             +   K+++   + PK     + ++  + P++         + T+GEKSPDLSELEFEARSSKDGAW                             
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKI---------VATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLV

Query:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR
                YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCR
Subjt:  FLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCR

Query:  CLFLVRYDHDSTEANY
        CLFLVRYDHDSTEA++
Subjt:  CLFLVRYDHDSTEANY

A0A6J1CF02 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X11.5e-8760.53Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPR+RQ+FSGFTK EIEKMEKLLE+SGEQLLNREF QKVTKGFNRSSGRAGKP+IKW EV          F                         
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY
                 K+++   + PK     + ++  + P+     GEKSPDLSELEFEARSSKDGAW                                     Y
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY

Query:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH
        DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK+GDLVLCFQERRDQAIYYDA I+E+QRRMHDIRGCRCLFLVRYDH
Subjt:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH

Query:  DSTE
        DSTE
Subjt:  DSTE

A0A6J1FDM4 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X34.7e-9462.15Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPRDRQMFSGFTKGEI KMEKL+E+SGEQLL+R+FCQKVTK F                                                +YTIG
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPK-------------IVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFAL
         K DCKTSQKLKRGCLKFPKLAL IRLRKVLKAPK             +    G+K PDLSELEFEARSSKDGAW                         
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPK-------------IVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFAL

Query:  CSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDI
                    YDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQ+VRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDI
Subjt:  CSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDI

Query:  RGCRCLFLVRYDHDSTE
        RGCRCLFL+RYDHDSTE
Subjt:  RGCRCLFLVRYDHDSTE

A0A6J1IMF0 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like1.5e-8760.53Show/hide
Query:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG
        MERLRPRDRQMFSGFTKGEI KMEKL+E+SGEQLL+R+FCQKVTK FNRSSGRAGKPVIKWMEV+    +    F                         
Subjt:  MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIG

Query:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY
                 K+++   + PK     + ++  + P+     G+K PDLSELEFEARSSKDGAW                                     Y
Subjt:  SKVDCKTSQKLKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKY

Query:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH
        DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEW+NIKQAVRERSVPLEHSECQKVK GDLVLCFQERRDQAIYYDA IVEVQRRMHDIRGCRCLFL+RYDH
Subjt:  DVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDH

Query:  DSTE
        DSTE
Subjt:  DSTE

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 21.0e-3735.05Show/hide
Query:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG
        F   E+ +ME +L      +  R   + +   F+ S  R GK V++          FK              +I+ + Q  RY + ++ + K   KL   
Subjt:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG

Query:  CLKFPKLALQIRLRKV---LKAPKIVATTG------------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFL
         +  P++ L  ++R V   L  PK    TG                      D S LEFEA+S++DGAW                               
Subjt:  CLKFPKLALQIRLRKV---LKAPKIVATTG------------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFL

Query:  FLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCL
              YDV  FL HR L  G+ EV+VRF GF  EEDEWIN+K+ VR+RS+P E SEC  V  GDLVLCFQE +DQA+Y+DA +++ QRR HD+RGCRC 
Subjt:  FLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCL

Query:  FLVRYDHDSTE
        FLVRY HD +E
Subjt:  FLVRYDHDSTE

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 11.3e-4034.88Show/hide
Query:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL
        F+ FT  EI  ME L ++ G+Q L+++FCQ V   F+ S  R GK  I W +V +                       +F +  ++    K     S  L
Subjt:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL

Query:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF
        +   L  P   A        +     V T   K+ DL++L FEA+S++D AW                                     YDV+ FLT+R 
Subjt:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF

Query:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANYVAD
        L +GE EVRVRF GF    DEW+N+K +VRERS+P+E SEC +V +GDL+LCFQER DQA+Y D  ++ ++R +HD   C C+FLVRY+ D+TE +   +
Subjt:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANYVAD

Query:  R
        R
Subjt:  R

Arabidopsis top hitse value%identityAlignment
AT1G15215.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors4.3e-3934.62Show/hide
Query:  MEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRGCLKFP-KL
        ME L ++ G+Q L+++FCQ V   F+ S  R GK  I W +V +                       +F +  ++    K     S  L+   L  P   
Subjt:  MEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRGCLKFP-KL

Query:  ALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVR
        A        +     V T   K+ DL++L FEA+S++D AW                                     YDV+ FLT+R L +GE EVRVR
Subjt:  ALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRVR

Query:  FVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANY
        F GF    DEW+N+K +VRERS+P+E SEC +V +GDL+LCFQER DQA+Y D  ++ ++R +HD   C C+FLVRY+ D+TE  +
Subjt:  FVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANY

AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors9.2e-4234.88Show/hide
Query:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL
        F+ FT  EI  ME L ++ G+Q L+++FCQ V   F+ S  R GK  I W +V +                       +F +  ++    K     S  L
Subjt:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL

Query:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF
        +   L  P   A        +     V T   K+ DL++L FEA+S++D AW                                     YDV+ FLT+R 
Subjt:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF

Query:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANYVAD
        L +GE EVRVRF GF    DEW+N+K +VRERS+P+E SEC +V +GDL+LCFQER DQA+Y D  ++ ++R +HD   C C+FLVRY+ D+TE +   +
Subjt:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANYVAD

Query:  R
        R
Subjt:  R

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.6e-4135.02Show/hide
Query:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL
        F+ FT  EI  ME L ++ G+Q L+++FCQ V   F+ S  R GK  I W +V +                       +F +  ++    K     S  L
Subjt:  FSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKL

Query:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF
        +   L  P   A        +     V T   K+ DL++L FEA+S++D AW                                     YDV+ FLT+R 
Subjt:  KRGCLKFP-KLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRF

Query:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANY
        L +GE EVRVRF GF    DEW+N+K +VRERS+P+E SEC +V +GDL+LCFQER DQA+Y D  ++ ++R +HD   C C+FLVRY+ D+TE  +
Subjt:  LSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANY

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding5.1e-4034.85Show/hide
Query:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG
        F   E+ +ME +L      +  R   + +   F+ S  R GK V++          FK              +I+ + Q  RY + ++ + K   KL   
Subjt:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG

Query:  CLKFPKLALQIRLRKV---LKAPKIVATTG------------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFL
         +  P++ L  ++R V   L  PK    TG                      D S LEFEA+S++DGAW                               
Subjt:  CLKFPKLALQIRLRKV---LKAPKIVATTG------------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFL

Query:  FLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCL
              YDV  FL HR L  G+ EV+VRF GF  EEDEWIN+K+ VR+RS+P E SEC  V  GDLVLCFQE +DQA+Y+DA +++ QRR HD+RGCRC 
Subjt:  FLIINKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCL

Query:  FLVRYDHDSTEANYVADRHI----NSDFRL
        FLVRY HD +E   V  R I     +D+RL
Subjt:  FLVRYDHDSTEANYVADRHI----NSDFRL

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding2.3e-4035.17Show/hide
Query:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG
        F   E+ +ME +L      +  R   + +   F+ S  R GK V++          FK              +I+ + Q  RY + ++ + K   KL   
Subjt:  FTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQKLKRG

Query:  CLKFPKLALQIRLRKV---LKAPKIVATTG---------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLI
         +  P++ L  ++R V   L  PK    TG                   D S LEFEA+S++DGAW                                  
Subjt:  CLKFPKLALQIRLRKV---LKAPKIVATTG---------------EKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLI

Query:  INKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLV
           YDV  FL HR L  G+ EV+VRF GF  EEDEWIN+K+ VR+RS+P E SEC  V  GDLVLCFQE +DQA+Y+DA +++ QRR HD+RGCRC FLV
Subjt:  INKYDVAMFLTHRFLSSGEAEVRVRFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLV

Query:  RYDHDSTEANYVADRHI----NSDFRL
        RY HD +E   V  R I     +D+RL
Subjt:  RYDHDSTEANYVADRHI----NSDFRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGTCTACGCCCGAGAGACAGGCAGATGTTTTCTGGGTTTACGAAGGGCGAGATAGAGAAAATGGAGAAATTGCTAGAGGATTCAGGAGAACAGTTGCTTAATCG
AGAATTTTGTCAAAAAGTTACAAAAGGTTTCAATCGGTCCTCTGGTCGTGCTGGGAAGCCTGTAATAAAGTGGATGGAGGTCTTTGTTGTGAGTTCTGCCTTTAAGCCAT
CTTTTGTATTTTCATGTTTGAAAAGAAAAGGTTCTGGTGAAATTTTTTTCTTCATACAATGTTACAGGTATACGATTGGCTCCAAAGTAGACTGCAAGACTTCCCAAAAA
TTGAAAAGAGGATGTCTGAAATTCCCAAAGCTTGCCCTTCAAATAAGACTCAGGAAAGTTCTCAAGGCCCCGAAGATAGTAGCTACTACAGGTGAAAAGAGCCCAGATTT
GTCAGAGTTGGAATTTGAAGCAAGGTCATCAAAGGATGGTGCATGGAATAGGAGTTTCGAAGGAAAATATAGAGGATTGATGGGAAGTGTGGGATTATTTGATGCCTGGT
CCTTTGCATTGTGTTCATTGGTATTTTTGTTTCTTATCATAAACAAGTATGATGTTGCTATGTTCCTTACGCATAGATTTCTTAGTTCCGGTGAAGCTGAAGTGCGTGTC
AGATTTGTCGGATTTGGAGCTGAGGAAGATGAGTGGATCAACATAAAACAGGCAGTACGAGAACGCTCTGTCCCTCTTGAACATTCAGAGTGCCAAAAGGTGAAGATTGG
GGATCTTGTACTCTGCTTCCAGGAGAGGAGAGATCAAGCAATCTACTACGATGCCCGTATTGTAGAAGTTCAGAGGAGAATGCATGATATTAGGGGCTGCAGGTGTCTTT
TCTTGGTTCGCTATGATCATGATAGCACCGAGGCAAATTATGTCGCAGACCGGCACATCAATTCTGATTTTCGACTTTGCATCAATTCATCACCTCTCTACTCGTCATTC
CCATATCAAAGCTGGAGATATGAATGGTTTGCCACAGAAATTACAATAGCTAATGCTAGTTTTGTCTGTTCTTGCAGAAAGTTTCTCCCAAAAGCCTTGGGAACAAGAAT
GGGACACCCACATGTAATTGTTGCATATTCTATGTCCATGCAAATAGGGCAAGGCGAGTTCAAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGTCTACGCCCGAGAGACAGGCAGATGTTTTCTGGGTTTACGAAGGGCGAGATAGAGAAAATGGAGAAATTGCTAGAGGATTCAGGAGAACAGTTGCTTAATCG
AGAATTTTGTCAAAAAGTTACAAAAGGTTTCAATCGGTCCTCTGGTCGTGCTGGGAAGCCTGTAATAAAGTGGATGGAGGTCTTTGTTGTGAGTTCTGCCTTTAAGCCAT
CTTTTGTATTTTCATGTTTGAAAAGAAAAGGTTCTGGTGAAATTTTTTTCTTCATACAATGTTACAGGTATACGATTGGCTCCAAAGTAGACTGCAAGACTTCCCAAAAA
TTGAAAAGAGGATGTCTGAAATTCCCAAAGCTTGCCCTTCAAATAAGACTCAGGAAAGTTCTCAAGGCCCCGAAGATAGTAGCTACTACAGGTGAAAAGAGCCCAGATTT
GTCAGAGTTGGAATTTGAAGCAAGGTCATCAAAGGATGGTGCATGGAATAGGAGTTTCGAAGGAAAATATAGAGGATTGATGGGAAGTGTGGGATTATTTGATGCCTGGT
CCTTTGCATTGTGTTCATTGGTATTTTTGTTTCTTATCATAAACAAGTATGATGTTGCTATGTTCCTTACGCATAGATTTCTTAGTTCCGGTGAAGCTGAAGTGCGTGTC
AGATTTGTCGGATTTGGAGCTGAGGAAGATGAGTGGATCAACATAAAACAGGCAGTACGAGAACGCTCTGTCCCTCTTGAACATTCAGAGTGCCAAAAGGTGAAGATTGG
GGATCTTGTACTCTGCTTCCAGGAGAGGAGAGATCAAGCAATCTACTACGATGCCCGTATTGTAGAAGTTCAGAGGAGAATGCATGATATTAGGGGCTGCAGGTGTCTTT
TCTTGGTTCGCTATGATCATGATAGCACCGAGGCAAATTATGTCGCAGACCGGCACATCAATTCTGATTTTCGACTTTGCATCAATTCATCACCTCTCTACTCGTCATTC
CCATATCAAAGCTGGAGATATGAATGGTTTGCCACAGAAATTACAATAGCTAATGCTAGTTTTGTCTGTTCTTGCAGAAAGTTTCTCCCAAAAGCCTTGGGAACAAGAAT
GGGACACCCACATGTAATTGTTGCATATTCTATGTCCATGCAAATAGGGCAAGGCGAGTTCAAATCTTAA
Protein sequenceShow/hide protein sequence
MERLRPRDRQMFSGFTKGEIEKMEKLLEDSGEQLLNREFCQKVTKGFNRSSGRAGKPVIKWMEVFVVSSAFKPSFVFSCLKRKGSGEIFFFIQCYRYTIGSKVDCKTSQK
LKRGCLKFPKLALQIRLRKVLKAPKIVATTGEKSPDLSELEFEARSSKDGAWNRSFEGKYRGLMGSVGLFDAWSFALCSLVFLFLIINKYDVAMFLTHRFLSSGEAEVRV
RFVGFGAEEDEWINIKQAVRERSVPLEHSECQKVKIGDLVLCFQERRDQAIYYDARIVEVQRRMHDIRGCRCLFLVRYDHDSTEANYVADRHINSDFRLCINSSPLYSSF
PYQSWRYEWFATEITIANASFVCSCRKFLPKALGTRMGHPHVIVAYSMSMQIGQGEFKS