; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G02100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G02100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionmyb family transcription factor PHL5
Genome locationClcChr04:6703867..6709645
RNA-Seq ExpressionClc04G02100
SyntenyClc04G02100
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146408.1 myb family transcription factor PHL5 isoform X1 [Cucumis sativus]1.0e-18385.29Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS+    SPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP PN  NV GY+D+PPIPT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

XP_008442127.1 PREDICTED: uncharacterized protein LOC103486080 isoform X1 [Cucumis melo]1.5e-18587.62Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNGQFP
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  +NVSGYLD+ PIPT     N QFP
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNGQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

XP_031739568.1 myb family transcription factor PHL5 isoform X2 [Cucumis sativus]1.7e-17883.82Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+        PSFCS+    SPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP PN  NV GY+D+PPIPT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]6.9e-19689.75Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEIQQNHGLITD YSQNFRAQQPWRMG CVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        DSLS IFQSSGENF  D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESSL  +KHYSVPFKDQ   YNSIAQPSFCS    NSPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        S LS SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPT--PDSVRNGQFPSKIS
        ELD+KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+NKP PNN+SGYLD+PPIP+  PD+++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPT--PDSVRNGQFPSKIS

XP_038881144.1 myb family transcription factor PHL5-like isoform X2 [Benincasa hispida]5.2e-19188.25Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEIQQNHGLITD YSQNFRAQQPWRMG CVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        DSLS IFQSSGENF  D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESSL  +KHYSVPFKDQ         PSFCS    NSPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        S LS SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPT--PDSVRNGQFPSKIS
        ELD+KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+NKP PNN+SGYLD+PPIP+  PD+++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPT--PDSVRNGQFPSKIS

TrEMBL top hitse value%identityAlignment
A0A0A0L162 Uncharacterized protein5.0e-18485.29Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS+    SPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP PN  NV GY+D+PPIPT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIPN--NVSGYLDDPPIPTP----DSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

A0A1S3B500 uncharacterized protein LOC103486080 isoform X17.0e-18687.62Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNGQFP
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  +NVSGYLD+ PIPT     N QFP
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNGQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

A0A5A7TRR1 Protein PHR1-LIKE 1 isoform X22.6e-17283.29Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---SDRRNCMN
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE R   +++    N
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---SDRRNCMN

Query:  EVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNG
          T      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  +NVSGYLD+ PIPT     N 
Subjt:  EVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--NNVSGYLDDPPIPTPDSVRNG

Query:  QFPSKIS
        QFPSKIS
Subjt:  QFPSKIS

A0A6J1HTK3 myb family transcription factor PHL5-like5.2e-17380.65Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDS QEI+QNHG++ DC+ QNFRAQQPWRMG CV L AMDEVES EQ +   SKSSSTIINLFESP SAFFATEQCMGIPPI+F++GSSSFD  S
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF
        DS+S IFQSSGEN  LD  E SG DSEF NTLQSVVKSQLCKR F+GFPK+  ++HK+FD+ S ++ KHYSVPFKDQ   YN    PSFCSSQEK SPRF
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL AS+GSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAER+SDRRN M EV 
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPTPDSVRNGQFPSKIS
        +LD KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+     NN+SG LD+P  PTP+S++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPTPDSVRNGQFPSKIS

A0A6J1HW70 myb family transcription factor PHL51.8e-17380.35Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEI QNHG+I DCYSQN RAQQPWRMGA VHLSAMDEVESSEQ NL  S SSSTIINLFESP SAFFATEQCMGIPPI+F +GSSSFD AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYN----SIAQPSFCSSQEKN
                        DSAEHSG DSEFSNTL SVV+SQLCKRSFNGFPKT F ++KVFD    +I KH+S+PFKDQ   Y+    SIAQPSFCSSQEKN
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYN----SIAQPSFCSSQEKN

Query:  SPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM
        SPRFSC S+S GSGSSSSSFNGNGF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+R+SDRRN M
Subjt:  SPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM

Query:  NEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPTPDSVRNGQFPSK
        NEV ELD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGK+LK+MFDQQQETNKCFF  NG+NKP PN+ SGYLDDPPIP  +++RN QF + 
Subjt:  NEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPTPDSVRNGQFPSK

Query:  IS
        IS
Subjt:  IS

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 14.0e-3752.05Show/hide
Query:  SIAQPS-FCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS   +SQ   +   S  S  +   +S    N N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPS-FCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
         P+ +E ++      +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q
Subjt:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q0WVU3 Myb family transcription factor PHL52.6e-5245.26Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +    
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

Query:  TNGYNKPIPN-NVSGYLDDPPIPTPDS
             K +P+   S  L DP I +P S
Subjt:  TNGYNKPIPN-NVSGYLDDPPIPTPDS

Q10LZ1 Protein PHOSPHATE STARVATION RESPONSE 14.0e-3752.05Show/hide
Query:  SIAQPS-FCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS   +SQ   +   S  S  +   +S    N N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPS-FCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
         P+ +E ++      +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q
Subjt:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q8GUN5 Protein PHR1-LIKE 11.8e-3755.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 14.0e-3746.01Show/hide
Query:  FAEHKVFDESSLAIRKHYSVPFKDQVVSYNS-----------IAQPSFCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCV
        +A+H + D+  L      S  + D ++  NS           I QP     Q   S     +S      +SS+S NG G   K R+RWT +LHE FV+ V
Subjt:  FAEHKVFDESSLAIRKHYSVPFKDQVVSYNS-----------IAQPSFCSSQEKNSPRFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCV

Query:  NRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIE
        N LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E  S  R    +  +T LD K  + I +AL+LQ++VQ++LH+QLEIQR LQL+IE
Subjt:  NRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIE

Query:  EQGKQLKMMFDQQ
        EQGK L+MMF++Q
Subjt:  EQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT5G06800.1 myb-like HTH transcriptional regulator family protein1.8e-5345.26Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +    
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

Query:  TNGYNKPIPN-NVSGYLDDPPIPTPDS
             K +P+   S  L DP I +P S
Subjt:  TNGYNKPIPN-NVSGYLDDPPIPTPDS

AT5G06800.2 myb-like HTH transcriptional regulator family protein1.7e-4343.96Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPRFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein1.3e-3855.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein1.3e-3855.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.3 Homeodomain-like superfamily protein1.3e-3855.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATGGGGGC
TTGTGTTCATCTGTCCGCCATGGATGAAGTTGAATCCTCAGAACAGCTAAATTTGTGTCCGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCGCCTACTTCGG
CTTTCTTCGCAACGGAGCAATGTATGGGGATTCCTCCGATTCAATTTCAGTCTGGTTCTTCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACAATTTTTCAATCCTCC
GGCGAGAATTTCCCTCTCGATTCGGCGGAGCACAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGG
CTTCCCGAAGACTAGTTTTGCTGAGCACAAGGTGTTTGATGAAAGTTCCCTTGCAATCAGGAAGCATTATTCAGTTCCTTTCAAAGACCAAGTAGTGAGTTATAATTCAA
TTGCACAGCCAAGCTTTTGTTCTTCACAAGAGAAGAACTCCCCTAGATTCTCTTGCTTGAGTGCTTCTGTTGGGTCTGGAAGCTCTTCATCTTCCTTCAATGGGAATGGA
TTCACCACCAAAACAAGAATCAGATGGACACAAGATCTCCACGAGAAATTTGTTGACTGTGTTAATCGTCTTGGTGGTGCTGAGAAGGCAACGCCTAAAGCAATCTTGAA
GCTGATGGATTCGGAGGGATTGACCATATTCCATGTGAAGAGTCATTTACAGAAATATCGGATAGCAAAATACATGCCAGAATCTGCAGAAAGGAGGTCTGATAGAAGGA
ACTGCATGAATGAAGTTACTGAACTGGATGCCAAAACTGCCATGCAAATTAAAGACGCCTTGCAACTGCAGCTAGATGTTCAGAGGCGTCTTCATGATCAATTGGAGATA
CAAAGGAAGCTACAATTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATGTTTGATCAACAACAGGAAACTAACAAATGCTTCTTCAGAACTAATGGATACAACAA
ACCAATTCCTAATAACGTGTCAGGTTATCTCGACGATCCTCCGATCCCGACACCCGACAGCGTCCGAAATGGCCAATTCCCATCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
CAAAAACTAATAGTTTTATTACAACTTTCCATGAAAGTGCAACACAAAAAAATGGGTTTCGTATTTAAATGAAGATTCATTTCAATTTGTTCTTCATTTTGTGTTCTTGC
TCCTTCGTTATTTGCCCTTCTATTGATTCTTCATTCTTCACTCCCATCCCCTTTTTCCTCGAATAATCTGTCTTCCAATTCGCTGTTATTCATCCTATCTTTCGCATCAA
GTCTTCCAAGAATAATCACAAAGGAATTGTATTCTATGCGTTGATGTTATTTATGCTCTTTCTCTGTTTATCCCTTTTTACTCTCTCAATTCCATCCAAAAATCTTCCAT
ACTCTGTCTTCAAACGAAGATTTGGGGGTTGTTTCTTTCGAAATTTTGTAAATGAACGATTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACTGATTAC
TGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATGGGGGCTTGTGTTCATCTGTCCGCCATGGATGAAGTTGAATCCTCAGAACAGCTAAATTTGTGTC
CGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCGCCTACTTCGGCTTTCTTCGCAACGGAGCAATGTATGGGGATTCCTCCGATTCAATTTCAGTCTGGTTCT
TCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACAATTTTTCAATCCTCCGGCGAGAATTTCCCTCTCGATTCGGCGGAGCACAGTGGTGTAGACTCTGAATTCAGTAA
CACCTTGCAATCGGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGGCTTCCCGAAGACTAGTTTTGCTGAGCACAAGGTGTTTGATGAAAGTTCCCTTGCAATCA
GGAAGCATTATTCAGTTCCTTTCAAAGACCAAGTAGTGAGTTATAATTCAATTGCACAGCCAAGCTTTTGTTCTTCACAAGAGAAGAACTCCCCTAGATTCTCTTGCTTG
AGTGCTTCTGTTGGGTCTGGAAGCTCTTCATCTTCCTTCAATGGGAATGGATTCACCACCAAAACAAGAATCAGATGGACACAAGATCTCCACGAGAAATTTGTTGACTG
TGTTAATCGTCTTGGTGGTGCTGAGAAGGCAACGCCTAAAGCAATCTTGAAGCTGATGGATTCGGAGGGATTGACCATATTCCATGTGAAGAGTCATTTACAGAAATATC
GGATAGCAAAATACATGCCAGAATCTGCAGAAAGGAGGTCTGATAGAAGGAACTGCATGAATGAAGTTACTGAACTGGATGCCAAAACTGCCATGCAAATTAAAGACGCC
TTGCAACTGCAGCTAGATGTTCAGAGGCGTCTTCATGATCAATTGGAGATACAAAGGAAGCTACAATTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATGTTTGA
TCAACAACAGGAAACTAACAAATGCTTCTTCAGAACTAATGGATACAACAAACCAATTCCTAATAACGTGTCAGGTTATCTCGACGATCCTCCGATCCCGACACCCGACA
GCGTCCGAAATGGCCAATTCCCATCCAAGATAAGTTAGGCCATGAAAAACTACCTTTTTCTTCATCACCATTCACCAGTACATAACACACTCATTGGGGGGTTCCTGTCT
TCTTTCAAACTTCTGACAATGGAAACAAAAGAAAAAAAAAAAAAAAAAGAAAAAAAAAAGGGGTAAAGATTACAGAGTGTTGTTGAATATACAAACTTTTGAAAGTTGAA
TTAGCCATGGGTTTTGTTATGTTAATTGTCAGTTCTAAGTTGTGAATTAAACTATTTTACCAACACAAAATGATATTGTTTATCATATATGATGTACTTCTCCTTCCCCA
TATCTTATGCTTTTTAATAG
Protein sequenceShow/hide protein sequence
MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSS
GENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPRFSCLSASVGSGSSSSSFNGNG
FTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEI
QRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPNNVSGYLDDPPIPTPDSVRNGQFPSKIS