; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G061090 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G061090
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionmyb family transcription factor PHL5
Genome locationCicolChr04:8493317..8498549
RNA-Seq ExpressionCcUC04G061090
SyntenyCcUC04G061090
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146408.1 myb family transcription factor PHL5 isoform X1 [Cucumis sativus]8.8e-18384.8Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS+    SP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP P  SNV GY+D+PP+PT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

XP_008442127.1 PREDICTED: uncharacterized protein LOC103486080 isoform X1 [Cucumis melo]1.9e-18587.38Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNGQFP
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  SNVSGYLD+ P+PT     N QFP
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNGQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

XP_031739568.1 myb family transcription factor PHL5 isoform X2 [Cucumis sativus]1.1e-17783.33Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+        PSFCS+    SP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP P  SNV GY+D+PP+PT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]8.5e-19488.75Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEIQQNHGLITD YSQNFRAQQPWRMG CVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        DSLS IFQSSGENF  D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESSL  +KHYSVPFKDQ   YNSIAQPSFCS    NSP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        S LS SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTA--DSVRNGQFPSKIS
        ELD+KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+NKP P+N+SGYLD+PP+P+   D+++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTA--DSVRNGQFPSKIS

XP_038881144.1 myb family transcription factor PHL5-like isoform X2 [Benincasa hispida]8.2e-18987.25Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEIQQNHGLITD YSQNFRAQQPWRMG CVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        DSLS IFQSSGENF  D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESSL  +KHYSVPFKDQ         PSFCS    NSP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        S LS SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTA--DSVRNGQFPSKIS
        ELD+KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+NKP P+N+SGYLD+PP+P+   D+++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTA--DSVRNGQFPSKIS

TrEMBL top hitse value%identityAlignment
A0A0A0L162 Uncharacterized protein4.3e-18384.8Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMGAC HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSS ENF LDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS+    SP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL  S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      +NKP P  SNV GY+D+PP+PT     D++RN
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----YNKPIP--SNVSGYLDDPPVPT----ADSVRN

Query:  GQFPSKIS
         QFPSKIS
Subjt:  GQFPSKIS

A0A1S3B500 uncharacterized protein LOC103486080 isoform X19.2e-18687.38Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVT
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNGQFP
        ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  SNVSGYLD+ P+PT     N QFP
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNGQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

A0A5A7TRR1 Protein PHR1-LIKE 1 isoform X23.4e-17283.05Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMGACVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        +SLSTIFQSSGENF LDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS  I+KHYSVPFKDQ+  YNSIAQPSFCS    NSP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---SDRRNCMN
        SCLS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE R   +++    N
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---SDRRNCMN

Query:  EVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNG
          T      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG +NKP P  SNVSGYLD+ P+PT     N 
Subjt:  EVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-YNKPIP--SNVSGYLDDPPVPTADSVRNG

Query:  QFPSKIS
        QFPSKIS
Subjt:  QFPSKIS

A0A6J1HTK3 myb family transcription factor PHL5-like2.2e-17180.15Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDS QEI+QNHG++ DC+ QNFRAQQPWRMG CV L AMDEVES EQ +   SKSSSTIINLFESP SAFFATEQCMGIPPI+F++GSSSFD  S
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF
        DS+S IFQSSGEN  LD  E SG DSEF NTLQSVVKSQLCKR F+GFPK+  ++HK+FD+ S ++ KHYSVPFKDQ   YN    PSFCSSQEK SP F
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIF

Query:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT
        SCL AS+GSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAER+SDRRN M EV 
Subjt:  SCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVT

Query:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTADSVRNGQFPSKIS
        +LD KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG+N     N+SG LD+P  PT +S++N QFPSKIS
Subjt:  ELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTADSVRNGQFPSKIS

A0A6J1HW70 myb family transcription factor PHL58.9e-17379.85Show/hide
Query:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MNDYGIDSKQEI QNHG+I DCYSQN RAQQPWRMGA VHLSAMDEVESSEQ NL  S SSSTIINLFESP SAFFATEQCMGIPPI+F +GSSSFD AS
Subjt:  MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYN----SIAQPSFCSSQEKN
                        DSAEHSG DSEFSNTL SVV+SQLCKRSFNGFPKT F ++KVFD    +I KH+S+PFKDQ   Y+    SIAQPSFCSSQEKN
Subjt:  DSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYN----SIAQPSFCSSQEKN

Query:  SPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM
        SP FSC S+S GSGSSSSSFNGNGF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+R+SDRRN M
Subjt:  SPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM

Query:  NEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTADSVRNGQFPSK
        NEV ELD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGK+LK+MFDQQQETNKCFF  NG+NKP P++ SGYLDDPP+P A+++RN QF + 
Subjt:  NEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTADSVRNGQFPSK

Query:  IS
        IS
Subjt:  IS

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 14.0e-3752.05Show/hide
Query:  SIAQPS-FCSSQEKNSPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS   +SQ   +   S  S  +   +S    N N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPS-FCSSQEKNSPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
         P+ +E ++      +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q
Subjt:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q0WVU3 Myb family transcription factor PHL57.5e-5244.1Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ P FS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

Query:  TNGYNKPIPSNVSGYLDDPPVP
             +   S +  ++  PP P
Subjt:  TNGYNKPIPSNVSGYLDDPPVP

Q10LZ1 Protein PHOSPHATE STARVATION RESPONSE 14.0e-3752.05Show/hide
Query:  SIAQPS-FCSSQEKNSPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS   +SQ   +   S  S  +   +S    N N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPS-FCSSQEKNSPIFSCLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
         P+ +E ++      +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q
Subjt:  MPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q8GUN5 Protein PHR1-LIKE 11.8e-3755.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 16.2e-3858.04Show/hide
Query:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQ
        S++S N N  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E  S  R    +  +T LD K  + 
Subjt:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQ

Query:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT4G28610.1 phosphate starvation response 14.4e-3958.04Show/hide
Query:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQ
        S++S N N  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E  S  R    +  +T LD K  + 
Subjt:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNC--MNEVTELDAKTAMQ

Query:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

AT5G06800.1 myb-like HTH transcriptional regulator family protein5.3e-5344.1Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ P FS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

Query:  TNGYNKPIPSNVSGYLDDPPVP
             +   S +  ++  PP P
Subjt:  TNGYNKPIPSNVSGYLDDPPVP

AT5G06800.2 myb-like HTH transcriptional regulator family protein1.5e-4243.59Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA       S SL+ 
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDES-SLAI

Query:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS
           H       +  S +++   +F SSQ   +++ P FS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS
Subjt:  R-KHYSVPFKDQVVSYNSIAQPSFCSSQ---EKNSPIFSC-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDS

Query:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL
        +GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  EGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein1.3e-3855.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein1.3e-3855.33Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATG
GGGGCTTGTGTTCATCTGTCCGCCATGGATGAAGTTGAATCCTCAGAACAGCTAAATTTGTGTCCGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCG
CCTACTTCGGCTTTCTTCGCAACGGAGCAATGTATGGGGATTCCTCCGATTCAATTTCAGTCTGGTTCTTCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACA
ATTTTTCAATCCTCCGGCGAGAATTTCCCTCTCGATTCGGCGGAGCACAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTC
TGTAAGAGAAGCTTTAATGGCTTCCCGAAGACTAGTTTCGCTGAGCACAAGGTGTTTGATGAAAGTTCCCTTGCAATCAGGAAGCATTATTCAGTTCCTTTCAAA
GACCAAGTAGTGAGTTATAATTCAATTGCACAGCCAAGCTTTTGTTCTTCACAAGAGAAGAACTCCCCTATATTCTCTTGCTTGAGTGCTTCTGTTGGGTCTGGA
AGCTCTTCATCTTCCTTCAATGGGAATGGATTCACCACCAAAACAAGAATCAGATGGACACAAGATCTCCACGAGAAATTTGTTGACTGTGTTAATCGTCTTGGT
GGTGCTGAGAAGGCAACGCCTAAAGCAATCTTGAAGCTGATGGATTCGGAGGGATTGACCATATTCCATGTGAAGAGTCATTTACAGAAATATCGGATAGCAAAA
TACATGCCAGAATCTGCAGAAAGGAGGTCTGATAGAAGGAACTGCATGAATGAAGTTACCGAACTGGATGCCAAAACTGCCATGCAAATTAAAGACGCCTTGCAA
CTGCAGCTAGATGTTCAGAGGCGTCTTCATGATCAATTGGAGATACAAAGGAAGCTACAATTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATGTTTGAT
CAACAACAGGAAACTAACAAATGCTTCTTCAGAACTAATGGATACAACAAACCGATTCCAAGTAACGTGTCGGGTTATCTCGACGATCCTCCGGTCCCGACAGCC
GACAGCGTCCGAAATGGCCAATTCCCATCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATTTGCCCTTCTATTGATTCTTCATTCTTCACTCCCATCCCCTTCTTCCTCGAATAATCTGTCTTCCAATTCGCTGTTATTCATCCTATCTTTCGCATCAAGTCT
TCCAAGAATAATCACAAAGGAATTGTATTCTATGCGTTGATGTTATTTATGCTCTTTCTCTGTTTATCCCTTTTTACTCTCTCAATTCCATCCAAAAATCTTCCA
TACTCTGTCTTCAAACGAAGATTTGGGGGTTGTTTCTTTCGAAATTTTGTAAATGAACGATTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACT
GATTACTGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATGGGGGCTTGTGTTCATCTGTCCGCCATGGATGAAGTTGAATCCTCAGAACAGCT
AAATTTGTGTCCGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCGCCTACTTCGGCTTTCTTCGCAACGGAGCAATGTATGGGGATTCCTCCGATTCA
ATTTCAGTCTGGTTCTTCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACAATTTTTCAATCCTCCGGCGAGAATTTCCCTCTCGATTCGGCGGAGCACAGTGG
TGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTTAATGGCTTCCCGAAGACTAGTTTCGCTGAGCACAAGGT
GTTTGATGAAAGTTCCCTTGCAATCAGGAAGCATTATTCAGTTCCTTTCAAAGACCAAGTAGTGAGTTATAATTCAATTGCACAGCCAAGCTTTTGTTCTTCACA
AGAGAAGAACTCCCCTATATTCTCTTGCTTGAGTGCTTCTGTTGGGTCTGGAAGCTCTTCATCTTCCTTCAATGGGAATGGATTCACCACCAAAACAAGAATCAG
ATGGACACAAGATCTCCACGAGAAATTTGTTGACTGTGTTAATCGTCTTGGTGGTGCTGAGAAGGCAACGCCTAAAGCAATCTTGAAGCTGATGGATTCGGAGGG
ATTGACCATATTCCATGTGAAGAGTCATTTACAGAAATATCGGATAGCAAAATACATGCCAGAATCTGCAGAAAGGAGGTCTGATAGAAGGAACTGCATGAATGA
AGTTACCGAACTGGATGCCAAAACTGCCATGCAAATTAAAGACGCCTTGCAACTGCAGCTAGATGTTCAGAGGCGTCTTCATGATCAATTGGAGATACAAAGGAA
GCTACAATTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATGTTTGATCAACAACAGGAAACTAACAAATGCTTCTTCAGAACTAATGGATACAACAAACC
GATTCCAAGTAACGTGTCGGGTTATCTCGACGATCCTCCGGTCCCGACAGCCGACAGCGTCCGAAATGGCCAATTCCCATCCAAGATAAGTTAGGCCATGAAAGA
CTACCTTTTTCTTCATCACCATTCACCAGTACATAAACACTCATTGGGGGTTTCCTGTCTTCTTTCAAACTTCTGACAATGGAAACAAAAGAAAAAAAAGAAGAA
AAAGAAAAAGGGTAAGGATTACAGAGTGTTGTTGAATATACAAACTTTTGAAAGTTGAATTAGCCATGGGTTTTGTTATGTTAATTGTCAGTTCTAAGCTGTGAA
TTAAACTATTTTACCAACACAAAATGATATTGTTTATCAGATATGATGTACTTCTCCCTCCCCATATCGAATGC
Protein sequenceShow/hide protein sequence
MNDYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGACVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLST
IFQSSGENFPLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSLAIRKHYSVPFKDQVVSYNSIAQPSFCSSQEKNSPIFSCLSASVGSG
SSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTAMQIKDALQ
LQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGYNKPIPSNVSGYLDDPPVPTADSVRNGQFPSKIS