; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017940 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017940
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionmyb family transcription factor PHL5
Genome locationChr03:26440278..26444384
RNA-Seq ExpressionHG10017940
SyntenyHG10017940
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146408.1 myb family transcription factor PHL5 isoform X1 [Cucumis sativus]3.3e-18585.85Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMG C HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSS ENFSLDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCS SPRFS L 
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDA
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKPTPN   V GY+D+PPIPT     D++
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV

Query:  RNAQFPSKIS
        RNAQFPSKIS
Subjt:  RNAQFPSKIS

XP_008442127.1 PREDICTED: uncharacterized protein LOC103486080 isoform X1 [Cucumis melo]4.5e-18788.18Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMG CVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSSGENFSLDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCSNSPRFS LS
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDA
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG FNKPTP  + VSGYLD+ PIPT     NAQ
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ

Query:  FPSKIS
        FPSKIS
Subjt:  FPSKIS

XP_031739568.1 myb family transcription factor PHL5 isoform X2 [Cucumis sativus]7.8e-17984.15Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMG C HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSS ENFSLDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+        PSFCS SPRFS L 
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDA
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKPTPN   V GY+D+PPIPT     D++
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV

Query:  RNAQFPSKIS
        RNAQFPSKIS
Subjt:  RNAQFPSKIS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]4.8e-19790.3Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQPWRMGTCVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        DSLS IFQSSGENFS D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESS T KKHYSVPFKDQ  CYNSIAQPSFCSNSPRFS LS
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELD+
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTA--DSVRNAQFPSK
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPN +SGYLD+PPIP+   D+++NAQFPSK
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTA--DSVRNAQFPSK

Query:  IS
        IS
Subjt:  IS

XP_038881144.1 myb family transcription factor PHL5-like isoform X2 [Benincasa hispida]8.8e-19188.56Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQPWRMGTCVHLS MDEVESSEQLN CPSKS+STIINLFESPTSAFFATEQCMGIPPIQFQSGSS    AS
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        DSLS IFQSSGENFS D AEHSGVDSE SNTLQSVVKSQLCKRSFNGFPKT+FA+HKVFDESS T KKHYSVPFKDQ         PSFCSNSPRFS LS
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         SVGSGSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELD+
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTA--DSVRNAQFPSK
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPN +SGYLD+PPIP+   D+++NAQFPSK
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTA--DSVRNAQFPSK

Query:  IS
        IS
Subjt:  IS

TrEMBL top hitse value%identityAlignment
A0A0A0L162 Uncharacterized protein1.6e-18585.85Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRA+QP RMG C HLSAMDEVESS+ LN CPSK SSTIINLFESP SAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSS ENFSLDSAE SGVDSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCS SPRFS L 
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+G GSSSSSF+GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDA
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKPTPN   V GY+D+PPIPT     D++
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNG----FNKPTPN--IVSGYLDDPPIPT----ADSV

Query:  RNAQFPSKIS
        RNAQFPSKIS
Subjt:  RNAQFPSKIS

A0A1S3B500 uncharacterized protein LOC103486080 isoform X12.2e-18788.18Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMG CVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSSGENFSLDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCSNSPRFS LS
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDA
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ
        KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG FNKPTP  + VSGYLD+ PIPT     NAQ
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ

Query:  FPSKIS
        FPSKIS
Subjt:  FPSKIS

A0A1S3B5M5 protein PHR1-LIKE 1 isoform X27.9e-16987.9Show/hide
Query:  MGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQS
        MG CVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    +SLSTIFQSSGENFSLDSAE SG+DSEFSNTLQS
Subjt:  MGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQS

Query:  VVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFV
        VVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCSNSPRFS LS S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFV
Subjt:  VVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFV

Query:  DCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQR
        DCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELDAKT      AMQIKDALQLQLDVQRRLHDQLEIQR
Subjt:  DCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQR

Query:  KLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQFPSKIS
        KLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG FNKPTP  + VSGYLD+ PIPT     NAQFPSKIS
Subjt:  KLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQFPSKIS

A0A5A7TRR1 Protein PHR1-LIKE 1 isoform X26.6e-17684.73Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEIQQNHGLITD YSQNFRAQQP RMG CVHLSAMDEVESSE+LN CPSK +STIINLFESPTSAFFATEQCMGIPPIQFQSGSSSF    
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS
        +SLSTIFQSSGENFSLDSAE SG+DSEFSNTLQSVVKSQLCKRSFNG PK SF EHKVFD SS TIKKHYSVPFKDQ+ CYNSIAQPSFCSNSPRFS LS
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLS

Query:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA
         S+GSGSSSSSFNGNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE R          TE  A
Subjt:  ASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ
             +  AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR   TNG FNKPTP  + VSGYLD+ PIPT     NAQ
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR---TNG-FNKPTP--NIVSGYLDDPPIPTADSVRNAQ

Query:  FPSKIS
        FPSKIS
Subjt:  FPSKIS

A0A6J1HW70 myb family transcription factor PHL53.0e-16878.68Show/hide
Query:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS
        MN YGIDSKQEI QNHG+I DCYSQN RAQQPWRMG  VHLSAMDEVESSEQ NL  S SSSTIINLFESP SAFFATEQCMGIPPI+F +GSSSFD AS
Subjt:  MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMAS

Query:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYN----SIAQPSFCS----N
                        DSAEHSG DSEFSNTL SVV+SQLCKRSFNGFPKT F ++KVFD   P+I+KH+S+PFKDQ  CY+    SIAQPSFCS    N
Subjt:  DSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYN----SIAQPSFCS----N

Query:  SPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM
        SPRFS  S+S GSGSSSSSFNGNGF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+R+SDRRN M
Subjt:  SPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCM

Query:  NEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTADSVRN
        NEV ELD KT      AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGK+LK+MFDQQQETNKCFF  NGFNKP PN  SGYLDDPPIP A+++RN
Subjt:  NEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTADSVRN

Query:  AQFPSKIS
        AQF + IS
Subjt:  AQFPSKIS

SwissProt top hitse value%identityAlignment
B8B5N8 Protein PHOSPHATE STARVATION RESPONSE 24.5e-3649.44Show/hide
Query:  KDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRI
        +  V  + S AQ S  S S   S ++    SG+S++S       +KTR+RWT +LHE+FVD VN LGG+EKATPK +LKLM ++ LTI+HVKSHLQKYR 
Subjt:  KDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRI

Query:  AKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        A+Y PE +E  S+++    E    D  ++ +      + +AL+LQL++Q+RLH+QLEIQR LQL+IEEQGK L+MM +QQ
Subjt:  AKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q0WVU3 Myb family transcription factor PHL59.2e-5041.99Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA     + SS +  
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK

Query:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
          +      Q +C      +++   +F S+        PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK 
Subjt:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        MDS+GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T       +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQ
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Query:  QETNKCFFRTNGFNKPTPNIVSGYLDDPPIP
        Q+  +   +     + + +++  ++  PP P
Subjt:  QETNKCFFRTNGFNKPTPNIVSGYLDDPPIP

Q6Z156 Protein PHOSPHATE STARVATION RESPONSE 24.5e-3649.44Show/hide
Query:  KDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRI
        +  V  + S AQ S  S S   S ++    SG+S++S       +KTR+RWT +LHE+FVD VN LGG+EKATPK +LKLM ++ LTI+HVKSHLQKYR 
Subjt:  KDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRI

Query:  AKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        A+Y PE +E  S+++    E    D  ++ +      + +AL+LQL++Q+RLH+QLEIQR LQL+IEEQGK L+MM +QQ
Subjt:  AKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Q8GUN5 Protein PHR1-LIKE 12.9e-3553.21Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT      +++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 17.6e-3655.78Show/hide
Query:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYD
        S++S N N  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E  S  R    ++T L+  T +   
Subjt:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYD

Query:  SAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
          + I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  SAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT4G28610.1 phosphate starvation response 15.4e-3755.78Show/hide
Query:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYD
        S++S N N  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E  S  R    ++T L+  T +   
Subjt:  SSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYD

Query:  SAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
          + I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  SAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

AT5G06800.1 myb-like HTH transcriptional regulator family protein6.6e-5141.99Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA     + SS +  
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK

Query:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
          +      Q +C      +++   +F S+        PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK 
Subjt:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        MDS+GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T       +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQ
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Query:  QETNKCFFRTNGFNKPTPNIVSGYLDDPPIP
        Q+  +   +     + + +++  ++  PP P
Subjt:  QETNKCFFRTNGFNKPTPNIVSGYLDDPPIP

AT5G06800.2 myb-like HTH transcriptional regulator family protein6.8e-4041.49Show/hide
Query:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK
        +  P+S  + TE   G+ P    + + SF +   S S  + SS   +   S++   +D   S      +  Q  K  +       FA     + SS +  
Subjt:  FESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLSTIFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIK

Query:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
          +      Q +C      +++   +F S+        PRFS   S S+  GS + +        KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK 
Subjt:  KHYSVPFKDQVVC-----YNSIAQPSFCSNS-------PRFSG-LSASVGSGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKL
        MDS+GLTIFHVKSHLQKYRIAKYMPES E + ++R C  E+++LD +T       +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein2.1e-3653.21Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT      +++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein2.1e-3653.21Show/hide
Query:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA
        SG +SSS   +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   + ++  LD 
Subjt:  SGSSSSSFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERRSDRRNCMNEVTELDA

Query:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        KT      +++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  KTMMMYDSAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGTTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATG
GGGACTTGTGTTCATCTATCTGCCATGGATGAAGTTGAATCCTCAGAACAGCTAAATTTATGTCCGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCG
CCTACTTCGGCTTTCTTCGCAACGGAGCAATGTATGGGGATTCCACCGATTCAATTTCAGTCTGGTTCTTCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACG
ATTTTTCAATCCTCGGGGGAGAATTTCTCTCTCGATTCGGCGGAGCATAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTC
TGTAAGAGAAGCTTCAATGGCTTCCCGAAGACTAGTTTCGCCGAGCACAAGGTGTTTGATGAAAGTTCCCCTACAATCAAGAAGCATTATTCAGTTCCTTTCAAA
GACCAAGTAGTGTGTTATAATTCAATTGCACAGCCAAGTTTTTGTTCGAACTCTCCTAGATTCTCTGGCTTGAGTGCTTCTGTTGGCTCTGGAAGCTCTTCATCT
TCCTTCAATGGGAATGGATTCACCACCAAAACAAGAATCAGATGGACACAAGATCTCCATGAGAAATTTGTTGACTGTGTTAATCGTCTTGGTGGTGCTGAGAAG
GCAACGCCTAAAGCAATCTTGAAGCTGATGGATTCAGAGGGATTGACCATATTCCATGTGAAGAGTCATTTGCAGAAATATCGGATAGCAAAATACATGCCAGAA
TCTGCAGAAAGGAGGTCTGATAGAAGGAACTGCATGAATGAAGTTACCGAACTGGATGCCAAAACAATGATGATGTATGACAGTGCCATGCAAATTAAAGACGCC
TTGCAACTGCAGCTAGATGTACAGAGGCGTCTTCATGATCAATTGGAGATACAGAGGAAGCTACAGTTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATG
TTTGATCAACAACAGGAAACTAACAAATGTTTCTTCAGAACTAATGGATTCAACAAACCAACTCCTAATATCGTGTCGGGTTATCTGGACGATCCTCCGATCCCG
ACAGCCGACAGCGTCCGAAATGCCCAATTCCCGTCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGGTTACGGAATTGATTCGAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTGTTACTCTCAAAATTTTAGGGCGCAGCAGCCCTGGAGGATG
GGGACTTGTGTTCATCTATCTGCCATGGATGAAGTTGAATCCTCAGAACAGCTAAATTTATGTCCGTCTAAATCCAGTTCCACTATCATCAATCTCTTCGAATCG
CCTACTTCGGCTTTCTTCGCAACGGAGCAATGTATGGGGATTCCACCGATTCAATTTCAGTCTGGTTCTTCGTCTTTCGATATGGCTTCCGATTCGCTTTCCACG
ATTTTTCAATCCTCGGGGGAGAATTTCTCTCTCGATTCGGCGGAGCATAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTC
TGTAAGAGAAGCTTCAATGGCTTCCCGAAGACTAGTTTCGCCGAGCACAAGGTGTTTGATGAAAGTTCCCCTACAATCAAGAAGCATTATTCAGTTCCTTTCAAA
GACCAAGTAGTGTGTTATAATTCAATTGCACAGCCAAGTTTTTGTTCGAACTCTCCTAGATTCTCTGGCTTGAGTGCTTCTGTTGGCTCTGGAAGCTCTTCATCT
TCCTTCAATGGGAATGGATTCACCACCAAAACAAGAATCAGATGGACACAAGATCTCCATGAGAAATTTGTTGACTGTGTTAATCGTCTTGGTGGTGCTGAGAAG
GCAACGCCTAAAGCAATCTTGAAGCTGATGGATTCAGAGGGATTGACCATATTCCATGTGAAGAGTCATTTGCAGAAATATCGGATAGCAAAATACATGCCAGAA
TCTGCAGAAAGGAGGTCTGATAGAAGGAACTGCATGAATGAAGTTACCGAACTGGATGCCAAAACAATGATGATGTATGACAGTGCCATGCAAATTAAAGACGCC
TTGCAACTGCAGCTAGATGTACAGAGGCGTCTTCATGATCAATTGGAGATACAGAGGAAGCTACAGTTGCAAATTGAAGAACAAGGGAAACAACTTAAGATGATG
TTTGATCAACAACAGGAAACTAACAAATGTTTCTTCAGAACTAATGGATTCAACAAACCAACTCCTAATATCGTGTCGGGTTATCTGGACGATCCTCCGATCCCG
ACAGCCGACAGCGTCCGAAATGCCCAATTCCCGTCCAAGATAAGTTAG
Protein sequenceShow/hide protein sequence
MNGYGIDSKQEIQQNHGLITDCYSQNFRAQQPWRMGTCVHLSAMDEVESSEQLNLCPSKSSSTIINLFESPTSAFFATEQCMGIPPIQFQSGSSSFDMASDSLST
IFQSSGENFSLDSAEHSGVDSEFSNTLQSVVKSQLCKRSFNGFPKTSFAEHKVFDESSPTIKKHYSVPFKDQVVCYNSIAQPSFCSNSPRFSGLSASVGSGSSSS
SFNGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRSDRRNCMNEVTELDAKTMMMYDSAMQIKDA
LQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNGFNKPTPNIVSGYLDDPPIPTADSVRNAQFPSKIS