; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011302 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011302
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionmyb family transcription factor PHL5-like
Genome locationchr1:20627707..20630097
RNA-Seq ExpressionLag0011302
SyntenyLag0011302
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022924882.1 uncharacterized protein LOC111432301 [Cucurbita moschata]4.5e-17181.41Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID  QEIQQ+HG++AD + QNF A+QPW MG CV   AMDE+ES +QQ+FGSSKSSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDR S
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DS+SAIFQSSGEN + D  E+SGADSEFRNTLQSVVKSQLCKR F+GFPKS  +DHKVFDD SHS+ KHYS PFKDQ  C N     SFCSSQEK SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCLG+SVG GSSSSSF GNGFTTKTRIRWTQ LHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ERK DRRN + EVA
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS
        QLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN FN         N D+PP PT ESIRN QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS

XP_022966364.1 myb family transcription factor PHL5-like [Cucurbita maxima]1.4e-17582.66Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID  QEI+QNHG++AD + QNFRA+QPW MGTCV   AMDE+ES +QQ+FGSSKSSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDR S
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DS+SAIFQSSGEN + D  E+SGADSEFRNTLQSVVKSQLCKR F+GFPKS  +DHK+FDD SHS+ KHYS PFKDQ  C N     SFCSSQEK SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCLG+S+GSGSSSSSF GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ERK DRRN + EVA
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS
        QLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN F     NN SGNLDNP  PT ESI+N QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS

XP_023517343.1 myb family transcription factor PHL4-like [Cucurbita pepo subsp. pepo]1.4e-17282.16Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MND+GID  QEIQQNHG++AD + QNFRA+QPW MGTCV   AM+E+ES +QQ+FGSSKSSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDRAS
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DS+SAIFQSSGEN + D  E+SGADSEFRNTLQSVVKSQLCKR F+GFPKS  +DHKVFDD SHSI KHYS PFKDQ  C N     SFCSSQEK SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        S LG+SVG GSSSSSF GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ERK DRRN + EVA
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS
        QLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN FN         N D+PP PT ESIRN QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS

XP_023544661.1 myb family transcription factor PHL5 [Cucurbita pepo subsp. pepo]1.0e-17079.6Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID KQEI QNHG++ D YSQN RA+QPW MG CVH SAMDE+ES +QQN G S SSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDRAS
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCN----SIAQQSFCSSQEKN
                        DS E SGADSEF NTLQSVV+SQLCKRSFNGFPK+IFTD+KVFD    SIGKH+S PFKDQ  C +    SIAQ +FCSSQEKN
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCN----SIAQQSFCSSQEKN

Query:  SPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGI
        SPRFSCL SSVGSGSSSSSF GNGF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES++RK DRRN +
Subjt:  SPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGI

Query:  NEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSK
        NEVA+LDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGK+LK+MFDQQQETNKCFF  N FNKP PN+ SG LD+PPIPT E+IRN QFP+ 
Subjt:  NEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSK

Query:  IS
        IS
Subjt:  IS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]4.1e-17281.25Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID KQEIQQNHGLI D YSQNFRA+QPW MGTCVH S MDE+ES +Q N   SKS+STIINLFESP  AFFATEQ MGIPPI+FQ+GSS    AS
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DSLS IFQSSGENF+ D  E SG DSE  NTLQSVVKSQLCKRSFNGFPK+ F DHKVFD+ S +  KHYS PFKDQ+ C NSIAQ SFCS    NSPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        S L  SVGSGSSSSSF GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ER+ DRRN +NEV 
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTL--ESIRNVQFPSKIS
        +LD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN FNKP PNN SG LDNPPIP+   ++I+N QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTL--ESIRNVQFPSKIS

TrEMBL top hitse value%identityAlignment
A0A0A0L162 Uncharacterized protein1.7e-16879.41Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MN YGID KQEIQQNHGLI D YSQNFRAEQP  MG C H SAMDE+ES Q  N   SK SSTIINLFESPA AFFATEQ MGIPPI+FQ+GSSSF    
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        +SLS IFQSS ENF+ DS EQSG DSEF NTLQSVVKSQLCKRSFNG PK  F +HKVFD  S +I KHYS PFKDQ  C NSIAQ SFCS+    SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCLG S+G GSSSSSF GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ER+CDRRN +NEV 
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA----FNKPIPNNSS--GNLDNPPIPT----LESIRN
        +LD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKP PNNS+  G +DNPPIPT    +++IRN
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA----FNKPIPNNSS--GNLDNPPIPT----LESIRN

Query:  VQFPSKIS
         QFPSKIS
Subjt:  VQFPSKIS

A0A1S3B500 uncharacterized protein LOC103486080 isoform X11.6e-16679.21Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MN YGID KQEIQQNHGLI D YSQNFRA+QP  MG CVH SAMDE+ES ++ N   SK +STIINLFESP  AFFATEQ MGIPPI+FQ+GSSSF    
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        +SLS IFQSSGENF+ DS EQSG DSEF NTLQSVVKSQLCKRSFNG PK+ F +HKVFD  S++I KHYS PFKDQ  C NSIAQ SFCS    NSPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCL  S+GSGSSSSSF GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ER+CDRRN +NEV 
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA----FNKPIPNNS--SGNLDNPPIPTLESIRNVQFP
        +LD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKP P+NS  SG LDN PIPT+    N QFP
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA----FNKPIPNNS--SGNLDNPPIPTLESIRNVQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

A0A6J1EA93 uncharacterized protein LOC1114323012.2e-17181.41Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID  QEIQQ+HG++AD + QNF A+QPW MG CV   AMDE+ES +QQ+FGSSKSSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDR S
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DS+SAIFQSSGEN + D  E+SGADSEFRNTLQSVVKSQLCKR F+GFPKS  +DHKVFDD SHS+ KHYS PFKDQ  C N     SFCSSQEK SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCLG+SVG GSSSSSF GNGFTTKTRIRWTQ LHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ERK DRRN + EVA
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS
        QLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN FN         N D+PP PT ESIRN QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS

A0A6J1HTK3 myb family transcription factor PHL5-like6.6e-17682.66Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID  QEI+QNHG++AD + QNFRA+QPW MGTCV   AMDE+ES +QQ+FGSSKSSSTIINLFESPA AFFATEQ MGIPPIEF+TGSSSFDR S
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF
        DS+SAIFQSSGEN + D  E+SGADSEFRNTLQSVVKSQLCKR F+GFPKS  +DHK+FDD SHS+ KHYS PFKDQ  C N     SFCSSQEK SPRF
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRF

Query:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA
        SCLG+S+GSGSSSSSF GNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES+ERK DRRN + EVA
Subjt:  SCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS
        QLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTN F     NN SGNLDNP  PT ESI+N QFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS

A0A6J1HW70 myb family transcription factor PHL56.2e-16678.36Show/hide
Query:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS
        MNDYGID KQEI QNHG+I D YSQN RA+QPW MG  VH SAMDE+ES +QQN G S SSSTIINLFESPA AFFATEQ MGIPPIEF TGSSSFDRAS
Subjt:  MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRAS

Query:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCN----SIAQQSFCSSQEKN
                        DS E SGADSEF NTL SVV+SQLCKRSFNGFPK+IFTD+KVFD    SI KH+S PFKDQ  C +    SIAQ SFCSSQEKN
Subjt:  DSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCN----SIAQQSFCSSQEKN

Query:  SPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGI
        SPRFSC  SS GSGSSSSSF GNGF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPES++RK DRRN +
Subjt:  SPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGI

Query:  NEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSK
        NEVA+LDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGK+LK+MFDQQQETNKCFF  N FNKP PN+ SG LD+PPIP  E+IRN QF + 
Subjt:  NEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSK

Query:  IS
        IS
Subjt:  IS

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 11.5e-3646.19Show/hide
Query:  KDQAACCNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQ
        K  A   NS A Q   +    +     C  +S    +S++S       +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQ
Subjt:  KDQAACCNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQ

Query:  KYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSS
        KYR A+Y P+ SE K       +E++ LD+K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q     C   T +   P    SS
Subjt:  KYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSS

Query:  GNLDNPPIPT
        G+   P  P+
Subjt:  GNLDNPPIPT

F4J3P7 Myb family transcription factor PHL131.4e-3750.3Show/hide
Query:  NGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESS----ERKCDRRNGINEVAQLDVKTAMQIKDAL
        +  T+K R+RWT +LHE FV+ +N+LGG+E+ATPKA+LKL++S GLT++HVKSHLQKYR A+Y PE S    E        I ++  LD+KT+++I +AL
Subjt:  NGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESS----ERKCDRRNGINEVAQLDVKTAMQIKDAL

Query:  QLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ---QETNKCFFRTNAFNKPIPNNSSGNLDNP
        +LQ+ VQ++LH+QLEIQR LQLQIEEQG+ L+MM ++Q   QE  K    +++  +  P+  S NL  P
Subjt:  QLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ---QETNKCFFRTNAFNKPIPNNSSGNLDNP

Q0WVU3 Myb family transcription factor PHL59.8e-5244.24Show/hide
Query:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK
        + TE + G+ P +  T + SF     S S  + SS   +   S +    D          +  Q  K  +    +S   D    +  S S    +     
Subjt:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK

Query:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL
         Q  C      +++   +F SSQ   +++ PRFS       S  S S  GG+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GL
Subjt:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL

Query:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA
        TIFHVKSHLQKYRIAKYMPES E K ++R    E++QLD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +   
Subjt:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA

Query:  FNKPIPNNSSGNLDNPPIPTL
          +   +    ++ +PP P L
Subjt:  FNKPIPNNSSGNLDNPPIPTL

Q8GUN5 Protein PHR1-LIKE 15.6e-3948Show/hide
Query:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
        D  SH+       PF D       +  QQ   SS+++ S R           +SSSS      T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL
Subjt:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        +++ GLTI+HVKSHLQKYR A+Y PE+SE   +    +   I ++  LD+KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 18.9e-3756.64Show/hide
Query:  SSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRR--NGINEVAQLDVKTAMQ
        S++S   N  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE SE     R    +  +  LD+K  + 
Subjt:  SSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRR--NGINEVAQLDVKTAMQ

Query:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
        I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  IKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT5G06800.1 myb-like HTH transcriptional regulator family protein7.0e-5344.24Show/hide
Query:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK
        + TE + G+ P +  T + SF     S S  + SS   +   S +    D          +  Q  K  +    +S   D    +  S S    +     
Subjt:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK

Query:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL
         Q  C      +++   +F SSQ   +++ PRFS       S  S S  GG+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GL
Subjt:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL

Query:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA
        TIFHVKSHLQKYRIAKYMPES E K ++R    E++QLD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +   
Subjt:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNA

Query:  FNKPIPNNSSGNLDNPPIPTL
          +   +    ++ +PP P L
Subjt:  FNKPIPNNSSGNLDNPPIPTL

AT5G06800.2 myb-like HTH transcriptional regulator family protein2.3e-4344.07Show/hide
Query:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK
        + TE + G+ P +  T + SF     S S  + SS   +   S +    D          +  Q  K  +    +S   D    +  S S    +     
Subjt:  FATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSSGENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFK

Query:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL
         Q  C      +++   +F SSQ   +++ PRFS       S  S S  GG+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GL
Subjt:  DQAAC-----CNSIAQQSFCSSQ---EKNSPRFSCLGSSVGSGSSSSSFGGN---GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGL

Query:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKL
        TIFHVKSHLQKYRIAKYMPES E K ++R    E++QLD +T +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  TIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein4.0e-4048Show/hide
Query:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
        D  SH+       PF D       +  QQ   SS+++ S R           +SSSS      T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL
Subjt:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        +++ GLTI+HVKSHLQKYR A+Y PE+SE   +    +   I ++  LD+KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein4.0e-4048Show/hide
Query:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
        D  SH+       PF D       +  QQ   SS+++ S R           +SSSS      T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL
Subjt:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        +++ GLTI+HVKSHLQKYR A+Y PE+SE   +    +   I ++  LD+KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.3 Homeodomain-like superfamily protein4.0e-4048Show/hide
Query:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL
        D  SH+       PF D       +  QQ   SS+++ S R           +SSSS      T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL
Subjt:  DDGSHSIGKHYSGPFKDQAAC-CNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKL

Query:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        +++ GLTI+HVKSHLQKYR A+Y PE+SE   +    +   I ++  LD+KT+++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MDSEGLTIFHVKSHLQKYRIAKYMPESSERKCD----RRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTACGGCATCGATTTCAAGCAAGAAATTCAACAAAATCATGGACTGATTGCTGATTCTTACTCTCAAAATTTTAGGGCAGAGCAGCCCTGGACGATGGGAAC
TTGTGTTCATCAATCCGCCATGGATGAAATTGAATCGTTTCAACAACAAAATTTTGGTTCCTCTAAATCGAGTTCTACCATCATCAATCTGTTTGAATCTCCCGCTTTAG
CGTTCTTTGCTACGGAGCAATATATGGGGATTCCGCCGATTGAGTTTCAAACTGGTTCTTCGTCTTTCGATAGGGCTTCCGATTCACTTTCCGCGATCTTTCAATCCTCC
GGTGAGAATTTCGCTCGCGATTCGGTGGAGCAGAGCGGTGCAGATTCTGAATTCAGGAACACCTTGCAATCAGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGG
CTTCCCAAAGAGTATCTTCACTGACCACAAGGTGTTTGATGATGGTTCTCATTCAATCGGGAAGCACTATTCAGGTCCTTTCAAAGACCAAGCAGCGTGTTGTAATTCAA
TTGCACAACAAAGTTTCTGTTCTTCACAAGAGAAGAACTCACCAAGATTCTCTTGCTTGGGTTCTTCCGTTGGCTCTGGAAGCTCTTCTTCTTCCTTTGGTGGGAATGGA
TTCACCACCAAAACAAGAATAAGATGGACACAAGATCTCCATGAGAAGTTTGTTGATTGTGTTAATCGTCTTGGTGGTGCTGAGAAGGCGACGCCTAAAGCAATTTTGAA
GCTGATGGATTCAGAGGGATTGACCATATTCCACGTGAAGAGTCATTTGCAGAAATATCGGATAGCCAAATACATGCCAGAATCATCAGAAAGGAAATGTGATAGAAGGA
ACGGCATCAATGAAGTTGCCCAACTGGATGTGAAAACTGCCATGCAAATTAAAGACGCTCTTCAACTTCAGTTAGATGTTCAGAGGCGTCTTCATGATCAACTTGAGATT
CAGAGGAAGCTACAGTTGCAAATTGAAGAACAAGGGAAACAACTCAAGATGATGTTTGACCAACAACAAGAAACTAACAAATGCTTCTTCAGAACCAATGCCTTCAACAA
ACCAATCCCTAACAACTCGTCGGGAAATCTCGACAACCCTCCGATCCCGACACTCGAAAGCATCCGAAACGTCCAATTCCCATCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGATTACGGCATCGATTTCAAGCAAGAAATTCAACAAAATCATGGACTGATTGCTGATTCTTACTCTCAAAATTTTAGGGCAGAGCAGCCCTGGACGATGGGAAC
TTGTGTTCATCAATCCGCCATGGATGAAATTGAATCGTTTCAACAACAAAATTTTGGTTCCTCTAAATCGAGTTCTACCATCATCAATCTGTTTGAATCTCCCGCTTTAG
CGTTCTTTGCTACGGAGCAATATATGGGGATTCCGCCGATTGAGTTTCAAACTGGTTCTTCGTCTTTCGATAGGGCTTCCGATTCACTTTCCGCGATCTTTCAATCCTCC
GGTGAGAATTTCGCTCGCGATTCGGTGGAGCAGAGCGGTGCAGATTCTGAATTCAGGAACACCTTGCAATCAGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGG
CTTCCCAAAGAGTATCTTCACTGACCACAAGGTGTTTGATGATGGTTCTCATTCAATCGGGAAGCACTATTCAGGTCCTTTCAAAGACCAAGCAGCGTGTTGTAATTCAA
TTGCACAACAAAGTTTCTGTTCTTCACAAGAGAAGAACTCACCAAGATTCTCTTGCTTGGGTTCTTCCGTTGGCTCTGGAAGCTCTTCTTCTTCCTTTGGTGGGAATGGA
TTCACCACCAAAACAAGAATAAGATGGACACAAGATCTCCATGAGAAGTTTGTTGATTGTGTTAATCGTCTTGGTGGTGCTGAGAAGGCGACGCCTAAAGCAATTTTGAA
GCTGATGGATTCAGAGGGATTGACCATATTCCACGTGAAGAGTCATTTGCAGAAATATCGGATAGCCAAATACATGCCAGAATCATCAGAAAGGAAATGTGATAGAAGGA
ACGGCATCAATGAAGTTGCCCAACTGGATGTGAAAACTGCCATGCAAATTAAAGACGCTCTTCAACTTCAGTTAGATGTTCAGAGGCGTCTTCATGATCAACTTGAGATT
CAGAGGAAGCTACAGTTGCAAATTGAAGAACAAGGGAAACAACTCAAGATGATGTTTGACCAACAACAAGAAACTAACAAATGCTTCTTCAGAACCAATGCCTTCAACAA
ACCAATCCCTAACAACTCGTCGGGAAATCTCGACAACCCTCCGATCCCGACACTCGAAAGCATCCGAAACGTCCAATTCCCATCCAAGATAAGTTAG
Protein sequenceShow/hide protein sequence
MNDYGIDFKQEIQQNHGLIADSYSQNFRAEQPWTMGTCVHQSAMDEIESFQQQNFGSSKSSSTIINLFESPALAFFATEQYMGIPPIEFQTGSSSFDRASDSLSAIFQSS
GENFARDSVEQSGADSEFRNTLQSVVKSQLCKRSFNGFPKSIFTDHKVFDDGSHSIGKHYSGPFKDQAACCNSIAQQSFCSSQEKNSPRFSCLGSSVGSGSSSSSFGGNG
FTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESSERKCDRRNGINEVAQLDVKTAMQIKDALQLQLDVQRRLHDQLEI
QRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTNAFNKPIPNNSSGNLDNPPIPTLESIRNVQFPSKIS