; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G20460 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G20460
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionmyb family transcription factor PHL5
Genome locationChr4:18637325..18640840
RNA-Seq ExpressionCSPI04G20460
SyntenyCSPI04G20460
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044271.1 protein PHR1-LIKE 1 isoform X2 [Cucumis melo var. makuwa]5.2e-19188.09Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRA+QPRRMGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKSQLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---CDRRNCMNEVTELDAK
         GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE R    ++    N  T     
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---CDRRNCMNEVTELDAK

Query:  TAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPS
         AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPS
Subjt:  TAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPS

Query:  KIS
        KIS
Subjt:  KIS

XP_004146408.1 myb family transcription factor PHL5 isoform X1 [Cucumis sativus]1.2e-224100Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
        PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

XP_008442127.1 PREDICTED: uncharacterized protein LOC103486080 isoform X1 [Cucumis melo]5.7e-20692.75Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRA+QPRRMGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKSQLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
         GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

XP_031739568.1 myb family transcription factor PHL5 isoform X2 [Cucumis sativus]4.2e-21798Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQI        PSFCSTSPRFSCLGGSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
        PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]2.4e-18886.75Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MN YGIDSKQEIQQNHGLITD+YSQNFRA+QP RMG C HLS MDEVESS+ LNSCPSK +STIINLFESP SAFFATEQCMGIPPIQFQSGSS+ +SLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
         IFQSS ENFS D AE SGVDSE SNTLQSVVKSQLCKRSFNG PK +F +HKVFD SS T KKHYSVPFKDQ GCYNSIAQPSFCS SPRFS L GS+G
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
         GSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR DRRNCMNEVTELD+KTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRT      FNKPTPNN +  GY+DNPPIP+T P  DNI+NAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

TrEMBL top hitse value%identityAlignment
A0A0A0L162 Uncharacterized protein5.9e-225100Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
        PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

A0A1S3B500 uncharacterized protein LOC103486080 isoform X12.8e-20692.75Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRA+QPRRMGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKSQLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
         GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

A0A1S3B5M5 protein PHR1-LIKE 1 isoform X27.1e-18692.35Show/hide
Query:  MGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKS
        MGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLSTIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKS
Subjt:  MGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKS

Query:  QLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVN
        QLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVN
Subjt:  QLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVN

Query:  RLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQG
        RLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQG
Subjt:  RLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQG

Query:  KQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
        KQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPSKIS
Subjt:  KQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

A0A5A7TRR1 Protein PHR1-LIKE 1 isoform X22.5e-19188.09Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRA+QPRRMGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKSQLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---CDRRNCMNEVTELDAK
         GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE R    ++    N  T     
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERR---CDRRNCMNEVTELDAK

Query:  TAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPS
         AMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPS
Subjt:  TAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPS

Query:  KIS
        KIS
Subjt:  KIS

A0A5D3C537 Protein PHR1-LIKE 1 isoform X27.3e-17582Show/hide
Query:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS
        MNAYGIDSKQEIQQNHGLITDYYSQNFRA+QPRRMGAC HLSAMDEVESS+ LNSCPSKP+STIINLFESP SAFFATEQCMGIPPIQFQSGSSSFNSLS
Subjt:  MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLS

Query:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG
        TIFQSS ENFSLDSAEQSG+DSEFSNTLQSVVKSQLCKRSFNGLPK SFVEHKVFDGSS+TIKKHYSVPFKDQIGCYNSIAQPSFCS SPRFSCL GSIG
Subjt:  TIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIG

Query:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM
         GSSSSSF+GNGFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSH                                    
Subjt:  PGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAM

Query:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS
               LQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTT+GLFNKPTP+NSNV GY+DN PIPT         NAQFPSKIS
Subjt:  QIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 14.8e-3847.96Show/hide
Query:  SIAQPSFCSTS-PRF----SCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS  + S P F    S   G I P +S    + N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPSFCSTS-PRF----SCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNS
         P+ +E +       +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q +++    +  ++        P+NS
Subjt:  MPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNS

Q0WVU3 Myb family transcription factor PHL51.8e-5347.27Show/hide
Query:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ
        P+Q  S     +S       S+++ SLD +          +     +  + C   F      S      F+ S D     ++ YS      +   +S  Q
Subjt:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ

Query:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA
        P       S+ P FS  GGS+ P              KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES 
Subjt:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA

Query:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +
Subjt:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

Q10LZ1 Protein PHOSPHATE STARVATION RESPONSE 14.8e-3847.96Show/hide
Query:  SIAQPSFCSTS-PRF----SCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY
        S+AQPS  + S P F    S   G I P +S    + N   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A+Y
Subjt:  SIAQPSFCSTS-PRF----SCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKY

Query:  MPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNS
         P+ +E +       +E++ LD K +M + +AL+LQ++VQ+RLH+QLEIQRKLQL+IEEQGK L+ MF++Q +++    +  ++        P+NS
Subjt:  MPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNS

Q8GUN5 Protein PHR1-LIKE 12.4e-3754.42Show/hide
Query:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA
        S  + S +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++E   +    +   + ++  LD KT+
Subjt:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA

Query:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        ++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 11.1e-3753.89Show/hide
Query:  IAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE
        I QP      P  S     + P S++SS S NG T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E
Subjt:  IAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE

Query:  RRCDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
             R    +  +T LD K  + I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  RRCDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT4G28610.1 phosphate starvation response 17.6e-3953.89Show/hide
Query:  IAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE
        I QP      P  S     + P S++SS S NG T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+Y PE +E
Subjt:  IAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAE

Query:  RRCDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ
             R    +  +T LD K  + I +AL+LQ++VQ++LH+QLEIQR LQL+IEEQGK L+MMF++Q
Subjt:  RRCDRRNC--MNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQ

AT5G06800.1 myb-like HTH transcriptional regulator family protein1.3e-5447.27Show/hide
Query:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ
        P+Q  S     +S       S+++ SLD +          +     +  + C   F      S      F+ S D     ++ YS      +   +S  Q
Subjt:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ

Query:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA
        P       S+ P FS  GGS+ P              KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES 
Subjt:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA

Query:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR
        E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLEIQR LQL+IEEQGKQLKMM +QQQ+  +   +
Subjt:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQETNKCFFR

AT5G06800.2 myb-like HTH transcriptional regulator family protein7.1e-4544.76Show/hide
Query:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ
        P+Q  S     +S       S+++ SLD +          +     +  + C   F      S      F+ S D     ++ YS      +   +S  Q
Subjt:  PIQFQSGSSSFNSLSTIFQSSAENFSLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSD---TIKKHYSVPFKDQIGCYNSIAQ

Query:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA
        P       S+ P FS  GGS+ P              KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES 
Subjt:  PSFC----STSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA

Query:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL
        E + ++R C  E+++LD +T +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  ERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein1.7e-3854.42Show/hide
Query:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA
        S  + S +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++E   +    +   + ++  LD KT+
Subjt:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA

Query:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        ++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein1.7e-3854.42Show/hide
Query:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA
        S  + S +  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++E   +    +   + ++  LD KT+
Subjt:  SSSSFSGNGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCD----RRNCMNEVTELDAKTA

Query:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE
        ++I  AL+LQ++VQ+RLH+QLEIQR LQLQIE+QG+ L+MMF++QQ+
Subjt:  MQIKDALQLQLDVQRRLHDQLEIQRKLQLQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCTTACGGAATTGATTCCAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTATTACTCTCAAAATTTCAGGGCAGAGCAGCCCAGGAGGATGGGAGC
TTGTGCTCATCTATCCGCCATGGATGAAGTTGAATCATCACAACACCTAAATTCATGTCCGTCTAAACCCAGTTCCACTATCATCAATCTCTTCGAATCACCCGCTTCCG
CTTTTTTCGCAACGGAGCAATGTATGGGGATTCCTCCGATTCAATTCCAGTCTGGTTCTTCGTCTTTCAATTCGCTTTCGACGATTTTTCAGTCCTCCGCCGAGAATTTC
TCTCTTGATTCCGCGGAGCAAAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGGGTTACCGAAGGG
TAGTTTCGTTGAACACAAGGTGTTTGATGGAAGTTCCGATACAATCAAGAAGCATTATTCAGTTCCTTTCAAAGACCAAATAGGTTGTTATAATTCAATTGCACAGCCAA
GTTTTTGTTCGACTTCTCCAAGATTCTCTTGCTTGGGTGGTTCTATTGGGCCAGGAAGTTCTTCATCTTCCTTCAGTGGGAATGGATTCACAACCAAAACGAGAATCAGA
TGGACACAAGATCTCCATGAGAAATTTGTTGATTGTGTTAATCGTCTTGGTGGTGCAGAGAAAGCAACGCCTAAAGCAATATTGAAGTTGATGGATTCTGAAGGATTGAC
CATATTCCATGTGAAGAGTCATTTGCAGAAATATCGGATAGCAAAATACATGCCAGAATCTGCAGAAAGGAGGTGTGATAGAAGGAACTGCATGAATGAAGTTACAGAAC
TCGATGCCAAAACTGCCATGCAAATTAAAGATGCCTTGCAACTGCAGTTAGATGTTCAGAGGCGTCTACATGATCAATTAGAGATACAAAGGAAGCTGCAGTTGCAAATT
GAAGAACAAGGGAAACAACTCAAGATGATGTTTGATCAACAACAAGAAACTAACAAATGCTTCTTCAGAACAACTACAACTGATGGGTTATTCAACAAACCAACTCCTAA
TAACAGTAACGTATTGGGCTATATCGACAATCCTCCGATCCCGACGACAGTGCCTGCAGTCGACAATATCCGAAATGCTCAATTCCCATCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
AGAAGAACAAAATGTGTTTCGTATTTAAACGTAGATTCATTTCAATTTGTTCTTCATTTTCTGTTTTTCCTCCTTTCTATCGATTCTTAATTCTTCACTTCGAATAATCT
GTCTTCCAATTCCCTGTTATTCATCTTATCTTTCGCATCAAGTCTTCCAAAGAATAATCACAAAAGGAATTCTATGCTATGCGTTGATCGTATTTATGCTCTCTCTCTGT
TTCTCCCCTTGTTTCCTCTCTCTCAATTACATACTCTCTCTTCATCCTCACAAAAACGGAGATTTTGGGGTTGTTTCTTTCCAAATTCTGTAAATGAACGCTTACGGAAT
TGATTCCAAGCAAGAAATTCAACAAAATCATGGACTGATTACTGATTATTACTCTCAAAATTTCAGGGCAGAGCAGCCCAGGAGGATGGGAGCTTGTGCTCATCTATCCG
CCATGGATGAAGTTGAATCATCACAACACCTAAATTCATGTCCGTCTAAACCCAGTTCCACTATCATCAATCTCTTCGAATCACCCGCTTCCGCTTTTTTCGCAACGGAG
CAATGTATGGGGATTCCTCCGATTCAATTCCAGTCTGGTTCTTCGTCTTTCAATTCGCTTTCGACGATTTTTCAGTCCTCCGCCGAGAATTTCTCTCTTGATTCCGCGGA
GCAAAGTGGTGTAGACTCTGAATTCAGTAACACCTTGCAATCGGTTGTGAAATCTCAACTCTGTAAGAGAAGCTTCAATGGGTTACCGAAGGGTAGTTTCGTTGAACACA
AGGTGTTTGATGGAAGTTCCGATACAATCAAGAAGCATTATTCAGTTCCTTTCAAAGACCAAATAGGTTGTTATAATTCAATTGCACAGCCAAGTTTTTGTTCGACTTCT
CCAAGATTCTCTTGCTTGGGTGGTTCTATTGGGCCAGGAAGTTCTTCATCTTCCTTCAGTGGGAATGGATTCACAACCAAAACGAGAATCAGATGGACACAAGATCTCCA
TGAGAAATTTGTTGATTGTGTTAATCGTCTTGGTGGTGCAGAGAAAGCAACGCCTAAAGCAATATTGAAGTTGATGGATTCTGAAGGATTGACCATATTCCATGTGAAGA
GTCATTTGCAGAAATATCGGATAGCAAAATACATGCCAGAATCTGCAGAAAGGAGGTGTGATAGAAGGAACTGCATGAATGAAGTTACAGAACTCGATGCCAAAACTGCC
ATGCAAATTAAAGATGCCTTGCAACTGCAGTTAGATGTTCAGAGGCGTCTACATGATCAATTAGAGATACAAAGGAAGCTGCAGTTGCAAATTGAAGAACAAGGGAAACA
ACTCAAGATGATGTTTGATCAACAACAAGAAACTAACAAATGCTTCTTCAGAACAACTACAACTGATGGGTTATTCAACAAACCAACTCCTAATAACAGTAACGTATTGG
GCTATATCGACAATCCTCCGATCCCGACGACAGTGCCTGCAGTCGACAATATCCGAAATGCTCAATTCCCATCCAAGATAAGTTAGGCCATGAAAAGGCATCCTTTTCTT
CATCATCATTTTACCACAGTACATAACACTCATTGGGGGGAAGCTTTTTTATTGTGAAGATAAATCCAAACAAAGAAAACAAAAAGGGGTAAAGATTACAAAGTGTATTG
TTGAATTGAATATATATACAAACTTTTGAAAAGTTTGAATTGAAGAAACTTGCAATGGGGTCTTGTTATGTTAATAAAATCATGAGTTCTAAGTTGTGAATTAAACTATT
TTACCAGCAG
Protein sequenceShow/hide protein sequence
MNAYGIDSKQEIQQNHGLITDYYSQNFRAEQPRRMGACAHLSAMDEVESSQHLNSCPSKPSSTIINLFESPASAFFATEQCMGIPPIQFQSGSSSFNSLSTIFQSSAENF
SLDSAEQSGVDSEFSNTLQSVVKSQLCKRSFNGLPKGSFVEHKVFDGSSDTIKKHYSVPFKDQIGCYNSIAQPSFCSTSPRFSCLGGSIGPGSSSSSFSGNGFTTKTRIR
WTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERRCDRRNCMNEVTELDAKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQLQI
EEQGKQLKMMFDQQQETNKCFFRTTTTDGLFNKPTPNNSNVLGYIDNPPIPTTVPAVDNIRNAQFPSKIS