; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0116781 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0116781
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionBZIP transcription factor family protein
Genome locationCMiso1.1chr04:34196291..34200673
RNA-Seq ExpressionCmc04g0116781
SyntenyCmc04g0116781
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043898.1 bZIP transcription factor family protein [Cucumis melo var. makuwa]2.0e-13896.17Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQC+
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR

TYK25238.1 bZIP transcription factor family protein [Cucumis melo var. makuwa]2.0e-13896.17Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQC+
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR

XP_008442819.2 PREDICTED: uncharacterized protein LOC103486593 isoform X1 [Cucumis melo]2.1e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

XP_008442820.2 PREDICTED: uncharacterized protein LOC103486593 isoform X2 [Cucumis melo]2.1e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

XP_008442821.2 PREDICTED: uncharacterized protein LOC103486593 isoform X3 [Cucumis melo]2.1e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

TrEMBL top hitse value%identityAlignment
A0A1S3B649 uncharacterized protein LOC103486593 isoform X31.0e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X11.0e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X21.0e-14096.59Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQCRMQS
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS

A0A5A7TM13 BZIP transcription factor family protein9.5e-13996.17Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQC+
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR

A0A5D3DNR8 BZIP transcription factor family protein9.5e-13996.17Show/hide
Query:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA
        MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNV+VVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFC+LPPCSWLLPHHDFRNQQSPQI FPA
Subjt:  MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPA

Query:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
        GNDQEDIYSKSQ+SAITSKVVHAESRH    SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW
Subjt:  GNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNW

Query:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR
        NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRK+LTKVKNLYARQC+
Subjt:  NEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein1.6e-0826.12Show/hide
Query:  SNCPLFLFSRFPY---FWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSG-SSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRN------QQSPQ
        S+ P + +++ PY    WP V QS+       N ++ P  +  P +  AS    ++Q  EN  +  G +  F ++ PC W LP  D  N      Q + +
Subjt:  SNCPLFLFSRFPY---FWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSG-SSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRN------QQSPQ

Query:  ILFPAGNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA
          F  G+  +D      +SA    V      H      EE++ +P+      L+ES+         +    +GF    +A   K                
Subjt:  ILFPAGNDQEDIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA

Query:  KLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQ
                 H   S T + +       H    +P KK   ++AA EAR+RRK+LT++KNL+ RQCRMQ
Subjt:  KLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTTTACCCTCCAACTGCCCTCTTTTCTTGTTTAGTCGCTTTCCATATTTCTGGCCATCTGTGGTTCAATCTACAAGTTCCTATCATGAACTACCCAAT
GTCGTCGTCGTTCCGTCAAGTATTAATCTGCCTGCTAACAATAATGCTTCTGTGTCTGGCTCTTCCCAGACACAAGAAAACTTTACGAATGTCACCGGCTCCAGA
GCACCATTTTGCATATTACCACCTTGTTCTTGGTTGTTACCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAATCTTGTTTCCCGCTGGAAATGATCAAGAG
GATATTTATTCGAAATCCCAAAATAGTGCTATTACTTCAAAGGTTGTCCATGCAGAAAGCAGACATTCTTCTTTGCCTTCAGCCGAAGAAGAAAACGATGCTCCT
GACTTGAATGAAGCTCCCAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAGAACACAGTTGGAGTAGCTGTGGAGGGATTCGATACCAATGCAAGAGCT
CCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTATTGAACCCAGTTCCGCTGCCAAACTAGATAACTGGAATGAAGATGATCATGGTGTGTCATCAAGA
ACGTGTGATGACTTGTGTTATTTTGCAGAAAGAAGGCATGAACCGGAGATAGTCCCTTGTAAGAAATCCATAGATGCAATGGCCGCAACTGAGGCCAGGAGGAGG
AGAAAAAAACTGACAAAGGTAAAGAACCTTTACGCCCGTCAATGCCGTATGCAATCCTGA
mRNA sequenceShow/hide mRNA sequence
GGATTGAGTGGTATTGGAAATTGAGTGGGGGTGTTCTTAACTGAGCCATCGGTAATTGAACGAGGAGGAATGACTCGTTTGGTTATGTTTTTTTTCTAAATTTGG
ATGGCTTCTAAGGCAATACCATGGAATGCTGAGCTCTTTGGTTTACATTCACTATGTAATATTCAAATGTGTTTGCCTGGCGGTTTGTTAAGTTCAAATTTCTGT
CCTTTGGTTTTAGATTCCAAGGACTAGGATTCGTATTGGTGATGCTTATCCCAGAAGTGTGGAGTGGAATTAAGAATGTGGAGATTTTTTTGTCTTGGATTCTAA
AACAAGTAGTGGGCCATTGCTATATTAAGTGAAAAACATTTGCTAACTTGTGAAAGCATATTAAAAGTGAGTTAGGTTTGGAAAAGAAATTTCCTCCTACAGTTG
TGGAAACATTAACCCCTTTTTTATCTGATCAACGCAAGCCTACCTTGATTAATCTAATTGGAAAACTCCTCTGTTTCTACAATATTTCGGTATATTTGGTTGTTA
AGAAAACTTGGTGGATATTAGACCCTTTATGTTTTTTCTAAAAAACATTAGTATACTCTTCAAATTCTCGGTGGCTGGACTAGTAAAGGGGACCTCTTGCCTTGT
TTTTTTTTTAACTTCTCCATTTGTTTTTCCCAATTAGGTCCCAGATTCTATGAAAACTATCCATGTCTTACTACAATCCCGCTTGTTCCAAATTACATTATGGTT
TCTTTTCTTAAGACTTTGGCTTGTGTCAACCTGCTAAACGAGTAAAGTTGAATTACTATGCTGCCCATGCACACTGTTTGATATTCAATGAGTTTTCTTGACAAC
CAAGTATGGTAGAGTCAAAACAAATGTCTCATGAGAATTGGCATGGTGTACACAGTATTACTATGCTGTTCATTTGTTTTACCAAATCAGGTTCTGGTTTATGTT
ATTATGACTTATGTCAAACTGCTAAACAAGTAAAGTTGGTATTACTATGCTGCCCATGCGTGGTGTTTGATACTCTACAAGATTTCTTTGCAACAAAATATAGTA
AGTCAAACAATTGTCTTGTGAGAACTGACGTTTGAACACTCGTTGATATTAAAAGAAACAAGAATTACACCACTTCTGCGGGCCTCGACAAACTACTTTTATTTG
TAGCAGCAGGATAGAGGGGTGGTAAAACATCAACCATCAGAGAAAGAGTGTACTAATCAGTCCCAGCATGAGCCGGAAACAACCGGAGAGGTGACAAAGATGGAC
AAGGAGGCTGAATCATCTAAAGTGAGTCCTGCAAGCACTACGAGCTACCAATTGTTTGGCTGCAGAAGGTCAAGGCGTAATCTAACTGAGGTTATGATTGCTAAC
ATCTTCCTACACAGTTTTACTTGCTTCATTTCAAGCTTTTCATAAGTGACTTTCTTCTTCAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTAGCGAACA
GAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGTGCAATGTAACTGGTGTCGGCTTCTTCACAGTCCAACTTTAATCCTCAGTAACAAGTTCAAGGGAGATTA
ACATCCTTACTTGGTCTAGGGAAAGAAAATACCAGGTGTTTTTATTCTTCGACACCATCAAGAAAGGGTTAACAATGCTCTATGCGAGGAATTAACCAGAAAGGC
TGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGGT
AGGATGGGGTTGCATTTAAGAAGTGTGTGTTTGGTCATGGTCTTGATATCTCCCAACAAATTCACTTGCTCGCTCGTTTGTGTCTAGTTGGCTGAAGCAGTAAAG
CCGAAGGTGGAGGAGATCCCAGGAAACCATATATCATCTCATGTTCAGATGCCTCCTTTACCCTCCAACTGCCCTCTTTTCTTGTTTAGTCGCTTTCCATATTTC
TGGCCATCTGTGGTTCAATCTACAAGTTCCTATCATGAACTACCCAATGTCGTCGTCGTTCCGTCAAGTATTAATCTGCCTGCTAACAATAATGCTTCTGTGTCT
GGCTCTTCCCAGACACAAGAAAACTTTACGAATGTCACCGGCTCCAGAGCACCATTTTGCATATTACCACCTTGTTCTTGGTTGTTACCTCATCATGATTTTAGG
AACCAACAGAGTCCTCAAATCTTGTTTCCCGCTGGAAATGATCAAGAGGATATTTATTCGAAATCCCAAAATAGTGCTATTACTTCAAAGGTTGTCCATGCAGAA
AGCAGACATTCTTCTTTGCCTTCAGCCGAAGAAGAAAACGATGCTCCTGACTTGAATGAAGCTCCCAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAG
AACACAGTTGGAGTAGCTGTGGAGGGATTCGATACCAATGCAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTATTGAACCCAGTTCCGCT
GCCAAACTAGATAACTGGAATGAAGATGATCATGGTGTGTCATCAAGAACGTGTGATGACTTGTGTTATTTTGCAGAAAGAAGGCATGAACCGGAGATAGTCCCT
TGTAAGAAATCCATAGATGCAATGGCCGCAACTGAGGCCAGGAGGAGGAGAAAAAAACTGACAAAGGTAAAGAACCTTTACGCCCGTCAATGCCGTATGCAATCC
TGATCCACATGGCCAGTTAGTTTGGCAACAGTGTTTGTTGTCGACACAGCAATCTTATGTGTTAAAGTTCTGTGTTGCATTGGCTTTTGTTGCCAGAGGCAGGCC
ACAGAGCTATGAGACGTAGCCAGAGTTCTGACTTAACTTCAGCTTTCTTGTCATGGCGTACGTTGTGCATTCCAGAGCTCAGAGCACATAAAGATCTTGGTGCTG
GAGATAGGTAGTGACGGGAAATAAGATTTAGGCACAATTTGCTAACAAATTAAATGAGAGAGGTACTAGGACTAAAATGCCGATTAGTTCAAAGAGCTTTGAATT
TGATTTGGTAGGGCAGCAAAGAGGAGGAGGTGAGTGTAAGAATTTTGTATTTAAATCTTTCTATGGTTACTTCGAAGACCCTTGTTATTGGGTTGGCTTACACTT
CTTCAATGAAGTGAAGCATAAGAATGGAGTGCATGGAGATCTGTTAGTTACTTTATCAAGAAGCAAGGAATTAACAGTTAGAAAAAATGTATTTCACTCCTTTTC
ATCAAGATGCAAACAAAATAGTCTACACTCACTTTTCTTAAACTTAACCAGAATTATAGAATTTCTTCCAAGGTTTTTAGTTTTGTACAAATTTGTATTCTATTA
TTGACTTGTAACATGAATTACATGTTTCTACAATGAGC
Protein sequenceShow/hide protein sequence
MPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVVVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCILPPCSWLLPHHDFRNQQSPQILFPAGNDQE
DIYSKSQNSAITSKVVHAESRHSSLPSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSR
TCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKKLTKVKNLYARQCRMQS