; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0020764 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0020764
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBZIP domain-containing protein
Genome locationchr10:4100759..4103424
RNA-Seq ExpressionIVF0020764
SyntenyIVF0020764
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135341.1 uncharacterized protein At4g06598 isoform X2 [Cucumis sativus]7.94e-12482.91Show/hide
Query:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN
        M+C RRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLP KSP PSGSSTY D LPNPIIGSRAVQNPRVGNVNHHRTSSESL +         + +  
Subjt:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN

Query:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML
          PVQR G+RRSS DSFAYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQA LYMKPSWNKQKNRTRELP TTLTTNPGG PSAKNS+LLES R L
Subjt:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML

Query:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD
        STP EANEFS TTTEKLDSAETV PDRKLSERMD
Subjt:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD

XP_008445999.1 PREDICTED: uncharacterized protein At4g06598 isoform X1 [Cucumis melo]7.29e-12784.19Show/hide
Query:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN
        M+CRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLP KSPVPSGSSTY + LPNPI+GSRAVQNPRVGNVNHHRTSSESL +         + +  
Subjt:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN

Query:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML
          PVQR G+RRSS DSFAYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQASLYMKPSWNKQKNRTRELPPTTLTTNPGG PSAKNSILLES R L
Subjt:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML

Query:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD
        ST QEANEFSSTTTEKLDSAET  PDRKLSERMD
Subjt:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD

XP_008446000.1 PREDICTED: uncharacterized protein At4g06598 isoform X2 [Cucumis melo]2.35e-11483.41Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF
        MENSKVLSNMRNMISSGKHALLP KSPVPSGSSTY + LPNPI+GSRAVQNPRVGNVNHHRTSSESL +         + +    PVQR G+RRSS DSF
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF

Query:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL
        AYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQASLYMKPSWNKQKNRTRELPPTTLTTNPGG PSAKNSILLES R LST QEANEFSSTTTEKL
Subjt:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL

Query:  DSAETVRPDRKLSERMD
        DSAET  PDRKLSERMD
Subjt:  DSAETVRPDRKLSERMD

XP_031741343.1 uncharacterized protein At4g06598 isoform X1 [Cucumis sativus]1.03e-12382.91Show/hide
Query:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN
        M+C RRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLP KSP PSGSSTY D LPNPIIGSRAVQNPRVGNVNHHRTSSESL +         + +  
Subjt:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN

Query:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML
          PVQR G+RRSS DSFAYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQA LYMKPSWNKQKNRTRELP TTLTTNPGG PSAKNS+LLES R L
Subjt:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML

Query:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD
        STP EANEFS TTTEKLDSAETV PDRKLSERMD
Subjt:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD

XP_031741344.1 uncharacterized protein At4g06598 isoform X3 [Cucumis sativus]2.87e-11282.49Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF
        MENSKVLSNMRNMISSGKHALLP KSP PSGSSTY D LPNPIIGSRAVQNPRVGNVNHHRTSSESL +         + +    PVQR G+RRSS DSF
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF

Query:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL
        AYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQA LYMKPSWNKQKNRTRELP TTLTTNPGG PSAKNS+LLES R LSTP EANEFS TTTEKL
Subjt:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL

Query:  DSAETVRPDRKLSERMD
        DSAETV PDRKLSERMD
Subjt:  DSAETVRPDRKLSERMD

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein9.1e-9882.91Show/hide
Query:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN
        M+C RRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLP KSP PSGSSTY D LPNPIIGSRAVQNPRVGNVNHHRTSSESL +         + +  
Subjt:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN

Query:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML
          PVQR G+RRSS DSFAYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQA LYMKPSWNKQKNRTRELP TTLTTNPGG PSAKNS+LLES R L
Subjt:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML

Query:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD
        STP EANEFS TTTEKLDSAETV PDRKLSERMD
Subjt:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X14.4e-10084.19Show/hide
Query:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN
        M+CRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLP KSPVPSGSSTY + LPNPI+GSRAVQNPRVGNVNHHRTSSESL +         + +  
Subjt:  MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSAN

Query:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML
          PVQR G+RRSS DSFAYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQASLYMKPSWNKQKNRTRELPPTTLTTNPGG PSAKNSILLES R L
Subjt:  PKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRML

Query:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD
        ST QEANEFSSTTTEKLDSAET  PDRKLSERMD
Subjt:  STPQEANEFSSTTTEKLDSAETVRPDRKLSERMD

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X21.8e-9083.41Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF
        MENSKVLSNMRNMISSGKHALLP KSPVPSGSSTY + LPNPI+GSRAVQNPRVGNVNHHRTSSESL +         + +    PVQR G+RRSS DSF
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF

Query:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL
        AYLDAGNV NENY QDDSQCKNMYLP WASQDF SHQASLYMKPSWNKQKNRTRELPPTTLTTNPGG PSAKNSILLES R LST QEANEFSSTTTEKL
Subjt:  AYLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKL

Query:  DSAETVRPDRKLSERMD
        DSAET  PDRKLSERMD
Subjt:  DSAETVRPDRKLSERMD

A0A5A7SW47 BZIP domain-containing protein1.0e-8582.69Show/hide
Query:  MRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSFAYLDAGNVL
        MRNMISSGKHALLP KSPVPSGSSTY + LPNPI+GSRAVQNPRVGNVNHHRTSSESL +         + +    PVQR G+RRSS DSFAYLDAGNV 
Subjt:  MRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSFAYLDAGNVL

Query:  NENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPD
        NENY QDDSQCKNMYLP WASQDF SHQASLYMKPSWNKQKNRTRELPPTTLTTNPGG PSAKNSILLES R LST QEANEFSSTTTEKLDSAET  PD
Subjt:  NENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPD

Query:  RKLSERMD
        RKLSERMD
Subjt:  RKLSERMD

A0A5A7SYH2 BZIP domain-containing protein2.6e-8485.57Show/hide
Query:  MRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLMISSANPKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDD
        MRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLMISSANPK                     ++ NE    DD
Subjt:  MRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLMISSANPKPVQRNGYRRSSRDSFAYLDAGNVLNENYAQDD

Query:  SQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPDRK
        SQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPDRK
Subjt:  SQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPDRK

SwissProt top hitse value%identityAlignment
Q8W3M7 Uncharacterized protein At4g065983.9e-1333.33Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF
        M +SK   N RN+  +GK ALLP KSP   G +   D +P+ +IGS+AVQ    GN NHHRTSSES  +         + +    PV++ G+RRSS DSF
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF

Query:  AYLDAGNVLNENYA-------QDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFS
        AY+D     + +Y         +++   N       S    S     Y     +KQK R  +  P +      G     +S  LESS   S  +  +  S
Subjt:  AYLDAGNVLNENYA-------QDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFS

Query:  STTTEKLDSAETVRPD
           TEK  SA   + D
Subjt:  STTTEKLDSAETVRPD

Arabidopsis top hitse value%identityAlignment
AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein6.0e-1735.59Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSES---------LGLMISSANPKPVQRNGYRRSSRDS
        M +SK   ++RN++  GKHALLP K P PS S++Y + +P  +IGSR  Q       +H RTSSES         L  +++     P ++ G+RRSS DS
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSES---------LGLMISSANPKPVQRNGYRRSSRDS

Query:  FAYLDAGNVLNENYA-QDDSQCKNMYLP-----YWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPS
        +AYLD  N  N +   Q+D   +N  L          ++  +  A+ Y   S+ KQK+R R+    T     G CPS
Subjt:  FAYLDAGNVLNENYA-QDDSQCKNMYLP-----YWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPS

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein6.0e-1735.59Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSES---------LGLMISSANPKPVQRNGYRRSSRDS
        M +SK   ++RN++  GKHALLP K P PS S++Y + +P  +IGSR  Q       +H RTSSES         L  +++     P ++ G+RRSS DS
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSES---------LGLMISSANPKPVQRNGYRRSSRDS

Query:  FAYLDAGNVLNENYA-QDDSQCKNMYLP-----YWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPS
        +AYLD  N  N +   Q+D   +N  L          ++  +  A+ Y   S+ KQK+R R+    T     G CPS
Subjt:  FAYLDAGNVLNENYA-QDDSQCKNMYLP-----YWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPS

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)2.8e-1433.33Show/hide
Query:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF
        M +SK   N RN+  +GK ALLP KSP   G +   D +P+ +IGS+AVQ    GN NHHRTSSES  +         + +    PV++ G+RRSS DSF
Subjt:  MENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLM--------ISSANPKPVQRNGYRRSSRDSF

Query:  AYLDAGNVLNENYA-------QDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFS
        AY+D     + +Y         +++   N       S    S     Y     +KQK R  +  P +      G     +S  LESS   S  +  +  S
Subjt:  AYLDAGNVLNENYA-------QDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFS

Query:  STTTEKLDSAETVRPD
           TEK  SA   + D
Subjt:  STTTEKLDSAETVRPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGCCGGAGGAGAAATTCTAGCGTTGAAACTTGTGAAGTCAATGCCATGGAAAATTCCAAGGTGTTGTCAAACATGAGAAATATGATTTCCTCTGGAAAGCATGC
TCTACTTCCTCTTAAGAGTCCAGTTCCTAGTGGTTCCTCCACATATTTTGATTGTCTCCCTAATCCCATTATTGGGTCCAGAGCGGTGCAGAATCCCAGAGTGGGAAATG
TGAACCATCATAGAACATCATCTGAAAGTCTTGGCTTGATGATCTCCTCAGCGAACCCGAAACCTGTTCAACGAAATGGCTATCGACGTTCATCAAGGGACTCCTTTGCA
TACTTAGATGCAGGAAATGTTTTGAATGAAAATTATGCACAGGATGACTCCCAATGTAAAAACATGTATTTACCTTATTGGGCATCACAAGATTTCTATTCCCATCAAGC
TTCATTATATATGAAACCAAGCTGGAACAAACAGAAGAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCCAGGTGGCTGCCCTTCTGCCAAAAATAGCA
TTCTTCTTGAAAGCTCGAGGATGTTGAGTACACCACAGGAAGCAAATGAGTTTTCTTCAACAACTACTGAAAAGCTGGATTCAGCCGAAACTGTTCGGCCTGATCGAAAG
TTATCAGAGAGAATGGATTAG
mRNA sequenceShow/hide mRNA sequence
AGTTGAGTCAGATTGTTTGTTTTTGCTCGATTGTTTATGGGCCGACTCGGGAATGGGCTACATTTTTCTTCGTAACATCCTTTTCTCTCTCCGGTATTCTTGTTTCAGTC
TTTGTCTCTCTCTCTCTCTTTGATTGTAGATGGAATGCCGGAGGAGAAATTCTAGCGTTGAAACTTGTGAAGTCAATGCCATGGAAAATTCCAAGGTGTTGTCAAACATG
AGAAATATGATTTCCTCTGGAAAGCATGCTCTACTTCCTCTTAAGAGTCCAGTTCCTAGTGGTTCCTCCACATATTTTGATTGTCTCCCTAATCCCATTATTGGGTCCAG
AGCGGTGCAGAATCCCAGAGTGGGAAATGTGAACCATCATAGAACATCATCTGAAAGTCTTGGCTTGATGATCTCCTCAGCGAACCCGAAACCTGTTCAACGAAATGGCT
ATCGACGTTCATCAAGGGACTCCTTTGCATACTTAGATGCAGGAAATGTTTTGAATGAAAATTATGCACAGGATGACTCCCAATGTAAAAACATGTATTTACCTTATTGG
GCATCACAAGATTTCTATTCCCATCAAGCTTCATTATATATGAAACCAAGCTGGAACAAACAGAAGAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCC
AGGTGGCTGCCCTTCTGCCAAAAATAGCATTCTTCTTGAAAGCTCGAGGATGTTGAGTACACCACAGGAAGCAAATGAGTTTTCTTCAACAACTACTGAAAAGCTGGATT
CAGCCGAAACTGTTCGGCCTGATCGAAAGTTATCAGAGAGAATGGATTAGTTCACATGTTAAGCCCGGTCCGACTGATACAGATAATAAGTTTTGCATTGCTTGTTATCT
ACATTATTGGCAATTTGCTCAACGATCGCGTGTAAGGAAACTTCAATACATTGCAGAGCTAGAAAGGAACGTACAAGCTTTACAAGCGAATGGTTCTGAAGTTTCTGCCG
AGCTCGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAACAAAGCACCCAAGCAACGGTAATTTTTACGATCATTTTAGAAAGTTTTCTATAGCTATTTTTGT
TAGACCATTTTCCCGAAACAAATTTTGTAACAAAAGAAATATTACCTTAGATGATCATGATTGATTCTCTCCTTTCTTTAAGAAAAAAAATTAATTATTACAATTACTTG
CAAGCAGAGAACACATATAAAGTCTATATATACCATGGTTTGTTTTCATCTTATTCTCTGTTTCCATTCATTTTTTTCTATTTCATTGTTATGTTCCTTCTTCTATCA
Protein sequenceShow/hide protein sequence
MECRRRNSSVETCEVNAMENSKVLSNMRNMISSGKHALLPLKSPVPSGSSTYFDCLPNPIIGSRAVQNPRVGNVNHHRTSSESLGLMISSANPKPVQRNGYRRSSRDSFA
YLDAGNVLNENYAQDDSQCKNMYLPYWASQDFYSHQASLYMKPSWNKQKNRTRELPPTTLTTNPGGCPSAKNSILLESSRMLSTPQEANEFSSTTTEKLDSAETVRPDRK
LSERMD