; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003362 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003362
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionC2H2-type domain-containing protein
Genome locationChr08:50040..57174
RNA-Seq ExpressionHG10003362
SyntenyHG10003362
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637537.1 hypothetical protein CSA_017393 [Cucumis sativus]3.6e-9579.83Show/hide
Query:  MPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP
        MPQVD RTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP
Subjt:  MPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP

Query:  FQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRC-------------------RSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHP
        FQGTCARSAAEDASPDYNS+V+DARFSSWALPGSLAVTRGIL R                       +GG V+  TR+RGWINHRSVATT  EDSNLSHP
Subjt:  FQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRC-------------------RSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHP

Query:  RDGAHGRPACAPARATSPHGGLRGGAAMRDAQA
         DGAHGRP CA AR TSPHGG RG     D  A
Subjt:  RDGAHGRPACAPARATSPHGGLRGGAAMRDAQA

KAG6384451.1 hypothetical protein SASPL_155737 [Salvia splendens]3.9e-8963.46Show/hide
Query:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA
        SPFHIR GHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP+FSLG+NLPPDWGCIPKQPDS TAPR ATGS  +GALTLSGAPFQGT ARSAA
Subjt:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA

Query:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGA
        EDASPDYNSD   ARFSSWA PGSLAVT+GIL          V  AT  R  ++    + A  G     L   P  G   R      R   P G    GA
Subjt:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGA

Query:  AMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNP
         MRDAQADVPSA+ LRAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCRES ++     APPT    G AP             + SR   + V P
Subjt:  AMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNP

Query:  YYVWTCAGGTTR
         + WT   G  R
Subjt:  YYVWTCAGGTTR

KOM53365.1 hypothetical protein LR48_Vigan09g202400 [Vigna angularis]5.6e-8055Show/hide
Query:  RLNPHRSMPQVDRRTGSS----PFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSE
        RL P      V+  + S+    PFHIR  HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF IGLSP+FSLG+NLPPDW CIPKQPDS   PRG TGS 
Subjt:  RLNPHRSMPQVDRRTGSS----PFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSE

Query:  RNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALP-------GSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPR
         NGA TLSGAPFQGT ARSA EDASPDYNS+ +  RFS  A P          + TR   G C S Q     FAT +   I+ R  A     +S     R
Subjt:  RNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALP-------GSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPR

Query:  DGAHGRPACAPARATSPHGGLRGGAAM-RDAQADVPSARRLRAQLAFKDSVVRGILQFTP-SIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIES
        D + GR    P+ +T     + G   +  D  A +P+   LR  L   D V     Q+T   +         +S       + QIAPPTKNGHAPPPIES
Subjt:  DGAHGRPACAPARATSPHGGLRGGAAM-RDAQADVPSARRLRAQLAFKDSVVRGILQFTP-SIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIES

Query:  RKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQGPTTSFLTATT
        RK SQSVNPYYVWTC GGTTRPVKARSASPAEGTSRPV T  G IDPTQ      L ATT
Subjt:  RKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQGPTTSFLTATT

OIV89726.1 hypothetical protein TanjilG_03599 [Lupinus angustifolius]1.3e-7952.23Show/hide
Query:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA
        SPFHIR  HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP+FSLG+NLPPDWGCIPKQPDS TAPRGATGS  +GALTLSGAPFQGT ARSAA
Subjt:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA

Query:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPR---------------DGAHGRPACAPAR
        EDASPDYNSD E  RF SW                +S + GG     R R      S+      DS  + PR               DG+         R
Subjt:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPR---------------DGAHGRPACAPAR

Query:  AT------------------------SPH-----------------GGLRGGAAM---RDAQADVPSARRLRAQLAFKDSVVRGILQFTP-SIAFRYVLH
         +                         PH                 G  R G  +    D     P+   LR  L   D V     Q+T  ++A      
Subjt:  AT------------------------SPH-----------------GGLRGGAAM---RDAQADVPSARRLRAQLAFKDSVVRGILQFTP-SIAFRYVLH

Query:  RCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTC--AGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQ
          +S       + QIAPPTKNGHAPPPIESRKSSQSVNPYYVWTC  AGGTTRPVKARSASPAEGTSRPV T GGPIDPTQ
Subjt:  RCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTC--AGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQ

OIV89939.1 hypothetical protein TanjilG_27267 [Lupinus angustifolius]5.5e-8360.37Show/hide
Query:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA
        SPFHIR  HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP+FSLG+NLPPDWGCIPKQPDS TAPRGATGS  +GALTLSGAPFQGT ARSAA
Subjt:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA

Query:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILG----RCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPRDGAHGRPACAPARATSPHGGLRGG
        EDASPDYNSD E  RFS WA P  +  T          C  P G G    ++ R +        TG  +S+  H             +    P   L   
Subjt:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILG----RCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHPRDGAHGRPACAPARATSPHGGLRGG

Query:  AAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKAR
          +  +Q   P   RLRA         RG L       F + L R   R        QIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKAR
Subjt:  AAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKAR

Query:  SASPAEGTSRPVLTIGGPIDPTQ
        SASP EG SRPV T GGPIDPTQ
Subjt:  SASPAEGTSRPVLTIGGPIDPTQ

TrEMBL top hitse value%identityAlignment
A0A0A0KEK9 Uncharacterized protein6.7e-9579.83Show/hide
Query:  MPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP
        MPQVD RTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP
Subjt:  MPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAP

Query:  FQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRC-------------------RSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHP
        FQGTCARSAAEDASPDYNS+V+DARFSSWALPGSLAVTRGIL R                       +G  VK  TR+RGWINHRSVATT  EDSNLSHP
Subjt:  FQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRC-------------------RSPQGGGVKFATRYRGWINHRSVATTGAEDSNLSHP

Query:  RDGAHGRPACAPARATSPHGGLRGGAAMRDAQA
         DGAHGRP CA AR TSPHGG RG     D  A
Subjt:  RDGAHGRPACAPARATSPHGGLRGGAAMRDAQA

A0A6N2KB50 Uncharacterized protein (Fragment)4.2e-8955.09Show/hide
Query:  GGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCK--GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF
        G C  T  +G         H   A      A+   G +   R  + H      DR    SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF
Subjt:  GGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCK--GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF

Query:  AIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR-----
        AIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL R       
Subjt:  AIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR-----

Query:  ---SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGIL
             Q  GV+    AT  R  ++    + A  G     L   P  G   R      R   P G    GA MRD QADVPS RR RAQLAFKDS+V GIL
Subjt:  ---SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGIL

Query:  QFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTRP
        QFTPSIAFRYVLHRCESRDIRCRES ++     APPT    G AP             + SR   + V P + WT   G   P
Subjt:  QFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTRP

A0A6N2L797 Uncharacterized protein (Fragment)3.0e-8754.91Show/hide
Query:  GGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCK--GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF
        G C  T  +G         H   A      A+   G +   R  + H      DR    SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF
Subjt:  GGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCK--GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLF

Query:  AIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR-----
        AIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL R       
Subjt:  AIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR-----

Query:  ---SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGIL
             Q  GV+    AT  R  ++    + A  G     L   P  G   R      R   P G    GA MRD QADVPS RR RAQLAFKDS+V GIL
Subjt:  ---SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGIL

Query:  QFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGT
        QFTPSIAFRYVLHRCESRDIRCRES +++   K G  P P   R   ++ +  + W      +R V+ R+ S A+ +
Subjt:  QFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGT

A0A6N2MPB6 Uncharacterized protein1.5e-8666.78Show/hide
Query:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA
        SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAA
Subjt:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA

Query:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR--------SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARA
        EDASPDYNS+   ARFSSWA PGSLAVTRGIL R            Q  GV+    AT  R  ++    + A  G     L   P  G   R      R 
Subjt:  EDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCR--------SPQGGGVK---FATRYRGWINH--RSVATTGAEDSNLS-HPRDGAHGRPACAPARA

Query:  TSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAP
          P G    GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCRES ++     APPT    G AP
Subjt:  TSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAP

A0A6N2MTU4 Uncharacterized protein8.8e-8766.67Show/hide
Query:  GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERN
        GR  +PHRS P+ DRRTG        G  + PHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +
Subjt:  GRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERN

Query:  GALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLS-HPRDGAHGRPA
        GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL     P+  G +   R  G     + A  G     L   P  G   R  
Subjt:  GALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATTGAEDSNLS-HPRDGAHGRPA

Query:  CAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIA
            R   P G    GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCRES +++
Subjt:  CAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGC
ACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTCGAAATACTTCAGGGC
ACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAA
CGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGAT
CACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCCA
CCCATGCAGTCTACAAAAAAAATTTCAAAACAATCCAAGCATTGGAAGATAGATTTCACGTTTCATGGGTGTTGGGTGTCACGACACACAGGAGTTGGGTGTCACGACAC
ATAGGAGGTGGGTGTCACGACACACAGGAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGA
GGGGTGCAAGGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGGCACATCGCCGGCCCCCATC
CGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCTCTCGCCCATATTTAGCCTT
GGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCTGG
CGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGC
TCGCCGTTACTAGGGGAATCCTTGGTCGGTGTAGGAGTCCACAAGGAGGCGGCGTCAAATTCGCGACGCGGTACCGAGGTTGGATCAACCACCGTAGTGTCGCGACGACG
GGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGGGGGGGCAGC
GATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAAGTATCGCAT
TTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATAGAATCAAGA
AAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGACCCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAGCCGACCGGT
GCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAACTGCAACAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGC
ACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTCGAAATACTTCAGGGC
ACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAA
CGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGAT
CACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCCA
CCCATGCAGTCTACAAAAAAAATTTCAAAACAATCCAAGCATTGGAAGATAGATTTCACGTTTCATGGGTGTTGGGTGTCACGACACACAGGAGTTGGGTGTCACGACAC
ATAGGAGGTGGGTGTCACGACACACAGGAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGA
GGGGTGCAAGGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGGCACATCGCCGGCCCCCATC
CGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCTCTCGCCCATATTTAGCCTT
GGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCTGG
CGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGC
TCGCCGTTACTAGGGGAATCCTTGGTCGGTGTAGGAGTCCACAAGGAGGCGGCGTCAAATTCGCGACGCGGTACCGAGGTTGGATCAACCACCGTAGTGTCGCGACGACG
GGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGGGGGGGCAGC
GATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAAGTATCGCAT
TTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATAGAATCAAGA
AAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGACCCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAGCCGACCGGT
GCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAACTGCAACAACTTAA
Protein sequenceShow/hide protein sequence
MPPNAPHPRERAPTTRPATSLGKAEAERDAGFGVRRAPNEGCRQKRTKETRRFHHVRYRCKNRSIAAARNTSGHVNEEELTPTVRCPSTEPTDPNYRITTHAPYAFEPGQ
RSTISTEPTEKQGCRVVGRGRARNAVRGLSFPFPFSITLVLFPMVDPKHMPAHLHSIVGGHGGHASWAWGAPPTHAVYKKNFKTIQALEDRFHVSWVLGVTTHRSWVSRH
IGGGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCKGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSL
GQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILGRCRSPQGGGVKFATRYRGWINHRSVATT
GAEDSNLSHPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPIESR
KSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQGPTTSFLTATT