; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003962 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003962
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationChr08:12273502..12274278
RNA-Seq ExpressionHG10003962
SyntenyHG10003962
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578929.1 TOM1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8884.75Show/hide
Query:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG
        T+EDCIIFRGWDSAA  DDDSQSESGVCSPTLW SNSRT+ QFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL+NS+R QDG
Subjt:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG

Query:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG
        DA SLSRDDS+SETSFRRD SKTRSETRALVTRSRSVD+GGFYLKMF PLPFGQ+SAKKK NLRTDSGL+ SSRVSPKPPPVDR+WWRKRS      NEG
Subjt:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG

Query:  SISGGSMTSSGSSNSTSSERSNS
        S+SG      GSSNSTSSERS +
Subjt:  SISGGSMTSSGSSNSTSSERSNS

TYK23785.1 uncharacterized protein E5676_scaffold1607G001120 [Cucumis melo var. makuwa]1.9e-11488.37Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE
        M DI TKFP      P PIP    SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQFHR RNRSLSPTSRTQAIARGQQE
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE

Query:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD
        LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG   SLSRDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRTD
Subjt:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD

Query:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        SGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

XP_004144253.1 uncharacterized protein LOC101219576 [Cucumis sativus]3.4e-11186.87Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQF-HRPRNRSLSPTSRTQAIARGQQ
        M DI TK P KP            SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQF HR RNRSLSPTSRTQAIARGQQ
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQF-HRPRNRSLSPTSRTQAIARGQQ

Query:  ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRT
        ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGD  SL+RDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRT
Subjt:  ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRT

Query:  DSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        DSGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  DSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

XP_008441584.1 PREDICTED: uncharacterized protein LOC103485667 [Cucumis melo]1.9e-11488.37Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE
        M DI TKFP      P PIP    SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQFHR RNRSLSPTSRTQAIARGQQE
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE

Query:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD
        LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG   SLSRDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRTD
Subjt:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD

Query:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        SGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

XP_038885358.1 uncharacterized protein LOC120075766 [Benincasa hispida]5.3e-11286.54Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE
        M DIRTKFPRK  H PD      ++YYKSP+ D   EDCIIFRGWDSAAA+DDDSQSESGV SPTLW SNSRTSPQFHRPRNRSLSPTSR QAIARGQQE
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE

Query:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGD--ATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLR
        LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGD  A SL+RDDSSSETSFRRD SKTRSETR LVTRSRSVD+GGFYLKMF PLPFGQVSAKKK+NLR
Subjt:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGD--ATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLR

Query:  TDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        TDSGLSG SRVSPKPPPV++DWWRKRS+VAGGENEGSISGGSM SSGSSNSTSSERSNSR
Subjt:  TDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

TrEMBL top hitse value%identityAlignment
A0A0A0KCW4 Uncharacterized protein1.7e-11186.87Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQF-HRPRNRSLSPTSRTQAIARGQQ
        M DI TK P KP            SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQF HR RNRSLSPTSRTQAIARGQQ
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQF-HRPRNRSLSPTSRTQAIARGQQ

Query:  ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRT
        ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGD  SL+RDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRT
Subjt:  ELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRT

Query:  DSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        DSGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  DSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

A0A1S3B3A7 uncharacterized protein LOC1034856679.4e-11588.37Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE
        M DI TKFP      P PIP    SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQFHR RNRSLSPTSRTQAIARGQQE
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE

Query:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD
        LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG   SLSRDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRTD
Subjt:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD

Query:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        SGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

A0A5D3DJL8 Uncharacterized protein9.4e-11588.37Show/hide
Query:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE
        M DI TKFP      P PIP    SYYKSPEH+  SEDCIIFRGWDSAAA+DDDSQSESGV SPTLWASNSRT+PQFHR RNRSLSPTSRTQAIARGQQE
Subjt:  MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQE

Query:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD
        LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG   SLSRDDSSSETSFRRDPSK R ETRALVTRSRSVD+GGFYLKMFFPLPFGQVSAKKK+NLRTD
Subjt:  LMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTD

Query:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR
        SGLSGSSRVSPKPPPVD+DWWRKRSSV+GGEN+GSISGGSMTSSGSSNSTSSERSNSR
Subjt:  SGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR

A0A6J1FJY2 uncharacterized protein LOC1114447106.8e-8984.75Show/hide
Query:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG
        T+EDCIIFRGWDSAA  DDDSQSESGVCSPTLW SNSRT+ QFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL+NS+R QDG
Subjt:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG

Query:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG
        DA SLSRDDS+SETSFRRD SKTRSETRALVTRSRSVD+GGFYLKMF PLPFGQ+SAKKK NLRTDSGL+ SSRVSPKPPPVDR+WWRKRS      NEG
Subjt:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG

Query:  SISGGSMTSSGSSNSTSSERSNS
        S+SG      GSSNSTSSERS +
Subjt:  SISGGSMTSSGSSNSTSSERSNS

A0A6J1K1H6 uncharacterized protein LOC1114897763.4e-8884.3Show/hide
Query:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG
        T+EDCIIFRGWDSAA  DDDSQ ESGVCSPTLW SNSRT+ QFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL+NS+R QDG
Subjt:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG

Query:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG
        DA SLSRDDS+SETSFRRD SKTRSETRALVTRSRSVD+GGFYLKMF PLPFGQ+SAKKK NLRTDSGL+ SSRVSPKPPPVDR+WWRKRS      NEG
Subjt:  DATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG

Query:  SISGGSMTSSGSSNSTSSERSNS
        S+SG      GSSNSTSSERS +
Subjt:  SISGGSMTSSGSSNSTSSERSNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21701.4e-2240Show/hide
Query:  AVDDDSQSESGVCSPTLW-ASNSRTSPQFHRPRNR-SLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSE
        ++ +D  S+SGVCSPTLW  S  ++ P FHRP +  SLSP S+ QAIARGQ+ELMEMV  MPES YELSLKDLVE  +     ++  D      +  S  
Subjt:  AVDDDSQSESGVCSPTLW-ASNSRTSPQFHRPRNR-SLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSE

Query:  TSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQV--SAKKKSNLRTDSGLSGSSRVSPKPPPV---DRDWWRKRSSVAGGENEGSISGGSMT
                KT+S+ R    RS   +N GF LK+ F +  G +  + KKK   + D     S R S     V   D++WW +            +S  S  
Subjt:  TSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQV--SAKKKSNLRTDSGLSGSSRVSPKPPPV---DRDWWRKRSSVAGGENEGSISGGSMT

Query:  SSGSSNSTSSERSNS
         SGSS+S +S RS S
Subjt:  SSGSSNSTSSERSNS

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)1.0e-2038.22Show/hide
Query:  DSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRD
        D  S+SGVCSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPES YELSLKDLVE    N++ ++  D      +    E   R+ 
Subjt:  DSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRD

Query:  PSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRV-------SPKP-------PPVDRDWW-------RKRSSVAGGEN
          KT+S+      R+  V+N GF LK+ FP+  G   AKKK+N + D+    SS         SP+P          D+DWW       R+  SV    N
Subjt:  PSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRV-------SPKP-------PPVDRDWW-------RKRSSVAGGEN

Query:  EGSISGGSMTSSGSSNSTSSERSNS
            SG S +S GSS+ ++S+RS +
Subjt:  EGSISGGSMTSSGSSNSTSSERSNS

AT1G76980.2 FUNCTIONS IN: molecular_function unknown6.0e-2138.6Show/hide
Query:  DSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRD
        D  S+SGVCSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPES YELSLKDLVE    N++ ++  D      +    E   R+ 
Subjt:  DSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRD

Query:  PSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRV-------SPKP-------PPVDRDWW-------RKRSSVAGGEN
          KT+S+      R+  V+N GF LK+ FP+  G   AKKK+N + D+    SS         SP+P          D+DWW       R+  SV    N
Subjt:  PSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRV-------SPKP-------PPVDRDWW-------RKRSSVAGGEN

Query:  EGS--ISGGSMTSSGSSNSTSSERSNSR
         GS   SGGS + S S  S +S R  +R
Subjt:  EGS--ISGGSMTSSGSSNSTSSERSNSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATATTAGGACGAAATTCCCAAGAAAACCTACCCATATCCCAGATCCTATCCCACGAACAGCCACTTCCTATTATAAATCTCCAGAACATGACTCCACATCCGA
AGATTGCATCATTTTCAGAGGCTGGGACAGTGCTGCTGCCGTCGACGATGACTCTCAATCCGAATCCGGGGTTTGTTCACCCACGCTTTGGGCTTCCAATTCTCGAACCA
GCCCCCAATTTCACCGCCCCCGTAATCGCAGCCTCTCCCCAACTTCCCGAACCCAAGCCATAGCCAGAGGCCAACAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAA
TCATCTTACGAGCTTTCTCTCAAAGATCTCGTCGAACATCACTTGACTAATTCGAAACGCCAACAAGACGGTGACGCTACTTCCCTTTCAAGAGACGATTCCAGCTCTGA
AACTTCCTTCCGAAGAGACCCTAGCAAGACCCGGAGTGAAACTAGGGCACTCGTTACCAGAAGTAGAAGCGTCGATAACGGTGGATTTTACCTCAAAATGTTCTTCCCAC
TGCCTTTTGGGCAGGTTTCGGCTAAAAAGAAGAGTAATCTCAGAACCGATTCGGGGTTGAGTGGTAGTTCGAGAGTGTCTCCTAAGCCACCGCCAGTGGACAGAGACTGG
TGGAGGAAGAGATCGTCGGTAGCCGGCGGTGAGAACGAGGGTAGTATCTCCGGCGGAAGCATGACGAGTAGCGGCAGTAGTAATAGCACTAGCAGCGAAAGAAGCAATAG
CAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGATATTAGGACGAAATTCCCAAGAAAACCTACCCATATCCCAGATCCTATCCCACGAACAGCCACTTCCTATTATAAATCTCCAGAACATGACTCCACATCCGA
AGATTGCATCATTTTCAGAGGCTGGGACAGTGCTGCTGCCGTCGACGATGACTCTCAATCCGAATCCGGGGTTTGTTCACCCACGCTTTGGGCTTCCAATTCTCGAACCA
GCCCCCAATTTCACCGCCCCCGTAATCGCAGCCTCTCCCCAACTTCCCGAACCCAAGCCATAGCCAGAGGCCAACAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAA
TCATCTTACGAGCTTTCTCTCAAAGATCTCGTCGAACATCACTTGACTAATTCGAAACGCCAACAAGACGGTGACGCTACTTCCCTTTCAAGAGACGATTCCAGCTCTGA
AACTTCCTTCCGAAGAGACCCTAGCAAGACCCGGAGTGAAACTAGGGCACTCGTTACCAGAAGTAGAAGCGTCGATAACGGTGGATTTTACCTCAAAATGTTCTTCCCAC
TGCCTTTTGGGCAGGTTTCGGCTAAAAAGAAGAGTAATCTCAGAACCGATTCGGGGTTGAGTGGTAGTTCGAGAGTGTCTCCTAAGCCACCGCCAGTGGACAGAGACTGG
TGGAGGAAGAGATCGTCGGTAGCCGGCGGTGAGAACGAGGGTAGTATCTCCGGCGGAAGCATGACGAGTAGCGGCAGTAGTAATAGCACTAGCAGCGAAAGAAGCAATAG
CAGGTAA
Protein sequenceShow/hide protein sequence
MDDIRTKFPRKPTHIPDPIPRTATSYYKSPEHDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWASNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPE
SSYELSLKDLVEHHLTNSKRQQDGDATSLSRDDSSSETSFRRDPSKTRSETRALVTRSRSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDW
WRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSR