; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g32120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g32120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNudix hydrolase domain-containing protein
Genome locationchr6:24180103..24183138
RNA-Seq ExpressionMoc06g32120
SyntenyMoc06g32120
Gene Ontology termsNA
InterPro domainsIPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138298.1 uncharacterized protein LOC111009508 [Momordica charantia]1.1e-16399.65Show/hide
Query:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
Subjt:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKW+
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

XP_022922297.1 uncharacterized protein LOC111430319 [Cucurbita moschata]4.8e-12280.69Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKW+
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

XP_022974352.1 uncharacterized protein LOC111472979 [Cucurbita maxima]9.1e-12180Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFK PLL  Q NP RRFLK PSMS SH      PH   RL  H FASPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADS PPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEE+ IA + AVSVKKHFWKW+
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

XP_023550799.1 uncharacterized protein LOC111808831 [Cucurbita pepo subsp. pepo]6.3e-12280.69Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKW+
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

XP_038874472.1 uncharacterized protein LOC120067121 isoform X1 [Benincasa hispida]3.1e-12181.18Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF--SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLS-HPHPKTRLDNHHFASPQSLSDWLGPR
        MPS P P+PPP P SNL HLNKST LPDF+LAALSLFVFF  SSSSKSFKFP    Q NP RRFLKIPS S+     P ++  +  F SPQSLS+WL PR
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF--SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLS-HPHPKTRLDNHHFASPQSLSDWLGPR

Query:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
        LPSDSFASWGV PGTKNVHNLWLEIS+GETSLADSNPPIRT+ V+SLRILD H+RVLVESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
Subjt:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS

Query:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWIV
        IIGD DC +IVRIVP+SYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLP+ EFCTVEEEEY  SEET IA + AVSVKKHFWKWIV
Subjt:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWIV

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein1.0e-10974.48Show/hide
Query:  PPPPLPP--PHPISNLTHLNKS-TPLPDFWLAALSLFVFF---SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPR
        PPPP+PP  P PISNLTHLNKS   LPDF+LAALSLF F    SSSSKSFKFP    Q NP RRF KIPS+S+  P+  ++  +  F SPQSLS+WL PR
Subjt:  PPPPLPP--PHPISNLTHLNKS-TPLPDFWLAALSLFVFF---SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPR

Query:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
        LPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIRT+ V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGS
Subjt:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS

Query:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        I+GD D  ++VRIVP+SY++KIEER+SVSYPGL A YVLHSMDVWVEGLPD +FCTVEEEEY  SE+T IA   AVSVKKHFWKW+
Subjt:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

A0A1S3AUP2 uncharacterized protein LOC1034830013.7e-11275.61Show/hide
Query:  PSPPPPLPP--PHPISNLTHLNKST-PLPDFWLAALSLFVFFSSS--SKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        P PPPP+PP  P PISNLTHLNKST  LPDF+LAALSLF FFSSS  SKSFKFP    Q NP RRFLKIPS S+  P+  ++  +  F SPQSLS+WL P
Subjt:  PSPPPPLPP--PHPISNLTHLNKST-PLPDFWLAALSLFVFFSSS--SKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIR + V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        SI+ D DC  +VRIVP+SYK+KIEER+SVSYPGLPACYVLHSMD+ VEGLPD +FCTVE+EEY  SEET IA + AVSVKKHFWKW+
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

A0A6J1CCN3 uncharacterized protein LOC1110095085.5e-16499.65Show/hide
Query:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
Subjt:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKW+
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

A0A6J1E2U8 uncharacterized protein LOC1114303192.3e-12280.69Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKW+
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

A0A6J1IA21 uncharacterized protein LOC1114729794.4e-12180Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFK PLL  Q NP RRFLK PSMS SH      PH   RL  H FASPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADS PPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEE+ IA + AVSVKKHFWKW+
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein8.3e-8054.42Show/hide
Query:  PLPPPHPISNLTHLNKSTP--------LPDFWLAALSLFVFFSSSSKSFKFPLLPFQF--NPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        P PP  P+ N  ++N   P        LPD +LAA+SL   +SS       P   F F  NP RR +   S     P P T+     FA+PQSLSDWL  
Subjt:  PLPPPHPISNLTHLNKSTP--------LPDFWLAALSLFVFFSSSSKSFKFPLLPFQF--NPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFA+WGVKPGTKNVHNLWLE+S+GETSLADS PP+RTV VV++R++ K+ R+LVE+HQELSDG++R R RPLSEKMKP E+P+ AV+RA+KEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SII-GDLD-CCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETE-----IAHKAAVSVKKHFWKWI
        SI  GD D   + ++I+P +Y  ++EERNS+SYPGLPA Y LHS++  VEGLP+++FCT EE+EYE  + T+     +A   AV+VK+H+WKW+
Subjt:  SII-GDLD-CCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETE-----IAHKAAVSVKKHFWKWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCTTGATATGCCATCACCCCCACCTCCACTTCCACCGCCACACCCCATCTCCAATCTTACCCACCTAAACAAATCCACGCCTCTTCCTGATTTTTGGCTC
GCAGCTCTCTCTCTTTTCGTTTTCTTCTCTTCCTCCTCCAAATCCTTCAAATTCCCTCTTCTCCCTTTTCAATTCAACCCTCATCGCCGCTTTCTGAAGATACCC
TCCATGTCTCTCTCGCATCCCCACCCCAAGACACGCCTCGACAATCACCACTTCGCATCTCCTCAATCCCTCTCCGATTGGCTCGGACCCCGCTTGCCCTCCGAC
TCTTTTGCTTCCTGGGGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGATCTCAGAAGGAGAAACTTCCCTTGCCGACTCCAATCCTCCCATT
CGCACCGTTCAGGTTGTTTCTCTTCGAATTCTTGATAAACATAACCGGGTTCTCGTCGAATCCCACCAGGAACTCTCGGATGGCACCCTACGGAATCGAAATCGA
CCCTTGTCGGAGAAGATGAAGCCGAATGAAACCCCTGAATCCGCGGTCTATCGGGCAGTGAAAGAAGAGCTCGGTTCCATTATTGGCGATCTCGATTGCTGCGAA
ATTGTGAGGATTGTGCCAAATTCGTATAAAATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCTGCTTGTTACGTTTTGCATTCGATGGATGTT
TGGGTGGAAGGTTTACCCGACGAGGAGTTCTGCACGGTGGAGGAAGAAGAATACGAAAAATCTGAGGAGACTGAGATTGCCCACAAGGCTGCTGTGTCCGTGAAG
AAGCATTTCTGGAAATGGATTGTGGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCCTTGATATGCCATCACCCCCACCTCCACTTCCACCGCCACACCCCATCTCCAATCTTACCCACCTAAACAAATCCACGCCTCTTCCTGATTTTTGGCTC
GCAGCTCTCTCTCTTTTCGTTTTCTTCTCTTCCTCCTCCAAATCCTTCAAATTCCCTCTTCTCCCTTTTCAATTCAACCCTCATCGCCGCTTTCTGAAGATACCC
TCCATGTCTCTCTCGCATCCCCACCCCAAGACACGCCTCGACAATCACCACTTCGCATCTCCTCAATCCCTCTCCGATTGGCTCGGACCCCGCTTGCCCTCCGAC
TCTTTTGCTTCCTGGGGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGATCTCAGAAGGAGAAACTTCCCTTGCCGACTCCAATCCTCCCATT
CGCACCGTTCAGGTTGTTTCTCTTCGAATTCTTGATAAACATAACCGGGTTCTCGTCGAATCCCACCAGGAACTCTCGGATGGCACCCTACGGAATCGAAATCGA
CCCTTGTCGGAGAAGATGAAGCCGAATGAAACCCCTGAATCCGCGGTCTATCGGGCAGTGAAAGAAGAGCTCGGTTCCATTATTGGCGATCTCGATTGCTGCGAA
ATTGTGAGGATTGTGCCAAATTCGTATAAAATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCTGCTTGTTACGTTTTGCATTCGATGGATGTT
TGGGTGGAAGGTTTACCCGACGAGGAGTTCTGCACGGTGGAGGAAGAAGAATACGAAAAATCTGAGGAGACTGAGATTGCCCACAAGGCTGCTGTGTCCGTGAAG
AAGCATTTCTGGAAATGGATTGTGGGGTGA
Protein sequenceShow/hide protein sequence
MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPRLPSD
SFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIIGDLDCCE
IVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWIVG