; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006157 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006157
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNudix hydrolase domain-containing protein
Genome locationscaffold96:760731..761612
RNA-Seq ExpressionMS006157
SyntenyMS006157
Gene Ontology termsNA
InterPro domainsIPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138298.1 uncharacterized protein LOC111009508 [Momordica charantia]2.2e-167100Show/hide
Query:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
Subjt:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS
        SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS

XP_022922297.1 uncharacterized protein LOC111430319 [Cucurbita moschata]1.8e-12480.74Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKWVS DS+D
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

XP_022974352.1 uncharacterized protein LOC111472979 [Cucurbita maxima]3.4e-12380.07Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFK PLL  Q NP RRFLK PSMS SH      PH   RL  H FASPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADS PPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEE+ IA + AVSVKKHFWKWVS DS+D
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

XP_023550799.1 uncharacterized protein LOC111808831 [Cucurbita pepo subsp. pepo]2.4e-12480.74Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKWVS DS+D
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]9.3e-12180.56Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF--SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLS-HPHPKTRLDNHHFASPQSLSDWLGPR
        MPS P P+PPP P SNL HLNKST LPDF+LAALSLFVFF  SSSSKSFKFP    Q NP RRFLKIPS S+     P ++  +  F SPQSLS+WL PR
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF--SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLS-HPHPKTRLDNHHFASPQSLSDWLGPR

Query:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
        LPSDSFASWGV PGTKNVHNLWLEIS+GETSLADSNPPIRT+ V+SLRILD H+RVLVESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
Subjt:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS

Query:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSA
        IIGD DC +IVRIVP+SYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLP+ EFCTVEEEEY  SEET IA + AVSVKKHFWKWV +
Subjt:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSA

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein5.0e-11274.66Show/hide
Query:  PPPPLPP--PHPISNLTHLNKS-TPLPDFWLAALSLFVFF---SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPR
        PPPP+PP  P PISNLTHLNKS   LPDF+LAALSLF F    SSSSKSFKFP    Q NP RRF KIPS+S+  P+  ++  +  F SPQSLS+WL PR
Subjt:  PPPPLPP--PHPISNLTHLNKS-TPLPDFWLAALSLFVFF---SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPR

Query:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS
        LPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIRT+ V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGS
Subjt:  LPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGS

Query:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        I+GD D  ++VRIVP+SY++KIEER+SVSYPGL A YVLHSMDVWVEGLPD +FCTVEEEEY  SE+T IA   AVSVKKHFWKWVS +SVD
Subjt:  IIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

A0A1S3AUP2 uncharacterized protein LOC1034830011.8e-11475.77Show/hide
Query:  PSPPPPLPP--PHPISNLTHLNKST-PLPDFWLAALSLFVFFSSS--SKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        P PPPP+PP  P PISNLTHLNKST  LPDF+LAALSLF FFSSS  SKSFKFP    Q NP RRFLKIPS S+  P+  ++  +  F SPQSLS+WL P
Subjt:  PSPPPPLPP--PHPISNLTHLNKST-PLPDFWLAALSLFVFFSSS--SKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIR + V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        SI+ D DC  +VRIVP+SYK+KIEER+SVSYPGLPACYVLHSMD+ VEGLPD +FCTVE+EEY  SEET IA + AVSVKKHFWKWVS +SVD
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

A0A6J1CCN3 uncharacterized protein LOC1110095081.1e-167100Show/hide
Query:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
Subjt:  MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS
        SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS
Subjt:  SIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS

A0A6J1E2U8 uncharacterized protein LOC1114303198.7e-12580.74Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFKFPLL  Q NP RRFLK PSMS SH      PH   RL  H F SPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADSNPPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEET IA + AVSVKKHFWKWVS DS+D
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

A0A6J1IA21 uncharacterized protein LOC1114729791.6e-12380.07Show/hide
Query:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW
        MPS PPP+PPP PIS+L HL +S PLPDF+LAALSLFVF  SSSS+SFK PLL  Q NP RRFLK PSMS SH      PH   RL  H FASPQSLSDW
Subjt:  MPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFF-SSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSH------PHPKTRLDNHHFASPQSLSDW

Query:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
        L PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSLADS PPIRTVQV+SLRI+D H R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE
Subjt:  LGPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD
        ELGSI+GD DC EIV+IVP+SYKMKIEERNS SYPGLPACYVLHSMDV VEGLP E+FCTVEEEEY  SEE+ IA + AVSVKKHFWKWVS DS+D
Subjt:  ELGSIIGDLDCCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein6.9e-8254.82Show/hide
Query:  PLPPPHPISNLTHLNKSTP--------LPDFWLAALSLFVFFSSSSKSFKFPLLPFQF--NPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP
        P PP  P+ N  ++N   P        LPD +LAA+SL   +SS       P   F F  NP RR +   S     P P T+     FA+PQSLSDWL  
Subjt:  PLPPPHPISNLTHLNKSTP--------LPDFWLAALSLFVFFSSSSKSFKFPLLPFQF--NPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGP

Query:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFA+WGVKPGTKNVHNLWLE+S+GETSLADS PP+RTV VV++R++ K+ R+LVE+HQELSDG++R R RPLSEKMKP E+P+ AV+RA+KEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SII-GDLD-CCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETE-----IAHKAAVSVKKHFWKWVSADSVD
        SI  GD D   + ++I+P +Y  ++EERNS+SYPGLPA Y LHS++  VEGLP+++FCT EE+EYE  + T+     +A   AV+VK+H+WKWVS DS+ 
Subjt:  SII-GDLD-CCEIVRIVPNSYKMKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETE-----IAHKAAVSVKKHFWKWVSADSVD

Query:  S
        S
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCTTGATATGCCATCACCCCCACCTCCACTTCCACCGCCACACCCCATCTCCAATCTTACCCACCTAAACAAATCCACGCCTCTTCCTGATTTTTGGCTCGCAGC
TCTCTCTCTTTTCGTTTTCTTCTCTTCCTCCTCCAAATCCTTCAAATTCCCTCTTCTCCCTTTTCAATTCAACCCTCATCGCCGCTTTCTGAAGATACCCTCCATGTCTC
TCTCGCATCCCCACCCCAAGACACGCCTCGACAATCACCACTTCGCATCTCCTCAATCCCTCTCCGATTGGCTCGGACCCCGCTTGCCCTCCGACTCTTTTGCTTCCTGG
GGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGATCTCAGAAGGAGAAACTTCCCTTGCCGACTCCAATCCTCCCATTCGCACCGTTCAGGTTGTTTC
TCTTCGAATTCTTGATAAACATAACCGGGTTCTCGTCGAATCCCACCAGGAACTCTCGGATGGCACCCTACGGAATCGAAATCGACCCTTGTCGGAGAAGATGAAGCCGA
ATGAAACCCCTGAATCCGCCGTCTATCGGGCAGTGAAAGAAGAGCTCGGTTCCATTATTGGCGATCTCGATTGCTGCGAAATTGTGAGGATTGTGCCAAATTCGTATAAA
ATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCTGCTTGTTACGTTTTGCATTCGATGGATGTTTGGGTGGAAGGTTTACCCGACGAGGAGTTCTGCAC
GGTGGAGGAAGAAGAATACGAAAAATCTGAGGAGACTGAGATTGCCCACAAGGCTGCTGTGTCCGTGAAGAAGCATTTCTGGAAATGGGTTAGTGCCGATTCTGTGGATT
CT
mRNA sequenceShow/hide mRNA sequence
ATGATCCTTGATATGCCATCACCCCCACCTCCACTTCCACCGCCACACCCCATCTCCAATCTTACCCACCTAAACAAATCCACGCCTCTTCCTGATTTTTGGCTCGCAGC
TCTCTCTCTTTTCGTTTTCTTCTCTTCCTCCTCCAAATCCTTCAAATTCCCTCTTCTCCCTTTTCAATTCAACCCTCATCGCCGCTTTCTGAAGATACCCTCCATGTCTC
TCTCGCATCCCCACCCCAAGACACGCCTCGACAATCACCACTTCGCATCTCCTCAATCCCTCTCCGATTGGCTCGGACCCCGCTTGCCCTCCGACTCTTTTGCTTCCTGG
GGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGATCTCAGAAGGAGAAACTTCCCTTGCCGACTCCAATCCTCCCATTCGCACCGTTCAGGTTGTTTC
TCTTCGAATTCTTGATAAACATAACCGGGTTCTCGTCGAATCCCACCAGGAACTCTCGGATGGCACCCTACGGAATCGAAATCGACCCTTGTCGGAGAAGATGAAGCCGA
ATGAAACCCCTGAATCCGCCGTCTATCGGGCAGTGAAAGAAGAGCTCGGTTCCATTATTGGCGATCTCGATTGCTGCGAAATTGTGAGGATTGTGCCAAATTCGTATAAA
ATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCTGCTTGTTACGTTTTGCATTCGATGGATGTTTGGGTGGAAGGTTTACCCGACGAGGAGTTCTGCAC
GGTGGAGGAAGAAGAATACGAAAAATCTGAGGAGACTGAGATTGCCCACAAGGCTGCTGTGTCCGTGAAGAAGCATTTCTGGAAATGGGTTAGTGCCGATTCTGTGGATT
CT
Protein sequenceShow/hide protein sequence
MILDMPSPPPPLPPPHPISNLTHLNKSTPLPDFWLAALSLFVFFSSSSKSFKFPLLPFQFNPHRRFLKIPSMSLSHPHPKTRLDNHHFASPQSLSDWLGPRLPSDSFASW
GVKPGTKNVHNLWLEISEGETSLADSNPPIRTVQVVSLRILDKHNRVLVESHQELSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIIGDLDCCEIVRIVPNSYK
MKIEERNSVSYPGLPACYVLHSMDVWVEGLPDEEFCTVEEEEYEKSEETEIAHKAAVSVKKHFWKWVSADSVDS