; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002658 (gene) of Chayote v1 genome

Gene IDSed0002658
OrganismSechium edule (Chayote v1)
DescriptionEnzymatic polyprotein
Genome locationLG04:6380923..6386279
RNA-Seq ExpressionSed0002658
SyntenySed0002658
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032919.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.3e-8070.3Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K SIVR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

KAA0038027.1 Enzymatic polyprotein [Cucumis melo var. makuwa]7.4e-8069.31Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLS ++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]7.4e-8069.31Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F+++++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.5e-8069.8Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.3e-8070.3Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K SIVR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

TrEMBL top hitse value%identityAlignment
A0A5A7SUR9 Enzymatic polyprotein1.6e-8070.3Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K SIVR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

A0A5A7UR29 Enzymatic polyprotein3.6e-8069.31Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F+++++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

A0A5A7UX67 Enzymatic polyprotein3.6e-8069.31Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F+++++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

A0A5D3BEY3 Enzymatic polyprotein7.2e-8169.8Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD+I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K S+VR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

A0A5D3BG41 Enzymatic polyprotein1.6e-8070.3Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LKNAPSEFQ IMNDIFN +QEF+IVYIDDVL+FS  +DQHFKHL+ F NV+K NGLVVS  KIKLFQT+IRFLG++IN G+IKP  RS++F  KFPD I 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS
        DKTQLQRFLGC+NY+ +F++D++ IC  L++RLKKNPKPWTD+HT AVQ IK+L KSIPCLSL++++A LI++TDASDIGYGGILKQ ++ K SIVR+HS
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHS

Query:  GV
        G+
Subjt:  GV

SwissProt top hitse value%identityAlignment
P03554 Enzymatic polyprotein1.5e-3539.81Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LK APS FQ  M++ F  F++F  VY+DD+L+FS N + H  H+        ++G+++S KK +LF+ +I FLG +I+ G  KP    ++  +KFPD + 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI
        DK QLQRFLG L Y ++++  +  I K L  +LK+N P  WT + T  +QK+K  ++  P L     +  LI+ETDASD  +GG+LK          E I
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI

Query:  VRFHSG
         R+ SG
Subjt:  VRFHSG

P03555 Enzymatic polyprotein1.2e-3539.81Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LK APS FQ  M++ F  F++F  VY+DD+L+FS N + H  H+        ++G+++S KK +LF+ +I FLG +I+ G  KP    ++  +KFPD + 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI
        DK QLQRFLG L Y ++++  +  I K L  +LK+N P  WT + T  +QK+K  ++  P L     +  LI+ETDASD  +GG+LK          E I
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI

Query:  VRFHSG
         R+ SG
Subjt:  VRFHSG

P03556 Enzymatic polyprotein1.2e-3539.81Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LK APS FQ  M++ F  F++F  VY+DD+L+FS N + H  H+        ++G+++S KK +LF+ +I FLG +I+ G  KP    ++  +KFPD + 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI
        DK QLQRFLG L Y ++++  +  I K L  +LK+N P  WT + T  +QK+K  ++  P L     +  LI+ETDASD  +GG+LK          E I
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI

Query:  VRFHSG
         R+ SG
Subjt:  VRFHSG

Q00962 Enzymatic polyprotein5.7e-3538.35Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LK APS FQ  M++ F  F++F  VY+DD+++FS N + H  H+        ++G+++S KK +LF+ +I FLG +I+ G  KP    ++  +KFPD + 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI
        DK QLQRFLG L Y ++++ ++  + + L  +LK+N P  WT + T  +QK+K  ++  P L     +  LI+ETDASD  +GG+LK          E I
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI

Query:  VRFHSG
         R+ SG
Subjt:  VRFHSG

Q02964 Enzymatic polyprotein1.2e-3539.81Show/hide
Query:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM
        LK APS FQ  M++ F  F++F  VY+DD+L+FS N + H  H+        ++G+++S KK +LF+ +I FLG +I+ G  KP    ++  +KFPD + 
Subjt:  LKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIM

Query:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI
        DK QLQRFLG L Y ++++  +  I K L  +LK+N P  WT + T  +QK+K  ++  P L     +  LI+ETDASD  +GG+LK          E I
Subjt:  DKTQLQRFLGCLNYVAEFLQDIKPICKSLFERLKKN-PKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCV----DKKESI

Query:  VRFHSG
         R+ SG
Subjt:  VRFHSG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACGTCATGCCCTTTGGCTTAAAAACGCTCCATCAGAATTTCAAAATATCATGAATGATATTTTCAATGGATTTCAAGAATTTTCAATTGTTTACATTGATGATGT
TTTGATTTTTTCTCAAAATATTGATCAACATTTTAAACACCTTCAAACGTTTTACAATGTTGTCAAGAGAAATGGCCTAGTTGTATCGGCCAAGAAAATCAAACTCTTTC
AAACCCAAATCCGTTTCCTTGGATTTGATATCAATTTGGGCCTCATAAAGCCCAGCCACCGGTCCATTGATTTTGCTTCTAAATTCCCGGACCAAATCATGGACAAAACC
CAATTACAACGATTTTTGGGTTGTCTTAATTATGTGGCCGAATTCTTACAGGATATCAAGCCCATATGCAAATCTCTTTTTGAAAGACTAAAAAAGAATCCAAAGCCTTG
GACCGACAAGCATACTGCTGCCGTCCAAAAAATTAAGGCTTTAGTCAAATCAATACCTTGTTTGAGTTTGCTTAATGACAAAGCAAACCTCATAGTGGAGACTGACGCAT
CCGATATCGGATATGGAGGAATCTTGAAACAGTGTGTTGATAAAAAGGAATCGATAGTGCGATTCCACTCTGGCGTATGA
mRNA sequenceShow/hide mRNA sequence
GTTTTACAGACGGCAAACCCTACTTTTCAAAAAAATAAAAAGAAAAACCAACGGAAAACCCTCCCCGTATTTCTCTCTGTAGATTTGCTCCGTTCTTGCGGCGAACCCTC
CGACGATGACGCCAAATCCATTTTCGCCGCCGCATGAACTGCTGCTGTACGGCGCCGATCTCTCAGGAACCTCCGAAGTTGACGCCAAAGCTATTTTCGCCGCTGCATTT
GCTGCTTCGTTCCTGCCCCGATCTGTCCAGATCTGTTGACGGAGAAGCTGAACCACAGGTGCAATAGCTAGATCCGGACATGCTTTCTTTAAATTTGCTCCATCTTTGGC
CGATCTCTCCAGGTAAGCCAAATTTCTTTGCAAGATCTTCAATAATTTGTTGCATTAAGTTCAATCTTTCCGTATTAAACATGAAATTTGGGATAGTTTTGGATCCGAGT
GTGTTGACAGTAGTAGGATGTTCCTTTTGTTACAATTCTTTATCTGGACTAAGCTTCTCTGTATGGTTGAGTAGAAATTCCAATCAATAAAATGAACATTGAGCCAAGAG
GAGAGTTAAAAAGAAAAATGCGAAAAAAATTCCTTTACCAGGAAAAAGTATTTTGATTAATTCTACCTATCTTTTATTAAGTCATTGTTTAATGATGGCTTGAAATCTAA
GCCTGAGTGCCTCTCCTTATGTTGGAAGGACAGCCCTGCCTGTCTCTTTTAAACACAGAATTATGCAACTCATTTTTACCTTGACTAAGAATTCTTCATATTCGATCCTT
AGGTAACAAACTATAATATTAAAATTTGCTCCATGCTCAAGGTGTTTAGGCAAATTCAAAGCATAAAAGGGGTCTCTTTCCGCCAACAAGAGTTAGTAGCTTAGTGGCAT
CAAGGTACACCTATTGACCATATGAGGTCATGGGTTTGAATCCCCTTTCCATCTCTTTTTCGAACTCCGCCAGACCATCTCTTGCATTACATAGTGCTGCTGAAGTTGGC
AGAAAATCAGAGTAAATTCTCCATACATTAGGTGCTTCATTTTTCTATGAATATTACAATTCCATAGGGGTGTTTGCGTTTGGTGTTGAAGAGAGGAGAGGAAGGGTAAG
GAGTTTGGATCCCAACTCCATGTGTATTTATAGGATATCAATCTCTCCTCCAACATTACATCAAATCTTTTATTTAATCTTTTAAATGTCACATCAAACTTTTTTTTGTA
ATTTTTAACCATATAATATAATAAAATATATTTTATCTAATCTTACAAATATCATATCAAACTTTATCTAATTTTCAACCATATAATATAAAATAAAATATATTAAATTT
ATTAATTAATATAAATATACAATGATTACATTAAAAAAATAATATAAAATAAACTTCATATCTCTAAATCACTCAACTTACTCTTTCTCTCCTCTCCTCTTTTCAACTTC
AAAATTCAATCGCCCCCAATTGGACCAGTAGCCACATTTAACATGAGATTATATCCTATAAGAAGATTGAGGAACAATTGGTTAATCCCAGAATAAAACTTCGGATTAAC
CAATTCCAAATGAAACTCGAACAGGACATCTATTCAGATTTCCCCAATGCATTTTGAGAAAGAAAACAACATGTCGTTTCTCTCCCCTATACAAAGGACTTTTCGGAAGA
AGCAATTCCGACAAAGTCAAGACTAATTCAAATGAATGCTAATCTTCTTGCTATTTGCAAGAAGGAAATTGATGATTTGCTGCAGAAGAAGTTGATCAGGCCGTCAAAAA
GCCCATGGTCTTGTGTAGCATTCTATGTCAACAACCAGGCCGAAAAGGAGCGAGGCGTCCCCAGGCTGGTAATCAACTATAAACCCTTAAACAAAGTTTTGGAATGGATT
CGATATCCGATTCCAAATAAAACGGATTTAATCAAGCGCATTTCAAGCGCAAAGGTTTTTTCAAAATTCGATCTCAAATCAGGTTTCTGGCAAATCCAAATTGTCAAAAA
AGATCGATACAAAACCGCTTTCAATGTCCCGTTTGGACAATATGAATGGAACGTCATGCCCTTTGGCTTAAAAACGCTCCATCAGAATTTCAAAATATCATGAATGATAT
TTTCAATGGATTTCAAGAATTTTCAATTGTTTACATTGATGATGTTTTGATTTTTTCTCAAAATATTGATCAACATTTTAAACACCTTCAAACGTTTTACAATGTTGTCA
AGAGAAATGGCCTAGTTGTATCGGCCAAGAAAATCAAACTCTTTCAAACCCAAATCCGTTTCCTTGGATTTGATATCAATTTGGGCCTCATAAAGCCCAGCCACCGGTCC
ATTGATTTTGCTTCTAAATTCCCGGACCAAATCATGGACAAAACCCAATTACAACGATTTTTGGGTTGTCTTAATTATGTGGCCGAATTCTTACAGGATATCAAGCCCAT
ATGCAAATCTCTTTTTGAAAGACTAAAAAAGAATCCAAAGCCTTGGACCGACAAGCATACTGCTGCCGTCCAAAAAATTAAGGCTTTAGTCAAATCAATACCTTGTTTGA
GTTTGCTTAATGACAAAGCAAACCTCATAGTGGAGACTGACGCATCCGATATCGGATATGGAGGAATCTTGAAACAGTGTGTTGATAAAAAGGAATCGATAGTGCGATTC
CACTCTGGCGTATGAAATGACGCACAGAGGAATTACTCAACGGTAAAAAAAGAAATCCTTGCAATTGTATTGTGTATACAGAAATTCCAAGGTGAATTAATCAATAAACG
ATTTTTAATCAAAACATAATCCAGAGCATCAAAATTTATTTTAGAAAAAGATGTAAAAAATCTGGTTTCTAAACAAATTTTTGCAAGATGGCAAGAAATTTTATCATGTT
TTAACTTCGAGGTTTTGCCTATAAAAGGGACGAAAAACGCCATGGCAGATTACCTCTCAAGAGAGTTCTTCTCTCAATCTTCTCTCTCTCTTCCTTCTCATGAGTGAAAG
AATAAGTGGTCGCGATAGAGGCCATGGTGCCTCCTCCTCCTCAAGGCAGTCTTTTGCTGCTCAGTAGCCTTCAGTTGCTTCTTCAAAGAAATCCTCATGTGCCAAGATGT
CACCTGACTCATATGCCGATGATTCTGATTTCATCTTGGTGAGATCAAAAGGCAAATCCAAAGCTGGGTCATCAACCCAGATCCAGTCTGCACATCCTATGCAGTCGACT
CAAAATCCGTCGCCAACGACGTTTGCAAATGCTGTCGTTCCCGACAGATTCATTCCTAAACCCGAAATCAAGGGTTATTTAAAAAAGAATACTCCTCATGGAGAATTCGT
CATTGTGCCAGAATTCGACGACCCCAACGTTTCAAAAATCGTCGAAAACATATTCCCAACTGGCTTTTTCTATATGCCAGAAACTCATCAAAAAACCTGTCGGTTTTATG
AATTCATCCTCGTGGATACAAAGTCCGCCGAGATAATTCATCATCCCGATCCAAAAGATCTAGAACAGATCGCATATTCAAAATTAAAAATTTTCAAAGTTCTAAATCCC
ACGGCATGGAACCAGAGTATCCATACCGAAAAATCATTTTCGAGGAAATTCACTCCTCAAACCTACTCATATCGCGATTATGTTCGTGCCTGGTATCACGAATTGTTTTA
CTAGAATTTCAAACATTCATGGTTTATTAATTTCTGCCCGAACGCACAGAAGGTTCATTTCCCGATGTGGTTCGTAAGCTGATGGATGGTGTTTGTTCTTACCACCAACA
TCTTCCCAGTCCCAATTCAAGAATCATTCTACTATTTTTCGAAGAATATCTTCAAAAATAGTTTTCCAATTACAATGACTTTCTCCATGTACTTTCAAATCCCATGGATT
ATATCATGAAATTTCGAAGTGCGTGTTTCCACTCTCACCAAGAGTTTGGTGAAAGTCATCTCAATCAAATGGTGGGATAAATTCAACACATCCCACTCAGATATTCGATT
CATGAAAGTTTGGTTCGCCAAGAATGGTCACCTTCAAGACCTTTCTTCTCAACGGAACCAAGAATTCTTCAATAAAAAAGCGCAGTTGATTGCTGCCCTAGCTCAAACCT
CATCTGAATCTGATTTGCAGAAAAGATGGAGTGAGCTAGCTTCATTAGCGGATGATTCAGTCTCATCCAAACAATCTGAAGTCGAACCATCAAACTCAGATGAACAATGA
ACCGTAATCGTTCTAAAACTAAAATTATTCATAGACCGAAATCGTCATTAAACTATCAACGATCGAAAAGTCGTTAAACTAAGAAACTTCATAGACTGCAAAGTCATTAA
ACTATCATTAGACCGCAACGTCGTTAAACTATCAAAGACTGCAATTGTCATTAAACTACGATTACTCAAATTATATTACAATTACAAAGGACCACAAAAGGTCCAACTAA
AAAAGGGTTGTAATGGTCCAAAAAAAAAAAAATTTGTACATACATGTTAAATTTTGTTCCTGTAAATTTCTATAAATAAACGAGAAAAAAGCCATTCAAGACGGAGACAA
AAATTTGCGGAGTTTAGCACACCCAGAGCCTATTTCTGGCTCGGAAAATACTTGTATTTTCAACACCCGAGAATTAGATTTTCAAAACCAGAGGCTTAAGACTACAGGTC
CGGTGCGAAGTACCAGAGGCAATAAGGCGCGTAAATGCGATGTTAGCAAAGCGCAGTGAGTCCATTGGTCTCCAGCATAGGATACGGCTGAAGTGTACGTGAAAATCTTG
AATTCGTTCGATGTCTACTTCGGATATGCGTCCAAAGTGGAAGAGGGTATGCTCAATAATGCAGATTTGAAGATTCGACCTGTCGGCTTGTCATCTCAACTTTACTTTTT
CATTTTAACGTTTTGTAATAATTTAGATTCCTTTACTATTTGTTTTTTACTGTTTGTCTTTTACTATTTGTCGTTTGTAATAATGTAGTTACTGTTTACGTTTGTCAGTT
TGTAATAATTTAGTTTTACTTTTTACTTTTTCAAAAGTGGTTTCGAGTGCAAGAGTCCTGTCTTGCCTCCCACTTCTTATATTTTAATTTTCTATTTTCAATTTTCATTT
CAAGTCCGTAGTCTATATAAGACTGCGAGTTTGTAAGTTCAAAGCATTCGATGCATTTTGCTCTCTACTTCACTCTGAAGATAGAATGATTCCTGCTTTGAGTCTTTGGA
TATTCGGTTGTAAAGATCCTCCCTCTGTTTGGTATCAGAACCAACCATATCCAAATAAAATATTTCTCATATATCTTTGTATCTTTTATCATAAATAAAAGTTTGGAAGA
TTGTTA
Protein sequenceShow/hide protein sequence
MERHALWLKNAPSEFQNIMNDIFNGFQEFSIVYIDDVLIFSQNIDQHFKHLQTFYNVVKRNGLVVSAKKIKLFQTQIRFLGFDINLGLIKPSHRSIDFASKFPDQIMDKT
QLQRFLGCLNYVAEFLQDIKPICKSLFERLKKNPKPWTDKHTAAVQKIKALVKSIPCLSLLNDKANLIVETDASDIGYGGILKQCVDKKESIVRFHSGV