; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g00060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g00060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:19383..28577
RNA-Seq ExpressionMoc04g00060
SyntenyMoc04g00060
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]1.0e-4641.92Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+  +D+  V  +W SR   W+   +   +GGI +LWN Q +++ +S++G FS          +++      G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
          L   C   WC+G DFNV R+ +EK +  +GR+T+SM++FN  I+  NL DP L N SFTWSN+R      RLDRFLV   W D F + R   L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP
        DH PI++    +KWGP+PFRFEN+WL  P
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP

RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-4536.06Show/hide
Query:  KETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNG--WFTGVYGPSSSSTGRADFW
        +ETK+ + D+ FV S+W+ + V W++L A   +GGI+ILW+  K   +  +LG+FS T+ F+            +G  G  W T VYGP  +   R DFW
Subjt:  KETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNG--WFTGVYGPSSSSTGRADFW

Query:  RELEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFR
         EL+ L       WC+G DFNV R +SEK    + R+T +M+ F+  I    LLDPPL N +FTWSNM+  P   RLDRFL   +W   F+      L R
Subjt:  RELEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFR

Query:  TTSDHFPIQMAFEAIKWGPTPFRFENVWLDSPQL---LPMWNKEVYGLIGVQKADLIRIISDIN-KKEEMQTLTVEEIDRRQHLKTQPLFSEEYGKMVGH
         TSDH PI +    +KWGPTPFRFEN+WL  P+      +W +E  G  G +    +R +  +  K ++   +T  ++  R+ L    L   +   ++  
Subjt:  TTSDHFPIQMAFEAIKWGPTPFRFENVWLDSPQL---LPMWNKEVYGLIGVQKADLIRIISDIN-KKEEMQTLTVEEIDRRQHLKTQPLFSEEYGKMVGH

Query:  YSNFPPDVSSKVSDRQMLWKSLRGILFGME
          N  PD+   V +R +  K L  +L   E
Subjt:  YSNFPPDVSSKVSDRQMLWKSLRGILFGME

RVW74143.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.5e-4540.91Show/hide
Query:  KETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWRE
        +ETK++  D+ FV SLW++R+  W  L A   +GGIL++W+ +K++    +LG+FS          +++ F++      W + VYGP+S++  R DFW E
Subjt:  KETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWRE

Query:  LEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTT
        L  +       WC+G DFNV R  SEK     GR+T SMK+ +  I    L+DPPL + SFTWSNM+  P   RLDRFL   +W  +F       L R T
Subjt:  LEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTT

Query:  SDHFPIQMAFEAIKWGPTPFRFENVWLDSPQLLPMWNKEVYG
        SDH+ I +     KWGPTPFRFEN+WL  P       KE++G
Subjt:  SDHFPIQMAFEAIKWGPTPFRFENVWLDSPQLLPMWNKEVYG

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]1.0e-4641.92Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+  +D+  V  +W SR   W+   +   +GGI +LWN Q +++ +S++G FS          +++      G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
          L   C   WC+G DFNV R+ +EK +  +GR+T+SM++FN  I+  NL DP L N SFTWSN+R      RLDRFLV   W D F + R   L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP
        DH PI++    +KWGP+PFRFEN+WL  P
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP

VVA41200.1 PREDICTED: RNA-directed DNA polymerase, partial [Prunus dulcis]4.7e-4741.57Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+   D+     +W SR   W+   ++  +GGI ++WN Q  +IS+  +G FS          +++     +G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
         GL   C   WCIG DFNV R++SEK +   GR+T SMK FN  I+  NL DP L N SFTWSN R      RLDRFL  ++W D F + + + L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRI
        DH PI +    +KWGP PFRFEN+ L +  Q + +WNKEV+G L+  +K    RI
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRI

TrEMBL top hitse value%identityAlignment
A0A4Y1RS61 TatD related DNase5.1e-4741.92Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+  +D+  V  +W SR   W+   +   +GGI +LWN Q +++ +S++G FS          +++      G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
          L   C   WC+G DFNV R+ +EK +  +GR+T+SM++FN  I+  NL DP L N SFTWSN+R      RLDRFLV   W D F + R   L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP
        DH PI++    +KWGP+PFRFEN+WL  P
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)5.1e-4741.92Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+  +D+  V  +W SR   W+   +   +GGI +LWN Q +++ +S++G FS          +++      G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
          L   C   WC+G DFNV R+ +EK +  +GR+T+SM++FN  I+  NL DP L N SFTWSN+R      RLDRFLV   W D F + R   L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP
        DH PI++    +KWGP+PFRFEN+WL  P
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDSP

A0A5E4GN72 PREDICTED: RNA-directed DNA polymerase (Fragment)2.3e-4741.57Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+   D+     +W SR   W+   ++  +GGI ++WN Q  +IS+  +G FS          +++     +G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
         GL   C   WCIG DFNV R++SEK +   GR+T SMK FN  I+  NL DP L N SFTWSN R      RLDRFL  ++W D F + + + L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRI
        DH PI +    +KWGP PFRFEN+ L +  Q + +WNKEV+G L+  +K    RI
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRI

M5VS59 Reverse transcriptase domain-containing protein (Fragment)1.3e-4741.38Show/hide
Query:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL
        ETK+ ++D+  V  +W SR   W+   +   +GGI +LWN Q +++ +S++G FS          +++      G + W +G+YGP      R  FW EL
Subjt:  ETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGRADFWREL

Query:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS
          L   C   WC+G DFNV R+ +EK +  +GR+T+SM++FN  I+  NL DP L N SFTWSN+R      RLDRFLV   W D F + R   L R TS
Subjt:  EGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRTTS

Query:  DHFPIQMAFEAIKWGPTPFRFENVWLDSPQLL
        DH PI++    +KWGP+PFRFEN+WL+ P  +
Subjt:  DHFPIQMAFEAIKWGPTPFRFENVWLDSPQLL

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)7.8e-4832.27Show/hide
Query:  TREEDKIEEEEHS--RPKETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTG
        TR E ++   + +  + +ET +   D+     +W SR   W+   ++  +GGI+++WN Q I+IS+  +G FS          +++     +G + W +G
Subjt:  TREEDKIEEEEHS--RPKETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTG

Query:  VYGPSSSSTGRADFWRELEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKD
        +YG       R  FW EL GL   C   WCIG DFNV R++SEK +   GR+T SMK FN  I+  NL DP L N SFTWSN R      RLDRFL F+ 
Subjt:  VYGPSSSSTGRADFWRELEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKD

Query:  WLDVFANQRVSKLFRTTSDHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRII------------SDINKKEEMQTLTV
        W D F + + + L R TSDH PIQ+    +KWGP PFRFEN+ L +  Q +  WNKEV+G L+  +K    RI             + + K+ E     V
Subjt:  WLDVFANQRVSKLFRTTSDHFPIQMAFEAIKWGPTPFRFENVWLDS-PQLLPMWNKEVYG-LIGVQKADLIRII------------SDINKKEEMQTLTV

Query:  EEIDRRQHLKTQPLFSEEYGKMVGHYSNFPPDVSSKVSDRQMLWKSLRGILFGMEGNIGRLGILLNGLGLRYLHNREEWELDFYAISTKRNFSIANC---
         ++  ++ LK +     ++ +          D ++K   R    +  R  +  +E  +   G+++N           EWE++   I+  +N   +N    
Subjt:  EEIDRRQHLKTQPLFSEEYGKMVGHYSNFPPDVSSKVSDRQMLWKSLRGILFGMEGNIGRLGILLNGLGLRYLHNREEWELDFYAISTKRNFSIANC---

Query:  WNTEALTWD
        W  E L W+
Subjt:  WNTEALTWD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCACGGAGATTATCCACTATGATAGAAAACAAATCCTTCGATCAGAAATATAAAGGAAGAGTGGTCCAGATTGAAGAACTTCACCTACAAAGAAGATTT
ACAGTTTCCTTTGAAGAAGTGATTGTAGTTTGGTTAACAGATGCTATCTCAGACCTTCTTCTCTCACCTCAAAACCAGAAGTTTTTTCGGAAGACCCATTGTGCA
AATGGGGAAGATTTCAAAGGCTGGATATCCTTTAAGCAACTTTTGCTAGATTTCCTGAATGGGTTCGAATCAACGAGCCTTGCAAAGCTCCCTCATAGGCCATCA
TCAGAAAAATCATATGCTTCAGCTCTCTTGTGCCCATCTCCGACACAAAGGAAGAAAACAGAAGTGCAGCCACTGACTAAAGAAGTTAGAATCAAGATTCAAGAT
GAAGAGACCTCCTTTTTAGCGCACGTATCGACCTTCGAGGATGTGAAATTACTGATCGGAAAAGAAGCCAAGGTTCACGGCAGCTTCCCGCCGGAAGCTGCAGCA
CGTTTCCATGGAGGATCGGCGGGAAACATAGGTCTCAGTCCAATGGACCGTTGGAGGACAGAGGATGGTTTCTTTTACCCAGTGGTCATCGCAAATCCCCCCTTG
ATGAAAGACCCTGCTGCATCGAGACACTCAACGACAGAGACCTGCGAAAAGAGAGAAGAAAAAGAGGAGAATGCTAAGAAGAAGGAAATCCAATTAGAAGTCTCA
TTAGAGTCTTCTTGGTCAAGCGAAGAACTCCCACGGGTAGAAGGGTTGCATCTAGGGGATCCTTCTACAGAATTTCCAGATGGCTTTCATAGTTGTTTTGATTTA
AGTGTTGAAGAAGAGACACAAACGGCTTTTGAGAAACAGGGAATAGAGCTTATACTAGCCGAGTTCGAACCTTTAGACTGCCAGCTACCAGAGACTAGGGAAGAA
GACAAAATAGAGGAGGAAGAACACTCTCGACCAAAGGAGACGAAGCAAAGTTCCATAGACAAAAGCTTTGTTAGATCGCTTTGGAGCTCTAGACATGTCGGCTGG
ATTTCTTTAAATGCTCAAAACACAGCAGGGGGTATTCTAATTTTATGGAATGAACAAAAGATCAATATCTCTAACTCTCTTCTGGGCACTTTCTCCACTACTTTA
CATTTTAGTCTTACTAATGGCATTACTTTACATTTTAGTCTTACTAATGGCAGGAACGGCTGGTTTACTGGGGTTTATGGTCCCTCTTCATCATCTACAGGCAGA
GCCGATTTTTGGAGGGAATTAGAGGGCTTAGTCAGCAAATGCCATGGTGCTTGGTGTATAGGAGCTGACTTCAACGTGGCTAGATGGCTCTCAGAAAAACAGAGC
AAGAAACAAGGGCGAATCACTCGTAGTATGAAAGAGTTTAATGCTCTTATTGAAGCAATGAACTTATTAGATCCTCCTTTGGACAACGGCTCGTTCACTTGGTCA
AATATGAGAGGCACCCCGACTCTTTCAAGGTTAGATAGATTCCTTGTCTTTAAAGATTGGCTTGATGTTTTTGCCAATCAAAGAGTTTCTAAGCTGTTCCGCACA
ACTTCGGACCACTTCCCCATTCAGATGGCTTTTGAGGCTATTAAATGGGGTCCTACCCCCTTTCGTTTTGAGAATGTATGGCTCGATTCCCCACAGCTTCTTCCT
ATGTGGAATAAGGAAGTCTATGGCCTTATCGGTGTTCAGAAGGCTGATTTGATCAGAATTATTAGTGATATTAATAAGAAAGAGGAAATGCAGACCCTAACAGTA
GAGGAGATTGATAGAAGGCAACACCTGAAGACACAGCCCCTTTTCTCGGAAGAATATGGAAAGATGGTGGGGCATTATTCAAATTTTCCTCCAGATGTCAGCTCC
AAAGTCAGTGACAGACAGATGCTATGGAAAAGCTTACGAGGGATTTTATTTGGAATGGAGGGAAATATAGGCCGATTGGGAATCTTGTTAAATGGTCTTGGACTG
CGCTACCTTCACAACAGGGAGGAATGGGAGCTAGATTTCTATGCTATCTCAACTAAGAGGAATTTTTCTATAGCAAATTGTTGGAACACAGAGGCCTTGACTTGG
GACCTTGGCTTACGGAGGAACCTGTTTGACAGAGAACTTGATAGATGGGCGATGTTCACCGGGAAAATTGAAGGGCTGGTCTTAGGACAAGATAATGATTCCATG
TGCTGGACAGCTGATAGCAAAGGAGTTTTTACAGTGAAATCGGCCTTTATGGCCCTCACCTCCCCCTCTCCTAAGTTAAATGCAGCCACGGCATCTTTTATCTGG
AACTTAAGGAGCGCTGAAACGGTGGATCATCTCTTTTTGCATTGTCCTTTTGCAGCTACAACATGGAATTACTTGGCATCTTATCTAGATTTGGCTCTCAGCTTA
CCAAGGAAAATAGAGGATTTTATAGAAGAAGGCTTTGGTGGATCTTTGTTAAAAGACAAAGCCTTGCTTAGATTTGAAGATGGTAGGCTATTGAAGAAGGAAGAG
ATGAAGGGCAAATGGTGGAAAATCGGAGATCTTTATCTGAAGATGGAACTTAGTTGTCTTAGGGAAGAGGGGAAAAAAAGAGGTCGTATGAATGCTATCATTAGG
GGTGTTCATCGGTTGGTCGGATCGGTTTTCAGGCCTAAATCGACTCTGACCGACACACTGGCTACTGCGACGGCGTGGGCGATTGGCAAGTGTGGCATGTGGGTG
GGGGCCGTGGGGCTGCTGCAGGAACTGAGCGGCGGGCGGGATCCTGGGTTGCGGCGAGATAAGGACGAAAATCAGGCTTTGAAACTGGAAATGACTCGTAATATG
GCTGTTGGTTGGGGGAAGGAGTCAGAAAGGCTAGTGAAGGAGATTTCCCATGGAGTTCAAAGGAAGTCATACGATTCCGCTGTAATGGAATCAAGGTTGGAAGTT
GTAATGTCAGGAAATTCTAGAGATGAAGTATTCCCCGTTGGTGTGGTGAAACACAGACTCCATTGTATGATAGACAGTGGGTTAGAAAAGCTGAAGAAGTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCACGGAGATTATCCACTATGATAGAAAACAAATCCTTCGATCAGAAATATAAAGGAAGAGTGGTCCAGATTGAAGAACTTCACCTACAAAGAAGATTT
ACAGTTTCCTTTGAAGAAGTGATTGTAGTTTGGTTAACAGATGCTATCTCAGACCTTCTTCTCTCACCTCAAAACCAGAAGTTTTTTCGGAAGACCCATTGTGCA
AATGGGGAAGATTTCAAAGGCTGGATATCCTTTAAGCAACTTTTGCTAGATTTCCTGAATGGGTTCGAATCAACGAGCCTTGCAAAGCTCCCTCATAGGCCATCA
TCAGAAAAATCATATGCTTCAGCTCTCTTGTGCCCATCTCCGACACAAAGGAAGAAAACAGAAGTGCAGCCACTGACTAAAGAAGTTAGAATCAAGATTCAAGAT
GAAGAGACCTCCTTTTTAGCGCACGTATCGACCTTCGAGGATGTGAAATTACTGATCGGAAAAGAAGCCAAGGTTCACGGCAGCTTCCCGCCGGAAGCTGCAGCA
CGTTTCCATGGAGGATCGGCGGGAAACATAGGTCTCAGTCCAATGGACCGTTGGAGGACAGAGGATGGTTTCTTTTACCCAGTGGTCATCGCAAATCCCCCCTTG
ATGAAAGACCCTGCTGCATCGAGACACTCAACGACAGAGACCTGCGAAAAGAGAGAAGAAAAAGAGGAGAATGCTAAGAAGAAGGAAATCCAATTAGAAGTCTCA
TTAGAGTCTTCTTGGTCAAGCGAAGAACTCCCACGGGTAGAAGGGTTGCATCTAGGGGATCCTTCTACAGAATTTCCAGATGGCTTTCATAGTTGTTTTGATTTA
AGTGTTGAAGAAGAGACACAAACGGCTTTTGAGAAACAGGGAATAGAGCTTATACTAGCCGAGTTCGAACCTTTAGACTGCCAGCTACCAGAGACTAGGGAAGAA
GACAAAATAGAGGAGGAAGAACACTCTCGACCAAAGGAGACGAAGCAAAGTTCCATAGACAAAAGCTTTGTTAGATCGCTTTGGAGCTCTAGACATGTCGGCTGG
ATTTCTTTAAATGCTCAAAACACAGCAGGGGGTATTCTAATTTTATGGAATGAACAAAAGATCAATATCTCTAACTCTCTTCTGGGCACTTTCTCCACTACTTTA
CATTTTAGTCTTACTAATGGCATTACTTTACATTTTAGTCTTACTAATGGCAGGAACGGCTGGTTTACTGGGGTTTATGGTCCCTCTTCATCATCTACAGGCAGA
GCCGATTTTTGGAGGGAATTAGAGGGCTTAGTCAGCAAATGCCATGGTGCTTGGTGTATAGGAGCTGACTTCAACGTGGCTAGATGGCTCTCAGAAAAACAGAGC
AAGAAACAAGGGCGAATCACTCGTAGTATGAAAGAGTTTAATGCTCTTATTGAAGCAATGAACTTATTAGATCCTCCTTTGGACAACGGCTCGTTCACTTGGTCA
AATATGAGAGGCACCCCGACTCTTTCAAGGTTAGATAGATTCCTTGTCTTTAAAGATTGGCTTGATGTTTTTGCCAATCAAAGAGTTTCTAAGCTGTTCCGCACA
ACTTCGGACCACTTCCCCATTCAGATGGCTTTTGAGGCTATTAAATGGGGTCCTACCCCCTTTCGTTTTGAGAATGTATGGCTCGATTCCCCACAGCTTCTTCCT
ATGTGGAATAAGGAAGTCTATGGCCTTATCGGTGTTCAGAAGGCTGATTTGATCAGAATTATTAGTGATATTAATAAGAAAGAGGAAATGCAGACCCTAACAGTA
GAGGAGATTGATAGAAGGCAACACCTGAAGACACAGCCCCTTTTCTCGGAAGAATATGGAAAGATGGTGGGGCATTATTCAAATTTTCCTCCAGATGTCAGCTCC
AAAGTCAGTGACAGACAGATGCTATGGAAAAGCTTACGAGGGATTTTATTTGGAATGGAGGGAAATATAGGCCGATTGGGAATCTTGTTAAATGGTCTTGGACTG
CGCTACCTTCACAACAGGGAGGAATGGGAGCTAGATTTCTATGCTATCTCAACTAAGAGGAATTTTTCTATAGCAAATTGTTGGAACACAGAGGCCTTGACTTGG
GACCTTGGCTTACGGAGGAACCTGTTTGACAGAGAACTTGATAGATGGGCGATGTTCACCGGGAAAATTGAAGGGCTGGTCTTAGGACAAGATAATGATTCCATG
TGCTGGACAGCTGATAGCAAAGGAGTTTTTACAGTGAAATCGGCCTTTATGGCCCTCACCTCCCCCTCTCCTAAGTTAAATGCAGCCACGGCATCTTTTATCTGG
AACTTAAGGAGCGCTGAAACGGTGGATCATCTCTTTTTGCATTGTCCTTTTGCAGCTACAACATGGAATTACTTGGCATCTTATCTAGATTTGGCTCTCAGCTTA
CCAAGGAAAATAGAGGATTTTATAGAAGAAGGCTTTGGTGGATCTTTGTTAAAAGACAAAGCCTTGCTTAGATTTGAAGATGGTAGGCTATTGAAGAAGGAAGAG
ATGAAGGGCAAATGGTGGAAAATCGGAGATCTTTATCTGAAGATGGAACTTAGTTGTCTTAGGGAAGAGGGGAAAAAAAGAGGTCGTATGAATGCTATCATTAGG
GGTGTTCATCGGTTGGTCGGATCGGTTTTCAGGCCTAAATCGACTCTGACCGACACACTGGCTACTGCGACGGCGTGGGCGATTGGCAAGTGTGGCATGTGGGTG
GGGGCCGTGGGGCTGCTGCAGGAACTGAGCGGCGGGCGGGATCCTGGGTTGCGGCGAGATAAGGACGAAAATCAGGCTTTGAAACTGGAAATGACTCGTAATATG
GCTGTTGGTTGGGGGAAGGAGTCAGAAAGGCTAGTGAAGGAGATTTCCCATGGAGTTCAAAGGAAGTCATACGATTCCGCTGTAATGGAATCAAGGTTGGAAGTT
GTAATGTCAGGAAATTCTAGAGATGAAGTATTCCCCGTTGGTGTGGTGAAACACAGACTCCATTGTATGATAGACAGTGGGTTAGAAAAGCTGAAGAAGTACTAG
Protein sequenceShow/hide protein sequence
MAPRRLSTMIENKSFDQKYKGRVVQIEELHLQRRFTVSFEEVIVVWLTDAISDLLLSPQNQKFFRKTHCANGEDFKGWISFKQLLLDFLNGFESTSLAKLPHRPS
SEKSYASALLCPSPTQRKKTEVQPLTKEVRIKIQDEETSFLAHVSTFEDVKLLIGKEAKVHGSFPPEAAARFHGGSAGNIGLSPMDRWRTEDGFFYPVVIANPPL
MKDPAASRHSTTETCEKREEKEENAKKKEIQLEVSLESSWSSEELPRVEGLHLGDPSTEFPDGFHSCFDLSVEEETQTAFEKQGIELILAEFEPLDCQLPETREE
DKIEEEEHSRPKETKQSSIDKSFVRSLWSSRHVGWISLNAQNTAGGILILWNEQKINISNSLLGTFSTTLHFSLTNGITLHFSLTNGRNGWFTGVYGPSSSSTGR
ADFWRELEGLVSKCHGAWCIGADFNVARWLSEKQSKKQGRITRSMKEFNALIEAMNLLDPPLDNGSFTWSNMRGTPTLSRLDRFLVFKDWLDVFANQRVSKLFRT
TSDHFPIQMAFEAIKWGPTPFRFENVWLDSPQLLPMWNKEVYGLIGVQKADLIRIISDINKKEEMQTLTVEEIDRRQHLKTQPLFSEEYGKMVGHYSNFPPDVSS
KVSDRQMLWKSLRGILFGMEGNIGRLGILLNGLGLRYLHNREEWELDFYAISTKRNFSIANCWNTEALTWDLGLRRNLFDRELDRWAMFTGKIEGLVLGQDNDSM
CWTADSKGVFTVKSAFMALTSPSPKLNAATASFIWNLRSAETVDHLFLHCPFAATTWNYLASYLDLALSLPRKIEDFIEEGFGGSLLKDKALLRFEDGRLLKKEE
MKGKWWKIGDLYLKMELSCLREEGKKRGRMNAIIRGVHRLVGSVFRPKSTLTDTLATATAWAIGKCGMWVGAVGLLQELSGGRDPGLRRDKDENQALKLEMTRNM
AVGWGKESERLVKEISHGVQRKSYDSAVMESRLEVVMSGNSRDEVFPVGVVKHRLHCMIDSGLEKLKKY