; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021082 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021082
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr01:7350174..7355535
RNA-Seq ExpressionPI0021082
SyntenyPI0021082
Gene Ontology termsGO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044242.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]9.9e-6660.63Show/hide
Query:  EAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLC
        EAMLLKLK+AI LFEWCSG KVNWDKS LSGVN+G +EL  MA KL CK E+LP LYLGL LGGYPRQK+FW+P+IDRVHKKLDRWK FNISR GRQTLC
Subjt:  EAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLC

Query:  TSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDA
         S+LA+ PTYYLSIFAIP+ V SAL+KLMRN                                        LLAKWGWRY+KED  L  +VIRS +G   
Subjt:  TSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDA

Query:  FDWHTLNKSGNSLKSPWISIS
        F  HT  KSGN L+SPW S S
Subjt:  FDWHTLNKSGNSLKSPWISIS

TYJ97221.1 putative LRR receptor-like serine/threonine-protein kinase [Cucumis melo var. makuwa]8.3e-8955.26Show/hide
Query:  MLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTS
        MLLKLK+AI+LFE CS QKVNW+KSALSGV++  + L + A+++ CK E LP++YL L LGGYP+ + FWQPVIDR+HKKLDRWK FNI R GRQ LC +
Subjt:  MLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTS

Query:  VLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFD
        VL SL TYYLS     K+  S LEK+ RNFFWEG+SGSK+NHLV W  V+ S    GLGLG +K  N+ALLAKWGWR+  ED +  R++I SIHG++ FD
Subjt:  VLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFD

Query:  WHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAM
        W TL K GNSL+SPW SIS+ WR VE LA  KLG                          +IA+LP G VA HWD  T SWS+ F R LK+ +I +F++ 
Subjt:  WHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAM

Query:  ILLL
        + LL
Subjt:  ILLL

TYK06564.1 hypothetical protein E5676_scaffold453G00250 [Cucumis melo var. makuwa]1.5e-6154.73Show/hide
Query:  YYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKS
        YY+S+FA+P  V+S+L++L+RN FWEG+SGS+INHLV W +VT       LGL G+ N+NT LLAKWGW +++E+ +L R+V+RSI+G+++  WHT+ K 
Subjt:  YYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKS

Query:  GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAMILLLSSK
        GNSLKSPWI IS   R++E L S+KLGNG R AFWSD WV+ +   T F  LF+++++P+GSVAA WD  T SWSI+F RL KEE+I +FQ ++ LLS++
Subjt:  GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAMILLLSSK

Query:  K
        +
Subjt:  K

TYK14440.1 uncharacterized protein E5676_scaffold186G00990 [Cucumis melo var. makuwa]3.9e-6265.91Show/hide
Query:  AIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKS
        AIP+ V  ALEKL+RNFFWEGNSGSKINH V+W KVT S  DG LGLGGI+N++ ALLAKWGWRY+KE+ AL R+V+RSIHGR+ FDW T +KS NSL+S
Subjt:  AIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKS

Query:  PWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLL
        PW+ ISR W +VEILA +KLG GRR  FW+D W    PL T F   F+I++LP  SVA HWD  T SWSIVF RLL
Subjt:  PWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLL

XP_038880332.1 uncharacterized protein LOC120071973 [Benincasa hispida]4.6e-8755.73Show/hide
Query:  TQSYCKFKEAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNIS
        T  +CK+ + M+  L+  I++FEWCS QKVNW+KSA+ G+N+   ++  +A +L CK + LPL+YLGL LGGYP+   FWQPVID++  KLD+W+RFN+S
Subjt:  TQSYCKFKEAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNIS

Query:  RVGRQTLCTSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVI
        R G+ TLC SV ++LPTYYLS+F +P+ V+  +E+ M+NFFWEG+ G KINHLV W  VT ++ DGGLGLGG++ +N A LAKWGWR +  +  L  +V+
Subjt:  RVGRQTLCTSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVI

Query:  RSIHGRDAFDWHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPL
        +SIH RD FDWHT  K   +L+S WISISR+W +VE LA YKLGNG R AF SDPW D +PL
Subjt:  RSIHGRDAFDWHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPL

TrEMBL top hitse value%identityAlignment
A0A540M4H0 zf-RVT domain-containing protein2.1e-6142.91Show/hide
Query:  GQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTSVLASLPTYYLSIFAIP
        G K+N  K  L+G+N   E+L+R+A+  GC+  + P+ YLGL LGG PR   FW PV++++ K+L  WK+  +SR GR TL  SVL SLP YY+S+F IP
Subjt:  GQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTSVLASLPTYYLSIFAIP

Query:  KIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKSPWI
          VI  LEKLM+ F WEG    K NHLV W  V  S+ +GGLG+G ++N+N ALLAKW WR+ KE  +L  +VIRS +G     W+       S +SPW 
Subjt:  KIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKSPWI

Query:  SISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTK--SWSIVFCRLLKEEKIPD
         IS   +       +++GNG R  FW D W++  PL   FP LF ++ + + ++++  D  T   SW+  F R L E +I +
Subjt:  SISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTK--SWSIVFCRLLKEEKIPD

A0A5A7TLN3 Transposon Tf2-9 polyprotein4.8e-6660.63Show/hide
Query:  EAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLC
        EAMLLKLK+AI LFEWCSG KVNWDKS LSGVN+G +EL  MA KL CK E+LP LYLGL LGGYPRQK+FW+P+IDRVHKKLDRWK FNISR GRQTLC
Subjt:  EAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLC

Query:  TSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDA
         S+LA+ PTYYLSIFAIP+ V SAL+KLMRN                                        LLAKWGWRY+KED  L  +VIRS +G   
Subjt:  TSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDA

Query:  FDWHTLNKSGNSLKSPWISIS
        F  HT  KSGN L+SPW S S
Subjt:  FDWHTLNKSGNSLKSPWISIS

A0A5D3BF26 Putative LRR receptor-like serine/threonine-protein kinase4.0e-8955.26Show/hide
Query:  MLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTS
        MLLKLK+AI+LFE CS QKVNW+KSALSGV++  + L + A+++ CK E LP++YL L LGGYP+ + FWQPVIDR+HKKLDRWK FNI R GRQ LC +
Subjt:  MLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRVHKKLDRWKRFNISRVGRQTLCTS

Query:  VLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFD
        VL SL TYYLS     K+  S LEK+ RNFFWEG+SGSK+NHLV W  V+ S    GLGLG +K  N+ALLAKWGWR+  ED +  R++I SIHG++ FD
Subjt:  VLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFD

Query:  WHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAM
        W TL K GNSL+SPW SIS+ WR VE LA  KLG                          +IA+LP G VA HWD  T SWS+ F R LK+ +I +F++ 
Subjt:  WHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAM

Query:  ILLL
        + LL
Subjt:  ILLL

A0A5D3C5F2 Uncharacterized protein7.1e-6254.73Show/hide
Query:  YYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKS
        YY+S+FA+P  V+S+L++L+RN FWEG+SGS+INHLV W +VT       LGL G+ N+NT LLAKWGW +++E+ +L R+V+RSI+G+++  WHT+ K 
Subjt:  YYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKS

Query:  GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAMILLLSSK
        GNSLKSPWI IS   R++E L S+KLGNG R AFWSD WV+ +   T F  LF+++++P+GSVAA WD  T SWSI+F RL KEE+I +FQ ++ LLS++
Subjt:  GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDFQAMILLLSSK

Query:  K
        +
Subjt:  K

A0A5D3CSP2 Uncharacterized protein1.9e-6265.91Show/hide
Query:  AIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKS
        AIP+ V  ALEKL+RNFFWEGNSGSKINH V+W KVT S  DG LGLGGI+N++ ALLAKWGWRY+KE+ AL R+V+RSIHGR+ FDW T +KS NSL+S
Subjt:  AIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWHTLNKSGNSLKS

Query:  PWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLL
        PW+ ISR W +VEILA +KLG GRR  FW+D W    PL T F   F+I++LP  SVA HWD  T SWSIVF RLL
Subjt:  PWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.6e-2433.52Show/hide
Query:  VIDRVHKKLDRWKRFNISRVGRQTLCTSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLA
        +++RV  ++  W+   +S  GR TL  +VL+S+P + +S   +P+ +++ L++L R F W   +  K  HLV W+KV   + +GGLG+   K+ N AL++
Subjt:  VIDRVHKKLDRWKRFNISRVGRQTLCTSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLA

Query:  KWGWRYVKEDRALCREVI-RSIHGRDAFDWHTLNKSGNSLKSPWISISRTWRR-VEILASYKLGNGRRSAFWSDPWVDVSPL
        K GWR ++E  +L   V+ +  H  +  D   L   G S  S W SI+   R  V     +  G+G++  FW+D WV   PL
Subjt:  KWGWRYVKEDRALCREVI-RSIHGRDAFDWHTLNKSGNSLKSPWISISRTWRR-VEILASYKLGNGRRSAFWSDPWVDVSPL

P93295 Uncharacterized mitochondrial protein AtMg003106.8e-0928.21Show/hide
Query:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSR-SDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWH
        +LP Y +S F + K++   L   M  F+W      +    VAW K+  S+  DGGLG   +   N ALLAK  +R + +   L   ++RS +    F   
Subjt:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSR-SDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWH

Query:  TLNKS--GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWV-DVSPLNTL
        ++ +   G      W SI      +       +G+G  +  W D W+ D +PL  L
Subjt:  TLNKS--GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWV-DVSPLNTL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.7e-1328Show/hide
Query:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRS--IHGRDAFDW
        +LPTY ++ F +PK V   +  ++ +F+W     +K  H  AW  ++  +++GG+G   I+  N ALL K  WR +    +L  +V +S   H  D  + 
Subjt:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRS--IHGRDAFDW

Query:  HTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSP
              G+     W SI  +   +   A   +GNG     W   W+D  P
Subjt:  HTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.8e-1028.21Show/hide
Query:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSR-SDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWH
        +LP Y +S F + K++   L   M  F+W      +    VAW K+  S+  DGGLG   +   N ALLAK  +R + +   L   ++RS +    F   
Subjt:  SLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSR-SDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCREVIRSIHGRDAFDWH

Query:  TLNKS--GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWV-DVSPLNTL
        ++ +   G      W SI      +       +G+G  +  W D W+ D +PL  L
Subjt:  TLNKS--GNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWV-DVSPLNTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCACGTCGAAGGCTTTACTGTTGGAAGGAACAAGATCCACATCCCACTTCTCCAATATGCTGACGACACAATCTTATTGCAAGTTTAAAGAGGCAATGTTACTAAA
ACTAAAGGATGCCATCCAACTGTTCGAATGGTGCTCGGGTCAAAAAGTCAACTGGGACAAATCTGCACTTAGTGGTGTAAATGTGGGGGCAGAAGAGTTGTCCCGAATGG
CAGAAAAATTAGGTTGCAAAACTGAACAGCTCCCACTATTGTATTTAGGCCTGGCGTTGGGGGGTTATCCAAGACAAAAGATGTTTTGGCAACCAGTAATTGATAGAGTA
CACAAAAAGCTGGACCGATGGAAAAGATTCAATATTTCAAGAGTCGGAAGACAAACTTTATGCACCTCGGTGCTAGCCAGCCTCCCAACTTATTATTTGTCCATTTTTGC
CATCCCCAAGATTGTAATTTCTGCTCTAGAAAAACTGATGAGAAACTTCTTTTGGGAAGGAAACTCCGGTAGCAAAATAAATCACTTAGTGGCATGGGCAAAGGTTACAC
CTTCCCGAAGTGACGGGGGGCTTGGTTTGGGAGGCATCAAAAATCAAAATACAGCTTTATTGGCTAAATGGGGATGGAGATATGTTAAGGAGGATAGAGCCTTGTGTAGA
GAAGTAATAAGGAGCATCCATGGTAGAGATGCTTTTGATTGGCACACCTTGAACAAGTCCGGTAATAGCCTTAAAAGCCCTTGGATTAGTATTTCTAGAACTTGGAGGAG
GGTGGAAATTCTAGCATCATACAAGCTTGGTAATGGAAGAAGATCGGCATTCTGGTCAGACCCTTGGGTAGATGTCTCTCCACTAAACACTTTATTTCCAAATCTGTTCA
AAATAGCTATACTTCCTCATGGCTCAGTGGCTGCCCATTGGGATATGGTCACAAAGTCTTGGTCCATTGTGTTTTGTAGGCTACTAAAAGAGGAGAAAATTCCAGATTTC
CAAGCTATGATCCTCCTTCTCTCGTCTAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
GACGTCAGTTTTGGGCTTTGCGGTGAGTTTTGTGGGTTTTTTCTTGAAGATCTGTACTCATTAGCTTTTGGCTTCGACTAGTTCCTATCACGTTCTGGTGTTAGATTGAA
GAGTAGCAGTGTCGTTTAGTTGCTGTCCAGCGGGTTCGATCGGGTTTTCGTCAAGATTTTGGTAATTAGTAAATGTGAGAATTATTAAGATGTGAAATAGAGTTTTAAGT
AAGGATATGCTTATGTTGGTTGGATTTCTTGAAGAGTTTCGACTAGCCATTAAGTTTCGGCTTTGATTTTGCAATACCCATTACCTTTCGTTGTTAGATTTGGAGTCTTT
GGAGATTTTAGCCGCTGTCCAACTTTAGGAAGCTCGGAATTAGGCCTTTTCGCACAACTTGGATAAGATTCTAAGGTTTATGGAGTGGATCTTGAGTACCCATCTAATTT
TAGTGATAACATTGAGTAACTCATAATGGTGGATGCTTGATTTCGAATTTAGAAGGTAGTTAGGATGTTGCCCATTGAAGTTAAGAGTACGTTGCTAAGTTCTTGGTAAG
GTTTAGAGTAACTTTATTGAGTTTTCCTAGAGTTCTTGAATACCCATCTAATTCTAGTGCTAATATTGAAGAGCCCATGATGTTGAAAGCTAGATTTCGAAGTTAGAAGG
TTGTTAGGGTGTTGTCTTGTCACACTCCGCCCCGCATGCATCCTAAGCTTCGTGATGCGGTACGCGATGCGATCTTAGCAGCCCTGATGTATCAAGATGAGATGCCAGAA
CGCTCAACTTAAATCTTTGAATATTGACTTTATCGCCTAGTCAAACTCAAGTAACTTAACAAACTTAAAACTTAAACATTAGGCGATAGAATAAATATTCATAATGCAAC
ATAGAACAACTAATTCACTTTATCACTTGAGATTATGCAAACCAAATATACATTCACACGTTTCCTTAAATACAAGAATTCTAAAATTTAGTTTAAATCGCCTTGAGCAG
CTTCTCAACCTTCAAACGTCTAGCCAGCTCGCCTTACATTTCCCACTTTGGCCTTAATGCCTGGTAACTGGGGGAAGGGAAACTTAAACGATAAGTCAACTACTTAGTGA
GTGATAGCCTTTGAAAACCTTTTTGGGCACACAATACGTAAACCATTTTCATAAAACAGTCTTCAATGCCTCATTTGATAAATCATAAGCAACAGATTAGCCTTTCACCA
TTCTTTTCTCTAGGAACATGAACCATTCCCACGCAGATTTTAGACCCGGTCTTAATTGCAAATGAGGCTGTTGAAGATTATAGAGCTAAAAAGAAAAAGGGGTGGATCCT
GAAACTAGATCTTGAAAAAGCATTTGATAGAGTGGACTGGGGGTTTCTAGAAAAGGTGTTGCACTGCAAGAATTTTGGCCACAAATGGATAACATGGATGATGGGTTGTA
TAAAAAACCCTCGATACACTATATTCATTAACGGAAAACCAAGAGGTAGAATTATGGCATCTAGAGGTATTCGGCAAGGTGATCCTCTCTCACCTTTCTTATTTCTGCTG
GTCAGCGAAGTCTTAGGAGCAATTATCGACAAGATGCACTTAAATGGGCACGTCGAAGGCTTTACTGTTGGAAGGAACAAGATCCACATCCCACTTCTCCAATATGCTGA
CGACACAATCTTATTGCAAGTTTAAAGAGGCAATGTTACTAAAACTAAAGGATGCCATCCAACTGTTCGAATGGTGCTCGGGTCAAAAAGTCAACTGGGACAAATCTGCA
CTTAGTGGTGTAAATGTGGGGGCAGAAGAGTTGTCCCGAATGGCAGAAAAATTAGGTTGCAAAACTGAACAGCTCCCACTATTGTATTTAGGCCTGGCGTTGGGGGGTTA
TCCAAGACAAAAGATGTTTTGGCAACCAGTAATTGATAGAGTACACAAAAAGCTGGACCGATGGAAAAGATTCAATATTTCAAGAGTCGGAAGACAAACTTTATGCACCT
CGGTGCTAGCCAGCCTCCCAACTTATTATTTGTCCATTTTTGCCATCCCCAAGATTGTAATTTCTGCTCTAGAAAAACTGATGAGAAACTTCTTTTGGGAAGGAAACTCC
GGTAGCAAAATAAATCACTTAGTGGCATGGGCAAAGGTTACACCTTCCCGAAGTGACGGGGGGCTTGGTTTGGGAGGCATCAAAAATCAAAATACAGCTTTATTGGCTAA
ATGGGGATGGAGATATGTTAAGGAGGATAGAGCCTTGTGTAGAGAAGTAATAAGGAGCATCCATGGTAGAGATGCTTTTGATTGGCACACCTTGAACAAGTCCGGTAATA
GCCTTAAAAGCCCTTGGATTAGTATTTCTAGAACTTGGAGGAGGGTGGAAATTCTAGCATCATACAAGCTTGGTAATGGAAGAAGATCGGCATTCTGGTCAGACCCTTGG
GTAGATGTCTCTCCACTAAACACTTTATTTCCAAATCTGTTCAAAATAGCTATACTTCCTCATGGCTCAGTGGCTGCCCATTGGGATATGGTCACAAAGTCTTGGTCCAT
TGTGTTTTGTAGGCTACTAAAAGAGGAGAAAATTCCAGATTTCCAAGCTATGATCCTCCTTCTCTCGTCTAAGAAATGAACAGAATTGGATGACAACAGAGTATGGTCTT
TAGAATCTTCGGGCAGATTTTCAAAATCTCCTTGCCCCATCTTCCCCATTGGATAAGGCAACATATAAAGCTCTATGGAAGACTAGCAGCCCTAGGAGAGTCAACATCCT
AATTTGGATTATGGCTTTCGGTCAGCTAAATTGTGCCTTAGCCATGCAAAGAAAAATTCCCAACAAGTGCTTACTGCCTTCGGTTTGCCCCCTTTGCTTGAAAGATAATG
AAAGTTTGCAGCATCTATTTATTTTCTGTCCTTATGTCTCACATTGTTGGCAAAGTATTCTTGCAAATTTTACAGTAGATTGGGCTTTTGACGGTTCCCTTAGTTCAAAT
GTTCAACAATTATTGAAGGGCCCCATCCTACCAAAGAAGCCAAGGCTAATTTGGGCAAATATGTCAAAAGCACTTTTGGCGGAAATCTGGTTTGAGCGTAACCAGTGTAT
CTTTCATGACAAGGCAAGAGTTTGGACTAGCATTATGGATACAGTCAAGAGGAATACCGCAGCTTGGTGTTCTTTGAATATGGCATTCCAAGACTATTCTATCCAAGACA
TCTGCTTAAATTGGGGGACTTTTATTCAGTTCTCATCTCAGTAAGGAATGGTTTTTTTGCACTTCAAAGAAGTTGAAGTTCAAGCTCTCACAATCGCATTTAATGGGAGT
GCAGTTCTCTTATTCAGATCAATCTTAGGGCTTAGTCTGAGGAGGCTTTTTGTCTTTCCTACACAGTTTCTGTTTTATTTGGCTTATTTCTTTGTAATCCGGCTGTACTT
TCCAAGCTTTTTGCCTAGTATTTGGTTCCGGCTTTGTTCGCACATCTTGGATATGATGAGGTCGCTAAGGGGGTGTCAACCTAGTTGAGATGCCCGGGTGCGCCTCCCGA
TCCTTTCTATCTCTCTCCCTCTTTCTCCCTTTCTTTGCTTCTTAATTATTATTATTCTCATTGTATATCTCTCTTGTACTTTGAGTTATTTATTAATATAAGAAGCATGT
CTCCTTTTCAAAAAAAAAATAGTCCTTCTGCTTTCTTGTATTTGGTTAGCTCTCCAAGTATCTTATCTTGAATGTCCAAGGAAGCAGCCATCTTTATTGTCCATGTATAC
AACCCATTGAAAATTTCATCATCAGCCTGAATATTTGGATTTGAGTCATAGAGTTTTGGGTTTAGATAATATTCCGCTGCATGTATAGGACGATGCAACTGAAGCTCCCA
TCGTCGATCAATTATTGTAAAATGTTTTTGTACTTTTCTTCCTTACCATCAAAGGATTTGGCGATGGTCTCCTTAGCTCTATCCACAGCCTTGTAAATGTTTTCCATAGG
AAGCTTCTTCTCACCATCTACCAACCTAAGTACTCGCACTAAAGGGCCAGATACTTTAAGAGCTAACACAATGGTATTCCAAAAAGTTGCGAACAAAATTGTTTGAGCAA
CTCGCTTTCCTTGTTGTTTCTTTGCTCCATTTGTTGTTTTTCCATTCGTCCAAAGTAACAATTTCCTCAAATTGTTTTTCCAACGATGTAAACTTGATAATGTCATGCAA
GCTGTAGGAAAGCGAGTCTTCGCTGGTCTAACTAATTCCTTTTGATTAGTAAACCGCCTCATCATGTTTAACAATCCTAGACGAACATAAATGAAATTGCTAACTTCCAT
GCCTCTTTTTAATGTTTTGTGAATATGTGGGATCTTGTATATATCCTCCAACATCAAATCTAATTAATGAGGGGCACATGGAGACCAAATTAATTTTGATCGTTTTGCTT
CTAACAATTTCCTTATTAACAAAGTAAAATAAGAACAATATTTAGGCTTCTTAATTGTGTAAGCATAAGACTAAATTTCATAAAGAGTTGGTACTATGCATTCTTTCTTA
CCTGCCATAACATTTGCTGAAGCACTATCAGTAACAACTTGTTTGACATTAGCTTTTCCAATGCGCTCTATGAAATTGTCGAGCGACTCAAACTTTTTCTTTCCATCCTT
CACATAAGATGAAGCAATGAATATGGTGCCTTTAGGACTATTATCTGAAAAGTTAATTAATGTCCTATGTCTTCTATCTGTCCATCCATCAGCCATAATGGTGCATCCAA
CCTTTGCCCACTTTACCTTATGGCTCTTCATCAACTCATTTGTAGATTCTAATTCCTTTTTCAAACACGACACTCTCAATTCATGATATGATGGTGGTTTCAATCTAGGA
CTGAATTGTCCTATTGCCTTAATCATAGGGGCAAAGCTATCATAAGTGCAAGTGTGTGGTGTTTGGCCATTTTTTATTACAACCTTTTTGCTTGAAATCGTTGAACTTCT
ACGTTCACATCATTATAACAGTGTCTTCCATAATTCCTTGCTCCATCCCTATTTCTTTGTCTTGTTTGCGTTTTGTTGCAGGTCATTCGTTAGGATGAATTCCCAATGTA
TACCTTCCTGTTAATTTTTCATTTTAGGTTAAACTATGCCTTTAGGTTTTTATTTTTATTGTTGGTTGCTTTTGTTTTGATGCCACTGTTTGCACTTTTATATCTCTAAA
TGGAAGTCTTCAGTTTTGAAGAAAGCAAGAAGAGAGAGAGAGAGAGAAAGAGTTTGTAGATTTGGCAAAAAAGCAGTCTTCATTGTGTGGAGAAAAGTTACTTATTGTTT
CCTCATTTAAGATGTTGCTCACAAAGATTATAAATATGTATTGTACTACTCATGCTGTTTGTTGCTCTCTTCTATGTTTGAA
Protein sequenceShow/hide protein sequence
MGTSKALLLEGTRSTSHFSNMLTTQSYCKFKEAMLLKLKDAIQLFEWCSGQKVNWDKSALSGVNVGAEELSRMAEKLGCKTEQLPLLYLGLALGGYPRQKMFWQPVIDRV
HKKLDRWKRFNISRVGRQTLCTSVLASLPTYYLSIFAIPKIVISALEKLMRNFFWEGNSGSKINHLVAWAKVTPSRSDGGLGLGGIKNQNTALLAKWGWRYVKEDRALCR
EVIRSIHGRDAFDWHTLNKSGNSLKSPWISISRTWRRVEILASYKLGNGRRSAFWSDPWVDVSPLNTLFPNLFKIAILPHGSVAAHWDMVTKSWSIVFCRLLKEEKIPDF
QAMILLLSSKK