; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014429 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014429
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr04:19555858..19558360
RNA-Seq ExpressionPI0014429
SyntenyPI0014429
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]7.4e-15948.63Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK+RF F+  V+DEQFV+G ++DL SGV VEV CVYASN+N++RR+LW RLVEITS+WSSP VVM DFNAIRVHSEA   SP+ G+ME+F+LAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN+  L +WP++ V VLPWGIS+H PILFYP+ +   +++SFRFFNHWVED SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF
                             HI  LSEEVR  KEAMD AQRE               +A + F +++  +E   R+ S V       +W E  +   A 
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF

Query:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV
           + R  + R  +L  +DS      +G  V     +   A+       G++       +  Y  LY  + +++            S NQ AF+  RSI+
Subjt:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV

Query:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------
         NILLCQE++                  G +HG  G                  LSR   K+                 L  ADD               
Subjt:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------

Query:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF
              FGELSGL AN  KSS+FVAGV+ E A+ LA  MGFA G LPVRYLG+PLL+GR RS+DCAPLIQ ITSQIRSW+ RVLSFAGRLQLVRSVLRS 
Subjt:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF

Query:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
        QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGG  IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]6.9e-17344.28Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK RF F   V+DE+FV G ++DL  GV VEV+CVYASN++ +RR LWR L EITS+WSS  VVMGDFNAIRVHSEA   SP+ G+MEEFDLAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN++ L +WP++R+ VLPWGIS+HSPILFYP+ +   R++SFRFFNHWVE+ SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRS
                             HI  LSEEV  AKEAMD AQREVE +P S   S  A +AT+ FW+A+R EEASLRQKS VRWL LGDQN+AFFHR +RS
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRS

Query:  HIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF--------------------------------------------
         + RN+LLS+VDS+G RV+SH+ + Q+AVN+F NSLGSQ +GYREL P+++++VQF                                            
Subjt:  HIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF--------------------------------------------

Query:  ---------------------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLG
                                                                                   S NQ AF+  RSI+ NILLCQE++G
Subjt:  ---------------------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLG

Query:  DYHGSSGLSRCSMKVDLQKADDS-----------------------------------------------------------------------------
         YH +SG  RC++KVDLQKA DS                                                                             
Subjt:  DYHGSSGLSRCSMKVDLQKADDS-----------------------------------------------------------------------------

Query:  ------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSD
                                                  FGE SGL AN  KSS+FV GV+ E A+ LA  +G +  + P             RS D
Subjt:  ------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSD

Query:  CAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSA
        CAPLIQ ITS+IRSW+ RVLSFAGRLQLVRSVLRS QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGGL IR+G SWN A
Subjt:  CAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSA

Query:  CILKVLWLMLTSSGSLWVAWVEAYILKGQSL
          LK+L   LT+ GSLWVAW+EAYILKG+SL
Subjt:  CILKVLWLMLTSSGSLWVAWVEAYILKGQSL

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]6.5e-14745.45Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        +WKK+RF F+  VVDEQFV+G ++DL SGV VEV CVYASN+N++RR+LWRRLVEITS WSSP VVMGDFNAIRVH EA   SP+ G+ME+FDLA RD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSL-------------RVTVLPWG-ISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSD
        LVEP++Q NWFTWTSKV GSG LRRLDRILVN++ L +WP+L              V    WG     SP++      Q  +    R F           
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSL-------------RVTVLPWG-ISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSD

Query:  VVSSVGHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEG
              HI  L+EEV  AKE MDRAQREVE +P S   S   G+AT+AFW+A+R EEASLRQKS +RWLELGDQN+AFFHR +RS + RN+LLS+VD++G
Subjt:  VVSSVGHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEG

Query:  IRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF-----------------------------------------------------------
         RV+SH+ +VQ+AVN+FRNSLGSQ +GYREL+PV++++VQF                                                           
Subjt:  IRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF-----------------------------------------------------------

Query:  ------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKV
                                                                      NQ AF+  RSI+ NILLCQE++  YH +SG  RC++KV
Subjt:  ------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKV

Query:  DLQKADDSFG-----------------------ELSGLIANLDKSSMF-------------------------VAGVDIETATVLADSMGFALGTLPVRY
        DLQKA DS                          LS ++  + +S  F                            +  E A+ LA SMGF LG LPVRY
Subjt:  DLQKADDSFG-----------------------ELSGLIANLDKSSMF-------------------------VAGVDIETATVLADSMGFALGTLPVRY

Query:  LGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVD
        LG+PLL+GR RS+DCAPLIQ ITS+IRSW+  VLSFAGRLQLVRSVLRS QV+W SVF+L   VH+ V  +L SYLWRGKE GRGG KVAWV+
Subjt:  LGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVD

TYK26658.1 uncharacterized protein E5676_scaffold313G003080 [Cucumis melo var. makuwa]3.4e-12743.85Show/hide
Query:  LRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV-----------------------------GH
        LRRLDR+LVNE    +WP++ V VLPWGIS+HSPILFYP+ +   +++SF FFNHWVED SF +VV+ +                              H
Subjt:  LRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV-----------------------------GH

Query:  IIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHE
        I  LSEEVR+AK AMD AQREVE +P S   SH AG++T+ FW+A+R EEASLRQKS +RWL+LGDQN+AFFHR +RS + RN L S+VDS+G R     
Subjt:  IIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHE

Query:  RLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF------------------------------------------------------------------
                          +GYREL PV++++VQF                                                                  
Subjt:  RLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF------------------------------------------------------------------

Query:  -----------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKVDLQKADDS-----------------------------
                               S NQ AF+  RSI+ NILLCQE++G YH +SG  RC++KVDLQKA DS                             
Subjt:  -----------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKVDLQKADDS-----------------------------

Query:  -------------------------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGT
                                                                     FGELSGL AN  KSS+FVAGV+ E A+ LA  MGF  G 
Subjt:  -------------------------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGT

Query:  LPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVC
        L VRYLG+PLL+GR RS+D A LIQ ITS+IRSW+ RVLSFAGRLQLV SVLRSFQV+ ASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC
Subjt:  LPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVC

Query:  RPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
         P EEGGL IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  RPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]7.4e-15948.63Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK+RF F+  V+DEQFV+G ++DL SGV VEV CVYASN+N++RR+LW RLVEITS+WSSP VVM DFNAIRVHSEA   SP+ G+ME+F+LAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN+  L +WP++ V VLPWGIS+H PILFYP+ +   +++SFRFFNHWVED SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF
                             HI  LSEEVR  KEAMD AQRE               +A + F +++  +E   R+ S V       +W E  +   A 
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF

Query:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV
           + R  + R  +L  +DS      +G  V     +   A+       G++       +  Y  LY  + +++            S NQ AF+  RSI+
Subjt:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV

Query:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------
         NILLCQE++                  G +HG  G                  LSR   K+                 L  ADD               
Subjt:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------

Query:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF
              FGELSGL AN  KSS+FVAGV+ E A+ LA  MGFA G LPVRYLG+PLL+GR RS+DCAPLIQ ITSQIRSW+ RVLSFAGRLQLVRSVLRS 
Subjt:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF

Query:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
        QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGG  IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein3.6e-15948.63Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK+RF F+  V+DEQFV+G ++DL SGV VEV CVYASN+N++RR+LW RLVEITS+WSSP VVM DFNAIRVHSEA   SP+ G+ME+F+LAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN+  L +WP++ V VLPWGIS+H PILFYP+ +   +++SFRFFNHWVED SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF
                             HI  LSEEVR  KEAMD AQRE               +A + F +++  +E   R+ S V       +W E  +   A 
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF

Query:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV
           + R  + R  +L  +DS      +G  V     +   A+       G++       +  Y  LY  + +++            S NQ AF+  RSI+
Subjt:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV

Query:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------
         NILLCQE++                  G +HG  G                  LSR   K+                 L  ADD               
Subjt:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------

Query:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF
              FGELSGL AN  KSS+FVAGV+ E A+ LA  MGFA G LPVRYLG+PLL+GR RS+DCAPLIQ ITSQIRSW+ RVLSFAGRLQLVRSVLRS 
Subjt:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF

Query:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
        QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGG  IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

A0A5A7TZS0 Reverse transcriptase domain-containing protein3.3e-17344.28Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK RF F   V+DE+FV G ++DL  GV VEV+CVYASN++ +RR LWR L EITS+WSS  VVMGDFNAIRVHSEA   SP+ G+MEEFDLAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN++ L +WP++R+ VLPWGIS+HSPILFYP+ +   R++SFRFFNHWVE+ SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRS
                             HI  LSEEV  AKEAMD AQREVE +P S   S  A +AT+ FW+A+R EEASLRQKS VRWL LGDQN+AFFHR +RS
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRS

Query:  HIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF--------------------------------------------
         + RN+LLS+VDS+G RV+SH+ + Q+AVN+F NSLGSQ +GYREL P+++++VQF                                            
Subjt:  HIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF--------------------------------------------

Query:  ---------------------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLG
                                                                                   S NQ AF+  RSI+ NILLCQE++G
Subjt:  ---------------------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLG

Query:  DYHGSSGLSRCSMKVDLQKADDS-----------------------------------------------------------------------------
         YH +SG  RC++KVDLQKA DS                                                                             
Subjt:  DYHGSSGLSRCSMKVDLQKADDS-----------------------------------------------------------------------------

Query:  ------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSD
                                                  FGE SGL AN  KSS+FV GV+ E A+ LA  +G +  + P             RS D
Subjt:  ------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSD

Query:  CAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSA
        CAPLIQ ITS+IRSW+ RVLSFAGRLQLVRSVLRS QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGGL IR+G SWN A
Subjt:  CAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSA

Query:  CILKVLWLMLTSSGSLWVAWVEAYILKGQSL
          LK+L   LT+ GSLWVAW+EAYILKG+SL
Subjt:  CILKVLWLMLTSSGSLWVAWVEAYILKGQSL

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein3.2e-14745.45Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        +WKK+RF F+  VVDEQFV+G ++DL SGV VEV CVYASN+N++RR+LWRRLVEITS WSSP VVMGDFNAIRVH EA   SP+ G+ME+FDLA RD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSL-------------RVTVLPWG-ISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSD
        LVEP++Q NWFTWTSKV GSG LRRLDRILVN++ L +WP+L              V    WG     SP++      Q  +    R F           
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSL-------------RVTVLPWG-ISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSD

Query:  VVSSVGHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEG
              HI  L+EEV  AKE MDRAQREVE +P S   S   G+AT+AFW+A+R EEASLRQKS +RWLELGDQN+AFFHR +RS + RN+LLS+VD++G
Subjt:  VVSSVGHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEG

Query:  IRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF-----------------------------------------------------------
         RV+SH+ +VQ+AVN+FRNSLGSQ +GYREL+PV++++VQF                                                           
Subjt:  IRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF-----------------------------------------------------------

Query:  ------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKV
                                                                      NQ AF+  RSI+ NILLCQE++  YH +SG  RC++KV
Subjt:  ------------------------------------------------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKV

Query:  DLQKADDSFG-----------------------ELSGLIANLDKSSMF-------------------------VAGVDIETATVLADSMGFALGTLPVRY
        DLQKA DS                          LS ++  + +S  F                            +  E A+ LA SMGF LG LPVRY
Subjt:  DLQKADDSFG-----------------------ELSGLIANLDKSSMF-------------------------VAGVDIETATVLADSMGFALGTLPVRY

Query:  LGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVD
        LG+PLL+GR RS+DCAPLIQ ITS+IRSW+  VLSFAGRLQLVRSVLRS QV+W SVF+L   VH+ V  +L SYLWRGKE GRGG KVAWV+
Subjt:  LGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVD

A0A5D3DT20 Reverse transcriptase domain-containing protein1.6e-12743.85Show/hide
Query:  LRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV-----------------------------GH
        LRRLDR+LVNE    +WP++ V VLPWGIS+HSPILFYP+ +   +++SF FFNHWVED SF +VV+ +                              H
Subjt:  LRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV-----------------------------GH

Query:  IIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHE
        I  LSEEVR+AK AMD AQREVE +P S   SH AG++T+ FW+A+R EEASLRQKS +RWL+LGDQN+AFFHR +RS + RN L S+VDS+G R     
Subjt:  IIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHE

Query:  RLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF------------------------------------------------------------------
                          +GYREL PV++++VQF                                                                  
Subjt:  RLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQF------------------------------------------------------------------

Query:  -----------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKVDLQKADDS-----------------------------
                               S NQ AF+  RSI+ NILLCQE++G YH +SG  RC++KVDLQKA DS                             
Subjt:  -----------------------SGNQLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKVDLQKADDS-----------------------------

Query:  -------------------------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGT
                                                                     FGELSGL AN  KSS+FVAGV+ E A+ LA  MGF  G 
Subjt:  -------------------------------------------------------------FGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGT

Query:  LPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVC
        L VRYLG+PLL+GR RS+D A LIQ ITS+IRSW+ RVLSFAGRLQLV SVLRSFQV+ ASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC
Subjt:  LPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVC

Query:  RPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
         P EEGGL IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  RPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

A0A5D3DXE4 Reverse transcriptase domain-containing protein3.6e-15948.63Show/hide
Query:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD
        MWKK+RF F+  V+DEQFV+G ++DL SGV VEV CVYASN+N++RR+LW RLVEITS+WSSP VVM DFNAIRVHSEA   SP+ G+ME+F+LAIRD+D
Subjt:  MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSD

Query:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------
        LVEP++Q NWFTWTSKV+GSG LRRLDR+LVN+  L +WP++ V VLPWGIS+H PILFYP+ +   +++SFRFFNHWVED SF +VV+ +         
Subjt:  LVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSV---------

Query:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF
                             HI  LSEEVR  KEAMD AQRE               +A + F +++  +E   R+ S V       +W E  +   A 
Subjt:  --------------------GHIIGLSEEVRSAKEAMDRAQREVEWDPGSVERSHDAGVATDAFWSAIRQEEASLRQKSWV-------RWLELGDQNSAF

Query:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV
           + R  + R  +L  +DS      +G  V     +   A+       G++       +  Y  LY  + +++            S NQ AF+  RSI+
Subjt:  FHRLIRSHIGRNNLLSIVDS------EGIRVTSHERLVQVAVNFFRNSLGSQ-------VVGYRELYPVLEEVVQ----------FSGNQLAFVVWRSIV

Query:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------
         NILLCQE++                  G +HG  G                  LSR   K+                 L  ADD               
Subjt:  HNILLCQEML------------------GDYHGSSG------------------LSRCSMKV----------------DLQKADD---------------

Query:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF
              FGELSGL AN  KSS+FVAGV+ E A+ LA  MGFA G LPVRYLG+PLL+GR RS+DCAPLIQ ITSQIRSW+ RVLSFAGRLQLVRSVLRS 
Subjt:  -----SFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSF

Query:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
        QV+WASVFVL + VH+ V  +L SYLWRGKE GRGG KVAWVDVC P EEGG  IR+G SWN A  LK+LWLMLT+SGSLWVAWVEAYILKG+SL
Subjt:  QVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.8e-1534.11Show/hide
Query:  VPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGG
        +P+L  R        +++ ++S++  W  + LSFAGRL L ++VL S  V   S  +L   + + +  L  ++LW      +    V W  VC P +EGG
Subjt:  VPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGG

Query:  LNIREGKSWNSACILKVLWLMLTSSGSLW
        L +R  KS N A I KV W +L    SLW
Subjt:  LNIREGKSWNSACILKVLWLMLTSSGSLW

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein8.5e-1224.7Show/hide
Query:  NNNMDRRVLWRRLVEITSS---WSSPSVVMGDFNAIRVHSEACSVSPVT---GDMEEFDLAIRDSDLVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQ
        N   +RR LW  +  +++S    +SP +V+GDFN I   +E  S+ P       +E+    +RDSDLV+   +   +TW++  + +  LR+LDR +VN  
Subjt:  NNNMDRRVLWRRLVEITSS---WSSPSVVMGDFNAIRVHSEACSVSPVT---GDMEEFDLAIRDSDLVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQ

Query:  GLMSWPSLRVTVLPWGISNHSPILFY----PAAEQQRRIISFRFFN-----------HWVEDTSFSDVVSSVGHIIGLSEEVRSAKEAMDR-----AQRE
         L ++P+      P   S+H+  +      P   +++    F F +            W ++ +    + S+G ++   +E + A   ++R      Q +
Subjt:  GLMSWPSLRVTVLPWGISNHSPILFY----PAAEQQRRIISFRFFN-----------HWVEDTSFSDVVSSVGHIIGLSEEVRSAKEAMDR-----AQRE

Query:  VEWDPGS--VERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGD
        +  +P        H A    + F +A+   E+  +QKS ++WL+ GD
Subjt:  VEWDPGS--VERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGD

AT1G43760.1 DNAse I-like superfamily protein8.0e-1824.91Show/hide
Query:  VVMGDFNAIRVHSEACSV----SPVTGDMEEFDLAIRDSDLVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFY
        +++GDF+ I   S+  SV     P+ G +EEF   +RDSDLV+   +   +TW++    +  +R+LDR + N     S+PS        G+S+HSP +  
Subjt:  VVMGDFNAIRVHSEACSV----SPVTGDMEEFDLAIRDSDLVEPAIQENWFTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFY

Query:  PAAEQQRRIISFRFFNHWVEDTSF--------SDVVSSVGHIIGLSEEVRSAK----------------------EAMDRAQREVEWDPGS--VERSHDA
             +R    FR+F+      +F         + +    H+  L E +++AK                      ++++  Q ++  +P        H A
Subjt:  PAAEQQRRIISFRFFNHWVEDTSF--------SDVVSSVGHIIGLSEEVRSAK----------------------EAMDRAQREVEWDPGS--VERSHDA

Query:  GVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGS
            + F +A+   E+  RQKS ++WL+ GD N+ FFH++I ++  +N +  +   + +RV +  ++ ++ V ++ + LGS
Subjt:  GVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-2941.86Show/hide
Query:  VAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLM
        +AGV       +  S  FA G LPVRYLG+PLL+ +  +SD  PL++ I  +I  W+ R LSFAGRLQL+ SV+ S   FW S F L S     + S+  
Subjt:  VAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQHITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLM

Query:  SYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL
        S+LW G E+    AKVAW DVC P +EGGL IR  K  N      +      S  +   +W+   ILK ++L
Subjt:  SYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKGQSL

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.0e-0641.18Show/hide
Query:  VDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKG
        + VC P  EGGL +R    WN+   LK++W + +  GSLWV W   + L+G
Subjt:  VDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLWVAWVEAYILKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAAGAAGGATAGGTTTGATTTTACTCCTAGCGTGGTGGATGAGCAGTTTGTTTCAGGTGTGATTTCTGATTTGCATTCTGGTGTGACTGTGGAGGTGTTATGTGT
TTATGCCTCTAATAATAATATGGACCGTCGTGTGCTTTGGCGTCGGTTAGTTGAGATCACTTCTAGTTGGTCGAGTCCAAGTGTGGTTATGGGAGATTTTAATGCAATTC
GAGTGCACTCTGAAGCTTGTAGTGTGAGTCCGGTTACTGGTGATATGGAGGAATTTGATCTTGCCATTCGCGATTCTGACTTGGTTGAGCCAGCTATTCAGGAAAACTGG
TTCACTTGGACTAGCAAGGTGCGAGGTTCAGGTCGGTTGCGTCGGCTTGATCGTATTTTAGTTAATGAGCAGGGGTTAATGTCTTGGCCTAGTCTGCGTGTTACTGTTTT
GCCTTGGGGAATTTCTAACCATTCCCCTATATTATTCTATCCTGCTGCTGAGCAGCAGAGGCGTATTATTTCGTTTCGTTTCTTTAATCATTGGGTGGAGGATACGTCTT
TTAGTGATGTGGTGTCTTCGGTTGGACATATCATAGGCCTTAGTGAGGAGGTGCGCTCGGCAAAAGAGGCTATGGATAGGGCCCAACGAGAGGTTGAGTGGGATCCTGGG
TCTGTGGAGAGGAGTCATGATGCGGGTGTTGCAACTGATGCCTTTTGGTCAGCTATCCGTCAGGAAGAAGCCTCTCTCCGTCAGAAATCATGGGTTAGGTGGTTGGAGCT
TGGGGATCAGAATTCTGCCTTTTTTCATCGCTTGATTCGTTCCCATATTGGTCGTAATAATTTGCTTTCTATTGTTGATTCTGAGGGTATTCGGGTGACATCCCATGAGA
GGTTGGTGCAGGTGGCTGTCAATTTTTTTCGTAATAGTCTTGGGTCCCAGGTGGTTGGCTATAGGGAGCTCTATCCTGTGTTGGAGGAGGTGGTTCAGTTTAGTGGTAAT
CAGTTAGCATTTGTTGTTTGGAGGAGTATTGTTCATAACATTCTGCTTTGTCAGGAGATGTTGGGGGATTATCATGGTTCTTCGGGTCTGTCTAGGTGTTCGATGAAGGT
GGATCTTCAGAAGGCTGATGATTCGTTCGGGGAGTTATCGGGGTTAATTGCTAATCTGGATAAGAGCTCTATGTTTGTGGCGGGGGTTGACATTGAGACTGCTACTGTGT
TGGCTGATAGTATGGGGTTTGCGCTGGGTACTTTGCCTGTTCGTTACCTGGGTGTTCCCTTACTCTCTGGTCGTCAGCGTTCTTCGGATTGTGCTCCGCTCATACAGCAT
ATTACTAGTCAGATTCGGTCTTGGTCAGTTAGAGTTTTATCCTTCGCTGGTAGACTTCAGCTTGTGCGTTCAGTGTTGCGGAGTTTTCAGGTCTTTTGGGCCAGTGTGTT
TGTTTTATCGAGTGAGGTGCATCATGCGGTTGGTAGTCTTTTGATGTCGTATTTGTGGAGGGGTAAGGAGGTTGGGAGAGGGGGTGCAAAGGTTGCTTGGGTGGATGTGT
GTCGTCCGTTAGAGGAAGGGGGTCTAAATATTCGGGAGGGAAAGTCGTGGAACAGTGCGTGTATCTTGAAGGTTCTTTGGCTGATGTTGACTAGTTCTGGGTCTCTTTGG
GTTGCTTGGGTGGAGGCTTATATTCTGAAGGGGCAGTCGTTGAAGGTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGAAGAAGGATAGGTTTGATTTTACTCCTAGCGTGGTGGATGAGCAGTTTGTTTCAGGTGTGATTTCTGATTTGCATTCTGGTGTGACTGTGGAGGTGTTATGTGT
TTATGCCTCTAATAATAATATGGACCGTCGTGTGCTTTGGCGTCGGTTAGTTGAGATCACTTCTAGTTGGTCGAGTCCAAGTGTGGTTATGGGAGATTTTAATGCAATTC
GAGTGCACTCTGAAGCTTGTAGTGTGAGTCCGGTTACTGGTGATATGGAGGAATTTGATCTTGCCATTCGCGATTCTGACTTGGTTGAGCCAGCTATTCAGGAAAACTGG
TTCACTTGGACTAGCAAGGTGCGAGGTTCAGGTCGGTTGCGTCGGCTTGATCGTATTTTAGTTAATGAGCAGGGGTTAATGTCTTGGCCTAGTCTGCGTGTTACTGTTTT
GCCTTGGGGAATTTCTAACCATTCCCCTATATTATTCTATCCTGCTGCTGAGCAGCAGAGGCGTATTATTTCGTTTCGTTTCTTTAATCATTGGGTGGAGGATACGTCTT
TTAGTGATGTGGTGTCTTCGGTTGGACATATCATAGGCCTTAGTGAGGAGGTGCGCTCGGCAAAAGAGGCTATGGATAGGGCCCAACGAGAGGTTGAGTGGGATCCTGGG
TCTGTGGAGAGGAGTCATGATGCGGGTGTTGCAACTGATGCCTTTTGGTCAGCTATCCGTCAGGAAGAAGCCTCTCTCCGTCAGAAATCATGGGTTAGGTGGTTGGAGCT
TGGGGATCAGAATTCTGCCTTTTTTCATCGCTTGATTCGTTCCCATATTGGTCGTAATAATTTGCTTTCTATTGTTGATTCTGAGGGTATTCGGGTGACATCCCATGAGA
GGTTGGTGCAGGTGGCTGTCAATTTTTTTCGTAATAGTCTTGGGTCCCAGGTGGTTGGCTATAGGGAGCTCTATCCTGTGTTGGAGGAGGTGGTTCAGTTTAGTGGTAAT
CAGTTAGCATTTGTTGTTTGGAGGAGTATTGTTCATAACATTCTGCTTTGTCAGGAGATGTTGGGGGATTATCATGGTTCTTCGGGTCTGTCTAGGTGTTCGATGAAGGT
GGATCTTCAGAAGGCTGATGATTCGTTCGGGGAGTTATCGGGGTTAATTGCTAATCTGGATAAGAGCTCTATGTTTGTGGCGGGGGTTGACATTGAGACTGCTACTGTGT
TGGCTGATAGTATGGGGTTTGCGCTGGGTACTTTGCCTGTTCGTTACCTGGGTGTTCCCTTACTCTCTGGTCGTCAGCGTTCTTCGGATTGTGCTCCGCTCATACAGCAT
ATTACTAGTCAGATTCGGTCTTGGTCAGTTAGAGTTTTATCCTTCGCTGGTAGACTTCAGCTTGTGCGTTCAGTGTTGCGGAGTTTTCAGGTCTTTTGGGCCAGTGTGTT
TGTTTTATCGAGTGAGGTGCATCATGCGGTTGGTAGTCTTTTGATGTCGTATTTGTGGAGGGGTAAGGAGGTTGGGAGAGGGGGTGCAAAGGTTGCTTGGGTGGATGTGT
GTCGTCCGTTAGAGGAAGGGGGTCTAAATATTCGGGAGGGAAAGTCGTGGAACAGTGCGTGTATCTTGAAGGTTCTTTGGCTGATGTTGACTAGTTCTGGGTCTCTTTGG
GTTGCTTGGGTGGAGGCTTATATTCTGAAGGGGCAGTCGTTGAAGGTTCTTTAG
Protein sequenceShow/hide protein sequence
MWKKDRFDFTPSVVDEQFVSGVISDLHSGVTVEVLCVYASNNNMDRRVLWRRLVEITSSWSSPSVVMGDFNAIRVHSEACSVSPVTGDMEEFDLAIRDSDLVEPAIQENW
FTWTSKVRGSGRLRRLDRILVNEQGLMSWPSLRVTVLPWGISNHSPILFYPAAEQQRRIISFRFFNHWVEDTSFSDVVSSVGHIIGLSEEVRSAKEAMDRAQREVEWDPG
SVERSHDAGVATDAFWSAIRQEEASLRQKSWVRWLELGDQNSAFFHRLIRSHIGRNNLLSIVDSEGIRVTSHERLVQVAVNFFRNSLGSQVVGYRELYPVLEEVVQFSGN
QLAFVVWRSIVHNILLCQEMLGDYHGSSGLSRCSMKVDLQKADDSFGELSGLIANLDKSSMFVAGVDIETATVLADSMGFALGTLPVRYLGVPLLSGRQRSSDCAPLIQH
ITSQIRSWSVRVLSFAGRLQLVRSVLRSFQVFWASVFVLSSEVHHAVGSLLMSYLWRGKEVGRGGAKVAWVDVCRPLEEGGLNIREGKSWNSACILKVLWLMLTSSGSLW
VAWVEAYILKGQSLKVL