; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005251 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005251
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr03:13098261..13103341
RNA-Seq ExpressionPI0005251
SyntenyPI0005251
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]1.2e-6131.46Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    SN++ R+LW  LVEITS WS  GVVM DFNAI VH EA       G++E+F+LA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+T P++ V+VLPW IS+H  ILFYPS +    + SFRFFN W+ED SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------
         KEAMD AQ E+                                                          + PG                          
Subjt:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------

Query:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----
                 +V   C++ +  D       +F S+ +         +E   L Q   ++C+                    + +  GD  S F  V     
Subjt:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----

Query:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------
              +    F+FHHRCEKV LTHLTF DDLMIF AA+   I F+R+ L++FGELSGL AN  KSS+FV GV+ E AS LA  MG+   +LP       
Subjt:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------

Query:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------
                                      S  G ++  R VLRS   L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                  
Subjt:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------

Query:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR
          G++    W++         +   S S+W   +   +  G   W +  RV RS   W  +RG        W   R ++       R Q V PCLS+ D 
Subjt:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR

Query:  --WV------------WEAIRPRSVRVCWAG
          WV            WEAIRPR  RV W G
Subjt:  --WV------------WEAIRPRSVRVCWAG

KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]8.8e-0469.23Show/hide
Query:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL
        KIVV+P EEVI Q +++WENSLVGQL+DA L +A+IQRL
Subjt:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL

KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]3.3e-9132.3Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSKWLATSPSLRVSVLPWR
        +YA++    SN++ R+LWR LVEITSGWS  GVVMGDF AI VH EA       G++E FDLA+R+ADL+EP VQ N       WL   P+L V+VL W 
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSKWLATSPSLRVSVLPWR

Query:  ISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSS--GLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIR
        IS+HS ILFYPS +Q R +ASFRFFN W+ED SF+ VV    G  E V      M      ++R    + R    GLAT+AFW A RLE+ASL QKS IR
Subjt:  ISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSS--GLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIR

Query:  WLELGDTNSAFF---------------------------------------------------------------------------------IVRFVP-
        WL+LGD N  FF                                                                                 +V  +P 
Subjt:  WLELGDTNSAFF---------------------------------------------------------------------------------IVRFVP-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSM
                                                  GF+FH  CEKV LT LTF DDLMIF  A+   + FVRETL++FGEL GL ANLGK S+
Subjt:  ------------------------------------------GFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSM

Query:  FVVGVDGEAASVLADSMGYPACSLP-----------GYSPYSLVGCVRRIVLRSPT--------------------VLLVFRASVFVLPASVHHEVDRIL
        FV G   E AS LA +MG+   +LP              P      ++RI  R  +                     L V+ ASVFVLP+ VH++VD+IL
Subjt:  FVVGVDGEAASVLADSMGYPACSLP-----------GYSPYSLVGCVRRIVLRSPT--------------------VLLVFRASVFVLPASVHHEVDRIL

Query:  MSYLWRGR-----RLGDGVQRLHGWSVIALESM-------------SIWRL-------------------------VMVGD-VEC-----GWTLGCRVAR
         SYLWR R      +   ++ L  W + +L S+             S+W +                         + VGD   C      W  G  +  
Subjt:  MSYLWRGR-----RLGDGVQRLHGWSVIALESM-------------SIWRL-------------------------VMVGD-VEC-----GWTLGCRVAR

Query:  SLGRWVRGWFMMRRAVR-------------------------RGQGVRPCLSIGDRWVW--------------EAIRPRSVRVCWAGLLLGGANISRHSF
         +G  V      RR  R                         R Q VRPCLS+ DRWVW              + IRPR  RV W GLL GG N+ +HSF
Subjt:  SLGRWVRGWFMMRRAVR-------------------------RGQGVRPCLSIGDRWVW--------------EAIRPRSVRVCWAGLLLGGANISRHSF

Query:  CAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
        CAWL I      RDRL RWD S+P S +LC+GGVESRDHLFFS PFG D+W+R+++ MASSHR+  W VELSW
Subjt:  CAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]4.7e-9831.82Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    S+ + R LWR L EITS WS  GVVMGDFNAI VH EA       G++EEFDLA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+  P++R++VLPW IS+HS ILFYPS +    + SFRFFN W+E+ SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIRWLELGDTNSAFF--------------------------------------
        AKEAMD AQ EVER+P S   S  A LAT+ FW+A RLEEASLRQKS +RWL LGD N+AFF                                      
Subjt:  AKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIRWLELGDTNSAFF--------------------------------------

Query:  --------------------IVR-----------------------------------------------------------------------------
                            IV+                                                                             
Subjt:  --------------------IVR-----------------------------------------------------------------------------

Query:  -------------------------------------------------FVPG-----------------------------------------------
                                                         F+PG                                               
Subjt:  -------------------------------------------------FVPG-----------------------------------------------

Query:  ---------------------------------------------------------------------FKFHHRCEKVCLTHLTFVDDLMIFYAAETTF
                                                                             F+FHHRCEKV LTHLTF DDLMIF AA+   
Subjt:  ---------------------------------------------------------------------FKFHHRCEKVCLTHLTFVDDLMIFYAAETTF

Query:  IGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMG---YPACSL--PGYSPYSL--VGC---VRRIV--LRSPTV----------------
        I F+RE L++FGE SGL AN  KSS+FVVGV+ E AS LA  +G      CSL  P  S + L  + C   ++RI   +RS T                 
Subjt:  IGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMG---YPACSL--PGYSPYSL--VGC---VRRIV--LRSPTV----------------

Query:  --LLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-------------------GVQRLHGWSV-----IALESM-SIWRLVMVGDVECG---WTLGC
          L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                    G++    W++     I L ++ S+W   M   +  G   W +  
Subjt:  --LLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-------------------GVQRLHGWSV-----IALESM-SIWRLVMVGDVECG---WTLGC

Query:  RVARS----------------LGR-------------------------WVRGWFMMRRAVRRGQGVRPCLSIGDRWV--------------WEAIRPRS
        RV RS                +G                          W R    +     R Q V PCLS+ D WV              WEAI PR 
Subjt:  RVARS----------------LGR-------------------------WVRGWFMMRRAVRRGQGVRPCLSIGDRWV--------------WEAIRPRS

Query:  VRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
         RV W GLL GG NI +HSFCAWLAI      RDRL RWD SIP SC+LC+GGVESRDHLFFS PFG D+W+R+ + M SSHR+G W VELSW
Subjt:  VRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.4e-7833Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    SN++ R+LW  LVEITS WS  GVVM DFNAI VH EA       G++E+F+LA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+T P++ V+VLPW IS+H  ILFYPS +    + SFRFFN W+ED SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------
         KEAMD AQ E+                                                          + PG                          
Subjt:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------

Query:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----
                 +V   C++ +  D       +F S+ +         +E   L Q   ++C+                    + +  GD  S F  V     
Subjt:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----

Query:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------
              +    F+FHHRCEKV LTHLTF DDLMIF AA+   I F+R+ L++FGELSGL AN  KSS+FV GV+ E AS LA  MG+   +LP       
Subjt:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------

Query:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------
                                      S  G ++  R VLRS   L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                  
Subjt:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------

Query:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR
          G++    W++         +   S S+W   +   +  G   W +  RV RS   W  +RG        W   R ++       R Q V PCLS+ D 
Subjt:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR

Query:  WVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
         VW  +R R           GG +IS     AW AIR R  R         +L +GGVESRDHLFFS  FG D+W+R++R MASS+R+G W VELSW
Subjt:  WVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]8.8e-0469.23Show/hide
Query:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL
        KIVV+P EEVI Q +++WENSLVGQL+DA L +A+IQRL
Subjt:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.0e-7628.79Show/hide
Query:  LVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK------------------WLATSPSLRVSVLPWRIS
        + EI++GW  SG+VMGDFN I +H EA       GD+EE D+ +READLVEP VQ NW TWT+K                   L+  P++RV+VLPW IS
Subjt:  LVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK------------------WLATSPSLRVSVLPWRIS

Query:  NHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSSG-------------------------------------------------------------
        NHS IL YPS ++ + + SFRFFN W++++SF DVVSS                                                              
Subjt:  NHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSSG-------------------------------------------------------------

Query:  -----LSEEVCSAKEAMDR--------------------------AQHEVERDPGSVERSCLAGLATDAF--------WSAT------------------
             ++ ++  A+++++                            + EV R   S++     G   D +        W+ T                  
Subjt:  -----LSEEVCSAKEAMDR--------------------------AQHEVERDPGSVERSCLAGLATDAF--------WSAT------------------

Query:  ----RLEEASLRQKSC------------------------------------------------------------------------IRW---------
            RLE+   R  SC                                                                        + W         
Subjt:  ----RLEEASLRQKSC------------------------------------------------------------------------IRW---------

Query:  -------------------------------------LELGDTNSAFFIVRFV-----------PGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGF
                                             L  GD  S F  V  +             F+FH  CEKV LTHLTF DDLMIF AA+   + F
Subjt:  -------------------------------------LELGDTNSAFFIVRFV-----------PGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGF

Query:  VRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP--------------------------------GYSPYSLVGCVR--RIVLRS
        ++ET+++FGELSGL ANL KSS+F+VGV+   AS LA +MG+    LP                                     S  G ++  R VLRS
Subjt:  VRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP--------------------------------GYSPYSLVGCVR--RIVLRS

Query:  PTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGDGVQR--------------------------------------------------------LH
           L V+ ASVF+LP  VH +VD+IL SYLWRG+  G G  +                                                        + 
Subjt:  PTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGDGVQR--------------------------------------------------------LH

Query:  GW-----------SVIALESMSIWRLVMVGDVECGWTLGCRVARSLGRWV-------RGWFMMRRAVRRG------------------QGVRPCLSIGDR
        GW           S I+++ +  WR  M G +ECGW  G  + +  G  V       R   ++   VR G                  QGVRP  S+ DR
Subjt:  GW-----------SVIALESMSIWRLVMVGDVECGWTLGCRVARSLGRWV-------RGWFMMRRAVRRG------------------QGVRPCLSIGDR

Query:  WV--------------WEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIR
        WV              WE IRP S RV W+GLL    NI +HSF AWLAI      RDRLS+WDRSIP SC+LC G  ESRDHLFFS PFG++IW+RI+ 
Subjt:  WV--------------WEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIR

Query:  AMASSHRVGTWEVELSW
         M+SSHR+G W VELSW
Subjt:  AMASSHRVGTWEVELSW

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein4.3e-0469.23Show/hide
Query:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL
        KIVV+P EEVI Q +++WENSLVGQL+DA L +A+IQRL
Subjt:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL

A0A5A7SPE5 Reverse transcriptase domain-containing protein7.7e-6229.91Show/hide
Query:  VVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK------------------WLATSPSLRVSVLPWRISNHSLILFYPSVE
        VVM DFNAI  H EA       G++E+FD+A+R+ADLVEP VQGNW TWTSK                  WL+  P++ V+VLPW IS+HS IL YPS +
Subjt:  VVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK------------------WLATSPSLRVSVLPWRISNHSLILFYPSVE

Query:  QQRCIASFRFFNLWLEDTSFSDVVS---SGLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEE-----------------------
        Q   + SFR FN W++D SF          LSEEV  AKEAMD AQ EVER+P S   S  A LAT+ FW+A RLE+                       
Subjt:  QQRCIASFRFFNLWLEDTSFSDVVS---SGLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEE-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------ASLRQKSCIR----------------WLE--LGDTNSAFFIVR-----------FVPGFKFHH---RCE-KVCL------
                               R  SC                  WL   +    SAF + R            V G+  +    RC  KV L      
Subjt:  --------------------ASLRQKSCIR----------------WLE--LGDTNSAFFIVR-----------FVPGFKFHH---RCE-KVCL------

Query:  --------------THLTFVD-----------DLMIFYAAETTFIG----------------FVRETL------------------RQFGELSGLVANLG
                      T L FV             +MI  + E  F G                 V E L                  ++FGELSGL AN  
Subjt:  --------------THLTFVD-----------DLMIFYAAETTFIG----------------FVRETL------------------RQFGELSGLVANLG

Query:  KSSMFVVGVDGEAASVLADSMGYPACSLP-GYSPYSLV-------GCV-------RRIVLRSPTV----------------LLVFRASVFVLPASVHHEV
        KSS+F+ GV+ E AS LA  MG+   +LP  Y    L+        CV        RI  RS  V                L V+ A VFVLPA VH+E 
Subjt:  KSSMFVVGVDGEAASVLADSMGYPACSLP-GYSPYSLV-------GCV-------RRIVLRSPTV----------------LLVFRASVFVLPASVHHEV

Query:  DRI-------------------------------LMSYLWRGRRLGDGVQRL-HGWSVIALESMSIWRLVMVGDVECGWTLGCRVARSLGRWVRGWFMMR
          +                               + +Y+ +GR L D   R+   W + A+      +L     ++ G    CRV      W+  W + R
Subjt:  DRI-------------------------------LMSYLWRGRRLGDGVQRL-HGWSVIALESMSIWRLVMVGDVECGWTLGCRVARSLGRWVRGWFMMR

Query:  RAVRRGQGVRPCLSIGDR--------------WV-----------WEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLS------RWDRSIPQSC
         A+    G R       R              W+           WEAIRPR  RV W GLL GG NI +HSFCAWLAI+DRL       RWD S+P SC
Subjt:  RAVRRGQGVRPCLSIGDR--------------WV-----------WEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLS------RWDRSIPQSC

Query:  LLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSWCIRYTVRARV
        +LCEGG+ESRDHLFFS PFG D+W+R++R MASSHR+G W VELSW     +R  V
Subjt:  LLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSWCIRYTVRARV

A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein1.6e-9132.3Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSKWLATSPSLRVSVLPWR
        +YA++    SN++ R+LWR LVEITSGWS  GVVMGDF AI VH EA       G++E FDLA+R+ADL+EP VQ N       WL   P+L V+VL W 
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSKWLATSPSLRVSVLPWR

Query:  ISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSS--GLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIR
        IS+HS ILFYPS +Q R +ASFRFFN W+ED SF+ VV    G  E V      M      ++R    + R    GLAT+AFW A RLE+ASL QKS IR
Subjt:  ISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSS--GLSEEVCSAKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIR

Query:  WLELGDTNSAFF---------------------------------------------------------------------------------IVRFVP-
        WL+LGD N  FF                                                                                 +V  +P 
Subjt:  WLELGDTNSAFF---------------------------------------------------------------------------------IVRFVP-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSM
                                                  GF+FH  CEKV LT LTF DDLMIF  A+   + FVRETL++FGEL GL ANLGK S+
Subjt:  ------------------------------------------GFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSM

Query:  FVVGVDGEAASVLADSMGYPACSLP-----------GYSPYSLVGCVRRIVLRSPT--------------------VLLVFRASVFVLPASVHHEVDRIL
        FV G   E AS LA +MG+   +LP              P      ++RI  R  +                     L V+ ASVFVLP+ VH++VD+IL
Subjt:  FVVGVDGEAASVLADSMGYPACSLP-----------GYSPYSLVGCVRRIVLRSPT--------------------VLLVFRASVFVLPASVHHEVDRIL

Query:  MSYLWRGR-----RLGDGVQRLHGWSVIALESM-------------SIWRL-------------------------VMVGD-VEC-----GWTLGCRVAR
         SYLWR R      +   ++ L  W + +L S+             S+W +                         + VGD   C      W  G  +  
Subjt:  MSYLWRGR-----RLGDGVQRLHGWSVIALESM-------------SIWRL-------------------------VMVGD-VEC-----GWTLGCRVAR

Query:  SLGRWVRGWFMMRRAVR-------------------------RGQGVRPCLSIGDRWVW--------------EAIRPRSVRVCWAGLLLGGANISRHSF
         +G  V      RR  R                         R Q VRPCLS+ DRWVW              + IRPR  RV W GLL GG N+ +HSF
Subjt:  SLGRWVRGWFMMRRAVR-------------------------RGQGVRPCLSIGDRWVW--------------EAIRPRSVRVCWAGLLLGGANISRHSF

Query:  CAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
        CAWL I      RDRL RWD S+P S +LC+GGVESRDHLFFS PFG D+W+R+++ MASSHR+  W VELSW
Subjt:  CAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

A0A5A7TZS0 Reverse transcriptase domain-containing protein2.3e-9831.82Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    S+ + R LWR L EITS WS  GVVMGDFNAI VH EA       G++EEFDLA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+  P++R++VLPW IS+HS ILFYPS +    + SFRFFN W+E+ SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIRWLELGDTNSAFF--------------------------------------
        AKEAMD AQ EVER+P S   S  A LAT+ FW+A RLEEASLRQKS +RWL LGD N+AFF                                      
Subjt:  AKEAMDRAQHEVERDPGSVERSCLAGLATDAFWSATRLEEASLRQKSCIRWLELGDTNSAFF--------------------------------------

Query:  --------------------IVR-----------------------------------------------------------------------------
                            IV+                                                                             
Subjt:  --------------------IVR-----------------------------------------------------------------------------

Query:  -------------------------------------------------FVPG-----------------------------------------------
                                                         F+PG                                               
Subjt:  -------------------------------------------------FVPG-----------------------------------------------

Query:  ---------------------------------------------------------------------FKFHHRCEKVCLTHLTFVDDLMIFYAAETTF
                                                                             F+FHHRCEKV LTHLTF DDLMIF AA+   
Subjt:  ---------------------------------------------------------------------FKFHHRCEKVCLTHLTFVDDLMIFYAAETTF

Query:  IGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMG---YPACSL--PGYSPYSL--VGC---VRRIV--LRSPTV----------------
        I F+RE L++FGE SGL AN  KSS+FVVGV+ E AS LA  +G      CSL  P  S + L  + C   ++RI   +RS T                 
Subjt:  IGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMG---YPACSL--PGYSPYSL--VGC---VRRIV--LRSPTV----------------

Query:  --LLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-------------------GVQRLHGWSV-----IALESM-SIWRLVMVGDVECG---WTLGC
          L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                    G++    W++     I L ++ S+W   M   +  G   W +  
Subjt:  --LLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-------------------GVQRLHGWSV-----IALESM-SIWRLVMVGDVECG---WTLGC

Query:  RVARS----------------LGR-------------------------WVRGWFMMRRAVRRGQGVRPCLSIGDRWV--------------WEAIRPRS
        RV RS                +G                          W R    +     R Q V PCLS+ D WV              WEAI PR 
Subjt:  RVARS----------------LGR-------------------------WVRGWFMMRRAVRRGQGVRPCLSIGDRWV--------------WEAIRPRS

Query:  VRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
         RV W GLL GG NI +HSFCAWLAI      RDRL RWD SIP SC+LC+GGVESRDHLFFS PFG D+W+R+ + M SSHR+G W VELSW
Subjt:  VRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

A0A5D3DXE4 Reverse transcriptase domain-containing protein6.9e-7933Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    SN++ R+LW  LVEITS WS  GVVM DFNAI VH EA       G++E+F+LA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+T P++ V+VLPW IS+H  ILFYPS +    + SFRFFN W+ED SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------
         KEAMD AQ E+                                                          + PG                          
Subjt:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------

Query:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----
                 +V   C++ +  D       +F S+ +         +E   L Q   ++C+                    + +  GD  S F  V     
Subjt:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----

Query:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------
              +    F+FHHRCEKV LTHLTF DDLMIF AA+   I F+R+ L++FGELSGL AN  KSS+FV GV+ E AS LA  MG+   +LP       
Subjt:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------

Query:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------
                                      S  G ++  R VLRS   L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                  
Subjt:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------

Query:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR
          G++    W++         +   S S+W   +   +  G   W +  RV RS   W  +RG        W   R ++       R Q V PCLS+ D 
Subjt:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR

Query:  WVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW
         VW  +R R           GG +IS     AW AIR R  R         +L +GGVESRDHLFFS  FG D+W+R++R MASS+R+G W VELSW
Subjt:  WVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVGTWEVELSW

A0A5D3DXE4 Reverse transcriptase domain-containing protein4.3e-0469.23Show/hide
Query:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL
        KIVV+P EEVI Q +++WENSLVGQL+DA L +A+IQRL
Subjt:  KIVVVPSEEVIAQSVRMWENSLVGQLVDALLLFAIIQRL

A0A5D3DXE4 Reverse transcriptase domain-containing protein5.9e-6231.46Show/hide
Query:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------
        +YA++    SN++ R+LW  LVEITS WS  GVVM DFNAI VH EA       G++E+F+LA+R+ADLVEP VQGNW TWTSK                
Subjt:  IYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEA-------GDVEEFDLAVREADLVEPFVQGNWLTWTSK----------------

Query:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS
          WL+T P++ V+VLPW IS+H  ILFYPS +    + SFRFFN W+ED SF +VV+                                    LSEEV  
Subjt:  --WLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVS----------------------------------SGLSEEVCS

Query:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------
         KEAMD AQ E+                                                          + PG                          
Subjt:  AKEAMDRAQHEVE---------------------------------------------------------RDPG--------------------------

Query:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----
                 +V   C++ +  D       +F S+ +         +E   L Q   ++C+                    + +  GD  S F  V     
Subjt:  ---------SVERSCLAGLATD-------AFWSATR---------LEEASLRQ---KSCI--------------------RWLELGDTNSAFFIV-----

Query:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------
              +    F+FHHRCEKV LTHLTF DDLMIF AA+   I F+R+ L++FGELSGL AN  KSS+FV GV+ E AS LA  MG+   +LP       
Subjt:  ------RFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVGVDGEAASVLADSMGYPACSLP-------

Query:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------
                                      S  G ++  R VLRS   L V+ ASVFVLPA VH+EVD+IL SYLWRG+  G                  
Subjt:  -------------------------GYSPYSLVGCVR--RIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGD-----------------

Query:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR
          G++    W++         +   S S+W   +   +  G   W +  RV RS   W  +RG        W   R ++       R Q V PCLS+ D 
Subjt:  --GVQRLHGWSV---------IALESMSIWRLVMVGDVECG---WTLGCRVARSLGRW--VRG--------WFMMRRAV------RRGQGVRPCLSIGDR

Query:  --WV------------WEAIRPRSVRVCWAG
          WV            WEAIRPR  RV W G
Subjt:  --WV------------WEAIRPRSVRVCWAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0927.73Show/hide
Query:  CLSIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW-TRIIRAMASSH
        C        W A+RPRS  V WA  +       +H+F  W      L  + RL+ W   +  +C LC   +E RDHLF +  F   +W T  +R    + 
Subjt:  CLSIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW-TRIIRAMASSH

Query:  RVGTWEVELSWCIRYTVRA
            W   + W ++   R+
Subjt:  RVGTWEVELSWCIRYTVRA

AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0838.96Show/hide
Query:  AIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW
        A+ P++  V W   +    ++ +H+F  W      L  RDRL  W  SIP  CLLC    ESR HLFF  PF   +W
Subjt:  AIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0831.18Show/hide
Query:  GQGVRPCLSIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIR------DRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDI
        G   +PC +  +   W A R   ++V W   +       ++S  AW+AI+      DR+  W+     SC+LC   VE+RDHLFF+ P+  ++
Subjt:  GQGVRPCLSIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIR------DRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.8e-1037.97Show/hide
Query:  WEAIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW
        W A+ P+S  V W   +    ++ +H+F  W      L  RDRL  W  SIP  CLLC    +SR HLFF   F   +W
Subjt:  WEAIRPRSVRVCWAGLLLGGANISRHSFCAW------LAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIW

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-1032.76Show/hide
Query:  SIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVG
        S   R  WE IR  S  V WA ++     I R S   W++       RDRL  W  +IP S +LC  G E+  HLFF   F   IW         S   G
Subjt:  SIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAI------RDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYDIWTRIIRAMASSHRVG

Query:  TWEVELSWCIRYTVRA
              SW ++  +R+
Subjt:  TWEVELSWCIRYTVRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTAGACATCCGAACTGGCGTGGAATTAAACCGAAGGGCATCAGAAAAAGGATTATTTTCGGTGAAATGATGGCGGAGGCTACAGTTGGGTTGCGTTTGTCTGA
AGGGGTGGCGGCGCGAGGTGGTGGTGCTGTTGGTAACCTAGAACCGACCGATTCAGGACCTCGGATGGGTTCTCCGGTTGAAGGAATGGGACCGAATTCATCTGGGCTTA
AAGAGACGTCAATTGGTGGGCTTTTGAATGGGCCATCAAATGAGAAGGTGCTGGGCCAAGTAGTGGATGAGGTTTCTTCAACTGGGCTTGGGAAGTCAGATGGCGGGTCG
AAGTCTCCGCTGGGTGAAAGTAAAGGTGGGCGGTACTCATCTAGGCTTCAACTTGGGCCAAAGGCGGTTGTTTTGGGTAAAATTGAGAAACAGGTGCATGGTATTCCTAA
GGCAATTGGGTCGCCCACAATGAATTTGAAATTGCCTAAGCAAGATACTGAGTTGGAGTGGCTTCGGGAATGCAAATTCTTGGGCGTCTCTGTTTGGTTCTTCATCAGGA
AATGCTCTCCCTTATACTGCTCCCTCATCGGTTGTTTAAAAATTGTGGTTGTTCCATCTGAGGAGGTTATTGCTCAAAGTGTTCGGATGTGGGAAAACTCGTTAGTGGGC
CAACTTGTGGATGCTTTGTTACTGTTTGCCATTATTCAACGACTTATTGAGAAAATTTTGGGGAAAATCGAAATGCCAACCATTACGTTATTGAGAATGGGCTCATTTGC
TGTCAATTTCGTCGTCCCAAATCGGTTGAGTGGATTCTTTACCGTGGGCCATGGCATCTTGGAGGGAATCTATGCTACTTCGCAAATGGGTCCCAGTAATGTGGATTGTC
GTGTGCTTTGGCGTTGGTTAGTTGAAATCACTTCTGGATGGTCGATTTCGGGTGTTGTCATGGGTGATTTTAATGCTATTCATGTGCACTTTGAAGCTGGTGATGTGGAA
GAGTTTGATCTTGCTGTTCGTGAGGCTGATTTGGTGGAGCCGTTTGTTCAGGGTAATTGGTTAACTTGGACGAGTAAGTGGTTAGCTACCTCGCCAAGTTTGCGTGTTTC
GGTTTTGCCTTGGAGGATTTCTAATCATTCACTTATTCTATTCTATCCTAGTGTTGAGCAACAGAGGTGTATTGCGTCGTTTCGTTTCTTTAATCTTTGGCTAGAGGATA
CTTCATTCAGTGATGTGGTGTCTTCGGGCCTTAGTGAGGAGGTGTGCTCTGCTAAAGAGGCTATGGATAGGGCCCAGCATGAGGTTGAAAGGGATCCTGGGTCTGTTGAG
AGGAGTTGTCTTGCTGGGCTAGCGACTGATGCTTTTTGGTCGGCTACCCGTCTAGAAGAAGCCTCTCTTCGTCAAAAATCTTGTATTAGATGGTTGGAGCTTGGTGATAC
GAATTCTGCTTTTTTCATCGTTCGGTTCGTTCCCGGTTTTAAGTTTCATCATCGTTGTGAGAAGGTTTGTTTGACCCATCTGACCTTCGTTGATGATCTTATGATTTTCT
ATGCTGCTGAAACCACTTTCATTGGCTTTGTGCGTGAGACTCTCCGTCAGTTTGGGGAGTTATCGGGGTTGGTTGCTAATCTGGGGAAGAGCTCCATGTTTGTGGTGGGG
GTTGACGGAGAGGCTGCTTCTGTGTTGGCGGATAGTATGGGGTACCCTGCTTGTTCGTTACCTGGGTATTCCCCTTACTCTCTAGTAGGCTGCGTTCGTCGAATTGTGCT
CCGCTCACCCACCGTATTACTAGTCTTTCGGGCTAGTGTGTTTGTTTTACCGGCGAGTGTGCATCATGAGGTTGATAGGATTTTGATGTCGTATCTCTGGAGGGGTAGGA
GATTGGGAGATGGGGTGCAAAGGTTGCATGGGTGGAGCGTGATAGCCTTAGAGAGCATGTCCATATGGAGGTTGGTGATGGTTGGAGATGTCGAGTGTGGTTGGACCCTT
GGTTGTAGGGTGGCTCGATCATTAGGTAGATGGGTGAGAGGGTGGTTTATGATGCGACGAGCAGTAAGGAGGGGTCAGGGTGTCCGTCCGTGCCTGAGTATTGGGGATCG
GTGGGTGTGGGAGGCTATTCGTCCTAGGAGTGTCAGGGTTTGTTGGGCTGGTTTGTTGTTGGGTGGGGCTAATATTTCCAGGCATTCTTTTTGTGCTTGGTTGGCCATCA
GGGATAGGTTGAGTAGGTGGGATAGGTCGATTCCTCAGTCGTGTCTTTTGTGTGAGGGGGGTGTTGAGTCTCGGGACCATCTATTTTTTTCTTATCCTTTTGGGTATGAT
ATTTGGACTCGTATCATTCGAGCTATGGCTTCCTCTCACAGGGTTGGGACGTGGGAAGTTGAGTTGTCTTGGTGTATCCGGTATACGGTTCGTGCTCGTGTTGTATCTTG
GCGGGAGGATGTTCAGGATCTTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTAGACATCCGAACTGGCGTGGAATTAAACCGAAGGGCATCAGAAAAAGGATTATTTTCGGTGAAATGATGGCGGAGGCTACAGTTGGGTTGCGTTTGTCTGA
AGGGGTGGCGGCGCGAGGTGGTGGTGCTGTTGGTAACCTAGAACCGACCGATTCAGGACCTCGGATGGGTTCTCCGGTTGAAGGAATGGGACCGAATTCATCTGGGCTTA
AAGAGACGTCAATTGGTGGGCTTTTGAATGGGCCATCAAATGAGAAGGTGCTGGGCCAAGTAGTGGATGAGGTTTCTTCAACTGGGCTTGGGAAGTCAGATGGCGGGTCG
AAGTCTCCGCTGGGTGAAAGTAAAGGTGGGCGGTACTCATCTAGGCTTCAACTTGGGCCAAAGGCGGTTGTTTTGGGTAAAATTGAGAAACAGGTGCATGGTATTCCTAA
GGCAATTGGGTCGCCCACAATGAATTTGAAATTGCCTAAGCAAGATACTGAGTTGGAGTGGCTTCGGGAATGCAAATTCTTGGGCGTCTCTGTTTGGTTCTTCATCAGGA
AATGCTCTCCCTTATACTGCTCCCTCATCGGTTGTTTAAAAATTGTGGTTGTTCCATCTGAGGAGGTTATTGCTCAAAGTGTTCGGATGTGGGAAAACTCGTTAGTGGGC
CAACTTGTGGATGCTTTGTTACTGTTTGCCATTATTCAACGACTTATTGAGAAAATTTTGGGGAAAATCGAAATGCCAACCATTACGTTATTGAGAATGGGCTCATTTGC
TGTCAATTTCGTCGTCCCAAATCGGTTGAGTGGATTCTTTACCGTGGGCCATGGCATCTTGGAGGGAATCTATGCTACTTCGCAAATGGGTCCCAGTAATGTGGATTGTC
GTGTGCTTTGGCGTTGGTTAGTTGAAATCACTTCTGGATGGTCGATTTCGGGTGTTGTCATGGGTGATTTTAATGCTATTCATGTGCACTTTGAAGCTGGTGATGTGGAA
GAGTTTGATCTTGCTGTTCGTGAGGCTGATTTGGTGGAGCCGTTTGTTCAGGGTAATTGGTTAACTTGGACGAGTAAGTGGTTAGCTACCTCGCCAAGTTTGCGTGTTTC
GGTTTTGCCTTGGAGGATTTCTAATCATTCACTTATTCTATTCTATCCTAGTGTTGAGCAACAGAGGTGTATTGCGTCGTTTCGTTTCTTTAATCTTTGGCTAGAGGATA
CTTCATTCAGTGATGTGGTGTCTTCGGGCCTTAGTGAGGAGGTGTGCTCTGCTAAAGAGGCTATGGATAGGGCCCAGCATGAGGTTGAAAGGGATCCTGGGTCTGTTGAG
AGGAGTTGTCTTGCTGGGCTAGCGACTGATGCTTTTTGGTCGGCTACCCGTCTAGAAGAAGCCTCTCTTCGTCAAAAATCTTGTATTAGATGGTTGGAGCTTGGTGATAC
GAATTCTGCTTTTTTCATCGTTCGGTTCGTTCCCGGTTTTAAGTTTCATCATCGTTGTGAGAAGGTTTGTTTGACCCATCTGACCTTCGTTGATGATCTTATGATTTTCT
ATGCTGCTGAAACCACTTTCATTGGCTTTGTGCGTGAGACTCTCCGTCAGTTTGGGGAGTTATCGGGGTTGGTTGCTAATCTGGGGAAGAGCTCCATGTTTGTGGTGGGG
GTTGACGGAGAGGCTGCTTCTGTGTTGGCGGATAGTATGGGGTACCCTGCTTGTTCGTTACCTGGGTATTCCCCTTACTCTCTAGTAGGCTGCGTTCGTCGAATTGTGCT
CCGCTCACCCACCGTATTACTAGTCTTTCGGGCTAGTGTGTTTGTTTTACCGGCGAGTGTGCATCATGAGGTTGATAGGATTTTGATGTCGTATCTCTGGAGGGGTAGGA
GATTGGGAGATGGGGTGCAAAGGTTGCATGGGTGGAGCGTGATAGCCTTAGAGAGCATGTCCATATGGAGGTTGGTGATGGTTGGAGATGTCGAGTGTGGTTGGACCCTT
GGTTGTAGGGTGGCTCGATCATTAGGTAGATGGGTGAGAGGGTGGTTTATGATGCGACGAGCAGTAAGGAGGGGTCAGGGTGTCCGTCCGTGCCTGAGTATTGGGGATCG
GTGGGTGTGGGAGGCTATTCGTCCTAGGAGTGTCAGGGTTTGTTGGGCTGGTTTGTTGTTGGGTGGGGCTAATATTTCCAGGCATTCTTTTTGTGCTTGGTTGGCCATCA
GGGATAGGTTGAGTAGGTGGGATAGGTCGATTCCTCAGTCGTGTCTTTTGTGTGAGGGGGGTGTTGAGTCTCGGGACCATCTATTTTTTTCTTATCCTTTTGGGTATGAT
ATTTGGACTCGTATCATTCGAGCTATGGCTTCCTCTCACAGGGTTGGGACGTGGGAAGTTGAGTTGTCTTGGTGTATCCGGTATACGGTTCGTGCTCGTGTTGTATCTTG
GCGGGAGGATGTTCAGGATCTTATTTAG
Protein sequenceShow/hide protein sequence
MAIRHPNWRGIKPKGIRKRIIFGEMMAEATVGLRLSEGVAARGGGAVGNLEPTDSGPRMGSPVEGMGPNSSGLKETSIGGLLNGPSNEKVLGQVVDEVSSTGLGKSDGGS
KSPLGESKGGRYSSRLQLGPKAVVLGKIEKQVHGIPKAIGSPTMNLKLPKQDTELEWLRECKFLGVSVWFFIRKCSPLYCSLIGCLKIVVVPSEEVIAQSVRMWENSLVG
QLVDALLLFAIIQRLIEKILGKIEMPTITLLRMGSFAVNFVVPNRLSGFFTVGHGILEGIYATSQMGPSNVDCRVLWRWLVEITSGWSISGVVMGDFNAIHVHFEAGDVE
EFDLAVREADLVEPFVQGNWLTWTSKWLATSPSLRVSVLPWRISNHSLILFYPSVEQQRCIASFRFFNLWLEDTSFSDVVSSGLSEEVCSAKEAMDRAQHEVERDPGSVE
RSCLAGLATDAFWSATRLEEASLRQKSCIRWLELGDTNSAFFIVRFVPGFKFHHRCEKVCLTHLTFVDDLMIFYAAETTFIGFVRETLRQFGELSGLVANLGKSSMFVVG
VDGEAASVLADSMGYPACSLPGYSPYSLVGCVRRIVLRSPTVLLVFRASVFVLPASVHHEVDRILMSYLWRGRRLGDGVQRLHGWSVIALESMSIWRLVMVGDVECGWTL
GCRVARSLGRWVRGWFMMRRAVRRGQGVRPCLSIGDRWVWEAIRPRSVRVCWAGLLLGGANISRHSFCAWLAIRDRLSRWDRSIPQSCLLCEGGVESRDHLFFSYPFGYD
IWTRIIRAMASSHRVGTWEVELSWCIRYTVRARVVSWREDVQDLI