; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:28291429..28295633
RNA-Seq ExpressionLag0018493
SyntenyLag0018493
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]9.9e-12634.68Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T  M+  L Q ++REE+E  + + +PTKAPG DG P LF+QK
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------
        YW +VG +   +CL ILN + SV+++NHT I LIPKV  P                                                            
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------

Query:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA
             +KG+    ALKLD++K YDRVEW F+RA+M +LGF   W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Subjt:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA

Query:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
        +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  KS +    N        +  +L +       +YLGLP+   
Subjt:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
        + R + F+ + D++W                                    +PKGL   ++ + ARFWW  +  K+ IHW KW+ LCK +  GGL F DL
Subjt:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        E F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---
         +  +                         V D    S QW++P L+ +  D++V  I+++P AS    D  IWH+++ G Y VKSGY+L   ++ +   
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---

Query:  ESPSDTDISSKWWKRLWS
        E  +  D++SK+WK++W+
Subjt:  ESPSDTDISSKWWKRLWS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.7e-14132.78Show/hide
Query:  TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEK
        + AL+ WG  ++ V WD  +Q+K +   I  AY++P  +DFTI+H LE        N ++G+     L L E     ++ E + K  +     +A D E 
Subjt:  TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEK

Query:  VLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR--------
        ++  IP ++T +++  L   Y++EE+E  +R+ +PTKA GPDGFP LFYQ YW +VGP+T++ CL  LN    +K WN T I LIPK+  PR        
Subjt:  VLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR--------

Query:  --------------------------------------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN
                                                                + G  G AALKLD+SK +DRVEW+++  +M ++GF   WI  I 
Subjt:  --------------------------------------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN

Query:  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYE
         CIST  FSI +NG   G F PSRG+RQGDPLSPYLFLLC+EGLSA++     +G L GI    ++  I HL FADDSLIFL++   E   L+ ++  Y 
Subjt:  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYE

Query:  CASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKE
         ASGQ IN  KS + F  NV  + Q YL  IL +K   + G+YLGLPS F R R +                                  +++HW KW  
Subjt:  CASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKE

Query:  LCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP
        +C P+E GGLNF DLE F QA++AK  WR L  P   V++VLK +YF  + +L++   S SSYFWKGF+WG DLL  G+R ++GNG++++   DPW+PRP
Subjt:  LCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP

Query:  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVK
         TFK L                      RF    L       + TV   I     WD+  + H   +ED   I+ +P +S    D W+WH+DK G Y V+
Subjt:  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVK

Query:  SGYKLGMYQRIEESPSDTDISSKWW-------------------------------------------------------------KRLWSTL-GYSDMV
        SGYKL M+ +   + + T+     W                                                             +++W TL  +   +
Subjt:  SGYKLGMYQRIEESPSDTDISSKWW-------------------------------------------------------------KRLWSTL-GYSDMV

Query:  RAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSN--DRIFQTRDMVSQ--MISGGEDFILNVD
         AE  ++  + W  +   +   DL    +  W +WNDRN + H + + PV  +C+W   +L  +  A  SN   R       V Q    S      LN D
Subjt:  RAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSN--DRIFQTRDMVSQ--MISGGEDFILNVD

Query:  AVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIE
        A     +T+ G   ++   S  LVA     +P P SPL  E+  ILEGL   ++   + +E
Subjt:  AVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIE

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]9.5e-12931.71Show/hide
Query:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKY
        + K+NTI G+    G W    + I +A  SYF +I++S  PS    E+V + IP KVT +M+ +L +++++EEV   +++ +P KAPGPDG   +F+QKY
Subjt:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKY

Query:  WDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRR-----------------------------------------------------------
        W +VG    D  L +LN    + + N TNI LIPK  NP+R                                                           
Subjt:  WDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRR-----------------------------------------------------------

Query:  -----KGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK
              GK+G+ A+KLD+SK +DRVEW FI  VME++GF   W +L+  CI++ S+SI+ING A G+ YPSRGLRQGDPLSP LFLLC+EGLSA++  A 
Subjt:  -----KGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK

Query:  RNGLM-GIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHR
        RN L+ GI++    PK+ HLFFADDS++F KA+ EE   L+ I+  YE ASGQ IN DKS I F  N   +T+  + +IL          YLGLPS   R
Subjt:  RNGLM-GIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHR

Query:  SRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLE
        S+S+ F  + ++V      W                              LP+GL D +  +   FWWG  + + ++ W  WK +C  +  GGL F +L+
Subjt:  SRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLE

Query:  IFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELT
         F  AMLAKQAWR+L  P S V RVLK RYFP+ D+L +++ S+ SY W+     L++++ G R ++GNG  + +  D W+P P T+KV+  +I + E  
Subjt:  IFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELT

Query:  VVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGYKLGMYQRIEESP-
        +V                    + DP+         +  W +  L+ + L  +V+ I+R+P S   P D+ IW  +K G + VKS Y +  +  I+ +  
Subjt:  VVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGYKLGMYQRIEESP-

Query:  ---SDTDISSKWWKRLW----------------------------------STLGYSDMV------------RAEFIM---------------NIQDRWI
           S+ D     WK+LW                                  ST     +V             A  +                +  D  +
Subjt:  ---SDTDISSKWWKRLW----------------------------------STLGYSDMV------------RAEFIM---------------NIQDRWI

Query:  HICNTVSILDLERICLGSWALWNDRNCVFHK-RPIPPVGVRCDWTL--DYLSEYQSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVWSKHTTTSGVRV
        H+C++ +   LE   + SWA+W +RN + H   P+ P  V   W +  + L +++ A  S D I      +         F +NVD   S     S + V
Subjt:  HICNTVSILDLERICLGSWALWNDRNCVFHK-RPIPPVGVRCDWTL--DYLSEYQSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVWSKHTTTSGVRV

Query:  VLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKI
        ++   +G +VA L K +P   +   VE +A+ +G+ +   L++S++
Subjt:  VLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKI

XP_024156142.1 uncharacterized protein LOC112164137 [Rosa chinensis]4.1e-12433.13Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        N+ KRN ISG+   DG+W TE   +      YF  +F++ SP   D        P+ VT +M+  L +++  EE+   + + +P KAPGPDGF  +FYQ+
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVP--------------------------------------------------------------
        YW +VG   +      +N +  +++ N T + LIPKV                                                               
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVP--------------------------------------------------------------

Query:  --NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T
            R  G  GY ALKLD+SK YDRVEW FI AVM  +GF   WIN I  C++T S+S ++NGE +GH  P+RGLRQGD +SPYLFLLC+EGLS +L   
Subjt:  --NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T

Query:  AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
         +++ L GIA+   +P I HLFFADDS +F+KA  EE   +K I+  YE ASGQ +N  KS+I F +NV    Q  L+ +  ++  D    YLGLP+   
Subjt:  AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
         S+ + F+ I+++       W                              LPK L   +    A FWWG S+  ++IHW  W ++C P+E+GGL F ++
Subjt:  RSRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        E F QA+LAKQ WR+L  P+S + + LK +YFP++D + + V    SY W+  + G  LL+ G+R ++G G  + V  DPWIPRPY+F+           
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGM
        + V++G+                    +LTV D I P S  W +  L+ +   ++V  I ++P S   P DR IWHFDK G Y VKSGY        L  
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGM

Query:  YQRIEESPSDTDISSKWWKRLWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQS
        +     S  D D+    W+R+W        VR      + NI    +++   V+ LD ERIC            VF +  +    + C W    L     
Subjt:  YQRIEESPSDTDISSKWWKRLWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQS

Query:  AHRSNDRIFQTRDMVSQMISGGED-FILNVDAVWSK
         H +N       DM+  +     D F + + A+WS+
Subjt:  AHRSNDRIFQTRDMVSQMISGGED-FILNVDAVWSK

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]9.2e-12433.01Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        N+ KRN ISG+   DG+W TE   +      YF  +F++ SP   + E      P+ VT  M+  L +++  EE+   + + +P KAPGPDGF  +FYQ+
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVP--------------------------------------------------------------
        YW +VG   +      +N +  +++ N T + LIPKV                                                               
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVP--------------------------------------------------------------

Query:  --NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T
            R  G  GY ALKLD+SK YDRVEW FI AVM  +GF   WI  I  C++T S+S ++NGE +GH  P+RGLRQGD +SPYLFLLC+EGLS +L   
Subjt:  --NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T

Query:  AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
         +++ L GIA+   +P I HLFFADDS +F+KA  EE   +K I+  YE ASGQ +N  KS+I F +NV    Q  L+ +  ++  D    YLGLP+   
Subjt:  AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
         S+++ F+ I+++       W                              LPK L   +    A FWWG S+  ++IHW  W ++C P+E+GGL F ++
Subjt:  RSRSKDFKGILDRV------W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        E F QA+LAKQ WR+L  P+S + + LK +YFP++D + + V    SY W+  + G  LL+ G+R ++G+G  + V  DPWIPRPY+F+           
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGM
        + V++G+                    +LTV D I P S  W +  L+ +   ++V  I ++P S   P DR IWHFDK G Y VKSGY        L  
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEIIRLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGM

Query:  YQRIEESPSDTDISSKWWKRLWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQS
        +     S  D D+    W+R+W        VR      + NI    +++   V+ LD ERIC            VF +  +    + C W    L     
Subjt:  YQRIEESPSDTDISSKWWKRLWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQS

Query:  AHRSNDRIFQTRDMVSQMISGGED-FILNVDAVWSK
         H +N       DM+  +     D F + + A+WS+
Subjt:  AHRSNDRIFQTRDMVSQMISGGED-FILNVDAVWSK

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein4.8e-12634.68Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T  M+  L Q ++REE+E  + + +PTKAPG DG P LF+QK
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------
        YW +VG +   +CL ILN + SV+++NHT I LIPKV  P                                                            
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------

Query:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA
             +KG+    ALKLD++K YDRVEW F+RA+M +LGF   W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Subjt:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA

Query:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
        +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  KS +    N        +  +L +       +YLGLP+   
Subjt:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
        + R + F+ + D++W                                    +PKGL   ++ + ARFWW  +  K+ IHW KW+ LCK +  GGL F DL
Subjt:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        E F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---
         +  +                         V D    S QW++P L+ +  D++V  I+++P AS    D  IWH+++ G Y VKSGY+L   ++ +   
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---

Query:  ESPSDTDISSKWWKRLWS
        E  +  D++SK+WK++W+
Subjt:  ESPSDTDISSKWWKRLWS

A0A6J1DX30 uncharacterized protein LOC1110248741.8e-14132.78Show/hide
Query:  TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEK
        + AL+ WG  ++ V WD  +Q+K +   I  AY++P  +DFTI+H LE        N ++G+     L L E     ++ E + K  +     +A D E 
Subjt:  TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEK

Query:  VLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR--------
        ++  IP ++T +++  L   Y++EE+E  +R+ +PTKA GPDGFP LFYQ YW +VGP+T++ CL  LN    +K WN T I LIPK+  PR        
Subjt:  VLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR--------

Query:  --------------------------------------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN
                                                                + G  G AALKLD+SK +DRVEW+++  +M ++GF   WI  I 
Subjt:  --------------------------------------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN

Query:  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYE
         CIST  FSI +NG   G F PSRG+RQGDPLSPYLFLLC+EGLSA++     +G L GI    ++  I HL FADDSLIFL++   E   L+ ++  Y 
Subjt:  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYE

Query:  CASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKE
         ASGQ IN  KS + F  NV  + Q YL  IL +K   + G+YLGLPS F R R +                                  +++HW KW  
Subjt:  CASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKE

Query:  LCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP
        +C P+E GGLNF DLE F QA++AK  WR L  P   V++VLK +YF  + +L++   S SSYFWKGF+WG DLL  G+R ++GNG++++   DPW+PRP
Subjt:  LCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP

Query:  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVK
         TFK L                      RF    L       + TV   I     WD+  + H   +ED   I+ +P +S    D W+WH+DK G Y V+
Subjt:  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVK

Query:  SGYKLGMYQRIEESPSDTDISSKWW-------------------------------------------------------------KRLWSTL-GYSDMV
        SGYKL M+ +   + + T+     W                                                             +++W TL  +   +
Subjt:  SGYKLGMYQRIEESPSDTDISSKWW-------------------------------------------------------------KRLWSTL-GYSDMV

Query:  RAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSN--DRIFQTRDMVSQ--MISGGEDFILNVD
         AE  ++  + W  +   +   DL    +  W +WNDRN + H + + PV  +C+W   +L  +  A  SN   R       V Q    S      LN D
Subjt:  RAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSN--DRIFQTRDMVSQ--MISGGEDFILNVD

Query:  AVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIE
        A     +T+ G   ++   S  LVA     +P P SPL  E+  ILEGL   ++   + +E
Subjt:  AVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIE

A0A803PI64 Uncharacterized protein2.1e-12633.37Show/hide
Query:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKY
        + K+N I G+    G+W +EA  + +  E+YF +IF S S     FE+V+  IP KVT DM+  L +D++ EE+   V+   PTKAPG DG P LFY K+
Subjt:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKY

Query:  WDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR------------------------------------------------------------
        W  +    +  CL +LN   +++  N T I LIPKV  P+                                                            
Subjt:  WDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR------------------------------------------------------------

Query:  -RKGK---QGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK
         RK +    G  ALKLD++K YDRVEW F+ AVM RLGF   W++ I  C+++ SFS +INGE KG   P RGLRQGDPLSP+LFL C+E LS+++   +
Subjt:  -RKGK---QGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK

Query:  RNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHR
          G L GI        + HLFFADDSL+F+ A  +     + I+  Y  ASGQ +N  KS+ CF  NV ++T+  L+ ++ ++  DN G YLGLPS   R
Subjt:  RNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHR

Query:  SRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDV
        ++              K  LD I            + +K+IHW KW+ LC+P+++GGL F DL +F QA+LAKQ WR +  P+   +RVLK  YFP    
Subjt:  SRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDV

Query:  LESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP
        LE+   +N+S+ W+  +WG  L+  G R ++GNG SVRV+ DPW+PRP TFKV       PE                             L V D    
Subjt:  LESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP

Query:  SIQWDIPKLQHVLLDEDVQEIIRLPASETT-PDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSDTDISSKWWKRLWSTLGYSDMVRAEFIMN-IQDRWI
          QWD   ++ V    D + I+ +P S+    D+ + H+ K G Y VKSGY++      E   S      +WWK+LW         +  ++++ + D + 
Subjt:  SIQWDIPKLQHVLLDEDVQEIIRLPASETT-PDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSDTDISSKWWKRLWSTLGYSDMVRAEFIMN-IQDRWI

Query:  HICNT--VSIL----------DLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEY--QSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVW
         +  T  +S+L           LE   L SW +WN RN V H    P      +W   YL E+  ++  RSN  + + R  V  ++   +   +NVDA  
Subjt:  HICNT--VSIL----------DLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEY--QSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVW

Query:  SKHTTTSGVRVVLHTKSGKLV----AILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISK
              SG+  V+   +G ++     +LQ+ +P    PL + ++AI +GL      R+ +
Subjt:  SKHTTTSGVRVVLHTKSGKLV----AILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISK

M5VU98 Reverse transcriptase domain-containing protein1.2e-13232.67Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        N+ +RN I G+E ++G W T    I      YF D+F S   S    E++L  +  KVT DM   L  D+S +E++  V +  P+KAPGPDG P LFYQK
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNP------------------------------------------------------------
        YW +VG   V    A L     ++  NHT + LIPKV  P                                                            
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNP------------------------------------------------------------

Query:  ----RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA
            RR+G++G  ALKLD+SK YDRVEW F+  +M  +GFP  W+ ++ DC++T S+S ++NGE     YP+RGLRQGDPLSPYLFLLC+EG + +L  A
Subjt:  ----RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA

Query:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
        +R G L GI +   +P + HLFFADDS +F KA+    G LK I   YE ASGQ IN  KS + F  N+  DTQS L+S+L +   D+  +YLGLP    
Subjt:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
        R+++  F+ + +RVW                                    LP+GL   I  + ARFWWG     ++IHW +W+ LCK + +GG+ F  L
Subjt:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        + F  AMLAKQ WR++  P S  +R+LK +YFP ++  E+ + S  S  WK       +L+ G R +IG+G SVR+  D W+PRP TF V+   +   E 
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPAS-ETTPDRWIWHFDKFGGYMVKSGYKLGMYQRI---E
        T V + I                          C   S QWD+ KL ++ L  DV +I+R+P S    PDR +W++DK G + VKS Y++ +       +
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPAS-ETTPDRWIWHFDKFGGYMVKSGYKLGMYQRI---E

Query:  ESPSDTDISSKWWKRLWSTLGYSDM-------------VRAEFI---MNIQDRWIHICN-TVSILDLERICLGSWALWNDRNCVFH------KRPIPPVG
        ES S    +   W+ +W+    + +              +A  I   +++QD  +   + T S L +  +C  + A WN      H      + P   VG
Subjt:  ESPSDTDISSKWWKRLWSTLGYSDM-------------VRAEFI---MNIQDRWIHICN-TVSILDLERICLGSWALWNDRNCVFH------KRPIPPVG

Query:  VRCDWTLDYLSEYQSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSL
            +  ++++   +  +  DR+   RD V            N D  +   +    V VV     G  VA + K +    S    E++A  EG+ +  SL
Subjt:  VRCDWTLDYLSEYQSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.8e-12634.68Show/hide
Query:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK
        ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T  M+  L Q ++REE+E  + + +PTKAPG DG P LF+QK
Subjt:  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQK

Query:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------
        YW +VG +   +CL ILN + SV+++NHT I LIPKV  P                                                            
Subjt:  YWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPR-----------------------------------------------------------

Query:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA
             +KG+    ALKLD++K YDRVEW F+RA+M +LGF   W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Subjt:  -----RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA

Query:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH
        +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  KS +    N        +  +L +       +YLGLP+   
Subjt:  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFH

Query:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL
        + R + F+ + D++W                                    +PKGL   ++ + ARFWW  +  K+ IHW KW+ LCK +  GGL F DL
Subjt:  RSRSKDFKGILDRVW------------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDL

Query:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL
        E F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Subjt:  EIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL

Query:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---
         +  +                         V D    S QW++P L+ +  D++V  I+++P AS    D  IWH+++ G Y VKSGY+L   ++ +   
Subjt:  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIE---

Query:  ESPSDTDISSKWWKRLWS
        E  +  D++SK+WK++W+
Subjt:  ESPSDTDISSKWWKRLWS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.6e-2020.52Show/hide
Query:  IRQVKDKIKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVL-KFIPRKVTPDMSHTLTQD
        ++++ +     ++R   ID  +   ++    K ++N I  ++   G   T+  +I      Y+K ++ ++  +  + +  L  +   ++  +   +L + 
Subjt:  IRQVKDKIKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVL-KFIPRKVTPDMSHTLTQD

Query:  YSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRRK--------------------------
         +  E+  ++      K+PGPDGF   FYQ+Y + + P  +    +I         +   +I+LIPK      K                          
Subjt:  YSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRRK--------------------------

Query:  ----------------GKQG---------------------YAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFY
                        G QG                     +  + +D  K +D+++  F+   + +LG    ++ +I       + +II+NG+    F 
Subjt:  ----------------GKQG---------------------YAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFY

Query:  PSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPS
           G RQG PLSP LF +  E L+  +   K   + GI +     K+    FADD +++L+       +L  +++++   SG  INV KSQ  F  N   
Subjt:  PSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPS

Query:  DTQSYLSSILQMKSADNLGSYLGL
         T+S +   L    A     YLG+
Subjt:  DTQSYLSSILQMKSADNLGSYLGL

P08548 LINE-1 reverse transcriptase homolog1.4e-1822.53Show/hide
Query:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLK--FIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQ
        K  ++ IS +   +    T+  +I K    Y+K +++ +  +  + ++ L+   +PR    ++   L +  S  E+   ++     K+PGPDGF + FYQ
Subjt:  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLK--FIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQ

Query:  KYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKV-PNPRRK-----------------------------------------GKQG----------
         + + + P  ++    I         +   NI LIPK   +P RK                                         G QG          
Subjt:  KYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKV-PNPRRK-----------------------------------------GKQG----------

Query:  -----------YAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA
                   +  L +D  K +D ++  F+   ++++G    ++ LI    S  + +II+NG     F    G RQG PLSP LF +  E L+  +   
Subjt:  -----------YAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA

Query:  KRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSI---LQMKSADNLGSYL
        +   + GI +   S +I    FADD +++L+ + +    L  ++ +Y   SG  IN  KS    + N     ++   SI   +  K    LG YL
Subjt:  KRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSI---LQMKSADNLGSYL

P0C2F6 Putative ribonuclease H protein At1g657504.8e-1424.4Show/hide
Query:  LDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESE---VR
        +  + LP+ +L+ +  L   F WGS+  KK+ H  KW ++C P+++GGL     +   +A+++K  WR+L    S    VL+ +Y    ++ +S     +
Subjt:  LDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESE---VR

Query:  SNSSYFWKGFIWGL-DLLKSGIRKKIGNGNSVRVMVDPWIP-RPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQW
         + S  W+    GL D++  G+    G+G  +R   D W+  +P      G +  D + TVV K                           D  +P   W
Subjt:  SNSSYFWKGFIWGL-DLLKSGIRKKIGNGNSVRVMVDPWIP-RPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQW

Query:  DIPKLQHVLLDEDVQEI--IRLPASETTPDRWIWHFDKFGGYMVKSGYKL
        D  K+     +    E+  + L       DR  W F + G + V+S Y++
Subjt:  DIPKLQHVLLDEDVQEI--IRLPASETTPDRWIWHFDKFGGYMVKSGYKL

P92555 Uncharacterized mitochondrial protein AtMg012501.4e-1358.82Show/hide
Query:  IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDS
        IING  +G   PSRGLRQGDPLSPYLF+LC+E LS +   A+  G L GI ++ +SP+I HL FADD+
Subjt:  IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003105.4e-2641.67Show/hide
Query:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFW
        L K L   ++S    FWW S + K++I W  W++LCK  E+ GGL F DL  F QA+LAKQ++R++  P + ++R+L+ RYFP S ++E  V +  SY W
Subjt:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFW

Query:  KGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI
        +  I G +LL  G+ + IG+G   +V +D WI
Subjt:  KGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.5e-1028.57Show/hide
Query:  LKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRV----MVDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGY
        +K RYF    +L+++VR   SY W   + G+ LLK G R  IG+G ++R+    +VD   PRP   +   YK    E+T+                    
Subjt:  LKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRV----MVDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGY

Query:  KIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRL-PASETTPDRWIWHFDKFGGYMVKSGYKL
         +F+ + +          WD  K+   +   D   I R+  A    PD+ IW+++  G Y V+SGY L
Subjt:  KIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRL-PASETTPDRWIWHFDKFGGYMVKSGYKL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.1e-0537.04Show/hide
Query:  NIVLIPKVPNP--RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWI
        NIV + +  +   R+KG +G+  LKLD+ K YDR+ W ++   +   GFP  W+
Subjt:  NIVLIPKVPNP--RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWI

AT4G29090.1 Ribonuclease H-like superfamily protein3.5e-2830.43Show/hide
Query:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWK
        LPK +   I S+ A FWW +    K +HWK W  L   + +GG+ F D+E F  A+L KQ WR+L+ PES +A+V K RYF  SD L + + S  S+ WK
Subjt:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWK

Query:  GFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI-PRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPS--------IQWD
              ++L+ G R  +GNG  + +    W+  +P +                      + LR  R+    Y      L V D I  S        I+  
Subjt:  GFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI-PRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPS--------IQWD

Query:  IPKLQHVLLDEDVQEIIRLPASETTPDRWIWHFDKFGGYMVKSGY--------KLGMYQRIEESPSDTDISSKWWK
         P+++  L+ E        P      D + W +   G Y VKSGY        K    Q + E PS   I  K WK
Subjt:  IPKLQHVLLDEDVQEIIRLPASETTPDRWIWHFDKFGGYMVKSGY--------KLGMYQRIEESPSDTDISSKWWK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-2741.67Show/hide
Query:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFW
        L K L   ++S    FWW S + K++I W  W++LCK  E+ GGL F DL  F QA+LAKQ++R++  P + ++R+L+ RYFP S ++E  V +  SY W
Subjt:  LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFW

Query:  KGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI
        +  I G +LL  G+ + IG+G   +V +D WI
Subjt:  KGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.8e-1558.82Show/hide
Query:  IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDS
        IING  +G   PSRGLRQGDPLSPYLF+LC+E LS +   A+  G L GI ++ +SP+I HL FADD+
Subjt:  IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIFHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTTCCGCTGCGTGGGGGCTTTTATTCCCTTCACGAAGGCCCTTAAAGAGTGGGGTTACAAAAAAAACAGAGTTCGATGGGATAATATTCGCCAAGTTAAGGACAA
GATTAAGGTAGCTTATGATAGACCTACGCTAATTGATTTCACTATTGTGCATCGGCTTGAGAGTCACCTCAATAAACATAAAAGAAACACAATCAGTGGTGTGGAAGGCG
CAGATGGGTTGTGGCTTACAGAGGCGGACCAGATTCACAAGGCCTTTGAATCGTATTTTAAAGACATTTTCAATTCCCAATCTCCTTCTGCCACGGATTTCGAAAAAGTG
TTGAAGTTTATTCCTCGAAAAGTCACACCAGATATGAGTCACACGCTTACTCAAGATTACTCGCGCGAGGAGGTGGAAGGAGTGGTTCGAAAGTTTTATCCTACAAAAGC
GCCAGGTCCAGATGGTTTCCCTACCTTATTCTACCAAAAATATTGGGATATGGTAGGACCTCAAACGGTGGATGAATGTTTAGCAATTCTGAATCGTAAGCGTTCAGTCA
AGGATTGGAACCATACCAATATTGTGCTCATCCCAAAGGTTCCAAACCCACGGAGGAAGGGGAAGCAGGGCTATGCGGCTCTTAAGCTTGATATAAGTAAGACCTATGAT
AGAGTAGAATGGTCTTTTATTAGGGCGGTTATGGAAAGATTGGGTTTCCCATGCGACTGGATTAATCTGATTAATGATTGCATTTCTACTGCTTCTTTTTCTATTATTAT
TAATGGGGAGGCTAAAGGGCATTTTTACCCGTCAAGAGGTTTGAGACAAGGAGACCCTTTGTCCCCGTACTTGTTTTTGCTATGCTCAGAGGGTTTGTCTGCTATTTTGG
GTACTGCCAAGAGGAATGGTCTCATGGGGATTGCTATGACCCCGTCATCACCAAAAATCTTCCATCTATTTTTTGCAGACGATAGTCTCATCTTTCTAAAGGCCTCAACG
GAGGAATTTGGTCATTTAAAGATTATCATGGCTGACTATGAATGTGCGTCGGGTCAAAGCATTAATGTGGATAAATCTCAAATATGTTTCTTCAGGAATGTGCCAAGCGA
TACTCAGTCTTACCTTAGCTCAATTTTGCAAATGAAGTCTGCGGACAATTTGGGATCTTACCTTGGCTTGCCTTCGTCTTTCCACCGTAGTCGGAGCAAGGATTTCAAGG
GTATTCTTGATCGTGTATGGCTACCGAAAGGTCTTCTAGACAGTATTTCTTCATTATGTGCGAGGTTTTGGTGGGGCTCTTCTGATACGAAAAAGCGTATTCATTGGAAG
AAGTGGAAGGAGTTGTGTAAGCCAGAAGAGCAGGGCGGTCTGAATTTTTGGGACTTAGAGATATTCATTCAAGCAATGCTCGCAAAACAAGCCTGGCGTGTCCTGACCCT
TCCAGAGTCAACGGTGGCAAGAGTTCTTAAAGGGAGGTATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATAT
GGGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTT
GGTTACAAGATATTTGATCCAGAGTTGACAGTAGTTCTTAAAGGGATTGGATTTTCCATCCTCAGACGTTTTAGAATCAGAGTGCTTGGTTACAAGATATTTGATCCAGA
GTTGACAGTAGTGGATTGTATTCTGCCGTCTATTCAATGGGATATTCCGAAACTCCAACATGTTCTTTTGGATGAGGATGTTCAAGAGATTATAAGACTCCCAGCTAGCG
AGACAACTCCGGATAGATGGATTTGGCATTTTGATAAATTTGGAGGTTATATGGTTAAGAGTGGGTACAAATTGGGTATGTATCAGAGAATAGAGGAGTCACCTTCGGAC
ACTGATATCAGTTCCAAGTGGTGGAAGAGACTTTGGTCGACTTTGGGGTATAGTGATATGGTGAGGGCAGAGTTCATTATGAATATTCAGGACCGGTGGATACATATCTG
CAATACTGTTTCAATATTGGATCTGGAAAGGATCTGTTTGGGATCTTGGGCTCTATGGAACGATCGGAATTGCGTGTTTCATAAGCGGCCAATTCCTCCGGTGGGGGTCC
GGTGTGATTGGACTTTGGATTATCTTTCTGAATACCAATCAGCTCACCGGTCCAACGATCGAATATTCCAAACGAGGGATATGGTCTCTCAGATGATTTCAGGTGGGGAG
GATTTTATTCTAAATGTCGACGCAGTGTGGTCCAAACATACCACGACCAGTGGAGTGAGGGTAGTATTACACACAAAGTCGGGTAAGTTGGTGGCTATTTTACAAAAAGG
GATTCCTTTACCTTCTTCTCCATTATGTGTTGAGGTGATTGCGATACTTGAAGGTCTTAATATGACTTCTTCTTTGAGGATTAGTAAGATTGAGGCCACCATTGTGATAA
TCCCAGAAAGCGTTGTCTCTGGCGACGACAAGGAAGCTCACAGCGGCACTGCGACTGCGTTGGTCTCCAGTGAAGAAGCTATATCAAATGTTAATCGAGAAACCAAGGAG
GATGTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTTCCGCTGCGTGGGGGCTTTTATTCCCTTCACGAAGGCCCTTAAAGAGTGGGGTTACAAAAAAAACAGAGTTCGATGGGATAATATTCGCCAAGTTAAGGACAA
GATTAAGGTAGCTTATGATAGACCTACGCTAATTGATTTCACTATTGTGCATCGGCTTGAGAGTCACCTCAATAAACATAAAAGAAACACAATCAGTGGTGTGGAAGGCG
CAGATGGGTTGTGGCTTACAGAGGCGGACCAGATTCACAAGGCCTTTGAATCGTATTTTAAAGACATTTTCAATTCCCAATCTCCTTCTGCCACGGATTTCGAAAAAGTG
TTGAAGTTTATTCCTCGAAAAGTCACACCAGATATGAGTCACACGCTTACTCAAGATTACTCGCGCGAGGAGGTGGAAGGAGTGGTTCGAAAGTTTTATCCTACAAAAGC
GCCAGGTCCAGATGGTTTCCCTACCTTATTCTACCAAAAATATTGGGATATGGTAGGACCTCAAACGGTGGATGAATGTTTAGCAATTCTGAATCGTAAGCGTTCAGTCA
AGGATTGGAACCATACCAATATTGTGCTCATCCCAAAGGTTCCAAACCCACGGAGGAAGGGGAAGCAGGGCTATGCGGCTCTTAAGCTTGATATAAGTAAGACCTATGAT
AGAGTAGAATGGTCTTTTATTAGGGCGGTTATGGAAAGATTGGGTTTCCCATGCGACTGGATTAATCTGATTAATGATTGCATTTCTACTGCTTCTTTTTCTATTATTAT
TAATGGGGAGGCTAAAGGGCATTTTTACCCGTCAAGAGGTTTGAGACAAGGAGACCCTTTGTCCCCGTACTTGTTTTTGCTATGCTCAGAGGGTTTGTCTGCTATTTTGG
GTACTGCCAAGAGGAATGGTCTCATGGGGATTGCTATGACCCCGTCATCACCAAAAATCTTCCATCTATTTTTTGCAGACGATAGTCTCATCTTTCTAAAGGCCTCAACG
GAGGAATTTGGTCATTTAAAGATTATCATGGCTGACTATGAATGTGCGTCGGGTCAAAGCATTAATGTGGATAAATCTCAAATATGTTTCTTCAGGAATGTGCCAAGCGA
TACTCAGTCTTACCTTAGCTCAATTTTGCAAATGAAGTCTGCGGACAATTTGGGATCTTACCTTGGCTTGCCTTCGTCTTTCCACCGTAGTCGGAGCAAGGATTTCAAGG
GTATTCTTGATCGTGTATGGCTACCGAAAGGTCTTCTAGACAGTATTTCTTCATTATGTGCGAGGTTTTGGTGGGGCTCTTCTGATACGAAAAAGCGTATTCATTGGAAG
AAGTGGAAGGAGTTGTGTAAGCCAGAAGAGCAGGGCGGTCTGAATTTTTGGGACTTAGAGATATTCATTCAAGCAATGCTCGCAAAACAAGCCTGGCGTGTCCTGACCCT
TCCAGAGTCAACGGTGGCAAGAGTTCTTAAAGGGAGGTATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATAT
GGGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTT
GGTTACAAGATATTTGATCCAGAGTTGACAGTAGTTCTTAAAGGGATTGGATTTTCCATCCTCAGACGTTTTAGAATCAGAGTGCTTGGTTACAAGATATTTGATCCAGA
GTTGACAGTAGTGGATTGTATTCTGCCGTCTATTCAATGGGATATTCCGAAACTCCAACATGTTCTTTTGGATGAGGATGTTCAAGAGATTATAAGACTCCCAGCTAGCG
AGACAACTCCGGATAGATGGATTTGGCATTTTGATAAATTTGGAGGTTATATGGTTAAGAGTGGGTACAAATTGGGTATGTATCAGAGAATAGAGGAGTCACCTTCGGAC
ACTGATATCAGTTCCAAGTGGTGGAAGAGACTTTGGTCGACTTTGGGGTATAGTGATATGGTGAGGGCAGAGTTCATTATGAATATTCAGGACCGGTGGATACATATCTG
CAATACTGTTTCAATATTGGATCTGGAAAGGATCTGTTTGGGATCTTGGGCTCTATGGAACGATCGGAATTGCGTGTTTCATAAGCGGCCAATTCCTCCGGTGGGGGTCC
GGTGTGATTGGACTTTGGATTATCTTTCTGAATACCAATCAGCTCACCGGTCCAACGATCGAATATTCCAAACGAGGGATATGGTCTCTCAGATGATTTCAGGTGGGGAG
GATTTTATTCTAAATGTCGACGCAGTGTGGTCCAAACATACCACGACCAGTGGAGTGAGGGTAGTATTACACACAAAGTCGGGTAAGTTGGTGGCTATTTTACAAAAAGG
GATTCCTTTACCTTCTTCTCCATTATGTGTTGAGGTGATTGCGATACTTGAAGGTCTTAATATGACTTCTTCTTTGAGGATTAGTAAGATTGAGGCCACCATTGTGATAA
TCCCAGAAAGCGTTGTCTCTGGCGACGACAAGGAAGCTCACAGCGGCACTGCGACTGCGTTGGTCTCCAGTGAAGAAGCTATATCAAATGTTAATCGAGAAACCAAGGAG
GATGTTTCTTGA
Protein sequenceShow/hide protein sequence
MRFRCVGAFIPFTKALKEWGYKKNRVRWDNIRQVKDKIKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKV
LKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRRKGKQGYAALKLDISKTYD
RVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKAST
EEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWK
KWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVL
GYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSD
TDISSKWWKRLWSTLGYSDMVRAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSNDRIFQTRDMVSQMISGGE
DFILNVDAVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIEATIVIIPESVVSGDDKEAHSGTATALVSSEEAISNVNRETKE
DVS