; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G24830 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G24830
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr3:21996701..22000110
RNA-Seq ExpressionCSPI03G24830
SyntenyCSPI03G24830
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]0.0e+0054.74Show/hide
Query:  CRAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMT-------------------YERFCVYASNSNIER
        C+AV DFL  S+VGFCC+LETRV+E NF ++S +FG+SW F+ +Y+NS +GR+W+MWK NRF F T                    E FCVYASNSNIER
Subjt:  CRAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMT-------------------YERFCVYASNSNIER

Query:  HVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNV
         +LWRQ+ EI++GW   G+VMGDF  I +HSEAF  A   G+ME  D+ IR+ADL+EP VQ                       VND+ L  WPN+ VNV
Subjt:  HVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNV

Query:  LPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMET
        L WGIS+HSPIL YPS Q+++ V SFRFFNHWV++ SF  VV   W +   VSP+V+ +RNL+  K IL  HFG         VRL K ++ +  R    
Subjt:  LPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMET

Query:  NL------LSEKPLCARNRESDDGNRLTNHDQVS---------------QVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVN
         L         + L      +  G++   + ++S                 ALQVPI  EEVRR LFSMDSGKAPGPDG+S+G FKG           VN
Subjt:  NL------LSEKPLCARNRESDDGNRLTNHDQVS---------------QVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVN

Query:  TIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWD
           +TLIPKR GA+R+E+FRPISCCNVIYKCIS+ILADRLR WLP+F+S                      V GYHL+ G P CT+KVDLQKAYD +NWD
Subjt:  TIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWD

Query:  FLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCT
        FLFGL IAI  P++FVSW++AC                FFHGRKG+RQG+PLS F FVMVM+V SRMLN PPQ FQFHQ CEKVKLT LT ADDLMIFC 
Subjt:  FLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCT

Query:  ADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA----
        AD  S+SF++ET+++FGEL GL+ANL K S F+ G  +  ASRLAANMGF +G+LPVRYL LPLL+GRLR SDC PLIQ ITSRIRSW+ARVLSFA    
Subjt:  ADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA----

Query:  ----VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILK
            V RSLQVYWASVF+LP  VH  VDKILRSYLWR                          +RDG SWNI STLKILWLL   S SLWVAWVEAYILK
Subjt:  ----VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILK

Query:  GRSMLGWVDLGVLGLSCVSGISIKL-MLRWR-WAMAG-SVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDW
         +S+   VD GV    C+  I  K   L++  W   G    C      W+QG  I++Q GERV+YDA SRR+AR  +F+G DG+W+WP VS++L+D+WD 
Subjt:  GRSMLGWVDLGVLGLSCVSGISIKL-MLRWR-WAMAG-SVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDW

Query:  IQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFS
        +Q VRP  SV DRWVWVPG    FSIASA +TIRP   RV W GLLW  GN+PKHSFCAWL I+++LGTRDRL +WD S+P+S +LC G  ESRD LFFS
Subjt:  IQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFS

Query:  CPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
        CPFG ++WSR+L  M+SSHRI YWGVELSWIC+QGIG  VRRKLW +L CAT YFIWKE NH LHGG  R  +LIF
Subjt:  CPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]0.0e+0057.75Show/hide
Query:  SSKFGDSW--GFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIR
        SS   + W   ++ +Y+NSGVGR+W+MWK  RF F+T+                    E  CVYASNS+ ER  LWR + EI++ W   G+VMGDFN IR
Subjt:  SSKFGDSW--GFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIR

Query:  LHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRF
        +HSEAFGG+   G+MEE D+ IR+ADLVEP+VQGNWFTWTSK+ G G++RRLDR+LVNDE LS WP MR+NVLPWGIS+HSPIL YPS Q +  VVSFRF
Subjt:  LHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRF

Query:  FNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSE-----------------------
        FNHWV+E SF++VV+  W++   VS +V+++RNL +LK ILR  FG+HI+++SE+V +AK+AMD AQRE+E N LS+                       
Subjt:  FNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSE-----------------------

Query:  ------------------------KPLCARNR----ESDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREE
                                +   +RN        DG+R+++HD V+Q+                                     ALQ+PI REE
Subjt:  ------------------------KPLCARNR----ESDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREE

Query:  VRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSF
        VRR LFSMDSGKAPGPDG+S+GF+KGAW+V              T Y P  VN  AITLIPK  GA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF
Subjt:  VRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSF

Query:  VSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLR
        +S NQ AFIPGRSII+NILLCQELVGGYHL+ G PRCT+KVDLQKAYD VNWDFLFGL IAIG P++FVSW+RAC                FF+GRKGLR
Subjt:  VSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLR

Query:  QGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAAN
        QGDPLS FLFVMVMEVLSRMLN  PQ+F+FH  CEKVKLTHLT ADDLMIFC AD  S+SFI+E +++FGE SGLFAN  KSS F+VGVN+  AS LAA 
Subjt:  QGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAAN

Query:  MGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAK
        +G S    P            LRS DC PLIQ ITSRIRSW+ARVLSFA        VLRSLQVYWASVF+LP  VH +VDKILRSYLWRGKEEGRGG K
Subjt:  MGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAK

Query:  VAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGT
        VAW +VCLPF+EGG  IRDG SWNIA+TLKI   LL   GSLWVAW+EAYILKG+S+  W VD  V    C     ++ +LR R  M   V         
Subjt:  VAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGT

Query:  IIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPK
             GERV+YDA SRR+A+  DF+  +G+W WP VSL+L+D+W+ +Q V P  SV D WVWVPG    FSIASAWE I P   RV W GLLW  GNIPK
Subjt:  IIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPK

Query:  HSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIY
        HSFCAWLAI+DRL TRDRL +WD SIPLSC+LC G  ESRD LFFSCPFG ++WSR+   M SSHRIG+WGVELSWIC++GIGKGVRRKLW +LWCATIY
Subjt:  HSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIY

Query:  FIWKERNHNLHGGAVREPMLIF
        FIW ERNH LHGG  R+P+++F
Subjt:  FIWKERNHNLHGGAVREPMLIF

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0053.6Show/hide
Query:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV
        +AV DFL  S+VGFCC LETRV+E NF S+S +FG+SW ++ +Y+NSGVGR+W+MWK NRF F T+ R   + + +                      +V
Subjt:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV

Query:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS
        M DFN IR HSEA GG+   G+ME+ DM IR+ADLVEP+VQGNWFTWTSK+ G G+MRRLDR+L+ND+ LS WP M VNVLPWGIS+HSPIL+YPS Q++
Subjt:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS

Query:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----------------
          VVSFR FNHWV++ SF+                                 FG+HIR++SE+VR+AK+AMD AQRE+E N +S                
Subjt:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----------------

Query:  ----EKPLCARNRE---SDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREEVRRGLFSMDSGKAPGPDGYS
            EKP   RN E       +R+++HD V+Q+                                     ALQVPI REEVRR LFSMDSGKAPGPDG+S
Subjt:  ----EKPLCARNRE---SDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREEVRRGLFSMDSGKAPGPDGYS

Query:  IGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILL
        +GFFKGAW+V              T Y P  VN  AITLIPK NGA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF+S NQ AFI GRSII+NILL
Subjt:  IGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILL

Query:  CQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRM
        CQELVGGYHL+ G PRCT+KVDLQKAYD VNWDFLFGL I+I  P++FVSW+ AC                FFHGRKG+RQGDPLSLFLFVMVMEVLSRM
Subjt:  CQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRM

Query:  LNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG
        LN  PQ+FQFH  CE                               KRFGELSGLFAN  KSS F+ GVN+  ASRLAA MGF  G+LPVRYLGLPLL+G
Subjt:  LNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG

Query:  RLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDG
        RLRS+DC PLIQ ITSRIRS SARVLSFA        VL SLQVYWA VF+LP  VH                                 +EGG  IRDG
Subjt:  RLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDG

Query:  SSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYD
        ++W  ASTLKILWL+L  SGSLWVAWVEAY+LKGRS+  W VD  V    C+  I     KL    R  +     C      W+Q G I++Q GERV+YD
Subjt:  SSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYD

Query:  AGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDR
        A SRR+A   +F+G DG+W WP                                   FSIASAWE IRP   RV W GLLW  GNIPKHSFCAWLAI+DR
Subjt:  AGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDR

Query:  LGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHG
        LGTRDR  +WD S+PLSC+LC G  ESRD LFFSCPFG ++WSR+L  M+SSHRIG+WGVELSWIC+QGI KGVRRKLW +LWCATIYFIW ERNH LHG
Subjt:  LGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHG

Query:  GAVREPMLIF
        G   +P++IF
Subjt:  GAVREPMLIF

TYK19523.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0054.39Show/hide
Query:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV
        +AV DFL  S+VGFCC LETRV+E NF S+S +FG+SW ++ +Y+NSGVGR+W+MWK NRF F T+ R   + + +                      +V
Subjt:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV

Query:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS
        M DFN IR HSEA GG+   G+ME+ DM IR+ADLVEP+VQ NWFTWTSK+ G G+MRRLDR+L+ND+ LS WP M VNVLPWGIS+HSPIL+YPS Q++
Subjt:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS

Query:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----EKPLCARNRESD
          V                       ++   VSP+V ++RNL  LK ILR  FG+HIR++SE+VR+AK+AMD AQRE++   L+       L      S 
Subjt:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----EKPLCARNRESD

Query:  DG------NRLTNH---------------DQVSQV--------ALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNY
        DG      N  +N                D + Q         ALQVPI REEVRR LFSMDSGKAPGPDG+S+GFFKGAW+V              T Y
Subjt:  DG------NRLTNH---------------DQVSQV--------ALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNY

Query:  FPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAY
         P  VN  AITLIPK NGA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF+S NQ AFI GRSII+NILLCQELVGGYHL+ G PRCT+KVDLQKAY
Subjt:  FPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAY

Query:  DFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADD
        D VNWDFLFGL I+I  P++FVSW+ AC                FFHGRKG+RQGDPLSLFLFVMVMEVLSRMLN  PQ+FQFH  CE            
Subjt:  DFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADD

Query:  LMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLS
                           KRFGELSGLFAN  KSS F+ GVN+  ASRLAA MGF  G+LPVRYLGLPLL+GRLRS+DC PLIQ ITSRIRS SARVLS
Subjt:  LMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLS

Query:  FA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWV
        FA        VL SLQVYWA VF+LP  VH                                 +EGG  IRDG++W  ASTLKILWL+L  SGSLWVAWV
Subjt:  FA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWV

Query:  EAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLD
        EAY+LKGRS+  W VD  V    C+  I     KL    R  +     C      W+Q G I+++ GERV+YDA SRR+A   +F+G DG+W WP     
Subjt:  EAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLD

Query:  LMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYES
                                      FSIASAWE IRP   RV W GLLW  GNIPKHSFCAWLAI+DRLGTRDR  +WD S+PLSC+LC G  ES
Subjt:  LMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYES

Query:  RDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
        RD LFFSCPFG ++WSR+L  M+SSHRIG+WGVELSWIC+QGI KGVRRKLW +LWCATIYFIW ERNH LHGG   +P++IF
Subjt:  RDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]0.0e+0089.97Show/hide
Query:  MAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGIS
        MAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQ NWFTWT+KIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGIS
Subjt:  MAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGIS

Query:  NHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAK----DAMDRAQREMETNL
        NHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSS WTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDV L +    +  +R +R      
Subjt:  NHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAK----DAMDRAQREMETNL

Query:  -----LSEKPLCARNRES-------DDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRN
             ++ + + AR   +        DGNRLTNHDQVSQ ALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRN
Subjt:  -----LSEKPLCARNRES-------DDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRN

Query:  GADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGM
        GADRLEDFRPISCCNVIYKCISRILADRL EWLPSFVSGNQ AFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYD VNWDFLFGL IAIGM
Subjt:  GADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGM

Query:  PIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKE
         IRFVSWVRAC                FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLT ADDLMIFC ADN+SMSFIKE
Subjt:  PIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKE

Query:  TIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQV
        TIKRFGELSGLFANLAKSS FLVGVNSS+ASRLAANMGFSIGHLPVRYLGLPLL GRL+S DCDPLIQ ITSRIRSWSARVLSFA        VLRSLQV
Subjt:  TIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQV

Query:  YWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLG
        YWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGG AIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLG
Subjt:  YWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLG

Query:  VLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPG
        VLGLSCVSGISIKL+LRWRWAMAGS+ECGWIQGGTIIQQFGERVIYDAGSRRDAR VDFM +DGDWRWPLVSLDLMDIWDWIQGVRPS SVEDRWVWVPG
Subjt:  VLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPG

Query:  SLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSH
        SLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSF AWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRD LFFSCPFGWE+WSRILLFMSSSH
Subjt:  SLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSH

Query:  RIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
        RIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPML F
Subjt:  RIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein0.0e+0054.74Show/hide
Query:  CRAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMT-------------------YERFCVYASNSNIER
        C+AV DFL  S+VGFCC+LETRV+E NF ++S +FG+SW F+ +Y+NS +GR+W+MWK NRF F T                    E FCVYASNSNIER
Subjt:  CRAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMT-------------------YERFCVYASNSNIER

Query:  HVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNV
         +LWRQ+ EI++GW   G+VMGDF  I +HSEAF  A   G+ME  D+ IR+ADL+EP VQ                       VND+ L  WPN+ VNV
Subjt:  HVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNV

Query:  LPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMET
        L WGIS+HSPIL YPS Q+++ V SFRFFNHWV++ SF  VV   W +   VSP+V+ +RNL+  K IL  HFG         VRL K ++ +  R    
Subjt:  LPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMET

Query:  NL------LSEKPLCARNRESDDGNRLTNHDQVS---------------QVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVN
         L         + L      +  G++   + ++S                 ALQVPI  EEVRR LFSMDSGKAPGPDG+S+G FKG           VN
Subjt:  NL------LSEKPLCARNRESDDGNRLTNHDQVS---------------QVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVN

Query:  TIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWD
           +TLIPKR GA+R+E+FRPISCCNVIYKCIS+ILADRLR WLP+F+S                      V GYHL+ G P CT+KVDLQKAYD +NWD
Subjt:  TIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWD

Query:  FLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCT
        FLFGL IAI  P++FVSW++AC                FFHGRKG+RQG+PLS F FVMVM+V SRMLN PPQ FQFHQ CEKVKLT LT ADDLMIFC 
Subjt:  FLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCT

Query:  ADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA----
        AD  S+SF++ET+++FGEL GL+ANL K S F+ G  +  ASRLAANMGF +G+LPVRYL LPLL+GRLR SDC PLIQ ITSRIRSW+ARVLSFA    
Subjt:  ADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA----

Query:  ----VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILK
            V RSLQVYWASVF+LP  VH  VDKILRSYLWR                          +RDG SWNI STLKILWLL   S SLWVAWVEAYILK
Subjt:  ----VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILK

Query:  GRSMLGWVDLGVLGLSCVSGISIKL-MLRWR-WAMAG-SVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDW
         +S+   VD GV    C+  I  K   L++  W   G    C      W+QG  I++Q GERV+YDA SRR+AR  +F+G DG+W+WP VS++L+D+WD 
Subjt:  GRSMLGWVDLGVLGLSCVSGISIKL-MLRWR-WAMAG-SVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDW

Query:  IQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFS
        +Q VRP  SV DRWVWVPG    FSIASA +TIRP   RV W GLLW  GN+PKHSFCAWL I+++LGTRDRL +WD S+P+S +LC G  ESRD LFFS
Subjt:  IQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFS

Query:  CPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
        CPFG ++WSR+L  M+SSHRI YWGVELSWIC+QGIG  VRRKLW +L CAT YFIWKE NH LHGG  R  +LIF
Subjt:  CPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

A0A5A7TZS0 Reverse transcriptase domain-containing protein0.0e+0057.75Show/hide
Query:  SSKFGDSW--GFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIR
        SS   + W   ++ +Y+NSGVGR+W+MWK  RF F+T+                    E  CVYASNS+ ER  LWR + EI++ W   G+VMGDFN IR
Subjt:  SSKFGDSW--GFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIR

Query:  LHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRF
        +HSEAFGG+   G+MEE D+ IR+ADLVEP+VQGNWFTWTSK+ G G++RRLDR+LVNDE LS WP MR+NVLPWGIS+HSPIL YPS Q +  VVSFRF
Subjt:  LHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRF

Query:  FNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSE-----------------------
        FNHWV+E SF++VV+  W++   VS +V+++RNL +LK ILR  FG+HI+++SE+V +AK+AMD AQRE+E N LS+                       
Subjt:  FNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSE-----------------------

Query:  ------------------------KPLCARNR----ESDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREE
                                +   +RN        DG+R+++HD V+Q+                                     ALQ+PI REE
Subjt:  ------------------------KPLCARNR----ESDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREE

Query:  VRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSF
        VRR LFSMDSGKAPGPDG+S+GF+KGAW+V              T Y P  VN  AITLIPK  GA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF
Subjt:  VRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSF

Query:  VSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLR
        +S NQ AFIPGRSII+NILLCQELVGGYHL+ G PRCT+KVDLQKAYD VNWDFLFGL IAIG P++FVSW+RAC                FF+GRKGLR
Subjt:  VSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLR

Query:  QGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAAN
        QGDPLS FLFVMVMEVLSRMLN  PQ+F+FH  CEKVKLTHLT ADDLMIFC AD  S+SFI+E +++FGE SGLFAN  KSS F+VGVN+  AS LAA 
Subjt:  QGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAAN

Query:  MGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAK
        +G S    P            LRS DC PLIQ ITSRIRSW+ARVLSFA        VLRSLQVYWASVF+LP  VH +VDKILRSYLWRGKEEGRGG K
Subjt:  MGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAK

Query:  VAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGT
        VAW +VCLPF+EGG  IRDG SWNIA+TLKI   LL   GSLWVAW+EAYILKG+S+  W VD  V    C     ++ +LR R  M   V         
Subjt:  VAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGSVECGWIQGGT

Query:  IIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPK
             GERV+YDA SRR+A+  DF+  +G+W WP VSL+L+D+W+ +Q V P  SV D WVWVPG    FSIASAWE I P   RV W GLLW  GNIPK
Subjt:  IIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPK

Query:  HSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIY
        HSFCAWLAI+DRL TRDRL +WD SIPLSC+LC G  ESRD LFFSCPFG ++WSR+   M SSHRIG+WGVELSWIC++GIGKGVRRKLW +LWCATIY
Subjt:  HSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIY

Query:  FIWKERNHNLHGGAVREPMLIF
        FIW ERNH LHGG  R+P+++F
Subjt:  FIWKERNHNLHGGAVREPMLIF

A0A5A7UP65 Reverse transcriptase0.0e+0053.6Show/hide
Query:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV
        +AV DFL  S+VGFCC LETRV+E NF S+S +FG+SW ++ +Y+NSGVGR+W+MWK NRF F T+ R   + + +                      +V
Subjt:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV

Query:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS
        M DFN IR HSEA GG+   G+ME+ DM IR+ADLVEP+VQGNWFTWTSK+ G G+MRRLDR+L+ND+ LS WP M VNVLPWGIS+HSPIL+YPS Q++
Subjt:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS

Query:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----------------
          VVSFR FNHWV++ SF+                                 FG+HIR++SE+VR+AK+AMD AQRE+E N +S                
Subjt:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----------------

Query:  ----EKPLCARNRE---SDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREEVRRGLFSMDSGKAPGPDGYS
            EKP   RN E       +R+++HD V+Q+                                     ALQVPI REEVRR LFSMDSGKAPGPDG+S
Subjt:  ----EKPLCARNRE---SDDGNRLTNHDQVSQV-------------------------------------ALQVPIGREEVRRGLFSMDSGKAPGPDGYS

Query:  IGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILL
        +GFFKGAW+V              T Y P  VN  AITLIPK NGA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF+S NQ AFI GRSII+NILL
Subjt:  IGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILL

Query:  CQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRM
        CQELVGGYHL+ G PRCT+KVDLQKAYD VNWDFLFGL I+I  P++FVSW+ AC                FFHGRKG+RQGDPLSLFLFVMVMEVLSRM
Subjt:  CQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRM

Query:  LNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG
        LN  PQ+FQFH  CE                               KRFGELSGLFAN  KSS F+ GVN+  ASRLAA MGF  G+LPVRYLGLPLL+G
Subjt:  LNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG

Query:  RLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDG
        RLRS+DC PLIQ ITSRIRS SARVLSFA        VL SLQVYWA VF+LP  VH                                 +EGG  IRDG
Subjt:  RLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDG

Query:  SSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYD
        ++W  ASTLKILWL+L  SGSLWVAWVEAY+LKGRS+  W VD  V    C+  I     KL    R  +     C      W+Q G I++Q GERV+YD
Subjt:  SSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYD

Query:  AGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDR
        A SRR+A   +F+G DG+W WP                                   FSIASAWE IRP   RV W GLLW  GNIPKHSFCAWLAI+DR
Subjt:  AGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDR

Query:  LGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHG
        LGTRDR  +WD S+PLSC+LC G  ESRD LFFSCPFG ++WSR+L  M+SSHRIG+WGVELSWIC+QGI KGVRRKLW +LWCATIYFIW ERNH LHG
Subjt:  LGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHG

Query:  GAVREPMLIF
        G   +P++IF
Subjt:  GAVREPMLIF

A0A5D3D7P6 Reverse transcriptase0.0e+0054.39Show/hide
Query:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV
        +AV DFL  S+VGFCC LETRV+E NF S+S +FG+SW ++ +Y+NSGVGR+W+MWK NRF F T+ R   + + +                      +V
Subjt:  RAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIV

Query:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS
        M DFN IR HSEA GG+   G+ME+ DM IR+ADLVEP+VQ NWFTWTSK+ G G+MRRLDR+L+ND+ LS WP M VNVLPWGIS+HSPIL+YPS Q++
Subjt:  MGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRS

Query:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----EKPLCARNRESD
          V                       ++   VSP+V ++RNL  LK ILR  FG+HIR++SE+VR+AK+AMD AQRE++   L+       L      S 
Subjt:  QHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLS----EKPLCARNRESD

Query:  DG------NRLTNH---------------DQVSQV--------ALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNY
        DG      N  +N                D + Q         ALQVPI REEVRR LFSMDSGKAPGPDG+S+GFFKGAW+V              T Y
Subjt:  DG------NRLTNH---------------DQVSQV--------ALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTV--------------TNY

Query:  FPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAY
         P  VN  AITLIPK NGA+RLEDFRPISCCNV+YKCIS+ILADRLR WLPSF+S NQ AFI GRSII+NILLCQELVGGYHL+ G PRCT+KVDLQKAY
Subjt:  FPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAY

Query:  DFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADD
        D VNWDFLFGL I+I  P++FVSW+ AC                FFHGRKG+RQGDPLSLFLFVMVMEVLSRMLN  PQ+FQFH  CE            
Subjt:  DFVNWDFLFGLPIAIGMPIRFVSWVRAC----------------FFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADD

Query:  LMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLS
                           KRFGELSGLFAN  KSS F+ GVN+  ASRLAA MGF  G+LPVRYLGLPLL+GRLRS+DC PLIQ ITSRIRS SARVLS
Subjt:  LMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLS

Query:  FA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWV
        FA        VL SLQVYWA VF+LP  VH                                 +EGG  IRDG++W  ASTLKILWL+L  SGSLWVAWV
Subjt:  FA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWV

Query:  EAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLD
        EAY+LKGRS+  W VD  V    C+  I     KL    R  +     C      W+Q G I+++ GERV+YDA SRR+A   +F+G DG+W WP     
Subjt:  EAYILKGRSMLGW-VDLGVLGLSCVSGI---SIKLMLRWRWAMAGSVEC-----GWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLD

Query:  LMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYES
                                      FSIASAWE IRP   RV W GLLW  GNIPKHSFCAWLAI+DRLGTRDR  +WD S+PLSC+LC G  ES
Subjt:  LMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYES

Query:  RDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
        RD LFFSCPFG ++WSR+L  M+SSHRIG+WGVELSWIC+QGI KGVRRKLW +LWCATIYFIW ERNH LHGG   +P++IF
Subjt:  RDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

A0A5D3DXE4 Reverse transcriptase domain-containing protein1.7e-29755.3Show/hide
Query:  LAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQ
        ++ S VGFCC+LETRV+E NF S+S +F +SW ++ +Y+NSGVGR+W+MWK NRF F T+                    E FCVYASNSNIER +LW +
Subjt:  LAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTY--------------------ERFCVYASNSNIERHVLWRQ

Query:  MAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGIS
        + EI++ W   G+VM DFN IR+HSEAF G+   G+ME+ ++ IR+ADLVEP+VQGNWFTWTSK+ G G++RRLDR+LVND+ LS WP M VNVLPWGIS
Subjt:  MAEISAGWRGSGIVMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGIS

Query:  NHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNL----
        +H PIL YPS Q S  VVSFRFFNHWV++ SF++VV+  W++   VSP+V ++RNL  LK ILR  FG+HI+ +SE+VR+ K+AMD AQREM  N     
Subjt:  NHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNL----

Query:  LSEKPLCARNRES--DDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRNGADRLEDFRP
        L  + +  R      DD  +    ++  Q ALQ+PI REEVRR LFSMDSGKAPGPDG+S+                +N  AITLIPK NGA+RLEDF P
Subjt:  LSEKPLCARNRES--DDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRNGADRLEDFRP

Query:  ISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRA
        ISC NV+YKCIS+ILADRLR WLPSF+S NQ AFIPGRSII+NILLCQEL+          R  +   +               PI I   +        
Subjt:  ISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRA

Query:  CFFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVN
         FFHGRKG+RQGDPLS FLFVMVMEVLSRMLN  PQ+FQFH  CEKVKLTHLT ADDLMIFC AD  S+ FI++ +++FGELSGLFAN  KSS F+ GVN
Subjt:  CFFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVN

Query:  SSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWR
        +  AS LA  MGF+ G+LPVRYLGLPLL+GRLRS+DC PLIQ ITS+IRSW+ARVLSFA        VLRSLQVYWASVF+LP  VH +VDKILRSYLWR
Subjt:  SSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILRSYLWR

Query:  GKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGS
        GKEEGRGG KVAW +VCLPF+EGGF IRDG SWNIASTLKILWL+L  SGSLWVAWVEAYILKGRS+  W VD                      +  G 
Subjt:  GKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGW-VDLGVLGLSCVSGISIKLMLRWRWAMAGS

Query:  VECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSG
          C W             ++    SR           DG W WP VSL+L+D+W+ +Q V P  SV D  VWV G    FSI+SAWE IRP   RV    
Subjt:  VECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGWSG

Query:  LLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKL
         LWD                                        G  ESRD LFFSC FG ++WSR+L  M+SS+RIG+WGVELSWIC+QGIGKGVRRKL
Subjt:  LLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.9e-3124.24Show/hide
Query:  DGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFK--------------GAWTVTNYFPQAVNTIAITLIPK-RNGADRLEDFRPI
        D   L   +Q    +L  PI   E+   + S+ + K+PGPDG++  F++               +       P +    +I LIPK      + E+FRPI
Subjt:  DGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFK--------------GAWTVTNYFPQAVNTIAITLIPK-RNGADRLEDFRPI

Query:  SCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC
        S  N+  K +++ILA+R+++ +   +  +Q+ FIPG     NI     ++   +  +      I +D +KA+D +   F+      +G+   ++  +RA 
Subjt:  SCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRAC

Query:  F--------FHGRK--------GLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGL
        +         +G+K        G RQG PLS  LF +V+EVL+R +    +        E+VKL+    ADD++++      S   + + I  F ++SG 
Subjt:  F--------FHGRK--------GLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGL

Query:  FANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG--RLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFM
          N+ KS  FL   N    S++   + F+I    ++YLG+ L      L   +  PL++ I      W            ++  A+L + +  + A    
Subjt:  FANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSG--RLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFM

Query:  LPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILW
        LPM    +++K    ++W  K      A++A   +      GG  + D   +  A+  K  W
Subjt:  LPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILW

P08548 LINE-1 reverse transcriptase homolog7.8e-2923.52Show/hide
Query:  PIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSEKPLCARNRESDDGNRLTNHDQVSQVALQV---PIGREEVRRGLFSMDS
        P+ N+ R  R +KS++ +     IR  ++++      + +   E    L S K      +E D      +  ++SQ  +++   PI   E+   + ++  
Subjt:  PIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSEKPLCARNRESDDGNRLTNHDQVSQVALQV---PIGREEVRRGLFSMDS

Query:  GKAPGPDGYSIGFFKG-----AWTVTNYF---------PQAVNTIAITLIPKRNGAD--RLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAF
         K+PGPDG++  F++         + N F         P       ITLIPK  G D  R E++RPIS  N+  K +++IL +R+++ +   +  +Q+ F
Subjt:  GKAPGPDGYSIGFFKG-----AWTVTNYF---------PQAVNTIAITLIPKRNGAD--RLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAF

Query:  IPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF----------------FHGRKGLRQGDPLSLF
        IPG     NI     ++   +  +      + +D +KA+D +   F+      IG+   F+  + A +                F  R G RQG PLS  
Subjt:  IPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF----------------FHGRKGLRQGDPLSLF

Query:  LFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHL
        LF +VMEVL+  +         H   E++KL+    ADD++++      S + + E IK +  +SG   N  KS  F+   N+     +  ++ F++   
Subjt:  LFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHL

Query:  PVRYLGLPLLSG--RLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKE
         ++YLG+ L      L   + + L + I   +  W            ++  ++L +++  + A     P+   +D++KI+  ++W  K+
Subjt:  PVRYLGLPLLSG--RLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFMLPMKVHRDVDKILRSYLWRGKE

P11369 LINE-1 retrotransposable element ORF2 protein2.9e-3124.78Show/hide
Query:  NHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFK--------------GAWTVTNYFPQAVNTIAITLIPK-RNGADRLEDFRPISCCNVI
        N DQV    L  PI  +E+   + S+ + K+PGPDG+S  F++                  V    P +     ITLIPK +    ++E+FRPIS  N+ 
Subjt:  NHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFK--------------GAWTVTNYFPQAVNTIAITLIPK-RNGADRLEDFRPISCCNVI

Query:  YKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF-----
         K +++ILA+R++E + + +  +Q+ FIPG     NI     ++   +  +      I +D +KA+D +   F+  +    G+   +++ ++A +     
Subjt:  YKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF-----

Query:  ---FHGRK--------GLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAK
            +G K        G RQG PLS +LF +V+EVL+R +    +        E+VK++ L  ADD++++ +   +S   +   I  FGE+ G   N  K
Subjt:  ---FHGRK--------GLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAK

Query:  SSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLS--GRLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFMLPMKVH
        S  FL   N      +     FSI    ++YLG+ L      L   +   L + I   +R W            ++  A+L +++  + A    +P +  
Subjt:  SSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLS--GRLRSSDCDPLIQHITSRIRSWS---------ARVLSFAVL-RSLQVYWASVFMLPMKVH

Query:  RDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILW
         +++  +  ++W  K+      ++A   +      GG  + D   +  A  +K  W
Subjt:  RDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-2024.79Show/hide
Query:  DRAQREMETNLLSEKPLCA-RNRESDDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAW---------TVTNYF-----PQ
        DRA R    NL S  P+      E  DG  + +  +  +  L+ PI  +E+ + L  M   K+PG DG +I FF+  W          +T  F     P 
Subjt:  DRAQREMETNLLSEKPLCA-RNRESDDGNRLTNHDQVSQVALQVPIGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAW---------TVTNYF-----PQ

Query:  AVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFV
        +     ++L+PK+     ++++RP+S  +  YK +++ ++ RL+  L   +  +Q   +PGR+I DN+ L ++L+  +    G     + +D +KA+D V
Subjt:  AVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTIKVDLQKAYDFV

Query:  NWDFLFGLPIAIGMPIRFVSWVRACFFHGR----------------KGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMI
        +  +L G   A     +FV +++  +                    +G+RQG PLS  L+ + +E    +L          +   +V L+    ADD +I
Subjt:  NWDFLFGLPIAIGMPIRFVSWVRACFFHGR----------------KGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLMI

Query:  FCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRL-AANMGFSIGHLPVRYLGLPLLSGRLR-SSDCDPLIQHITSRIRSWS--ARVL
            D   +   +E  + +   S    N +KSS  L G  S +   L  A    S     ++YLG+ L +     S +   L + + +R+  W   A+VL
Subjt:  FCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRL-AANMGFSIGHLPVRYLGLPLLSGRLR-SSDCDPLIQHITSRIRSWS--ARVL

Query:  SF----AVLRSL---QVYWASVFMLP-MKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGG
        S      V+  L   Q+++  + + P  +    + + L  +LW GK     G         LP  EGG
Subjt:  SF----AVLRSL---QVYWASVFMLP-MKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGG

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.6e-1625.61Show/hide
Query:  IGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTN--------------YFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLR-
        I  +++R    S+ S  +PGPDG +    K A  V +                P ++       IPK   A R +DFRPIS  +V+ + ++ ILA RL  
Subjt:  IGREEVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTN--------------YFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLR-

Query:  --EWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTI-KVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF---------------
           W P      Q  F+P     DN  +   ++   H H+    C I  +D+ KA+D ++   ++    A G P  FV +V+  +               
Subjt:  --EWLPSFVSGNQLAFIPGRSIIDNILLCQELVGGYHLHRGNPRCTI-KVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACF---------------

Query:  -FHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKV--KLTHLTS-ADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVG
         F   +G++QGDPLS  LF +VM+ L R L          +   KV   +T+  + ADDL++F          + +T+  F  + GL  N  K   F VG
Subjt:  -FHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKV--KLTHLTS-ADDLMIFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVG

Query:  VNSSEASR--LAANMGFSIGHLPV---------RYLGLPL-LSGRLR---SSDCDPLIQHITSRIRSWSARVLSFAVLRSLQVYW-ASVFMLPMKVHRDV
        +      +  +     F +G   +         +YLG+    +GR+R   + D  P +Q +T        R+ +   +   Q+Y   ++  + + V R  
Subjt:  VNSSEASR--LAANMGFSIGHLPV---------RYLGLPL-LSGRLR---SSDCDPLIQHITSRIRSWSARVLSFAVLRSLQVYW-ASVFMLPMKVHRDV

Query:  DKILRSYLWR
        DK++R Y+ R
Subjt:  DKILRSYLWR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.2e-2121.05Show/hide
Query:  EENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASN---V
        E N  +I      SW   +NY  S +GR+WI+W  +  V                   +++++  ++        I++GDF+ I   S+ +        +
Subjt:  EENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGIVMGDFNTIRLHSEAFGGASN---V

Query:  GDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMD
          +EE    +R++DLV+   +G  +TW++      ++R+LDR + N +  S +P+        G+S+HSP ++   N   +    FR+F+      +F+ 
Subjt:  GDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRFFNHWVKESSFMD

Query:  VVSSTWTKDTRV-SPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETN--------------------------------------
         ++  W +   V S + ++  +L+  K   +    +    I    + A D+++  Q ++ TN                                      
Subjt:  VVSSTWTKDTRV-SPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETN--------------------------------------

Query:  ------------LLSEKPLCARNRESDDGNRLTNHDQVSQV--------------------------------------ALQVPIGREEVRRGLFSMDSG
                    L ++     +    DD  R+ N  QV ++                                       L      +E+   +F+M   
Subjt:  ------------LLSEKPLCARNRESDDGNRLTNHDQVSQV--------------------------------------ALQVPIGREEVRRGLFSMDSG

Query:  KAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCIS
        KAPGPD ++  FF  +W V              T +  +  N  AITLIPK  G D+L  FRP+SCC V+YK I+
Subjt:  KAPGPDGYSIGFFKGAWTV--------------TNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCIS

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-2934.18Show/hide
Query:  WIQGGTIIQQFGERVIYDAGSRR---DARRVDFMGQDGDWRWPLV-SLDLMDIWDWIQGV-RPSP-SVEDRWVWVPGSL--DSFSIASAWETIRPHSSRV
        W   G +I+  G+   Y + S R   +AR V+ +G +G W+ PL  S     I D I  +  PSP ++ED + WV G +    FS A  W+ IRP +  +
Subjt:  WIQGGTIIQQFGERVIYDAGSRR---DARRVDFMGQDGDWRWPLV-SLDLMDIWDWIQGV-RPSP-SVEDRWVWVPGSL--DSFSIASAWETIRPHSSRV

Query:  GWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRI-GYWGVELSWICNQGIGKG
         W+  +W +G +PKH+F  W++  DRL TR RL+ W       C LC    ESRD L FSC F  ++W      +    R+   W   LSW+  +     
Subjt:  GWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRI-GYWGVELSWICNQGIGKG

Query:  VRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF
            L  +   A IY IW++RN+ LH      P++IF
Subjt:  VRRKLWHLLWCATIYFIWKERNHNLHGGAVREPMLIF

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-2237.41Show/hide
Query:  SAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMW----SRILLFMSSSHRIGY
        + W  + P   RV W   +W +G IPKH+F AW+ +R RL T+DR+  W    P  C+ C  + E+R  LFF C F  E+W    SR+ +F       G 
Subjt:  SAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMW----SRILLFMSSSHRIGY

Query:  WGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVR
            + W+ N    K V   L  L   A++Y IWKERN  LH  A R
Subjt:  WGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.8e-4432.2Show/hide
Query:  LVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILR
        + GV  ++ + +  +  F+ G LPVRYLGLPLL+ ++ +SD  PL++ I  RI  W+AR LSFA        V+ SL  +W S F LP    +++D I  
Subjt:  LVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFA--------VLRSLQVYWASVFMLPMKVHRDVDKILR

Query:  SYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRD------GSSWNIASTLKI---LWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLGVLGLSCVSGISI
        S+LW G E     AKVAW +VC P DEGG  IR       GS W+I+    +   +W  ++K  +L   +V+  I  G +   W D              
Subjt:  SYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRD------GSSWNIASTLKI---LWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLGVLGLSCVSGISI

Query:  KLMLRWRWAMAGSVECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPS--PSVEDRWVWVPGSLD----SFS
                         W + G +I   G R   D G    A   + +      R    +  L+ I D I  VR     S ED   W  G+ D     F+
Subjt:  KLMLRWRWAMAGSVECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPS--PSVEDRWVWVPGSLD----SFS

Query:  IASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEM
            W   R    +V W   +W     PK+S  AW+AI++RL T DR+  W+     SC+LC    E+RD LFF+CP+  E+
Subjt:  IASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEM

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.1e-2331.9Show/hide
Query:  EDRWVW---VPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEM
        +D ++W   +    + FS    W  + P S  V W   +W + ++PKH+F  W+   +RL TRDRL  W  SIP  C+LC  + +SR  LFF C F   +
Subjt:  EDRWVW---VPGSLDSFSIASAWETIRPHSSRVGWSGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEM

Query:  WSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVR
        W R     ++ +        L+W+ +    K +   +  L + + +Y IW+ERN  LH G  R
Subjt:  WSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCATIYFIWKERNHNLHGGAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTATCAAAGGGACACAAATATATCTGTAGAGCGGTGAAGGACTTCTTAGCCGTGTCTACAGTGGGTTTCTGTTGTATTCTAGAGACTAGAGTTCAAGAGGAGAA
CTTTGTATCGATATCCAGCAAATTTGGTGATTCATGGGGGTTCACTAGTAACTACAACAATAGTGGAGTTGGTCGGATGTGGATTATGTGGAAACATAACAGATTCGTGT
TCATGACTTATGAGAGGTTCTGTGTGTATGCCTCTAATAGTAATATCGAGAGACATGTTTTATGGAGGCAAATGGCTGAGATCTCTGCTGGTTGGAGAGGGTCGGGTATT
GTCATGGGTGATTTTAACACCATCAGACTCCACTCTGAAGCCTTCGGGGGTGCTTCGAATGTGGGTGATATGGAGGAGATTGACATGGATATTCGAGAGGCGGACCTTGT
TGAACCTGCAGTTCAGGGGAACTGGTTTACTTGGACAAGTAAAATACATGGGTTGGGTTTGATGAGGAGACTTGATCGTATCCTAGTGAATGATGAGGGGCTTAGTAAAT
GGCCTAACATGAGGGTTAACGTCCTCCCATGGGGTATTTCTAATCATTCTCCCATACTTGTCTATCCCAGTAATCAGCGAAGCCAGCATGTGGTGTCTTTTCGTTTCTTT
AACCATTGGGTTAAAGAATCGTCCTTTATGGATGTTGTGTCCTCTACTTGGACCAAAGATACTCGAGTTTCTCCAATTGTGAATATTGTGAGGAACTTAAGAAATCTCAA
GTCGATCCTTCGCAACCATTTTGGTAAGCATATTCGGACCATCAGTGAGGATGTTCGTCTTGCCAAGGATGCCATGGACCGAGCTCAGAGAGAGATGGAAACGAACCTTC
TATCGGAGAAGCCGCTATGTGCCAGAAATCGCGAATCAGATGATGGAAATCGATTGACTAATCATGATCAGGTTTCTCAGGTGGCCCTCCAGGTGCCTATTGGCAGGGAG
GAAGTGAGACGTGGTCTGTTTTCCATGGATAGTGGAAAGGCTCCAGGCCCTGATGGGTATTCGATTGGCTTCTTTAAAGGAGCTTGGACGGTGACCAATTACTTCCCTCA
AGCGGTGAATACTATTGCCATTACACTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGGATTTCAGGCCTATATCTTGCTGTAACGTTATTTACAAATGCATTTCAA
GAATATTGGCAGATAGGCTTCGTGAGTGGCTTCCTTCTTTTGTTAGTGGAAACCAGTTAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTCTTTGTCAGGAG
CTTGTAGGGGGCTACCATTTGCACAGAGGTAATCCTAGGTGCACTATAAAGGTTGACCTCCAAAAAGCTTATGATTTTGTTAATTGGGATTTCCTCTTTGGTTTGCCGAT
TGCCATAGGTATGCCTATAAGATTTGTGAGTTGGGTTCGAGCGTGTTTTTTCCATGGGAGGAAAGGACTTAGACAAGGTGATCCTTTATCCCTGTTCTTATTTGTGATGG
TCATGGAGGTGCTATCTCGCATGTTGAACCACCCACCTCAAAATTTTCAATTCCACCAATTTTGTGAGAAGGTCAAGTTAACTCATCTTACTTCTGCAGATGACCTGATG
ATCTTTTGTACTGCTGATAATCATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTTTCGGGACTGTTTGCTAATCTTGCTAAAAGCTCAAATTTTCT
TGTGGGGGTTAATAGTTCGGAAGCTTCTCGGCTTGCTGCTAATATGGGTTTTTCCATTGGTCATCTCCCTGTTCGTTATCTTGGGCTTCCTCTCCTCTCTGGAAGATTGC
GGAGCTCTGATTGTGATCCCCTTATTCAGCATATTACCAGTCGTATTCGATCTTGGTCTGCTAGAGTGTTATCTTTTGCAGTCCTTAGGAGCCTTCAGGTTTATTGGGCT
AGTGTGTTCATGCTTCCTATGAAAGTCCATAGAGACGTTGATAAGATTTTGCGGTCTTATTTGTGGAGAGGTAAGGAGGAGGGTAGAGGTGGTGCTAAAGTTGCATGGGA
TGAGGTTTGTCTTCCTTTTGATGAAGGAGGTTTTGCTATTCGCGATGGATCTTCTTGGAATATTGCAAGCACGTTGAAGATCTTATGGTTGCTGCTAGTTAAATCTGGTA
GTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGATGCTGGGGTGGGTCGATCTTGGTGTTTTAGGGCTATCTTGCGTAAGCGGGATATCCATAAAG
CTCATGTTAAGATGGAGGTGGGCAATGGCAGGAAGTGTAGAGTGTGGTTGGATTCAGGGTGGGACAATTATCCAGCAGTTTGGGGAGAGGGTGATATATGATGCAGGTAG
TCGGCGGGATGCGAGGCGTGTGGATTTCATGGGTCAGGATGGAGATTGGAGGTGGCCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATTGGATTCAGGGAGTGAGGC
CGAGTCCGAGTGTTGAGGATAGGTGGGTCTGGGTGCCAGGGAGTCTTGATAGTTTTTCGATCGCAAGTGCGTGGGAAACTATTCGTCCTCATAGTAGTAGGGTTGGCTGG
TCAGGTTTACTTTGGGATAGGGGAAATATTCCTAAGCACTCCTTCTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTACTAGAGATAGGTTAAGTCAATGGGATAGGTC
GATTCCTTTATCGTGTATGCTGTGTGGTGGGAACTATGAGTCTCGTGATCGTTTGTTTTTTTCTTGTCCTTTTGGGTGGGAGATGTGGTCGAGAATCCTTTTGTTTATGT
CATCTTCTCATAGGATAGGTTATTGGGGGGTTGAGTTATCTTGGATCTGTAATCAGGGTATTGGGAAGGGTGTAAGGAGAAAACTGTGGCACCTTCTCTGGTGTGCTACA
ATTTATTTCATTTGGAAGGAGCGAAATCATAATCTTCATGGAGGTGCTGTTCGGGAGCCTATGCTTATTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTATCAAAGGGACACAAATATATCTGTAGAGCGGTGAAGGACTTCTTAGCCGTGTCTACAGTGGGTTTCTGTTGTATTCTAGAGACTAGAGTTCAAGAGGAGAA
CTTTGTATCGATATCCAGCAAATTTGGTGATTCATGGGGGTTCACTAGTAACTACAACAATAGTGGAGTTGGTCGGATGTGGATTATGTGGAAACATAACAGATTCGTGT
TCATGACTTATGAGAGGTTCTGTGTGTATGCCTCTAATAGTAATATCGAGAGACATGTTTTATGGAGGCAAATGGCTGAGATCTCTGCTGGTTGGAGAGGGTCGGGTATT
GTCATGGGTGATTTTAACACCATCAGACTCCACTCTGAAGCCTTCGGGGGTGCTTCGAATGTGGGTGATATGGAGGAGATTGACATGGATATTCGAGAGGCGGACCTTGT
TGAACCTGCAGTTCAGGGGAACTGGTTTACTTGGACAAGTAAAATACATGGGTTGGGTTTGATGAGGAGACTTGATCGTATCCTAGTGAATGATGAGGGGCTTAGTAAAT
GGCCTAACATGAGGGTTAACGTCCTCCCATGGGGTATTTCTAATCATTCTCCCATACTTGTCTATCCCAGTAATCAGCGAAGCCAGCATGTGGTGTCTTTTCGTTTCTTT
AACCATTGGGTTAAAGAATCGTCCTTTATGGATGTTGTGTCCTCTACTTGGACCAAAGATACTCGAGTTTCTCCAATTGTGAATATTGTGAGGAACTTAAGAAATCTCAA
GTCGATCCTTCGCAACCATTTTGGTAAGCATATTCGGACCATCAGTGAGGATGTTCGTCTTGCCAAGGATGCCATGGACCGAGCTCAGAGAGAGATGGAAACGAACCTTC
TATCGGAGAAGCCGCTATGTGCCAGAAATCGCGAATCAGATGATGGAAATCGATTGACTAATCATGATCAGGTTTCTCAGGTGGCCCTCCAGGTGCCTATTGGCAGGGAG
GAAGTGAGACGTGGTCTGTTTTCCATGGATAGTGGAAAGGCTCCAGGCCCTGATGGGTATTCGATTGGCTTCTTTAAAGGAGCTTGGACGGTGACCAATTACTTCCCTCA
AGCGGTGAATACTATTGCCATTACACTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGGATTTCAGGCCTATATCTTGCTGTAACGTTATTTACAAATGCATTTCAA
GAATATTGGCAGATAGGCTTCGTGAGTGGCTTCCTTCTTTTGTTAGTGGAAACCAGTTAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTCTTTGTCAGGAG
CTTGTAGGGGGCTACCATTTGCACAGAGGTAATCCTAGGTGCACTATAAAGGTTGACCTCCAAAAAGCTTATGATTTTGTTAATTGGGATTTCCTCTTTGGTTTGCCGAT
TGCCATAGGTATGCCTATAAGATTTGTGAGTTGGGTTCGAGCGTGTTTTTTCCATGGGAGGAAAGGACTTAGACAAGGTGATCCTTTATCCCTGTTCTTATTTGTGATGG
TCATGGAGGTGCTATCTCGCATGTTGAACCACCCACCTCAAAATTTTCAATTCCACCAATTTTGTGAGAAGGTCAAGTTAACTCATCTTACTTCTGCAGATGACCTGATG
ATCTTTTGTACTGCTGATAATCATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTTTCGGGACTGTTTGCTAATCTTGCTAAAAGCTCAAATTTTCT
TGTGGGGGTTAATAGTTCGGAAGCTTCTCGGCTTGCTGCTAATATGGGTTTTTCCATTGGTCATCTCCCTGTTCGTTATCTTGGGCTTCCTCTCCTCTCTGGAAGATTGC
GGAGCTCTGATTGTGATCCCCTTATTCAGCATATTACCAGTCGTATTCGATCTTGGTCTGCTAGAGTGTTATCTTTTGCAGTCCTTAGGAGCCTTCAGGTTTATTGGGCT
AGTGTGTTCATGCTTCCTATGAAAGTCCATAGAGACGTTGATAAGATTTTGCGGTCTTATTTGTGGAGAGGTAAGGAGGAGGGTAGAGGTGGTGCTAAAGTTGCATGGGA
TGAGGTTTGTCTTCCTTTTGATGAAGGAGGTTTTGCTATTCGCGATGGATCTTCTTGGAATATTGCAAGCACGTTGAAGATCTTATGGTTGCTGCTAGTTAAATCTGGTA
GTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGATGCTGGGGTGGGTCGATCTTGGTGTTTTAGGGCTATCTTGCGTAAGCGGGATATCCATAAAG
CTCATGTTAAGATGGAGGTGGGCAATGGCAGGAAGTGTAGAGTGTGGTTGGATTCAGGGTGGGACAATTATCCAGCAGTTTGGGGAGAGGGTGATATATGATGCAGGTAG
TCGGCGGGATGCGAGGCGTGTGGATTTCATGGGTCAGGATGGAGATTGGAGGTGGCCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATTGGATTCAGGGAGTGAGGC
CGAGTCCGAGTGTTGAGGATAGGTGGGTCTGGGTGCCAGGGAGTCTTGATAGTTTTTCGATCGCAAGTGCGTGGGAAACTATTCGTCCTCATAGTAGTAGGGTTGGCTGG
TCAGGTTTACTTTGGGATAGGGGAAATATTCCTAAGCACTCCTTCTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTACTAGAGATAGGTTAAGTCAATGGGATAGGTC
GATTCCTTTATCGTGTATGCTGTGTGGTGGGAACTATGAGTCTCGTGATCGTTTGTTTTTTTCTTGTCCTTTTGGGTGGGAGATGTGGTCGAGAATCCTTTTGTTTATGT
CATCTTCTCATAGGATAGGTTATTGGGGGGTTGAGTTATCTTGGATCTGTAATCAGGGTATTGGGAAGGGTGTAAGGAGAAAACTGTGGCACCTTCTCTGGTGTGCTACA
ATTTATTTCATTTGGAAGGAGCGAAATCATAATCTTCATGGAGGTGCTGTTCGGGAGCCTATGCTTATTTTCTAG
Protein sequenceShow/hide protein sequence
MVLSKGHKYICRAVKDFLAVSTVGFCCILETRVQEENFVSISSKFGDSWGFTSNYNNSGVGRMWIMWKHNRFVFMTYERFCVYASNSNIERHVLWRQMAEISAGWRGSGI
VMGDFNTIRLHSEAFGGASNVGDMEEIDMDIREADLVEPAVQGNWFTWTSKIHGLGLMRRLDRILVNDEGLSKWPNMRVNVLPWGISNHSPILVYPSNQRSQHVVSFRFF
NHWVKESSFMDVVSSTWTKDTRVSPIVNIVRNLRNLKSILRNHFGKHIRTISEDVRLAKDAMDRAQREMETNLLSEKPLCARNRESDDGNRLTNHDQVSQVALQVPIGRE
EVRRGLFSMDSGKAPGPDGYSIGFFKGAWTVTNYFPQAVNTIAITLIPKRNGADRLEDFRPISCCNVIYKCISRILADRLREWLPSFVSGNQLAFIPGRSIIDNILLCQE
LVGGYHLHRGNPRCTIKVDLQKAYDFVNWDFLFGLPIAIGMPIRFVSWVRACFFHGRKGLRQGDPLSLFLFVMVMEVLSRMLNHPPQNFQFHQFCEKVKLTHLTSADDLM
IFCTADNHSMSFIKETIKRFGELSGLFANLAKSSNFLVGVNSSEASRLAANMGFSIGHLPVRYLGLPLLSGRLRSSDCDPLIQHITSRIRSWSARVLSFAVLRSLQVYWA
SVFMLPMKVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGFAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSMLGWVDLGVLGLSCVSGISIK
LMLRWRWAMAGSVECGWIQGGTIIQQFGERVIYDAGSRRDARRVDFMGQDGDWRWPLVSLDLMDIWDWIQGVRPSPSVEDRWVWVPGSLDSFSIASAWETIRPHSSRVGW
SGLLWDRGNIPKHSFCAWLAIRDRLGTRDRLSQWDRSIPLSCMLCGGNYESRDRLFFSCPFGWEMWSRILLFMSSSHRIGYWGVELSWICNQGIGKGVRRKLWHLLWCAT
IYFIWKERNHNLHGGAVREPMLIF