; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12030 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12030
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr4:10344102..10350491
RNA-Seq ExpressionCSPI04G12030
SyntenyCSPI04G12030
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]0.0e+0053.08Show/hide
Query:  KNRGMISIRDKRKRAEVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDV-------IPMEEPCLRWVSVAF---WRLEFE------
        K R ++S+RD+ K  EV++ +SF +L+EV   DKW L+IV+ SP P++V   A+     +  V         + E     VS  F   W   +       
Subjt:  KNRGMISIRDKRKRAEVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDV-------IPMEEPCLRWVSVAF---WRLEFE------

Query:  -------RRTLSPFPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTR
               ++    F   +V  QF+   + DL+ G  VEV CVYASNSNIERR+LWR++ EI++GW  P       GV+   G + ++  +          
Subjt:  -------RRTLSPFPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTR

Query:  SRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIV
         +G    +   I  + L+  L  + VND+ L  WPN+ VNVL WGISDHSPIL YPS Q++++V SFRFFNHWVE+ SF  VV   W +   VSP+V+ +
Subjt:  SRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIV

Query:  RNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSN
        RNL+  K IL RHFG                                      LAT  FW AVR+++ ++ QKS+IRWLKL DQN  FFHRS+      N
Subjt:  RNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSN

Query:  ALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNY
        +L S             Q I YRELS  I+ IVQFRW++EC Q LQ PI  EEVRRVLFSMDSGKAPGPDG+SVG FK                      
Subjt:  ALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNY

Query:  FPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAY
           GVN T +TLIPKR GA+R+E+F PISCC+VIYKCIS+ILADRLRVWLP+F+S                      V  YHL+ GKP CT+KVDLQKAY
Subjt:  FPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAY

Query:  DSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDD
        DS+NWDFLFGLLIAI TPL+FVSW++AC+TSPMFSIMINGSLEGFFHGRKG+RQG+PLSPF FVMVM+V SRMLN PPQ FQFHQ CEKV+LT LTF DD
Subjt:  DSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDD

Query:  LMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLS
        LMIFC AD  S+SF++ET+++FGEL  L+ANL K SIF+ G  +  AS LAAN    +G+L VRYL LPLL+ RL+ SDC PLIQRITS IRSW+ARVLS
Subjt:  LMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLS

Query:  FAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSG-------
        FAGR QLV SV RSLQVYWASVF+LP  VH  +DKILR+YLWR                          +RDG SWNI STLKILWLL   S        
Subjt:  FAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSG-------

Query:  ------RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMD
              +SL  +D+GV RSWC R ILRK+D LK HV +EVG+G  CRVWL PW+QG PI++Q GERV+YDA SR +ARL +F+G DG+W+W  VS++L+D
Subjt:  ------RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMD

Query:  IWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDH
        +WDRVQ VRP  SV DRWVWVPG    F I S  +TIRP   RV W GLLWGG N+PKHSFCAWL I+++LGTRDRL RWD S+P+S +LC G  ESRDH
Subjt:  IWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDH

Query:  LFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVHG
        LFFSC FG ++WSR+L +M+SSHRI YWGVELSWI +QGIG SVRRKLWR+L CAT YFIW+E NHRLHG   R  +++FQ I +CI+AR  SW +  H 
Subjt:  LFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVHG

Query:  LI
        LI
Subjt:  LI

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]0.0e+0062.92Show/hide
Query:  FPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRT----------WGTWRSLTWLFERRTLLNTRSRG
        F  +++ ++F+ G +TDL+ G  VEV+CVYASNS+ ERR LWR + EI++ W      +     +R            G             L+    +G
Subjt:  FPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRT----------WGTWRSLTWLFERRTLLNTRSRG

Query:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNL
          FTWTSK+ GSG++ RLDR+LVNDE LS WP MR+NVLPWGISDHSPIL YPS Q + +VVSFRFFNHWVEE SF++VV+  W++   VS +V+++RNL
Subjt:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNL

Query:  RNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALR
         +LK ILRR FGRHI+++SE+V +A + MD A+RE+E N LS+  S  ASLAT  FW AVR++E ++RQKS++RWL L DQNTAFFHRSVRSR S N+L 
Subjt:  RNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALR

Query:  SVIDPDENRLTNHD----------------QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEG
        S++D D +R+++HD                Q I YRELS  I++IVQF+W+EECCQ LQ PI REEVRRVLFSMDSGKAPGPDG+SVGF+KGAW+VVGE 
Subjt:  SVIDPDENRLTNHD----------------QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEG

Query:  FCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGK
        FC+ VLHFFET Y P GVN TAITLIPK  GA+RLEDF PISCC+V+YKCIS+ILADRLR+WLPSF+S NQ AFIPGRSII+NILLCQELVG YHL+ GK
Subjt:  FCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGK

Query:  PRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFC
        PRCT+KVDLQKAYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTS MFSIMINGSLEGFF+GRKGLRQGDPLSPFLFVMVMEVLSRMLN  PQ+F+FH  C
Subjt:  PRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFC

Query:  EKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSR-RLQSSDCDPLIQR
        EKV+LTHLTF DDLMIFC AD  S+SFI+E +++FGE S LFAN  KSSIF+VGVN+  AS LAA   +     S   L  P  S   L+S DC PLIQR
Subjt:  EKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSR-RLQSSDCDPLIQR

Query:  ITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKIL-
        ITS IRSW+ARVLSFAGRLQLV SVLRSLQVYWASVF+LP  VH ++DKILR+YLWRG EEGRGG KVAW +VCLPF+EGGL IRDG SWNIA+TLKIL 
Subjt:  ITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKIL-

Query:  ---------WL-LLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD
                 W+   +  G+SL ++D+ V RSWC R ILRK++ +K HV                           GERV+YDA SR +A+L DF+  +G+
Subjt:  ---------WL-LLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD

Query:  WRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSC
        W W  VSL+L+D+W+RVQ V P  SV D WVWVPG    F I S WE I P   RV W GLLWGG NIPKHSFCAWLAI+DRL TRDRL RWD SIPLSC
Subjt:  WRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSC

Query:  LLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIK
        +LC G  ESRDHLFFSC FG ++WSR+  +M SSHRIG+WGVELSWI ++GIGK VRRKLWR+LWCATIYFIW ERNHRLHG   R+P+++F LI + I+
Subjt:  LLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIK

Query:  ARAASWSDGVH
        ARA SW +  H
Subjt:  ARAASWSDGVH

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]0.0e+0073.17Show/hide
Query:  MAEISAGWRGPDSTLKPSGVLR----------TWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGIS
        MAEISAGWRG    +     +R            G    +        L+    +   FTWT+KIHG GLM RLDRILVNDEGLS WPNMRVNVLPWGIS
Subjt:  MAEISAGWRGPDSTLKPSGVLR----------TWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGIS

Query:  DHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEE
        +HSPILVYPSNQRSQ VVSFRFFNHWV+E+SFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILR HFG+HIRTISEDV L           M    L E 
Subjt:  DHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEE

Query:  ASNHASLATVNFWKAVR-VKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQ
                   F++  R V E+A                    ++ V++RQS NAL S+IDPD NRLTNHDQ                        Q LQ
Subjt:  ASNHASLATVNFWKAVR-VKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQ

Query:  SPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRL
         PIGREEVRR LFSMDSGKAPGPDGYS+GFFKGAWTV              TNYFPQ VNT AITLIPKRNGADRLEDF PISCC+VIYKCISRILADRL
Subjt:  SPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRL

Query:  RVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFF
          WLPSFVSGNQ AFIPGRSIIDNILLCQELVG YHLHRG PRCT+KVDLQKAYDSVNWDFLFGLLIAIG  +RFVSWVRAC TS MFSI+INGSLEGFF
Subjt:  RVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFF

Query:  HGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSK
        HGRKGLRQGDPLS FLFVMVMEVLSRMLN+PPQNFQFHQFCEKV+LTHLTF DDLMIFC ADN+SMSFIKETIKRFGELS LFANL KSSIFLVGVNSSK
Subjt:  HGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSK

Query:  ASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNE
        AS LAAN   SIGHL VRYLGLPLL  RLQS DCDPLIQRITS IRSWSARVLSFAGRLQLV SVLRSLQVYWASVFMLPMKVHRD+DKILR+YLWRG E
Subjt:  ASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNE

Query:  EGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLE-IDAGVSRSWCFREILRKQDILKAHVKMEVGNGR
        EGRGGAKVAWDEVCLPFDEGGL IRDGSSWNIASTLKILWLLLVKS              GRS+L  +D GV    C   I  K   L    +  +    
Subjt:  EGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLE-IDAGVSRSWCFREILRKQDILKAHVKMEVGNGR

Query:  KCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRV
        +C      WIQGG IIQQFGERVIYDAGSR DARLVDFM RDGDWRW LVSLDLMDIWD +QGVRPS SVEDRWVWVPGS DSF I S WETIRPHSSRV
Subjt:  KCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRV

Query:  GWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSV
        GWSGLLW   NIPKHSF AWLAIRDRLGTRDRLS+WDRSIPLSC+LCGGNYESRDHLFFSC FGWEIWSRILL MSSSHRIGYWGVELSWI NQGIGK V
Subjt:  GWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSV

Query:  RRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVF
        RRKLW LLWCATIYFIW+ERNH LHG A+REPM+ F
Subjt:  RRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVF

XP_031740402.1 uncharacterized protein LOC116403409 [Cucumis sativus]0.0e+0081.92Show/hide
Query:  MRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAK
        MRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWV+EASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMD+A+
Subjt:  MRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAK

Query:  REMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHD----------------QNI
        REMETNSLSEEASNHASLATVNFWKAVRV+E AMRQKS+ RWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPD NRLTNHD                QNI
Subjt:  REMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHD----------------QNI

Query:  SYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGAD
        SY ELSTSIEEIVQFRWTEECCQ LQSPIGREEVRRVLFSMD GKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGAD
Subjt:  SYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGAD

Query:  RLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLR
        RLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYD VNWDFLFGLLIAI     
Subjt:  RLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLR

Query:  FVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIK
                                                                                      DDLMIFCTADNHSMSFIKETIK
Subjt:  FVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIK

Query:  RFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWA
        RFGELS LFANL KS IFLVGVNSSKAS LAAN   SIGHL VRYLGLPLLSRRL+SSDCDPLIQRITS IRSWSARVLSFAGRLQLV SVLRSLQVYWA
Subjt:  RFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWA

Query:  SVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAGVSRS
        SVFMLPMKVHRD+DKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS              GRSLLEIDAGVSRS
Subjt:  SVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAGVSRS

Query:  WCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD
        WCFREILRK+DILK HVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD
Subjt:  WCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD

XP_031745634.1 uncharacterized protein LOC116406053 [Cucumis sativus]0.0e+0060.48Show/hide
Query:  PGIVLETFVFDSVPVWIKLERIPLELWTYAGLAIIA--------------KRRRLSYVRICVELNVESSMSAEITFNLRGAKFIVTVAYEWKLRKCNLCR
        PGIV E+FVFDSV V IKL RIPLELWT AGLA++A              +RRRLSY RICVELNV+S M AE+T NLRG +FIVTV YEWK +KCNLCR
Subjt:  PGIVLETFVFDSVPVWIKLERIPLELWTYAGLAIIA--------------KRRRLSYVRICVELNVESSMSAEITFNLRGAKFIVTVAYEWKLRKCNLCR

Query:  SFGHLSSTCPK---IEVSKKEAVSKEDPVQEIVPTKEVVTTCEKFGDVV---------------------------------KNRGMISIRDKRKRAEVS
        SFGH  +TCPK    E SKKEA SKE PV+E+VPTKEVV  C ++ DVV                                 KNRGMISI D+ KRAEVS
Subjt:  SFGHLSSTCPK---IEVSKKEAVSKEDPVQEIVPTKEVVTTCEKFGDVV---------------------------------KNRGMISIRDKRKRAEVS

Query:  ITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAFWRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKVEVLC
        ITNSFNNLMEVDKGDKWPLSIVD SPPPLRVDDS+MVLLST GDVIPM E                    +P   +            DLISGEKVEVLC
Subjt:  ITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAFWRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKVEVLC

Query:  VYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLR----------TWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGL
        VYA NSNIERRVLWR+MAEISAGWRGP   L     +R            G WRSLTWLFERRTLLN   +G  FTWTSKIHGSGLM RLDRILVNDEGL
Subjt:  VYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLR----------TWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGL

Query:  STWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDT
        STWPNMRVNVLPW                                ASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDT
Subjt:  STWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDT

Query:  MDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHD--------------
        MD+A+REMETNSLSEEASNHASLAT                                         SSNALRSVIDPD NRLTNHD              
Subjt:  MDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNHD--------------

Query:  --QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPK
          QNISY ELSTSIEEIVQFRWTEECCQ LQSPIGR EVRRVLFSMD GKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPK
Subjt:  --QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPK

Query:  RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAI
        RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYD VNWDFLFGLLIAI
Subjt:  RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAI

Query:  GTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFI
        GTPLRF                                                                                              
Subjt:  GTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFI

Query:  KETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSL
                                      ++S LAAN   SIGHL VRYLGLPLLSRRL+SSDCDPLIQRITS IRSWSARVLSFAGRLQLV SVLRSL
Subjt:  KETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSL

Query:  QVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSGRSLLEIDAGVSRSWCFREILRK
        QVYWASVFMLPMKVHRD+DKILRAYLWR          + W+ + L  D  G        W                            RSWCFREILRK
Subjt:  QVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSGRSLLEIDAGVSRSWCFREILRK

Query:  QDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFG
        +DILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFG
Subjt:  QDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFG

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein0.0e+0053.08Show/hide
Query:  KNRGMISIRDKRKRAEVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDV-------IPMEEPCLRWVSVAF---WRLEFE------
        K R ++S+RD+ K  EV++ +SF +L+EV   DKW L+IV+ SP P++V   A+     +  V         + E     VS  F   W   +       
Subjt:  KNRGMISIRDKRKRAEVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDV-------IPMEEPCLRWVSVAF---WRLEFE------

Query:  -------RRTLSPFPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTR
               ++    F   +V  QF+   + DL+ G  VEV CVYASNSNIERR+LWR++ EI++GW  P       GV+   G + ++  +          
Subjt:  -------RRTLSPFPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTR

Query:  SRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIV
         +G    +   I  + L+  L  + VND+ L  WPN+ VNVL WGISDHSPIL YPS Q++++V SFRFFNHWVE+ SF  VV   W +   VSP+V+ +
Subjt:  SRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIV

Query:  RNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSN
        RNL+  K IL RHFG                                      LAT  FW AVR+++ ++ QKS+IRWLKL DQN  FFHRS+      N
Subjt:  RNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSN

Query:  ALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNY
        +L S             Q I YRELS  I+ IVQFRW++EC Q LQ PI  EEVRRVLFSMDSGKAPGPDG+SVG FK                      
Subjt:  ALRSVIDPDENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNY

Query:  FPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAY
           GVN T +TLIPKR GA+R+E+F PISCC+VIYKCIS+ILADRLRVWLP+F+S                      V  YHL+ GKP CT+KVDLQKAY
Subjt:  FPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAY

Query:  DSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDD
        DS+NWDFLFGLLIAI TPL+FVSW++AC+TSPMFSIMINGSLEGFFHGRKG+RQG+PLSPF FVMVM+V SRMLN PPQ FQFHQ CEKV+LT LTF DD
Subjt:  DSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDD

Query:  LMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLS
        LMIFC AD  S+SF++ET+++FGEL  L+ANL K SIF+ G  +  AS LAAN    +G+L VRYL LPLL+ RL+ SDC PLIQRITS IRSW+ARVLS
Subjt:  LMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLS

Query:  FAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSG-------
        FAGR QLV SV RSLQVYWASVF+LP  VH  +DKILR+YLWR                          +RDG SWNI STLKILWLL   S        
Subjt:  FAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSG-------

Query:  ------RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMD
              +SL  +D+GV RSWC R ILRK+D LK HV +EVG+G  CRVWL PW+QG PI++Q GERV+YDA SR +ARL +F+G DG+W+W  VS++L+D
Subjt:  ------RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMD

Query:  IWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDH
        +WDRVQ VRP  SV DRWVWVPG    F I S  +TIRP   RV W GLLWGG N+PKHSFCAWL I+++LGTRDRL RWD S+P+S +LC G  ESRDH
Subjt:  IWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDH

Query:  LFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVHG
        LFFSC FG ++WSR+L +M+SSHRI YWGVELSWI +QGIG SVRRKLWR+L CAT YFIW+E NHRLHG   R  +++FQ I +CI+AR  SW +  H 
Subjt:  LFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVHG

Query:  LI
        LI
Subjt:  LI

A0A5A7TZS0 Reverse transcriptase domain-containing protein0.0e+0062.92Show/hide
Query:  FPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRT----------WGTWRSLTWLFERRTLLNTRSRG
        F  +++ ++F+ G +TDL+ G  VEV+CVYASNS+ ERR LWR + EI++ W      +     +R            G             L+    +G
Subjt:  FPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRT----------WGTWRSLTWLFERRTLLNTRSRG

Query:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNL
          FTWTSK+ GSG++ RLDR+LVNDE LS WP MR+NVLPWGISDHSPIL YPS Q + +VVSFRFFNHWVEE SF++VV+  W++   VS +V+++RNL
Subjt:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNL

Query:  RNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALR
         +LK ILRR FGRHI+++SE+V +A + MD A+RE+E N LS+  S  ASLAT  FW AVR++E ++RQKS++RWL L DQNTAFFHRSVRSR S N+L 
Subjt:  RNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALR

Query:  SVIDPDENRLTNHD----------------QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEG
        S++D D +R+++HD                Q I YRELS  I++IVQF+W+EECCQ LQ PI REEVRRVLFSMDSGKAPGPDG+SVGF+KGAW+VVGE 
Subjt:  SVIDPDENRLTNHD----------------QNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEG

Query:  FCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGK
        FC+ VLHFFET Y P GVN TAITLIPK  GA+RLEDF PISCC+V+YKCIS+ILADRLR+WLPSF+S NQ AFIPGRSII+NILLCQELVG YHL+ GK
Subjt:  FCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGK

Query:  PRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFC
        PRCT+KVDLQKAYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTS MFSIMINGSLEGFF+GRKGLRQGDPLSPFLFVMVMEVLSRMLN  PQ+F+FH  C
Subjt:  PRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFC

Query:  EKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSR-RLQSSDCDPLIQR
        EKV+LTHLTF DDLMIFC AD  S+SFI+E +++FGE S LFAN  KSSIF+VGVN+  AS LAA   +     S   L  P  S   L+S DC PLIQR
Subjt:  EKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSR-RLQSSDCDPLIQR

Query:  ITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKIL-
        ITS IRSW+ARVLSFAGRLQLV SVLRSLQVYWASVF+LP  VH ++DKILR+YLWRG EEGRGG KVAW +VCLPF+EGGL IRDG SWNIA+TLKIL 
Subjt:  ITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKIL-

Query:  ---------WL-LLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD
                 W+   +  G+SL ++D+ V RSWC R ILRK++ +K HV                           GERV+YDA SR +A+L DF+  +G+
Subjt:  ---------WL-LLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGD

Query:  WRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSC
        W W  VSL+L+D+W+RVQ V P  SV D WVWVPG    F I S WE I P   RV W GLLWGG NIPKHSFCAWLAI+DRL TRDRL RWD SIPLSC
Subjt:  WRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSC

Query:  LLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIK
        +LC G  ESRDHLFFSC FG ++WSR+  +M SSHRIG+WGVELSWI ++GIGK VRRKLWR+LWCATIYFIW ERNHRLHG   R+P+++F LI + I+
Subjt:  LLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIK

Query:  ARAASWSDGVH
        ARA SW +  H
Subjt:  ARAASWSDGVH

A0A5A7UP65 Reverse transcriptase0.0e+0053.67Show/hide
Query:  EVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAF-WRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKV
        EV   NSF +L+EV   DKW LSI++GSP   R      + L +   V       L   SV F   LE   R  +    + V+ +F          G   
Subjt:  EVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAF-WRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKV

Query:  EVLCVYASNSNIER-RVLWRR-------------------MAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGL
        +  C Y SNS + R  V+W++                   + E     RG       S +    G             L+    +G  FTWTSK+ GSG+
Subjt:  EVLCVYASNSNIER-RVLWRR-------------------MAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGL

Query:  MNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRH
        M RLDR+L+ND+ LS WP M VNVLPWGISDHSPIL+YPS Q++ +VVSFR FNHWV++ SF+                               R FGRH
Subjt:  MNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRH

Query:  IRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH-
        IR++SE+VR+A + MD A+RE+E N +S+  S  ASLAT  FW AVR+++                +N  F   SV SR SS+    V     N  +N  
Subjt:  IRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH-

Query:  -DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPK
          Q I YREL+  I++IVQF+W+EECCQ LQ PI REEVRRVLFSMDSGKAPGPDG+SVGFFKGAW+V+GE FCD VLHFFET Y P GVN TAITLIPK
Subjt:  -DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPK

Query:  RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAI
         NGA+RLEDF PISCC+V+YKCIS+ILADRLRVWLPSF+S NQ AFI GRSII+NILLCQELVG YHL+ GKPRCT+KVDLQKAYDSVNWDFLFGL I+I
Subjt:  RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAI

Query:  GTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFI
         TPL+FVSW+ ACVTSPMFSIMINGSLEGFFHGRKG+RQGDPLS FLFVMVMEVLSRMLN  PQ+FQFH  CE                           
Subjt:  GTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFI

Query:  KETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSL
            KRFGELS LFAN  KSSIF+ GVN+  AS LAA      G+L VRYLGLPLL+ RL+S+DC PLIQRITS IRS SARVLSFAGRLQLV SVL SL
Subjt:  KETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSL

Query:  QVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDA
        QVYWA VF+LP  VH                                 +EGGL IRDG++W  ASTLKILWL+L  S              GRSL ++D+
Subjt:  QVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDA

Query:  GVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSV
         V RSWC R ILRKQ+ LK HV+M+VGNG +CRVWL PW+Q G I++Q GERV+YDA SR +A L +F+G DG+W W                       
Subjt:  GVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSV

Query:  EDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSR
                     F I S WE IRP   RV W GLLWGG NIPKHSFCAWLAI+DRLGTRDR  RWD S+PLSC+LC G  ESRDHLFFSC FG ++WSR
Subjt:  EDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSR

Query:  ILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH
        +L +M+SSHRIG+WGVELSWI +QGI K VRRKLWR+LWCATIYFIW ERNHRLHG    +P+V+F LI + I+ARA SW +  +
Subjt:  ILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH

A0A5A7V3Z0 Reverse transcriptase domain-containing protein0.0e+0060.87Show/hide
Query:  LMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGR
        ++ RLDR+LVN++  S WP M VNVLPWGISDHSPIL YPS Q + +VVSF FFNHWVE+ SF++VV+  W +   VSPIV+++RNL +LK I+RR FGR
Subjt:  LMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGR

Query:  HIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH
        HI+++SE+VR A   +D A+RE+E N +S+  S+ A L+T  FW AVR++E ++RQKS+IRWLKL DQNTAFFHRSVRSR S N LRS++D D  R    
Subjt:  HIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH

Query:  DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKR
           I YRELS  I++IVQF+W+EECCQ LQ PI REEVRRVLFSMDSGKAPGPDG+S                              GVN TAITLIPK 
Subjt:  DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKR

Query:  NGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIG
        NGA+RLEDF PISCC+ +YKCIS+ILADRLR WLPSF+S NQ AFIPGRSII+NILLCQELVG YHL+ GKPRCT+KVDLQKAYDSVNWDFLFGLLIAIG
Subjt:  NGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIG

Query:  TPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIK
        TPL+FVSW+RACVTSPMFSIMINGSLEGFFHGRKG+RQGDPLS FLFVMVMEVLSRMLN  PQ+F FH  CEKV+LTHLTF DDLMIFC A+  S+ FI+
Subjt:  TPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIK

Query:  ETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ
        E +++FGELS LFAN  KSSIF+ GVN+  AS LA       G+LSVRYLGLPLL+ RL+S+D  PLIQRITS IRSW+ARVLSFAGRLQLVHSVLRS Q
Subjt:  ETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ

Query:  VYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAG
        VYWASVF+LP  VH ++DKILR+YLWRG EEGRGG KVAW +VCLPF+EGGL IRDG SWNIASTLKILWL+L  S              GRSL ++D+ 
Subjt:  VYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAG

Query:  VSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVE
        V +SWC R ILRK++ LK  V+M+VGNG   RVWL PW+  G I++Q GERV+YDA SR  ARL DF+  DG+W W  VSL+L+D+W+RVQ V P  SV 
Subjt:  VSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVE

Query:  DRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI
        D WVWVPG    F I S WE +RP   RV W GLLWGG NI KH FCAWLAI+DRLGT DRL RWD S+P+ C+L                     W   
Subjt:  DRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI

Query:  LLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH
                   ++G   SW+   G+G         +   + + F+          +  R+P+V+F LI S I+ARA SW    H
Subjt:  LLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH

A0A5D3D7P6 Reverse transcriptase0.0e+0052.62Show/hide
Query:  EVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAF-WRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKV
        EV   NSF +L+EV   DKW LSI++GSP   R      + L +   V       L   SV F   LE   R  +    + V+ +F          G   
Subjt:  EVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMVLLSTTGDVIPMEEPCLRWVSVAF-WRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKV

Query:  EVLCVYASNSNIER-RVLWRR-------------------MAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGL
        +  C Y SNS + R  V+W++                   + E     RG       S +    G             L+    +   FTWTSK+ GSG+
Subjt:  EVLCVYASNSNIER-RVLWRR-------------------MAEISAGWRGPDSTLKPSGVLRTWGTWRSLTWLFERRTLLNTRSRGTGFTWTSKIHGSGL

Query:  MNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRH
        M RLDR+L+ND+ LS WP M VNVLPWGISDHSPIL+YPS Q++ +V                       ++   VSP+V ++RNL  LK ILRR FGRH
Subjt:  MNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVRNLRNLKSILRRHFGRH

Query:  IRTISEDVRLANDTMDQAKREMETNSLS-EEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH
        IR++SE+VR+A + MD A+RE++  SL+  E SN    A V+    V    V                              SN+L S            
Subjt:  IRTISEDVRLANDTMDQAKREMETNSLS-EEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDPDENRLTNH

Query:  DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKR
         Q I YREL+  I++IVQF+W+EECCQ LQ PI REEVRRVLFSMDSGKAPGPDG+SVGFFKGAW+V+GE FCD VLHFFET Y P GVN TAITLIPK 
Subjt:  DQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKR

Query:  NGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIG
        NGA+RLEDF PISCC+V+YKCIS+ILADRLRVWLPSF+S NQ AFI GRSII+NILLCQELVG YHL+ GKPRCT+KVDLQKAYDSVNWDFLFGL I+I 
Subjt:  NGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIG

Query:  TPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIK
        TPL+FVSW+ ACVTSPMFSIMINGSLEGFFHGRKG+RQGDPLS FLFVMVMEVLSRMLN  PQ+FQFH  CE                            
Subjt:  TPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIK

Query:  ETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ
           KRFGELS LFAN  KSSIF+ GVN+  AS LAA      G+L VRYLGLPLL+ RL+S+DC PLIQRITS IRS SARVLSFAGRLQLV SVL SLQ
Subjt:  ETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ

Query:  VYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAG
        VYWA VF+LP  VH                                 +EGGL IRDG++W  ASTLKILWL+L  S              GRSL ++D+ 
Subjt:  VYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKS--------------GRSLLEIDAG

Query:  VSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVE
        V RSWC R ILRKQ+ LK HV+M+VGNG +CRVWL PW+Q G I+++ GERV+YDA SR +A L +F+G DG+W W                        
Subjt:  VSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVE

Query:  DRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI
                    F I S WE IRP   RV W GLLWGG NIPKHSFCAWLAI+DRLGTRDR  RWD S+PLSC+LC G  ESRDHLFFSC FG ++WSR+
Subjt:  DRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI

Query:  LLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH
        L +M+SSHRIG+WGVELSWI +QGI K VRRKLWR+LWCATIYFIW ERNHRLHG    +P+V+F LI + I+ARA SW +  +
Subjt:  LLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARAASWSDGVH

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.8e-4025Show/hide
Query:  ELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFET----NYFPQGVNTTAITLIPK-RNG
        E+ T ++     R  +E  ++L  PI   E+  ++ S+ + K+PGPDG++  F++       E     +L  F++       P      +I LIPK    
Subjt:  ELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFET----NYFPQGVNTTAITLIPK-RNG

Query:  ADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTP
          + E+F PIS  ++  K +++ILA+R++  +   +  +Q  FIPG     NI     ++   +  + K    + +D +KA+D +   F+   L  +G  
Subjt:  ADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTP

Query:  LRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKET
          ++  +RA    P  +I++NG     F  + G RQG PLSP LF +V+EVL+R +    +        E+V+L+   F DD++++      S   + + 
Subjt:  LRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKET

Query:  IKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS--RRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ
        I  F ++S    N+ KS  FL   N    S +      +I    ++YLG+ L    + L   +  PL++ I      W     S+ GR+ +V   +    
Subjt:  IKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS--RRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQ

Query:  VYW--ASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILW
        +Y   A    LPM    +++K    ++W         A++A   +      GG+ + D   +  A+  K  W
Subjt:  VYW--ASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILW

P08548 LINE-1 reverse transcriptase homolog2.6e-4124.04Show/hide
Query:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRS--QQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVR
        T +T+ S  HG+   +++D IL +   LS +   ++ ++P   SDH  I V  +N R+      +++  N  +++   +D +    TK    +   N   
Subjt:  TGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRS--QQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVNIVR

Query:  NLRNL----KSILRRHF----GRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASL-ATVNFWKAVRVKEVAMRQKSQ-IRWLKLDDQNTAFFHR
        N +NL    K++LR  F        +T  E+V      + Q ++E  +N           + A +N  +  R+ +   + KS     +   D+  A   R
Subjt:  NLRNL----KSILRRHF----GRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASL-ATVNFWKAVRVKEVAMRQKSQ-IRWLKLDDQNTAFFHR

Query:  SVRSRQSSNALRS-----VIDPDE-NRLTNHDQNISYRELSTSIEEIVQF-------RWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFK
          R +   +++R+       DP E  ++ N      Y     +++EI Q+       R +++  + L  PI   E+   + ++   K+PGPDG++  F++
Subjt:  SVRSRQSSNALRS-----VIDPDE-NRLTNHDQNISYRELSTSIEEIVQF-------RWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFK

Query:  GAWTVVGEGFCDVVLHFFET----NYFPQGVNTTAITLIPKRNGAD--RLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNIL
               E    ++L+ F+        P       ITLIPK  G D  R E++ PIS  ++  K +++IL +R++  +   +  +Q  FIPG     NI 
Subjt:  GAWTVVGEGFCDVVLHFFET----NYFPQGVNTTAITLIPKRNGAD--RLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNIL

Query:  LCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSR
            ++   +  + K    + +D +KA+D++   F+   L  IG    F+  + A  + P  +I++NG     F  R G RQG PLSP LF +VMEVL+ 
Subjt:  LCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSR

Query:  MLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS
         +         H   E+++L+   F DD++++      S + + E IK +  +S    N  KS  F+   N+     +  +   ++    ++YLG+ L  
Subjt:  MLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS

Query:  --RRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVH-SVL-RSLQVYWASVFMLPMKVHRDIDKILRAYLW
          + L   + + L + I   +  W     S+ GR+ +V  S+L +++  + A     P+   +D++KI+  ++W
Subjt:  --RRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVH-SVL-RSLQVYWASVFMLPMKVHRDIDKILRAYLW

P0C2F6 Putative ribonuclease H protein At1g657501.0e-2925Show/hide
Query:  LPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGG
        +P+L +R+       +++R++S +  W  + LSFAGRL L  +VL S+ V+  S  +LP  +   +D++ R +LW    E +    V W +VC P  EGG
Subjt:  LPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGG

Query:  LDIRDGSSWNIASTLKILWLLL-------------------VKSGRSLLEIDAGVSRSWCFREI-LRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPII
        L +R   S N A   K+ W LL                   ++  R L+      S +W  R I +  +D++   V    G+G++ R W   W+ G P++
Subjt:  LDIRDGSSWNIASTLKILWLLL-------------------VKSGRSLLEIDAGVSRSWCFREI-LRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPII

Query:  Q-QFGERV----------IYDAGSRWD-ARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIR----PHSSRV
        +   GER           ++  G  WD A++  +   +       V LDL      V G R      DR  W       F + S +E +     P  +  
Subjt:  Q-QFGERV----------IYDAGSRWD-ARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIR----PHSSRV

Query:  GWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVEL-SWIY-NQGIGK
         +   LW      +     WL     + T +   R   S    C +C G  ES  H+   C     IW R+   +    + G++   L  W+Y N G   
Subjt:  GWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVEL-SWIY-NQGIGK

Query:  SVRRKLWRLLWCATIYFIWQERNHRLHG
              W  ++   I++ W+ R   + G
Subjt:  SVRRKLWRLLWCATIYFIWQERNHRLHG

P11369 LINE-1 retrotransposable element ORF2 protein3.8e-3724Show/hide
Query:  IDPDENRLTNHDQNISYRELSTSIEEIVQF----------RWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHF
        I  D   + N  ++   R  ST +E + +           +  ++    L SPI  +E+  V+ S+ + K+PGPDG+S  F++       E    ++   
Subjt:  IDPDENRLTNHDQNISYRELSTSIEEIVQF----------RWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHF

Query:  FE----TNYFPQGVNTTAITLIPK-RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRC
        F         P       ITLIPK +    ++E+F PIS  ++  K +++ILA+R++  + + +  +Q  FIPG     NI     ++   +  + K   
Subjt:  FE----TNYFPQGVNTTAITLIPK-RNGADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRC

Query:  TMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKV
         + +D +KA+D +   F+  +L   G    +++ ++A  + P+ +I +NG        + G RQG PLSP+LF +V+EVL+R +    +        E+V
Subjt:  TMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKV

Query:  RLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS--RRLQSSDCDPLIQRIT
        +++ L   DD++++ +   +S   +   I  FGE+     N  KS  FL   N      +      SI   +++YLG+ L    + L   +   L + I 
Subjt:  RLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLS--RRLQSSDCDPLIQRIT

Query:  SHIRSWSARVLSFAGRLQLVHSVLRSLQVYW--ASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILW
          +R W     S+ GR+ +V   +    +Y   A    +P +   +++  +  ++W  N++ R    +  D+       GG+ + D   +  A  +K  W
Subjt:  SHIRSWSARVLSFAGRLQLVHSVLRSLQVYW--ASVFMLPMKVHRDIDKILRAYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-3326.28Show/hide
Query:  QTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRIL
        + L++PI  +E+ + L  M   K+PG DG ++ FF+  W  +G  F  V+   F+    P       ++L+PK+     ++++ P+S  S  YK +++ +
Subjt:  QTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCISRIL

Query:  ADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSL
        + RL+  L   +  +Q   +PGR+I DN+ L ++L+  +    G     + +D +KA+D V+  +L G L A     +FV +++    S    + IN SL
Subjt:  ADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSL

Query:  EGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGV
               +G+RQG PLS  L+ + +E    +L          +    +R+    + DD +I    D   +   +E  + +   S    N  KSS  L G 
Subjt:  EGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIFLVGV

Query:  NSSKASWL-AANRDLSIGHLSVRYLGLPLLSRRLQ-SSDCDPLIQRITSHIRSWS--ARVLSFAGRLQLVHSVLRSLQVYWASVFMLP-MKVHRDIDKIL
         S K  +L  A RD+S     ++YLG+ L +     S +   L + + + +  W   A+VLS  GR  +++ ++ S Q+++  + + P  +    I + L
Subjt:  NSSKASWL-AANRDLSIGHLSVRYLGLPLLSRRLQ-SSDCDPLIQRITSHIRSWS--ARVLSFAGRLQLVHSVLRSLQVYWASVFMLP-MKVHRDIDKIL

Query:  RAYLWRGNEEGRGGAKVAWDEVCLPFDEGG
          +LW G      G         LP  EGG
Subjt:  RAYLWRGNEEGRGGAKVAWDEVCLPFDEGG

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-2628.57Show/hide
Query:  RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQ
        R+   +++  S SW +R + + +++ +  V  +VG+G   + W   W   GP+I   G       G   DA                  + L+D      
Subjt:  RSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQ

Query:  GVRPSPSVEDRWVWVPGSHDSFLITSEWET---IRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFF
                +D ++W    H    I S  +T   + P +  V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP  CLLC  + ESR HLFF
Subjt:  GVRPSPSVEDRWVWVPGSHDSFLITSEWET---IRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFF

Query:  SCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKAR
         C F   +W R     ++          L W+ N    K+    + RL + A +Y IW+ERN  LH    R    V + I+  I+AR
Subjt:  SCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKAR

AT1G43760.1 DNAse I-like superfamily protein1.2e-3828.93Show/hide
Query:  LLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRV-S
        L++  SRG  +TW++    + ++ +LDR + N +  S++P+        G+SDHSP ++   N   +    FR+F+      +F+  ++ AW +   V S
Subjt:  LLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRV-S

Query:  PIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETN-SLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSV
         + ++  +L+  K   +    +    I    + A D+++  + ++ TN S S     H +    NF+ A    E   RQKS+I+WL+  D NT FFH+ +
Subjt:  PIVNIVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETN-SLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSV

Query:  RSRQSSNALRSVIDPDENRLTN-----------------HDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGF
         + Q+ N ++ +   D+ R+ N                  D +I   +    I++I  FR  +     L +    +E+   +F+M   KAPGPD ++  F
Subjt:  RSRQSSNALRSVIDPDENRLTN-----------------HDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGF

Query:  FKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCIS
        F  +W VV +     V  FF T +  +  N TAITLIPK  G D+L  F P+SCC+V+YK I+
Subjt:  FKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNGADRLEDFSPISCCSVIYKCIS

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.9e-3332.14Show/hide
Query:  ILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSR-----WDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGV-RPSP-SVEDR
        +L  + + +  VK  +GNGR    W   W   GP+I+  G     D GSR      +AR+V+ +G +G    L  S     I D +  +  PSP ++ED 
Subjt:  ILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSR-----WDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGV-RPSP-SVEDR

Query:  WVWVPGS--HDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI
        + WV G      F     W+ IRP +  + W+  +W    +PKH+F  W++  DRL TR RL+ W       C LC    ESRDHL FSC F  ++W   
Subjt:  WVWVPGS--HDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI

Query:  LLLMSSSHRI-GYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQL----IRSCIKAR
           +    R+   W   LSW+  +    S    L ++   A IY IW++RN+ LH      P+++F++    IR+ I +R
Subjt:  LLLMSSSHRI-GYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQL----IRSCIKAR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-5432.95Show/hide
Query:  LVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILR
        + GV  +  + +  +   + G L VRYLGLPLL++++ +SD  PL+++I   I  W+AR LSFAGRLQL+ SV+ SL  +W S F LP    ++ID I  
Subjt:  LVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILR

Query:  AYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRD------GSSWNIASTLKILWLLLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRK
        ++LW G E     AKVAW +VC P DEGGL IR       GS W+I             SG + L        SW +++IL+ + +    VK ++ NG  
Subjt:  AYLWRGNEEGRGGAKVAWDEVCLPFDEGGLDIRD------GSSWNIASTLKILWLLLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRK

Query:  CRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPS--PSVEDRWVWVPGSHDSF---LITSE-WETIRP
           W   W + G +I   G R   D G    A + + +      R    +  L+ I D +  VR     S ED   W  G+ D F     T E W   R 
Subjt:  CRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPS--PSVEDRWVWVPGSHDSF---LITSE-WETIRP

Query:  HSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQG
           +V W   +W     PK+S  AW+AI++RL T DR+  W+     SC+LC    E+RDHLFF+C +  E+                            
Subjt:  HSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQG

Query:  IGKSVRRKLWRLLWCATIYFIWQERNHRLHG
                L R  +  T++ +W+ERN R HG
Subjt:  IGKSVRRKLWRLLWCATIYFIWQERNHRLHG

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.5e-3130Show/hide
Query:  ILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVS-------LDLMDIWDRVQGVRPSPSVEDRWVWVPG
        + +  +  EVG+G   + W   WI  GP+I+  G       G   DA + D + R   W W+  S       + L ++    QG+      +D ++W   
Subjt:  ILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGSRWDARLVDFMGRDGDWRWLLVS-------LDLMDIWDRVQGVRPSPSVEDRWVWVPG

Query:  SH---DSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMS
         H   + F     W  + P S  V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP  CLLC  + +SR HLFF C F   +W R     +
Subjt:  SH---DSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMS

Query:  SSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKAR
        + +        L+W+ +    K++   + RL + + +Y IW+ERN RLH    R    + + I+  I+AR
Subjt:  SSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATCGTCCAAGGAAGCTATGTCGGTGAAGAGAAGGGCGTTGACGAGCCTTCAACGTGTTCATGACTTGGCGGCTAGTGAACGCAGAGCGGCAGCGGCGAACGG
TGACCTAACTACAAGGATGAAACTGTCTGAAATAGAACCGGTCGAGATGAACCATGGTCCAAATGGGATTGAGAAGGAGATCTTACATGATGGGCCAAATACAACTGACG
GACCTTCCACCAATAGAAATGGGTCTGGGATGGAGGCTTCTACAAGGATTGCAAAGAATTCAGGGGGTGAAGGCAAAAAACCGAATTCTTGGGCTTCCTTATTTGGTTCA
ACAAGTGGGACCTCCTTGACCTACACTCCTCCAACTACCGTAGGAGAGAAATTTGTGGTGACACCGTCGGAAGAAGTTATTAGTCAAGGAGTTAGTGTTCCAGGTATTGT
TCTTGAAACCTTTGTTTTTGATTCAGTACCTGTTTGGATAAAGTTAGAGAGGATCCCTTTGGAGTTATGGACATATGCTGGTTTGGCTATTATAGCGAAAAGACGTCGTT
TGTCTTATGTTCGTATATGTGTGGAATTGAATGTGGAAAGTAGTATGTCTGCCGAAATAACTTTCAACCTTAGAGGAGCAAAATTCATTGTAACCGTTGCTTACGAGTGG
AAACTGAGGAAATGTAATTTGTGTAGATCATTTGGACACTTGTCGAGCACCTGCCCTAAAATTGAAGTTAGTAAAAAGGAAGCTGTAAGTAAAGAGGATCCTGTTCAAGA
GATAGTTCCTACTAAAGAGGTGGTTACGACGTGTGAGAAGTTTGGCGATGTTGTGAAAAATAGGGGGATGATCTCTATTAGGGACAAAAGGAAAAGGGCGGAAGTGAGTA
TCACTAACTCCTTTAACAACCTCATGGAGGTGGATAAAGGAGATAAATGGCCTTTATCTATAGTGGATGGCTCCCCTCCTCCCTTACGAGTAGATGACTCTGCTATGGTC
CTGTTGAGTACTACAGGTGATGTAATTCCTATGGAAGAGCCGTGTCTACGGTGGGTTTCTGTTGCATTCTGGAGACTAGAGTTTGAGAGGAGAACTTTGTCTCCATTTCC
AGCAAATTTGGTTGCCGACCAGTTTATTTATGGTGTGGTAACAGATTTGATCTCTGGGGAGAAGGTAGAGGTTTTGTGTGTGTATGCCTCTAATAGTAATATTGAGAGAC
GTGTTTTATGGAGGCGAATGGCTGAGATCTCTGCTGGTTGGAGAGGGCCAGACTCCACTCTGAAGCCTTCGGGGGTGCTCCGAACATGGGGGACATGGAGGAGTTTGACA
TGGCTATTCGAGAGGCGGACCTTGTTGAACACGCGGTCTAGGGGAACTGGGTTTACTTGGACTAGTAAAATACATGGGTCGGGTTTGATGAATAGACTTGATCGTATCCT
AGTGAATGATGAGGGGCTTAGTACATGGCCTAACATGAGGGTTAACGTCCTCCCGTGGGGTATTTCTGATCATTCTCCCATACTTGTCTATCCCAGTAATCAGCGAAGCC
AACAAGTGGTCTCTTTTCGTTTCTTTAACCATTGGGTTGAAGAAGCGTCCTTTATGGATGTTGTGTCCTCTGCTTGGACCAAAGATACTAGAGTTTCTCCAATTGTGAAT
ATTGTGAGGAACTTAAGAAATCTCAAGTCGATTCTTCGCAGACATTTTGGTAGGCATATCCGGACCATCAGTGAGGATGTTCGTCTTGCCAATGATACCATGGACCAAGC
TAAAAGAGAGATGGAGACGAACTCTCTGTCGGAGGAGGCGAGTAATCACGCGAGCTTAGCCACGGTAAACTTTTGGAAAGCGGTTAGAGTGAAGGAAGTCGCTATGCGCC
AGAAGTCGCAAATCAGATGGTTGAAGCTAGATGACCAGAATACTGCCTTTTTTCATCGATCTGTTCGATCAAGGCAGAGCAGCAATGCTTTGAGATCAGTTATTGACCCA
GATGAAAATCGATTGACTAACCATGATCAGAATATTAGCTATAGAGAGCTCTCTACTAGTATTGAGGAGATTGTTCAGTTTAGATGGACTGAGGAGTGTTGCCAGACCCT
CCAGTCGCCTATTGGCAGGGAGGAAGTGAGACGTGTTCTGTTCTCCATGGATAGTGGAAAGGCTCCAGGCCCTGATGGGTATTCAGTTGGCTTCTTCAAAGGAGCTTGGA
CGGTGGTTGGAGAGGGTTTCTGTGATGTCGTCTTACACTTCTTTGAGACCAATTACTTCCCTCAAGGGGTGAATACGACTGCTATTACACTTATTCCTAAAAGGAATGGT
GCTGATCGATTGGAGGATTTCAGCCCTATATCTTGTTGCAGCGTTATTTATAAGTGCATTTCAAGAATATTGGCAGATAGGCTTCGTGTGTGGCTTCCTTCTTTTGTTAG
TGGAAATCAGCCAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTCTTTGTCAGGAGCTTGTAGGGGCATACCATTTGCACAGAGGTAAACCTCGGTGCACTA
TGAAGGTTGACCTCCAAAAAGCTTATGATTCTGTTAATTGGGATTTCCTCTTTGGCCTGCTGATTGCCATAGGTACGCCTTTAAGATTTGTGAGTTGGGTTCGAGCGTGT
GTGACCTCTCCGATGTTCTCCATTATGATTAATGGATCGTTGGAAGGTTTTTTCCATGGGAGGAAAGGACTTAGACAAGGTGACCCTTTATCCCCGTTCTTATTTGTGAT
GGTCATGGAGGTGCTTTCTCGCATGTTGAACAACCCACCTCAGAATTTTCAATTCCACCAGTTTTGTGAGAAGGTCAGATTAACTCATCTTACTTTTGTAGATGACCTGA
TGATCTTTTGTACTGCTGATAATCATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTTTCGAGACTGTTTGCTAATCTTCCTAAAAGCTCAATTTTT
CTTGTGGGGGTTAATAGTTCGAAAGCTTCTTGGCTTGCTGCTAACAGGGATTTGTCCATTGGTCATCTCTCTGTTCGTTATCTTGGGCTTCCTCTCCTCTCTAGAAGATT
GCAGAGCTCTGATTGTGATCCCCTTATTCAGCGTATTACCAGTCATATTCGGTCTTGGTCTGCTAGAGTGTTATCTTTTGCAGGTAGACTTCAGCTTGTTCACTCAGTCC
TTAGGAGCCTTCAGGTTTATTGGGCTAGTGTGTTCATGCTTCCTATGAAAGTCCACAGAGACATTGATAAGATCTTGAGGGCTTATTTGTGGAGAGGTAACGAGGAGGGA
AGAGGTGGTGCTAAAGTTGCCTGGGATGAGGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGATATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTGAAGATATT
ATGGTTGCTACTTGTTAAATCTGGGAGATCGCTCTTGGAGATCGATGCTGGGGTGAGTCGATCTTGGTGCTTTAGGGAAATCTTGCGTAAGCAGGATATCCTTAAAGCTC
ATGTTAAGATGGAGGTGGGCAATGGAAGGAAGTGTAGAGTGTGGTTGGTTCCATGGATTCAGGGTGGGCCGATTATCCAGCAGTTTGGGGAGAGGGTGATCTATGATGCG
GGTAGTCGGTGGGATGCGAGGCTGGTGGATTTCATGGGTCGGGATGGTGATTGGAGGTGGCTGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGGGTTCAGGGAGT
GAGGCCGAGTCCGAGTGTTGAGGATAGGTGGGTCTGGGTGCCGGGGAGTCATGATAGTTTTTTGATCACCAGTGAGTGGGAGACTATTCGTCCTCATAGTAGTAGAGTTG
GCTGGTCGGGTTTACTATGGGGTGGGGAAAATATTCCTAAGCACTCCTTTTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTACTAGAGATAGGTTAAGTCGGTGGGAT
AGGTCGATTCCTTTATCGTGTTTGCTTTGTGGAGGGAACTATGAGTCTCGTGATCATTTGTTTTTTTCTTGTCATTTTGGGTGGGAGATTTGGTCAAGGATCCTTTTGCT
TATGTCATCTTCTCATAGAATCGGTTATTGGGGGGTTGAGTTATCTTGGATTTATAATCAGGGTATTGGGAAGAGTGTGAGGAGAAAACTGTGGCGCCTTCTCTGGTGTG
CTACAATTTATTTCATTTGGCAGGAGCGAAATCATCGTCTTCATGGAGTTGCTATTCGGGAGCCTATGGTTGTATTCCAGCTCATTCGGTCGTGTATTAAAGCGCGTGCT
GCTTCTTGGTCGGATGGTGTTCATGGTCTTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAATCGTCCAAGGAAGCTATGTCGGTGAAGAGAAGGGCGTTGACGAGCCTTCAACGTGTTCATGACTTGGCGGCTAGTGAACGCAGAGCGGCAGCGGCGAACGG
TGACCTAACTACAAGGATGAAACTGTCTGAAATAGAACCGGTCGAGATGAACCATGGTCCAAATGGGATTGAGAAGGAGATCTTACATGATGGGCCAAATACAACTGACG
GACCTTCCACCAATAGAAATGGGTCTGGGATGGAGGCTTCTACAAGGATTGCAAAGAATTCAGGGGGTGAAGGCAAAAAACCGAATTCTTGGGCTTCCTTATTTGGTTCA
ACAAGTGGGACCTCCTTGACCTACACTCCTCCAACTACCGTAGGAGAGAAATTTGTGGTGACACCGTCGGAAGAAGTTATTAGTCAAGGAGTTAGTGTTCCAGGTATTGT
TCTTGAAACCTTTGTTTTTGATTCAGTACCTGTTTGGATAAAGTTAGAGAGGATCCCTTTGGAGTTATGGACATATGCTGGTTTGGCTATTATAGCGAAAAGACGTCGTT
TGTCTTATGTTCGTATATGTGTGGAATTGAATGTGGAAAGTAGTATGTCTGCCGAAATAACTTTCAACCTTAGAGGAGCAAAATTCATTGTAACCGTTGCTTACGAGTGG
AAACTGAGGAAATGTAATTTGTGTAGATCATTTGGACACTTGTCGAGCACCTGCCCTAAAATTGAAGTTAGTAAAAAGGAAGCTGTAAGTAAAGAGGATCCTGTTCAAGA
GATAGTTCCTACTAAAGAGGTGGTTACGACGTGTGAGAAGTTTGGCGATGTTGTGAAAAATAGGGGGATGATCTCTATTAGGGACAAAAGGAAAAGGGCGGAAGTGAGTA
TCACTAACTCCTTTAACAACCTCATGGAGGTGGATAAAGGAGATAAATGGCCTTTATCTATAGTGGATGGCTCCCCTCCTCCCTTACGAGTAGATGACTCTGCTATGGTC
CTGTTGAGTACTACAGGTGATGTAATTCCTATGGAAGAGCCGTGTCTACGGTGGGTTTCTGTTGCATTCTGGAGACTAGAGTTTGAGAGGAGAACTTTGTCTCCATTTCC
AGCAAATTTGGTTGCCGACCAGTTTATTTATGGTGTGGTAACAGATTTGATCTCTGGGGAGAAGGTAGAGGTTTTGTGTGTGTATGCCTCTAATAGTAATATTGAGAGAC
GTGTTTTATGGAGGCGAATGGCTGAGATCTCTGCTGGTTGGAGAGGGCCAGACTCCACTCTGAAGCCTTCGGGGGTGCTCCGAACATGGGGGACATGGAGGAGTTTGACA
TGGCTATTCGAGAGGCGGACCTTGTTGAACACGCGGTCTAGGGGAACTGGGTTTACTTGGACTAGTAAAATACATGGGTCGGGTTTGATGAATAGACTTGATCGTATCCT
AGTGAATGATGAGGGGCTTAGTACATGGCCTAACATGAGGGTTAACGTCCTCCCGTGGGGTATTTCTGATCATTCTCCCATACTTGTCTATCCCAGTAATCAGCGAAGCC
AACAAGTGGTCTCTTTTCGTTTCTTTAACCATTGGGTTGAAGAAGCGTCCTTTATGGATGTTGTGTCCTCTGCTTGGACCAAAGATACTAGAGTTTCTCCAATTGTGAAT
ATTGTGAGGAACTTAAGAAATCTCAAGTCGATTCTTCGCAGACATTTTGGTAGGCATATCCGGACCATCAGTGAGGATGTTCGTCTTGCCAATGATACCATGGACCAAGC
TAAAAGAGAGATGGAGACGAACTCTCTGTCGGAGGAGGCGAGTAATCACGCGAGCTTAGCCACGGTAAACTTTTGGAAAGCGGTTAGAGTGAAGGAAGTCGCTATGCGCC
AGAAGTCGCAAATCAGATGGTTGAAGCTAGATGACCAGAATACTGCCTTTTTTCATCGATCTGTTCGATCAAGGCAGAGCAGCAATGCTTTGAGATCAGTTATTGACCCA
GATGAAAATCGATTGACTAACCATGATCAGAATATTAGCTATAGAGAGCTCTCTACTAGTATTGAGGAGATTGTTCAGTTTAGATGGACTGAGGAGTGTTGCCAGACCCT
CCAGTCGCCTATTGGCAGGGAGGAAGTGAGACGTGTTCTGTTCTCCATGGATAGTGGAAAGGCTCCAGGCCCTGATGGGTATTCAGTTGGCTTCTTCAAAGGAGCTTGGA
CGGTGGTTGGAGAGGGTTTCTGTGATGTCGTCTTACACTTCTTTGAGACCAATTACTTCCCTCAAGGGGTGAATACGACTGCTATTACACTTATTCCTAAAAGGAATGGT
GCTGATCGATTGGAGGATTTCAGCCCTATATCTTGTTGCAGCGTTATTTATAAGTGCATTTCAAGAATATTGGCAGATAGGCTTCGTGTGTGGCTTCCTTCTTTTGTTAG
TGGAAATCAGCCAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTCTTTGTCAGGAGCTTGTAGGGGCATACCATTTGCACAGAGGTAAACCTCGGTGCACTA
TGAAGGTTGACCTCCAAAAAGCTTATGATTCTGTTAATTGGGATTTCCTCTTTGGCCTGCTGATTGCCATAGGTACGCCTTTAAGATTTGTGAGTTGGGTTCGAGCGTGT
GTGACCTCTCCGATGTTCTCCATTATGATTAATGGATCGTTGGAAGGTTTTTTCCATGGGAGGAAAGGACTTAGACAAGGTGACCCTTTATCCCCGTTCTTATTTGTGAT
GGTCATGGAGGTGCTTTCTCGCATGTTGAACAACCCACCTCAGAATTTTCAATTCCACCAGTTTTGTGAGAAGGTCAGATTAACTCATCTTACTTTTGTAGATGACCTGA
TGATCTTTTGTACTGCTGATAATCATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTTTCGAGACTGTTTGCTAATCTTCCTAAAAGCTCAATTTTT
CTTGTGGGGGTTAATAGTTCGAAAGCTTCTTGGCTTGCTGCTAACAGGGATTTGTCCATTGGTCATCTCTCTGTTCGTTATCTTGGGCTTCCTCTCCTCTCTAGAAGATT
GCAGAGCTCTGATTGTGATCCCCTTATTCAGCGTATTACCAGTCATATTCGGTCTTGGTCTGCTAGAGTGTTATCTTTTGCAGGTAGACTTCAGCTTGTTCACTCAGTCC
TTAGGAGCCTTCAGGTTTATTGGGCTAGTGTGTTCATGCTTCCTATGAAAGTCCACAGAGACATTGATAAGATCTTGAGGGCTTATTTGTGGAGAGGTAACGAGGAGGGA
AGAGGTGGTGCTAAAGTTGCCTGGGATGAGGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGATATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTGAAGATATT
ATGGTTGCTACTTGTTAAATCTGGGAGATCGCTCTTGGAGATCGATGCTGGGGTGAGTCGATCTTGGTGCTTTAGGGAAATCTTGCGTAAGCAGGATATCCTTAAAGCTC
ATGTTAAGATGGAGGTGGGCAATGGAAGGAAGTGTAGAGTGTGGTTGGTTCCATGGATTCAGGGTGGGCCGATTATCCAGCAGTTTGGGGAGAGGGTGATCTATGATGCG
GGTAGTCGGTGGGATGCGAGGCTGGTGGATTTCATGGGTCGGGATGGTGATTGGAGGTGGCTGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGGGTTCAGGGAGT
GAGGCCGAGTCCGAGTGTTGAGGATAGGTGGGTCTGGGTGCCGGGGAGTCATGATAGTTTTTTGATCACCAGTGAGTGGGAGACTATTCGTCCTCATAGTAGTAGAGTTG
GCTGGTCGGGTTTACTATGGGGTGGGGAAAATATTCCTAAGCACTCCTTTTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTACTAGAGATAGGTTAAGTCGGTGGGAT
AGGTCGATTCCTTTATCGTGTTTGCTTTGTGGAGGGAACTATGAGTCTCGTGATCATTTGTTTTTTTCTTGTCATTTTGGGTGGGAGATTTGGTCAAGGATCCTTTTGCT
TATGTCATCTTCTCATAGAATCGGTTATTGGGGGGTTGAGTTATCTTGGATTTATAATCAGGGTATTGGGAAGAGTGTGAGGAGAAAACTGTGGCGCCTTCTCTGGTGTG
CTACAATTTATTTCATTTGGCAGGAGCGAAATCATCGTCTTCATGGAGTTGCTATTCGGGAGCCTATGGTTGTATTCCAGCTCATTCGGTCGTGTATTAAAGCGCGTGCT
GCTTCTTGGTCGGATGGTGTTCATGGTCTTATTTAATGTTCTTTTCGTTTGCTTGTCCCCGGGCTGTGGGGTTTCTCTTCTCTTTATTAACTTTGTTGTCTTTCTCTCTT
CTTCTTGTCCTTTTAATGTTTGGAACACTAGTTGGTTTCTTTTGGTTTTAGTTCTTGTTCCCGAGATGTGGGGTTGTTTTGGGTACTTATGGGTTGTTTTGTCTAACTAC
TTGTTCTATGAGTGTTGTTCGCTCTTTTGTCTTGACCTCAGGCTGTGAGGTCCACCTTGTTGTTGTATAATATTATCATTACCTTTTCAAAAAAAAAATCAAACTCAATG
CTAAGTTTGACTGCTCCCTTCGAAGTGACAACAAGCCTCGTTTTCGCCCTACTGTGGATCGAGGAGGATTCCTCCTCTTTTCAAAGCCCTCAATCAAACTTGACGTTATG
TTTAAAACACTAAAAATAGATGGTCTAGTTTTGGTTAGGCCAAAAGCGATGTTTTCCCCCAAAAGCTCTAATAGCTAGGGAACTCAACATGTTCTACTTGACCTAACCAA
CTTGACTCTGTATTTTTTATGAGCTTTTTTTAATGCATATTGCATATGTTTTTTTTTATAATTGTTGATTGTCTCACTTTTTTATGGTCTGCATTTACGAGCCTCTTTTC
TTTCTTTGTTATGAACTGCGGTAACTAGGCATTGCTAGAACCGATGGGGTAACATTTCTATGGTTCTACACTTAGTTTCTAAGTGTGATTGCTCACCTCTAATTTTCAAA
ACACTTAATTTCTAAGCACATACAATGTTCACCTAGAAAAAAATGAAAATTGTTGTAGTTAAGATCAAAACTAAATAGTTATGAAATAAGACATAAGAGATTTGATTTTA
ACAACCTTCCCGTTGTACTTGGGATCCACTCTGGCTTTGTGCTGGAAAATTGGTTTAGAGCTGATCATAAGCTTTCGGTTGCTCTTCTGTAACCTTTAGATACTCTTCAA
AGCTATAAGCTTTGCTTCCTGACGAAGAATCTTCTACTATTACAATACATAAGAAAATTCTATTTCATAAACCCTGTATCATAATTGAGGCTGATAGAATGATTATTCTA
TAACCCTACCGATAGTTTCATTCTTAACATTCATAGACACGATGCATGGCAGCTATCCAATTCTATTTCATGTTCTCGAGAATGGCTCGGTTTAACAATAATGTTAATTT
TTGTTTTCTATTTTAAAATAATAACGCTGCTTACTAAT
Protein sequenceShow/hide protein sequence
MKQSSKEAMSVKRRALTSLQRVHDLAASERRAAAANGDLTTRMKLSEIEPVEMNHGPNGIEKEILHDGPNTTDGPSTNRNGSGMEASTRIAKNSGGEGKKPNSWASLFGS
TSGTSLTYTPPTTVGEKFVVTPSEEVISQGVSVPGIVLETFVFDSVPVWIKLERIPLELWTYAGLAIIAKRRRLSYVRICVELNVESSMSAEITFNLRGAKFIVTVAYEW
KLRKCNLCRSFGHLSSTCPKIEVSKKEAVSKEDPVQEIVPTKEVVTTCEKFGDVVKNRGMISIRDKRKRAEVSITNSFNNLMEVDKGDKWPLSIVDGSPPPLRVDDSAMV
LLSTTGDVIPMEEPCLRWVSVAFWRLEFERRTLSPFPANLVADQFIYGVVTDLISGEKVEVLCVYASNSNIERRVLWRRMAEISAGWRGPDSTLKPSGVLRTWGTWRSLT
WLFERRTLLNTRSRGTGFTWTSKIHGSGLMNRLDRILVNDEGLSTWPNMRVNVLPWGISDHSPILVYPSNQRSQQVVSFRFFNHWVEEASFMDVVSSAWTKDTRVSPIVN
IVRNLRNLKSILRRHFGRHIRTISEDVRLANDTMDQAKREMETNSLSEEASNHASLATVNFWKAVRVKEVAMRQKSQIRWLKLDDQNTAFFHRSVRSRQSSNALRSVIDP
DENRLTNHDQNISYRELSTSIEEIVQFRWTEECCQTLQSPIGREEVRRVLFSMDSGKAPGPDGYSVGFFKGAWTVVGEGFCDVVLHFFETNYFPQGVNTTAITLIPKRNG
ADRLEDFSPISCCSVIYKCISRILADRLRVWLPSFVSGNQPAFIPGRSIIDNILLCQELVGAYHLHRGKPRCTMKVDLQKAYDSVNWDFLFGLLIAIGTPLRFVSWVRAC
VTSPMFSIMINGSLEGFFHGRKGLRQGDPLSPFLFVMVMEVLSRMLNNPPQNFQFHQFCEKVRLTHLTFVDDLMIFCTADNHSMSFIKETIKRFGELSRLFANLPKSSIF
LVGVNSSKASWLAANRDLSIGHLSVRYLGLPLLSRRLQSSDCDPLIQRITSHIRSWSARVLSFAGRLQLVHSVLRSLQVYWASVFMLPMKVHRDIDKILRAYLWRGNEEG
RGGAKVAWDEVCLPFDEGGLDIRDGSSWNIASTLKILWLLLVKSGRSLLEIDAGVSRSWCFREILRKQDILKAHVKMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDA
GSRWDARLVDFMGRDGDWRWLLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSHDSFLITSEWETIRPHSSRVGWSGLLWGGENIPKHSFCAWLAIRDRLGTRDRLSRWD
RSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRILLLMSSSHRIGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERNHRLHGVAIREPMVVFQLIRSCIKARA
ASWSDGVHGLI