; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025263 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025263
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:10598931..10602786
RNA-Seq ExpressionLag0025263
SyntenyLag0025263
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]8.0e-18137.49Show/hide
Query:  RKKLNWKRRARMGHINESSTQDQLSKKMKSSSE---LEEGKKRTRLEGASLNHIDSKVCWKG--KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWL
        R++L WK       +N S  + +  K +  S     L     +  L+  S NHID  +  +G  + WRF G+YGFP+A  + +TWNL++ L    N PW+
Subjt:  RKKLNWKRRARMGHINESSTQDQLSKKMKSSSE---LEEGKKRTRLEGASLNHIDSKVCWKG--KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWL

Query:  LGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCR
        +GGD+NE+    +K GG  R+  L+ + + +L  CEL +++F G  FTW+G R G ++   LDRF C+L +  LF  +  R+LD   SDH PI   V  +
Subjt:  LGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCR

Query:  RMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQ----VSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIH-NLEF
        +   +K  ++ FKFEE W   E C++++K+   W+    V    +L   + S   AL  W       +R +I + +  L   YD++           L+ 
Subjt:  RMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQ----VSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIH-NLEF

Query:  ELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMN
        +L+ LL +E++FW+QR++  WL  GD N+ +FH++   R++ N ++G+ + +G W+ +   +ED  + YF  +F SS PE  +  + L G+   VS+  N
Subjt:  ELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMN

Query:  EKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAI
          L     K ++  AI +M+P+K+PGPDGFS  FFQ +W  VG   V    E   + +SL   N T + LIPKV  P+ +    PISLCNV YKI +K +
Subjt:  EKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAI

Query:  ANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLING
        ANRLK +L  +IS  QSAF+ GRLI+DN ++  E  H +K  R+       +KLD+SKA+DRVEW+FLE +M K+GF   WI  IM C++T ++S +ING
Subjt:  ANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLING

Query:  IPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAI
         P+G +IPSRGLRQGD +SPYLFLL +E LS  I  A   G+L GV     +PS+SHL FADD+ +F +A   +   +K+LL  YE  SG+ +N+ KS I
Subjt:  IPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAI

Query:  LFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCC
         F ++++   +  +++IL V +V     YLG+P  LS SK + F F+  KIR   QGWR    S A KE+LIK+V QAIPSYV+S F+ P+ LC E+   
Subjt:  LFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCC

Query:  FAQFWWGLSEVKRKIHW---------------------------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQG
         AQFWWG     RKIHW                                             LL+ KYFP  S LEA+L    S+ W+S+L GR++L +G
Subjt:  FAQFWWGLSEVKRKIHW---------------------------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQG

Query:  IRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV---FSQALVADFISE-SGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS
        +R +VG+GESI  + DPW+P     RP   +PV     +  VAD I   +  W    + E     E++ +  IP+ LR +   ++  +     YS
Subjt:  IRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV---FSQALVADFISE-SGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.5e-18235.17Show/hide
Query:  LEGASLNHIDSKV-CWKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYE---------------------------------
        ++  S +HID+ V    GKIWR  G+YG  EA+ K+ TW L++ L  L +  W   GDFNE+L+ +E                                 
Subjt:  LEGASLNHIDSKV-CWKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYE---------------------------------

Query:  ------------------------------------------------------------------------------------KYGGPRRASHLLEEFR
                                                                                            K GG  R+S+++ EF+
Subjt:  ------------------------------------------------------------------------------------KYGGPRRASHLLEEFR

Query:  SSLNDCELKEMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPI--EASVDCRRMVWRKSGRRPFKFEEFWTHYEACED
         S+  C L +M F G  FTW   R G+  I E LDR +C+ ++ S F    + +L    SDH PI  E  V C+++ ++K+      +E+ W+ YEAC +
Subjt:  SSLNDCELKEMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPI--EASVDCRRMVWRKSGRRPFKFEEFWTHYEACED

Query:  IIKTH------GDWQ--VSSASVLAKNLNSCSEALSKWDSDVR----NSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSRE
        I+++         W+  V     +AK   +  +  SK + + R    N +  ++K  KQ    A D         I  LE ++  +L +EE++WKQRSR 
Subjt:  IIKTH------GDWQ--VSSASVLAKNLNSCSEALSKWDSDVR----NSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSRE

Query:  DWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDM
        DWL  GDKN+ +FH KAS RR+ N+I G+ D +G+W +DP  IE  F  +F  +F SS P +++I +AL+G+ P+VSQ+MN  L   FT  DI +A+++M
Subjt:  DWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDM

Query:  YPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAF
         PTKAPGPDG  A FFQK+W  VG    + CL +LN   +L   N T I LIPKV  P++V +F PISLCNV Y+I  KAIANRLK IL  IIS  QSAF
Subjt:  YPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAF

Query:  IQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLS
        I  RLITDN+I+G+ECLH I+ ++     +  +KLD+SKA+DRVEW FLE+ M  LGF  +WISLIM CITT  FSVLING P G I P RGLRQG PLS
Subjt:  IQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLS

Query:  PYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILN
        PYLF+L AE  S L+++A  +  + G L      +++HLLFADD+LVF KA+ ++  ++K +   Y   SG+  NF KS++ F    ++++   + SI  
Subjt:  PYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILN

Query:  VNQVKDLGSYLGVPSSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW--
        +  V     YLG+P  L R+K   F  V  K+   +  W   LFS   KEILIK+V QA+P+Y +S+FK PK LC++I    A+FWWG  + K  IHW  
Subjt:  VNQVKDLGSYLGVPSSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW--

Query:  -------------------------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWI
                                                   +++ +Y+ + +   A++G + SF W+S+LWG +++ +G+R R+G+G+ ++ ++D WI
Subjt:  -------------------------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWI

Query:  PRESLLRPLCLNPVFSQALVADFISESGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS
        PR +  +P+    +  + +VAD I     W    L +    ++I+ I +I +   K    +L  ++ +  YS
Subjt:  PRESLLRPLCLNPVFSQALVADFISESGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS

XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]1.0e-18038.19Show/hide
Query:  ASLNHIDSKVC-WKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTW
        +S NHID+ V     + WRF G+YG  +   K +TW LIR L+   + PWL  GDFNE+LW +EK G   R   L+  FR  L++C L ++ F G  FTW
Subjt:  ASLNHIDSKVC-WKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTW

Query:  KGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA----SVLAKN
        +G R G  + E LDR + +  + +LF     R+L+   SDH+ I  +++           RPFKFE+ W   E C + I +   W  SS      ++A+ 
Subjt:  KGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA----SVLAKN

Query:  LNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPH--MDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGI
        +  C   L+ W       +R  I+   + L  A ++      D+  ++ L+ EL+ LL++E + W+QR+R  +L  GD+N+ +FH KAS R + N+I G+
Subjt:  LNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPH--MDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGI

Query:  RDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ
        R++  +W  D   + D   +YF  +F +S P +  +V  L  + P V+Q+MN +LL  F K ++  A+N M    APGPDG   +F+ K+WN +G     
Subjt:  RDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ

Query:  ECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKD
          L+ LNN    SE N+TNI LIPKV +P+ + D+ PISLCNV YK+ +K +ANR K +L  +ISE QSAF  GRLITDNI++ +E LH +K ++     
Subjt:  ECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKD

Query:  MATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLC
           +KLD+SKA+DRVEW+F+EE+M KLGFD RWI+LI+ CI+T ++SVLING+P   I PSRGLRQGDPLSPYLFL+ +EGL  LI +A     + GV  
Subjt:  MATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLC

Query:  SPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVL
            P ++HL FADD+LVFC+A+  E   ++ LL +YE  SG+ +N NK+++ F +S     +  +   L V  +K    YLG+PS + ++K     F+ 
Subjt:  SPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVL

Query:  NKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW-------------------------------
         ++   +QGW+  L S A +EIL+K+V QAIP++ +S FK P +LC +I     +FWWG    +RKIHW                               
Subjt:  NKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW-------------------------------

Query:  --------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLC-LNPVFSQALVADFIS-ES
                        + K+FP+GSIL+A+ G   SF WKS+L GR ++ +G++ RVGNG +I  ++D W+P     + +  LN +   A V+  I  + 
Subjt:  --------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLC-LNPVFSQALVADFIS-ES

Query:  GAWNESLLIEAVGIDEIDIIRRIPIDL
          WNE ++       +   I+ IP+ L
Subjt:  GAWNESLLIEAVGIDEIDIIRRIPIDL

XP_023927486.1 uncharacterized protein LOC112038880 [Quercus suber]6.1e-18137.67Show/hide
Query:  LEGASLNHIDSKVCWKGK--IWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGS
        ++ +SLNHID  V  KGK   WRF G+YG PEA+ K +TWNL+R+LH     PWL  GDFNE+L  YEK GG  R+   + EFR  ++DC   ++ + G 
Subjt:  LEGASLNHIDSKVCWKGK--IWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGS

Query:  PFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTH--GDWQVSSASVLA
         ++W+G R    + E LDR +    + +L        L +  SDH PI   ++   +  R    +PF+FE  W     C + +KT     + +S++ ++ 
Subjt:  PFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTH--GDWQVSSASVLA

Query:  KNLNSCSEALSKWDSDVRNSMRTKIKECKQAL-KAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISG
        + +  C E L +W      S++ +++E  + L KA  + A   D  ++  L  E++ LL++E + W+QR+R   L  GD+N+ +FH KAS R + N I G
Subjt:  KNLNSCSEALSKWDSDVRNSMRTKIKECKQAL-KAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISG

Query:  IRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRI-VDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVT
        + D   SW  D A + D  + ++  +F   T E+S I +  L  + P V+++MN  L   FTK +++ A+ +M P KAPGPDG   +FFQ +W  +G   
Subjt:  IRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRI-VDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVT

Query:  VQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVI
         +  L+ LN+     E+N T + LIPKV NP+++ +F PISLCNV YK+ +K +AN LK +L  I+SE QSAF  GR+ITDNI++  E LH +K  +T  
Subjt:  VQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVI

Query:  KDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGV
             +KLD+SKA+DRVEW FL+ ++ K+GF  RW+ L+M CITT ++S+LING P   I PSRGLRQGDPLSPYLFLL  EGL  LIS+A   G++ G+
Subjt:  KDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGV

Query:  LCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAF
            + P ++HL FADD+L+FC+A+  +  H++ LL+ Y   SG+ +N  K+ + F ++ +++ +  +  +L V ++K    Y G+PS + R K    A+
Subjt:  LCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAF

Query:  VLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWL----------------------------
        + ++I   +QGW++ L S A +E+L+K+V QAIP+Y +S FK P +LC EI     +FWWG    +R+IHW+                            
Subjt:  VLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWL----------------------------

Query:  -----------------LRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNP-VFSQALVADFISE
                          + KYFP GSI +A+     SF WKS+L GREL+++G++ R+GNG  +  F D W+P   L R     P   + ALV+  I+ 
Subjt:  -----------------LRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNP-VFSQALVADFISE

Query:  -SGAWNESLLIEAVGIDEIDIIRRIPIDL
            W E+ +      +E  II+ IP+ L
Subjt:  -SGAWNESLLIEAVGIDEIDIIRRIPIDL

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]1.0e-18037.74Show/hide
Query:  NHIDSKVCWKG--KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWKG
        NHID  +   G    WRF G+YG  +  L++ TW LI  +    + PWL+GGDFNE+L   EK GGP R +  +E FR  +  C L ++ F G  FTW+G
Subjt:  NHIDSKVCWKG--KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWKG

Query:  NRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGR-RPFKFEEFWTHYEACEDIIKTHGDWQVSSAS----VLAKNL
         R G +I   LDRF+    +  LF  +   +L    SDH PI   V+ R  + RK  R R F+FEE W H   C +++K    W+  + +     +   +
Subjt:  NRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGR-RPFKFEEFWTHYEACEDIIKTHGDWQVSSAS----VLAKNL

Query:  NSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNA----PHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISG
            +AL  W       ++ +I+  +  L   YD +    P  + L    LE +L+ LL  E  +W+QRSR  WL  GD N+ +FH +AS R++ N ISG
Subjt:  NSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNA----PHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISG

Query:  IRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTV
        + + +G W  + + +E+  + YF  +F +S+P+   +   L      V+  MN +L+  F + +I +A+N M+P KAPGPDGFS +F+Q+YW+ VG   +
Subjt:  IRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTV

Query:  QECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIK
              +N+   L E N T + LIPKV   + +    PISLCNV YK+ +K +ANRLK +L+DII+  QSAF+ GR I+DN ++  E  H +K       
Subjt:  QECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIK

Query:  DMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVL
            +KLD+SKA+DRVEW F+E +M  +GFD+ WI  IMGC+TT ++S L+NG P+G +IP+RGLRQGD +SPYLFLL AEGLS ++S    +  L G+ 
Subjt:  DMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVL

Query:  CSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFV
         +  +PS++HL FADD+ VF KA   E   +K +L  YE  SG+ +NF KS I F ++++   +  L+ +  V +V     YLG+P+ +S SK++ F F+
Subjt:  CSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFV

Query:  LNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWL-----------------------------
        + K R  M+ W+    S+A KE++IKSV Q++P+YV+S F+ PK LC+E+  C A+FWWG SE  RKIHWL                             
Subjt:  LNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWL-----------------------------

Query:  ----------------LRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL-CLNPVFSQALVADFIS-E
                        L+ KYFP+   + A +    S+ W+SL+ G+ LL +G+R +VG+G  I  + DPWIPR    RP   +        VAD I  +
Subjt:  ----------------LRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL-CLNPVFSQALVADFIS-E

Query:  SGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS
        S  W    L E    DE+D+IR+IP+ LR     ++  ++ +  YS
Subjt:  SGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYS

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein4.4e-18538.11Show/hide
Query:  EEGKKRTRLEGASLNHIDSKVCWKGKI-WRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELK
        +EG + T L  +S  HID  +   G   W F G YG P+ + ++ +W L+R L   ++ PWL+ GDFNELL   EK G   R  + +E FR +L+DCELK
Subjt:  EEGKKRTRLEGASLNHIDSKVCWKGKI-WRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELK

Query:  EMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVS
        +M + G+ FTW   R G   ++E LDR +C+ ++ SLF  A  R++ +  SDH  +   +   +   +   +R F+FE  W   E CE++++    WQ  
Subjt:  EMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVS

Query:  SASVL----AKNLNSCSEALSKWDSDVRNSMRTKIKECKQAL----KAAYDNAPHMDFLSIHN-LEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFH
         +  L    ++ + +C  AL +W   +    + ++ + + A     K   +N  + +  +  N    +L+ +L +EE +W+QRS   WL  GD+N+ +FH
Subjt:  SASVL----AKNLNSCSEALSKWDSDVRNSMRTKIKECKQAL----KAAYDNAPHMDFLSIHN-LEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFH

Query:  RKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAV
          AS R++ N I G+RDA G        +     +YF  IF++S P  S I   +  +S  V+Q+MN+ LL+ FT  +I  A+  M+PTKAPGPDG +A+
Subjt:  RKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAV

Query:  FFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGH
        F+QK+W+ VG       LE L++ K L   N T+I LIPK+ +P+ +  F PISLCNV YKI +K +ANRLK +L  IIS+ QSAF+ GRLITDNI+V  
Subjt:  FFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGH

Query:  ECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFL
        E LH +K  R        +KLD+SKA+DRVEW FLE +M+KLGFD+RW++LIM C+T+ ++SV++NG P G I P+RG+RQGDPLSPYLFL+ AEGL+ L
Subjt:  ECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFL

Query:  ISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVP
        + +A   G + G+      P +SHL FADD+L+FC+AN  E  ++ A+L TYE  SG+ +N  K+++ F  + + D +  + ++L  +   DLG YLG+P
Subjt:  ISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVP

Query:  SSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------
          + R K + F  +  KI K + GW+  L S A +EILIKSV QAIP Y +S F+ P +LC EI    ++FWWG    ++KIHW                
Subjt:  SSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------

Query:  -----------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV
                                     LL+ KYFP+ S +EA +    SF W+S+   R ++ +G R R+GNG  +  ++D WI   +  + +    +
Subjt:  -----------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV

Query:  F-SQALVADFIS-ESGAWNESLLIEAVGIDEIDIIRRIPIDL
          + A V+D I  E+  WN SL+       E   I+ IP+ L
Subjt:  F-SQALVADFIS-ESGAWNESLLIEAVGIDEIDIIRRIPIDL

A0A2N9I946 Uncharacterized protein3.4e-18538.4Show/hide
Query:  LEGASLNHIDSKVCW-KGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSP
        ++  S NHID+ V    G  WR  G YG PE  L+  +W L+R L+S+ N PWL+ GDFNE+L   E++G   R    +  FR +L+DC L+++ ++G  
Subjt:  LEGASLNHIDSKVCW-KGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSP

Query:  FTWKGNRR-GIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAS--VLA
        F+W   R  G  +   LDR + N E+  LF      ++ +  SDH  +   ++   +    + ++PF+FE  W     CED IK+     VS     ++A
Subjt:  FTWKGNRR-GIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAS--VLA

Query:  KNLNSCSEALSKWD-SDVRNSMRTKIKECKQALKAAYDNAPHMDFLS--IHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEI
        + + +C   L +W+ S VR  +  ++ E K+      +++P  ++ S  ++ L  E++ L+E+EEIFW+QRSR  WL  GD+N+ ++H  AS R++TN I
Subjt:  KNLNSCSEALSKWD-SDVRNSMRTKIKECKQALKAAYDNAPHMDFLS--IHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEI

Query:  SGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPV
         G+RD +G W  +   I +  + YF  +F SS P+   I + +  +   VS  MN+ LL  F+  +I++A+  M P+KAPGPDG +A+FFQKYW+ VG  
Subjt:  SGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPV

Query:  TVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTV
             L+  ++ + L   N TNIVLIPKV NP+ +  F PISLCNV YKI +K + NR+K+IL +IIS+ QSAF+ GRLI+DNII+  E LH +K  R  
Subjt:  TVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTV

Query:  IKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSG
               KLD+SKA+DRVEW FL+ I+LKLGF RRW+ LIM C+T+ ++SV++NG+P G I PSRGLRQGDPLSPYLFLL AEGLS LI +A  +  + G
Subjt:  IKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSG

Query:  VLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFA
        +      P +SHL FADD+++FC+A++ +   + A+L  YE  SG+ IN  K+AI F ++     +  + S+   +       YLG+P  L RSK + F 
Subjt:  VLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFA

Query:  FVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------
         + ++I K +QGW+  L S A +EILIK+V QAIP Y +S FK P  LC EI     QFWWG    +R+IHW                            
Subjt:  FVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------

Query:  -----------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLC-LNPVFSQALVADFIS
                         +L+ KYFP  S LEAQ+  + S+ W+S+   R +L  G+R RVGNG +I  ++D W+P  S  R +  L+   S+  V   I 
Subjt:  -----------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLC-LNPVFSQALVADFIS

Query:  ESG-AWNESLLIEAVGIDEIDIIRRIPIDLR-----------KSGSFMLKDWNSQFRYSGSNWTDSQFNFVLKN
        E+   W+E  L +     ++DII++IP+ LR           KSG+F ++   S   +   + + S  N +  N
Subjt:  ESG-AWNESLLIEAVGIDEIDIIRRIPIDLR-----------KSGSFMLKDWNSQFRYSGSNWTDSQFNFVLKN

A0A2N9J3U0 Reverse transcriptase domain-containing protein4.4e-18538.11Show/hide
Query:  EEGKKRTRLEGASLNHIDSKVCWKGKI-WRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELK
        +EG + T L  +S  HID  +   G   W F G YG P+ + ++ +W L+R L   ++ PWL+ GDFNELL   EK G   R  + +E FR +L+DCELK
Subjt:  EEGKKRTRLEGASLNHIDSKVCWKGKI-WRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELK

Query:  EMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVS
        +M + G+ FTW   R G   ++E LDR +C+ ++ SLF  A  R++ +  SDH  +   +   +   +   +R F+FE  W   E CE++++    WQ  
Subjt:  EMRFSGSPFTWKGNRRGIQ-IWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVS

Query:  SASVL----AKNLNSCSEALSKWDSDVRNSMRTKIKECKQAL----KAAYDNAPHMDFLSIHN-LEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFH
         +  L    ++ + +C  AL +W   +    + ++ + + A     K   +N  + +  +  N    +L+ +L +EE +W+QRS   WL  GD+N+ +FH
Subjt:  SASVL----AKNLNSCSEALSKWDSDVRNSMRTKIKECKQAL----KAAYDNAPHMDFLSIHN-LEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFH

Query:  RKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAV
          AS R++ N I G+RDA G        +     +YF  IF++S P  S I   +  +S  V+Q+MN+ LL+ FT  +I  A+  M+PTKAPGPDG +A+
Subjt:  RKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAV

Query:  FFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGH
        F+QK+W+ VG       LE L++ K L   N T+I LIPK+ +P+ +  F PISLCNV YKI +K +ANRLK +L  IIS+ QSAF+ GRLITDNI+V  
Subjt:  FFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGH

Query:  ECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFL
        E LH +K  R        +KLD+SKA+DRVEW FLE +M+KLGFD+RW++LIM C+T+ ++SV++NG P G I P+RG+RQGDPLSPYLFL+ AEGL+ L
Subjt:  ECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFL

Query:  ISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVP
        + +A   G + G+      P +SHL FADD+L+FC+AN  E  ++ A+L TYE  SG+ +N  K+++ F  + + D +  + ++L  +   DLG YLG+P
Subjt:  ISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVP

Query:  SSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------
          + R K + F  +  KI K + GW+  L S A +EILIKSV QAIP Y +S F+ P +LC EI    ++FWWG    ++KIHW                
Subjt:  SSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------

Query:  -----------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV
                                     LL+ KYFP+ S +EA +    SF W+S+   R ++ +G R R+GNG  +  ++D WI   +  + +    +
Subjt:  -----------------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPV

Query:  F-SQALVADFIS-ESGAWNESLLIEAVGIDEIDIIRRIPIDL
          + A V+D I  E+  WN SL+       E   I+ IP+ L
Subjt:  F-SQALVADFIS-ESGAWNESLLIEAVGIDEIDIIRRIPIDL

A0A803PWX1 Uncharacterized protein6.1e-18739.25Show/hide
Query:  KKRTRLE--GASLNHIDSKVCWKG-KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKE
        KK  +L+   +S+ HI + V   G   W   G YG PEA+L+  +W L+R+L      PWL  GDFNE++   EK GG  R    ++ F+  ++DC   +
Subjt:  KKRTRLE--GASLNHIDSKVCWKG-KIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKE

Query:  MRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCR---RMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQV
           S    TW       QI E LDR +CN E+   F  A  + LDW  SDHR +  ++  R       +   +  F FEE W   E C +II      + 
Subjt:  MRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCR---RMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQV

Query:  SSASVLA--KNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIR
             ++    +N C +AL  W+   +  +  +I + K+ L           + +I ++E +L+ LLE++E +W+QRSR  WL WGD+N+ +FH KAS R
Subjt:  SSASVLA--KNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIR

Query:  RQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYW
        R+ NEI G++D  G W +D  L+      Y+ G+F  S  ++  + + L  + P+VS  MNE+L+  F+  ++ +A+  M PTKAPG DG  A+F+QK+W
Subjt:  RQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYW

Query:  NTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSI
        + +    +  CL VLNN   LS  N T + LIPKV  P+++ +F PISLCNV YKI +K +ANRL+  L  ++S+ QSAF++GRLI DN IVG+ECLH +
Subjt:  NTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSI

Query:  KENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANA
        ++NR        +KLD++KA+DRVEW FLE +MLKLG+D  W+S IM C+T+  FS LING  +G++ P RGLRQGDPLSP+LFLL AE  S LI  A  
Subjt:  KENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANA

Query:  KGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRS
         G L G+       SVSHL FADD+LVF  ANE E    K LL  Y   SG+ +NF+KS + F R +    +  L++I+ V  V + G YLG+PS + R+
Subjt:  KGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRS

Query:  KSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------
        K + F F+ NK+   ++GW+ S FS A KE+LIK+V QAIP+Y +S F+ PK     I    A+FWWG SE   KIHW                      
Subjt:  KSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------

Query:  -----------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPVFSQALV
                               +L+  Y+P+G ++EA+ G   SF W+SL+WG++++ +G R R+GNG S+    DPW+PR    +     P+  Q  V
Subjt:  -----------------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPVFSQALV

Query:  AD
         D
Subjt:  AD

A0A803Q9W0 Uncharacterized protein4.1e-18338.24Show/hide
Query:  HIDSKVCWKGK----IWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWK
        HID+   W  K     WRF G YG P+ + +  +W L++ +    N PWL GGDFNE+   +EK GG  +  +L++ F  +++ C L+E+ + GS FTW 
Subjt:  HIDSKVCWKGK----IWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWK

Query:  GNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEA---SVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIK---THGDWQVSSASVLAK
          R    I+E LDR + N  +  ++A A  ++L    SDH P+     ++ C     ++ G R F +E+ W   E C+ II+     G+  ++SA+ L +
Subjt:  GNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEA---SVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIK---THGDWQVSSASVLAK

Query:  NLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMD-FLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGI
         LN+C   L +W+   R +   KIK+ K+ ++  Y N    D F  +  +E +L+  L +EE+FWKQRSR  WL  GD+N+ +FH+KA+ RR+ N I+G+
Subjt:  NLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMD-FLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGI

Query:  RDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ
         D    W      IE+T   +F  +F ++   ++      R +  R+S+  NE+LL+ FT  DI+ A++ +   KAPG DG   +F++K+W  +G    +
Subjt:  RDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ

Query:  ECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKD
         CL++LNN+K   + NKT + LIPK+  PK+VGD+ PISLCNV+YKI  K +ANR+K  LK++ISE QSAFI+GRLI DN I+G E LH +K+ R     
Subjt:  ECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKD

Query:  MATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLC
           +KLD+SKA+DRVEW FLE +M+ LG+D+RW+  IM CI + +FS+L+NG   G+I PSRGLRQGDPLSPY+FLL +EGLS LI EA     L G+  
Subjt:  MATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLC

Query:  SPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVL
           +  +SHL FADD+ +F  A  S+   +K++L  Y  +SG+ INF+KS +   + +N      L++IL V  V     YLG+P+S+ + K + F  + 
Subjt:  SPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLGVPSSLSRSKSKDFAFVL

Query:  NKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW-------------------------------
         KIR  +QGW+ SLFS A +EIL+K++ QAIP+Y++S F+ PK L K+I    A+FWWG S+ K+K HW                               
Subjt:  NKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW-------------------------------

Query:  --------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPVFSQALVADFISESGA
                      +L+  Y+ + + LEA++G   S+ W+S+LWGR+++ +GIR RV  G  +   +D W+PR S         V     +     E G 
Subjt:  --------------LLRDKYFPSGSILEAQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPVFSQALVADFISESGA

Query:  WNESLLIEAVGIDEIDII
        WN   + E    D++ +I
Subjt:  WNESLLIEAVGIDEIDII

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.8e-3827.59Show/hide
Query:  RRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGIS-PRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQK
        +R+ N+I  I++ +G  T DP  I+ T   Y+  ++ +       +   L   + PR++Q+  E L    T  +I   IN +   K+PGPDGF+A F+Q+
Subjt:  RRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGIS-PRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQK

Query:  YWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVG-NPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECL
        Y   + P  ++    +       + + + +I+LIPK G +  +  +F PISL N++ KI  K +ANR++  +K +I  +Q  FI G     NI      +
Subjt:  YWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVG-NPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECL

Query:  HSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSR-GLRQGDPLSPYLFLLVAEGLSFLIS
          I  NR   K+   I +D  KAFD+++  F+ + + KLG D  ++ +I      P  ++++NG  K E  P + G RQG PLSP LF +V E L+  I 
Subjt:  HSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSR-GLRQGDPLSPYLFLLVAEGLSFLIS

Query:  EANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKS-AILFPRSMNADRK--GFLSSILNVNQVKDLGSYLGV
        +   +  + G+        +S  LFADD +V+ +       ++  L+S +  +SG  IN  KS A L+  +   + +  G L   +   ++K LG  + +
Subjt:  EANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKS-AILFPRSMNADRK--GFLSSILNVNQVKDLGSYLGV

Query:  PSSLSRSKSKDFAFVLNKIRKSMQGWRR------SLFSIARKEILIKSVGQ--AIPSYVLSIFKFPKSLCKEIMCCFAQFWW
           +     +++  +L +I++    W+          +I +  IL K + +  AIP       K P +   E+     +F W
Subjt:  PSSLSRSKSKDFAFVLNKIRKSMQGWRR------SLFSIARKEILIKSVGQ--AIPSYVLSIFKFPKSLCKEIMCCFAQFWW

P08548 LINE-1 reverse transcriptase homolog1.1e-3923.56Show/hide
Query:  IGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSP----FTWKGNRRGIQIWEWLDRF
        I +Y  P  N        +  + +L +S  ++ GDFN  L   ++    + +  +L +  S++   +L ++  +  P    +T+  +  G   +  +D  
Subjt:  IGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSP----FTWKGNRRGIQIWEWLDRF

Query:  ICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRR------MVWRKSGRRPFKFEEFWTHYEACEDIIK-----------THGDWQVSSASVLAKNLNS
        + +    +L  F     +  +FSDH  I+  ++  R        W+ +       ++ W   E  ++I K               W  + A VL     +
Subjt:  ICNLEFESLFAFAGSRNLDWMFSDHRPIEASVDCRR------MVWRKSGRRPFKFEEFWTHYEACEDIIK-----------THGDWQVSSASVLAKNLNS

Query:  CSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWG----DKNSGWFHRKASIRRQTNEISGIR
            L K + +  N++   +K+ +   K  + N        I  +  EL+  +E + I  +    + W +      DK      RK   +R  + IS IR
Subjt:  CSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWG----DKNSGWFHRKASIRRQTNEISGIR

Query:  DAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGIS-PRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ
        +     T DP+ I+     Y+  ++         I   L     PR+SQK  E L    +  +I   I ++   K+PGPDGF++ F+Q +   + P+ + 
Subjt:  DAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGIS-PRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQ

Query:  ECLEVLNNHKSLSEWNKTNIVLIPKVG-NPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIK
            +       + + + NI LIPK G +P    ++ PISL N++ KI  K + NR++  +K II  +Q  FI G     NI      +  I  N+   K
Subjt:  ECLEVLNNHKSLSEWNKTNIVLIPKVG-NPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIK

Query:  DMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVL
        D   + +D  KAFD ++  F+   + K+G +  ++ LI    + P  ++++NG+         G RQG PLSP LF +V E L+  I E  A   + G+ 
Subjt:  DMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVL

Query:  CSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSI---LNVNQVKDLGSYLGVPSSLSRSKSKDF
            S  +   LFADD +V+ +        +  ++  Y  +SG  IN +KS      + N   K    SI   +   ++K LG YL     +     +++
Subjt:  CSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSI---LNVNQVKDLGSYLGVPSSLSRSKSKDF

Query:  AFVLNKIRKSMQGWRRSLFS-IARKEILIKSV-GQAIPSYVLSIFKFPKSLCKEIMCCFAQFWW
          +  +I + +  W+    S + R  I+  S+  +AI ++     K P S  K++      F W
Subjt:  AFVLNKIRKSMQGWRRSLFS-IARKEILIKSV-GQAIPSYVLSIFKFPKSLCKEIMCCFAQFWW

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-3927.86Show/hide
Query:  ISGIRDAEGSWTEDPALIEDTFISYFCGIFKS---STPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNT
        I+ IR+ +G  T DP  I++T  S++  ++ +   +  E  + +D  R   P+++Q   + L S  +  +IE  IN +   K+PGPDGFSA F+Q +   
Subjt:  ISGIRDAEGSWTEDPALIEDTFISYFCGIFKS---STPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNT

Query:  VGPVTVQECLEVLNNHKSLSEWNKTNIVLIPK-VGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIK
        + P+  +   ++       + + +  I LIPK   +P ++ +F PISL N++ KI  K +ANR++  +K II  +Q  FI G     NI      +H I 
Subjt:  VGPVTVQECLEVLNNHKSLSEWNKTNIVLIPK-VGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIK

Query:  ENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSR-GLRQGDPLSPYLFLLVAEGLSFLISEANA
         N+   K+   I LD  KAFD+++  F+ +++ + G    ++++I    + P  ++ +NG  K E IP + G RQG PLSPYLF +V E L+  I +   
Subjt:  ENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSR-GLRQGDPLSPYLFLLVAEGLSFLISEANA

Query:  KGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKS-AILFPRSMNADRKGFLSSILNV--NQVKDLGSYLGVPSSL
        +  + G+        +S  L ADD +V+    ++    +  L++++  + G  IN NKS A L+ ++  A+++   ++  ++  N +K LG  + +   +
Subjt:  KGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKS-AILFPRSMNADRKGFLSSILNV--NQVKDLGSYLGVPSSL

Query:  SRSKSKDFAFVLNKIRKSMQGWRR------SLFSIARKEILIKSVGQ--AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWLLRDKYFPSG
             K+F  +  +I++ ++ W+          +I +  IL K++ +  AIP       K P     E+     +F W  ++  R    LL+DK    G
Subjt:  SRSKSKDFAFVLNKIRKSMQGWRR------SLFSIARKEILIKSVGQ--AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWLLRDKYFPSG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-3425.98Show/hide
Query:  WTHYEACEDIIKTHGDWQVSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWK-QRSREDW
        W  + A +D   T   W       L       ++++S   +    ++  ++ + +Q L  + D A   ++L       E  R +E+ +      RSR   
Subjt:  WTHYEACEDIIKTHGDWQVSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLEFELDRLLEEEEIFWK-QRSREDW

Query:  LWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYP
        L   D+ S +F+     +    +I+ +   +G+  EDP  I D   S++  +F S  P      + L    P VS++  E+L +  T  ++ +A+  M  
Subjt:  LWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNEKLLSLFTKCDIEKAINDMYP

Query:  TKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQ
         K+PG DG +  FFQ +W+T+GP   +   E     +      +  + L+PK G+ + + ++ P+SL + +YKI  KAI+ RLK +L ++I  +QS  + 
Subjt:  TKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDIISEEQSAFIQ

Query:  GRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPY
        GR I DN+ +  + LH  +  RT +  +A + LD  KAFDRV+  +L   +    F  +++  +     +    V IN      +   RG+RQG PLS  
Subjt:  GRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPY

Query:  LFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFL-SSILNV
        L+ L  E    L+     +  L+G++       V    +ADD ++    +  +L   +     Y A S   IN++KS+ L   S+  D   FL  +  ++
Subjt:  LFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFL-SSILNV

Query:  NQVKDLGSYLGV
        +    +  YLGV
Subjt:  NQVKDLGSYLGV

P93295 Uncharacterized mitochondrial protein AtMg003104.2e-1530.92Show/hide
Query:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------------------------LLRDKYFPSGSILE
        A+P Y +S F+  K LCK++     +FWW   E KRKI W                                              LLR +YFP  S++E
Subjt:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------------------------LLRDKYFPSGSILE

Query:  AQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL
          +G   S+ W+S++ GRELLS+G+ R +G+G     + D WI  E+ L PL
Subjt:  AQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-2623.99Show/hide
Query:  VSARKKLNWKRRARMGHINESSTQDQLSKKMKSSSELEEGKKRTRLEGASLNHIDSKVCWKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWL--
        +S++ +  W+ +    +I+ SS  D +SK   S   L  G   +     ++  +DS +      WR    Y   E    +  W+    +   + +  L  
Subjt:  VSARKKLNWKRRARMGHINESSTQDQLSKKMKSSSELEEGKKRTRLEGASLNHIDSKVCWKGKIWRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWL--

Query:  LGGDFNELLWDYEKYGGPRRASHL--LEEFRSSLNDCELKEMRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVD
        L GDF+++    + Y   + +  +  LEEF++ L D +L ++   G  +TW  ++    I   LDR I N ++ S F  A +       SDH P    ++
Subjt:  LGGDFNELLWDYEKYGGPRRASHL--LEEFRSSLNDCELKEMRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFAFAGSRNLDWMFSDHRPIEASVD

Query:  CRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLE----
               K  ++ F++  F          + TH  + VS      + +   S   S     +   ++   K CK   +  + N  H    ++ +LE    
Subjt:  CRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMDFLSIHNLE----

Query:  ----------FELDRLLEEE--------EIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSS----
                  F ++ +  ++        E F++Q+SR  WL  GD N+ +FH+     +  N I  +R  +    E+   +++  ++Y+  +  S     
Subjt:  ----------FELDRLLEEE--------EIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSS----

Query:  TPEKSRIVDALRGISP-RVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGN
        TP+    V  ++ I P R +  +  +L +L +  +I  A+  M   KAPGPD F+A FF + W  V   T+    E       L  +N T I LIPKV  
Subjt:  TPEKSRIVDALRGISP-RVSQKMNEKLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGN

Query:  PKEVGDFWPISLCNVNYKITT
          ++  F P+S C V YKI T
Subjt:  PKEVGDFWPISLCNVNYKITT

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.9e-1238.55Show/hide
Query:  IANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWI
        +  RLK ++ ++I   Q++FI GR+ TDNI+   E +HS++  + V K    +KLDL KA+DR+ W +LE+ ++  GF   W+
Subjt:  IANRLKMILKDIISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWI

AT4G29090.1 Ribonuclease H-like superfamily protein5.6e-1526.91Show/hide
Query:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW---------------------------------------------LLRDKYFPSGSILEA
        A+P+Y ++ F  PK++CK+I+   A FWW   +  + +HW                                             + + +YF     L A
Subjt:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW---------------------------------------------LLRDKYFPSGSILEA

Query:  QLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWI---PRESLLRPLCLNP-----VFSQALVADFISESGAWNESLLIEAVGIDEIDIIRRI
         LG   SF WKS+   +E+L QG R  VGNGE I+ ++  W+   P  + LR   + P     V S   V+D I ESG      +IE +   E++  R++
Subjt:  QLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWI---PRESLLRPLCLNP-----VFSQALVADFISESGAWNESLLIEAVGIDEIDIIRRI

Query:  PIDLRKSGSFMLKDWNSQFRYSG
          +LR  G  +L  +   +  SG
Subjt:  PIDLRKSGSFMLKDWNSQFRYSG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-1630.92Show/hide
Query:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------------------------LLRDKYFPSGSILE
        A+P Y +S F+  K LCK++     +FWW   E KRKI W                                              LLR +YFP  S++E
Subjt:  AIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHW----------------------------------------------LLRDKYFPSGSILE

Query:  AQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL
          +G   S+ W+S++ GRELLS+G+ R +G+G     + D WI  E+ L PL
Subjt:  AQLGYSLSFRWKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.1e-1659.7Show/hide
Query:  LINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADD
        +ING P+G + PSRGLRQGDPLSPYLF+L  E LS L   A  +G L G+  S +SP ++HLLFADD
Subjt:  LINGIPKGEIIPSRGLRQGDPLSPYLFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAATCAGGTGAAGCTGGGGGAGAGCGATGGATTACTATCAGCCATGGTGCAAAAGAATGCAAGAATGATCAAGAAGGTGGGGGATCCAACAAGAATAGCTTTGA
ATTCAGCGCATGGTTAAAGTTCCAGGGTTATTTTAGAGGAACGAGGGCACAAACTCCTCCAGCCAATGAGGAGTCTCTTGATCTGAATAGTTCTAGACATCCGGAGAACT
CTGAGGACATTGTTCATAATACTGTGTATACTCCCGCCCTGGCTGTCGAGGAGGGAGCAATGGACTTTAATCTTGCAAAGGATTTTGAGGGAGAACAAAAGAATGAACAT
GTCCTCTCGAATGATTCTGTATTCGAGGTGGCGATGGAGGAGGATGGAAATCAGATCATTGAAAGTAGTAGACTGAAGGAGGTTCATAAGGTTTCAGGTACTCTGAGTCA
AGGTGAAGGGTCCAGTGGAGGATTTATTGTTTCAGCTAGAAAGAAGCTGAACTGGAAGAGGCGTGCAAGGATGGGACATATTAATGAGTCTTCAACTCAAGATCAGTTGT
CCAAGAAAATGAAGTCTAGTTCTGAGCTCGAGGAAGGAAAGAAGAGGACTAGACTCGAGGGAGCTTCATTGAATCACATTGATTCTAAGGTTTGCTGGAAAGGAAAAATC
TGGAGGTTCATTGGTCTTTATGGCTTCCCAGAGGCCAATCTAAAATATAAAACCTGGAACCTTATCCGGCACCTTCATAGCCTGGAGAATTCTCCTTGGCTTCTAGGAGG
TGATTTCAATGAGCTGTTATGGGATTATGAAAAGTATGGAGGCCCAAGAAGAGCGAGTCATCTGTTAGAGGAATTTCGAAGTTCTCTGAATGACTGTGAGCTAAAGGAGA
TGCGCTTCTCTGGCAGCCCTTTTACCTGGAAAGGAAATCGAAGGGGAATTCAAATCTGGGAGTGGTTGGACAGGTTTATTTGTAACCTTGAATTCGAGTCTCTGTTTGCT
TTTGCAGGGTCGCGTAACTTGGATTGGATGTTCTCGGATCACAGGCCAATTGAGGCCTCAGTCGACTGTCGTCGTATGGTTTGGAGGAAATCAGGGAGGCGCCCCTTTAA
ATTTGAGGAGTTCTGGACCCATTATGAGGCTTGCGAGGATATCATTAAGACTCATGGAGACTGGCAGGTTTCTTCAGCATCTGTATTAGCAAAAAACTTGAATTCCTGCT
CTGAAGCTTTAAGCAAATGGGACAGTGATGTGAGAAATTCGATGCGGACCAAAATTAAAGAATGTAAACAAGCCCTAAAGGCAGCCTATGACAATGCTCCCCATATGGAT
TTCCTCTCTATTCATAATCTGGAATTCGAATTAGATAGATTGCTGGAAGAAGAAGAGATTTTTTGGAAGCAAAGATCTCGTGAAGATTGGCTGTGGTGGGGAGACAAAAA
TTCAGGTTGGTTTCATAGGAAGGCTTCTATCCGAAGGCAAACTAATGAGATTTCTGGAATTCGTGATGCAGAGGGTTCATGGACTGAGGATCCTGCCTTAATTGAGGACA
CTTTTATATCATACTTCTGTGGTATTTTCAAGTCCTCTACGCCCGAGAAGAGCAGAATAGTTGATGCCTTGCGAGGTATTTCTCCTAGAGTTTCTCAGAAGATGAATGAA
AAACTTTTGTCCCTTTTCACGAAGTGTGATATTGAAAAGGCGATTAATGATATGTACCCCACTAAGGCACCAGGGCCAGATGGATTCTCTGCGGTTTTTTTTCAGAAGTA
CTGGAATACGGTAGGCCCTGTCACAGTTCAGGAGTGCCTGGAGGTTCTTAATAATCACAAGAGTCTATCTGAATGGAACAAGACAAATATTGTGTTGATCCCTAAGGTAG
GCAATCCAAAGGAGGTGGGGGATTTCTGGCCTATAAGCCTTTGTAATGTTAATTACAAAATCACTACAAAGGCCATAGCGAATCGGTTAAAGATGATTTTGAAGGACATC
ATTTCGGAGGAACAATCAGCATTCATTCAAGGTCGGTTGATCACAGACAATATAATAGTGGGGCATGAGTGTCTTCATTCAATCAAGGAGAATCGAACGGTGATTAAAGA
TATGGCAACTATCAAACTAGATCTCAGTAAGGCATTCGATAGAGTTGAGTGGCTCTTTTTGGAGGAGATTATGTTAAAACTGGGGTTTGACAGGCGTTGGATAAGTCTTA
TTATGGGTTGTATCACCACCCCTGCCTTTTCTGTCTTGATAAATGGGATTCCTAAAGGAGAGATAATTCCTAGTAGGGGCTTGCGACAGGGTGACCCCTTGTCGCCGTAT
CTCTTCTTGCTTGTAGCAGAGGGCCTATCTTTTTTAATCTCCGAAGCTAATGCTAAAGGTAATTTGTCAGGGGTCCTTTGTTCCCCATCGAGCCCAAGTGTTTCCCACTT
GTTGTTTGCTGACGACAACTTGGTCTTCTGCAAGGCCAATGAATCTGAGTTAGTTCATATGAAAGCCCTTTTGTCTACCTATGAAGCCATTTCAGGGGAATTCATAAACT
TCAATAAGTCTGCAATCTTATTTCCTAGGAGTATGAATGCAGATAGAAAAGGTTTCTTGAGCAGCATCCTTAATGTTAATCAGGTTAAAGATCTAGGGTCCTATCTGGGG
GTTCCTTCATCTTTGTCTAGGAGTAAGTCAAAGGACTTTGCTTTTGTTCTCAACAAAATCAGGAAGTCTATGCAAGGCTGGAGGAGATCTCTTTTCTCAATTGCGAGGAA
AGAGATTTTGATCAAAAGTGTTGGGCAGGCCATCCCATCTTATGTTCTGAGTATTTTCAAGTTCCCTAAAAGCCTTTGTAAGGAGATCATGTGTTGCTTTGCTCAGTTTT
GGTGGGGCTTGAGTGAGGTTAAGAGGAAAATACATTGGCTTTTGAGGGATAAATACTTCCCCTCGGGCTCTATTTTGGAGGCTCAATTGGGATATTCTCTTTCTTTTCGT
TGGAAAAGCCTTTTATGGGGTCGAGAGCTTTTAAGCCAAGGCATTCGCAGGAGGGTGGGCAATGGCGAATCCATTGTTTGTTTTCAGGATCCATGGATTCCAAGAGAGTC
CTTATTGAGGCCTTTGTGCTTGAATCCTGTTTTCTCCCAAGCGTTGGTAGCTGATTTTATCTCAGAATCTGGAGCTTGGAACGAAAGTTTGCTAATTGAAGCAGTTGGTA
TTGATGAAATTGATATAATCAGGCGAATCCCTATAGATCTAAGGAAGTCCGGTAGCTTTATGTTGAAGGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACC
GATTCGCAATTTAATTTCGTTCTAAAAAATACTGGTACATTGGCAATTACAAATTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAATCAGGTGAAGCTGGGGGAGAGCGATGGATTACTATCAGCCATGGTGCAAAAGAATGCAAGAATGATCAAGAAGGTGGGGGATCCAACAAGAATAGCTTTGA
ATTCAGCGCATGGTTAAAGTTCCAGGGTTATTTTAGAGGAACGAGGGCACAAACTCCTCCAGCCAATGAGGAGTCTCTTGATCTGAATAGTTCTAGACATCCGGAGAACT
CTGAGGACATTGTTCATAATACTGTGTATACTCCCGCCCTGGCTGTCGAGGAGGGAGCAATGGACTTTAATCTTGCAAAGGATTTTGAGGGAGAACAAAAGAATGAACAT
GTCCTCTCGAATGATTCTGTATTCGAGGTGGCGATGGAGGAGGATGGAAATCAGATCATTGAAAGTAGTAGACTGAAGGAGGTTCATAAGGTTTCAGGTACTCTGAGTCA
AGGTGAAGGGTCCAGTGGAGGATTTATTGTTTCAGCTAGAAAGAAGCTGAACTGGAAGAGGCGTGCAAGGATGGGACATATTAATGAGTCTTCAACTCAAGATCAGTTGT
CCAAGAAAATGAAGTCTAGTTCTGAGCTCGAGGAAGGAAAGAAGAGGACTAGACTCGAGGGAGCTTCATTGAATCACATTGATTCTAAGGTTTGCTGGAAAGGAAAAATC
TGGAGGTTCATTGGTCTTTATGGCTTCCCAGAGGCCAATCTAAAATATAAAACCTGGAACCTTATCCGGCACCTTCATAGCCTGGAGAATTCTCCTTGGCTTCTAGGAGG
TGATTTCAATGAGCTGTTATGGGATTATGAAAAGTATGGAGGCCCAAGAAGAGCGAGTCATCTGTTAGAGGAATTTCGAAGTTCTCTGAATGACTGTGAGCTAAAGGAGA
TGCGCTTCTCTGGCAGCCCTTTTACCTGGAAAGGAAATCGAAGGGGAATTCAAATCTGGGAGTGGTTGGACAGGTTTATTTGTAACCTTGAATTCGAGTCTCTGTTTGCT
TTTGCAGGGTCGCGTAACTTGGATTGGATGTTCTCGGATCACAGGCCAATTGAGGCCTCAGTCGACTGTCGTCGTATGGTTTGGAGGAAATCAGGGAGGCGCCCCTTTAA
ATTTGAGGAGTTCTGGACCCATTATGAGGCTTGCGAGGATATCATTAAGACTCATGGAGACTGGCAGGTTTCTTCAGCATCTGTATTAGCAAAAAACTTGAATTCCTGCT
CTGAAGCTTTAAGCAAATGGGACAGTGATGTGAGAAATTCGATGCGGACCAAAATTAAAGAATGTAAACAAGCCCTAAAGGCAGCCTATGACAATGCTCCCCATATGGAT
TTCCTCTCTATTCATAATCTGGAATTCGAATTAGATAGATTGCTGGAAGAAGAAGAGATTTTTTGGAAGCAAAGATCTCGTGAAGATTGGCTGTGGTGGGGAGACAAAAA
TTCAGGTTGGTTTCATAGGAAGGCTTCTATCCGAAGGCAAACTAATGAGATTTCTGGAATTCGTGATGCAGAGGGTTCATGGACTGAGGATCCTGCCTTAATTGAGGACA
CTTTTATATCATACTTCTGTGGTATTTTCAAGTCCTCTACGCCCGAGAAGAGCAGAATAGTTGATGCCTTGCGAGGTATTTCTCCTAGAGTTTCTCAGAAGATGAATGAA
AAACTTTTGTCCCTTTTCACGAAGTGTGATATTGAAAAGGCGATTAATGATATGTACCCCACTAAGGCACCAGGGCCAGATGGATTCTCTGCGGTTTTTTTTCAGAAGTA
CTGGAATACGGTAGGCCCTGTCACAGTTCAGGAGTGCCTGGAGGTTCTTAATAATCACAAGAGTCTATCTGAATGGAACAAGACAAATATTGTGTTGATCCCTAAGGTAG
GCAATCCAAAGGAGGTGGGGGATTTCTGGCCTATAAGCCTTTGTAATGTTAATTACAAAATCACTACAAAGGCCATAGCGAATCGGTTAAAGATGATTTTGAAGGACATC
ATTTCGGAGGAACAATCAGCATTCATTCAAGGTCGGTTGATCACAGACAATATAATAGTGGGGCATGAGTGTCTTCATTCAATCAAGGAGAATCGAACGGTGATTAAAGA
TATGGCAACTATCAAACTAGATCTCAGTAAGGCATTCGATAGAGTTGAGTGGCTCTTTTTGGAGGAGATTATGTTAAAACTGGGGTTTGACAGGCGTTGGATAAGTCTTA
TTATGGGTTGTATCACCACCCCTGCCTTTTCTGTCTTGATAAATGGGATTCCTAAAGGAGAGATAATTCCTAGTAGGGGCTTGCGACAGGGTGACCCCTTGTCGCCGTAT
CTCTTCTTGCTTGTAGCAGAGGGCCTATCTTTTTTAATCTCCGAAGCTAATGCTAAAGGTAATTTGTCAGGGGTCCTTTGTTCCCCATCGAGCCCAAGTGTTTCCCACTT
GTTGTTTGCTGACGACAACTTGGTCTTCTGCAAGGCCAATGAATCTGAGTTAGTTCATATGAAAGCCCTTTTGTCTACCTATGAAGCCATTTCAGGGGAATTCATAAACT
TCAATAAGTCTGCAATCTTATTTCCTAGGAGTATGAATGCAGATAGAAAAGGTTTCTTGAGCAGCATCCTTAATGTTAATCAGGTTAAAGATCTAGGGTCCTATCTGGGG
GTTCCTTCATCTTTGTCTAGGAGTAAGTCAAAGGACTTTGCTTTTGTTCTCAACAAAATCAGGAAGTCTATGCAAGGCTGGAGGAGATCTCTTTTCTCAATTGCGAGGAA
AGAGATTTTGATCAAAAGTGTTGGGCAGGCCATCCCATCTTATGTTCTGAGTATTTTCAAGTTCCCTAAAAGCCTTTGTAAGGAGATCATGTGTTGCTTTGCTCAGTTTT
GGTGGGGCTTGAGTGAGGTTAAGAGGAAAATACATTGGCTTTTGAGGGATAAATACTTCCCCTCGGGCTCTATTTTGGAGGCTCAATTGGGATATTCTCTTTCTTTTCGT
TGGAAAAGCCTTTTATGGGGTCGAGAGCTTTTAAGCCAAGGCATTCGCAGGAGGGTGGGCAATGGCGAATCCATTGTTTGTTTTCAGGATCCATGGATTCCAAGAGAGTC
CTTATTGAGGCCTTTGTGCTTGAATCCTGTTTTCTCCCAAGCGTTGGTAGCTGATTTTATCTCAGAATCTGGAGCTTGGAACGAAAGTTTGCTAATTGAAGCAGTTGGTA
TTGATGAAATTGATATAATCAGGCGAATCCCTATAGATCTAAGGAAGTCCGGTAGCTTTATGTTGAAGGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACC
GATTCGCAATTTAATTTCGTTCTAAAAAATACTGGTACATTGGCAATTACAAATTCATAA
Protein sequenceShow/hide protein sequence
MLKSGEAGGERWITISHGAKECKNDQEGGGSNKNSFEFSAWLKFQGYFRGTRAQTPPANEESLDLNSSRHPENSEDIVHNTVYTPALAVEEGAMDFNLAKDFEGEQKNEH
VLSNDSVFEVAMEEDGNQIIESSRLKEVHKVSGTLSQGEGSSGGFIVSARKKLNWKRRARMGHINESSTQDQLSKKMKSSSELEEGKKRTRLEGASLNHIDSKVCWKGKI
WRFIGLYGFPEANLKYKTWNLIRHLHSLENSPWLLGGDFNELLWDYEKYGGPRRASHLLEEFRSSLNDCELKEMRFSGSPFTWKGNRRGIQIWEWLDRFICNLEFESLFA
FAGSRNLDWMFSDHRPIEASVDCRRMVWRKSGRRPFKFEEFWTHYEACEDIIKTHGDWQVSSASVLAKNLNSCSEALSKWDSDVRNSMRTKIKECKQALKAAYDNAPHMD
FLSIHNLEFELDRLLEEEEIFWKQRSREDWLWWGDKNSGWFHRKASIRRQTNEISGIRDAEGSWTEDPALIEDTFISYFCGIFKSSTPEKSRIVDALRGISPRVSQKMNE
KLLSLFTKCDIEKAINDMYPTKAPGPDGFSAVFFQKYWNTVGPVTVQECLEVLNNHKSLSEWNKTNIVLIPKVGNPKEVGDFWPISLCNVNYKITTKAIANRLKMILKDI
ISEEQSAFIQGRLITDNIIVGHECLHSIKENRTVIKDMATIKLDLSKAFDRVEWLFLEEIMLKLGFDRRWISLIMGCITTPAFSVLINGIPKGEIIPSRGLRQGDPLSPY
LFLLVAEGLSFLISEANAKGNLSGVLCSPSSPSVSHLLFADDNLVFCKANESELVHMKALLSTYEAISGEFINFNKSAILFPRSMNADRKGFLSSILNVNQVKDLGSYLG
VPSSLSRSKSKDFAFVLNKIRKSMQGWRRSLFSIARKEILIKSVGQAIPSYVLSIFKFPKSLCKEIMCCFAQFWWGLSEVKRKIHWLLRDKYFPSGSILEAQLGYSLSFR
WKSLLWGRELLSQGIRRRVGNGESIVCFQDPWIPRESLLRPLCLNPVFSQALVADFISESGAWNESLLIEAVGIDEIDIIRRIPIDLRKSGSFMLKDWNSQFRYSGSNWT
DSQFNFVLKNTGTLAITNS