; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015318 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015318
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:10322992..10329039
RNA-Seq ExpressionLag0015318
SyntenyLag0015318
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]1.3e-15136.07Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        R  +W L+++L    + PW++GGD+N+I   S+K GG  +S RL+++ K  L FC L D+ F G +F W R    G+ V+ RLDRF  +LP  +      
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIV-QKTEESMMKLASW---NFNRLKEK---------
        V HL+   SDH  IL     + ++  +  KK++ +FEE WLL + C ++VK +W    G G  +++  + + +   L SW   +F  ++++         
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIV-QKTEESMMKLASW---NFNRLKEK---------

Query:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN
                            +L+ LL +E+ +WR RA+  WLK GD NTK+FH    +RK+KN++  + +++G W    + +  + + YF  LF+SS P 
Subjt:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN

Query:  --QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPK
          Q ++E     +     E   R L R   K E+  A+KNM+PSK+ GPDG    FFQ +W+++G+DI     +       L ++ S++  LIPKV+VP+
Subjt:  --QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPK

Query:  KMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFG
         M +  PISLC VVYKI +K LANR+K  LDS+I P Q+ F+PGRLI+DN L+ FE  H +  + +GK GY A+ LDMSKAYDRVEW ++  VM K+GFG
Subjt:  KMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFG

Query:  DNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETI
        + WI  +M CV +V Y   IN  P     P RGLRQGD +S                                            + IF +A   +CE +
Subjt:  DNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETI

Query:  KKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQA
        K  L  YE  SGQ +N+ KS    SKN++     G +  LGV  V+    YL                          GW+E+  S  GK+VLIKAVAQA
Subjt:  KKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQA

Query:  IPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TS
        IP+Y MSCF +P+ +C+E++++ A+FWWG  GD +K HW++W+KLC  K   GLGFR++ +FN A+LTKQ                              
Subjt:  IPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TS

Query:  LGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMW-IREELVDGRVADV
        L      TWRSIL GR + ++GMR++VG    IR+ DDPW+P   +++P   + E L + RVAD+
Subjt:  LGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMW-IREELVDGRVADV

XP_010688579.1 PREDICTED: uncharacterized protein LOC104902491 [Beta vulgaris subsp. vulgaris]7.5e-14736.96Show/hide
Query:  SWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEH
        +W L+R++   ++ P +  GDFN+I+   EK GG  +S+RL+D F+  +  C + D+G++G  F W+R ++   +++ERLDR +AN    N     E+ H
Subjt:  SWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEH

Query:  LNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQG---GGACEIVQK-----------TEESMMKLASWNFNRLKEK--
        L  + SDH  +L      N    R  K  K  FE  WL  +EC +IV++AWG  +G   G   E V +             +   K A    NRL+++  
Subjt:  LNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQG---GGACEIVQK-----------TEESMMKLASWNFNRLKEK--

Query:  -------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFT
                     +LD + + EE+YW  RAR + L+ GD+NTK+FH  AS RK +N+I  + D NG W +G+D IG +   YF++LFSS  P    +E  
Subjt:  -------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFT

Query:  TKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPIS
         + ++  + +    EL  P +  +I  AL +M+P+KA G DG HA FFQ +W I+G DI    L   N D +L S+  +   LIPK   P  M+ F PIS
Subjt:  TKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPIS

Query:  LCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMK
        LC V+YKI++K LAN++K+ L ++I P+Q+ F+P RLITDN L+ FE  HA+  K     G  A+ LDMSKAYDRVEW ++ KVMEK+GF   WI +VM 
Subjt:  LCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMK

Query:  CVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFL--------------------------------------------KANWKNCETIKKFLGEYEA
        CV SV +  KIN        P RGLRQGDP+S  +FL                                             A+   C  +   + +YE 
Subjt:  CVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFL--------------------------------------------KANWKNCETIKKFLGEYEA

Query:  VSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCF
         SGQ +N  K+  + S+N+ R       N LGV  VE    YL                          GWKE+L S  GK+VLIKAV QAIPTY MS F
Subjt:  VSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCF

Query:  MLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTW
         LP G+ +EI+ L A+FWWG    ++K HW  W+ LC  K   GLGFRD+  FNQA+L KQ                               G NP  TW
Subjt:  MLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTW

Query:  RSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREG
        RSI   + L  +G++W VGS   IR+ DD W+  EG
Subjt:  RSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREG

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.4e-15537.78Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        R DSW L+  L+S    PW+  GDFN+IL  +EK GG  +S+  +D F+  +++C   D+G+ G  + W       + +  RLDR +A      K   M+
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAW----------GKNQGGGAC-------------EIVQKTEESMMKL-
        V HL     DH  +L + NS    I    + ++  FE  W   ++C  I++ +W          G ++    C             +I +K ++   +L 
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAW----------GKNQGGGAC-------------EIVQKTEESMMKL-

Query:  ----------ASWNFNRLKEKELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP
                   S   NRL+E E++ LL++EETYW  RA+  WLK GDRNTK+FH  AS R+++N+I  I D  G W + E++I   AI YF  ++SSS P
Subjt:  ----------ASWNFNRLKEKELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP

Query:  NQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKK
        +Q  IE  T+ I  K+ E+    L R F+K E+  ALK ++P+KA GPDG  A FFQ YW I+G +++ + L++LN +  +  L  +   LIPK   PK+
Subjt:  NQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKK

Query:  MERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGD
        M  F PISLC VVYK+I+K LANR+K  L  +I  +Q+ F   RLITDNVL+ FE +H +  K AGKEG++A+ LDMSKA+DRVEW +I KVME++GF +
Subjt:  MERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGD

Query:  NWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFL--------------------------------------------KANWKNCETIK
         W + VM+C+ SV Y + IN       YP RGLRQGDPLS  +FL                                            KA ++ C  ++
Subjt:  NWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFL--------------------------------------------KANWKNCETIK

Query:  KFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLG-------------------------VKLVESLGYYL-GWKERLFSMGGKKVLIKAVAQAI
          LG+YE  SGQ IN DKS+   S N  +E+     N LG                           L E +G+ L GWK +L SMGGK++LIKAVAQAI
Subjt:  KFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLG-------------------------VKLVESLGYYL-GWKERLFSMGGKKVLIKAVAQAI

Query:  PTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSL
        PTYTMSCF+LP+G+C+++ ++   FWWG    + K  W+SWK++C SK   GLGFR++  FN AML KQ                              L
Subjt:  PTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSL

Query:  GSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYK
        GS+P  +WRSI    ++ ++G RWRVG+   I I +D W+P    YK
Subjt:  GSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.1e-15036.86Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        +++SW L++ L +F   PW++ GDFN  L  SEK   RQ     I+ F+  LS C L D+GF+G  + W  +       K RLDR VAN    ++ Q   
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGG--GACEIVQKTEESMMKLASW-----------------NF
        V HL+ H SDH  +L    S ++   R    R  +FEESWLL DEC  +++EAWG   G   G   + +K +   ++L +W                   
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGG--GACEIVQKTEESMMKLASW-----------------NF

Query:  NRLKE---------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP
        +RL E               K++D LL+++E YW  R+R +WL+ GDRNTK+FH  AS R+RKN I  I++S G WVE  + +G VA +YF  LF +   
Subjt:  NRLKE---------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP

Query:  NQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKK
        +Q  +E     +  K+ ED R  L   F+  E++ AL  M P+KA GPDG +A F+Q +W I+G+ +    LD LN    L  +  +   LIPKV+ P++
Subjt:  NQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKK

Query:  MERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGD
        M  F PISLC V+YKII+K LANR+KQ L  +I  +Q+ F+PGRLITDNVL+ +E +H +  +  GK+G +A+ LD+SKAYDRVEW ++  +MEK+GF  
Subjt:  MERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGD

Query:  NWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIK
         WI +VM CV +  + + +N  P     P RG+RQGDP+S                                            +++F +A     ETI 
Subjt:  NWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIK

Query:  KFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAI
        + L  YE  SGQ+IN +KS+   S N +          LGVK V+    YL                          GWK  L S  GK++LIKAVAQAI
Subjt:  KFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAI

Query:  PTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSL
        PTYTMS F +P  +C E+  LCA+FWWG VG+++K HW SW KL A K+  G+GFRD+  FN AML KQ                               
Subjt:  PTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSL

Query:  GSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIP
          N    WRS++  + + + G  WRVG+   I    D W+P
Subjt:  GSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIP

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]2.2e-14635.92Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        R +SW  ++ L      PW+  GDFN+I   +EK GGR + +R ++ F   +++C   +V F G K+ W      G  ++ERLDR +AN   ++     +
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIVQK-TEESMMKLASWN----------FNRLKEK--
        + HL+   SDH  +      K ++  + ++K   RFE  WL    C +IVK AW   +  GA  I++   E     L  WN           + L++K  
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIVQK-TEESMMKLASWN----------FNRLKEK--

Query:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN
                             L+  LE+E+  WR R+R +W + GDRNT +FH  AS R +KN ID I D  G W E E  I  VA+ YF+KLF+SSKP 
Subjt:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN

Query:  QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKM
         E        ++ K+  D   EL R ++  E+  ALK M P KA GPDG    FFQ++W+  GE ++   LD LN      +   +   LIPK+  PK +
Subjt:  QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKM

Query:  ERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDN
          + PISLC V YKI +KA+ANR+K+ L S+I  +Q+ F+ GRLITDNVL+ FE +H I  K  GK G +A+ LDMSKAYDRVEW ++ K+MEKLGF  N
Subjt:  ERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDN

Query:  WINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKK
          + +M+C+ +V Y +KIN  P     P RG+RQGDPLS                                            ++IF KA    C+ +++
Subjt:  WINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKK

Query:  FLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIP
         LG YE  SGQ +N  K++   S N  +E         G ++++    YL                          GWKE+L S  GK++LIKAVA A+P
Subjt:  FLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIP

Query:  TYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLG
        TYTMSCF LP  +C+E+  +  KFWWG V ++ +  W+SW K+C SK   G+GF+++ LFN A+L KQ                             SLG
Subjt:  TYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLG

Query:  SNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMWIREEL-VDGRVADV
        +NP  +WRSI+  + L K+G++WRVG+ A IR+ +D W+P   ++K +  R  L  D RVAD+
Subjt:  SNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMWIREEL-VDGRVADV

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein1.6e-15037.12Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        R+ SWEL+RRL    D PW++ GDFN+I+   EK G   +    + EF+  L+ C L+D+GF+G +F W    +  + V ERLDR VA    ++      
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGAC-EIVQKTEESMMKLASWNFNRLKE-------------
        ++H+ +  SDH  ++ +  +  +   RN +KR+  FE +WL  + C + + +AW  +Q G A   + QK ++  M L SW+   L++             
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGAC-EIVQKTEESMMKLASWNFNRLKE-------------

Query:  -------------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN
                            EL  LL++EE YWR R+R  WL+ GDRNT +FH  AS RK+ N+I  I+D+   W   E  I  V   YF ++++++ P 
Subjt:  -------------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN

Query:  QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKM
           I+   +++   +  D  +EL +PF++ E+  AL  M+PSKA GPDG  A FFQ +W I+G D++   LD LNK   L SL  +   LIPKVK P+ M
Subjt:  QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKM

Query:  ERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDN
          F PISLC V+YKII+K L NRMK  L  V+  SQ+ F+PGR+I+DN++I FE IH +  K  GK   +A  LDMSKAY+RVEW Y+ K+M KLGF + 
Subjt:  ERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDN

Query:  WINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKK
        W+  +M+CV SV Y + +N  P     P RGLRQGDPLS                                            ++IF +A   +C+ +++
Subjt:  WINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKK

Query:  FLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIP
         L  YE  SGQ IN DK+A   S+N +           G         YL                          GWKE+  S  GK++LIKAV QAIP
Subjt:  FLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIP

Query:  TYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ-TSLGSNPFLTWRSILWGRDLFKQGMRWRVGS
        TY MSCF LP G+C+EI+ +  +FWWG  G+++K HW+S KKLC +K   G+GFRD+  FNQA+L +Q   L  NP      +   + +   G+RWRVG+
Subjt:  TYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ-TSLGSNPFLTWRSILWGRDLFKQGMRWRVGS

Query:  EAHIRIKDDPWIPREGNYK
          +IRI  D WI     Y+
Subjt:  EAHIRIKDDPWIPREGNYK

A0A2N9IPS8 Reverse transcriptase domain-containing protein4.3e-14833.98Show/hide
Query:  TVFDKKSSAPSLSKKARGDRQRLAGPNG------RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDK
        T + +      +  K +G   RL G  G      RK+SW L++ L     SPW+  GDFN+IL ++E+ G   + +  I +F+  +  C L D+G+ G+ 
Subjt:  TVFDKKSSAPSLSKKARGDRQRLAGPNG------RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDK

Query:  FMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGK--NQGGGACE
        + WRR+ +   +V  RLDR +A++  +       V HL   NSDH  IL         + R  KK+  RFE  W+  ++C +++  AWG    +G     
Subjt:  FMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGK--NQGGGACE

Query:  IVQKTEESMMKLASWNFNR-------LKEK------------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSID
        +V+K +     L  W+  R       +K K                        +L+ LLE+EE +WR R+R  W+  GD+NTK+FH   + R+R N I 
Subjt:  IVQKTEESMMKLASWNFNR-------LKEK------------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSID

Query:  RIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDI
         ++D +G W   +  I  +A++YF+ +F+SS P+ E I    + +++ +      +L+  F+K E+  ALK M P+KA GPDG  A F+Q YWDI+G ++
Subjt:  RIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDI

Query:  SKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGK
        ++  L IL+    L  +  +   LIPKVK P+ +  F PISLC V+YKI++K LANR+K+ L  VI  +Q+ F+PGRLITDNVL+ FE +H++  K  GK
Subjt:  SKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGK

Query:  EGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS---------------------------
        +G +A+ LDMSKAYDRVEW ++  +M  +GF   WI  +M C+ SV Y V IN      F   RG+RQGD LS                           
Subjt:  EGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS---------------------------

Query:  -----------------TMIFLKANWKNCETIKKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL----------------
                         +++F +A   NCE +   L +YE  SGQ +N  K++   +K+ +    R   ++  V  ++S   YL                
Subjt:  -----------------TMIFLKANWKNCETIKKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL----------------

Query:  ----------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLT
                  GWKE+  S  G++VLIKAVAQ+IPTY+MSCF LP+ +C ++N + + FWWG     KKAHW+ W KLC SK   GLGFRD+  FN A+L 
Subjt:  ----------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLT

Query:  KQ----------------------------TSLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYK
        KQ                             +LG+ P   WRSI   R + + G++W +G    ++I +DPW+P   ++K
Subjt:  KQ----------------------------TSLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYK

A0A2N9J6I3 Uncharacterized protein6.6e-14938.1Show/hide
Query:  GDRQRLAGPNG------RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERL
        G+  RL G  G      R  SWEL+RRL    +  W++ GDFN+I    EK G   +  R +  F+  L+ C L D+GF G +F W    N+G+ V ERL
Subjt:  GDRQRLAGPNG------RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERL

Query:  DRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWG-KNQGGGACEIVQKTEESMMKLASWNFN
        DR VA    ++      ++H  +  SDH  +L + ++     ++  KKR+  FE +WL  + C +++ +AW  ++ G     +VQK ++  M L SW+ +
Subjt:  DRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWG-KNQGGGACEIVQKTEESMMKLASWNFN

Query:  --------------RLKE------------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIG
                      RLKE                  ++L  LL +EE YWR R+R  WL+ GDRNT +FH  A+ RK+ N+I  I+DSN  W   +  I 
Subjt:  --------------RLKE------------------KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIG

Query:  VVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSL
         V   YF  +++SS P    I+  T++++  +      +L  PF++ E+ RAL  M+PSKA GPDG  A FFQ +W I+G D++   LD LN    L SL
Subjt:  VVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSL

Query:  YSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRV
          +   LIPKVK P+ M +F PISLC V+YKII+K L NRMK  L  V+  SQ+ F+PGR+I+DN++I FE +H +  K  GK   +A+ LDMSKAYDRV
Subjt:  YSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRV

Query:  EWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFLKANWKNCETIKKFLGEYEAV-----------SGQTINFD
        EW Y+ K+M KLGF   W+  +M+CV SV Y + +N  P     P RGLRQGDPLS  +FL         ++K   E E+V            GQ IN  
Subjt:  EWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFLKANWKNCETIKKFLGEYEAV-----------SGQTINFD

Query:  KSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEE
        K+A   S+N +        NF G         YL                          GWKE+  S  GK VLIKAV QAIPTY MSCF  P G+CEE
Subjt:  KSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEE

Query:  INKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTWRSILWGRDL
        I+ +  +FWWG     +K HW+S KKLC +K+  G+GFRD+  FNQA+L +Q                              +  N    WRSI   +++
Subjt:  INKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTWRSILWGRDL

Query:  FKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM
         + GMRWRVGS   IRI  D WIP    YK M
Subjt:  FKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM

A0A2P6SDG4 Putative RNA-directed DNA polymerase6.4e-15236.07Show/hide
Query:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME
        R  +W L+++L    + PW++GGD+N+I   S+K GG  +S RL+++ K  L FC L D+ F G +F W R    G+ V+ RLDRF  +LP  +      
Subjt:  RKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKME

Query:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIV-QKTEESMMKLASW---NFNRLKEK---------
        V HL+   SDH  IL     + ++  +  KK++ +FEE WLL + C ++VK +W    G G  +++  + + +   L SW   +F  ++++         
Subjt:  VEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIV-QKTEESMMKLASW---NFNRLKEK---------

Query:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN
                            +L+ LL +E+ +WR RA+  WLK GD NTK+FH    +RK+KN++  + +++G W    + +  + + YF  LF+SS P 
Subjt:  --------------------ELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN

Query:  --QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPK
          Q ++E     +     E   R L R   K E+  A+KNM+PSK+ GPDG    FFQ +W+++G+DI     +       L ++ S++  LIPKV+VP+
Subjt:  --QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPK

Query:  KMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFG
         M +  PISLC VVYKI +K LANR+K  LDS+I P Q+ F+PGRLI+DN L+ FE  H +  + +GK GY A+ LDMSKAYDRVEW ++  VM K+GFG
Subjt:  KMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFG

Query:  DNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETI
        + WI  +M CV +V Y   IN  P     P RGLRQGD +S                                            + IF +A   +CE +
Subjt:  DNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETI

Query:  KKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQA
        K  L  YE  SGQ +N+ KS    SKN++     G +  LGV  V+    YL                          GW+E+  S  GK+VLIKAVAQA
Subjt:  KKFLGEYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYL--------------------------GWKERLFSMGGKKVLIKAVAQA

Query:  IPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TS
        IP+Y MSCF +P+ +C+E++++ A+FWWG  GD +K HW++W+KLC  K   GLGFR++ +FN A+LTKQ                              
Subjt:  IPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TS

Query:  LGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMW-IREELVDGRVADV
        L      TWRSIL GR + ++GMR++VG    IR+ DDPW+P   +++P   + E L + RVAD+
Subjt:  LGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMW-IREELVDGRVADV

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.0e-14936.55Show/hide
Query:  PWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILAS
        PW   GDFN+I+   EK GG  +++  +D F++ ++ C   D+G+ G  + W         +  RLDR +A    I +  +++V HL    SDH  +  S
Subjt:  PWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILAS

Query:  WNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACE-IVQKTEESMMKLASWNFNRLKE------------------------------
            N QI +  + R+  FE  W   ++C  I++  WG        E +    +    +LASWN + L++                              
Subjt:  WNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACE-IVQKTEESMMKLASWNFNRLKE------------------------------

Query:  --KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFE
          +EL+ LL++EE +W  R++  WLK GDRNTK+FH  AS R+++N+I  + D  G W E  D+I   A+ YF+ ++S+S P+  +++  T  I   + E
Subjt:  --KELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFE

Query:  DQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIA
        +   EL R F++ EI  ALK ++P+K+ GPDG  A FFQ YWDI+G ++S + L++LN    L  +  +   LIPK   PK+M  F PISLC V+YK+I+
Subjt:  DQRRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIA

Query:  KALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVK
        K LANR+K  L  +I  +Q+ F   RLITDNVLI +E +H +  K  GK+ ++A  LDMSKA+DRVEW +I +VM K+GF + WI+ +M+C+ SV Y V 
Subjt:  KALANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVK

Query:  INDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKKFLGEYEAVSGQTINFDK
        IN        P RGLRQGDPLS                                            +++F KAN + CE +K+ L +YEA SGQ +N DK
Subjt:  INDCPTSEFYPERGLRQGDPLS--------------------------------------------TMIFLKANWKNCETIKKFLGEYEAVSGQTINFDK

Query:  SACMTSKNINRESVRGFSNFLG-------------------------VKLVESLGYYL-GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEI
        S+   S N   E      N LG                          ++ E +G  L GWK +L S GGK++LIKAVAQAIPTYTMSCF+LPK +C+E+
Subjt:  SACMTSKNINRESVRGFSNFLG-------------------------VKLVESLGYYL-GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEI

Query:  NKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTWRSILWGRDLF
         K+   FWWG    + K  W+SW+K+C  K   GLGFR++  FN A+L KQ                             SLGSNP  TWRSI    ++ 
Subjt:  NKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------TSLGSNPFLTWRSILWGRDLF

Query:  KQGMRWRVGSEAHIRIKDDPWIPREGNYK
        K+G RWRVG+   I I DD W+P    YK
Subjt:  KQGMRWRVGSEAHIRIKDDPWIPREGNYK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-2523.15Show/hide
Query:  KKKLRKDQESMETVFDKKSSAPSLSKKARGDRQRLAGPNGRKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDV
        K  +++++ ++  ++   + AP   K+   D QR           +L        DS  +I GDFN  L   +++  RQK  +   E  S L    L+D+
Subjt:  KKKLRKDQESMETVFDKKSSAPSLSKKARGDRQRLAGPNGRKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDV

Query:  GFR--GDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNV--KKRKPRFEESWL---------LFDECN
         +R    K       +       ++D  V +  +++K ++ E+  +  + SDH  I      KN   +R+   K       + W+         +F E N
Subjt:  GFR--GDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNV--KKRKPRFEESWL---------LFDECN

Query:  Q---------------------IVKEAWGKNQGGGACEIVQKTEESMMKLASWNFNRLKEKELDLL------LEEEETYWRLRARKDWLKWGDRNTKWFH
        +                     I   A+ + Q     + +    + + K    +    + +E+  +      +E ++T  ++   + W  + +R  K   
Subjt:  Q---------------------IVKEAWGKNQGGGACEIVQKTEESMMKLASWNFNRLKEKELDLL------LEEEETYWRLRARKDWLKWGDRNTKWFH

Query:  FNA---SHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN-QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDG
          A     ++ KN ID IK+  G        I     EY+K L+++   N +E+  F       +L +++   L RP +  EI   + ++   K+ GPDG
Subjt:  FNA---SHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN-QELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNMNPSKALGPDG

Query:  AHATFFQNYWDIIGEDISKVFLDILNKDGEL-GSLYSSWTKLIPKV-KVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITD
          A F+Q Y + +   + K+F  I  K+G L  S Y +   LIPK  +   K E F PISL  +  KI+ K LANR++Q +  +I+  Q  FIPG     
Subjt:  AHATFFQNYWDIIGEDISKVFLDILNKDGEL-GSLYSSWTKLIPKV-KVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITD

Query:  NVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIF
        N+      I  I    A  + ++ +++D  KA+D+++  ++ K + KLG    ++  +    +     + +N      F  + G RQG PLS ++F
Subjt:  NVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIF

P0C2F6 Putative ribonuclease H protein At1g657501.3e-1630.72Show/hide
Query:  GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTK--------Q
        GW+E+  S  G+  L KAV  ++P ++MS  +LP+ I   +++L   F WG   +KKK H + W K+C+ K+  GLG R     N+A+++K        +
Subjt:  GWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTK--------Q

Query:  TSLGS----------------------NPFLTWRSILWG-RDLFKQGMRWRVGSEAHIRIKDDPWI
         SL +                      +   TWRSI  G RD+   G+ W  G    IR   D W+
Subjt:  TSLGS----------------------NPFLTWRSILWG-RDLFKQGMRWRVGSEAHIRIKDDPWI

P11369 LINE-1 retrotransposable element ORF2 protein9.5e-2828.57Show/hide
Query:  LEEEETYWRLRARKDWL--KWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN-QELIEFTTKDIKAKLFEDQRRE
        +E   T  R+   + W   K    +         HR  K  I++I++  G      + I      ++K+L+S+   N  E+ +F  +    KL +DQ   
Subjt:  LEEEETYWRLRARKDWL--KWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPN-QELIEFTTKDIKAKLFEDQRRE

Query:  LERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPK-VKVPKKMERFWPISLCKVVYKIIAKALA
        L  P S  EIE  + ++   K+ GPDG  A F+Q + + +   + K+F  I  +     S Y +   LIPK  K P K+E F PISL  +  KI+ K LA
Subjt:  LERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPK-VKVPKKMERFWPISLCKVVYKIIAKALA

Query:  NRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDC
        NR+++ + ++I+P Q  FIPG     N+      IH I       + ++ ++LD  KA+D+++  ++ KV+E+ G    ++N +          +K+N  
Subjt:  NRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDC

Query:  PTSEFYPERGLRQGDPLSTMIF
               + G RQG PLS  +F
Subjt:  PTSEFYPERGLRQGDPLSTMIF

P14381 Transposon TX1 uncharacterized 149 kDa protein7.8e-3029.68Show/hide
Query:  LRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIER
        +R+R   L   DR +++F+     +  +  I  +   +G  +E  + I   A  +++ LFS    + +  E     +   + E ++  LE P +  E+ +
Subjt:  LRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIER

Query:  ALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELG-SLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIY
        AL+ M  +K+ G DG    FFQ +WD +G D  +V  +   K GEL  S   +   L+PK    + ++ + P+SL    YKI+AKA++ R+K  L  VI+
Subjt:  ALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELG-SLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIY

Query:  PSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLR
        P Q+  +PGR I DNV +  + +H     G        ++LD  KA+DRV+  Y+   ++   FG  ++  +     S   +VKIN   T+     RG+R
Subjt:  PSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLR

Query:  QGDPLSTMIF
        QG PLS  ++
Subjt:  QGDPLSTMIF

P93295 Uncharacterized mitochondrial protein AtMg003102.0e-2235.53Show/hide
Query:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFE-GLGFRDISLFNQAMLTKQT---------------------------
        A+P Y MSCF L K +C+++     +FWW    +K+K  W++W+KLC SKE + GLGFRD+  FNQA+L KQ+                           
Subjt:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFE-GLGFRDISLFNQAMLTKQT---------------------------

Query:  -SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM
         S+G+ P   WRSI+ GR+L  +G+   +G   H ++  D WI  E    P+
Subjt:  -SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.1e-1424.2Show/hide
Query:  DSPWIIGGDFNKILFDSEKNGGRQKS--KRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHK-
        D   I+ GDF++I   S+     Q S   R ++EF++ L    LVD+  RG  + W    +   +++ +LDR +AN    +              SDH  
Subjt:  DSPWIIGGDFNKILFDSEKNGGRQKS--KRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRFVANLPMINKIQKMEVEHLNYHNSDHK-

Query:  --LILASWNSKNRQINR--NVKKRKPRFEES----W----------LLFDECNQIVKEAWGKNQGGGACEIVQKTEESMMKLAS-------------WNF
          +IL +   ++++  R  +     P F  S    W              E  +  K+        G   I  KT+E++  L S             +  
Subjt:  --LILASWNSKNRQINR--NVKKRKPRFEES----W----------LLFDECNQIVKEAWGKNQGGGACEIVQKTEESMMKLAS-------------WNF

Query:  NRLKEKELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP--NQELIEFTTKDIK
          +  K+ +      E+++R ++R  WL+ GD NT++FH      + KN I  ++  +   VE    +  + + Y+  L  S       + ++   KDI 
Subjt:  NRLKEKELDLLLEEEETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKP--NQELIEFTTKDIK

Query:  AKLFEDQ-RRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKV
             D     L    S  EI  A+  M  +KA GPD   A FF   W ++ +       +       L    ++   LIPKV    ++  F P+S C V
Subjt:  AKLFEDQ-RRELERPFSKGEIERALKNMNPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKV

Query:  VYKII
        VYKII
Subjt:  VYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.1e-1434.38Show/hide
Query:  LANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYV
        +  R+K  + ++I P+QA+FIPGR+ TDN++   E +H++  K  G +G++ + LD+ KAYDR+ W Y+   +   GF + W+ ++ +     R V
Subjt:  LANRMKQALDSVIYPSQATFIPGRLITDNVLIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYV

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1834.51Show/hide
Query:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------T
        A+PTYTM+CF+LPK +C++I  + A FWW    + K  HW +W  L   K   G+GF+DI  FN A+L KQ                             
Subjt:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFEGLGFRDISLFNQAMLTKQ----------------------------T

Query:  SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWI
         LGS P   W+SI   +++ +QG R  VG+   I I    W+
Subjt:  SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-2335.53Show/hide
Query:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFE-GLGFRDISLFNQAMLTKQT---------------------------
        A+P Y MSCF L K +C+++     +FWW    +K+K  W++W+KLC SKE + GLGFRD+  FNQA+L KQ+                           
Subjt:  AIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMSWKKLCASKEFE-GLGFRDISLFNQAMLTKQT---------------------------

Query:  -SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM
         S+G+ P   WRSI+ GR+L  +G+   +G   H ++  D WI  E    P+
Subjt:  -SLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGATCAAAAGAGCCATTGGGAACAGAAAACGACAAACCAGAGAGGAGTGAACTGGGTCAGATAATAATTTCTGAAATAGAGGGTCTAATCCAATTAGCTAATGG
GGCTGATGGAAAGGAAAAAAAGAGAGACAGTAAAGAAGACAAGAGATTTCTGTCGGTAGGAAAAAATGAAACTAAAAAAGGGGAAGTAGGGAGTGAGTTGTGGAAAAATG
GAAAAGGAAAAGTTAAAGCGTTAACCAGTGACCACAGTACAAAAATGACAAAGGAAAATTTCAATAGAATAGGAGAAGATACAGAAGAAGTGAGCAGTCATAAAGAAAGG
ATAAATGATTTAGATCCCATGACGATGGATTCTCAAACCACAGGAAGAGTTTTGGTGAATTCTAAGGATAAGGAAAACGAAAACAAGAAAGTCAATTCAAACAGAGGTGT
CAGTCAAATGAGACCGAAATCTGAGTACGGGAAAGGAGCAGGTAAAACTTTGGAGATTAAGAATGGGAAAGAAACAGAGGAAGTCCGAAGGAACAACGTTAAAATGTGGA
AAAGAATCGCTAGAACCAACAATGATATGAGCAATGAGTTCAGACTATCTGAGGAGATACAGAAGAAAAAGCTTAGAAAGGACCAAGAAAGCATGGAAACCGTGTTTGAC
AAAAAAAGCAGTGCACCAAGCCTCTCGAAGAAAGCGAGGGGGGATCGGCAGAGGCTCGCGGGCCCCAATGGCAGGAAAGACTCGTGGGAGCTCATCAGAAGGCTCCACTC
TTTTGACGATTCCCCATGGATTATAGGTGGGGATTTCAACAAGATTTTATTCGATTCCGAAAAGAATGGTGGTAGACAAAAAAGTAAGAGGCTTATAGATGAATTCAAAT
CTACTTTGAGCTTTTGTCATTTGGTGGATGTAGGCTTCAGAGGTGATAAGTTCATGTGGAGAAGAAGAGATAATAAAGGAGATATGGTCAAAGAAAGATTGGATAGATTT
GTGGCTAACTTGCCCATGATTAACAAGATCCAAAAAATGGAGGTGGAGCATCTCAACTACCACAATTCAGACCATAAGCTTATTCTTGCTTCCTGGAACTCTAAAAATAG
GCAAATCAATAGAAATGTGAAGAAAAGAAAACCTAGATTTGAGGAGAGTTGGCTACTTTTTGATGAGTGTAATCAGATTGTGAAAGAGGCTTGGGGTAAAAATCAGGGAG
GGGGAGCTTGTGAGATTGTCCAGAAAACTGAAGAGAGCATGATGAAGCTGGCTTCATGGAACTTTAACAGATTGAAGGAGAAAGAGTTGGACCTTTTGTTAGAAGAAGAA
GAGACGTATTGGAGGCTGAGAGCGAGGAAAGATTGGCTTAAATGGGGTGATAGAAATACTAAATGGTTTCACTTCAATGCCTCTCATAGAAAAAGGAAAAACTCGATTGA
TAGGATTAAGGATTCGAATGGAGCTTGGGTGGAAGGAGAGGACAATATTGGAGTGGTGGCAATAGAGTACTTCAAAAAGCTCTTCTCATCTTCCAAACCCAATCAGGAGT
TGATTGAATTTACTACAAAAGATATCAAAGCCAAGTTATTTGAGGATCAAAGAAGAGAGTTAGAGCGCCCTTTCTCTAAAGGTGAAATTGAGAGAGCTTTGAAAAATATG
AATCCGTCTAAAGCGCTAGGTCCTGATGGTGCCCATGCTACGTTTTTCCAGAATTACTGGGATATCATAGGTGAAGACATCTCAAAAGTTTTCCTTGACATCCTGAACAA
GGATGGTGAGTTAGGGTCATTGTATAGTTCATGGACTAAGCTGATTCCCAAAGTTAAAGTCCCAAAAAAGATGGAAAGATTTTGGCCTATCAGTTTGTGTAAAGTGGTTT
ATAAAATCATAGCTAAAGCCCTAGCTAACAGAATGAAGCAAGCTCTAGATAGTGTTATTTATCCATCTCAGGCGACTTTTATTCCAGGAAGGCTTATAACGGATAATGTG
CTTATCGAGTTTGAGTGTATTCACGCTATTATCTGGAAAGGGGCGGGTAAGGAAGGGTACATTGCCATGAATTTGGATATGAGCAAGGCGTACGATCGTGTGGAATGGGC
TTATATTCATAAAGTCATGGAGAAGTTGGGGTTTGGAGACAATTGGATCAACAAGGTGATGAAATGTGTGGAATCTGTCAGATATGTTGTGAAGATCAATGACTGCCCTA
CCTCAGAGTTCTATCCAGAAAGAGGCCTTCGCCAAGGAGATCCCCTATCCACTATGATATTCCTCAAAGCCAACTGGAAAAATTGTGAGACCATCAAAAAATTCTTGGGA
GAATACGAAGCGGTATCGGGTCAAACTATAAATTTTGACAAATCGGCATGTATGACGAGTAAGAATATCAATAGGGAATCGGTAAGAGGGTTCAGTAACTTCCTTGGGGT
TAAACTTGTTGAGTCTTTGGGATACTACCTTGGTTGGAAGGAAAGGCTTTTCTCTATGGGAGGAAAGAAGGTGTTGATTAAAGCGGTGGCTCAAGCTATTCCAACGTACA
CTATGAGTTGTTTTATGCTCCCAAAAGGTATTTGTGAGGAGATCAATAAGCTTTGTGCTAAGTTCTGGTGGGGCTTTGTAGGGGATAAAAAGAAGGCCCATTGGATGAGT
TGGAAAAAGCTTTGCGCTAGTAAGGAGTTTGAAGGGCTTGGTTTTAGAGATATTAGTTTGTTTAATCAAGCAATGCTCACAAAGCAAACCTCGCTTGGGTCAAACCCTTT
CTTAACTTGGAGAAGTATCCTTTGGGGAAGGGACCTTTTTAAACAAGGCATGAGATGGAGGGTGGGGAGTGAAGCTCATATTAGGATTAAAGATGATCCCTGGATCCCAA
GAGAAGGGAATTATAAACCAATGTGGATTAGGGAAGAGTTAGTTGATGGTAGAGTAGCTGATGTGCAATTTGCATACCGCAAGAAGAAGGCCGAGTCCGTGGAGCACGTG
ATGTGGAACTGCAAAATTGCTAGGTTTGTTTGGGGTCATTTTTTCCCTATCTTACAGGAGTTTTTGGATTTTTTCAAGGATGGATGGGTTGCTAAGGATAAATGGGTGAA
GCTGCTGGAGTTGATCAAAGCTGAAGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATATGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGA
GCTGGGAACATAATCGTACAAGATGGAATTCACTCCTTCCTGACTTTAGGGAAGCAGATGAGTGTTCCCTTAAGTGGTTACTCCGAGTCTTGAACAAAGGGCCATACTCT
CTCAATGGCACGAGAGGGTTTTCTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGGATCAAAAGAGCCATTGGGAACAGAAAACGACAAACCAGAGAGGAGTGAACTGGGTCAGATAATAATTTCTGAAATAGAGGGTCTAATCCAATTAGCTAATGG
GGCTGATGGAAAGGAAAAAAAGAGAGACAGTAAAGAAGACAAGAGATTTCTGTCGGTAGGAAAAAATGAAACTAAAAAAGGGGAAGTAGGGAGTGAGTTGTGGAAAAATG
GAAAAGGAAAAGTTAAAGCGTTAACCAGTGACCACAGTACAAAAATGACAAAGGAAAATTTCAATAGAATAGGAGAAGATACAGAAGAAGTGAGCAGTCATAAAGAAAGG
ATAAATGATTTAGATCCCATGACGATGGATTCTCAAACCACAGGAAGAGTTTTGGTGAATTCTAAGGATAAGGAAAACGAAAACAAGAAAGTCAATTCAAACAGAGGTGT
CAGTCAAATGAGACCGAAATCTGAGTACGGGAAAGGAGCAGGTAAAACTTTGGAGATTAAGAATGGGAAAGAAACAGAGGAAGTCCGAAGGAACAACGTTAAAATGTGGA
AAAGAATCGCTAGAACCAACAATGATATGAGCAATGAGTTCAGACTATCTGAGGAGATACAGAAGAAAAAGCTTAGAAAGGACCAAGAAAGCATGGAAACCGTGTTTGAC
AAAAAAAGCAGTGCACCAAGCCTCTCGAAGAAAGCGAGGGGGGATCGGCAGAGGCTCGCGGGCCCCAATGGCAGGAAAGACTCGTGGGAGCTCATCAGAAGGCTCCACTC
TTTTGACGATTCCCCATGGATTATAGGTGGGGATTTCAACAAGATTTTATTCGATTCCGAAAAGAATGGTGGTAGACAAAAAAGTAAGAGGCTTATAGATGAATTCAAAT
CTACTTTGAGCTTTTGTCATTTGGTGGATGTAGGCTTCAGAGGTGATAAGTTCATGTGGAGAAGAAGAGATAATAAAGGAGATATGGTCAAAGAAAGATTGGATAGATTT
GTGGCTAACTTGCCCATGATTAACAAGATCCAAAAAATGGAGGTGGAGCATCTCAACTACCACAATTCAGACCATAAGCTTATTCTTGCTTCCTGGAACTCTAAAAATAG
GCAAATCAATAGAAATGTGAAGAAAAGAAAACCTAGATTTGAGGAGAGTTGGCTACTTTTTGATGAGTGTAATCAGATTGTGAAAGAGGCTTGGGGTAAAAATCAGGGAG
GGGGAGCTTGTGAGATTGTCCAGAAAACTGAAGAGAGCATGATGAAGCTGGCTTCATGGAACTTTAACAGATTGAAGGAGAAAGAGTTGGACCTTTTGTTAGAAGAAGAA
GAGACGTATTGGAGGCTGAGAGCGAGGAAAGATTGGCTTAAATGGGGTGATAGAAATACTAAATGGTTTCACTTCAATGCCTCTCATAGAAAAAGGAAAAACTCGATTGA
TAGGATTAAGGATTCGAATGGAGCTTGGGTGGAAGGAGAGGACAATATTGGAGTGGTGGCAATAGAGTACTTCAAAAAGCTCTTCTCATCTTCCAAACCCAATCAGGAGT
TGATTGAATTTACTACAAAAGATATCAAAGCCAAGTTATTTGAGGATCAAAGAAGAGAGTTAGAGCGCCCTTTCTCTAAAGGTGAAATTGAGAGAGCTTTGAAAAATATG
AATCCGTCTAAAGCGCTAGGTCCTGATGGTGCCCATGCTACGTTTTTCCAGAATTACTGGGATATCATAGGTGAAGACATCTCAAAAGTTTTCCTTGACATCCTGAACAA
GGATGGTGAGTTAGGGTCATTGTATAGTTCATGGACTAAGCTGATTCCCAAAGTTAAAGTCCCAAAAAAGATGGAAAGATTTTGGCCTATCAGTTTGTGTAAAGTGGTTT
ATAAAATCATAGCTAAAGCCCTAGCTAACAGAATGAAGCAAGCTCTAGATAGTGTTATTTATCCATCTCAGGCGACTTTTATTCCAGGAAGGCTTATAACGGATAATGTG
CTTATCGAGTTTGAGTGTATTCACGCTATTATCTGGAAAGGGGCGGGTAAGGAAGGGTACATTGCCATGAATTTGGATATGAGCAAGGCGTACGATCGTGTGGAATGGGC
TTATATTCATAAAGTCATGGAGAAGTTGGGGTTTGGAGACAATTGGATCAACAAGGTGATGAAATGTGTGGAATCTGTCAGATATGTTGTGAAGATCAATGACTGCCCTA
CCTCAGAGTTCTATCCAGAAAGAGGCCTTCGCCAAGGAGATCCCCTATCCACTATGATATTCCTCAAAGCCAACTGGAAAAATTGTGAGACCATCAAAAAATTCTTGGGA
GAATACGAAGCGGTATCGGGTCAAACTATAAATTTTGACAAATCGGCATGTATGACGAGTAAGAATATCAATAGGGAATCGGTAAGAGGGTTCAGTAACTTCCTTGGGGT
TAAACTTGTTGAGTCTTTGGGATACTACCTTGGTTGGAAGGAAAGGCTTTTCTCTATGGGAGGAAAGAAGGTGTTGATTAAAGCGGTGGCTCAAGCTATTCCAACGTACA
CTATGAGTTGTTTTATGCTCCCAAAAGGTATTTGTGAGGAGATCAATAAGCTTTGTGCTAAGTTCTGGTGGGGCTTTGTAGGGGATAAAAAGAAGGCCCATTGGATGAGT
TGGAAAAAGCTTTGCGCTAGTAAGGAGTTTGAAGGGCTTGGTTTTAGAGATATTAGTTTGTTTAATCAAGCAATGCTCACAAAGCAAACCTCGCTTGGGTCAAACCCTTT
CTTAACTTGGAGAAGTATCCTTTGGGGAAGGGACCTTTTTAAACAAGGCATGAGATGGAGGGTGGGGAGTGAAGCTCATATTAGGATTAAAGATGATCCCTGGATCCCAA
GAGAAGGGAATTATAAACCAATGTGGATTAGGGAAGAGTTAGTTGATGGTAGAGTAGCTGATGTGCAATTTGCATACCGCAAGAAGAAGGCCGAGTCCGTGGAGCACGTG
ATGTGGAACTGCAAAATTGCTAGGTTTGTTTGGGGTCATTTTTTCCCTATCTTACAGGAGTTTTTGGATTTTTTCAAGGATGGATGGGTTGCTAAGGATAAATGGGTGAA
GCTGCTGGAGTTGATCAAAGCTGAAGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATATGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGA
GCTGGGAACATAATCGTACAAGATGGAATTCACTCCTTCCTGACTTTAGGGAAGCAGATGAGTGTTCCCTTAAGTGGTTACTCCGAGTCTTGAACAAAGGGCCATACTCT
CTCAATGGCACGAGAGGGTTTTCTGTTTGA
Protein sequenceShow/hide protein sequence
MYGSKEPLGTENDKPERSELGQIIISEIEGLIQLANGADGKEKKRDSKEDKRFLSVGKNETKKGEVGSELWKNGKGKVKALTSDHSTKMTKENFNRIGEDTEEVSSHKER
INDLDPMTMDSQTTGRVLVNSKDKENENKKVNSNRGVSQMRPKSEYGKGAGKTLEIKNGKETEEVRRNNVKMWKRIARTNNDMSNEFRLSEEIQKKKLRKDQESMETVFD
KKSSAPSLSKKARGDRQRLAGPNGRKDSWELIRRLHSFDDSPWIIGGDFNKILFDSEKNGGRQKSKRLIDEFKSTLSFCHLVDVGFRGDKFMWRRRDNKGDMVKERLDRF
VANLPMINKIQKMEVEHLNYHNSDHKLILASWNSKNRQINRNVKKRKPRFEESWLLFDECNQIVKEAWGKNQGGGACEIVQKTEESMMKLASWNFNRLKEKELDLLLEEE
ETYWRLRARKDWLKWGDRNTKWFHFNASHRKRKNSIDRIKDSNGAWVEGEDNIGVVAIEYFKKLFSSSKPNQELIEFTTKDIKAKLFEDQRRELERPFSKGEIERALKNM
NPSKALGPDGAHATFFQNYWDIIGEDISKVFLDILNKDGELGSLYSSWTKLIPKVKVPKKMERFWPISLCKVVYKIIAKALANRMKQALDSVIYPSQATFIPGRLITDNV
LIEFECIHAIIWKGAGKEGYIAMNLDMSKAYDRVEWAYIHKVMEKLGFGDNWINKVMKCVESVRYVVKINDCPTSEFYPERGLRQGDPLSTMIFLKANWKNCETIKKFLG
EYEAVSGQTINFDKSACMTSKNINRESVRGFSNFLGVKLVESLGYYLGWKERLFSMGGKKVLIKAVAQAIPTYTMSCFMLPKGICEEINKLCAKFWWGFVGDKKKAHWMS
WKKLCASKEFEGLGFRDISLFNQAMLTKQTSLGSNPFLTWRSILWGRDLFKQGMRWRVGSEAHIRIKDDPWIPREGNYKPMWIREELVDGRVADVQFAYRKKKAESVEHV
MWNCKIARFVWGHFFPILQEFLDFFKDGWVAKDKWVKLLELIKAEDCPLICMGESGQYADSISLPFWGQDRMGSWEHNRTRWNSLLPDFREADECSLKWLLRVLNKGPYS
LNGTRGFSV