; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012566 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012566
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:42262374..42265723
RNA-Seq ExpressionLag0012566
SyntenyLag0012566
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]5.0e-22839.87Show/hide
Query:  LAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGD-DKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLW
        L  W+K+  +G      N  I  +K  +++PL   D +++ K E ++  +L +EE+YWK RSR DWL+ GDKNTK+FHS+AS R++KN I G+ D QG W
Subjt:  LAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGD-DKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLW

Query:  VEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILN
        V++ E I      +F++LFTSSNP   +I +A++ +  +VS E    +  PF+  +I +AL  M PTKAPGPDG  A FFQ +W IVGE +++TCL ILN
Subjt:  VEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILN

Query:  EEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLD
        E+  +  LN T+IALIPKV  P+++ ++RPISLCN VY+I++KA+ANR+KP+L+ IIS +QSAFIP RLI+DNV++ +EC+H +   +  + G +A KLD
Subjt:  EEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLD

Query:  MSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSL
        +SKAYDRVEW F+ Q M  LGF   W+  IM CI T  FSVLING    + KP RG+RQG PLSPY+FILCAE  S L ++ E    + GLK      ++
Subjt:  MSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSL

Query:  THLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKAL
        THL FADDSL+ S+A+ ++C+ ++ I   Y   SGQ  N +KSS   S   S +    + +   +K+     +YLG+P   GRNK   F +VK KV   +
Subjt:  THLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKAL

Query:  HGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKH
          W  +LFSAGGKEILIKAV QA+P Y MS FK+PK  C+DI +  A+FWWG+  DK   HW  W  + ++K  GG+GFR+L  FNQA++AKQ WR++++
Subjt:  HGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKH

Query:  PNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHWIDHIVK
        PNSL+++V++ RYY+NS F  AK+GSNPSF WRSILWG ++  KG+RWRIG+G+ + + +D WI     F+P+    L     V  LID E  W    ++
Subjt:  PNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHWIDHIVK

Query:  GNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGI
         +F+  D EAIL I L +   +DE++W  D KG +SVKS Y LA +     +   S+ S     WK  W      + KI  WRA++N LP   N+ K+  
Subjt:  GNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGI

Query:  DTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEE---DRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIK
             C  C+ + E+ +HV   CK ++K+W+       A   +   ++   +F   +++++      + +L +   W IW+ RN    +G K D + +  
Subjt:  DTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEE---DRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIK

Query:  EVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRFL----
        + ++    +  + +        +  + Q  W PP     K+N DA+        GLG + RD+ G  +  G K+   + ++   E EA+   L+      
Subjt:  EVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRFL----

Query:  ---LIESADCSEILLL--------------AVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVD
           LI  +DC E++ L                +V   +     V FS   R  N  AH+LA+ A+ +S  D
Subjt:  ---LIESADCSEILLL--------------AVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVD

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.1e-22240.13Show/hide
Query:  MKLAAWNKDRLKGSIHGAVNRKIME---------IKELEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTI
        +++ A    +   +I+G + +KI +         ++EL++D       ++++  +E+  LL++EE YW  R++  WL+ GD+NTK+FH++AS R+K+NTI
Subjt:  MKLAAWNKDRLKGSIHGAVNRKIME---------IKELEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTI

Query:  NGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGED
         GI D QG W + EE I   A  YF  +++SS+P   +I++  + I  +V++E  E + R F+K E+  AL+ + P KAPGPDG  A+FFQ YW IVG +
Subjt:  NGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGED

Query:  ISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSG
        ++   L +LN    +  LN T I+LIPK ++PKRM  +RPISLCN VYK+ISK LANR+KP+L  IIS +QSAF   RLI+DNVLVAFE +H L++K +G
Subjt:  ISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSG

Query:  KEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHG
        KEGF+A KLDMSKA+DRVEW FI ++M ++GF   W   +M CI +V +S+LINGVA     PSRG+RQGDPLSP +F+LCAEGLSAL ++      + G
Subjt:  KEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHG

Query:  LKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFW
        + IN  CP +THLFFADDS++  +A    C  +R IL +YE  SGQ++N DKSS   S N +++    + N LG   ++   +YLG+PS  GR+K+++F 
Subjt:  LKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFW

Query:  KVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAML
         +K+KV   L GWK +L S GGKEILIKAV QAIP YTMSCF +P+  CDD++R+   FWWG    + K  W+SWKR+C SK +GG+GFR L+ FN AML
Subjt:  KVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAML

Query:  AKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFK---PVLVNKLLEGAYVNAL
        AKQ+WRI+ +PNSLV +VL+ RY+   D L AK+GS+PS++WRSI    E+  +G RWR+GNG+ I I ED W+ +   +K   P + N   E   V++L
Subjt:  AKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFK---PVLVNKLLEGAYVNAL

Query:  IDEEGHWID-HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAG-KASCSDPSEARADWKAFWKSKAKPREKICAWRAV
        ID +  W     ++  F+P + E IL IPL     +D+++W  + KG FSVKSAY++A S+++   +  CS+    R  WK  W      + KI AWRA 
Subjt:  IDEEGHWID-HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAG-KASCSDPSEARADWKAFWKSKAKPREKICAWRAV

Query:  QNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYF--FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV
         + LP   NI K+GI  +  C +C    E   H    C+ +  VW ++  +P++             ++    L      +  +L   + W IW  RN +
Subjt:  QNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYF--FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV

Query:  --KIKGLKPDHKAIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKW---QI
              L P    ++    AN    ++ +    +   +  R SQ  W  PP G +K+N D +  +      +G + RDSNG  + A  K +  ++   Q+
Subjt:  --KIKGLKPDHKAIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKW---QI

Query:  KCLEME-------------AMLESLRFLLI----ESADCSEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLA
        + L +E              +LE     +I    +++  +E+  +   + +++ S    +F    R  N VAH LA+LA
Subjt:  KCLEME-------------AMLESLRFLLI----ESADCSEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.8e-23341.23Show/hide
Query:  LAAWNKDRLKGSIHGAVNRKIMEIKE-----LEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDR
        L+ WNK      I G + RKI E KE     +  D       +++   KE+ +LL+ EEI W+ RSR  WL  GD+NTK+FH++AS R+++NTINGI D 
Subjt:  LAAWNKDRLKGSIHGAVNRKIMEIKE-----LEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDR

Query:  QGLWVEEEERIGAVANDYFERLFTSSNP-RIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTC
         G W +  E I  VA  YF+ +++SS P RI E+  AI T    V++E    + + F++ EIE AL  M PTKAPGPDG  A+FFQ YW+IVG DI    
Subjt:  QGLWVEEEERIGAVANDYFERLFTSSNP-RIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTC

Query:  LKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFI
        L +LN    +  +N T I L+PK+ +P +M  +RPISLCN VYK+ISK LANR+K +L +IIS +QSAF+ GRLI+DNVLVAFE +H L +K+ GKEGF 
Subjt:  LKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFI

Query:  ATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINN
        A KLDMSKAYDRVEW FI+Q+M K+GF E W++ +M CI +V +S+L+NG A     P+RG+RQGDP+SPYIF+LCA+G S+L +       + G+ I  
Subjt:  ATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINN

Query:  YCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDK
         CP +THLFFADDSL+  +AN   C+T+  IL+ YE  SGQ++N+DKSS   S N   +    +   LG        +YLG+PS  G++K ++F +VK++
Subjt:  YCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDK

Query:  VWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSW
        V + L GWKE+L S GG+EILIKAV QAIP YTMSCF+IPK  C++I+ +  +FWWG  G + K  W+SWK+LC++K+NGGMGFR L+ FN AMLAKQ W
Subjt:  VWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSW

Query:  RIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEG-AYVNALID-EEGH
        R+I +PNSLV+++ + RYY + D   AK+G++PS+TWRSI  G E+  +G RWR+GNG  I I ED W+ +   +K +   K  +    V+ALID E   
Subjt:  RIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEG-AYVNALID-EEGH

Query:  WIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSL---MEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALP
        W D +V+  F+P +A  IL+IPL     +D+I+W  + KG FSVKSAYY+A  +   +E G++S  D       W+  W     P+ +I AW+   NALP
Subjt:  WIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSL---MEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALP

Query:  CQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKP
          LN+++KG++    C  C  + ES  H+F  C+V+++VW  +  +      L      I++    +       D ++   + W IW  RN +  + L  
Subjt:  CQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKP

Query:  DHK-----AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEA
          +     AI   +E    +  Y + +           S G W  PP G +KIN D +  E      +G + RD+ G    A    +  ++ ++  E+EA
Subjt:  DHK-----AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEA

Query:  MLESLRFLLIESADCSEILL----------------------LAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLA
        +      LL +     +I++                      +   + ++ +S G    +  +RE N+ AH LA+ A
Subjt:  MLESLRFLLIESADCSEILL----------------------LAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLA

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.4e-22741.19Show/hide
Query:  LAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGDDK------VDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITD
        L +WN+     ++ G + +KIME K    + +   D +      +++  KEL  LL+ EEI W+ RS+  W R GD+NTK+FH+RAS R+KKN+I+ + +
Subjt:  LAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGDDK------VDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITD

Query:  RQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTC
          G+W + +E I A A  YFE ++TSS+P    I + I  I +RV+DE    + + F+  E+ KAL+ + PTKAPGPDG  A FF +YWDIVG  I+   
Subjt:  RQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTC

Query:  LKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFI
        L +LN    ++ +N T I+LIPK + P RM ++RPISLCNT YKIISK LANR K +L  IIS +QSAF P RLI+DNVLVAFE +H LN+K  GKE ++
Subjt:  LKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFI

Query:  ATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINN
        + KLDMSKA+DRVEW+FI+ +M KLGF E W+  IM C+ +V +SVLING A     PSRGIRQGDPLSP +F+LCAEGLSAL         ++G+ I  
Subjt:  ATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINN

Query:  YCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDK
         CP +THLFFADDSL+  +A +  C  +  IL  YE  SGQ++N DKSS   S N S++    + N LG    +   +YLG+PS  G++K ++F +VKD+
Subjt:  YCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDK

Query:  VWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSW
        V K L GWK +L S GG+EILIKAV QA+P YTMSCF++PK  C D++ L   FWWG    + K  W+SW+++CRSK +GGMGFR ++ FN AMLAKQ W
Subjt:  VWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSW

Query:  RIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--VNALID-EEG
        RI+ +PNSL+++V + +Y+   D L +K GSNPS+ WRSI    ++  KG RWR+GNGR I I +D W+ +    K V+  ++  G +  V++LID +  
Subjt:  RIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--VNALID-EEG

Query:  HWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEA-RADWKAFWKSKAKPREKICAWRAVQNALPC
         W   ++   F+P +A  IL IPL     +D ++W  + +G F+VKSAYY+A  L+++ +   S    +    WK  W+ K  P+ KI AWR   N LP 
Subjt:  HWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEA-RADWKAFWKSKAKPREKICAWRAVQNALPC

Query:  QLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV--KIKGLK
          N+  +G+  +  C LC    E+  H    C+ ++  W ++        + C   EI ++    +       D +L   + W IW  RN    +  G  
Subjt:  QLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV--KIKGLK

Query:  PDHKAIIKEVEANRVNWNYLERVEANPL--KLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAML
        P        ++A  +    L   +A  L   L   L    W  PPSG +KIN+DA+  +D+    +G V RD  G  + A  K +   +  +  E  AM 
Subjt:  PDHKAIIKEVEANRVNWNYLERVEANPL--KLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAML

Query:  E--------SLRFLLIESADCSEILLLAVEVEA------------MAASIGGVSFSSCRRESNEVAHSLARLA
        E         +   + ES   S I  ++ ++              +AAS    SF   +RE N+VAH LAR+A
Subjt:  E--------SLRFLLIESADCSEILLLAVEVEA------------MAASIGGVSFSSCRRESNEVAHSLARLA

XP_023912601.1 uncharacterized protein LOC112024185 [Quercus suber]1.0e-22041.32Show/hide
Query:  LLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMR
        +++ EEI W+ RS+  W R GD+NTK+FH+RAS R+KKN+I+G+ +  G+W + +E I A A  YFE ++TSS+P    I + I  I +RV+DE    + 
Subjt:  LLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMR

Query:  RPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRM
        + F+  E+ KAL+ + PTKAP PDG  A+FF +YWDIVG  I+   L +LN    ++ +N T I+LIPK + P RM ++RPISLCNT YKIISK LANR 
Subjt:  RPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRM

Query:  KPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATK
        K +L  IIS +QSAF P RLI+DNVLVAF+ +H LN+K  GKE +++ KLDMSKA+DRVEW+FI+ +M KLGF E W+  IM C+ +V +SVLING A  
Subjt:  KPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATK

Query:  VFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSK
           PSRGIRQGDPLSP +F+LCA+GLSAL         ++G+ I   CP +THLFFADDSL+  +A +  C  +  IL  YE  S Q++N DKSS   S 
Subjt:  VFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSK

Query:  NISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKF
        N S++    + N LG    +   +YLG+PS  G++K ++F +VKD+V K L GWK +L S GG+EILIKAV Q +P YTMSCF++PK  C D++ L   F
Subjt:  NISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKF

Query:  WWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWR
        WWG    + K  W+SW+++CRSK +GGMGFR ++ FN AMLAKQ WRI+ +PNSL+++V + +Y+   D L +K GSNPS+ WRSI    ++  KG RWR
Subjt:  WWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWR

Query:  IGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--VNALID-EEGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQS
        +GNGR I I +D W+ +    K V+  +   G +  V++LID +   W   ++   F+P +A  IL IPL     +D ++W  +  G F+VKSAYY+A  
Subjt:  IGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--VNALID-EEGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQS

Query:  LMEA-GKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCR
        L+++  +   +  S     WK  W+ K  P+ KI AWR   N LP   N+  +G+  +  C LC    E+  H    C+ ++  W ++        + C 
Subjt:  LMEA-GKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCR

Query:  REEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV--KIKGLKPDHKAIIKEVEANRVNWNYLERVEANPL--KLESRLSQGSWNPPPSGCWKINSD
          EI ++    +       D +L   + W IW  RN    +  G  P        ++A  +    L   +A  L   L   L    W  PP+G +KIN+D
Subjt:  REEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSV--KIKGLKPDHKAIIKEVEANRVNWNYLERVEANPL--KLESRLSQGSWNPPPSGCWKINSD

Query:  ASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLE--------SLRFLLIESADCSEILLLAVEVEA------------MAASIGGVS
        A+  +D+    +G V RD  G  + A  K +   +  +  E  A+ E         +   + ES   S I  ++ ++              +AAS    S
Subjt:  ASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLE--------SLRFLLIESADCSEILLLAVEVEA------------MAASIGGVS

Query:  FSSCRRESNEVAHSLARLA
        F   +RE N+VAH LAR+A
Subjt:  FSSCRRESNEVAHSLARLA

TrEMBL top hitse value%identityAlignment
A0A2N9FN47 Uncharacterized protein8.3e-22141.48Show/hide
Query:  MKLAAWNKDRLKGSIHGAVNRKIMEIKELE-KDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQG
        MKL  W++ +++ +     ++K   ++ELE + P      +V+   KELC L+ +EEI+WK RSR  WL  GD NT++FH  AS RKK NT++G+ D   
Subjt:  MKLAAWNKDRLKGSIHGAVNRKIMEIKELE-KDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQG

Query:  LWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKI
        +W  + + I ++  +YF  LF+SSNP    I +  Q +D RV+ E    +  PFS  EI+ AL  M P+KAPGPDG  ALFFQ YW++VG  ++   L  
Subjt:  LWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKI

Query:  LNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATK
        LN  + +  +N T I LIPKV  P  M  +RPISLCN +YKIISK L NRMK +L ++IS SQSAF+PGR+I+DN+LV+FE +H L NKR GK G +A K
Subjt:  LNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATK

Query:  LDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCP
        LDMSKAYDRVEW ++R I+LKLGF E WV  IM C+ +V +SVL+NG      KP RG+RQGDPLSPY+F++CAEGLSAL  + E    +HG+ I    P
Subjt:  LDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCP

Query:  SLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWK
         ++HLFFADDS+I   A  S+C+ +  +L  YE  SGQ+VN  K++   S N ++    ++    G   +    +YLG+P   GR K + F  +KD+V +
Subjt:  SLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWK

Query:  ALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRII
         L GWKE+L S  G+EILIKAV+QAIP Y MSCFK+P   C +I  +  K+WWG  G ++K HW+S ++L  +K+ GGMGFREL +FN AMLA+Q WR+I
Subjt:  ALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRII

Query:  KHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLE-GAYVNALID-EEGHWID
        + PNSL+ +VL+ +Y+ N  FL A +  NPS TWRSI   +++ + G+ WR+G G  I+I +D W+      K +   ++LE  A V  LI+ E+G W  
Subjt:  KHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLE-GAYVNALID-EEGHWID

Query:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIV
         ++   F+PSDAE I  IPL +R  +D+++W  + KG FSVK+AY L          S S  S  +  W   W +K +P+ +   WRA +N LP Q  + 
Subjt:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIV

Query:  KKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKE---EDRDLAVGIIWHIWNYRNSVKIKGLKPDHK
         K I ++  C  C  + E+G H+ W C  +Q+VW      S     +    ++   F   L    ++    + ++ + + W +W  RN +  +G+  +  
Subjt:  KKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKE---EDRDLAVGIIWHIWNYRNSVKIKGLKPDHK

Query:  AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFA
         I     A+ V   +LE      ++ E   S  SW PP  G +K++    +  D    G+G + RD  G  + A
Subjt:  AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFA

A0A2N9GWG4 Uncharacterized protein6.3e-22141.48Show/hide
Query:  MKLAAWNKDRLKGSIHGAVNRKIMEIKELE-KDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQG
        MKL  W++ +++ +     ++K   ++ELE + P      +V+   KELC L+ +EEI+WK RSR  WL  GD NT++FH  AS RKK NT++G+ D   
Subjt:  MKLAAWNKDRLKGSIHGAVNRKIMEIKELE-KDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQG

Query:  LWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKI
        +W  + + I ++  +YF  LF+SSNP    I +  Q +D RV+ E    +  PFS  EI+ AL  M P+KAPGPDG  ALFFQ YW++VG  ++   L  
Subjt:  LWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKI

Query:  LNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATK
        LN  + +  +N T I LIPKV  P  M  +RPISLCN +YKIISK L NRMK +L ++IS SQSAF+PGR+I+DN+LV+FE +H L NKR GK G +A K
Subjt:  LNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATK

Query:  LDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCP
        LDMSKAYDRVEW ++R I+LKLGF E WV  IM C+ +V +SVL+NG      KP RG+RQGDPLSPY+F++CAEGLSAL  + E    +HG+ I    P
Subjt:  LDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCP

Query:  SLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWK
         ++HLFFADDS+I   A  S+C+ +  +L  YE  SGQ+VN  K++   S N ++    ++    G   +    +YLG+P   GR K + F  +KD+V +
Subjt:  SLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWK

Query:  ALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRII
         L GWKE+L S  G+EILIKAV+QAIP Y MSCFK+P   C +I  +  K+WWG  G ++K HW+S ++L  +K+ GGMGFREL +FN AMLA+Q WR+I
Subjt:  ALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRII

Query:  KHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLE-GAYVNALID-EEGHWID
        + PNSL+ +VL+ +Y+ N  FL A +  NPS TWRSI   +++ + G+ WR+G G  I+I +D W+      K +   ++LE  A V  LI+ E+G W  
Subjt:  KHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLE-GAYVNALID-EEGHWID

Query:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIV
         ++   F+PSDAE I  IPL +R  +D+++W  + KG FSVK+AY L          S S  S  +  W   W +K +P+ +   WRA +N LP Q  + 
Subjt:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIV

Query:  KKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKE---EDRDLAVGIIWHIWNYRNSVKIKGLKPDHK
         K I ++  C  C  + E+G H+ W C  +Q+VW      S     +    ++   F   L    ++    + ++ + + W +W  RN +  +G+  +  
Subjt:  KKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKE---EDRDLAVGIIWHIWNYRNSVKIKGLKPDHK

Query:  AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFA
         I     A+ V   +LE      ++ E   S  SW PP  G +K++    +  D    G+G + RD  G  + A
Subjt:  AIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFA

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.3e-22641.64Show/hide
Query:  KLAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLW
        +LA+WN   L+        ++ M  K  E+D       +++   +EL  LL++EEI+W  RS+  WL+ GD+NTK+FH+RAS R+K+NTI+G+ D+ G W
Subjt:  KLAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLW

Query:  VEEEERIGAVANDYFERLFTSSNP-RIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKIL
         E+ + I   A  YFE ++++SNP  +DE+  AI T+   +++E    + R F++ EI  AL+ + PTK+PGPDG  A+FFQ YWDIVG ++S   L +L
Subjt:  VEEEERIGAVANDYFERLFTSSNP-RIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKIL

Query:  NEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKL
        N    +  +N T I LIPK S+PKRM  +RPISLCN +YK+ISK LANR+K  L  II+ +QSAF   RLI+DNVL+A+E +H L +K+ GK+ F+ATKL
Subjt:  NEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKL

Query:  DMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPS
        DMSKA+DRVEW FI ++M K+GF E W+  IM CI +V +SV+ING       P+RG+RQGDPLSPY+F+LCAEGLSAL         L+G+ +   CP 
Subjt:  DMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPS

Query:  LTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKA
        +THLFFADDSL+  +AN+  CE ++ IL++YEA SGQ+VN DKSS   S N + +    + N LG    +   +YLG+PS  GR+K  +F ++K++V   
Subjt:  LTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKA

Query:  LHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIK
        L GWK +L S+GGKEILIKAV QAIP YTMSCF +PK+ CD+++++   FWWG    + K  W+SW+++C+ K  GG+GFR L  FN A+LAKQ+WRI+ 
Subjt:  LHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIK

Query:  HPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEG-AYVNALIDEEGHW--ID
        +P SL +++L+ +Y+   D L A +GSNPS+TWRSI    E+  KG RWR+GNGR I I +D W+ S   +K +   ++ E    V++LID +  W  ID
Subjt:  HPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEG-AYVNALIDEEGHW--ID

Query:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGK-ASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNI
         I +  F+P DAEAIL IPL      D I+W  + KG FSVKSAY++A +L+E+ +   CS        WK  WK     + KI AWRA  N LP   N+
Subjt:  HIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGK-ASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNI

Query:  VKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAV--GIIWHIWNYRNSVKIKGLKPDHK
          +G++ N+ C +C  + E   H    C  +  VW       W    L    E        L  L     R L +   I W IW+ RN   ++    D  
Subjt:  VKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAV--GIIWHIWNYRNSVKIKGLKPDHK

Query:  AIIKEVE-ANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDK-GAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLR
        + ++  E A R+  +Y   ++ +   ++S L    W+ PP G +K+N D +   D  G+ G+G V RD +G  I A  K +   +  +  E+ A+ + L 
Subjt:  AIIKEVE-ANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDK-GAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLR

Query:  FLLIESADCSEILLLAVEVEAMAA------------SIGGV----------SFSSCRRESNEVAHSLARLA
         LL +  D  +I+L +  + A+ A             + G+          SFS  +R+ N VAH LA+ A
Subjt:  FLLIESADCSEILLLAVEVEAMAA------------SIGGV----------SFSSCRRESNEVAHSLARLA

A0A803QEG9 Uncharacterized protein5.4e-22037.65Show/hide
Query:  KLAAWNKDRLKGSIHGAVNRKI----MEIKELEKDPLFPGD--DKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGIT
        KL  W+  +     +G + +KI     ++++L   P   G   + +  AE  L +LLE+EE+YW+ RSR DWL  GDKNTK+FH++A+ RK  N I  + 
Subjt:  KLAAWNKDRLKGSIHGAVNRKI----MEIKELEKDPLFPGD--DKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGIT

Query:  DRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRT
        +  G        I ++ +DY+  +FT+       ++  +  I   V+  + E +  PF+ L++E AL++M   K PG DG  A+F+Q +W IVG+ +SR 
Subjt:  DRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRT

Query:  CLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGF
         L +LN   D S  N T I LIPK+  P  +  +RPISLCN + K+++K +  R K +L  +IS +QSAF+  RLI+DNVLVAFE +HA+ NK SG+ G 
Subjt:  CLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGF

Query:  IATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKIN
         + KLDMSKA+DRVEWTFI ++M K+GF   W+  IM C+ T +FS L+NG  T    P+RG+RQG PLSPY+F++CAEGLS L   E+   NLHG K+ 
Subjt:  IATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKIN

Query:  NYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKD
           P ++HL FADDSL+  +A++S+C  I+R+L  Y   SGQ +N DKS    S N +    +     L + +S    +YLG+PS SGR+K +LF  +K+
Subjt:  NYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKD

Query:  KVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQS
        ++WK +  W E+LFSAGG+E+L+KAVVQ+IP Y MSCF++P   C+ ++ + A FWWG   +  K HW SW  LC++K  GGMGFR    FNQA+LAKQ+
Subjt:  KVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQS

Query:  WRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHW
        WRI+++PNSL+S +L+ +Y+ ++ FL A MG +PS TW+ I WGREL   G+RW+IG GR ++   DPW+     F P+  +    G   N LI +E  W
Subjt:  WRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHW

Query:  IDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLN
           +++  F P D E IL +PL    ++D+++W     GNF+V+SAY+LA SL     +S S  + A   WK FW  +   + KI AWR + +ALP   +
Subjt:  IDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLN

Query:  IVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYF-FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLK----
        +V++ I T+  C +C+   ES  H  +GCK ++ VW  F F  +W+     +  + +M     L  ++ + + +    I+W IW  RN V + G      
Subjt:  IVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYF-FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLK----

Query:  ----------------PDHKAIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKIC
                          HK +         +     +  + P    + LS  SW PP + C+K+N+DA+        G+G V RD++GS   A    I 
Subjt:  ----------------PDHKAIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKIC

Query:  KKWQIKCLEMEAMLESLRFLLIESADC---------------------SEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDDS
          ++   +E +A+  SL + L +                         SE   L  +V ++ +    VS S   R +N  AH LA+ A+   G+D++
Subjt:  KKWQIKCLEMEAMLESLRFLLIESADC---------------------SEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDDS

A0A803QH07 Uncharacterized protein7.0e-22039.18Show/hide
Query:  GDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQT
        G  ++ ++EK L +LL  EE YW  RSR DWL+ GDKNTK+FH++AS RK  NTI  +T+  GL V  +E +  V   Y+E+LF S +   D +Q  I  
Subjt:  GDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQT

Query:  IDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCN
        I   +     + +  PFS  E+  AL+SM P K+PG DG  A+F+Q+YWDIVG  +++  L ILN+  +++ LN + I LIPKV +P  M  YRPISLCN
Subjt:  IDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCN

Query:  TVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIE
         +YK+ISK +  R + +L  +IS +QSAF+  RLI+DN+LVAFE IH L +K  G+ G+ A KLDMSKA+DRVEW ++  +MLK+GF   WV  IM CI 
Subjt:  TVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIE

Query:  TVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSG
        T  FS  +NG      +P RG+RQGDPLSPY+F++C+EGLS L   EE+  +L GL++    PS++HL FADDSL+  +A + +   ++R L  Y   SG
Subjt:  TVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSG

Query:  QRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIP
        Q +N DKS    S N S       + +L + +++    YLG+PS SGR+K +LF  +K+KVWK LH W E++FS GGKE+L+KAVVQ+IP Y MSCFK+ 
Subjt:  QRVNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIP

Query:  KATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSI
        K  C+ ++ + A FWWG+  +  K HW  WK LC+SK  GGMGFR    FNQA+LAKQ+WRI   P+SL+S++L+ RY+  + FL A +G +PS+TW+SI
Subjt:  KATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSI

Query:  LWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNF
         WGREL +KG+R+++GNG +I   +DPWI S D F+PV      +   V+ALI++   W   ++   F P D E IL+IPL   +++D ++W     G++
Subjt:  LWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNF

Query:  SVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWE--YF
        +VKS ++LA  L +  ++S SD +  R  WK FW     P+ +I AW+ +Q+ LP    + K+ +  +  C LC S  ES  H  +GC  ++ +W+   F
Subjt:  SVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWE--YF

Query:  FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEANRVNWNYLERVEANPLKLESRLS---QGS---
          D    +S+        ++   L  ++ +ED +L + ++W IW  RN V   G      +II        +++  + + +      + LS   Q S   
Subjt:  FPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEANRVNWNYLERVEANPLKLESRLS---QGS---

Query:  -------WNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRF---------------------LLIESAD
               W PP    +K+N DA+   D+   G+G + R  +G  + A  K +   ++   +E +A+  +L +                     L    +D
Subjt:  -------WNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRF---------------------LLIESAD

Query:  CSEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDD
         S    L  ++  + +    V  S  +R +N+ AH LAR A+   G+D+
Subjt:  CSEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDD

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog9.3e-4425.42Show/hide
Query:  LKGSIHGAVNRKIMEIKELEKD----PLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEE
        LK +    VN  +  +K+LEK+    P      ++ K   EL ++  +  I    +S+  +    +K  K   +    ++ K+ I+ I +       +  
Subjt:  LKGSIHGAVNRKIMEIKELEKD----PLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEE

Query:  RIGAVANDYFERLFTSSNPRIDEIQQAIQTID-KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKIL-NEEK
         I  + N+Y+++L++     + EI Q ++     R+S +  E + RP S  EI   ++++   K+PGPDG  + F+Q++     E++    L +  N EK
Subjt:  RIGAVANDYFERLFTSSNPRIDEIQQAIQTID-KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKIL-NEEK

Query:  DVSPLNTTW---IALIPKV-SDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKL
        +    NT +   I LIPK   DP R + YRPISL N   KI++K L NR++  + +II   Q  FIPG     N+  +   I  + NK   K+  I + +
Subjt:  DVSPLNTTW---IALIPKV-SDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKL

Query:  DMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPS
        D  KA+D ++  F+ + + K+G E ++++ I         ++++NGV  K F    G RQG PLSP +F +  E L+     E+++  +H   I +    
Subjt:  DMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPS

Query:  LTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGI----KLSNSIGEYLGMPSQSGRNKNKLFWKVKDK
        L+   FADD ++     + +   +  ++KEY  VSG ++N  KS   +  N + Q    + +S+      K    +G YL    +    +N  +  ++ +
Subjt:  LTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKSSCMVSKNISKQHALLLSNSLGI----KLSNSIGEYLGMPSQSGRNKNKLFWKVKDK

Query:  VWKALHGWKERLFSAGGKEILIKAVV--QAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQ
        + + ++ WK    S  G+  ++K  +  +AI N+     K P +   D++++   F W      +K   ++   L    + GG+   +L ++ ++++ K 
Subjt:  VWKALHGWKERLFSAGGKEILIKAVV--QAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQ

Query:  SW
        +W
Subjt:  SW

P0C2F6 Putative ribonuclease H protein At1g657503.8e-3725.92Show/hide
Query:  MPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGG
        MP    R     F ++ ++V   + GW+E+  S  G+  L KAV+ ++P ++MS   +P++  + +D+L   F WGS  +KKK H + W ++C  K+ GG
Subjt:  MPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGG

Query:  MGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYY----RNSDFLGAKMGSNPSFTWRSILWG-RELFMKGMRWRIGNGRYIKICEDPWIGSIDGFK
        +G R  +  N+A+++K  WR+++  NSL + VL+ +Y+    R+S +L  K   + S TWRSI  G R++   G+ W  G+G+ I+   D W+      K
Subjt:  MGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYY----RNSDFLGAKMGSNPSFTWRSILWG-RELFMKGMRWRIGNGRYIKICEDPWIGSIDGFK

Query:  PVLVNKLLEGAYVNALIDEEGHW----------IDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEA
        P+L     E       +  +  W          ID     N        +L++  GAR   D + W     G FSV+SAY +  ++ E  + + +     
Subjt:  PVLVNKLLEGAYVNALIDEEGHW----------IDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEA

Query:  RADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLF
         + +   WK +   R K   W     A+  +    ++ +  +  C +C+   ES  HV   C     +W    P         RR++    F + L +  
Subjt:  RADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLF

Query:  KEE--DRD---------LAVGIIWHIWNYR--NSVKIKGLKPDHKAIIKE--VEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCE
         +   DR          +   IIW  W +R  N         D    +KE  VE  R +   +  V     ++E  +    W  P  G  K+N+D +   
Subjt:  KEE--DRD---------LAVGIIWHIWNYR--NSVKIKGLKPDHKAIIKE--VEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCE

Query:  DKGAGGLGWVARDSNGS
        + G    G V RD  G+
Subjt:  DKGAGGLGWVARDSNGS

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-3924.52Show/hide
Query:  KKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTID-KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSY
        + K  IN I + +G    + E I      +++RL+++    +DE+ + +      +++ +  + +  P S  EIE  + S+   K+PGPDG  A F+Q++
Subjt:  KKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTID-KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSY

Query:  WDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPK-VSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIH
         + +   + +   KI  E    +      I LIPK   DP +++ +RPISL N   KI++K LANR++  +  II   Q  FIPG     N+  +   IH
Subjt:  WDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPK-VSDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIH

Query:  ALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDRE
         +N  +      I+  LD  KA+D+++  F+ +++ + G +  ++  I         ++ +NG   +      G RQG PLSPY+F +  E L+    ++
Subjt:  ALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDRE

Query:  ESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKS-SCMVSKNISKQHALLLSNSLGIKLSNSIGEYLG--MPS
        + +    G++I      ++ L  ADD ++     +++   +  ++  +  V G ++N +KS + + +KN   +  +  +    I ++N+I +YLG  +  
Subjt:  ESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVNLDKS-SCMVSKNISKQHALLLSNSLGIKLSNSIGEYLG--MPS

Query:  QSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVV--QAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGM
        +     +K F  +K ++ + L  WK+   S  G+  ++K  +  +AI  +     KIP    ++++    KF W +     K   ++   L   + +GG+
Subjt:  QSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVV--QAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGM

Query:  GFRELEIFNQAMLAKQSW
           +L+++ +A++ K +W
Subjt:  GFRELEIFNQAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-3726.29Show/hide
Query:  CKLLEEEEIYWKM----------RSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTID
        C+ LE +E    M          RSR   L   D+ +++F++    +  +  I  +    G  +E+ E I   A  +++ LF+      D  ++    + 
Subjt:  CKLLEEEEIYWKM----------RSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRIDEIQQAIQTID

Query:  KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTV
          VS+  +E +  P +  E+ +ALR M   K+PG DG    FFQ +WD +G D  R   +   + +         ++L+PK  D + +  +RP+SL +T 
Subjt:  KRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTV

Query:  YKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETV
        YKI++KA++ R+K +L E+I   QS  +PGR I DNV +  + +H    +R+G        LD  KA+DRV+  ++   +    F   +V  +     + 
Subjt:  YKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRKIMGCIETV

Query:  RFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQR
           V IN   T      RG+RQG PLS  ++ L  E    L  +      L GL +      +    +ADD +IL   +  + E  +   + Y A S  R
Subjt:  RFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQR

Query:  VNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGM-PSQSGRNKNKLFWKVKDKVWKALHGWK--ERLFSAGGKEILIKAVVQAIPNYTMSCFKI
        +N  KSS ++  ++      L      I   + I +YLG+  S      ++ F ++++ V   L  WK   ++ S  G+ ++I  +V +   Y + C   
Subjt:  VNLDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGM-PSQSGRNKNKLFWKVKDKVWKALHGWK--ERLFSAGGKEILIKAVVQAIPNYTMSCFKI

Query:  PKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMG
         +     I R    F W  +G     HW+S        + GG G
Subjt:  PKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMG

P93295 Uncharacterized mitochondrial protein AtMg003104.2e-3646.85Show/hide
Query:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE-NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLG
        A+P Y MSCF++ K  C  +     +FWW S  +K+K  W++W++LC+SKE +GG+GFR+L  FNQA+LAKQS+RII  P++L+S++LR RY+ +S  + 
Subjt:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE-NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLG

Query:  AKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWI
          +G+ PS+ WRSI+ GREL  +G+   IG+G + K+  D WI
Subjt:  AKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-1629.47Show/hide
Query:  EIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRI--DEIQQAIQTIDKRVSDENREGMRRPF
        E +++ +SR  WL+ GD NT++FH      + KN I  +     + VE   ++  +   Y+  L  S +  +  D +Q+       R +D     +    
Subjt:  EIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGAVANDYFERLFTSSNPRI--DEIQQAIQTIDKRVSDENREGMRRPF

Query:  SKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIIS
        S  EI  A+ +M   KAPGPD   A FF   W +V +       +       +   N T I LIPKV+   ++  +RP+S C  VYKII+
Subjt:  SKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKVSDPKRMDQYRPISLCNTVYKIIS

AT3G09510.1 Ribonuclease H-like superfamily protein5.1e-2923.19Show/hide
Query:  LRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEG---HWIDHIVKGNFIP
        ++ RY+++   L AK+    S+ W S+L G  L  KG R  IG+G+ I+I  D  + S    +P+   +  +   +N L + +G    W D  +      
Subjt:  LRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEG---HWIDHIVKGNFIP

Query:  SDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVC
        SD   I  I L      D+I+W  +  G ++V+S Y+L          + + P  +       W     P+ K   WRA+  AL     +  +G+  +  
Subjt:  SDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVC

Query:  CFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILR----QLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEA
        C  C  + ES  H  + C  +   W     DS   R+     +       IL         +  + L V +IW IW  RN+V     +      +   +A
Subjt:  CFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILR----QLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEA

Query:  NRVNWNYLERVEANPLKLESRLSQG--SWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRFLLIE---
           +W    +          ++++    W  PP+   K N DA +   K     GW+ R+  G+PI  G  K+         E +A+L +L+   I    
Subjt:  NRVNWNYLERVEANPLKLESRLSQG--SWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLESLRFLLIE---

Query:  ----SADCSEILLL-------------AVEVEAMAASIGGVSFSSCRRESNEVAHSLAR----LAVFHSG
              DC  ++ L               ++   A     + F   RR+ N++AH LA+     + F+SG
Subjt:  ----SADCSEILLL-------------AVEVEAMAASIGGVSFSSCRRESNEVAHSLAR----LAVFHSG

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-2121.78Show/hide
Query:  YLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE
        YLG+P  + +     +  + +K+   +  W  R  S  G+  LI +V+ ++ N+ MS F++P A   +ID +C+ F W       K   ++W  +C  K+
Subjt:  YLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE

Query:  NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPW--IG---SID
         GG+G R L+  N+       W I                   +  LG       S+ W+ IL  R L    ++  I NG       D W  IG    + 
Subjt:  NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPW--IG---SID

Query:  GFKPVL-VNKLLEGAYVNALIDE--EGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADW
        G +  + +   L  +   A+++     H  D +++   + ++         G  S +D + W    KGN  +    +  +    A +       + + +W
Subjt:  GFKPVL-VNKLLEGAYVNALIDE--EGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADW

Query:  -KAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKV-------WEYFFPDSWAFRSLCRREEIIMEFPRIL
         K  W S A P+  + AW A++N L     ++      +  C LC    E+  H+F+ C  S +V       ++      W  R+  R  E+     +++
Subjt:  -KAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKV-------WEYFFPDSWAFRSLCRREEIIMEFPRIL

Query:  RQ----LFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEANRVNWNYLER-VEANPLKLESRLSQ
        +     + K  D +L    + ++ N   SV+ +  K   + I +  +++ ++ + L+R +E   L L+S  SQ
Subjt:  RQ----LFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEANRVNWNYLER-VEANPLKLESRLSQ

AT4G29090.1 Ribonuclease H-like superfamily protein1.8e-6327.13Show/hide
Query:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGA
        A+P YTM+CF +PK  C  I  + A FWW +  + K  HW +W  L   K  GG+GF+++E FN A+L KQ WR++  P SL++KV + RY+  SD L A
Subjt:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGA

Query:  KMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--------VNALIDEEG-HWIDHIVKGNFIPSDAEAILN
         +GS PSF W+SI   +E+  +G R  +GNG  I I    W+ S      + + ++    Y        V+ LIDE G  W   +++  F   + + I  
Subjt:  KMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWIGSIDGFKPVLVNKLLEGAY--------VNALIDEEG-HWIDHIVKGNFIPSDAEAILN

Query:  IPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLM--EAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRS
        +  G R   D   W     G+++VKS Y++   ++   +     S+PS     ++  WKS+  P+ +   W+ + N+LP    +  + +     C  C S
Subjt:  IPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLM--EAGKASCSDPSEARADWKAFWKSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRS

Query:  KEESGAHVFWGCKVSQKVWEYF-----FPDSWAFRSLCRREEIIMEFPRILR----QLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEAN
         +E+  H+ + C  ++  W            WA       + I +    +          E+   L   ++W +W  RN +  +G + + + +++  E +
Subjt:  KEESGAHVFWGCKVSQKVWEYF-----FPDSWAFRSLCRREEIIMEFPRILR----QLFKEEDRDLAVGIIWHIWNYRNSVKIKGLKPDHKAIIKEVEAN

Query:  RVNWNYLERVEANPLKLE-SRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAM---------------
           W      E+   K + +R S G W PPP    K N+DA+W  D    G+GWV R+  G   + G + + K   +   E+EAM               
Subjt:  RVNWNYLERVEANPLKLE-SRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAM---------------

Query:  -LESLRFLLIESADCSEI----LLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDDSSFAFLDS
          ES   +LIE  +  EI         +++ + +    V F    RE N +A  +AR ++     D   ++ + S
Subjt:  -LESLRFLLIESADCSEI----LLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDDSSFAFLDS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-3746.85Show/hide
Query:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE-NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLG
        A+P Y MSCF++ K  C  +     +FWW S  +K+K  W++W++LC+SKE +GG+GFR+L  FNQA+LAKQS+RII  P++L+S++LR RY+ +S  + 
Subjt:  AIPNYTMSCFKIPKATCDDIDRLCAKFWWGSVGDKKKAHWMSWKRLCRSKE-NGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLG

Query:  AKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWI
          +G+ PS+ WRSI+ GREL  +G+   IG+G + K+  D WI
Subjt:  AKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKICEDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTGGCGGCTTGGAACAAAGACAGGTTGAAAGGTTCTATCCATGGCGCGGTTAACAGGAAGATCATGGAAATCAAAGAATTGGAAAAAGACCCTCTTTTTCCAGG
AGACGATAAGGTAGATAAAGCTGAAAAGGAGTTGTGCAAGCTCCTTGAAGAAGAGGAGATATATTGGAAGATGAGATCGAGGGAGGATTGGCTTCGTTGGGGTGACAAGA
ATACCAAATGGTTCCATTCAAGAGCCTCTCATAGGAAGAAGAAGAATACGATTAATGGTATAACAGATCGTCAGGGGTTGTGGGTGGAGGAAGAAGAAAGGATCGGAGCA
GTTGCAAATGATTATTTTGAACGGTTGTTTACCTCCTCCAATCCGAGAATCGATGAGATCCAGCAAGCTATTCAAACCATTGACAAAAGAGTGTCGGATGAGAACAGAGA
GGGTATGAGAAGGCCTTTCTCCAAGCTGGAAATTGAAAAGGCTTTGAGAAGCATGAAGCCCACAAAAGCTCCAGGTCCTGATGGAGCCCACGCCTTGTTCTTCCAAAGCT
ATTGGGACATTGTGGGAGAAGATATCTCTAGGACTTGCCTCAAGATTCTGAACGAAGAGAAAGATGTAAGCCCCCTTAATACCACGTGGATCGCGCTAATCCCTAAGGTT
TCTGACCCTAAGAGAATGGATCAATATAGACCGATCAGTTTGTGTAATACGGTTTATAAAATCATTTCAAAAGCTCTAGCAAATAGGATGAAACCTATGCTAGACGAGAT
CATTTCCCTCTCCCAATCGGCTTTTATCCCGGGGAGGCTCATTTCTGATAATGTTTTGGTTGCCTTTGAGTGTATTCATGCCTTAAATAATAAAAGATCTGGGAAGGAAG
GGTTCATTGCAACAAAGCTTGATATGAGCAAAGCGTATGACCGAGTGGAATGGACTTTCATTCGTCAGATTATGCTCAAGTTGGGGTTTGAAGAGAGCTGGGTGAGGAAA
ATCATGGGATGTATCGAAACAGTCCGATTTTCTGTGCTGATCAATGGGGTAGCTACGAAAGTTTTCAAGCCGAGCAGGGGCATTCGCCAAGGAGATCCTCTTTCTCCCTA
CATTTTCATTCTCTGTGCCGAAGGACTCTCGGCGCTTTTCGACAGGGAAGAAAGCCTTTCTAACCTTCATGGCTTAAAAATTAATAATTATTGTCCTTCTTTAACTCATC
TTTTCTTTGCTGATGACAGTTTGATTTTGTCCAGGGCCAATCAATCGAATTGCGAGACGATTAGGCGAATCCTAAAGGAGTATGAAGCGGTCTCGGGCCAGCGGGTGAAC
CTAGACAAGTCCTCTTGTATGGTTAGCAAAAACATCAGCAAGCAACACGCCCTTCTTCTCAGTAACTCCTTGGGAATCAAGCTATCAAACTCTATTGGAGAATATTTGGG
AATGCCTTCCCAATCTGGGAGGAACAAAAATAAGCTCTTTTGGAAAGTTAAGGATAAAGTATGGAAGGCTCTTCATGGTTGGAAGGAAAGGCTCTTCTCTGCTGGTGGAA
AGGAAATCCTGATTAAAGCCGTGGTGCAAGCTATCCCAAACTACACAATGAGTTGCTTCAAAATTCCGAAGGCTACCTGCGATGACATTGATAGGCTTTGTGCAAAGTTT
TGGTGGGGCTCGGTGGGAGACAAGAAGAAGGCTCACTGGATGAGCTGGAAAAGATTATGCCGGAGTAAAGAGAATGGGGGTATGGGTTTTAGGGAGCTCGAAATTTTCAA
CCAAGCTATGCTTGCAAAGCAAAGCTGGAGAATTATTAAGCACCCTAATAGCCTTGTCTCCAAAGTTTTAAGAGGCAGATATTACAGAAACTCGGATTTCCTTGGGGCCA
AGATGGGGTCTAATCCCTCTTTTACATGGAGAAGTATTTTATGGGGCAGGGAGCTCTTTATGAAGGGGATGAGATGGAGGATTGGGAACGGTAGGTATATCAAAATCTGT
GAAGATCCTTGGATTGGATCCATAGATGGATTTAAACCTGTCTTGGTTAATAAGTTGCTGGAGGGTGCTTATGTTAACGCCCTCATTGATGAAGAGGGGCATTGGATCGA
TCACATCGTAAAAGGCAACTTTATCCCCTCTGATGCAGAAGCAATTCTCAACATCCCTTTGGGAGCTCGAAGCTCTAAGGATGAAATTGTTTGGGGGATGGATCCTAAAG
GAAATTTCTCGGTTAAAAGTGCCTATTACCTTGCCCAATCGTTGATGGAAGCAGGGAAAGCCTCCTGTTCAGATCCCTCGGAGGCTAGAGCAGATTGGAAGGCCTTTTGG
AAGTCTAAAGCAAAGCCTAGAGAAAAAATCTGTGCTTGGAGGGCTGTCCAAAATGCTCTTCCATGCCAATTAAACATTGTTAAGAAGGGAATTGATACTAATGTCTGTTG
TTTTTTGTGCAGGTCTAAGGAAGAGTCGGGCGCTCATGTTTTCTGGGGTTGCAAAGTTTCTCAAAAAGTTTGGGAATATTTTTTCCCCGACTCATGGGCTTTTCGCTCTC
TATGCAGGAGGGAGGAGATTATTATGGAATTTCCTAGGATCCTGAGACAACTTTTCAAGGAGGAAGATAGGGACCTTGCGGTTGGTATCATATGGCATATATGGAACTAC
AGAAATTCTGTCAAAATCAAAGGTTTAAAGCCAGACCACAAAGCTATCATTAAAGAGGTGGAGGCTAACAGAGTCAACTGGAATTACCTGGAACGCGTTGAAGCCAATCC
GCTTAAGTTGGAGAGTCGTCTGAGTCAGGGGAGTTGGAATCCTCCTCCGAGTGGTTGCTGGAAGATCAATTCAGATGCTTCCTGGTGTGAAGACAAAGGAGCGGGGGGTT
TAGGCTGGGTCGCCCGTGACTCAAATGGGTCTCCAATTTTCGCCGGTTTCAAAAAGATTTGCAAAAAATGGCAGATTAAATGTTTGGAAATGGAGGCTATGCTTGAGAGC
TTAAGATTCTTACTGATCGAATCGGCTGACTGCTCTGAGATCCTCTTGCTGGCGGTCGAGGTCGAGGCTATGGCGGCTTCAATTGGGGGTGTTTCATTCTCCTCGTGCCG
GCGGGAAAGCAACGAAGTAGCGCATTCTTTGGCGCGTTTGGCTGTCTTCCACAGTGGCGTGGATGATTCTAGCTTTGCTTTTTTGGATTCTCCCAATTCGGGAGAAGATT
GTTTGCATGTAAACTGTTTGCCTCCATGGCTGAACTCTTCGCCTGCTTTGAGGAGTTGTTTTGGAGGTTGTAGCTTGGTTGTTCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTGGCGGCTTGGAACAAAGACAGGTTGAAAGGTTCTATCCATGGCGCGGTTAACAGGAAGATCATGGAAATCAAAGAATTGGAAAAAGACCCTCTTTTTCCAGG
AGACGATAAGGTAGATAAAGCTGAAAAGGAGTTGTGCAAGCTCCTTGAAGAAGAGGAGATATATTGGAAGATGAGATCGAGGGAGGATTGGCTTCGTTGGGGTGACAAGA
ATACCAAATGGTTCCATTCAAGAGCCTCTCATAGGAAGAAGAAGAATACGATTAATGGTATAACAGATCGTCAGGGGTTGTGGGTGGAGGAAGAAGAAAGGATCGGAGCA
GTTGCAAATGATTATTTTGAACGGTTGTTTACCTCCTCCAATCCGAGAATCGATGAGATCCAGCAAGCTATTCAAACCATTGACAAAAGAGTGTCGGATGAGAACAGAGA
GGGTATGAGAAGGCCTTTCTCCAAGCTGGAAATTGAAAAGGCTTTGAGAAGCATGAAGCCCACAAAAGCTCCAGGTCCTGATGGAGCCCACGCCTTGTTCTTCCAAAGCT
ATTGGGACATTGTGGGAGAAGATATCTCTAGGACTTGCCTCAAGATTCTGAACGAAGAGAAAGATGTAAGCCCCCTTAATACCACGTGGATCGCGCTAATCCCTAAGGTT
TCTGACCCTAAGAGAATGGATCAATATAGACCGATCAGTTTGTGTAATACGGTTTATAAAATCATTTCAAAAGCTCTAGCAAATAGGATGAAACCTATGCTAGACGAGAT
CATTTCCCTCTCCCAATCGGCTTTTATCCCGGGGAGGCTCATTTCTGATAATGTTTTGGTTGCCTTTGAGTGTATTCATGCCTTAAATAATAAAAGATCTGGGAAGGAAG
GGTTCATTGCAACAAAGCTTGATATGAGCAAAGCGTATGACCGAGTGGAATGGACTTTCATTCGTCAGATTATGCTCAAGTTGGGGTTTGAAGAGAGCTGGGTGAGGAAA
ATCATGGGATGTATCGAAACAGTCCGATTTTCTGTGCTGATCAATGGGGTAGCTACGAAAGTTTTCAAGCCGAGCAGGGGCATTCGCCAAGGAGATCCTCTTTCTCCCTA
CATTTTCATTCTCTGTGCCGAAGGACTCTCGGCGCTTTTCGACAGGGAAGAAAGCCTTTCTAACCTTCATGGCTTAAAAATTAATAATTATTGTCCTTCTTTAACTCATC
TTTTCTTTGCTGATGACAGTTTGATTTTGTCCAGGGCCAATCAATCGAATTGCGAGACGATTAGGCGAATCCTAAAGGAGTATGAAGCGGTCTCGGGCCAGCGGGTGAAC
CTAGACAAGTCCTCTTGTATGGTTAGCAAAAACATCAGCAAGCAACACGCCCTTCTTCTCAGTAACTCCTTGGGAATCAAGCTATCAAACTCTATTGGAGAATATTTGGG
AATGCCTTCCCAATCTGGGAGGAACAAAAATAAGCTCTTTTGGAAAGTTAAGGATAAAGTATGGAAGGCTCTTCATGGTTGGAAGGAAAGGCTCTTCTCTGCTGGTGGAA
AGGAAATCCTGATTAAAGCCGTGGTGCAAGCTATCCCAAACTACACAATGAGTTGCTTCAAAATTCCGAAGGCTACCTGCGATGACATTGATAGGCTTTGTGCAAAGTTT
TGGTGGGGCTCGGTGGGAGACAAGAAGAAGGCTCACTGGATGAGCTGGAAAAGATTATGCCGGAGTAAAGAGAATGGGGGTATGGGTTTTAGGGAGCTCGAAATTTTCAA
CCAAGCTATGCTTGCAAAGCAAAGCTGGAGAATTATTAAGCACCCTAATAGCCTTGTCTCCAAAGTTTTAAGAGGCAGATATTACAGAAACTCGGATTTCCTTGGGGCCA
AGATGGGGTCTAATCCCTCTTTTACATGGAGAAGTATTTTATGGGGCAGGGAGCTCTTTATGAAGGGGATGAGATGGAGGATTGGGAACGGTAGGTATATCAAAATCTGT
GAAGATCCTTGGATTGGATCCATAGATGGATTTAAACCTGTCTTGGTTAATAAGTTGCTGGAGGGTGCTTATGTTAACGCCCTCATTGATGAAGAGGGGCATTGGATCGA
TCACATCGTAAAAGGCAACTTTATCCCCTCTGATGCAGAAGCAATTCTCAACATCCCTTTGGGAGCTCGAAGCTCTAAGGATGAAATTGTTTGGGGGATGGATCCTAAAG
GAAATTTCTCGGTTAAAAGTGCCTATTACCTTGCCCAATCGTTGATGGAAGCAGGGAAAGCCTCCTGTTCAGATCCCTCGGAGGCTAGAGCAGATTGGAAGGCCTTTTGG
AAGTCTAAAGCAAAGCCTAGAGAAAAAATCTGTGCTTGGAGGGCTGTCCAAAATGCTCTTCCATGCCAATTAAACATTGTTAAGAAGGGAATTGATACTAATGTCTGTTG
TTTTTTGTGCAGGTCTAAGGAAGAGTCGGGCGCTCATGTTTTCTGGGGTTGCAAAGTTTCTCAAAAAGTTTGGGAATATTTTTTCCCCGACTCATGGGCTTTTCGCTCTC
TATGCAGGAGGGAGGAGATTATTATGGAATTTCCTAGGATCCTGAGACAACTTTTCAAGGAGGAAGATAGGGACCTTGCGGTTGGTATCATATGGCATATATGGAACTAC
AGAAATTCTGTCAAAATCAAAGGTTTAAAGCCAGACCACAAAGCTATCATTAAAGAGGTGGAGGCTAACAGAGTCAACTGGAATTACCTGGAACGCGTTGAAGCCAATCC
GCTTAAGTTGGAGAGTCGTCTGAGTCAGGGGAGTTGGAATCCTCCTCCGAGTGGTTGCTGGAAGATCAATTCAGATGCTTCCTGGTGTGAAGACAAAGGAGCGGGGGGTT
TAGGCTGGGTCGCCCGTGACTCAAATGGGTCTCCAATTTTCGCCGGTTTCAAAAAGATTTGCAAAAAATGGCAGATTAAATGTTTGGAAATGGAGGCTATGCTTGAGAGC
TTAAGATTCTTACTGATCGAATCGGCTGACTGCTCTGAGATCCTCTTGCTGGCGGTCGAGGTCGAGGCTATGGCGGCTTCAATTGGGGGTGTTTCATTCTCCTCGTGCCG
GCGGGAAAGCAACGAAGTAGCGCATTCTTTGGCGCGTTTGGCTGTCTTCCACAGTGGCGTGGATGATTCTAGCTTTGCTTTTTTGGATTCTCCCAATTCGGGAGAAGATT
GTTTGCATGTAAACTGTTTGCCTCCATGGCTGAACTCTTCGCCTGCTTTGAGGAGTTGTTTTGGAGGTTGTAGCTTGGTTGTTCCCTAA
Protein sequenceShow/hide protein sequence
MKLAAWNKDRLKGSIHGAVNRKIMEIKELEKDPLFPGDDKVDKAEKELCKLLEEEEIYWKMRSREDWLRWGDKNTKWFHSRASHRKKKNTINGITDRQGLWVEEEERIGA
VANDYFERLFTSSNPRIDEIQQAIQTIDKRVSDENREGMRRPFSKLEIEKALRSMKPTKAPGPDGAHALFFQSYWDIVGEDISRTCLKILNEEKDVSPLNTTWIALIPKV
SDPKRMDQYRPISLCNTVYKIISKALANRMKPMLDEIISLSQSAFIPGRLISDNVLVAFECIHALNNKRSGKEGFIATKLDMSKAYDRVEWTFIRQIMLKLGFEESWVRK
IMGCIETVRFSVLINGVATKVFKPSRGIRQGDPLSPYIFILCAEGLSALFDREESLSNLHGLKINNYCPSLTHLFFADDSLILSRANQSNCETIRRILKEYEAVSGQRVN
LDKSSCMVSKNISKQHALLLSNSLGIKLSNSIGEYLGMPSQSGRNKNKLFWKVKDKVWKALHGWKERLFSAGGKEILIKAVVQAIPNYTMSCFKIPKATCDDIDRLCAKF
WWGSVGDKKKAHWMSWKRLCRSKENGGMGFRELEIFNQAMLAKQSWRIIKHPNSLVSKVLRGRYYRNSDFLGAKMGSNPSFTWRSILWGRELFMKGMRWRIGNGRYIKIC
EDPWIGSIDGFKPVLVNKLLEGAYVNALIDEEGHWIDHIVKGNFIPSDAEAILNIPLGARSSKDEIVWGMDPKGNFSVKSAYYLAQSLMEAGKASCSDPSEARADWKAFW
KSKAKPREKICAWRAVQNALPCQLNIVKKGIDTNVCCFLCRSKEESGAHVFWGCKVSQKVWEYFFPDSWAFRSLCRREEIIMEFPRILRQLFKEEDRDLAVGIIWHIWNY
RNSVKIKGLKPDHKAIIKEVEANRVNWNYLERVEANPLKLESRLSQGSWNPPPSGCWKINSDASWCEDKGAGGLGWVARDSNGSPIFAGFKKICKKWQIKCLEMEAMLES
LRFLLIESADCSEILLLAVEVEAMAASIGGVSFSSCRRESNEVAHSLARLAVFHSGVDDSSFAFLDSPNSGEDCLHVNCLPPWLNSSPALRSCFGGCSLVVP