; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:30642383..30646387
RNA-Seq ExpressionLag0015975
SyntenyLag0015975
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.6e-21437.84Show/hide
Query:  FEEGWTEFEECKEIVKNHWRS----SRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG----SGKGGGSLIVAERELEKLLEE
        +E+ W+ +E C  IV++ W S    S  + ++ FQ      +  LK W++   +G  K    K+ E I +L            G  +   E ++  +L +
Subjt:  FEEGWTEFEECKEIVKNHWRS----SRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG----SGKGGGSLIVAERELEKLLEE

Query:  EEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFS
        EE YWK RSR DWLK GD+NTK+FHSKAS RR++N I  + ++ G+W+ D + I      +F+ LF SSNP+   + E L+    K+S++    +EEPF+
Subjt:  EEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFS

Query:  VAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVL
          +I  AL  M P KAPGPDG  A FFQK+W    E L  TCL +LNE   +  +N++FIALIPK +KP+K+ EF+PISLCNV+Y+++AK +ANRLK +L
Subjt:  VAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVL

Query:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP
          IIS  QSAFI  RLI+DNV++G+EC+H I   +  + GL ALKLD+SKAYDRVEW ++ Q M ++ F   WI LIM C+ +  + VLING P  +  P
Subjt:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP

Query:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISE
        +RG+RQG PLSPYLFI+C E  S LLN+ E    I G+K  +   TITHL FADDSLVF KAS  +C  +K +   Y  ASGQ  N  KS    S   S 
Subjt:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISE

Query:  ATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGS
           + I     +        YLGLP   GRNK   F+ ++ +V   + +W  K FS GGKEILIK +AQA+P Y MS F++PK +C +I +  A FWWG+
Subjt:  ATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGS

Query:  FQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGND
         +DK   HW  W +M  +K +GG+GFRDL  FNQA++AK  WR++R PNSLMA++++ +Y+ NSTF  AK   + S +W+SILWG  +  KG +W++G+ 
Subjt:  FQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGND

Query:  KQVYIDEDPWLMHNNRWKPLRVKDILKGKRVSEILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVA
        K+V + +D W+     ++P+  K +     V+++++ +  W+ + +++ F+  D E IL +   +    DE++W  DKKG +SVKS Y LA+ N NF   
Subjt:  KQVYIDEDPWLMHNNRWKPLRVKDILKGKRVSEILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVA

Query:  SGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCK--------------------------VLE
          S  ++SS +WK  W L    K+KI +W+A+ N LPT +N+ K        C   + Q E+V H+   CK                          + E
Subjt:  SGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCK--------------------------VLE

Query:  ETHQSVTSETR----------------------------------------------SSHGG---------WNPPEAHQWKLNVDATWFDEAEVGGVGWI
           +S T+E                                                + HG          W PP  +  KLNVDA    + +  G+G I
Subjt:  ETHQSVTSETR----------------------------------------------SSHGG---------WNPPEAHQWKLNVDATWFDEAEVGGVGWI

Query:  IRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPEGFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRS
        +RD+ G ++  G K+   +  +++ EA  +  GL Q+A +  S        L+VESD  E+V+L+N      +EI  ++ ++   +   K V F F PR+
Subjt:  IRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPEGFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRS

Query:  TNFLAHSLVR
         N  AH+L +
Subjt:  TNFLAHSLVR

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]5.7e-20643.31Show/hide
Query:  SLKNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASI-RSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAER-
        S+++ PR   K   FE  WT+ E+CK I++  W      S        +  C  +L KW+  T+ G I   I  K   +  L+     +     I   R 
Subjt:  SLKNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASI-RSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAER-

Query:  ELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQK
        E+  LL++EE YW  R++  WLK GDRNTK+FH++AS+RRK+N I  I +E G W  + + I + A  YF N++ SS+P+   ++E+ ++   K++E+  
Subjt:  ELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQK

Query:  KVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVL
        + +   F+  E+ VALK + P KAPGPDG  A+FFQKYW     ++    L VLN    +  +N + I+LIPKT  PK+M +F+PISLCNV+YK+I+K+L
Subjt:  KVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVL

Query:  ANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLING
        ANRLK +L  IIS  QSAF   RLI+DNV+V FE +H ++++ +GK G  A+KLDMSKA+DRVEW +I ++M  M F   W  L+M C+ SVSY +LING
Subjt:  ANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLING

Query:  IPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLC
        +      P RG+RQGDPLSP LF++C EGLSAL+N+      I+GI IN+ CP +THLFFADDS++F KA+ +EC  ++ +L  YE ASGQKIN +KS  
Subjt:  IPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLC

Query:  MISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRI
          S N ++ T  EI   LG  Q++    YLGLP+  GR+K  +F  L+E+V   L  WK K  S+GGKEILIK +AQAIP YTMSCF +P+ +C ++ R+
Subjt:  MISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRI

Query:  CANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKG
          NFWWG    + K  W SWK MC SK  GG+GFR+L+ FN AMLAK +WRIL  PNSL+ ++L+ +YF     L AK   S S  W+SI     +  +G
Subjt:  CANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKG

Query:  YKWKVGNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSEILNEDGS-WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHL
         +W+VGN KQ++I ED WL   + +K +  +    +   VS +++ D   WK E ++  FLP + ETIL +P       D++IW  +KKG FSVKSAYH+
Subjt:  YKWKVGNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSEILNEDGS-WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHL

Query:  A---VNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKV--LEETHQSVTSETR
        A   ++ +     S  DP    ++WK +W L    KIKI  W+A  + LPT  NI K G+  +S C +     E V H    C+   L     S   ET 
Subjt:  A---VNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKV--LEETHQSVTSETR

Query:  SSHGG
         SH G
Subjt:  SSHGG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.5e-21945.19Show/hide
Query:  KKPIRFEEGWTEFEECKEIVKNHWRSSRRA-SIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLLEEE
        ++  +FE  WT  E+CK+I++  W SS    S R    ++  C   L +WN++ + G+I   I +K+E +  L +   +G  GG + +  +E+ +LL+ E
Subjt:  KKPIRFEEGWTEFEECKEIVKNHWRSSRRA-SIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLLEEE

Query:  EKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSV
        E  W+ RSR  WL  GDRNTK+FH+KAS RR+RN I+ I++E+G+W   ++ I + A  YF+ ++ SS P    + E+L +  + ++E+    + + F+ 
Subjt:  EKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSV

Query:  AEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLE
         EIE AL  M P KAPGPDG  A+FFQKYW     D+V   L+VLN    M  IN + I L+PK K P KM +F+PISLCNV+YK+I+KVLANRLK +L 
Subjt:  AEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLE

Query:  TIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPK
         IIS  QSAF+ GRLI+DNV+V FE +H + +++ GK G AA+KLDMSKAYDRVEW +I+Q+M  M F + WIKL+M C+ SVSY +L+NG      TP 
Subjt:  TIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPK

Query:  RGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEA
        RG+RQGDP+SPY+F++C +G S+LLN       ISG+ I + CP ITHLFFADDSL+F KA+ +EC  +  +L+ YE ASGQKIN++KS    S N  + 
Subjt:  RGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEA

Query:  TAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSF
           E+ R LG  Q      YLGLP+  G++K  +F  ++ERV + L  WKEK  SVGG+EILIK +AQAIP YTMSCF+IPK +C EI  +   FWWG  
Subjt:  TAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSF

Query:  QDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDK
          + K  W SWK +C +K  GGMGFR+LQ FN AMLAK  WR++  PNSL+A+I + +Y+ +    +AK   S S  W+SI  G  +  +G +W+VGN +
Subjt:  QDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDK

Query:  QVYIDEDPWLMHNNRWKPLR-VKDILKGKRVSEILN-EDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVN-NSNFQ
        ++ I ED WL     +K +   K      RVS +++ E   WK++++++ FLP +A TILS+P  +    D+IIW  ++KG FSVKSAY++AV    N +
Subjt:  QVYIDEDPWLMHNNRWKPLR-VKDILKGKRVSEILN-EDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVN-NSNFQ

Query:  VASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKV
        V   S   + S++W+ +W L   PK++I  WK   NALPT  N+ + GV+    C     + ES  HIF  C+V
Subjt:  VASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKV

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.0e-20243.22Show/hide
Query:  RNCKKPIRFEEGWTEFEECKEIVKNHWRSSR-RASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLL
        R  K+   FE  WT+ E+C ++++  W S     +       +  C   L  WNQ  + G+I   I +K   +  ++     G  G ++    +EL  LL
Subjt:  RNCKKPIRFEEGWTEFEECKEIVKNHWRSSR-RASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLL

Query:  EEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSK-VIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEE
        + EE  W+ RS+  W + GDRNTK+FH++AS+RRK+N+I R+ NEDG W CDSK  I   A  YF+N++ SS+P    + E++ +   +++++    + +
Subjt:  EEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSK-VIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEE

Query:  PFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLK
         F+  E+  ALK + P KAPGPDG  A FF  YW      + +  L VLN    M  IN + I+LIPKT +P +M EF+PISLCN  YK+I+KVLANR K
Subjt:  PFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLK

Query:  QVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEV
         +L  IIS  QSAF   RLI+DNV+V FE +H +N++  GK    ++KLDMSKA+DRVEW +I+ +M  + F + WI LIM+CV SVSY VLING     
Subjt:  QVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEV

Query:  FTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKN
         TP RGIRQGDPLSP LF++C EGLSAL++       I+GI I + CP ITHLFFADDSL+F KA  +EC  +  +L  YE ASGQKIN +KS    S N
Subjt:  FTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKN

Query:  ISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFW
         S+     I   LG  Q +    YLGLP+  G++K  +F  +++RV K L  WK K  S+GG+EILIK +AQA+P YTMSCF++PK +C ++  +  +FW
Subjt:  ISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFW

Query:  WGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKV
        WG    + K  W SW+ MC SK  GGMGFR++Q FN AMLAK  WRIL  PNSLMA++ + KYF     L +K   + S  W+SI     +  KG +W+V
Subjt:  WGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKV

Query:  GNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSEILNED-GSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNS
        GN ++++I +D WL      K +  + D      VS +++ D   WK ++I   FLP +A TIL +P       D +IW  +K+G F+VKSAY++A +  
Subjt:  GNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSEILNED-GSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNS

Query:  NFQVASGSDPSNS-SIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCK
        +      S   NS S++WK IW+L+  PKIKI  W+   N LPT+QN+   GV  +S C L     E++ H   +C+
Subjt:  NFQVASGSDPSNS-SIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCK

XP_030924668.1 uncharacterized protein LOC115951644 [Quercus lobata]1.3e-20536.97Show/hide
Query:  KPIRFEEGWTEFEECKEIVKNHW-RSSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQ-QLSNYGSGKGGGSLIVAERELEKLLEEEE
        +P RFE  W     C+E+V + W +   +A+   F  K+ +C   L+ WN+  + G ++ ++ KK E+++ +  N G  K    +     E++KL  +EE
Subjt:  KPIRFEEGWTEFEECKEIVKNHW-RSSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQ-QLSNYGSGKGGGSLIVAERELEKLLEEEE

Query:  KYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVA
          WK RSR  WLK GDRNTK+FH +A+QR +RN I  + +E G+W+ D   +G+    YF+ +F SSNP+      IL S      ED +  +E  F   
Subjt:  KYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVA

Query:  EIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLET
        E++ AL +M+P  APGPDG   +F++ +W    ED+    L  LN G     +N +FI LIPK K PKK+ +F+PISLCNV+YK+IAKV+ANRLK+ L  
Subjt:  EIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLET

Query:  IISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKR
         + ++QSAF+ GRLISDN+++ FE +H +  +  GK G  ALKLDMSKAYDRVEW ++  +M ++   +   ++I+ C++SVSY +L+NG P     P R
Subjt:  IISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKR

Query:  GIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEAT
        G+RQGDPLSPYLF++C  GL  LL + E   +I G+ I+++ P ++HLFFADDS++F +A+  EC  +  +L TYE  +GQKIN  K+    S N    T
Subjt:  GIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEAT

Query:  AAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQ
         + I + LGV        YLGLPA  GR K   F  L+ERVWK +Q WKEK  S+ G+E+LIK + QAIP YTMSCF++PK +  E+  +   FWWG   
Subjt:  AAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQ

Query:  DKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQ
          +K HW SW+ +C +K+ GGMGF++++ FN+A+LAK  WR+++ P SL  ++ + ++F N + L AK   S S  WKSIL  R +  KG  W++G+ + 
Subjt:  DKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQ

Query:  VYIDEDPWL-MHNNRWKPLRVKDILKGKRVSEILNEDGS-WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVA
        V I ED WL +  +R     +  ++   RVS ++N D S WK + +   FLP +A  +L +P  +R  +D I W     G FS  SAY L    S+   A
Subjt:  VYIDEDPWL-MHNNRWKPLRVKDILKGKRVSEILNEDGS-WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVA

Query:  SGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNS------------------CCFLFRCQEESVEHIF-------WNCK----
        S S+       WK IWKL+   KIK  +W+  NNALPT  N+++  +  +                   CC +   ++E    +F       WN +    
Subjt:  SGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNS------------------CCFLFRCQEESVEHIF-------WNCK----

Query:  -----------------VLEETHQSVTSETRS----SHGGWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMI
                         +L++   S  SET      +   W PPE    K+N DA  F+ +   G+G I+RD  G  +GA          +  +EA   +
Subjt:  -----------------VLEETHQSVTSETRS----SHGGWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMI

Query:  EGLKQIALKACSYPEGFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVRDVA
          ++  A +          ++++E D+  I+  I +    LS     + ++L L P    + F    RS N +A SL +  A
Subjt:  EGLKQIALKACSYPEGFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVRDVA

TrEMBL top hitse value%identityAlignment
A0A2N9FFZ2 Reverse transcriptase domain-containing protein3.2e-21037.14Show/hide
Query:  KKPIRFEEGWTEFEECKEIVKNHWRSS-RRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGS--GKGGGSLIVAERELEKLLEE
        +KP RFEE WT  + C+ ++++ W+       + +   K+ +C   L+ W++ T  G+I + I K+ E + +++   S  G+    +   +REL  LL +
Subjt:  KKPIRFEEGWTEFEECKEIVKNHWRSS-RRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGS--GKGGGSLIVAERELEKLLEE

Query:  EEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFS
        EE+ W+ RSR +WL  GDRNT++FH +A+QR+++N++ R+  +DG W      +     +Y+K+LF+++  NP+ V+++++     ++ +    +   F+
Subjt:  EEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFS

Query:  VAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVL
          E+E+ALK M+P KAPGPD    +F+QKYW     D+    L  LN G+ +  IN++ I LIPK + P+++ EF+PISLCNVIYK+I+KVLANRLK +L
Subjt:  VAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVL

Query:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP
         +I+  +QSAFI GRLI+DN++V FE +H + ++++GK G  ALKLDMSKAYDRVEW+Y++ +M  M F   W+ L+M+C+ +VSY +L+NG P     P
Subjt:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP

Query:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISE
         RG+RQGDPLSPYLF++C EGL +L+ +E+    + G+ I++  P ITHLFFADDSL+F KA+  + + I+ +L  YE ASGQ++N  K+    SK+   
Subjt:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISE

Query:  ATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGS
        A   +I   LGV        YLGLP+  GR K   F +++ERVW  L+ WKEK  S  G+EILIK++AQAIP Y MSCFR+P  +  EI  +   FWWG 
Subjt:  ATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGS

Query:  FQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGND
          DK K HW  W+ +C SK  GGMG RDL  FN+A+LAK  WR+L  P+SL +K+ + KYF + + L+A+ +   S  WKSI+  R L  KG  W+VG  
Subjt:  FQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGND

Query:  KQVYIDEDPWL--MHNNRWKPLRVKDILKGKRVSEILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQ
          + I  D WL   H++         I        I +E  SWK EL++  FLP +A  IL +P   R   D ++W A K G ++V+S YHL +N  +  
Subjt:  KQVYIDEDPWL--MHNNRWKPLRVKDILKGKRVSEILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQ

Query:  VASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEETHQSV------------------
          S SD +  + +W +IW L   PKI+  LW+A +N+LPT  N+    +  +  C     Q ES  H  W CK ++   QS+                  
Subjt:  VASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEETHQSV------------------

Query:  -----------------------------------TSETRSSHGGWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLE
                                            + T+S+   W PPE  ++K+N D   F E    GVG IIR+  G ++G+   +I     +  +E
Subjt:  -----------------------------------TSETRSSHGGWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLE

Query:  ATTMIEGLKQIALKACSYPEGFDHELV-VESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR
        A+         A  A  + +     LV +E D+  IV+ +  +    +    +I +I   A   + V+F+   R  N +AH L +
Subjt:  ATTMIEGLKQIALKACSYPEGFDHELV-VESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR

A0A2N9GIC4 Reverse transcriptase domain-containing protein7.8e-20937.81Show/hide
Query:  KNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQ--GKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQL-SNYGSGKGGGSLIVAERE
        KN     ++P RFEE W     C++ +   W S+ R     FQ   K+  C F+L  W++    G+I   I + + E++Q  ++   G     L    ++
Subjt:  KNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQ--GKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQL-SNYGSGKGGGSLIVAERE

Query:  LEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKK
        L  L E+EEK W+ RSR  WL  GDRNTK+FHS+A+QR +RN+I  + ++   W   +  +     KY+ +LF +S  +P+ + E++      ++ED  K
Subjt:  LEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKK

Query:  VMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLA
         +   F+  E+E+ALK M+P KAPGPDG   +F+QK+W     D+    L  LN G+ +  IN++FI+LIPKTK P+++ EF+PISLCNVIYK+I+KVLA
Subjt:  VMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLA

Query:  NRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGI
        NRLK +L  ++S++QSAF+ GRLI+DNV+V FE +H +++ + G+ G  ALKLDMSKAYDRVEW ++ ++M  M F + WI ++++C+ +VSY +L+NG 
Subjt:  NRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGI

Query:  PQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCM
        P     P RG+RQGDPLSPYLF++C EGL +L+ +     +I GI + ++ P I+HLFFADDSL+F KA+   C  I+ +L  YE ASGQ++N +K+   
Subjt:  PQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCM

Query:  ISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRIC
         SKN  EA+   I   L V        YLGLP+  GRN+   F +++ERVW+ L+ WKEK  S  G+EILIK +AQAIP Y+MSCF++P  +C E+  + 
Subjt:  ISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRIC

Query:  ANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGY
          FWW +  + RK HW +W+ +C  K KGGMGFRD++ FN A+LAK  WR+L + +SL  ++ + K+F + + L        S  W+SIL  R +  KG 
Subjt:  ANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGY

Query:  KWKVGNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSE-ILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLA
         W+VGN + + I +  W +  +  K +     IL+   V + I+    +W   LI   F   DAE I  +P  N    D+IIW ++  G ++V+S Y   
Subjt:  KWKVGNDKQVYIDEDPWLMHNNRWKPLRVK-DILKGKRVSE-ILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLA

Query:  VNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEET-HQSVTSETRSSHGG
        V  ++  +   S P+    IWKSIW L+   K ++  WKA   ALPT  N+QK  +   + C +   ++E   H  W+CK L+    + V +    S GG
Subjt:  VNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEET-HQSVTSETRSSHGG

Query:  ----------------WNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTM---IEGLKQIALKACSYPEGFDHEL
                        W P   H++K+N D   F E    G+G I+RDS+  ++ +  +K+     I  +EA  +   I+ + +I L    +        
Subjt:  ----------------WNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTM---IEGLKQIALKACSYPEGFDHEL

Query:  VVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR
          E D+  IV  +N     L+    LI +   LA   +  +F    R  N LAH+L R
Subjt:  VVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR

A0A2N9GQ35 Reverse transcriptase domain-containing protein8.6e-20841.32Show/hide
Query:  KKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQ--GKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAERELEKLLEEE
        KK  RFE  WT+ E+C+ ++   W    R   R F+   K+  C   L  W+Q    GS+ A+I  K E++Q  +N         L+  + EL  LLE+E
Subjt:  KKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQ--GKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAERELEKLLEEE

Query:  EKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSV
        E +W+ RSR  W+  GD+NTK+FH+  +QRR+ N I  + ++D  W  +   I + A  YF+N+F SS P  + +   L+   S ++ D    +   F+ 
Subjt:  EKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSV

Query:  AEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLE
         E+  AL+ M P KAPGPDG  A+F+Q YW     ++    L +++ G  +  IN + IAL+PK   P+K+ +F+PI+LCNVIYK+I+KVLANRLK++L 
Subjt:  AEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLE

Query:  TIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPK
         I+S +QSAF+ GRLI+DNV+V FE +H+++ +R+G+ G  ALKLDMSKAYDRVEW ++  +M  + F + WI LIM C++SVSY VLING     FT  
Subjt:  TIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPK

Query:  RGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEA
        RGIRQGD LSPYLF++C EGLS LL + E+   I+G+  ++  P +THLFFADDSL+F +A+   C  +  +L+ YE ASGQ++N  K+    +KN +  
Subjt:  RGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEA

Query:  TAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSF
           +I     V +  S   YLGLP+  GR+K + F  ++ RVW+ +  WKEKF S  G+EIL+K +AQ+IP YTMSCF++P+++C ++N + +NFWWG  
Subjt:  TAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSF

Query:  QDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDK
           RK HW  W  MC SK  GG+GFRD+++FN+A+LAK  WR ++  NSL++++ + KYF + +FL+AK +   S  W+S++  R +   G +W +GN  
Subjt:  QDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDK

Query:  QVYIDEDPWLMHNNRWKP-LRVKDILKGKRVSEILNEDG-SWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQV
         V I EDPWL  +   +P L ++++   +RVS ++N +G  WK E ++  F   +   I S+P   RA ND + W A K G F+V+SAYH+ V +     
Subjt:  QVYIDEDPWLMHNNRWKP-LRVKDILKGKRVSEILNEDG-SWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQV

Query:  ASGSDPSNSSI-----IWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEE
        A+  D  NS +      WK +W L   PK+K  LW+A    LPT   + +  +  +  C     +EESV HI W+C V  +
Subjt:  ASGSDPSNSSI-----IWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEE

A0A7N2LIH6 Uncharacterized protein1.3e-20836.45Show/hide
Query:  LSLKNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAER
        L+  N  R  KK   FEE WT  EECKEIV+  W   R  S    Q ++  C   L++WNQ +  G++   I +K+  +QQL +     +    +   ++
Subjt:  LSLKNGPRNCKKPIRFEEGWTEFEECKEIVKNHWRSSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAER

Query:  ELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQK
        E+ +L   EE  WK RSR  WL++GD+N+K+FH+ ASQRR++N I  ++++ G W  D +   +    YFK+++ S+ P    V   L++   +++ +  
Subjt:  ELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQK

Query:  KVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVL
          +++ F   E+  AL+ M P KAPGPDG   +F+QKYW      + +  L+ LN G     IN ++I LIPKTK P+K+ EF+PISLCNVIYK+I+KVL
Subjt:  KVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVL

Query:  ANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLING
        ANRLK+VL  +I   QSAF+ GR+I+DNV+V FE +H+IN RR GK GL A+KLDMSKAYDRVEW Y+  +M  M F   WI LIM CV SVS+ VLING
Subjt:  ANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLING

Query:  IPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLC
         P+  FTP RG+RQGDP+SPYLF++C EGLSA++ ++E    I G+   +  P I+HLFFADDS++F +A+  EC  + +VL+ YE  SGQK+N +K+  
Subjt:  IPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLC

Query:  MISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRI
          S+N  +          G         YLGLP   GR K   F R++++V + +  WK K  S  G+E+LIK +AQA P YTM+ F++P ++C E+N +
Subjt:  MISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRI

Query:  CANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKG
          +FWWG    ++K  W SWKN+C  K  GGMGF+DL+ FN A+LAK  WR+ + PNSL  ++L+ KYF+NS+F++A+     S +W+SI+  + +  +G
Subjt:  CANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKG

Query:  YKWKVGNDKQVYIDEDPWLMHNNRWKPLRVKD-ILKGKRVSEILNED-GSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHL
         +W VG+ + + I +  WL      K +  +   ++G+RV+ +++++ G WK  L+Q+ F+P +AE ILS+P  +  L D ++W     GCF+VKSAY  
Subjt:  YKWKVGNDKQVYIDEDPWLMHNNRWKPLRVKD-ILKGKRVSEILNED-GSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHL

Query:  AVN-----NSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEET---------
        A                SD S  S IWK+IW L+   KIK  LW+A    LPT + +    +  + CC  F  + E+  H  WNC V +E          
Subjt:  AVN-----NSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEET---------

Query:  -------------------------------------------------HQSVTSETR--------------------SSHGGWNPPEAHQWKLNVDATW
                                                          +S+T E R                      H  W+PP    +K+NVDA  
Subjt:  -------------------------------------------------HQSVTSETR--------------------SSHGGWNPPEAHQWKLNVDATW

Query:  FDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLK---QIALKACSYPEGFDHELVVESDASEIVKLIN--RETVDLSEILTLIGEIL
        F E    G+G +IR++ G ++GA  KK+   L     EA     G+     + LK           +VVE DA  +++ +        + +I+      L
Subjt:  FDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLK---QIALKACSYPEGFDHELVVESDASEIVKLIN--RETVDLSEILTLIGEIL

Query:  ALAPPAKVVNFIFSPRSTNFLAHSLVRDVAKFGDFLFFCFDPSP
         +    K V+   + R  N  AH L R+     D++ +  +  P
Subjt:  ALAPPAKVVNFIFSPRSTNFLAHSLVRDVAKFGDFLFFCFDPSP

M5VU98 Reverse transcriptase domain-containing protein1.1e-20737.59Show/hide
Query:  FEEGWTEFEECKEIVKNHWRS-SRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLLEEEEKYWK
        FE  WT   +C++ +K  W S      +     K+    + L++W++ T  G IK        ++  L     S +      V ++ L++LL + E YW 
Subjt:  FEEGWTEFEECKEIVKNHWRS-SRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYG-SGKGGGSLIVAERELEKLLEEEEKYWK

Query:  LRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVAEIEV
         RSRE+WLK GD+NT +FH KA+ RR+RN I  + + +G W    + I      YF +LF SS  +   ++EIL +   K++ D ++V+   FS  EI+ 
Subjt:  LRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVAEIEV

Query:  ALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISN
        A+  M P KAPGPDG   LF+QKYW    +D+V      L   + +  +N++F+ LIPK K+P+ M + +PISLCNV+Y++ AK LANR+K V++++IS 
Subjt:  ALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISN

Query:  TQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQ
        +QSAF+ GRLI+DN +V FE  H +  RR G+ G  ALKLDMSKAYDRVEWE++ ++M+ M FP  W++++MDCV +VSY  L+NG P  +  P RG+RQ
Subjt:  TQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQ

Query:  GDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEI
        GDPLSPYLF++C EG + LL++ E    + GI I +  PT++HLFFADDS VF KA+   C  +K + + YE ASGQ+IN  KS    S NI   T + +
Subjt:  GDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEI

Query:  SRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRK
        +  LGV + +S   YLGLP   GRNK + FR L+ERVWK LQ W+E+  S+ GKE+L+K +AQ+IP+Y MSCF +P+ +C EI ++ A FWWG   + RK
Subjt:  SRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRK

Query:  KHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYID
         HW  W+ +C +K +GGMGFR LQ FN AMLAK  WR++  P+SL +++L+ KYF  + F +A      S VWKSI   R + + G ++++G+ K V I 
Subjt:  KHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYID

Query:  EDPWLMHNNRWKPLRVK-DILKGKRVSEILNEDGS--WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVN-NSNFQVASG
         D W+     +  +    D ++  +VSE++  +GS  W  + +   FLP+D   I+ +P   RA  D I+W  DK G F+VKSAY +A+   S  +  S 
Subjt:  EDPWLMHNNRWKPLRVK-DILKGKRVSEILNEDGS--WKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVN-NSNFQVASG

Query:  SDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIF---------WNCKVL-EETHQSVTSETRSSHG---
        S  S++ ++W+ IW      K+KI  W+  ++ LPT  N+ K GVD    C       ES  H+          WN  +L    HQ V        G   
Subjt:  SDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIF---------WNCKVL-EETHQSVTSETRSSHG---

Query:  ------------------------GWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPE
                                 W  P + + K N D  +   +  G VG + RD+ G  + A  K +   L     E     EG+        + P 
Subjt:  ------------------------GWNPPEAHQWKLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPE

Query:  GFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR-DVAKFGDFLFFCFDP
              + E D++ +V  I R   D S I T++ ++  L        F F+PR  N +AH L R  +    +F++F   P
Subjt:  GFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVR-DVAKFGDFLFFCFDP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-4825.48Show/hide
Query:  VEGGKGTMHKCKKLE------SEHS-LKLIIE-KSLLGLRKDGLSLKN-------------------GPRNCKKPIRFEEGWTEFEEC---KEIVKNHWR
        + G K  + KCK+ E      S+HS +KL +  K+L   R     L N                      N  K   ++  W  F+     K I  N + 
Subjt:  VEGGKGTMHKCKKLE------SEHS-LKLIIE-KSLLGLRKDGLSLKN-------------------GPRNCKKPIRFEEGWTEFEEC---KEIVKNHWR

Query:  SSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAERELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKA
          +R   RS    +TS + +L+K  Q   K S +  I K   E+++             I  ++ L+K+ E    +++  ++ D         +      
Subjt:  SSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAERELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKA

Query:  SQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFT-SKISEDQKKVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFF
         ++R++N ID I N+ G    D   I     +Y+K+L+ +   N E +   L ++T  ++++++ + +  P + +EI   + ++  +K+PGPDG  A F+
Subjt:  SQRRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFT-SKISEDQKKVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFF

Query:  QKYWGDTKEDLVHTCLEVLNEGKGMGPINNSF----IALIPKT-KKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISNTQSAFILGRLISDNVV
        Q+Y    KE+LV   L++    +  G + NSF    I LIPK  +   K E F+PISL N+  K++ K+LANR++Q ++ +I + Q  FI G     N+ 
Subjt:  QKYWGDTKEDLVHTCLEVLNEGKGMGPINNSF----IALIPKT-KKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISNTQSAFILGRLISDNVV

Query:  VGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGL
             I  IN  R+       + +D  KA+D+++  ++ + +  +     ++K+I    +  +  +++NG   E F  K G RQG PLSP LF +  E L
Subjt:  VGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGL

Query:  SALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEISRELGVTQSNSIGHYL
        +  + +E+    I GI++ K    +    FADD +V+ +       N+ +++  +   SG KIN+ KS   +  N +  T ++I  EL  T ++    YL
Subjt:  SALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEISRELGVTQSNSIGHYL

Query:  GLPAQSGRNKGIMFRR----LRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSC--FRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMC
        G+  Q  R+   +F+     L + + +    WK    S  G+  ++K       IY  +    ++P     E+ +    F W   + +  K   S KN  
Subjt:  GLPAQSGRNKGIMFRR----LRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSC--FRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMC

Query:  LSKDKGGMGFRDLQLFNQAMLAKISW
             GG+   D +L+ +A + K +W
Subjt:  LSKDKGGMGFRDLQLFNQAMLAKISW

P0C2F6 Putative ribonuclease H protein At1g657503.5e-3327.17Show/hide
Query:  LPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGG
        +P    R     F  + ERV   +  W+EK  S  G+  L K +  ++P+++MS   +P++I   ++++   F WGS  +K+K+H   W  +C  K +GG
Subjt:  LPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGG

Query:  MGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKY----FSNSTFLKAKAKGSASIVWKSILWG-RTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWK
        +G R  +  N+A+++K+ WR+L+E NSL   +L+ KY      +S +L    KGS S  W+SI  G R +   G  W  G+ +Q+    D W+      K
Subjt:  MGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKY----FSNSTFLKAKAKGSASIVWKSILWG-RTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWK

Query:  PLRVKDILKGKRVSE---ILNED-----GSWKEELIQEAFLPIDAETILSMPKRNRAL--------NDEIIWGADKKGCFSVKSAYHLAVNNSNFQVASG
        PL   D   G+R ++   ++ +D       W        F  ID  T  +     RA+         D + W   + G FSV+SAY +            
Subjt:  PLRVKDILKGKRVSE---ILNED-----GSWKEELIQEAFLPIDAETILSMPKRNRAL--------NDEIIWGADKKGCFSVKSAYHLAVNNSNFQVASG

Query:  SDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNC
            N +  +  +WK+R   ++K  LW   N A+ T +   +  +  ++ C + +   ES+ H+  +C
Subjt:  SDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNC

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-4527Show/hide
Query:  DRNTKWFHSKASQ-----------RRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSF-TSKISEDQKKVMEEPFSVAEIE
        ++   WF  K ++            R +  I++I NE G    D + I      ++K L+ +   N + + + L  +   K+++DQ   +  P S  EIE
Subjt:  DRNTKWFHSKASQ-----------RRKRNNIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSF-TSKISEDQKKVMEEPFSVAEIE

Query:  VALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSF----IALIPKTKK-PKKMEEFKPISLCNVIYKVIAKVLANRLKQVL
          + ++  +K+PGPDG  A F+Q +    KEDL+    ++ ++ +  G + NSF    I LIPK +K P K+E F+PISL N+  K++ K+LANR+++ +
Subjt:  VALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKGMGPINNSF----IALIPKTKK-PKKMEEFKPISLCNVIYKVIAKVLANRLKQVL

Query:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP
        + II   Q  FI G     N+      IH IN  +     +  + LD  KA+D+++  ++ +++        ++ +I          + +NG   E    
Subjt:  ETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTP

Query:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMI------
        K G RQG PLSPYLF +  E L+  + +++    I GI+I K    I+ L  ADD +V+    +     +  ++ ++    G KIN NKS+  +      
Subjt:  KRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMI------

Query:  -SKNISEATAAEI----SRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSC--FRIPKNICG
          K I E T   I     + LGVT +  +           +N    F+ L++ + + L+ WK+   S  G+  ++K       IY  +    +IP     
Subjt:  -SKNISEATAAEI----SRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSC--FRIPKNICG

Query:  EINRICANFWWGSFQDKRKKHWFSWKNMCLSKDK---GGMGFRDLQLFNQAMLAKISWRILRE
        E+      F W + + +  K         L KDK   GG+   DL+L+ +A++ K +W   R+
Subjt:  EINRICANFWWGSFQDKRKKHWFSWKNMCLSKDK---GGMGFRDLQLFNQAMLAKISWRILRE

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-4025.65Show/hide
Query:  TLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSL----IVAERELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDS
        ++ G   A I     E+  L    SG    +L    +  +  L  + + + +   +RSR   L   DR +++F++   ++  R  I  +  EDG+ + D 
Subjt:  TLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSL----IVAERELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRNNIDRILNEDGSWICDS

Query:  KVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKG
        + I + A  +++NLF     +P+A +E+       +SE +K+ +E P ++ E+  AL+ M   K+PG DG    FFQ +W     D      E   +G+ 
Subjt:  KVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEVLNEGKG

Query:  MGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKA
              + ++L+PK    + ++ ++P+SL +  YK++AK ++ RLK VL  +I   QS  + GR I DNV +  + +H    RR+G   LA L LD  KA
Subjt:  MGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKA

Query:  YDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLF
        +DRV+ +Y+   +   +F   ++  +     S    V IN          RG+RQG PLS  L+ +  E    LL +      ++G+ + +    +    
Subjt:  YDRVEWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLF

Query:  FADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNK-GIMFRRLRERVWKALQNW
        +ADD ++       +    +   + Y  AS  +IN +KS  ++  ++         R+  ++  + I  YLG+   +        F  L E V   L  W
Subjt:  FADDSLVFFKASRKECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNK-GIMFRRLRERVWKALQNW

Query:  K--EKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMG
        K   K  S+ G+ ++I  +  +   Y + C    +    +I R   +F W        KHW S     L   +GG G
Subjt:  K--EKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMG

P93295 Uncharacterized mitochondrial protein AtMg003107.4e-3140.13Show/hide
Query:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSK-DKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLK
        A+P+Y MSCFR+ K +C ++      FWW S ++KRK  W +W+ +C SK D GG+GFRDL  FNQA+LAK S+RI+ +P++L++++LR +YF +S+ ++
Subjt:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSK-DKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLK

Query:  AKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL
               S  W+SI+ GR L  +G    +G+     +  D W+M      PL
Subjt:  AKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-1724.77Show/hide
Query:  LRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPLRVKDILKGKRVSEILNEDGS---WKEELIQEAFLP
        ++ +YF + + L AK +   S  W S+L G  L  KG +  +G+ + + I  D  ++ ++  +PL  ++  K   ++ +    GS   W +  I +    
Subjt:  LRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPLRVKDILKGKRVSEILNEDGS---WKEELIQEAFLP

Query:  IDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSC
         D   I  +        D+IIW  +  G ++V+S Y L  ++ +  + + + P  S  +   IW L   PK+K  LW+A++ AL T + +   G+  +  
Subjt:  IDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSC

Query:  CFLFRCQEESVEHIFWNC
        C     + ES+ H  + C
Subjt:  CFLFRCQEESVEHIFWNC

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-1823.48Show/hide
Query:  YLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKD
        YLGLP  + +     +  L E++   +  W  +  S  G+  LI ++  ++  + MS FR+P     EI+ IC++F W   +   KK   +W ++C  KD
Subjt:  YLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKD

Query:  KGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL
        +GG+G R L+  N+     IS                     N+T          S +WK IL  R L     K  + N        D W       K  
Subjt:  KGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL

Query:  RVKDILKGKRVSEI-LNEDGSWKEELIQEAFLPIDAETILSMPK-----RNRAL---NDEIIW---GADKKGCFSVKSAYHLAVNNSNFQVASGSDPSNS
        R+ D+   +   ++ +    S  E ++         +T+L +       R++ L    D + W   G   K CF+ K  +           A+  +P   
Subjt:  RVKDILKGKRVSEI-LNEDGSWKEELIQEAFLPIDAETILSMPK-----RNRAL---NDEIIW---GADKKGCFSVKSAYHLAVNNSNFQVASGSDPSNS

Query:  SIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNC
           +K +W   + PK  +L W A+ N L T   +       +S C L     E+ +H+F+ C
Subjt:  SIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNC

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-5426.12Show/hide
Query:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKA
        A+P YTM+CF +PK +C +I  + A+FWW + Q+ +  HW +W ++   K +GG+GF+D++ FN A+L K  WR+L  P SLMAK+ + +YF  S  L A
Subjt:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKA

Query:  KAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWL--------MHNNRWKPLRVKDILKGKRVSEILNEDG-SWKEELIQEAFLPIDAETILS
              S VWKSI   + +  +G +  VGN + + I    WL        +   R  P     +    +VS++++E G  W++++I+  F  ++ + I  
Subjt:  KAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWL--------MHNNRWKPLRVKDILKGKRVSEILNEDG-SWKEELIQEAFLPIDAETILS

Query:  MPKRNRALNDEIIWGADKKGCFSVKSAYHL--AVNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRC
        +    R + D   W     G ++VKS Y +   + N        S+PS +  I++ IWK ++ PKI+  LWK ++N+LP    +    +   S C     
Subjt:  MPKRNRALNDEIIWGADKKGCFSVKSAYHL--AVNNSNFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRC

Query:  QEESVEHIFWNCKV-------------------------------------------------------------------------------LEETHQS
         +E+V H+ + C                                                                                 LEE    
Subjt:  QEESVEHIFWNCKV-------------------------------------------------------------------------------LEETHQS

Query:  VTSET--------RSSHGGWNPPEAHQW-KLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPEGFDHE
          +E+        RSS G W PP  HQW K N DATW  + E  G+GW++R+  G +   G + +  KL+ ++LEA   +E ++   L    +   +   
Subjt:  VTSET--------RSSHGGWNPPEAHQW-KLNVDATWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPEGFDHE

Query:  LVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVRDVAKFGDFLFFCFDPSPFS
        ++ ESD+  +++++N + +    +   I ++  L      V F+F PR  N LA  + R+        F  +DP  +S
Subjt:  LVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIFSPRSTNFLAHSLVRDVAKFGDFLFFCFDPSPFS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-3240.13Show/hide
Query:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSK-DKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLK
        A+P+Y MSCFR+ K +C ++      FWW S ++KRK  W +W+ +C SK D GG+GFRDL  FNQA+LAK S+RI+ +P++L++++LR +YF +S+ ++
Subjt:  AIPIYTMSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSK-DKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLK

Query:  AKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL
               S  W+SI+ GR L  +G    +G+     +  D W+M      PL
Subjt:  AKAKGSASIVWKSILWGRTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1755.88Show/hide
Query:  LINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDS
        +ING PQ + TP RG+RQGDPLSPYLFI+CTE LS L  R +    + GI+++ + P I HL FADD+
Subjt:  LINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCTCCGTCTGGCTTCAAGCAGCGGTTTGCTGGTCTGTTGCGACAGTGCGACTTGACGGGGCAACGCGGTGGATTTGGTGGCGGAGCAGTGTTCGGTGGTGGCAA
GAGTAGAGGGGGAGAGATTGTCGAAGGGGGGAAGGGGACTATGCATAAATGCAAAAAATTAGAGTCTGAGCATTCTTTAAAGCTAATCATCGAGAAGAGCCTATTAGGTT
TGAGGAAGGATGGACTGAGTTTGAAGAATGGACCAAGGAACTGTAAAAAGCCTATTAGGTTTGAGGAAGGATGGACTGAGTTTGAAGAATGTAAAGAGATTGTTAAAAAC
CATTGGAGATCATCGAGAAGAGCCTCCATTAGGTCTTTTCAAGGTAAGGTCACCAGCTGCATCTTTAAACTCAAAAAGTGGAACCAGATTACACTCAAAGGGTCTATAAA
GGCAGCCATTGCAAAGAAGGAGGAAGAAATTCAGCAGTTATCAAATTATGGTAGTGGTAAAGGTGGAGGAAGCCTGATTGTTGCAGAAAGGGAGCTTGAGAAGTTGCTCG
AGGAGGAGGAAAAATATTGGAAGTTAAGGTCTAGGGAGGATTGGCTTAAGTGGGGGGATAGGAACACAAAATGGTTCCATTCCAAAGCTAGCCAAAGGAGAAAAAGAAAC
AATATTGATCGAATCCTCAATGAAGATGGTTCATGGATCTGTGACAGCAAAGTTATAGGGGAAGGAGCCACAAAATACTTCAAGAACCTGTTTGAATCTTCTAACCCAAA
CCCTGAAGCAGTAAAGGAGATCTTGCAAAGCTTTACTAGCAAAATCTCGGAAGACCAAAAGAAGGTAATGGAGGAGCCTTTTTCTGTAGCAGAGATTGAAGTGGCTTTAA
AAAACATGAGTCCGAGAAAGGCCCCAGGTCCTGATGGGGCCCACGCTCTGTTTTTTCAGAAGTATTGGGGAGATACAAAGGAGGATCTTGTGCATACCTGTCTGGAAGTG
CTCAATGAAGGCAAAGGGATGGGTCCGATTAACAACTCTTTCATTGCCCTTATACCAAAGACAAAGAAGCCGAAAAAAATGGAAGAATTCAAGCCTATCAGCCTTTGTAA
TGTGATCTACAAGGTGATTGCGAAGGTGTTGGCTAACAGGTTAAAGCAGGTCCTTGAGACTATCATATCAAATACTCAATCGGCTTTCATTCTGGGGAGACTCATTTCAG
ATAACGTGGTAGTGGGTTTTGAATGCATCCACGCTATAAACAATAGAAGATCGGGGAAAGCGGGCCTTGCAGCTCTTAAGCTCGATATGAGCAAGGCTTACGATCGTGTC
GAATGGGAGTACATCAGGCAATTGATGATTCACATGAATTTCCCTCAGAACTGGATCAAGCTTATCATGGATTGCGTGGAATCGGTTAGCTATCAAGTTCTTATTAATGG
TATCCCCCAGGAAGTTTTCACCCCCAAAAGAGGTATTCGTCAAGGAGATCCGCTTTCACCCTATTTGTTCATCATGTGCACTGAAGGGCTTTCGGCTCTGCTTAACAGGG
AAGAATCTCTCTCTAACATCTCTGGTATTAAAATTAACAAACATTGCCCCACTATAACTCATCTCTTTTTTGCAGATGACAGTCTCGTGTTTTTCAAAGCTTCAAGAAAG
GAATGTCTGAATATCAAAAGAGTCTTGAAGACCTATGAGCTGGCTTCTGGCCAGAAGATCAACCTTAACAAATCCTTGTGTATGATAAGCAAAAATATTAGTGAGGCTAC
AGCGGCGGAGATCAGTAGAGAGTTGGGAGTTACTCAATCCAACTCTATAGGCCACTACCTTGGCCTCCCAGCCCAATCGGGAAGAAACAAGGGTATTATGTTTAGAAGGC
TTAGGGAGAGAGTTTGGAAAGCGCTTCAAAACTGGAAAGAGAAGTTTTTCTCAGTGGGCGGGAAGGAGATCCTTATTAAAACCATTGCCCAAGCCATCCCCATCTATACT
ATGAGCTGTTTTAGGATTCCTAAGAATATTTGTGGGGAAATCAATAGGATTTGTGCTAATTTCTGGTGGGGCTCTTTTCAAGATAAAAGGAAAAAGCATTGGTTTAGTTG
GAAGAATATGTGCTTGAGCAAAGACAAAGGGGGGATGGGTTTTCGGGACCTTCAGCTTTTTAACCAAGCGATGCTTGCAAAGATTAGTTGGAGAATCCTCAGGGAGCCTA
ATAGCCTTATGGCCAAAATCCTTAGGGGTAAATATTTCAGCAATAGCACCTTTCTGAAAGCTAAAGCCAAAGGAAGTGCTTCAATCGTGTGGAAAAGTATCTTGTGGGGT
AGAACTTTATTTGATAAAGGCTATAAATGGAAAGTTGGGAACGACAAGCAAGTTTACATTGATGAAGACCCTTGGTTGATGCACAACAATAGATGGAAGCCCCTTAGAGT
TAAGGATATTTTAAAAGGTAAAAGAGTTTCAGAGATTCTGAATGAGGATGGTTCTTGGAAAGAGGAGTTAATCCAAGAAGCATTTCTCCCTATCGATGCTGAAACCATCC
TTAGTATGCCTAAGAGGAACCGTGCCCTGAATGATGAGATTATCTGGGGGGCGGACAAAAAAGGGTGTTTTTCGGTCAAGAGCGCATATCACTTGGCAGTCAATAACTCC
AACTTTCAAGTGGCTTCTGGCTCAGACCCTTCAAATTCCTCCATAATTTGGAAGTCAATCTGGAAACTCAGGAGTTGGCCGAAGATTAAAATTCTTTTGTGGAAAGCCAT
GAACAATGCTTTACCCACTCTACAAAATATTCAGAAAATTGGGGTGGATACTAATTCGTGCTGCTTTTTGTTTAGGTGCCAAGAGGAGAGCGTGGAACATATCTTCTGGA
ATTGTAAAGTGTTGGAAGAGACGCACCAGTCAGTGACGTCAGAGACTCGATCGAGTCACGGGGGGTGGAATCCGCCGGAGGCCCACCAATGGAAATTGAACGTTGATGCT
ACCTGGTTCGATGAAGCAGAGGTCGGAGGGGTGGGGTGGATCATCCGCGACTCGGCAGGTTCTCTGATAGGTGCTGGCGGAAAGAAAATTACAAGGAAGTTAGAGATAAA
CATGCTCGAAGCCACAACTATGATAGAAGGCCTCAAGCAAATTGCTCTTAAAGCGTGTTCTTACCCAGAAGGATTTGATCACGAGCTGGTGGTTGAATCGGACGCATCTG
AGATCGTGAAGCTGATTAACCGGGAGACGGTCGATTTATCGGAAATTTTGACCTTGATCGGTGAGATCCTTGCATTGGCACCACCTGCGAAGGTGGTGAATTTCATTTTC
AGCCCTCGTTCCACCAATTTTTTGGCGCACTCCCTTGTGCGCGATGTAGCCAAATTTGGCGATTTCTTGTTCTTCTGTTTTGATCCTTCTCCTTTCTCGAGAATGGATGC
TTCTGATGGGGTTTGCTACTTTTGGCCCCCCTGTATCTCCGATTACCTGGAGAAGTTGGTTGTTGTACCTAACTCTTCTTCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCTCCGTCTGGCTTCAAGCAGCGGTTTGCTGGTCTGTTGCGACAGTGCGACTTGACGGGGCAACGCGGTGGATTTGGTGGCGGAGCAGTGTTCGGTGGTGGCAA
GAGTAGAGGGGGAGAGATTGTCGAAGGGGGGAAGGGGACTATGCATAAATGCAAAAAATTAGAGTCTGAGCATTCTTTAAAGCTAATCATCGAGAAGAGCCTATTAGGTT
TGAGGAAGGATGGACTGAGTTTGAAGAATGGACCAAGGAACTGTAAAAAGCCTATTAGGTTTGAGGAAGGATGGACTGAGTTTGAAGAATGTAAAGAGATTGTTAAAAAC
CATTGGAGATCATCGAGAAGAGCCTCCATTAGGTCTTTTCAAGGTAAGGTCACCAGCTGCATCTTTAAACTCAAAAAGTGGAACCAGATTACACTCAAAGGGTCTATAAA
GGCAGCCATTGCAAAGAAGGAGGAAGAAATTCAGCAGTTATCAAATTATGGTAGTGGTAAAGGTGGAGGAAGCCTGATTGTTGCAGAAAGGGAGCTTGAGAAGTTGCTCG
AGGAGGAGGAAAAATATTGGAAGTTAAGGTCTAGGGAGGATTGGCTTAAGTGGGGGGATAGGAACACAAAATGGTTCCATTCCAAAGCTAGCCAAAGGAGAAAAAGAAAC
AATATTGATCGAATCCTCAATGAAGATGGTTCATGGATCTGTGACAGCAAAGTTATAGGGGAAGGAGCCACAAAATACTTCAAGAACCTGTTTGAATCTTCTAACCCAAA
CCCTGAAGCAGTAAAGGAGATCTTGCAAAGCTTTACTAGCAAAATCTCGGAAGACCAAAAGAAGGTAATGGAGGAGCCTTTTTCTGTAGCAGAGATTGAAGTGGCTTTAA
AAAACATGAGTCCGAGAAAGGCCCCAGGTCCTGATGGGGCCCACGCTCTGTTTTTTCAGAAGTATTGGGGAGATACAAAGGAGGATCTTGTGCATACCTGTCTGGAAGTG
CTCAATGAAGGCAAAGGGATGGGTCCGATTAACAACTCTTTCATTGCCCTTATACCAAAGACAAAGAAGCCGAAAAAAATGGAAGAATTCAAGCCTATCAGCCTTTGTAA
TGTGATCTACAAGGTGATTGCGAAGGTGTTGGCTAACAGGTTAAAGCAGGTCCTTGAGACTATCATATCAAATACTCAATCGGCTTTCATTCTGGGGAGACTCATTTCAG
ATAACGTGGTAGTGGGTTTTGAATGCATCCACGCTATAAACAATAGAAGATCGGGGAAAGCGGGCCTTGCAGCTCTTAAGCTCGATATGAGCAAGGCTTACGATCGTGTC
GAATGGGAGTACATCAGGCAATTGATGATTCACATGAATTTCCCTCAGAACTGGATCAAGCTTATCATGGATTGCGTGGAATCGGTTAGCTATCAAGTTCTTATTAATGG
TATCCCCCAGGAAGTTTTCACCCCCAAAAGAGGTATTCGTCAAGGAGATCCGCTTTCACCCTATTTGTTCATCATGTGCACTGAAGGGCTTTCGGCTCTGCTTAACAGGG
AAGAATCTCTCTCTAACATCTCTGGTATTAAAATTAACAAACATTGCCCCACTATAACTCATCTCTTTTTTGCAGATGACAGTCTCGTGTTTTTCAAAGCTTCAAGAAAG
GAATGTCTGAATATCAAAAGAGTCTTGAAGACCTATGAGCTGGCTTCTGGCCAGAAGATCAACCTTAACAAATCCTTGTGTATGATAAGCAAAAATATTAGTGAGGCTAC
AGCGGCGGAGATCAGTAGAGAGTTGGGAGTTACTCAATCCAACTCTATAGGCCACTACCTTGGCCTCCCAGCCCAATCGGGAAGAAACAAGGGTATTATGTTTAGAAGGC
TTAGGGAGAGAGTTTGGAAAGCGCTTCAAAACTGGAAAGAGAAGTTTTTCTCAGTGGGCGGGAAGGAGATCCTTATTAAAACCATTGCCCAAGCCATCCCCATCTATACT
ATGAGCTGTTTTAGGATTCCTAAGAATATTTGTGGGGAAATCAATAGGATTTGTGCTAATTTCTGGTGGGGCTCTTTTCAAGATAAAAGGAAAAAGCATTGGTTTAGTTG
GAAGAATATGTGCTTGAGCAAAGACAAAGGGGGGATGGGTTTTCGGGACCTTCAGCTTTTTAACCAAGCGATGCTTGCAAAGATTAGTTGGAGAATCCTCAGGGAGCCTA
ATAGCCTTATGGCCAAAATCCTTAGGGGTAAATATTTCAGCAATAGCACCTTTCTGAAAGCTAAAGCCAAAGGAAGTGCTTCAATCGTGTGGAAAAGTATCTTGTGGGGT
AGAACTTTATTTGATAAAGGCTATAAATGGAAAGTTGGGAACGACAAGCAAGTTTACATTGATGAAGACCCTTGGTTGATGCACAACAATAGATGGAAGCCCCTTAGAGT
TAAGGATATTTTAAAAGGTAAAAGAGTTTCAGAGATTCTGAATGAGGATGGTTCTTGGAAAGAGGAGTTAATCCAAGAAGCATTTCTCCCTATCGATGCTGAAACCATCC
TTAGTATGCCTAAGAGGAACCGTGCCCTGAATGATGAGATTATCTGGGGGGCGGACAAAAAAGGGTGTTTTTCGGTCAAGAGCGCATATCACTTGGCAGTCAATAACTCC
AACTTTCAAGTGGCTTCTGGCTCAGACCCTTCAAATTCCTCCATAATTTGGAAGTCAATCTGGAAACTCAGGAGTTGGCCGAAGATTAAAATTCTTTTGTGGAAAGCCAT
GAACAATGCTTTACCCACTCTACAAAATATTCAGAAAATTGGGGTGGATACTAATTCGTGCTGCTTTTTGTTTAGGTGCCAAGAGGAGAGCGTGGAACATATCTTCTGGA
ATTGTAAAGTGTTGGAAGAGACGCACCAGTCAGTGACGTCAGAGACTCGATCGAGTCACGGGGGGTGGAATCCGCCGGAGGCCCACCAATGGAAATTGAACGTTGATGCT
ACCTGGTTCGATGAAGCAGAGGTCGGAGGGGTGGGGTGGATCATCCGCGACTCGGCAGGTTCTCTGATAGGTGCTGGCGGAAAGAAAATTACAAGGAAGTTAGAGATAAA
CATGCTCGAAGCCACAACTATGATAGAAGGCCTCAAGCAAATTGCTCTTAAAGCGTGTTCTTACCCAGAAGGATTTGATCACGAGCTGGTGGTTGAATCGGACGCATCTG
AGATCGTGAAGCTGATTAACCGGGAGACGGTCGATTTATCGGAAATTTTGACCTTGATCGGTGAGATCCTTGCATTGGCACCACCTGCGAAGGTGGTGAATTTCATTTTC
AGCCCTCGTTCCACCAATTTTTTGGCGCACTCCCTTGTGCGCGATGTAGCCAAATTTGGCGATTTCTTGTTCTTCTGTTTTGATCCTTCTCCTTTCTCGAGAATGGATGC
TTCTGATGGGGTTTGCTACTTTTGGCCCCCCTGTATCTCCGATTACCTGGAGAAGTTGGTTGTTGTACCTAACTCTTCTTCCTCTTAA
Protein sequenceShow/hide protein sequence
MFPPSGFKQRFAGLLRQCDLTGQRGGFGGGAVFGGGKSRGGEIVEGGKGTMHKCKKLESEHSLKLIIEKSLLGLRKDGLSLKNGPRNCKKPIRFEEGWTEFEECKEIVKN
HWRSSRRASIRSFQGKVTSCIFKLKKWNQITLKGSIKAAIAKKEEEIQQLSNYGSGKGGGSLIVAERELEKLLEEEEKYWKLRSREDWLKWGDRNTKWFHSKASQRRKRN
NIDRILNEDGSWICDSKVIGEGATKYFKNLFESSNPNPEAVKEILQSFTSKISEDQKKVMEEPFSVAEIEVALKNMSPRKAPGPDGAHALFFQKYWGDTKEDLVHTCLEV
LNEGKGMGPINNSFIALIPKTKKPKKMEEFKPISLCNVIYKVIAKVLANRLKQVLETIISNTQSAFILGRLISDNVVVGFECIHAINNRRSGKAGLAALKLDMSKAYDRV
EWEYIRQLMIHMNFPQNWIKLIMDCVESVSYQVLINGIPQEVFTPKRGIRQGDPLSPYLFIMCTEGLSALLNREESLSNISGIKINKHCPTITHLFFADDSLVFFKASRK
ECLNIKRVLKTYELASGQKINLNKSLCMISKNISEATAAEISRELGVTQSNSIGHYLGLPAQSGRNKGIMFRRLRERVWKALQNWKEKFFSVGGKEILIKTIAQAIPIYT
MSCFRIPKNICGEINRICANFWWGSFQDKRKKHWFSWKNMCLSKDKGGMGFRDLQLFNQAMLAKISWRILREPNSLMAKILRGKYFSNSTFLKAKAKGSASIVWKSILWG
RTLFDKGYKWKVGNDKQVYIDEDPWLMHNNRWKPLRVKDILKGKRVSEILNEDGSWKEELIQEAFLPIDAETILSMPKRNRALNDEIIWGADKKGCFSVKSAYHLAVNNS
NFQVASGSDPSNSSIIWKSIWKLRSWPKIKILLWKAMNNALPTLQNIQKIGVDTNSCCFLFRCQEESVEHIFWNCKVLEETHQSVTSETRSSHGGWNPPEAHQWKLNVDA
TWFDEAEVGGVGWIIRDSAGSLIGAGGKKITRKLEINMLEATTMIEGLKQIALKACSYPEGFDHELVVESDASEIVKLINRETVDLSEILTLIGEILALAPPAKVVNFIF
SPRSTNFLAHSLVRDVAKFGDFLFFCFDPSPFSRMDASDGVCYFWPPCISDYLEKLVVVPNSSSS