; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035056 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035056
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:14230216..14234938
RNA-Seq ExpressionLag0035056
SyntenyLag0035056
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]2.5e-22337.06Show/hide
Query:  WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPWMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRC
        WRFTG YG P T ++  +W LL  L      PW+  GDFN IL   EK GGR +   ++ GFR AVD C   D GF+G +FTW    P    +  R+DR 
Subjt:  WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPWMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRC

Query:  FGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAPLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGR
               T FP  +V HL+ ++SDH PI ++++  +     H   ++RFEE W+     ME +R  W    +  +PR V  +  +    L  W RG  G 
Subjt:  FGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAPLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGR

Query:  YGDRIREASEEVQRALGRLSTSGSSADLQVAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVS
           +I+    ++   L    T  +    +   A+L+++  + E+YW+QRSR  WLK GDRNT++FH +AS R++RN I GL D  G+ +    E+   V 
Subjt:  YGDRIREASEEVQRALGRLSTSGSSADLQVAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVS

Query:  EYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETM
         YF+ +F+S+     D + V   +R  V++EMNR L+  F  EEI  AL Q+HP+KAPGPD  S  F+++ W VVG DVV   L  L+ G     +N T 
Subjt:  EYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETM

Query:  IVLIPKVKNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYAL-KHRRGKTGWASLKLDMSKAYDRVEWRY
        + LIPKV +P  +++ RPISL NV+YK+ AK L NR+K +L ALIS +QSAF+PGR + DN+I+ +E ++ + K   G+ G+ +LKLDMSKAYDRVEW +
Subjt:  IVLIPKVKNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYAL-KHRRGKTGWASLKLDMSKAYDRVEWRY

Query:  LERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLF
        LE +ML MGF++ WV LI  C+++V +SF +NG   G V P R LRQGDPLSPYLFLLCAE  S ++++AE    + G+ L R  P +SHLFFADDS LF
Subjt:  LERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLF

Query:  FRAKKSEAWSVPCTVPSVHEGIGQ-------------------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGG
         +A +S    +   +    +  GQ                          L+V+    H++YLGLP+ + ++R     ++K+ +WK+IQGWK KL S  G
Subjt:  FRAKKSEAWSVPCTVPSVHEGIGQ-------------------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGG

Query:  REILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGR
        +E+LLK + QA+P Y M+CF  PK L ++I  MMA +WW   E  R+IHW+SWK +C+PK  GGLGFR+L  FN AL+AKQ WR+   P S +AR+LK R
Subjt:  REILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGR

Query:  YFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL--------
        Y+     LEA +G  PS++WR+L   K +LE G RWR+G+G+ ++I+   W+P   + KV S+P++   +  V+ L+  +   T  W  + L        
Subjt:  YFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL--------

Query:  ------MPLS----------HPGTRGFIFIPK-----------------LSSGLVEGML--------EDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDL
              +PLS          H    G   +                    ++G   G+         E   P K+++FLWR  LN LPT  NL RRGV  
Subjt:  ------MPLS----------HPGTRGFIFIPK-----------------LSSGLVEGML--------EDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDL

Query:  LDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQG-----AERPSGLVEWAKG
        L  CVFCG+  E+  H F +C  A+++         L++V A       +D+   +   + E++++  W++W+ RN+V         +E  SG +     
Subjt:  LDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQG-----AERPSGLVEWAKG

Query:  MWWL---FVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLG
         + L    +    V  KW             P  G  K+N   +         +G++VRD +G
Subjt:  MWWL---FVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLG

PKA56961.1 Putative ribonuclease H protein [Apostasia shenzhenica]6.5e-22435.57Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE-------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETP
        FL ETK+ SS F+++K    F S F+V S GRSGGL L+W                            WRFTG YG P    K  +W  L  L    + P
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE-------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETP

Query:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL
        W+  GDFN IL+Q+EKEGG A+ E +++ FREA+  C L D G+ G  FTW N + G   + ER+DR   +M  + LFPH+ V HL+ + SDH  +++ L
Subjt:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL

Query:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSA----DL
                      +RFE  WL DP ++ +V ++W +S    +   +          L  W +   G     I+++    Q AL  +           ++
Subjt:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSA----DL

Query:  QVAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSV
        +  +  L  +   EEV WKQR+R  WL+ GD NT +FH+ A+FR+K+N I  L D  G   ++   +  ++ +YF NIF++   + +  +++T  + + +
Subjt:  QVAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSV

Query:  TDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKL
        + +    L + F +EE+  AL QIHP+KAPGPD    +FF+K W ++G DVV+  L ILNEG S   +N T IVLIPK      +  FRPISLCNV YK+
Subjt:  TDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKL

Query:  VAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYAL-KHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFS
        +AKAL NR+K +L+++IS NQSAF+P R + DN I+ YEC++++ K +RG   + +LKLDMSKAYDRVEW +LE++M+ M F   ++  I  C++TV ++
Subjt:  VAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYAL-KHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFS

Query:  FNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIG-----
         + NG   G++ P R LRQGDPLSPY FL+CAEGLSS++   E   ++ G++ +RR P I+HLFFADDSLLFFRA+  EA ++   +    +  G     
Subjt:  FNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIG-----

Query:  --------------------QILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQ
                            Q+L V+  ++ ++YLGLP  + + +      +K+R+ K+I+GWK +L S+GG+EI++K+I QAIP Y+M+ F+ P  L +
Subjt:  --------------------QILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQ

Query:  DISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKE
        +I +++++FWW    + + IHW  W  +CK K  GG+GFR+L +FN ALL KQ WR+ ++  S   +VLKGRYFPS    EA +G  PSF+WR++   K+
Subjt:  DISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKE

Query:  LLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPI-SLPPDATVAGLMTALRGGTKPWWNNTLM---------------PL------------SHPGTR
        +++MG  WR+GNG+ +KI+   WIPR  + +  SPI +L    TV  L+  + G     WN +L+               PL            +HP  R
Subjt:  LLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPI-SLPPDATVAGLMTALRGGTKPWWNNTLM---------------PL------------SHPGTR

Query:  -----GFIFIPKLS-------SGLVEGMLEDDA----------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDV-CVFCGKRGESSMHFFWEC
              + FI  ++       +G+    L+  A                PSKIK+FLWR   + LPT   L  R + +  + C+ C    E+S+H   +C
Subjt:  -----GFIFIPKLS-------SGLVEGMLEDDA----------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDV-CVFCGKRGESSMHFFWEC

Query:  KWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCV
         +  ++    G    + K  +        D+ E  +  K E   +  WS+W  RN  RL+G +  S +      + +L        GK            
Subjt:  KWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCV

Query:  ASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM
        + PRN   K+NFDA+  +  +R  LG+I R+ LG+V+
Subjt:  ASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM

XP_023896927.1 uncharacterized protein LOC112008817 [Quercus suber]9.3e-22335.13Show/hide
Query:  IETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPWMV
        +ET+ + S  ERL+ RL + + F V      GGLALLWS++                        WRFTG YG P+  ++  +W++L+HL       W+ 
Subjt:  IETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPWMV

Query:  GGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAPL
         GDFN I    EK GG  +PE ++  FR+ +D C L D G++G  FTWCNR+     VW R+DR    +     FP A + HL  S SDH+P+ L    L
Subjt:  GGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAPL

Query:  VRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEARL
            +    + +RFE  WL D     VV   W  S +      V  +  +C   L+LW +   G     +    + +  A G      + A ++     +
Subjt:  VRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEARL

Query:  EAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNRH
          +   EE  W QRS+  WL++GD+NT++FH +A+ R K+N I GL D+ G   +   +I  +++ Y+ N+F +  P+E  +D V +GV+  V+D MN  
Subjt:  EAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNRH

Query:  LMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKALVN
        L++PF+ EE+  ALKQ+ P  APGPD L   F++  W  VG +V    LS+LN G  P  +N T I LIPK+K+P +V+EFRPISLCNV+YKL+AK L N
Subjt:  LMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKALVN

Query:  RMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFR
        R+K ML  LIS+ QSAF+  R + DN ++ +E ++ LK +R GKTG+ S+KLDMSK YDRVEW YLE+IM KMGF++ W++LI  C+ +V +S  +NG  
Subjt:  RMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFR

Query:  CGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPC----------------------
         G + P+R LRQGDPLSPYLFLL  EGL ++  EA+    I G+ L    P ISHL FADDSL+F +A  +E+  V                        
Subjt:  CGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPC----------------------

Query:  ---TVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMA
           T P V   I   L +     + QYLGLPS + + +  +   +KER+WK+++GWK KL S  GREIL+K+++QAIP ++M+CF+ PK LI +I +++ 
Subjt:  ---TVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMA

Query:  RFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMGVR
        +FWW    + ++IHW+SW  +C+PK  GG+GF++L  FN +LLAKQ WR+         RV K ++FP+    + EV  + S+ W++++  K ++E+G  
Subjt:  RFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMGVR

Query:  WRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTK---------PWWNNTL--MPLS----------HPGTRGFIF---------
        WR+G+G+ +KI G  W+P+++  K+ SP++ LPP A V  ++       K         P+  N +  +PLS          H  + G            
Subjt:  WRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTK---------PWWNNTL--MPLS----------HPGTRGFIF---------

Query:  ------IP-----KLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSILREAGFGEILE
              +P     + SS +   +     P K++  +WR     LPT+ NL RR V     C  C    E ++H  W C      W K         E+++
Subjt:  ------IP-----KLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSILREAGFGEILE

Query:  KVRAGCCLLLCRDIREVI----EGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFD
          R    +    D+ E++    +      L ++ W +W+ RN  R+  +      +      + L  +   V+ +     ++  G    P    YK+NFD
Subjt:  KVRAGCCLLLCRDIREVI----EGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFD

Query:  ASFLSHLSRADLGIIVRDPLGQVM
         +  S L  A LG+++RD  GQV+
Subjt:  ASFLSHLSRADLGIIVRDPLGQVM

XP_023920731.1 uncharacterized protein LOC112032201 [Quercus suber]1.2e-22536.23Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW
        FL+ETK+ + +    K    +     V S G+SGGLALLW                           WRFTG YG+P T  +  TWALL+ L      PW
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW

Query:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA
        +  GDFN I  Q EK GG  +P  +++ FR  +  C  ID G+ G  FTW    P  G ++ R+DR F N A K LFP + V H+  S SDH  + + + 
Subjt:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA

Query:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA
                    ++RFE  WL DP   EVV+ +W       +   +      C + L  W +   G  G +I++   ++Q          +SA++     
Subjt:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA

Query:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN
         L      E   W QRSR  W+  GDRNT +FH +AS R++RN I G+ DS G  +++      ++ EYFENIF S+     D  ++   V+  VT+EMN
Subjt:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN

Query:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL
          L R F  +E+  ALKQ+HP K+PGPD +   F++  W + G  V +  L  LN G+ P   NET IVLIPKVKNP++++++RPISLCNV+ +L +K +
Subjt:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL

Query:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECI-YALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING
         NR+K  L  +IS+NQSAF+  R + DN I+ +E + Y  K   GK    +LKLDMSKA+DRVEW  L RIM KMGF   WV+L+ +C++TV +S  ING
Subjt:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECI-YALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING

Query:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQILQ------
           G + PSR LRQGDP+SP+LFL CAEGLS++L +A +T  + G+    R P ISHLFFADDS++F +A   +   +   +    +  GQ+L       
Subjt:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQILQ------

Query:  -------------------VQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM
                            ++   H  YLGLPS + +++ +T + +KE++  ++ GWK KL S  G+E+L+K++ QAIP Y+M+ F+ P  L  D++ M
Subjt:  -------------------VQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM

Query:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG
        + RFWW   +   ++ W+SW+ MC PK  GGLGFRDL+ FN ALL KQ WR+   P S + RVLK RYFP++ FL AE+G +PS+ WR+++  + +LE G
Subjt:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG

Query:  VRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL---------------MPLSHPGTRGFI---FIP--------
         RW+VG+GE I I+   W+PR +  +V S P SLP D  V+ L+    G     WN +L               +PLS    R  I   + P        
Subjt:  VRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL---------------MPLSHPGTRGFI---FIP--------

Query:  --KLSSGLVEGMLEDDA------------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEI
          K++  + +G L+ +A                  P+KIK+F WR C N LPT  NL  R V     C  C    E++ H FW+C  AK +   A     
Subjt:  --KLSSGLVEGMLEDDA------------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEI

Query:  LEKVRAGCCLLLCRDI--REVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKW-----GGNGWSGEGCV--ASPRNG
           +R    +    D+   + +  E  E  V   W +W  RNK R  GA R S           + VR   ++G++       + +  +  +    P+  
Subjt:  LEKVRAGCCLLLCRDI--REVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKW-----GGNGWSGEGCV--ASPRNG

Query:  WYKVNFDASFLSHLSRADLGIIVRDPLGQVM
        WYK N DA+    L    +G+I+RD  G V+
Subjt:  WYKVNFDASFLSHLSRADLGIIVRDPLGQVM

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.3e-22435.19Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW
        FL+ET+ + S  ERL+ RL F + F V      GGLALLW+++                        WRFTG YG P+  ++  +W++L+HL      PW
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW

Query:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA
        +  GDFN I    EK GG  +PE ++  FR+ +D C L D GF+G  FTWCNR+     VW R+DR          FP A + HL  S SDH+P+ L ++
Subjt:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA

Query:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA
          ++   A   + +RFE  WL D     VV   W  +        +  +  +C   L+LW +   G     +    + +  A G      + A +++   
Subjt:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA

Query:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN
         +  +   EE  W QRS+  WL++GD+NT++FH +A+ R K+N I GL ++ G   +   +I  L++ Y+ N+F++  P+E  ++ V  GV+  V++ MN
Subjt:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN

Query:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL
          L++PF+ EE+ +ALKQ+ P+ APGPD     F++  W  VG +V    LS+LN G  P  +N T I LIPK K+P +V+EFRPISLCNV+YKL+AK L
Subjt:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL

Query:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING
         NR+K ML  LIS++QSAF+  R + DN ++ +E ++ LK +R GK G+ S+KLDMSKAYDRVEW YLE+IM +MGF   W++LI  C+ +V +S  +NG
Subjt:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING

Query:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPC--------------------
           G + P+R LRQGDPLSPYLFLL  EGL ++  EA+ +  I G+ L    P ISHL FADDSL+F RA  +E+  +                      
Subjt:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPC--------------------

Query:  -----TVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM
             T P V   I   L + +T+   QYLGLPS + +++  +   +KER+WK+++GWK +L S  GREIL+K+++QAIP Y+M+CFR PK LI DI ++
Subjt:  -----TVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM

Query:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG
        + +FWW    E ++IHW+SW+ +C+PK  GG+GF++L  FN ++LAKQ WR+         RV K ++FP+    ++EV  + S+ W++++  + ++++G
Subjt:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG

Query:  VRWRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTKPWWNNTL---------------MPLS----------HPGTRGFIF---
          WR+G+G+ +KI G  W+P++T  K+ SPIS LPP + V  L+       +  W + L               +PLS          H  T G      
Subjt:  VRWRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTKPWWNNTL---------------MPLS----------HPGTRGFIF---

Query:  ------------IP-----KLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSILREAG
                    IP     + SS   +G+     P K++ F+WR     LPT+ NL +R V     C  C    E ++H  W C      W K       
Subjt:  ------------IP-----KLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSILREAG

Query:  FGEILEKVRAGCCLLLCRDIREVIE-----GEKFEE--LVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWL-FVRLGGVVGKWGGNGWSGEGCVAS-
          E+++     C    C    +++E     G++ +   L ++ W +W+ RN VR++ +      +     M+ L F    G + +          C ++ 
Subjt:  FGEILEKVRAGCCLLLCRDIREVIE-----GEKFEE--LVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWL-FVRLGGVVGKWGGNGWSGEGCVAS-

Query:  ----PRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM
            P +  +K+NFD +  S L  A LG+++RD  G V+
Subjt:  ----PRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein3.8e-23836.55Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW
        FL+ET       E L+  L F+S   V S  + GGLAL W+++                        WR TG+YG P+  ++  TW L++ L G ++  W
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW

Query:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA
           GDFN I+   E  G  ++P+ ++  FR  +D C L+D GF G  FTWCN +    T W R+DR   N+     FP A V HLD  +SDH+ + L+  
Subjt:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA

Query:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA
        P          + +RFEE W+ D G  + ++ +WG+  +  +   V+ +  +C   L  W R   G  G +I     E+++A        S  +LQ+   
Subjt:  PLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEA

Query:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN
        RL ++F +EE  W+QRSR  WL  GDRNT++FH++A+ R +RN I GL D+ GV+      +  L   Y+ ++FT+  P +  I+ V A V + VT++MN
Subjt:  RLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMN

Query:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL
        + L+R F   E+ +ALKQ+ P KAPGPD +   F++K W VVG DV +  LS LN G     +N T I LIPK KNP RV+EFRPISLCNV+YKL++K L
Subjt:  RHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKAL

Query:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING
         NR+K +L  ++S +QSAF+PGR + DN ++ +E ++ + H + G+ G  +LKLDMSKAYDRVEW +LE+IM K+GF Q W+SL+  C+STV +S  +NG
Subjt:  VNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNING

Query:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQ---------
           G +KPSR LRQGDPLSPYLFLLCAEGL S++ +A     I G+ L R+ P I+HLFFADDSLLF +A       +   +    +  GQ         
Subjt:  FRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQ---------

Query:  ----------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM
                         L+V I   + +YLGLPS + +NR+ +   +KERVW++++GWK KL S  GREIL+K++ QAIP YSM+CFR P KL  D+ +M
Subjt:  ----------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSM

Query:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG
        + RFWWS   E R+IHWVSW+ +C+ K  GGLGFRDL  FN ALLAKQ WR++ +  S   RV K ++FP    ++     R S+ W++++  +E++  G
Subjt:  MARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMG

Query:  VRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDATVAGLMTALRGGTKPWWNNTL---------------MPLSH---------PGTRGFIF-----
          WRVGNG+ I I+   W+  D + K+ +P    P+  +   +  L    +  W++ L               +PLS+         PG    ++     
Subjt:  VRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDATVAGLMTALRGGTKPWWNNTL---------------MPLSH---------PGTRGFIF-----

Query:  ----------------IPKLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEILE
                         P         +   + P K ++F W+     LPT  NL +R + +   C  CG+  E ++H  W CK  + +  +  + + L 
Subjt:  ----------------IPKLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEILE

Query:  KVRAG-CCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASF
          ++G    LL + I +  + E  E+ +++ W++W  RNK+RL         V   K   +L   L     K   +    E     PR+   K+N+D + 
Subjt:  KVRAG-CCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASF

Query:  LSHLSRADLGIIVRDPLGQVMLS
         S  + A +G+IVRD  G V+ S
Subjt:  LSHLSRADLGIIVRDPLGQVMLS

A0A2N9I509 Uncharacterized protein2.8e-23337.53Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW
        FL+ET++     E L++RL    CF VD  G  GGLALLW                         S WR T  YG+P+   +P+TWALL+ L   ++ PW
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW

Query:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA
        +V GDFN I+   E+ G  A+  +++ GFR+ +  C L D GF G  FTW NR+ G   V  R+DR   N A   LFP   V +   S SDH  I++ L 
Subjt:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA

Query:  PLVRMVDAHGSR---IYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGK-CLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQ
        PL+     +  R   ++RFE AWL + G  +V+  +W +SP  G+      E  K C   L  W           I     +++    + + S     + 
Subjt:  PLVRMVDAHGSR---IYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGK-CLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQ

Query:  VAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVT
        +    L  +  +EE+ W+Q+SR SWL+ GDRNT++FH  AS RRK NLI GL DS     ++P +I  +  +YF  +F SS P+   ++ V   V   VT
Subjt:  VAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVT

Query:  DEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLV
          MN  L+RPF  EEI  AL Q+HP+KAPGPD ++  FF+K W VVG DV    L  L+ G     +N T IVLIPKVK P R+S+FRPISLCNVVYK+V
Subjt:  DEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLV

Query:  AKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALK-HRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSF
        +K LVNRMK ML  +IS +QSAF+PGR + DN I+ +E ++ LK HR G     + KLDMSKAYDRVEW YL  I+LK+GF   WV L+  CV++  +S 
Subjt:  AKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALK-HRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSF

Query:  NINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQIL---
         +NG   G V PSR LRQGDPLSPYLFL+CAEGLS+++ +AE  + I G+ + R  P ISHLFFADDS++F RA  S+   +   +    +  GQ++   
Subjt:  NINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQIL---

Query:  QVQITTSHN----------------------QYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQD
        +  I  SHN                      +YLGLP  + + +      +K+R+W+++QGWK KL S  G+E+L+K+++QA+P Y+M+CF+FP  L  +
Subjt:  QVQITTSHN----------------------QYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQD

Query:  ISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKEL
        ISSM   FWW   +  R+IHW+S   + KPKC GG+GFRDL++FN+ALLA+Q WR+L+ P S + R LK +YFP+S FLEA V    SF+WR+L   +++
Subjt:  ISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKEL

Query:  LEMGVRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL--------------MPLSH---------PGTRGFIFI
        L MG+RWRVGNG +IK++   W+P  +  KV S P  L   ATV  L   +  GT  W  + L              +PLS            ++  +F 
Subjt:  LEMGVRWRVGNGEQIKIYGSNWIPRDTNLKV-SSPISLPPDATVAGLMTALRGGTKPWWNNTL--------------MPLSH---------PGTRGFIFI

Query:  PKLSSGLVEGMLEDDAPS----------------------KIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGF
         + +  ++      D PS                      K+K+F+W+ C N +PT   L  +G+     C +C +  E+  H  W C++A+++ + +  
Subjt:  PKLSSGLVEGMLEDDAPS----------------------KIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGF

Query:  GEILEKVRAGCCLLLCRDIREVIE-------GEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWG-GNGWSGEGCVASPR
              V     + +  + REV+E           E      W++W+ RN+            +     +  L      + G+ G  +  +G+     P+
Subjt:  GEILEKVRAGCCLLLCRDIREVIE-------GEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWG-GNGWSGEGCVASPR

Query:  NGWYKVNFDASFLSHLSRADLGIIVRDPLGQVMLSATFT
         G YK+N     +S+ SR  +GI++RD LG V  S   T
Subjt:  NGWYKVNFDASFLSHLSRADLGIIVRDPLGQVMLSATFT

A0A7N2LIH6 Uncharacterized protein2.6e-23437.26Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW-------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETP
        FL+ETK    + +  + +LGF     V S GRSGGLALLW                            WR TG YG+P T  + ++W LL+ L    E P
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW-------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETP

Query:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL
        W+V GDFN I++  EK G + +  ++++ FRE +  C LID GF G RFTWCN + G      R+DR   N A   +FP A+V H+  S SDH  + L L
Subjt:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL

Query:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAE
          +       G + + FEE W       E+V  +W       S   V     +C ++L+ W +   G     I++    +Q+         ++ ++Q  +
Subjt:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAE

Query:  ARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEM
          +  +   EEV WKQRSR SWL++GD+N+++FH  AS RR++N I GL+D  GV  ++      L+ +YF++I++S+ P+  D+ +     R  VT EM
Subjt:  ARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEM

Query:  NRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKA
        N  L + F+  E++ AL+Q+HP KAPGPD +S  F++K W +VG  V  C L  LN GV P  +N+T I LIPK KNP +++EFRPISLCNV+YK+++K 
Subjt:  NRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAKA

Query:  LVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNIN
        L NR+K +L+ +I + QSAF+PGR + DN I+ +E ++++  RR GK G  ++KLDMSKAYDRVEW YLE +M KMGF   W+SLI  CV++V FS  IN
Subjt:  LVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNIN

Query:  GFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQ--------
        G   G   PSR LRQGDP+SPYLFLLC EGLS+M+ + E    I G+  AR+ P ISHLFFADDS++F RA   E   V   +    E  GQ        
Subjt:  GFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQ--------

Query:  -----------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISS
                         I   QI   H +YLGLP  + + +      +K++V ++I GWKGKL S  GRE+L+K++ QA P Y+MN F+ P  L  +++S
Subjt:  -----------------ILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISS

Query:  MMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEM
        MM  FWW     ++++ WVSWK +CKPK  GG+GF+DL+ FN ALLAKQ WR+ + P S   RVLK +YF +S F+EA++G +PS+IWR++M  K +++ 
Subjt:  MMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEM

Query:  GVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTKPWWNNTLMP---LSHPGTRGFIFIPKLSSGLVEGML-----------
        G RW VG+G  I+I+ + W+P   + KV +  S       VA L++  RG     W  TL+    + H      + IP  S  L + ++           
Subjt:  GVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPIS-LPPDATVAGLMTALRGGTKPWWNNTLMP---LSHPGTRGFIFIPKLSSGLVEGML-----------

Query:  ---------------EDDA----------------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILRE
                       E +A                      P+KIK FLWR C   LPT   L+ R +   D C FCG+  E+S H  W C  AK   + 
Subjt:  ---------------EDDA----------------------PSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILRE

Query:  AGFG-----EILEKVRAGCCLLLCRDIREVIEGEK-FEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLG----GVVGK--WGGNGWSGE
         GF       ++E +     LL         +G+K +E   ++ WS+W+ RN VR  G  +    +      +   VR      G V K       WS  
Subjt:  AGFG-----EILEKVRAGCCLLLCRDIREVIEGEK-FEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLG----GVVGK--WGGNGWSGE

Query:  GCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM
             P   WYKVN DA+         +G+++R+  GQ+M
Subjt:  GCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM

A0A7N2R0C3 Reverse transcriptase domain-containing protein1.2e-23937.24Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW-------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETP
        FL ETK    R + L+ +LG     +V S GRSGGLA+LW                         +  WR TG YG+P    +P +W LL+ L      P
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLW-------------------------SSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETP

Query:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL
        W+V GDFN IL   EK G   +   ++  FRE +  C L+D GF G+RFTWCN + G      R+DR   N     LFP A+V H   + SDH   LLSL
Subjt:  WMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSL

Query:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEV-LRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVA
        +   R       R + FEE W  + G  EV+ R+W   P   +P        KC +  L+ W R   G     +++    +Q+         S+ ++Q  
Subjt:  APLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEV-LRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVA

Query:  EARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDE
        +  +  + L EE+ W QRSR  W+K+GDRNTR+FH  A+ RR++N I G++DS G  R+   E+  ++ EYF+ I++S+ P+E         V R VT++
Subjt:  EARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDE

Query:  MNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAK
        MN  L+R F++EE++ AL Q+HP K+PGPD +S  FF+K W VVGP VV+  +  L  GV P  VNET I LIPKVK P +++E+RPISLCNV+YKLV+K
Subjt:  MNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVAK

Query:  ALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNI
         L NR+K +L  ++ + QSAF+PGR + DN ++ +E ++ +  RR GK G  ++KLDMSKAYDRVEW YLE IM +MGF + W+SL+  CV+TV FS  I
Subjt:  ALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRR-GKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNI

Query:  NGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSV--------------------
        NG   G + P+R LRQGDP+SPYLFLLCAEGLS+ML   E    +SG+++ RR P ISHL FADD ++F +A   E   V                    
Subjt:  NGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSV--------------------

Query:  -----PCTVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDIS
               T   V EG+ ++   +I   H +YLGLP  + + +      +K++V ++I  WKG+L S  GREIL+K++ QA P Y+MNCF  P  L  +++
Subjt:  -----PCTVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDIS

Query:  SMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLE
        S++  FWW   ++++++ W++W  +CKPK  GG+GF+DL+ FN ALLAKQ WR+ + P S   RVLK RYFPSS F+EA++G  PS+ WR+L+  +E++E
Subjt:  SMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLE

Query:  MGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDA-TVAGLMTALRGGTKPWWNNTLMPLSHPGTRGFIFIPKLSSGLVEG---------------
         G RW +GNG+Q++I+   W+P   + KV SP     +   V  L+    GG   W  N +  +  P     I    +S  L E                
Subjt:  MGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDA-TVAGLMTALRGGTKPWWNNTLMPLSHPGTRGFIFIPKLSSGLVEG---------------

Query:  ----------------------------------MLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREA
                                          + +   PSK+K FLWR C N LPT   L  R V + + C  CG R ES+ H  W+C+ A ++ RE+
Subjt:  ----------------------------------MLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREA

Query:  GFGEILEKVRAGCCLLLCRDIREVI----EGEK---FEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGG----NGWSGEGC
            +L K+R        RD  E++    EG +   +E  V   W +W  RN ++ +G  + + ++     +     R G +  K         W     
Subjt:  GFGEILEKVRAGCCLLLCRDIREVI----EGEK---FEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGG----NGWSGEGC

Query:  VASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM
           PR GWYK N D +     +   +G+++R+  GQ+M
Subjt:  VASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVM

A0A803PWX1 Uncharacterized protein2.1e-23636.57Show/hide
Query:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW
        FL+E+K+   + ER+   L F S ++VD +G SGGL L+W  E                        W  TG YG P+   +  +W LL++L+ + + PW
Subjt:  FLIETKVQSSRFERLKLRLGFASCFSVDSIGRSGGLALLWSSE------------------------WRFTGIYGYPQTEDKPSTWALLKHLRGDAETPW

Query:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA
        +  GDFN I+   EK GGR +P   ++GF+E +D C  IDFG +    TWCN +     + ER+DR   N      F  A+V+ LD+  SDHR +++++ 
Subjt:  MVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLA

Query:  PLV---RMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQV
          V   +   A     + FEEAW  +    E++   W      G P     +  KC + L+ W + +  R  + I + ++++   L      G    +Q 
Subjt:  PLV---RMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQV

Query:  AEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTD
         E +L  +  ++E YW+QRSR  WL+WGDRNT++FH +AS RRK+N I+GL D  GV + +   +  +V +Y++ +F  S   +  +  V   V+  V+ 
Subjt:  AEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTD

Query:  EMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVA
         MN  LM  F  EE+  A+K ++P KAPG D L   F++K W  +  +V+  CL++LN G     +N+T++ LIPKV  P ++ EFRPISLCNV+YK+V+
Subjt:  EMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLCNVVYKLVA

Query:  KALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTGW-ASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFN
        K L NR++  L+ ++S +QSAF+ GR + DNAI+GYEC++ ++  R + G   +LKLDM+KAYDRVEWR+LE +MLK+G++  WVS I RC+++V+FSF 
Subjt:  KALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTGW-ASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFN

Query:  INGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSE-----------------------
        ING   G V P R LRQGDPLSP+LFLLCAE  S ++  AE + ++ G+   R+   +SHLFFADDSL+F  A + E                       
Subjt:  INGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSE-----------------------

Query:  ---AWSVPCTVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQD
            +    T+P V   +  I+ V++  ++ +YLGLPSF+ + +    +F+K +VW +++GWKG  FS  G+E+L+K+++QAIP Y+M+CFR PKK I  
Subjt:  ---AWSVPCTVPSVHEGIGQILQVQITTSHNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQD

Query:  ISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKEL
        I SM ARFWW   E+D +IHW  W  +CK K  GGLGFRDL +FNQALLAKQ WR +R P S  +RVLK  Y+P+ G +EA+ G   SF+WR+L+WGK++
Subjt:  ISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKEL

Query:  LEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDATVAGLMTALRG---------GTKPWWNNTLMPLSHPGTRGFIFIPKLSSGLVEGMLEDD
        ++ G RWR+GNG  +++    W+PR    K+     LP    V  L     G         G  P+ +  +   +    + +  + KL            
Subjt:  LEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDATVAGLMTALRG---------GTKPWWNNTLMPLSHPGTRGFIFIPKLSSGLVEGMLEDD

Query:  APSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRG-ESSMHFFWECKWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWW
         P K+K F+W++  + +PT   L  R + +   C  C     E+  H  W C+  + +   AGF   L++      L     +  +   ++FE  ++L W
Subjt:  APSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRG-ESSMHFFWECKWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVVLWW

Query:  SMWSTRNKVRLQGAERP--SGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVMLSAT
        ++W  RN V   G  +P  S +++W       F R   V  + G      E     PR G +KVN DA   +  + A L  +VRD  G+VM++AT
Subjt:  SMWSTRNKVRLQGAERP--SGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQVMLSAT

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog2.2e-3323.36Show/hide
Query:  RRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDID-VVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRK
        +R ++LI  + +    +  +P EI  +++EY++ +++    + ++ID  + A     ++ +    L RP    EI   ++ +   K+PGPD  +  F++ 
Subjt:  RRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDID-VVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRK

Query:  SWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKV-KNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECI
            + P ++    +I  EG+ P    E  I LIPK  K+P+R   +RPISL N+  K++ K L NR++  +  +I  +Q  FIPG     N       I
Subjt:  SWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKV-KNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECI

Query:  YALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEA
          +   + K     L +D  KA+D ++  ++ R + K+G E  ++ LIE   S    +  +NG +          RQG PLSP LF +  E L+  + E 
Subjt:  YALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEA

Query:  EATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQILQVQ-----ITTSHNQ-------------------YLGLPSFMPK
        +A   I G+ +      I    FADD +++    +     +   +       G  +        I T++NQ                   YLG+  ++ K
Subjt:  EATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQILQVQ-----ITTSHNQ-------------------YLGLPSFMPK

Query:  NRTSTLK----FVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNC--FRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGL
        +     K     +++ + + +  WK    S  GR  ++K  +     Y+ N    + P    +D+  ++  F W+  +       +S K        GG+
Subjt:  NRTSTLK----FVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNC--FRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGL

Query:  GFRDLEVFNQALLAKQCW
           DL ++ ++++ K  W
Subjt:  GFRDLEVFNQALLAKQCW

P0C2F6 Putative ribonuclease H protein At1g657502.1e-2823.52Show/hide
Query:  LPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGG
        +P    +    T   + ERV  ++ GW+ K  S  GR  L K+++ ++P +SM+    P+ ++  +  +   F W    E ++ H V W  +C PK  GG
Subjt:  LPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGG

Query:  LGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRP----SFIWRNLMWG-KELLEMGVRWRVGNGEQIKIYGSNWI-------
        LG R  +  N+AL++K  WR+L+E  S    VL+ +Y    G +     + P    S  WR++  G ++++  GV W  G+G+QI+ +   W+       
Subjt:  LGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRP----SFIWRNLMWG-KELLEMGVRWRVGNGEQIKIYGSNWI-------

Query:  ------PRDTNLKVSSPISLPP--------------------DATVAGLMTALRGGTKPWWNNTLMPLSHPGTRGFIFIPKL----SSGLVEGMLEDDAP
              P D +  V+  + +P                      A V  L+T  R     W  +     S       + + ++     +     + +   P
Subjt:  ------PRDTNLKVSSPISLPP--------------------DATVAGLMTALRGGTKPWWNNTLMPLSHPGTRGFIFIPKL----SSGLVEGMLEDDAP

Query:  SKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSI--LREAGF--GEILEKV------RAGCCLLLCRDIREVIE
         ++K FLW +    + T +   RR +   +VC  C    ES +H   +C      W + +   R+ GF    + E +      R+GC         E I 
Subjt:  SKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECK-----WAKSI--LREAGF--GEILEKV------RAGCCLLLCRDIREVIE

Query:  GEKFEELVVLWWSMWSTRN--KVRLQGAERPSGLVEWAKGMWWLFVRLGGV-VGKWGGN-----GWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGII
              +++ W   W   N      +  +R   + EWA  ++      G V VG          GW       SP  GW KVN D +   +   A  G +
Subjt:  GEKFEELVVLWWSMWSTRN--KVRLQGAERPSGLVEWAKGMWWLFVRLGGV-VGKWGGN-----GWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGII

Query:  VRDPLG
        +RD  G
Subjt:  VRDPLG

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-3523.47Show/hide
Query:  RKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNR----HLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFF
        R + LI  + +  G +  +P EI   +  +++ ++++      ++D +   + R    ++N+    HL  P   +EI   +  +   K+PGPD  S  F+
Subjt:  RKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAGVRRSVTDEMNR----HLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFF

Query:  RKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPK-VKNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYE
        +     + P + +    I  EG  P    E  I LIPK  K+P+++  FRPISL N+  K++ K L NR++  + A+I  +Q  FIPG     N      
Subjt:  RKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPK-VKNPSRVSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYE

Query:  CIYALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLL
         I+ +   + K     + LD  KA+D+++  ++ +++ + G +  ++++I+   S    +  +NG +   +      RQG PLSPYLF +  E L+  + 
Subjt:  CIYALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLL

Query:  EAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIG------------------------QILQVQITTSHNQYLG--LPS
        + +    I G+++ +    IS L  ADD +++    K+    +   + S  E +G                        +     I T++ +YLG  L  
Subjt:  EAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIG------------------------QILQVQITTSHNQYLG--LPS

Query:  FMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNC--FRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGG
         +        K +K+ + + ++ WK    S  GR  ++K  +     Y  N    + P +   ++   + +F W+   +  RI     K + K K   GG
Subjt:  FMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNC--FRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGG

Query:  LGFRDLEVFNQALLAKQCWRILRE
        +   DL+++ +A++ K  W   R+
Subjt:  LGFRDLEVFNQALLAKQCWRILRE

P14381 Transposon TX1 uncharacterized 149 kDa protein1.7e-4125.27Show/hide
Query:  DAETPWMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGE----RFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRS
        D++   ++GGDFN  L   ++   + +  SE +  RE +    L+D           FT+   + G      RIDR + +  L +    + ++   FS  
Subjt:  DAETPWMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGE----RFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRS

Query:  DHRPILLSLAPLVRMVDAHGSRIYRFEEAWLLDPGFMEVVR---RSWGASPTVGSPRGVAGETGKC-LEVL-RLWGRGRCGRYGDRIREASEEVQRALGR
        +   + +S+AP +       +  + F  + L D GF + VR   R W A     +      + GK  L++L + + +   G+    I   + EV     R
Subjt:  DHRPILLSLAPLVRMVDAHGSRIYRFEEAWLLDPGFMEVVR---RSWGASPTVGSPRGVAGETGKC-LEVL-RLWGRGRCGRYGDRIREASEEVQRALGR

Query:  LSTSGSSADLQVAEARLEAIFLEEEVYWK---QRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSER
        LS S   A       R EA+   E+   +    RSR   L   DR +R+F+     +  R  I  L    G   ++P  I      +++N+F+    S  
Subjt:  LSTSGSSADLQVAEARLEAIFLEEEVYWK---QRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSER

Query:  DIDVVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSE
          + +  G+   V++     L  P   +E+  AL+ +  NK+PG D L+  FF+  W  +GPD  R       +G  P      ++ L+PK  +   +  
Subjt:  DIDVVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSE

Query:  FRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTG--WASLKLDMSKAYDRVEWRYLERIMLKMGFEQGW
        +RP+SL +  YK+VAKA+  R+K +L  +I  +QS  +PGR + DN  L    I  L H   +TG   A L LD  KA+DRV+ +YL   +    F   +
Subjt:  FRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTG--WASLKLDMSKAYDRVEWRYLERIMLKMGFEQGW

Query:  VSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFR-----AKKSEAW
        V  ++   ++      IN      +   R +RQG PLS  L+ L  E    +L       R++GL L      +    +ADD +L  +      +  E  
Subjt:  VSLIERCVSTVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFR-----AKKSEAW

Query:  SVPCTVPSVH----------EGIGQI-------LQVQITTSHNQYLGLPSFMPKNRTS-TLKFVKERVWKQIQGWKG--KLFSVGGREILLKSIVQAIPC
         V     S            EG  ++         +   +   +YLG+     +   S     ++E V  ++  WKG  K+ S+ GR +++  +V +   
Subjt:  SVPCTVPSVH----------EGIGQI-------LQVQITTSHNQYLGLPSFMPKNRTS-TLKFVKERVWKQIQGWKG--KLFSVGGREILLKSIVQAIPC

Query:  YSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLG
        Y + C    ++ I  I   +  F W G       HWVS      P   GG G
Subjt:  YSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLG

P93295 Uncharacterized mitochondrial protein AtMg003105.1e-3848.32Show/hide
Query:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLE
        A+P Y+M+CFR  K L + ++S M  FWWS  E  R+I WV+W+ +CK K   GGLGFRDL  FNQALLAKQ +RI+ +P + L+R+L+ RYFP S  +E
Subjt:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLE

Query:  AEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNL
          VG RPS+ WR+++ G+ELL  G+   +G+G   K++   WI  +T L
Subjt:  AEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.4e-2926.54Show/hide
Query:  GDFN--AILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAP
        GDF+  A    H      + P   L  F+  +    L+D    G  +TW N +     +  ++DR   N    + FP A         SDH P ++ L  
Subjt:  GDFN--AILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFTWCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAP

Query:  LVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCL-EVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQ---V
        L +       + +R+       P F+  +  +W     VGS     GE  K   +  +L  R   G    + +EA + ++    +L T+ S +  +   V
Subjt:  LVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCL-EVLRLWGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQ---V

Query:  AEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTA-GVRR---
        A  +        E +++Q+SR  WL+ GD NTR+FH      + +NLI+ L     V  +   ++  ++  Y+ ++  S      D D++T   V+R   
Subjt:  AEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTA-GVRR---

Query:  ----SVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLC
               D +   L      +EI  A+  +  NKAPGPD  +  FF +SW VV    +         G      N T I LIPKV    ++S FRP+S C
Subjt:  ----SVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSRVSEFRPISLC

Query:  NVVYKLV
         VVYK++
Subjt:  NVVYKLV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.6e-1640.23Show/hide
Query:  LVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIER
        +V R+K ++  LI   Q++FIPGR   DN +   E +++++ ++G  GW  LKLD+ KAYDR+ W YLE  ++  GF + W+  I R
Subjt:  LVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIER

AT4G29090.1 Ribonuclease H-like superfamily protein3.9e-4127.19Show/hide
Query:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEA
        A+P Y+M CF  PK + + I S++A FWW   +E + +HW +W  +   K  GG+GF+D+E FN ALL KQ WR+L  P S +A+V K RYF  S  L A
Subjt:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEA

Query:  EVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPD--ATVAGLMTA---LRGGTKPWWNNTLMPL-----------
         +G RPSF+W+++   +E+L  G R  VGNGE I I+   W+             +PP   A+V+ ++     +    + W  + +  L           
Subjt:  EVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPD--ATVAGLMTA---LRGGTKPWWNNTLMPL-----------

Query:  SHPGTRGFI------------------------FIPKLSS----------GLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKR
          PG R  +                         I K SS           + + + +     KI+ FLW+   N LP    L  R +     C+ C   
Subjt:  SHPGTRGFI------------------------FIPKLSS----------GLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKR

Query:  GESSMHFFWEC-----KWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVV-LWWSMWSTRNKVRLQGAE-RPSGLVEWAKG--MWWLFV
         E+  H  ++C      WA S +     GE  + +      +         + EK  +LV  L W +W  RN++  +G E     ++  A+     W   
Subjt:  GESSMHFFWEC-----KWAKSILREAGFGEILEKVRAGCCLLLCRDIREVIEGEKFEELVV-LWWSMWSTRNKVRLQGAE-RPSGLVEWAKG--MWWLFV

Query:  RLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQV
              G       S  G    P + W K N DA++     R  +G ++R+  G+V
Subjt:  RLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGIIVRDPLGQV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.6e-3948.32Show/hide
Query:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLE
        A+P Y+M+CFR  K L + ++S M  FWWS  E  R+I WV+W+ +CK K   GGLGFRDL  FNQALLAKQ +RI+ +P + L+R+L+ RYFP S  +E
Subjt:  AIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPK-CYGGLGFRDLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLE

Query:  AEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNL
          VG RPS+ WR+++ G+ELL  G+   +G+G   K++   WI  +T L
Subjt:  AEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.4e-1453.62Show/hide
Query:  FNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDS
        F ING   G V PSR LRQGDPLSPYLF+LC E LS +   A+   R+ G++++   P I+HL FADD+
Subjt:  FNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAGGGATGACGAAAACTAGAACAGAGTTTATGAACTTGAAGATAGGGATGACGATGAACAGACGGTTTGGCCCCTGGGTTTGTCGTTCTTACCGCAGAAAATG
GTCACGGCAACGGTGTGGCCCCTGGGTTATTTCAAATACATGGGACGACGACGACGACGGTGTGGTTGGCCGTCAGAGAGTGAGAGGCGACAGTGAAGAGATGATCGAAA
TTTGGGGAAGAAGGAGAGGGAGGAGAATAGATAGATTTAGGGTTAAGGGAATGCAAAACGACAAGTTTTTAATAGAGACCAAGGTGCAGTCTTCGCGTTTTGAGAGGTTA
AAGTTGAGGTTGGGTTTTGCTTCCTGTTTCAGTGTAGACAGTATTGGAAGAAGTGGGGGTCTAGCTCTGCTATGGAGTTCGGAGTGGCGCTTTACAGGGATCTATGGGTA
TCCGCAGACTGAGGATAAACCGTCGACATGGGCTCTGTTGAAACATCTTCGGGGGGATGCTGAGACTCCGTGGATGGTGGGTGGTGATTTTAATGCGATCTTATATCAAC
ATGAGAAGGAAGGAGGAAGGGCTAAACCAGAATCTGAGTTAAACGGTTTTCGAGAGGCTGTGGATGCATGCTGTTTGATTGATTTTGGGTTCACAGGGGAGAGGTTTACG
TGGTGTAATAGAAAGCCAGGTATGGGAACAGTCTGGGAGAGAATTGACAGATGTTTTGGAAATATGGCGCTAAAAACTCTTTTCCCACACGCTGAGGTGAAACATCTGGA
CTTCAGCCGATCGGATCACCGTCCTATCTTGCTCTCATTAGCGCCGTTGGTTCGAATGGTTGATGCGCATGGGAGCAGAATTTATAGATTTGAGGAGGCCTGGTTGCTGG
ATCCGGGATTTATGGAGGTGGTTAGAAGGAGTTGGGGGGCAAGTCCAACAGTTGGATCGCCGAGGGGAGTGGCAGGGGAGACTGGAAAATGCCTGGAGGTGCTGAGGCTT
TGGGGAAGGGGGAGGTGTGGTAGGTATGGGGATAGAATAAGGGAAGCGTCTGAGGAGGTTCAGAGGGCTTTGGGTCGGTTGAGCACCTCTGGGTCTAGTGCAGACCTGCA
GGTAGCAGAGGCCAGATTGGAGGCGATTTTTCTGGAGGAAGAGGTTTACTGGAAACAACGCTCTAGGGAGAGTTGGCTGAAGTGGGGTGACAGGAACACCCGCTGGTTTC
ATACACAGGCGTCTTTTAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGACAGTGGTGGTGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCGGAGTAC
TTTGAGAACATCTTCACGTCTAGTTGTCCGTCAGAAAGGGATATTGATGTCGTTACAGCAGGGGTGAGGAGATCAGTAACAGATGAGATGAACAGACACCTGATGAGGCC
TTTCCAACAGGAGGAGATTTTCCTTGCTCTGAAGCAGATACATCCTAATAAGGCTCCGGGCCCGGATGAGCTTTCAGGGGCGTTCTTTAGGAAGTCATGGGGGGTGGTGG
GACCGGACGTTGTGAGGTGTTGTCTGAGTATTCTTAATGAGGGTGTATCTCCAGGGCCAGTCAATGAGACCATGATTGTCTTGATTCCGAAAGTCAAAAATCCCAGTAGG
GTCTCGGAGTTTAGGCCTATTTCCCTCTGTAATGTTGTGTATAAGCTAGTTGCCAAAGCACTGGTGAACAGAATGAAAGGAATGCTGAACGCACTAATCTCTCAAAACCA
GAGTGCCTTCATCCCGGGGCGTTGTGTGGTGGACAATGCCATATTGGGTTATGAGTGCATTTATGCATTGAAGCACAGGAGAGGGAAAACTGGGTGGGCTTCACTTAAGC
TTGACATGAGCAAGGCCTACGACCGGGTGGAATGGAGGTATTTGGAACGGATTATGCTGAAAATGGGATTTGAGCAAGGATGGGTCAGTCTGATTGAGCGTTGTGTATCC
ACGGTGCGGTTTTCTTTTAACATAAACGGGTTTAGATGTGGGGATGTGAAGCCTAGTCGGGACCTGCGACAGGGGGACCCGTTATCCCCCTATCTGTTCTTATTATGTGC
GGAGGGGCTGTCCAGTATGCTTTTGGAGGCTGAAGCTACCAATAGGATTTCAGGCCTAAAGCTGGCTAGACGATGCCCGATAATATCTCACCTATTTTTTGCAGATGATA
GTTTGCTTTTCTTCAGGGCTAAGAAGAGCGAGGCCTGGAGTGTCCCATGTACGGTGCCCAGTGTGCATGAAGGCATAGGCCAGATTCTTCAGGTCCAGATTACAACATCC
CATAACCAATACTTGGGGCTCCCTTCGTTCATGCCCAAGAACAGAACAAGCACTTTGAAGTTTGTCAAGGAACGGGTGTGGAAGCAGATTCAGGGGTGGAAGGGGAAACT
ATTCTCTGTAGGAGGCCGTGAAATCCTGCTTAAATCCATTGTGCAGGCTATCCCATGCTACTCCATGAACTGCTTCCGGTTTCCGAAAAAGCTCATTCAGGATATTAGCA
GTATGATGGCACGGTTCTGGTGGAGCGGAGTGGAGGAGGACCGAAGAATACATTGGGTGAGTTGGAAGGGCATGTGTAAACCGAAGTGCTATGGGGGTTTAGGCTTCAGG
GACTTGGAGGTTTTTAATCAGGCCCTCCTGGCAAAGCAGTGTTGGAGAATATTGCGTGAGCCTGCGTCTTTCCTAGCCCGAGTTCTGAAAGGGCGATATTTCCCTTCCTC
TGGTTTTTTGGAGGCAGAAGTGGGGGTCCGTCCATCGTTCATATGGAGAAACCTAATGTGGGGTAAGGAGCTTTTAGAGATGGGTGTTCGTTGGCGAGTTGGGAATGGGG
AACAAATCAAGATCTATGGATCTAACTGGATTCCGAGGGATACTAATTTGAAAGTTAGCTCGCCGATATCCTTGCCTCCAGATGCCACGGTGGCGGGACTGATGACGGCT
TTGAGGGGTGGGACCAAACCCTGGTGGAACAACACTTTAATGCCGCTGAGTCACCCTGGCACGAGGGGCTTCATCTTCATCCCCAAACTCTCTTCAGGATTGGTGGAAGG
GATGTTGGAAGATGACGCTCCAAGTAAGATAAAGGTGTTCCTTTGGAGATTGTGTCTGAACAGGCTGCCGACAATTGACAACCTAATAAGAAGGGGTGTGGACCTGCTAG
ATGTATGCGTTTTCTGCGGGAAGCGGGGGGAATCTAGCATGCATTTTTTTTGGGAATGCAAATGGGCCAAATCGATCCTACGAGAAGCGGGTTTTGGGGAGATCTTGGAG
AAAGTCCGAGCAGGATGTTGTCTACTACTCTGTCGGGACATCAGAGAGGTGATAGAGGGGGAGAAGTTTGAGGAACTGGTGGTGTTATGGTGGTCGATGTGGTCGACCCG
GAATAAAGTTCGTCTTCAAGGGGCTGAGAGGCCGAGTGGGCTAGTGGAATGGGCGAAGGGTATGTGGTGGCTTTTCGTGAGGTTGGGAGGAGTAGTGGGGAAGTGGGGCG
GGAATGGTTGGTCGGGAGAGGGTTGTGTGGCGAGCCCGAGGAATGGCTGGTATAAGGTGAATTTTGATGCCTCTTTCCTTTCTCATTTGTCCAGGGCCGACCTAGGGATA
ATTGTGCGGGATCCTTTGGGCCAAGTAATGCTGTCGGCGACCTTTACTAAGGATAATGTGAGGATGTTGACATGGCTGAAGGATATGCGGCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAGGGATGACGAAAACTAGAACAGAGTTTATGAACTTGAAGATAGGGATGACGATGAACAGACGGTTTGGCCCCTGGGTTTGTCGTTCTTACCGCAGAAAATG
GTCACGGCAACGGTGTGGCCCCTGGGTTATTTCAAATACATGGGACGACGACGACGACGGTGTGGTTGGCCGTCAGAGAGTGAGAGGCGACAGTGAAGAGATGATCGAAA
TTTGGGGAAGAAGGAGAGGGAGGAGAATAGATAGATTTAGGGTTAAGGGAATGCAAAACGACAAGTTTTTAATAGAGACCAAGGTGCAGTCTTCGCGTTTTGAGAGGTTA
AAGTTGAGGTTGGGTTTTGCTTCCTGTTTCAGTGTAGACAGTATTGGAAGAAGTGGGGGTCTAGCTCTGCTATGGAGTTCGGAGTGGCGCTTTACAGGGATCTATGGGTA
TCCGCAGACTGAGGATAAACCGTCGACATGGGCTCTGTTGAAACATCTTCGGGGGGATGCTGAGACTCCGTGGATGGTGGGTGGTGATTTTAATGCGATCTTATATCAAC
ATGAGAAGGAAGGAGGAAGGGCTAAACCAGAATCTGAGTTAAACGGTTTTCGAGAGGCTGTGGATGCATGCTGTTTGATTGATTTTGGGTTCACAGGGGAGAGGTTTACG
TGGTGTAATAGAAAGCCAGGTATGGGAACAGTCTGGGAGAGAATTGACAGATGTTTTGGAAATATGGCGCTAAAAACTCTTTTCCCACACGCTGAGGTGAAACATCTGGA
CTTCAGCCGATCGGATCACCGTCCTATCTTGCTCTCATTAGCGCCGTTGGTTCGAATGGTTGATGCGCATGGGAGCAGAATTTATAGATTTGAGGAGGCCTGGTTGCTGG
ATCCGGGATTTATGGAGGTGGTTAGAAGGAGTTGGGGGGCAAGTCCAACAGTTGGATCGCCGAGGGGAGTGGCAGGGGAGACTGGAAAATGCCTGGAGGTGCTGAGGCTT
TGGGGAAGGGGGAGGTGTGGTAGGTATGGGGATAGAATAAGGGAAGCGTCTGAGGAGGTTCAGAGGGCTTTGGGTCGGTTGAGCACCTCTGGGTCTAGTGCAGACCTGCA
GGTAGCAGAGGCCAGATTGGAGGCGATTTTTCTGGAGGAAGAGGTTTACTGGAAACAACGCTCTAGGGAGAGTTGGCTGAAGTGGGGTGACAGGAACACCCGCTGGTTTC
ATACACAGGCGTCTTTTAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGACAGTGGTGGTGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCGGAGTAC
TTTGAGAACATCTTCACGTCTAGTTGTCCGTCAGAAAGGGATATTGATGTCGTTACAGCAGGGGTGAGGAGATCAGTAACAGATGAGATGAACAGACACCTGATGAGGCC
TTTCCAACAGGAGGAGATTTTCCTTGCTCTGAAGCAGATACATCCTAATAAGGCTCCGGGCCCGGATGAGCTTTCAGGGGCGTTCTTTAGGAAGTCATGGGGGGTGGTGG
GACCGGACGTTGTGAGGTGTTGTCTGAGTATTCTTAATGAGGGTGTATCTCCAGGGCCAGTCAATGAGACCATGATTGTCTTGATTCCGAAAGTCAAAAATCCCAGTAGG
GTCTCGGAGTTTAGGCCTATTTCCCTCTGTAATGTTGTGTATAAGCTAGTTGCCAAAGCACTGGTGAACAGAATGAAAGGAATGCTGAACGCACTAATCTCTCAAAACCA
GAGTGCCTTCATCCCGGGGCGTTGTGTGGTGGACAATGCCATATTGGGTTATGAGTGCATTTATGCATTGAAGCACAGGAGAGGGAAAACTGGGTGGGCTTCACTTAAGC
TTGACATGAGCAAGGCCTACGACCGGGTGGAATGGAGGTATTTGGAACGGATTATGCTGAAAATGGGATTTGAGCAAGGATGGGTCAGTCTGATTGAGCGTTGTGTATCC
ACGGTGCGGTTTTCTTTTAACATAAACGGGTTTAGATGTGGGGATGTGAAGCCTAGTCGGGACCTGCGACAGGGGGACCCGTTATCCCCCTATCTGTTCTTATTATGTGC
GGAGGGGCTGTCCAGTATGCTTTTGGAGGCTGAAGCTACCAATAGGATTTCAGGCCTAAAGCTGGCTAGACGATGCCCGATAATATCTCACCTATTTTTTGCAGATGATA
GTTTGCTTTTCTTCAGGGCTAAGAAGAGCGAGGCCTGGAGTGTCCCATGTACGGTGCCCAGTGTGCATGAAGGCATAGGCCAGATTCTTCAGGTCCAGATTACAACATCC
CATAACCAATACTTGGGGCTCCCTTCGTTCATGCCCAAGAACAGAACAAGCACTTTGAAGTTTGTCAAGGAACGGGTGTGGAAGCAGATTCAGGGGTGGAAGGGGAAACT
ATTCTCTGTAGGAGGCCGTGAAATCCTGCTTAAATCCATTGTGCAGGCTATCCCATGCTACTCCATGAACTGCTTCCGGTTTCCGAAAAAGCTCATTCAGGATATTAGCA
GTATGATGGCACGGTTCTGGTGGAGCGGAGTGGAGGAGGACCGAAGAATACATTGGGTGAGTTGGAAGGGCATGTGTAAACCGAAGTGCTATGGGGGTTTAGGCTTCAGG
GACTTGGAGGTTTTTAATCAGGCCCTCCTGGCAAAGCAGTGTTGGAGAATATTGCGTGAGCCTGCGTCTTTCCTAGCCCGAGTTCTGAAAGGGCGATATTTCCCTTCCTC
TGGTTTTTTGGAGGCAGAAGTGGGGGTCCGTCCATCGTTCATATGGAGAAACCTAATGTGGGGTAAGGAGCTTTTAGAGATGGGTGTTCGTTGGCGAGTTGGGAATGGGG
AACAAATCAAGATCTATGGATCTAACTGGATTCCGAGGGATACTAATTTGAAAGTTAGCTCGCCGATATCCTTGCCTCCAGATGCCACGGTGGCGGGACTGATGACGGCT
TTGAGGGGTGGGACCAAACCCTGGTGGAACAACACTTTAATGCCGCTGAGTCACCCTGGCACGAGGGGCTTCATCTTCATCCCCAAACTCTCTTCAGGATTGGTGGAAGG
GATGTTGGAAGATGACGCTCCAAGTAAGATAAAGGTGTTCCTTTGGAGATTGTGTCTGAACAGGCTGCCGACAATTGACAACCTAATAAGAAGGGGTGTGGACCTGCTAG
ATGTATGCGTTTTCTGCGGGAAGCGGGGGGAATCTAGCATGCATTTTTTTTGGGAATGCAAATGGGCCAAATCGATCCTACGAGAAGCGGGTTTTGGGGAGATCTTGGAG
AAAGTCCGAGCAGGATGTTGTCTACTACTCTGTCGGGACATCAGAGAGGTGATAGAGGGGGAGAAGTTTGAGGAACTGGTGGTGTTATGGTGGTCGATGTGGTCGACCCG
GAATAAAGTTCGTCTTCAAGGGGCTGAGAGGCCGAGTGGGCTAGTGGAATGGGCGAAGGGTATGTGGTGGCTTTTCGTGAGGTTGGGAGGAGTAGTGGGGAAGTGGGGCG
GGAATGGTTGGTCGGGAGAGGGTTGTGTGGCGAGCCCGAGGAATGGCTGGTATAAGGTGAATTTTGATGCCTCTTTCCTTTCTCATTTGTCCAGGGCCGACCTAGGGATA
ATTGTGCGGGATCCTTTGGGCCAAGTAATGCTGTCGGCGACCTTTACTAAGGATAATGTGAGGATGTTGACATGGCTGAAGGATATGCGGCCGTGA
Protein sequenceShow/hide protein sequence
MKIGMTKTRTEFMNLKIGMTMNRRFGPWVCRSYRRKWSRQRCGPWVISNTWDDDDDGVVGRQRVRGDSEEMIEIWGRRRGRRIDRFRVKGMQNDKFLIETKVQSSRFERL
KLRLGFASCFSVDSIGRSGGLALLWSSEWRFTGIYGYPQTEDKPSTWALLKHLRGDAETPWMVGGDFNAILYQHEKEGGRAKPESELNGFREAVDACCLIDFGFTGERFT
WCNRKPGMGTVWERIDRCFGNMALKTLFPHAEVKHLDFSRSDHRPILLSLAPLVRMVDAHGSRIYRFEEAWLLDPGFMEVVRRSWGASPTVGSPRGVAGETGKCLEVLRL
WGRGRCGRYGDRIREASEEVQRALGRLSTSGSSADLQVAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEY
FENIFTSSCPSERDIDVVTAGVRRSVTDEMNRHLMRPFQQEEIFLALKQIHPNKAPGPDELSGAFFRKSWGVVGPDVVRCCLSILNEGVSPGPVNETMIVLIPKVKNPSR
VSEFRPISLCNVVYKLVAKALVNRMKGMLNALISQNQSAFIPGRCVVDNAILGYECIYALKHRRGKTGWASLKLDMSKAYDRVEWRYLERIMLKMGFEQGWVSLIERCVS
TVRFSFNINGFRCGDVKPSRDLRQGDPLSPYLFLLCAEGLSSMLLEAEATNRISGLKLARRCPIISHLFFADDSLLFFRAKKSEAWSVPCTVPSVHEGIGQILQVQITTS
HNQYLGLPSFMPKNRTSTLKFVKERVWKQIQGWKGKLFSVGGREILLKSIVQAIPCYSMNCFRFPKKLIQDISSMMARFWWSGVEEDRRIHWVSWKGMCKPKCYGGLGFR
DLEVFNQALLAKQCWRILREPASFLARVLKGRYFPSSGFLEAEVGVRPSFIWRNLMWGKELLEMGVRWRVGNGEQIKIYGSNWIPRDTNLKVSSPISLPPDATVAGLMTA
LRGGTKPWWNNTLMPLSHPGTRGFIFIPKLSSGLVEGMLEDDAPSKIKVFLWRLCLNRLPTIDNLIRRGVDLLDVCVFCGKRGESSMHFFWECKWAKSILREAGFGEILE
KVRAGCCLLLCRDIREVIEGEKFEELVVLWWSMWSTRNKVRLQGAERPSGLVEWAKGMWWLFVRLGGVVGKWGGNGWSGEGCVASPRNGWYKVNFDASFLSHLSRADLGI
IVRDPLGQVMLSATFTKDNVRMLTWLKDMRP