; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041161 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041161
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:13013473..13016013
RNA-Seq ExpressionLag0041161
SyntenyLag0041161
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66044.1 hypothetical protein [Beta vulgaris subsp. vulgaris]2.8e-13635.02Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE------------------------------------------ALLKHLRGDVENP
        ++FL+ETM+  +  E LK +LGFA+ F V S GR+GGL + W  E                                          +L++ L  D+  P
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE------------------------------------------ALLKHLRGDVENP

Query:  WMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLSL
         ++GGDFN I+   EKEGG  +    +                 NG   TW      +  + ER+DR   + +  T++P   V H    + DH  I L  
Subjt:  WMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLSL

Query:  TPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAAE
            R    Q  +   FE +WLLDP   E ++ +W  S    +   + G        +K W   + G  G ++     ++ R   +  +S +       E
Subjt:  TPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAAE

Query:  ARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM
         +L+ +  ++E                         AS R+KRN ++GL D SG   +E  +I  + ++YF +IFTS+  +   ++ V   V   VT+E 
Subjt:  ARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM

Query:  NRRLMRPFLQEEILLALKQIHPNKAPGPDGL---------------------SGGHGRVS----------------------EFRPISLCSVVYKLVSKA
        N  L++PF +EE+ +AL Q+HP KAPGPDG+                     S  HG +S                      EFRPI+LC+VVYKLVSKA
Subjt:  NRRLMRPFLQEEILLALKQIHPNKAPGPDGL---------------------SGGHGRVS----------------------EFRPISLCSVVYKLVSKA

Query:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN
        LV ++K  L  L+S+N+SAF+PGR + DNA++  E  H++K R R + G  ++KLDMSKAYDRVEWG+L +++L MGF+  WV+LI  CVS+V +SF IN
Subjt:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN

Query:  EVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSV
           CG VTP+RGLR GDPLSPYLF+L A+  S                              + SL F +A   E   + EIL  YE+ASGQ +N+DKS 
Subjt:  EVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSV

Query:  ISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISS
        +SFS   + + +E +  ILQ++    H +YLG+PS   ++RT+    + +++WK++ GWK KL S  G+EILLKS++QAIP Y M  ++ P ++IQ I S
Subjt:  ISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISS

Query:  MMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
         MARFWW   +  RRIHW +W  +C  KC+GG+GFRDL +FN ALL +
Subjt:  MMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

CCA66050.1 hypothetical protein [Beta vulgaris subsp. vulgaris]1.7e-13334.62Show/hide
Query:  VFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE--------------------------------------------ALLKHLRGDVEN
        +F++ET V  +  E+ K  LGF+  F V  VGR+GGL + W  E                                            AL+K L  + E 
Subjt:  VFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE--------------------------------------------ALLKHLRGDVEN

Query:  PWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS
        P + GGDFN IL   EKEGG ++    + G R                  TW   R     + ER+DR   + +   LFP+A + H      DH  I+L 
Subjt:  PWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS

Query:  LTPMVRMVDAQGCKICR-----FEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSS-
             R +  +G    R     FE  WLLD    EVV+ +W A+  G     +  + G     ++ W +   G    +I    K++  A G  +TS+ S 
Subjt:  LTPMVRMVDAQGCKICR-----FEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSS-

Query:  -------AELQAAEARLEAI------------------FLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVR
                EL    A+ EA                   +   +AS R+KRNLI G+ D  G  + E  EI  +V  YF+ IFTSS  ++ D   V   V+
Subjt:  -------AELQAAEARLEAI------------------FLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVR

Query:  RTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGLSG-------------------------------------------GHGRVSEFRPISLCSVV
        R+VT E N  L++P+ +EEI  AL  +HP KAPGPDG+                                                 VSEFRPISLC+V+
Subjt:  RTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGLSG-------------------------------------------GHGRVSEFRPISLCSVV

Query:  YKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTV
        YK+ SKA+V ++K  L  + ++N+SAF+PGR + DN+++  E  H +KKR   + G  ++KLDMSKAYDRVEWG+L +++L MGF+  WV+L+  CV+TV
Subjt:  YKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTV

Query:  WFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQT
         +SF IN   CG VTPSRGLRQGDPLSP+LF+L A+  S                              +DSL F +A   E   + +IL  YE ASGQ 
Subjt:  WFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQT

Query:  VNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKT
        +N++KS +SFS   +   +E +  +L ++    H +YLG+P+   +++    + + +++WK++ GWK KL S  G+E+L+K+++QA+P Y M  ++ P  
Subjt:  VNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKT

Query:  LIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        +IQ+I S MARFWW G  + R++HW+SW+ MCKPKC GG+GF+DL +FN ALL K
Subjt:  LIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

CCA66054.1 hypothetical protein [Beta vulgaris subsp. vulgaris]5.3e-13534.43Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE------------------------------------------ALLKHLRGDVENP
        ++F++ETM+     E LK  LGF++ F V SVGR+GGL L W  E                                          +LL+HL  D   P
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE------------------------------------------ALLKHLRGDVENP

Query:  WMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLSL
         ++GGDFN IL   EKEGG  +   E+                  G  +TW   R  +  + ER+DR   + +   L+P +  +H    + DH  I+L  
Subjt:  WMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLSL

Query:  TPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAAE
            R       +   FE +WLLD     VV+ SW  S  G    G     G+C+    RW   +      +I  A K +  A     +  +  E    E
Subjt:  TPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAAE

Query:  ARLEAI-------------------------FLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM
         +L+ +                         +   +AS R+KRN ++GL D  G  R+E   I  + + YF +IFTSS  +   ++ V + +   VT+E 
Subjt:  ARLEAI-------------------------FLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM

Query:  NRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-----------------------GGHG--------------------RVSEFRPISLCSVVYKLVSKA
        N +L+ PF ++EIL AL+Q+HP KAPGPDG+                         GH                     + +EFRPI+LC+V+YKL+SKA
Subjt:  NRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-----------------------GGHG--------------------RVSEFRPISLCSVVYKLVSKA

Query:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN
        +V ++K  L  +IS+N+SAF+PGR + DNA++  E  H++K R R + G  ++KLDMSKAYDRVEWG+L +++L MGF+  WV+LI   VS+V +SF IN
Subjt:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN

Query:  EVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSV
           CG V P+RGLRQGDPLSPYLF++ A+  S                              +DSL F +A   E   + +IL  YE ASGQ +N++KS 
Subjt:  EVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSV

Query:  ISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISS
        +S+S   + S ++ +  IL ++    H +YLG+PS   +++ +    + +++WK++ GWK KL S  G+E+LLKS++QAIP Y M  ++FP  +IQ I S
Subjt:  ISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISS

Query:  MMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
         MARFWW   +  R+IHW +W  MC  KC+GG+GF+DL IFN ALL +
Subjt:  MMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

OMO59710.1 reverse transcriptase [Corchorus capsularis]8.5e-13335.8Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSEALL------------------------------------KHLRGDV--------E
        +VFL ET +   + + ++ +    +CF V + GRSGGLA+ W+    L                                    +HL  D+        E
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSEALL------------------------------------KHLRGDV--------E

Query:  NPWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILL
          W   GDFN +L+Q EK+GGR +PE+++                  G  FTW         + ER+DR        + FP A + HL  S  DH PILL
Subjt:  NPWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILL

Query:  SLTPMVRMVDAQGCKICR---FEEAWLLDPRFMEVVKRSWG-ASRLGGSSKGV--AGETGKCMATMKRWGRGRCGR----------YGDRIREATK-EVQ
        +     R    Q C  C+   FE  W  +    ++V   W     LG   + V      GK      R  R R              G  +R + + E++
Subjt:  SLTPMVRMVDAQGCKICR---FEEAWLLDPRFMEVVKRSWG-ASRLGGSSKGV--AGETGKCMATMKRWGRGRCGR----------YGDRIREATK-EVQ

Query:  RALGRLSTSVSSAELQ------AAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM
          + RL     S  LQ       +E      F   +AS RRK+N I  L  ++G +  +P EI  + S YF+ +F S  S ++  D +   V  ++T EM
Subjt:  RALGRLSTSVSSAELQ------AAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEM

Query:  NRRLMRPFLQEEILLALKQIHPNKAPGPDG---------------------LSGGHG----------------------RVSEFRPISLCSVVYKLVSKA
        N  L+  F  EEI  ALKQIHP KAPGPDG                     L   HG                       +++FRPISLC+V+YK++SK 
Subjt:  NRRLMRPFLQEEILLALKQIHPNKAPGPDG---------------------LSGGHG----------------------RVSEFRPISLCSVVYKLVSKA

Query:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN
        LVN++K IL + IS+++SAF+PGR + DN ++ +E +H+LK R+ GK G+ +LKLDMSKAYDRVEW +LE IML+MGF++ WV+LI RCV +V FS  +N
Subjt:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNIN

Query:  EVRCGDVT----PSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNF
            GDVT    P  GLRQGDPLSPYLFL+C EGLS                              +DSL F KA   E+ +V++ L  YE  SGQ +NF
Subjt:  EVRCGDVT----PSRGLRQGDPLSPYLFLLCAEGLS------------------------------NDSLFFFKAREGEARRVQEILGSYERASGQTVNF

Query:  DKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQ
        +KSV+ FS     S ++ +  I  V   +   +YLGLP+F+ +N+ S   ++KE++ K+I  W  +  S GGRE+++KS++QAIP Y+MN F FP+ L  
Subjt:  DKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQ

Query:  DISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        DI+ M++RFWW    + R I+W+ W+ +CK K +GG+GFRD+E FNQALLAK
Subjt:  DISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]3.1e-13535.49Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE-------------------------------------------ALLKHLRGDVEN
        ++FL ET + S R E  KL+LGF  CF VDSVGRSGGLALLW  +                                            LLK L   V  
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE-------------------------------------------ALLKHLRGDVEN

Query:  PWMVGGDFNAILYQHEKEGGRAKPESELN-----------------GERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS
        PW+V GDFN IL   EK GG  + + ++                  G RFTW NRR     V ER+DR   N     +FP   V H   +  DH P+ L 
Subjt:  PWMVGGDFNAILYQHEKEGGRAKPESELN-----------------GERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS

Query:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWG------------------ASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGD-------RIR
              +V  +  ++ RFE  W+ +     +++R WG                  A+ LG  +K   G   K +AT KR  R +C    D         +
Subjt:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWG------------------ASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGD-------RIR

Query:  EATKEVQRALGR--LSTSVSSAELQAAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVT
        +A  EVQ+ L R  L     S      E    + +   +AS RR++N I  L D+SG+  Q+  ++  L++EYF+ +FT++     D++ V + V   VT
Subjt:  EATKEVQRALGR--LSTSVSSAELQAAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVT

Query:  DEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSEFRPISLCSVVYKLV
         EMN  L++P++ EE+ +ALKQ+HP+KAPGPDG+                                                +V++FRPISLC+V+YK++
Subjt:  DEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSEFRPISLCSVVYKLV

Query:  SKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALK-KRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSF
        SK + N++K +L  +IS ++SAF+PGR + DN ++ YE +H L+ KR+G+ G+ SLKLDMSKAYDRV+W +LE+IM  +GF++  + LI +CV TV FS 
Subjt:  SKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALK-KRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSF

Query:  NINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYERASGQTVNFD
         +N    G + PSRGLRQGDPLSPYLFLLC EGL                              ++DS+ F KA      ++Q +L  YERASGQ +N +
Subjt:  NINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYERASGQTVNFD

Query:  KSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQD
        K+ + FS      ++  I Q+     +  + +YLG P  + +++      +K++VW+++  WKG L S GGRE+L+K++  +IP Y+M+CF FPKTL  +
Subjt:  KSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQD

Query:  ISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        +  MMARFWW       +IHW  W+ +C  K  GG+GFRDL +FN ALLAK
Subjt:  ISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein1.9e-13837.78Show/hide
Query:  WNSEALLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEV
        WN   LL+ L      PW   GDFN I+   E  G   +P+ ++ G R                 FTWCN R    T W R+DR   N+     F +A V
Subjt:  WNSEALLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEV

Query:  KHLDFSRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRA
        +HLD    DH  +LL+  P  R V+    K  RFEE W  D    E ++ +W ++  G +   VA +   C   +  W R + G    ++RE  +E   A
Subjt:  KHLDFSRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRA

Query:  LGRLSTSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATR
                S  +L+  ++ +  +  +EE                         AS RR+RN I GL DD GV R+E  E  GL+ ++FE+IF +  S   
Subjt:  LGRLSTSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATR

Query:  DIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDG------------------------LSGGH-------------------GRVSE
        +I+   A V   ++ E+N  L   F  +E+ LALKQ+ P KAPGPDG                        L+ G                     R++E
Subjt:  DIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDG------------------------LSGGH-------------------GRVSE

Query:  FRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWV
        FRPISLC+V YKL+SK + N++KGIL  +IS+ +SAF+PGR + DN ++ +E +H +   + GK G  ++KLDMSKAYDRVEW +LE+IM KMGF   WV
Subjt:  FRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWV

Query:  DLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEIL
         LI  C+STV +S  +N    G + PSRG+RQGDPLSPYLFLLCAEGL                              ++DSL F KA   E   +QEIL
Subjt:  DLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEIL

Query:  GSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFY
          YE+ASGQ VN DK+ + FS  T  ++Q  I   L V I   + +YLGLPS + + R ++   +KE+VW +I GWKGK+ S  GREI++K++ QA+P Y
Subjt:  GSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFY

Query:  SMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        +M+CF+ P  L QDI SM+ +FWWS  ++  +I WV W  +C+PK  GG+GFR++  FN ALL K
Subjt:  SMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

A0A2N9F6L9 Reverse transcriptase domain-containing protein3.8e-13937.76Show/hide
Query:  LLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDF
        L++ L G  +  W   GDFN I+   E  G  ++P+ ++                 NG  FTWCN R    T W R+DR   N+     FP+A V HLD 
Subjt:  LLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDF

Query:  SRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLS
         + DH  + L+  P       +  K  RFEE W+ D    + +K +WG+ + G +   V+ +  +C   +  W R   G  G +I     E+++A     
Subjt:  SRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLS

Query:  TSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVV
           S   LQ    RL ++F +EE                         A+ R +RN I GL D++GV+      +  L   Y+ ++FT+       I+ V
Subjt:  TSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVV

Query:  TARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSEFRPIS
         A+V + VT++MN+ L+R F   E+ +ALKQ+ P KAPGPDG+                                           +    RV+EFRPIS
Subjt:  TARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSEFRPIS

Query:  LCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGR
        LC+V+YKL+SK L N++K IL  ++S ++SAF+PGR + DN ++ +E +H +   + G+ G  +LKLDMSKAYDRVEW +LE+IM K+GF Q W+ L+  
Subjt:  LCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRR-GKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGR

Query:  CVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYER
        C+STV +S  +N    G + PSRGLRQGDPLSPYLFLLCAEGL                              ++DSL F KA      ++Q+IL  YE+
Subjt:  CVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYER

Query:  ASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCF
        ASGQ VN DK+ I FS  T  + Q  I   L+V I   + +YLGLPS + +NR+ +   +KE+VW+++ GWK KL S  GREIL+K++ QAIP YSM+CF
Subjt:  ASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCF

Query:  RFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        R P  L  D+ +M+ RFWWS   E R+IHWVSW+ +C+ K  GGLGFRDL  FN ALLAK
Subjt:  RFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

A0A2N9FCL2 Reverse transcriptase domain-containing protein3.6e-13736.47Show/hide
Query:  WNSEALLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEV
        WN   LL+ L G    PW   GDFN I+   EK G R + E ++ G R                 FTWCN R G+ T W R+DR          F  A V
Subjt:  WNSEALLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEV

Query:  KHLDFSRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRA
         HL+ +  DH PI L+  P+      Q  ++ RFE+ W  DP    V+ R+W     G  +  V+ +  +C   + RW R + G     ++E T++++RA
Subjt:  KHLDFSRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRA

Query:  LGRLSTSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATR
            +  +   ++ +    +  + L+EE                         AS RR+RN I  +  D+  +  E   I    ++Y++ +FT+  +   
Subjt:  LGRLSTSVSSAELQAAEARLEAIFLEEE-------------------------ASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATR

Query:  DIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSE
        D+++V   ++  +T EMN+ LM  F +EE+  A+KQ+ P KAPGPDG+                                           +    RV+E
Subjt:  DIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNKAPGPDGL-------------------------------------------SGGHGRVSE

Query:  FRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHAL-KKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWV
        +RPISLC+V+YKL+SK L N++K IL  +IS+++SAF+PGR + DN ++ +E +H +  KR GK+G  +LKLDMSKAYDRVEWG+L+Q+M+ MGF   W+
Subjt:  FRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHAL-KKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWV

Query:  DLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEIL
         LI  C+STV +S  IN    G + P+RGLRQGDP+SPYLFLLCAEGL                              ++DSL F +A   E +++QEIL
Subjt:  DLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL------------------------------SNDSLFFFKAREGEARRVQEIL

Query:  GSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFY
         +YE+ASGQ +N  K+ + FS  T   +Q  +  IL V     + +YLGLPS + K +      +KE+VW ++ GWK KL S  GRE+L+K++VQAIP Y
Subjt:  GSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFY

Query:  SMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        +MNCF+ P TL ++I  ++ RFWW   ++ R+IHW+ W+ +C  K  GGLGFRDL+ FN ALLAK
Subjt:  SMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

A0A2N9H1U1 Reverse transcriptase domain-containing protein6.1e-13735.53Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE-------------------------------------------ALLKHLRGDVEN
        ++F+ ET +  +R E L+ KL F+S   V    + GGL L W  E                                           ALL+ L      
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSE-------------------------------------------ALLKHLRGDVEN

Query:  PWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS
        PW+  GDFN +L   EK+GG  +   ++                  G  FTWCN R  +GTVWER+DR    +     FP+A + HL  +  DH+PI L 
Subjt:  PWMVGGDFNAILYQHEKEGGRAKPESEL-----------------NGERFTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS

Query:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAE----
          P      A+  +  RFEE WL +P   E V  +W   + G     V  +   C   +++W R + G    +++  T +++ A       +S +     
Subjt:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAE----

Query:  ------LQAAEARL---------------EAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDE
              L + E R+                  F  + AS RR+RNLI  L D+ GV       I  L  EYF+++F +S     D + V   +   VT +
Subjt:  ------LQAAEARL---------------EAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDE

Query:  MNRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-------------------------------------------GGHGRVSEFRPISLCSVVYKLVSK
        MN RL +PF ++E+  A+KQ+ P KAPGPDG+S                                               +V++FRPISLC+V+YK++SK
Subjt:  MNRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-------------------------------------------GGHGRVSEFRPISLCSVVYKLVSK

Query:  ALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNI
         L N++K IL  +IS+ +SAF+PGR + DN ++ +E +H +K R  GK G+ +LKLDMSKAYDRVEW +L+ +MLKMGF   WV L+  C+S+V +S  I
Subjt:  ALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKR-RGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNI

Query:  NEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQ
        N    G + P+RGLRQGDPLSPYLFLLCAEGL   + +F    E   R        YE  SGQ +N DK+ + FS  T    Q  I Q+L V +   + +
Subjt:  NEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQISASHNQ

Query:  YLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKC
        YLGLPSF+ ++R  +   +KE++W+++ GWK K+ S  GREIL+K++VQAIP +SM+CF+ P +L QDI  ++ +FWW      R+IHWV+W  +C  K 
Subjt:  YLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKC

Query:  YGGLGFRDLEIFNQALLAK
         GG+GFRD+  FN ALLAK
Subjt:  YGGLGFRDLEIFNQALLAK

A0A2N9I9F4 Reverse transcriptase domain-containing protein4.2e-13834.83Show/hide
Query:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLW-------------------------------------------NSEALLKHLRGDVEN
        ++FL+ET     R E L+ +  F + F V  + + GGL L W                                           NS  LL+ + G    
Subjt:  MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLW-------------------------------------------NSEALLKHLRGDVEN

Query:  PWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS
        PW   GDFN I+   EK G R + E ++ G R                 FTWCN R G  T W R+DR          F QA V HL+ S  DH P+ L+
Subjt:  PWMVGGDFNAILYQHEKEGGRAKPESELNGER-----------------FTWCNRRPGTGTVWERIDRCFGNVALQTLFPQAEVKHLDFSRLDHHPILLS

Query:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAA
          P +  +     +  RFEE W  DP   + V  +W     G     V  +  +C   +++W R   G    +++E T+++++A            + + 
Subjt:  LTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKEVQRALGRLSTSVSSAELQAA

Query:  EARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNK
           ++ + ++EE  +  +R+    L D       EP  I      Y++ +FT+  +   D++ +   +   VTDEMN RL  PF   EI  A+KQ+   K
Subjt:  EARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEMNRRLMRPFLQEEILLALKQIHPNK

Query:  APGPDGL-------------------------------------------SGGHGRVSEFRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGR
        APGPDG+                                           +    RV+E+RPISLC+V+YKL+SK L N++K +L  +IS  +SAF+PGR
Subjt:  APGPDGL-------------------------------------------SGGHGRVSEFRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGR

Query:  CVVDNAILGYECIHAL-KKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLF
         + DN ++ +E +H +  +R GK+G  +LKLDMSKAYDRVEWG+L+Q+M +MGF + W  +I  C+STV +S  IN    G +TP+RGLRQGDP+SPYLF
Subjt:  CVVDNAILGYECIHAL-KKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLF

Query:  LLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQIS
        LLCAEGL                              ++DSL F +A   E +++Q IL  YE+ASGQ +N +K+ + FS  T  ++QE +  IL V   
Subjt:  LLCAEGL------------------------------SNDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSVISFSPCTTPSVQEGIGQILQVQIS

Query:  ASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGM
          + +YLGLPS + K + +    +KE+VW +I GWK KL S  GREIL+K++VQAIP Y+MNCF+ P TL ++I  ++ RFWW    + R+IHW+ W+ +
Subjt:  ASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGM

Query:  CKPKCYGGLGFRDLEIFNQALLAK
        C+PK  GGLGFR+L+ FN ALLAK
Subjt:  CKPKCYGGLGFRDLEIFNQALLAK

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein4.1e-2123.01Show/hide
Query:  REATKEVQRALGRLSTSVSSAELQAAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTD
        +EA + +++   R +   S  +L     R    F   E   +  R  I  L  + G   ++P  I      +++N+F+    +    + +   +   V++
Subjt:  REATKEVQRALGRLSTSVSSAELQAAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTD

Query:  EMNRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-------------------------------------------GGHGRVSEFRPISLCSVVYKLVS
            RL  P   +E+  AL+ +  NK+PG DGL+                                           G    +  +RP+SL S  YK+V+
Subjt:  EMNRRLMRPFLQEEILLALKQIHPNKAPGPDGLS-------------------------------------------GGHGRVSEFRPISLCSVVYKLVS

Query:  KALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNI
        KA+  ++K +L  +I  ++S  +PGR + DN  L  + +H    RR  +  A L LD  KA+DRV+  YL   +    F   +V  +    ++      I
Subjt:  KALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNI

Query:  NEVRCGDVTPSRGLRQGDPLSPYLFLLCAEG---LSNDSLFFFKAREGEAR---------------------RVQEILGSYERASGQTVNFDKS------
        N      +   RG+RQG PLS  L+ L  E    L    L     +E + R                     R QE    Y  AS   +N+ KS      
Subjt:  NEVRCGDVTPSRGLRQGDPLSPYLFLLCAEG---LSNDSLFFFKAREGEAR---------------------RVQEILGSYERASGQTVNFDKS------

Query:  --VISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKG--KLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLI
           + F P     +      I  + +  S  +Y    +F+          ++E V  ++  WKG  K+ S+ GR +++  +V +  +Y + C    +  I
Subjt:  --VISFSPCTTPSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKG--KLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLI

Query:  QDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLG
          I   +  F W G       HWVS      P   GG G
Subjt:  QDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLG

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM3.6e-0929.61Show/hide
Query:  RVSEFRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNA-ILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFE
        R  +FRPIS+ SV+ + ++  L  ++   +N      +  F+P     DNA I+     H+ K  R         LD+SKA+D +    +   +   G  
Subjt:  RVSEFRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNA-ILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFE

Query:  QGWVDLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL
        +G+VD +         S N +     +  P+RG++QGDPLSP LF L  + L
Subjt:  QGWVDLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGL

P92555 Uncharacterized mitochondrial protein AtMg012503.4e-0751.52Show/hide
Query:  IGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQE
        +G  V   +  F IN    G VTPSRGLRQGDPLSPYLF+LC E LS           G  RR QE
Subjt:  IGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQE

P93295 Uncharacterized mitochondrial protein AtMg003101.4e-1656.34Show/hide
Query:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPK-CYGGLGFRDLEIFNQALLAK
        A+P Y+M+CFR  K L + ++S M  FWWS  E  R+I WV+W+ +CK K   GGLGFRDL  FNQALLAK
Subjt:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPK-CYGGLGFRDLEIFNQALLAK

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.0e-0825Show/hide
Query:  RPFLQEEILLALKQIHPNKAPGPDGLS--------------------------------------GGHGRVSEFRPISLCSVVYKLVSKALVNKMKGILN
        RP  +EEI  A+K   P+ APG DGL+                                      G     S +RPI++ S + +L+ + L  +++  + 
Subjt:  RPFLQEEILLALKQIHPNKAPGPDGLS--------------------------------------GGHGRVSEFRPISLCSVVYKLVSKALVNKMKGILN

Query:  MLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNINE-VRCGDVTPS
        +  +Q   A I G  V  N++L    I + +++R      S  LD+ KA+D V    + + + ++G ++G  + I   +S    +  +    +   +   
Subjt:  MLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGRCVSTVWFSFNINE-VRCGDVTPS

Query:  RGLRQGDPLSPYLF------LLCA
        RG++QGDPLSP+LF      LLC+
Subjt:  RGLRQGDPLSPYLF------LLCA

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.4e-1639.08Show/hide
Query:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGR
        +V ++K ++  LI   +++FIPGR   DN +   E +H++++++G  GW  LKLD+ KAYDR+ W YLE  ++  GF + W+  I R
Subjt:  LVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGYLEQIMLKMGFEQGWVDLIGR

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-1547.14Show/hide
Query:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK
        A+P Y+M CF  PKT+ + I S++A FWW   +EA+ +HW +W  +   K  GG+GF+D+E FN ALL K
Subjt:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPKCYGGLGFRDLEIFNQALLAK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.7e-1856.34Show/hide
Query:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPK-CYGGLGFRDLEIFNQALLAK
        A+P Y+M+CFR  K L + ++S M  FWWS  E  R+I WV+W+ +CK K   GGLGFRDL  FNQALLAK
Subjt:  AIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHWVSWKGMCKPK-CYGGLGFRDLEIFNQALLAK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.4e-0851.52Show/hide
Query:  IGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQE
        +G  V   +  F IN    G VTPSRGLRQGDPLSPYLF+LC E LS           G  RR QE
Subjt:  IGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTCCTTACAGAGACCATGGTGCAGTCTTCACGCTTTGAGAGGTTAAAATTGAAGTTGGGTTTTGCTTCCTGCTTCAGTGTGGACAGTGTTGGAAGAAGTGGTGG
CCTAGCTCTGCTATGGAATTCGGAGGCACTGTTGAAACATCTTCGAGGGGATGTCGAGAACCCGTGGATGGTTGGCGGTGACTTTAATGCGATCTTGTACCAGCATGAGA
AGGAAGGAGGAAGGGCTAAACCAGAATCTGAGCTAAACGGGGAGAGGTTTACATGGTGTAATAGAAGACCGGGTACAGGAACTGTCTGGGAGAGAATCGACAGATGCTTT
GGAAACGTGGCGCTACAAACGCTTTTCCCACAGGCTGAGGTGAAACATCTAGACTTTAGCCGTTTAGATCATCACCCTATTTTGTTATCATTAACGCCGATGGTTCGAAT
GGTTGATGCTCAGGGGTGCAAAATTTGTAGATTTGAGGAGGCCTGGTTGCTAGACCCGAGATTTATGGAGGTGGTTAAGAGAAGTTGGGGGGCAAGTCGGTTAGGTGGAT
CATCGAAGGGAGTGGCAGGGGAGACTGGGAAATGCATGGCGACAATGAAACGTTGGGGAAGGGGGAGGTGTGGTAGGTATGGGGATAGAATAAGGGAAGCTACTAAGGAA
GTTCAGAGGGCGTTGGGTAGATTGAGCACTTCAGTGTCTAGCGCAGAATTACAGGCAGCAGAGGCCAGATTGGAGGCGATTTTCCTGGAGGAAGAGGCTTCATTTAGGAG
GAAGAGGAATCTAATTCGGGGTTTGGTAGATGATAGTGGCGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCAGAGTACTTCGAGAACATTTTTACGTCTA
GTTGTTCGGCAACCAGGGATATTGATGTCGTTACAGCAAGAGTGAGAAGAACAGTAACGGATGAGATGAACAGACGACTGATGAGACCTTTCCTCCAGGAGGAGATCCTC
CTTGCTTTGAAGCAAATACATCCTAATAAAGCTCCGGGCCCGGATGGGCTTTCAGGAGGTCATGGGAGAGTCTCAGAGTTCAGACCCATATCTCTTTGCAGTGTGGTGTA
CAAACTAGTTTCCAAAGCACTAGTGAACAAAATGAAAGGAATCCTGAACATGCTAATCTCCCAAAACCGGAGTGCCTTTATTCCGGGACGATGTGTGGTGGATAACGCCA
TACTGGGGTATGAATGCATCCATGCCTTGAAGAAAAGGAGGGGGAAAATTGGGTGGGCCTCGCTCAAGCTTGACATGAGTAAGGCCTACGATCGAGTGGAATGGGGGTAT
TTGGAACAGATTATGTTGAAAATGGGATTTGAGCAGGGATGGGTTGATCTGATTGGGCGTTGTGTGTCCACGGTGTGGTTTTCTTTTAACATAAACGAGGTTAGATGTGG
TGATGTGACGCCTAGTCGGGGCCTGCGGCAGGGGGACCCTCTATCCCCCTATCTTTTCCTGCTGTGTGCGGAGGGGTTGTCTAACGACAGCTTGTTTTTCTTCAAGGCTA
GGGAGGGAGAGGCCCGGAGAGTGCAGGAGATCCTGGGAAGTTACGAACGGGCGTCCGGACAGACCGTCAATTTTGACAAATCTGTCATTTCTTTTAGCCCATGTACGACA
CCCAGTGTGCAGGAGGGCATAGGCCAGATCCTTCAGGTACAGATTTCAGCATCCCATAACCAATATTTGGGGCTTCCGTCGTTCATGCCCAAGAACAGAACAAGCACTTT
GAAGTTTGTGAAGGAACAGGTGTGGAAGCAGATCCATGGGTGGAAGGGAAAATTATTCTCCATCGGAGGTCGAGAGATCCTACTTAAATCCATTGTGCAGGCTATTCCAT
TCTACTCCATGAACTGCTTTCGATTCCCGAAAACGCTCATCCAGGATATTAGCAGTATGATGGCACGGTTCTGGTGGAGTGGAGTGGAGGAGGCCCGGAGAATACATTGG
GTGAGTTGGAAAGGTATGTGTAAACCGAAGTGCTATGGGGGTTTAGGATTCAGGGACCTGGAGATCTTTAATCAAGCCCTCCTTGCGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTCCTTACAGAGACCATGGTGCAGTCTTCACGCTTTGAGAGGTTAAAATTGAAGTTGGGTTTTGCTTCCTGCTTCAGTGTGGACAGTGTTGGAAGAAGTGGTGG
CCTAGCTCTGCTATGGAATTCGGAGGCACTGTTGAAACATCTTCGAGGGGATGTCGAGAACCCGTGGATGGTTGGCGGTGACTTTAATGCGATCTTGTACCAGCATGAGA
AGGAAGGAGGAAGGGCTAAACCAGAATCTGAGCTAAACGGGGAGAGGTTTACATGGTGTAATAGAAGACCGGGTACAGGAACTGTCTGGGAGAGAATCGACAGATGCTTT
GGAAACGTGGCGCTACAAACGCTTTTCCCACAGGCTGAGGTGAAACATCTAGACTTTAGCCGTTTAGATCATCACCCTATTTTGTTATCATTAACGCCGATGGTTCGAAT
GGTTGATGCTCAGGGGTGCAAAATTTGTAGATTTGAGGAGGCCTGGTTGCTAGACCCGAGATTTATGGAGGTGGTTAAGAGAAGTTGGGGGGCAAGTCGGTTAGGTGGAT
CATCGAAGGGAGTGGCAGGGGAGACTGGGAAATGCATGGCGACAATGAAACGTTGGGGAAGGGGGAGGTGTGGTAGGTATGGGGATAGAATAAGGGAAGCTACTAAGGAA
GTTCAGAGGGCGTTGGGTAGATTGAGCACTTCAGTGTCTAGCGCAGAATTACAGGCAGCAGAGGCCAGATTGGAGGCGATTTTCCTGGAGGAAGAGGCTTCATTTAGGAG
GAAGAGGAATCTAATTCGGGGTTTGGTAGATGATAGTGGCGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCAGAGTACTTCGAGAACATTTTTACGTCTA
GTTGTTCGGCAACCAGGGATATTGATGTCGTTACAGCAAGAGTGAGAAGAACAGTAACGGATGAGATGAACAGACGACTGATGAGACCTTTCCTCCAGGAGGAGATCCTC
CTTGCTTTGAAGCAAATACATCCTAATAAAGCTCCGGGCCCGGATGGGCTTTCAGGAGGTCATGGGAGAGTCTCAGAGTTCAGACCCATATCTCTTTGCAGTGTGGTGTA
CAAACTAGTTTCCAAAGCACTAGTGAACAAAATGAAAGGAATCCTGAACATGCTAATCTCCCAAAACCGGAGTGCCTTTATTCCGGGACGATGTGTGGTGGATAACGCCA
TACTGGGGTATGAATGCATCCATGCCTTGAAGAAAAGGAGGGGGAAAATTGGGTGGGCCTCGCTCAAGCTTGACATGAGTAAGGCCTACGATCGAGTGGAATGGGGGTAT
TTGGAACAGATTATGTTGAAAATGGGATTTGAGCAGGGATGGGTTGATCTGATTGGGCGTTGTGTGTCCACGGTGTGGTTTTCTTTTAACATAAACGAGGTTAGATGTGG
TGATGTGACGCCTAGTCGGGGCCTGCGGCAGGGGGACCCTCTATCCCCCTATCTTTTCCTGCTGTGTGCGGAGGGGTTGTCTAACGACAGCTTGTTTTTCTTCAAGGCTA
GGGAGGGAGAGGCCCGGAGAGTGCAGGAGATCCTGGGAAGTTACGAACGGGCGTCCGGACAGACCGTCAATTTTGACAAATCTGTCATTTCTTTTAGCCCATGTACGACA
CCCAGTGTGCAGGAGGGCATAGGCCAGATCCTTCAGGTACAGATTTCAGCATCCCATAACCAATATTTGGGGCTTCCGTCGTTCATGCCCAAGAACAGAACAAGCACTTT
GAAGTTTGTGAAGGAACAGGTGTGGAAGCAGATCCATGGGTGGAAGGGAAAATTATTCTCCATCGGAGGTCGAGAGATCCTACTTAAATCCATTGTGCAGGCTATTCCAT
TCTACTCCATGAACTGCTTTCGATTCCCGAAAACGCTCATCCAGGATATTAGCAGTATGATGGCACGGTTCTGGTGGAGTGGAGTGGAGGAGGCCCGGAGAATACATTGG
GTGAGTTGGAAAGGTATGTGTAAACCGAAGTGCTATGGGGGTTTAGGATTCAGGGACCTGGAGATCTTTAATCAAGCCCTCCTTGCGAAATAG
Protein sequenceShow/hide protein sequence
MVFLTETMVQSSRFERLKLKLGFASCFSVDSVGRSGGLALLWNSEALLKHLRGDVENPWMVGGDFNAILYQHEKEGGRAKPESELNGERFTWCNRRPGTGTVWERIDRCF
GNVALQTLFPQAEVKHLDFSRLDHHPILLSLTPMVRMVDAQGCKICRFEEAWLLDPRFMEVVKRSWGASRLGGSSKGVAGETGKCMATMKRWGRGRCGRYGDRIREATKE
VQRALGRLSTSVSSAELQAAEARLEAIFLEEEASFRRKRNLIRGLVDDSGVMRQEPGEIVGLVSEYFENIFTSSCSATRDIDVVTARVRRTVTDEMNRRLMRPFLQEEIL
LALKQIHPNKAPGPDGLSGGHGRVSEFRPISLCSVVYKLVSKALVNKMKGILNMLISQNRSAFIPGRCVVDNAILGYECIHALKKRRGKIGWASLKLDMSKAYDRVEWGY
LEQIMLKMGFEQGWVDLIGRCVSTVWFSFNINEVRCGDVTPSRGLRQGDPLSPYLFLLCAEGLSNDSLFFFKAREGEARRVQEILGSYERASGQTVNFDKSVISFSPCTT
PSVQEGIGQILQVQISASHNQYLGLPSFMPKNRTSTLKFVKEQVWKQIHGWKGKLFSIGGREILLKSIVQAIPFYSMNCFRFPKTLIQDISSMMARFWWSGVEEARRIHW
VSWKGMCKPKCYGGLGFRDLEIFNQALLAK