; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g014690 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g014690
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:29928368..29933022
RNA-Seq ExpressionLcy06g014690
SyntenyLcy06g014690
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018805736.2 uncharacterized protein LOC108979499 [Juglans regia]7.9e-7841.43Show/hide
Query:  WIVTGDFNAIIQENEH-------------DEDALNYCALRDLGFIGNRFTWSNRQPGTTFVRKRLDR---------------------------------
        W+  GDFN I+  NE                +AL  C L D+G++GN+FTWSN + GT F ++RLDR                                 
Subjt:  WIVTGDFNAIIQENEH-------------DEDALNYCALRDLGFIGNRFTWSNRQPGTTFVRKRLDR---------------------------------

Query:  ------------------------CDDVQA-------------TDEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWH
                                C   QA              D  L AT   VT E+N  L R +  L+++EAL QM    +PGPDG PA FYQ+ W 
Subjt:  ------------------------CDDVQA-------------TDEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWH

Query:  IFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIK
        + G +V  + L   N       IN T I LIPKK+N   V+DFRPISLCNV YKII+K +AN LK IL +IIS+NQSAFIP RL++DN+IV +E LH + 
Subjt:  IFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIK

Query:  GSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEIS
           + +  Y ++KLDMSKAYDRVEW+FLR  + KMGF  +W+  +   VE+VS+SIL+NGI    F P+RG+R+GDPLSPYLF+MCA+ L S++ K    
Subjt:  GSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEIS

Query:  KKNSGLRIARSALAISHLLF
           SG+  AR  + ISHL F
Subjt:  KKNSGLRIARSALAISHLLF

XP_023923653.1 uncharacterized protein LOC112035046 [Quercus suber]1.3e-8041.9Show/hide
Query:  WIVTGDFNAIIQENEH-----------DE--DALNYCALRDLGFIGNRFTWSNRQPGTTFVRKRLDRC--------------------------------
        W+  GDFNAI+  +E            DE  +AL    L+DLG+ G ++TW+N++PG    R+RLDR                                 
Subjt:  WIVTGDFNAIIQENEH-----------DE--DALNYCALRDLGFIGNRFTWSNRQPGTTFVRKRLDRC--------------------------------

Query:  ------------------------DDVQAT--------------------------DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPD
                                DD +                            +E LQA   +VTEE+   L R     ++ EAL QM PTKAPG D
Subjt:  ------------------------DDVQAT--------------------------DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPD

Query:  GLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADN
        G+ ALFYQK+WHI G  V+S  L     G +  +IN T I+LIPK ++P  ++DFRPISLCNV YKII+KVLANRLK +L  IIS  QSAF+P RLI DN
Subjt:  GLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADN

Query:  IIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAK
        ++V YE LH + G +  K    ++KLD+SKAYDRVEW FLR IM K+GF   WIN +   + + SFS+L+NG       P+RGLRQGDPLSPYLFL+CA+
Subjt:  IIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAK

Query:  GLTSLLIKVEISKKNSGLRIARSALAISHLLF
        G T+LL KVE+ ++  G+ I + A  ISHLLF
Subjt:  GLTSLLIKVEISKKNSGLRIARSALAISHLLF

XP_024041921.1 uncharacterized protein LOC112099052 [Citrus clementina]1.1e-7654.87Show/hide
Query:  TDEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDF
        T   L   + RVT E+N +L+  F   +++EAL QM PTKAPGPDGLPA+FYQK+WH+  Q VISTCL   N+      +N T I LIPKKE P  V DF
Subjt:  TDEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDF

Query:  RPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWIN
        RPISLCNV Y+I+AK +ANRLK ++H IIS  Q+AFIP RLI DNII+GYECLH I+  + +KN   ++KLD+SKAYDR+EW FL   M  +GFS  WIN
Subjt:  RPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWIN

Query:  RIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
         I   V +VSFS+++NG  ++  +P RGLRQG PLSPYLF+MC +  +SLL++ E      GL   +  L ISHLLF
Subjt:  RIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis]3.8e-8056.18Show/hide
Query:  RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCY
        +VT  +N  L + F   ++  AL QMHPTKAPGPDG+P LFYQKYW   G  V    L   N G  P ++N + I LIPKK+NP  V DFRPISLCNV Y
Subjt:  RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCY

Query:  KIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVS
        K+++K +ANRLK +L  +IS +QSAF+P RLI DN++V YE +HF++  R  K  Y SIKLDMSKAYDRVEW FL RIM +MGF + WIN I   V +V 
Subjt:  KIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVS

Query:  FSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        FS++LNG+ T   KPTRGLRQGDPLSPYLFL+C +GL S+L +  ++    G+RI R A  I+HLLF
Subjt:  FSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

XP_042988708.1 uncharacterized protein LOC122316242 [Carya illinoinensis]3.9e-7753.82Show/hide
Query:  EILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRP
        E L     RVTEE+N  L + F   ++++AL QMHPTKAPGPDGLP +FYQKYWH+  +      ++    G LP  IN T I LIPKK+ P  + +FR 
Subjt:  EILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRP

Query:  ISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRI
        ISLCNV YK+I+KVLANRLK IL+ +IS +QSAF+P RLI+DN++V YE +H+++  R  K  Y SIKLD+SKAYDR+EW +L  +M++MGF   WI  +
Subjt:  ISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRI

Query:  GMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
         M V+TVSFS+L+NG       P+RG+RQGDPLSPYLFL+  +GL SLL + E+S+   G+RI R A  I HLLF
Subjt:  GMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein1.2e-8257.61Show/hide
Query:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR
        DEIL      +T ++N  LD  F   ++E AL QM P KAPGPDG+  +FYQKYW+I G  V ++ L C  +G L  +IN T I LIPK +NP  V DFR
Subjt:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR

Query:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR
        PISLCNV YKIIAKVLANRLK IL  IISE+QSAF+P RLI+DNI++ +E LH +K  +  K  Y ++KLDMSKAYDRVEW FL RIM  MGFS SW++ 
Subjt:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR

Query:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        I   V TVS+S+L+NG     F PTRGLRQGDP+SPYLFL+CA+GL +LL K  +SKK  G+ I+R    +SHL F
Subjt:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

A0A2N9F5W1 Reverse transcriptase domain-containing protein6.3e-8156.88Show/hide
Query:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR
        DEIL      +T ++N  LD  F   ++E AL QM P KA G DG+  +FYQKYW+I G  + ++ L C  +G L  +IN T I LIPK +NP  V DFR
Subjt:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR

Query:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR
        PISLCNV YKIIAKVLANRLK IL  IISE+QSAF+P RLI+DNI++ +E LH +K  +  K  Y ++KLDMSKAYDRVEW FL RIM KMGFS SW++ 
Subjt:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR

Query:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        I   V TVS+S+L+NG     F PTRGLRQGDP+SPYLFL+CA+GL +LL K  +SKK  G+ I+R    +SHL F
Subjt:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

A0A2N9I8Z6 Reverse transcriptase domain-containing protein3.5e-7954.71Show/hide
Query:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR
        DEIL+     VT E+N+ LD  F   ++E  LKQM P KAPGPDG+  +FYQ+YWHI G+ + ++ L C  +G L  +IN T + LIPK +N   V DFR
Subjt:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR

Query:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR
        PISLCNV YKIIAKVLANRLK IL  IISE+QSAF+P RLI+DNI++ +E LH ++  +  +  Y ++KLDMSK YDRVEW FL  IM KMGF+ SW++ 
Subjt:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR

Query:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        I   V TVS+S+L+NG     F PTRGLRQGDP+SPY FL+CA+GL +LL+K  +SK   G+ I+R    ++HL F
Subjt:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

A0A2N9J6Y2 Uncharacterized protein3.5e-7954.71Show/hide
Query:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR
        D+IL      VT ++N+ LD  F   ++E A+KQM P KAPGPDG+   FYQKYWHI G  V ++ L C  +G L  +IN T I LIPK +NP  + D+R
Subjt:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR

Query:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR
        PISLCNV YKIIAKVLANRLK IL  IISE+QSAF+P RLI+DNI++ +E LH +K  +  +  Y ++KLDMSKAYDRVEW FL  IM KMGF+ SW++ 
Subjt:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR

Query:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        I   V +VS+S+L+NG     F PTRGLRQGDP+SPYLFL+C +GL +LL +  +SK+  G+ I+R    ++HL F
Subjt:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

A0A7N2LIH6 Uncharacterized protein3.8e-7854.35Show/hide
Query:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR
        D  L+A   RVT E+N  L + F  +++ +AL+QMHPTKAPGPDG+  +FYQKYW I G  V +  L+  N G +P  IN T I LIPK +NP  + +FR
Subjt:  DEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFR

Query:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR
        PISLCNV YKII+KVLANRLK +LH +I E QSAF+P R+I DN+IV +E +H I   R  K    +IKLDMSKAYDRVEW +L  +M KMGF   WI+ 
Subjt:  PISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINR

Query:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        I M V +VSFS+L+NG    +F P+RGLRQGDP+SPYLFL+C +GL++++ K E      G+  AR A  ISHL F
Subjt:  IGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2528.98Show/hide
Query:  DRCDDVQATDEILQATTL-RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKK
        ++ ++++  D  L   TL R+ +E    L+R     ++   +  +   K+PGPDG  A FYQ+Y       ++        EG LP      +IILIPK 
Subjt:  DRCDDVQATDEILQATTL-RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKK

Query:  -ENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMA
          +     +FRPISL N+  KI+ K+LANR++  +  +I  +Q  FIP      NI      +  I  +R+K   +  I +D  KA+D+++  F+ + + 
Subjt:  -ENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMA

Query:  KMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAIS
        K+G    ++  I    +  + +I+LNG   + F    G RQG PLSP LF +    L  L   +   K+  G+++ +  + +S
Subjt:  KMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAIS

P08548 LINE-1 reverse transcriptase homolog7.3e-2628.93Show/hide
Query:  DDVQATDEILQATTL-RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKK-EN
        ++++  D+ L+A  L R++++    L+R     ++   ++ +   K+PGPDG  + FYQ +       +++       EG LP       I LIPK  ++
Subjt:  DDVQATDEILQATTL-RVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKK-EN

Query:  PCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMG
        P    ++RPISL N+  KI+ K+L NR++  +  II  +Q  FIP      NI      +  I   ++K +   SI  D  KA+D ++  F+ R + K+G
Subjt:  PCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMG

Query:  FSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAIS
           +++  I       + +I+LNG+  K+F    G RQG PLSP LF +    +  L I +   K   G+ I    + +S
Subjt:  FSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAIS

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-2731.78Show/hide
Query:  HLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPK-KENPCMVNDFRPISLCNVCYKIIAKVL
        HL+      ++E  +  +   K+PGPDG  A FYQ +       +     +   EG LP      TI LIPK +++P  + +FRPISL N+  KI+ K+L
Subjt:  HLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPK-KENPCMVNDFRPISLCNVCYKIIAKVL

Query:  ANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNG
        ANR++  + AII  +Q  FIP      NI      +H+I   + K   +  I LD  KA+D+++  F+ +++ + G    ++N I         +I +NG
Subjt:  ANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNG

Query:  IYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHL
           +      G RQG PLSPYLF +    L  L   +   K+  G++I +  + IS L
Subjt:  IYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHL

P14381 Transposon TX1 uncharacterized 149 kDa protein6.0e-2830.15Show/hide
Query:  VTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYK
        V+E     L+    + +L +AL+ M   K+PG DGL   F+Q +W   G            +GELP       + L+PKK +  ++ ++RP+SL +  YK
Subjt:  VTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYK

Query:  IIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSF
        I+AK ++ RLK++L  +I  +QS  +P R I DN+ +  + LHF   +R      A + LD  KA+DRV+  +L   +    F   ++  +     +   
Subjt:  IIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSF

Query:  SILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIK--VEISKKNSGLRIARSALA
         + +N   T      RG+RQG PLS  L+ +  +    LL K    +  K   +R+  SA A
Subjt:  SILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIK--VEISKKNSGLRIARSALA

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM5.3e-1630.54Show/hide
Query:  TKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIP
        + +PGPDG+     ++       ++++  L C   G LP+ I     + IPK        DFRPIS+ +V  + +  +LA RL + ++      Q  F+P
Subjt:  TKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIP

Query:  SRLIADN-IIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSP
        +   ADN  IV     H  K  RS   CY +  LD+SKA+D +    +   +   G    +++ +    E    S+  +G  ++ F P RG++QGDPLSP
Subjt:  SRLIADN-IIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSP

Query:  YLF
         LF
Subjt:  YLF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.4e-0936.05Show/hide
Query:  DLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKII
        ++  A+  M   KAPGPD   A F+ + W +     I+        G L  + N T I LIPK      ++ FRP+S C V YKII
Subjt:  DLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIPKKENPCMVNDFRPISLCNVCYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.8e-1033.72Show/hide
Query:  LANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRI
        +  RLK ++  +I   Q++FIP R+  DNI+   E +H ++  +  K  +  +KLD+ KAYDR+ W +L   +   GF   W+  I
Subjt:  LANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWINRI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.9e-0942.19Show/hide
Query:  LLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF
        ++NG       P+RGLRQGDPLSPYLF++C + L+ L  + +   +  G+R++ ++  I+HLLF
Subjt:  LLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTTGTGCTAGGACCAGGTTTATCCAAGATTTGAAGTATCCTGGAGAAACTCCGATCCTTGCCCCTCACCTTATCCTCAAGGGCGGCAATTTTGGCGTCAGCTT
CTTCTCCTCGGCCTCGGTTCCCTGGCCTGTTCCTCTCGGTGACCTCTACGTGTCATCATCTTCTCGTGACTTCTGCGGAACAAAGTTTTCAGATTTACAGGCATGGATAG
TCACTGGAGATTTCAATGCAATCATTCAAGAAAATGAACATGATGAAGATGCCCTTAATTATTGTGCTCTAAGAGATCTTGGTTTCATAGGAAATCGTTTCACCTGGTCT
AATAGGCAACCAGGAACAACTTTTGTTCGCAAACGACTGGACCGATGCGATGATGTGCAGGCAACAGATGAAATCCTACAAGCTACCACATTGAGGGTAACAGAAGAGGT
GAATGTTCATTTAGATCGATCGTTCGACATTTTGGACTTGGAGGAGGCTTTAAAGCAAATGCACCCAACAAAAGCCCCGGGACCAGATGGCTTGCCAGCTCTTTTCTACC
AAAAATATTGGCATATTTTTGGACAAAAGGTTATCAGCACTTGTCTTCGATGTCCTAATGAGGGGGAACTTCCATATCAAATTAACACAACAACGATTATTCTAATCCCA
AAGAAGGAAAACCCATGCATGGTGAATGATTTCCGACCTATTAGCTTATGTAATGTGTGCTATAAAATCATTGCAAAAGTGCTTGCTAATAGATTAAAAACGATTCTTCA
TGCTATAATATCTGAAAATCAAAGTGCTTTCATTCCTAGTAGACTCATAGCTGATAATATTATTGTTGGTTATGAATGTTTGCATTTCATAAAAGGCTCAAGGTCAAAGA
AAAATTGTTATGCATCCATAAAATTGGACATGAGTAAAGCATATGATAGAGTGGAGTGGACTTTTCTTAGAAGAATCATGGCAAAAATGGGTTTTAGCTCGTCGTGGATT
AATAGGATTGGAATGCGTGTTGAAACTGTGTCTTTTTCTATCCTCCTCAATGGGATTTATACTAAAAATTTTAAGCCCACTCGTGGGCTTAGACAGGGAGACCCTTTATC
GCCTTATTTATTTCTGATGTGTGCTAAAGGTTTAACTAGCCTATTAATAAAGGTTGAGATCTCAAAAAAAAATTCTGGTCTTCGAATTGCCCGATCGGCACTAGCTATTT
CTCACCTTCTTTTTTTAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTTGTGCTAGGACCAGGTTTATCCAAGATTTGAAGTATCCTGGAGAAACTCCGATCCTTGCCCCTCACCTTATCCTCAAGGGCGGCAATTTTGGCGTCAGCTT
CTTCTCCTCGGCCTCGGTTCCCTGGCCTGTTCCTCTCGGTGACCTCTACGTGTCATCATCTTCTCGTGACTTCTGCGGAACAAAGTTTTCAGATTTACAGGCATGGATAG
TCACTGGAGATTTCAATGCAATCATTCAAGAAAATGAACATGATGAAGATGCCCTTAATTATTGTGCTCTAAGAGATCTTGGTTTCATAGGAAATCGTTTCACCTGGTCT
AATAGGCAACCAGGAACAACTTTTGTTCGCAAACGACTGGACCGATGCGATGATGTGCAGGCAACAGATGAAATCCTACAAGCTACCACATTGAGGGTAACAGAAGAGGT
GAATGTTCATTTAGATCGATCGTTCGACATTTTGGACTTGGAGGAGGCTTTAAAGCAAATGCACCCAACAAAAGCCCCGGGACCAGATGGCTTGCCAGCTCTTTTCTACC
AAAAATATTGGCATATTTTTGGACAAAAGGTTATCAGCACTTGTCTTCGATGTCCTAATGAGGGGGAACTTCCATATCAAATTAACACAACAACGATTATTCTAATCCCA
AAGAAGGAAAACCCATGCATGGTGAATGATTTCCGACCTATTAGCTTATGTAATGTGTGCTATAAAATCATTGCAAAAGTGCTTGCTAATAGATTAAAAACGATTCTTCA
TGCTATAATATCTGAAAATCAAAGTGCTTTCATTCCTAGTAGACTCATAGCTGATAATATTATTGTTGGTTATGAATGTTTGCATTTCATAAAAGGCTCAAGGTCAAAGA
AAAATTGTTATGCATCCATAAAATTGGACATGAGTAAAGCATATGATAGAGTGGAGTGGACTTTTCTTAGAAGAATCATGGCAAAAATGGGTTTTAGCTCGTCGTGGATT
AATAGGATTGGAATGCGTGTTGAAACTGTGTCTTTTTCTATCCTCCTCAATGGGATTTATACTAAAAATTTTAAGCCCACTCGTGGGCTTAGACAGGGAGACCCTTTATC
GCCTTATTTATTTCTGATGTGTGCTAAAGGTTTAACTAGCCTATTAATAAAGGTTGAGATCTCAAAAAAAAATTCTGGTCTTCGAATTGCCCGATCGGCACTAGCTATTT
CTCACCTTCTTTTTTTAAGATGA
Protein sequenceShow/hide protein sequence
MEFCARTRFIQDLKYPGETPILAPHLILKGGNFGVSFFSSASVPWPVPLGDLYVSSSSRDFCGTKFSDLQAWIVTGDFNAIIQENEHDEDALNYCALRDLGFIGNRFTWS
NRQPGTTFVRKRLDRCDDVQATDEILQATTLRVTEEVNVHLDRSFDILDLEEALKQMHPTKAPGPDGLPALFYQKYWHIFGQKVISTCLRCPNEGELPYQINTTTIILIP
KKENPCMVNDFRPISLCNVCYKIIAKVLANRLKTILHAIISENQSAFIPSRLIADNIIVGYECLHFIKGSRSKKNCYASIKLDMSKAYDRVEWTFLRRIMAKMGFSSSWI
NRIGMRVETVSFSILLNGIYTKNFKPTRGLRQGDPLSPYLFLMCAKGLTSLLIKVEISKKNSGLRIARSALAISHLLFLR