; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022908 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022908
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:40644772..40646372
RNA-Seq ExpressionLag0022908
SyntenyLag0022908
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO90013.1 reverse transcriptase [Corchorus capsularis]3.0e-5436.83Show/hide
Query:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF
        +VLIPKV NP  V D RPISLCNVI+K++SK + N++K IL  ++ +N SA++PGR + D +++ +E +H +K K RG     +LKLD+ KAYDRVEW+F
Subjt:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF

Query:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTSPTGQIVLEDSQSA
        LE IM +MGF+  WV LV +C+ +VS+S  VNGV+          G R G       + + M        HA+ +G    +   R +P    +     S 
Subjt:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTSPTGQIVLEDSQSA

Query:  SFL---LGQCPK-KAVLSI--------LGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWI-------PCDTN
         FL   LG+C   K +L I        + +   + R FR            PSF+WRSLM G++++  G +WRIGNG  V V+   WI       P   +
Subjt:  SFL---LGQCPK-KAVLSI--------LGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWI-------PCDTN

Query:  LRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
          I SP        V  L    GHW    + + F P++   IL IPL      D +IW    +G Y+V+SGY
Subjt:  LRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]1.4e-6438.98Show/hide
Query:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF
        IVLIPK   P RVT+YRPISLCNVIY LV+KA+ N++K  L+ +IS   SA++P R + DN I+GYEC+H ++     +    +LKLD+ KAYDRVEW F
Subjt:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF

Query:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCH------AQYLGLSSFMPRCRTSPTGQIVL
        L+ ++ ++GF+  W+ L+  CI++ +FS  +NG   G    +   G R G   LS   ++  +E  K   H        Y  +   +     S   Q +L
Subjt:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCH------AQYLGLSSFMPRCRTSPTGQIVL

Query:  EDSQSASFLLGQCPKKAVLSILGVLGYSGREF-------RPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDL
               + L Q P+  V  ++    Y   +F        PSFIWRS++WG+++L+KG RWRIGNGE++ +  S+WIP  T  ++    +L   A+V +L
Subjt:  EDSQSASFLLGQCPKKAVLSILGVLGYSGREF-------RPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDL

Query:  FTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
           +  WN ++I Q F+  +A  I SI L +   ED +IWH+++ GLYSVKSGY
Subjt:  FTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

XP_024041921.1 uncharacterized protein LOC112099052 [Citrus clementina]2.3e-5431.95Show/hide
Query:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF
        I LIPK + P +VTD+RPISLCNVIY++V+KA+ N++K +++ +IS   +A+IP R + DN I+GYEC+H ++     + G  +LKLD+ KAYDR+EW F
Subjt:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF

Query:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVN-----------GVRCGA-TTTYCF--------------------SGQRWGK-LALSRTYYVS----MS
        LE  M  +GF++ W+ L+ RC+SSVSFS  +N           G+R G   + Y F                     G  +GK L +S   +       +
Subjt:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVN-----------GVRCGA-TTTYCF--------------------SGQRWGK-LALSRTYYVS----MS

Query:  EPQKAPC--------------------------------------------------HAQYLGLSSFMPRCRTSPTGQIVLE------DSQSASFLLGQC
            A C                                                  H +YLGL S + R R++    I L+        Q   F  G  
Subjt:  EPQKAPC--------------------------------------------------HAQYLGLSSFMPRCRTSPTGQIVLE------DSQSASFLLGQC

Query:  P---KKAVLSI-----------LGVLG---------YSGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARV
            K A L+I           LG+            +G   +PSFIWRS++WG++++ KG RWR+GNG+ + ++ S+W+P  +  +  +   L  +A V
Subjt:  P---KKAVLSI-----------LGVLG---------YSGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARV

Query:  RDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
         +L      WN  LI   F   +A  I+ IPL +    D VIWH+ K G +SVKSGY
Subjt:  RDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]4.2e-5631.34Show/hide
Query:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF
        I LIPK+ NP +V+DYRPISLCNVIY++V+KA+ N+MK IL+ +IS   SA+IP R + DN I+GYEC+H ++     + G  +LKLD+ KAYDRVEW F
Subjt:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF

Query:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCG--------------------------------ATTTYCFSGQRWGK-------------LALS
        LEQ MLKMGF    V L+ RC++S SFS  +NGV  G                                A       G ++ +             L  S
Subjt:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCG--------------------------------ATTTYCFSGQRWGK-------------LALS

Query:  RTYYVSMS----EPQKAPC-----------HAQYLGLSSFMPRCR----------------------TSPTGQIVL-----------------------E
        R      S    E Q+A             H +YLGL S + R +                       S  G+ VL                       +
Subjt:  RTYYVSMS----EPQKAPC-----------HAQYLGLSSFMPRCR----------------------TSPTGQIVL-----------------------E

Query:  DSQSA------------------------------------------------SFLLGQCPKKAVLSILGVLGYSGREF-------RPSFIWRSLMWGKK
        D Q A                                                ++ L Q P   V  +L    +    F         S+IWRS+MWG++
Subjt:  DSQSA------------------------------------------------SFLLGQCPKKAVLSILGVLGYSGREF-------RPSFIWRSLMWGKK

Query:  LLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSG
        +++KG+RWRIGNG++++++  +W+P     R   P++L  ++ V DL  A   W+   +RQHF   +   IL IPL     ED V+WH++K G YSVKSG
Subjt:  LLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSG

Query:  Y
        Y
Subjt:  Y

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]1.4e-5632.66Show/hide
Query:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV
        +I LIPKVK P  +  YRPISLCNV+YKLVSKA+V ++K  L  +IS+  SA+I  R + DN ++ +E +H+LKG+ RG  G+ ++KLDM KA+DRVEW 
Subjt:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV

Query:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTY----------------CFSG-----QRWGKLALSRTYYVSMSEP--------QKAPC
        F+ Q+MLK GF    V+L+ RC+++VS+SF +NG   G  T Y                C  G     Q    +   +   +S + P          +PC
Subjt:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTY----------------CFSG-----QRWGKLALSRTYYVSMSEP--------QKAPC

Query:  HAQYLGLSSFMPRCRTSPTG-----------------------QIVLEDSQSASFLLGQCPK-KAV----------LSILGVLGY---------------
        H QYLGL SF  R +    G                       +++L+      FL+G     K++            + G LG+               
Subjt:  HAQYLGLSSFMPRCRTSPTG-----------------------QIVLEDSQSASFLLGQCPK-KAV----------LSILGVLGY---------------

Query:  ---------------------------SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASGHW
                                   SG    PS  WRSL WGK+LL KG+RWR+G+G  ++    +W+P +T  +        PN  V DL +    W
Subjt:  ---------------------------SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASGHW

Query:  NGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
        +   ++ +FS  +   ILSIPL     +D +IW    +G+Y+VKSGY
Subjt:  NGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

TrEMBL top hitse value%identityAlignment
A0A1R3J5A5 Reverse transcriptase1.5e-5436.83Show/hide
Query:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF
        +VLIPKV NP  V D RPISLCNVI+K++SK + N++K IL  ++ +N SA++PGR + D +++ +E +H +K K RG     +LKLD+ KAYDRVEW+F
Subjt:  IVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVF

Query:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTSPTGQIVLEDSQSA
        LE IM +MGF+  WV LV +C+ +VS+S  VNGV+          G R G       + + M        HA+ +G    +   R +P    +     S 
Subjt:  LEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTSPTGQIVLEDSQSA

Query:  SFL---LGQCPK-KAVLSI--------LGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWI-------PCDTN
         FL   LG+C   K +L I        + +   + R FR            PSF+WRSLM G++++  G +WRIGNG  V V+   WI       P   +
Subjt:  SFL---LGQCPK-KAVLSI--------LGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWI-------PCDTN

Query:  LRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
          I SP        V  L    GHW    + + F P++   IL IPL      D +IW    +G Y+V+SGY
Subjt:  LRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

A0A803P8W5 Uncharacterized protein2.0e-5937.24Show/hide
Query:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV
        +I LIPKV  P ++ DYRPISLC+V+YKLVSKA+VN+ K +L   IS+N SA++PGR + DN +L +E +H LK K RGR  + +LKLDM KA+D+VEWV
Subjt:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV

Query:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTS------------
        FL+++MLKMGF   WVEL+ RC+SS S SFN+NG   G +              L     +S+S    +  H  +   S    +   S            
Subjt:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTS------------

Query:  --PTGQIVLEDSQSASF--------------LLGQCPKKAVLSILGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVY
           +GQ++  D    SF              +LG          LG+  Y+ R F+            PS  W+S+  G+ LL KG+RW+IG G +V   
Subjt:  --PTGQIVLEDSQSASF--------------LLGQCPKKAVLSILGVLGYSGREFR------------PSFIWRSLMWGKKLLEKGVRWRIGNGERVSVY

Query:  GSSWIPCDTNLRIASPMTLQ--PNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
           W+P +T      P T Q  P   V+   T    W+ +L++QHF   +   IL I +     ED +IWH  ++G Y+VKSGY
Subjt:  GSSWIPCDTNLRIASPMTLQ--PNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

A0A803PI64 Uncharacterized protein2.9e-5531.56Show/hide
Query:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV
        +I LIPKV+ P +V ++RPISLCNVIYK+VSK +V ++ G+++ +IS   SA+I  R + DNAI+GYE +H ++       G  +LKLDM KAYDRVEW+
Subjt:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV

Query:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATT----------------TYC----------------FSGQRWGKLALS------------
        FL  +M ++GFAQ WV+ + RC++S SFSF +NG   G                    +C                  G R+ +L +S            
Subjt:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATT----------------TYC----------------FSGQRWGKLALS------------

Query:  --------------------------------------------RTYYVSMSEPQKAPCHAQYLGLSSFMPR------------------------CRTS
                                                    R     M   ++   H +YLGL SF+ R                        CR  
Subjt:  --------------------------------------------RTYYVSMSEPQKAPCHAQYLGLSSFMPR------------------------CRTS

Query:  PTGQIVLED--SQSASFLLGQ-----------CPKKAVLSILGVLGY--SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRI
          G +   D    + + L  Q           C +    S     G+  +G     SF+WRSL+WGKKL+ KG RWR+GNGE V V    W+P     ++
Subjt:  PTGQIVLED--SQSASFLLGQ-----------CPKKAVLSILGVLGY--SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRI

Query:  ASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
            TL  N  V DL  A G W+ + IR  F+  +A  IL+IP      ED ++ H+ K+G Y+VKSGY
Subjt:  ASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

A0A803Q852 Uncharacterized protein2.9e-5534.53Show/hide
Query:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV
        +I LIPK+  P +++++RPISLCNV+YK+V+K +  +MK  L+  IS+  SA++ GR + DN I+GYE +H++K K  G     +LKLDM KAYD VEW 
Subjt:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV

Query:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQR-----WGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMP-----------R
        FL  +M  +G+ + W+E + RC++SVSF   +NG R G    +   G R     WG    ++  + S  +    P     LG  S              R
Subjt:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQR-----WGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMP-----------R

Query:  CRTSPTGQI--VLEDS--QSASFLLGQCPKKAVLSILGVLGYSGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQ
            P   +  VL++S   + SFL  +CP+ A                 S IW+ ++WG++++ +G RWR+GNG  + V+   W+P         P+   
Subjt:  CRTSPTGQI--VLEDS--QSASFLLGQCPKKAVLSILGVLGYSGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQ

Query:  PNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY
        PN  V  L  A   WN +++ ++F  ++  +IL IP+  +  ED ++W F K G Y VKSGY
Subjt:  PNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGY

M5XIV5 Uncharacterized protein5.9e-5634.24Show/hide
Query:  KVKNPCR-VTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVFLEQI
        +V + CR VT++RP+SLC VIYK+++KA  N++K +L  +ISK+ SA++P R ++DN +  +E +H +KG  +G+    ++KLDM KAYD+VEW FL+ I
Subjt:  KVKNPCR-VTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVFLEQI

Query:  MLKMGFAQAWVELVYRCISSVSFS-------FNVNGVRCGA-------------------TTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLS
        ML++GFA  WV+ +  C+  V+FS       F  N  R G+                     +  F         + +  +    E      +     L+
Subjt:  MLKMGFAQAWVELVYRCISSVSFS-------FNVNGVRCGA-------------------TTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLS

Query:  SFMPRCRTSPTGQIVLEDSQSASFLLGQCPKKAVLSILGVLGY----SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIAS
        S +         Q +L       F         +      L Y    SG  ++PS IW SL+WGK LL+ G+RWR+ NGE + VY   W+P   N +I S
Subjt:  SFMPRCRTSPTGQIVLEDSQSASFLLGQCPKKAVLSILGVLGY----SGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIAS

Query:  PMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGYL
              + +V +LF A+G W+  L++ +F  QE + IL IPL  +  ED +I H+++SG YSVKSGY+
Subjt:  PMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHFEKSGLYSVKSGYL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.0e-0524.29Show/hide
Query:  IVLIPKV-KNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECI-HALKGKSRGRVGWTSLKLDMIKAYDRVEW
        I+LIPK  ++  +  ++RPISL N+  K+++K + N+++  +  LI  +   +IPG     N       I H  + K +  V    + +D  KA+D+++ 
Subjt:  IVLIPKV-KNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECI-HALKGKSRGRVGWTSLKLDMIKAYDRVEW

Query:  VFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGA
         F+ + + K+G    +++++       + +  +NG +  A
Subjt:  VFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGA

P08548 LINE-1 reverse transcriptase homolog7.6e-0828.78Show/hide
Query:  IVLIPKV-KNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGR---CVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRV
        I LIPK  K+P R  +YRPISL N+  K+++K + N+++  +  +I  +   +IPG      +  +I   + I+ LK K         L +D  KA+D +
Subjt:  IVLIPKV-KNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGR---CVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRV

Query:  EWVFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVR
        +  F+ + + K+G    +++L+    S  + +  +NGV+
Subjt:  EWVFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVR

P11369 LINE-1 retrotransposable element ORF2 protein2.9e-0726.43Show/hide
Query:  IVLIPK-VKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHAL-KGKSRGRVGWTSLKLDMIKAYDRVEW
        I LIPK  K+P ++ ++RPISL N+  K+++K + N+++  +  +I  +   +IPG     N       IH + K K +  +    + LD  KA+D+++ 
Subjt:  IVLIPK-VKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHAL-KGKSRGRVGWTSLKLDMIKAYDRVEW

Query:  VFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGA
         F+ +++ + G    ++ ++    S    +  VNG +  A
Subjt:  VFLEQIMLKMGFAQAWVELVYRCISSVSFSFNVNGVRCGA

P14381 Transposon TX1 uncharacterized 149 kDa protein5.8e-0827.82Show/hide
Query:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV
        ++ L+PK  +   + ++RP+SL +  YK+V+KA+  ++K +L  +I  + S  +PGR + DN  L  + +H  +   R  +    L LD  KA+DRV+  
Subjt:  MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWV

Query:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVN
        +L   +    F   +V  +    +S      +N
Subjt:  FLEQIMLKMGFAQAWVELVYRCISSVSFSFNVN

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.3e-0729.31Show/hide
Query:  SFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASG---HWNGNLIRQHFSPQEARYILSIPLWQVACEDAV
        S+ W SL+ G  LL+KG R  IG+G+ + + G   I      R  +         + +LF   G    W+ + I Q     +  +I  I L +    D +
Subjt:  SFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASG---HWNGNLIRQHFSPQEARYILSIPLWQVACEDAV

Query:  IWHFEKSGLYSVKSGY
        IW++  +G Y+V+SGY
Subjt:  IWHFEKSGLYSVKSGY

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.5e-1437.5Show/hide
Query:  MVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVFLEQIMLKMGFAQAWVELVYR
        MV ++K ++  LI    +++IPGR   DN +   E +H+++ K +G  GW  LKLD+ KAYDR+ W +LE  ++  GF + W+  + R
Subjt:  MVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVFLEQIMLKMGFAQAWVELVYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGTTGATACCGAAGGTAAAGAATCCTTGTCGTGTGACTGACTACCGTCCTATCTCCCTATGTAATGTAATTTACAAACTAGTGTCGAAAGCCATGGTGAATCA
GATGAAAGGGATCCTGAATGGGCTGATTTCTAAGAATCATAGTGCGTATATCCCGGGTCGATGTGTGGTGGATAATGCTATCCTGGGCTACGAGTGTATTCATGCACTGA
AGGGTAAGTCTAGGGGTAGAGTTGGTTGGACATCTCTGAAGCTGGATATGATCAAGGCCTATGACCGGGTGGAATGGGTTTTCTTGGAGCAGATCATGTTGAAAATGGGT
TTTGCGCAAGCATGGGTGGAGTTGGTCTATCGGTGCATATCCTCAGTGAGCTTTTCCTTCAATGTGAATGGGGTAAGATGTGGGGCGACGACAACCTACTGTTTTTCAGG
ACAGAGGTGGGGGAAGCTCGCACTATCCAGGACATACTACGTTTCTATGAGCGAGCCTCAGAAGGCACCTTGTCATGCTCAGTACCTGGGGCTCTCGTCCTTTATGCCTC
GTTGTAGGACGAGCCCTACTGGCCAAATAGTCCTGGAGGATAGTCAGTCAGCTAGCTTCTTACTTGGCCAGTGTCCTAAAAAGGCGGTACTTTCCATCCTCGGAGTTCTT
GGATACAGCGGTAGGGAGTTTCGACCTTCCTTTATCTGGAGGAGCCTTATGTGGGGGAAGAAGTTGCTGGAAAAGGGGGTTCGCTGGAGGATTGGGAATGGAGAAAGGGT
GAGTGTGTATGGTTCCAGCTGGATTCCATGTGATACGAACCTGAGGATCGCTTCACCGATGACATTACAGCCTAATGCTAGAGTGAGGGACCTGTTTACGGCGTCAGGGC
ACTGGAATGGGAACCTTATTCGGCAGCACTTTAGCCCTCAGGAGGCAAGGTACATTCTTTCTATTCCCCTATGGCAGGTTGCTTGTGAGGATGCCGTCATCTGGCACTTT
GAGAAGTCGGGGCTATATTCAGTAAAGAGTGGGTATCTCACGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGTTGATACCGAAGGTAAAGAATCCTTGTCGTGTGACTGACTACCGTCCTATCTCCCTATGTAATGTAATTTACAAACTAGTGTCGAAAGCCATGGTGAATCA
GATGAAAGGGATCCTGAATGGGCTGATTTCTAAGAATCATAGTGCGTATATCCCGGGTCGATGTGTGGTGGATAATGCTATCCTGGGCTACGAGTGTATTCATGCACTGA
AGGGTAAGTCTAGGGGTAGAGTTGGTTGGACATCTCTGAAGCTGGATATGATCAAGGCCTATGACCGGGTGGAATGGGTTTTCTTGGAGCAGATCATGTTGAAAATGGGT
TTTGCGCAAGCATGGGTGGAGTTGGTCTATCGGTGCATATCCTCAGTGAGCTTTTCCTTCAATGTGAATGGGGTAAGATGTGGGGCGACGACAACCTACTGTTTTTCAGG
ACAGAGGTGGGGGAAGCTCGCACTATCCAGGACATACTACGTTTCTATGAGCGAGCCTCAGAAGGCACCTTGTCATGCTCAGTACCTGGGGCTCTCGTCCTTTATGCCTC
GTTGTAGGACGAGCCCTACTGGCCAAATAGTCCTGGAGGATAGTCAGTCAGCTAGCTTCTTACTTGGCCAGTGTCCTAAAAAGGCGGTACTTTCCATCCTCGGAGTTCTT
GGATACAGCGGTAGGGAGTTTCGACCTTCCTTTATCTGGAGGAGCCTTATGTGGGGGAAGAAGTTGCTGGAAAAGGGGGTTCGCTGGAGGATTGGGAATGGAGAAAGGGT
GAGTGTGTATGGTTCCAGCTGGATTCCATGTGATACGAACCTGAGGATCGCTTCACCGATGACATTACAGCCTAATGCTAGAGTGAGGGACCTGTTTACGGCGTCAGGGC
ACTGGAATGGGAACCTTATTCGGCAGCACTTTAGCCCTCAGGAGGCAAGGTACATTCTTTCTATTCCCCTATGGCAGGTTGCTTGTGAGGATGCCGTCATCTGGCACTTT
GAGAAGTCGGGGCTATATTCAGTAAAGAGTGGGTATCTCACGGTGTGA
Protein sequenceShow/hide protein sequence
MIVLIPKVKNPCRVTDYRPISLCNVIYKLVSKAMVNQMKGILNGLISKNHSAYIPGRCVVDNAILGYECIHALKGKSRGRVGWTSLKLDMIKAYDRVEWVFLEQIMLKMG
FAQAWVELVYRCISSVSFSFNVNGVRCGATTTYCFSGQRWGKLALSRTYYVSMSEPQKAPCHAQYLGLSSFMPRCRTSPTGQIVLEDSQSASFLLGQCPKKAVLSILGVL
GYSGREFRPSFIWRSLMWGKKLLEKGVRWRIGNGERVSVYGSSWIPCDTNLRIASPMTLQPNARVRDLFTASGHWNGNLIRQHFSPQEARYILSIPLWQVACEDAVIWHF
EKSGLYSVKSGYLTV