; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035501 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035501
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:22815283..22816778
RNA-Seq ExpressionLag0035501
SyntenyLag0035501
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66050.1 hypothetical protein [Beta vulgaris subsp. vulgaris]6.8e-5233.06Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------
        E P + GGDFN  L  +EKEGG ++ +  + GFR  +D CSL D  F G      RG  R  E R+RE+  R +     L  +      H +R       
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------

Query:  ------GAEFIDLRRP-GCWIRDLW------------RWLRAVGGQVCR-----LDRREEW----------------------QGD------WEMPGG--
              G E +  RR  G W    W             W  A GG++C          + W                      QG+      WE   G  
Subjt:  ------GAEFIDLRRP-GCWIRDLW------------RWLRAVGGQVCR-----LDRREEW----------------------QGD------WEMPGG--

Query:  ---NEALGKGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM
           +E   K E+Y  LR              +  +   +R+KRNLI G+ D GG  + E  EI  +V  YF+ IFTSS P++ D   V   V+RSVT E 
Subjt:  ---NEALGKGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM

Query:  NRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKA
        N  L++P+ +EEI  AL  MHP KAPG DG+  +                                        + S   VS+FRPISLCNV+YK+  KA
Subjt:  NRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKA

Query:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
        +V R+K  L  + ++NQSAF+PGR + DN ++  E  H +KKR    +   ++KLDMSKAYDRVEW +L +++L MGF+  WV+
Subjt:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

OMO59710.1 reverse transcriptase [Corchorus capsularis]5.4e-4931.84Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDALETWHYKCFSH----------TL
        E  W   GDFN  L+Q EK+GGR + ++++  FREA+D C L D G+ G      RG+        R   G    +    +   C +H           L
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDALETWHYKCFSH----------TL

Query:  RGAEFIDLRRP--------------------------GCW---------------------------------IRDLWRWLRAVGGQVCRLDRREEWQGD
           E    RR                            CW                                 I +L + L  + G    +   EE +  
Subjt:  RGAEFIDLRRP--------------------------GCW---------------------------------IRDLWRWLRAVGGQVCRLDRREEWQGD

Query:  WEMPGGNEALGKGESYRQLRPDWRRYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRR
         E+   N  L + ES+      W R +W                KRRK+N I  L    G +  +P EI  + S YF+ +F SS   +K  D +   V  
Subjt:  WEMPGGNEALGKGESYRQLRPDWRRYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRR

Query:  SVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLS-----------GLSLGS------HG----------------------RVSKFRPISLCNVVY
        S+T EMN  L+  F  EEI  ALKQ+HP KAPG DG+            G  + S      HG                       +++FRPISLCNV+Y
Subjt:  SVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLS-----------GLSLGS------HG----------------------RVSKFRPISLCNVVY

Query:  KLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
        K++ K LVNR+K IL + IS++QSAF+PGR + DN+++ +E +H+LK R+ GK  + +LKLDMSKAYDRVEW +LE IML+MGF++ WV+
Subjt:  KLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

XP_010684899.1 PREDICTED: uncharacterized protein LOC104899410 [Beta vulgaris subsp. vulgaris]7.8e-4833.63Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------
        E P + GGDFN  L  +EKEGG ++ +  + GFR  +D CSL +  F G      RG  R  E R+RE+  R +     L  +      H +R       
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------

Query:  ------GAEFIDLRRPG------CWIRD-------LWRWLRAVGGQVCRL--DRREEWQGDWEMPGGN-----------------EALG-----------
              G E +  RR G       W+ D       +  W  A  G++C        E QG  +   G+                 EA+            
Subjt:  ------GAEFIDLRRPG------CWIRD-------LWRWLRAVGGQVCRL--DRREEWQGDWEMPGGN-----------------EALG-----------

Query:  --------KGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM
                K E++  +R              +  +   +R+KRNLI G+ D GG  + E  EI  +V  YF+ IFTSS P+  D   V   V+  VT E 
Subjt:  --------KGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM

Query:  NRRLMRPFHQEEILLALKQMHPNKAPGSDG---LSGLSLGSHGR----VSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVIL
        N  L+ P+ +EEI  AL  MHP KAPG DG    + ++L    +    VS+FRPISLCNV+YK+  KA+V R+K  L  ++++NQSAF+PGR + DN ++
Subjt:  NRRLMRPFHQEEILLALKQMHPNKAPGSDG---LSGLSLGSHGR----VSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVIL

Query:  GYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
          E  H +K R    +   +++LDMSKAYDRVEW YL++++L MGF+  WV+
Subjt:  GYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]5.9e-4831.33Show/hide
Query:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLR----RVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-----GAEFID
        P ++GGDFN  L  +EK+GG  + +  + GFRE +D+C L D    G+          E R+RE+  R L     L+ +      H +R      A  + 
Subjt:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLR----RVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-----GAEFID

Query:  LRRP---GCWIRDL---WRWLRAVGGQVCRLDRREEWQGD------------------WEMPGGNEALGK------------------------GESYRQ
         + P    C +R      +WL   G   C    RE W G                   W   G  +   K                        GE  ++
Subjt:  LRRP---GCWIRDL---WRWLRAVGGQVCRLDRREEWQGD------------------WEMPGGNEALGK------------------------GESYRQ

Query:  L-------------------------RPDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNR
        L                            +  +   +R+KRN I+GL D  G  R E  E+  LV +YF  IFTSS P+   +D V   V++SVT E N 
Subjt:  L-------------------------RPDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNR

Query:  RLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKALV
         L++P+ +EEI  ALKQMHP KAPG DGL  +                                        + +   VS+FRPISLCNV+YK+  KALV
Subjt:  RLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKALV

Query:  NRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
         R+K  L  ++++NQSAF+PGR + DN ++  E  H++KKR    +   ++KLDMSKAYDRVEW +L +++L MGF+  WV+
Subjt:  NRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]8.3e-5031.86Show/hide
Query:  IETSSGDAETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLRRVIEGRVREQSGRELTD---ALETW-----HYKCFSHTL
        I +     + PW++ GDFN  ++ +EK G   +   ++ GFR+ +  C L+D GF G+       GR+ EQ      D   A E W       K +   +
Subjt:  IETSSGDAETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLRRVIEGRVREQSGRELTD---ALETW-----HYKCFSHTL

Query:  RGAEF----IDLRRP-----GCWIRDLWRWLRAVGGQV--------CRLDRREEWQGDWEMPGGNEALGKGESYRQLRPD--WRRYS---WRK-------
          ++     + LRR       C+   L  W + V G V         RL   E      E     + L K  +   LR +  W + S   W K       
Subjt:  RGAEF----IDLRRP-----GCWIRDLWRWLRAVGGQV--------CRLDRREEWQGDWEMPGGNEALGKGESYRQLRPD--WRRYS---WRK-------

Query:  --------RRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGL
                RR++N I+GL D+ G  + +  E+ G++  YF+ IF++S P   +       + R V+D+MN  L++ F +EE+  ALKQMHP K+PG + +
Subjt:  --------RRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGL

Query:  SGLSLGSH---------------------------------------GRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVI
        S +   S+                                        ++S+FRPISLCNV+YK+V K L NR+K +L  +IS+ QSAF+PGR + DNV+
Subjt:  SGLSLGSH---------------------------------------GRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVI

Query:  LGYECIHAL-KKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV
        + +E +H + +KR GK    ++KLDMSKAYDRVEW YLE IM ++GF++ W+
Subjt:  LGYECIHAL-KKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase2.6e-4931.84Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDALETWHYKCFSH----------TL
        E  W   GDFN  L+Q EK+GGR + ++++  FREA+D C L D G+ G      RG+        R   G    +    +   C +H           L
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDALETWHYKCFSH----------TL

Query:  RGAEFIDLRRP--------------------------GCW---------------------------------IRDLWRWLRAVGGQVCRLDRREEWQGD
           E    RR                            CW                                 I +L + L  + G    +   EE +  
Subjt:  RGAEFIDLRRP--------------------------GCW---------------------------------IRDLWRWLRAVGGQVCRLDRREEWQGD

Query:  WEMPGGNEALGKGESYRQLRPDWRRYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRR
         E+   N  L + ES+      W R +W                KRRK+N I  L    G +  +P EI  + S YF+ +F SS   +K  D +   V  
Subjt:  WEMPGGNEALGKGESYRQLRPDWRRYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRR

Query:  SVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLS-----------GLSLGS------HG----------------------RVSKFRPISLCNVVY
        S+T EMN  L+  F  EEI  ALKQ+HP KAPG DG+            G  + S      HG                       +++FRPISLCNV+Y
Subjt:  SVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLS-----------GLSLGS------HG----------------------RVSKFRPISLCNVVY

Query:  KLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
        K++ K LVNR+K IL + IS++QSAF+PGR + DN+++ +E +H+LK R+ GK  + +LKLDMSKAYDRVEW +LE IML+MGF++ WV+
Subjt:  KLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

A0A2N9EV35 Uncharacterized protein5.8e-4932.43Show/hide
Query:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLR--RVIEGRVREQSGRELTDALETW-------------------------
        PW+  GDFN  + QNEK G   +S +++  FRE  + C+L+D GF+G         +G    Q   +   A  TW                         
Subjt:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLR--RVIEGRVREQSGRELTDALETW-------------------------

Query:  ---HYKCFSHTLRGAEFID--LRRPGC--WIRDLWRWLRAVGGQVCRLDRREEWQGDWEMPGGNEALGKGESYRQ-LRPDWRRYSWR----------KRR
           H        +   F +  +  P C   IR LW+    VG  +  L  + +      +    E  G    ++Q +R  W +   R          +R+
Subjt:  ---HYKCFSHTLRGAEFID--LRRPGC--WIRDLWRWLRAVGGQVCRLDRREEWQGDWEMPGGNEALGKGESYRQ-LRPDWRRYSWR----------KRR

Query:  KRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL-------
        K N + GL+DS      +P  +  + ++YF+ +FT+S P+   ID     V R VT EMNRRL+ P++  EI  AL QMHP+K+PG DG+S +       
Subjt:  KRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL-------

Query:  --------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALK
                                         + +  ++S +RPISLCNVVYK++ K L NR+K  L  +IS  QSAF+PGR + DN+I+ YE +++LK
Subjt:  --------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALK

Query:  KRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGW
         RR GKT   ++KLDMSKAYDRVEW ++E++M K+GF+Q W
Subjt:  KRR-GKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGW

A0A2N9IXL7 Uncharacterized protein3.4e-4936.92Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG--------RGLRRVIEGRV-REQSGRELTD-----ALETWHYKCFSHT---
        E PWMV GDFN     +E+ G   +++++++ FREA+  C L+D GF+G        RG   ++  R+ R  +  E         +      C  H    
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG--------RGLRRVIEGRV-REQSGRELTD-----ALETWHYKCFSHT---

Query:  -----LRGAEFIDLRRPGCWIRDLWRWLRAVGGQVCRLDRREEWQGDWE-MPGGNEALGKGESYRQLRPDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPG
               G + +D RR   + R    W+R VG +       E   G W  +P G       E  +Q R              NLI+         +  P 
Subjt:  -----LRGAEFIDLRRPGCWIRDLWRWLRAVGGQVCRLDRREEWQGDWE-MPGGNEALGKGESYRQLRPDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPG

Query:  EIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGLS-------LGSHGRVSKFRPISLCNVV
        EI  +   YF +IFTSS P A  I  V + V   VT  MN  L++PF +EE+  AL QM+P+KAPG DG    +       + +  ++++FRPISLCNV+
Subjt:  EIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGLS-------LGSHGRVSKFRPISLCNVV

Query:  YKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKK-RRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV
        YK+  K LVNRMK +L  +IS++QSAF+P R + DNVI+ +E IH LK  R+G     + KLDMSKAYDRVEW YL+ IMLK+GF + WV
Subjt:  YKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKK-RRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV

A0A2N9IZB6 Reverse transcriptase domain-containing protein5.2e-5037.34Show/hide
Query:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGL----RRVIEGRVREQSGRELTDALETWHYKCFSHTLRGAEFIDLRRPGCW
        PWMV  DFN  +  +E+ G   +S +++  F+EA+    L D GF G       RR+ E  VR +  R +++    W+          A+   +  P   
Subjt:  PWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGL----RRVIEGRVREQSGRELTDALETWHYKCFSHTLRGAEFIDLRRPGCW

Query:  IRDLWRWLRAVGGQVCRLDRREEWQGDWEMPGGNEALGKGESYRQLRPD--WR---RYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGE
           +   +R    +   L+   +    ++  G N AL K  +    + +  WR   R +W                +R++ N I+GL+D     R +P E
Subjt:  IRDLWRWLRAVGGQVCRLDRREEWQGDWEMPGGNEALGKGESYRQLRPD--WR---RYSW---------------RKRRKRNLIRGLVDSGGVMRHEPGE

Query:  IVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGLSLGSHGRVSKFRPISLCNVVYKLVFKAL
        +  + + YF  +FTSS PT  D+D V   V   VT +MN  L+RPF  +E+  AL QMHP+KAPG D  S  S+      S+FRPISLCNV+YK++ K L
Subjt:  IVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGLSLGSHGRVSKFRPISLCNVVYKLVFKAL

Query:  VNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKK-RRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
        VNRMK +L  +IS++Q AF+PGR + DNVI+ +E IH LK  R GK    + KLDMSKAYDRVEW YL  +M K+GF   WVD
Subjt:  VNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKK-RRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

F4NCJ4 Reverse transcriptase domain-containing protein3.3e-5233.06Show/hide
Query:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------
        E P + GGDFN  L  +EKEGG ++ +  + GFR  +D CSL D  F G      RG  R  E R+RE+  R +     L  +      H +R       
Subjt:  ETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTG------RGLRRVIEGRVREQSGRELTDA--LETWHYKCFSHTLR-------

Query:  ------GAEFIDLRRP-GCWIRDLW------------RWLRAVGGQVCR-----LDRREEW----------------------QGD------WEMPGG--
              G E +  RR  G W    W             W  A GG++C          + W                      QG+      WE   G  
Subjt:  ------GAEFIDLRRP-GCWIRDLW------------RWLRAVGGQVCR-----LDRREEW----------------------QGD------WEMPGG--

Query:  ---NEALGKGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM
           +E   K E+Y  LR              +  +   +R+KRNLI G+ D GG  + E  EI  +V  YF+ IFTSS P++ D   V   V+RSVT E 
Subjt:  ---NEALGKGESYRQLR------------PDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEM

Query:  NRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKA
        N  L++P+ +EEI  AL  MHP KAPG DG+  +                                        + S   VS+FRPISLCNV+YK+  KA
Subjt:  NRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGL---------------------------------------SLGSHGRVSKFRPISLCNVVYKLVFKA

Query:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD
        +V R+K  L  + ++NQSAF+PGR + DN ++  E  H +KKR    +   ++KLDMSKAYDRVEW +L +++L MGF+  WV+
Subjt:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTE-WASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.1e-1021.4Show/hide
Query:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDV-VTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSG---
        +K+R++N I  + +  G +  +P EI   + EY+++++ +     +++D  +       +  E    L RP    EI+  +  +   K+PG DG +    
Subjt:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDV-VTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSG---

Query:  ---------------LSLGSHG----------------------RVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE
                        S+   G                      +   FRPISL N+  K++ K L NR++  +  LI  +Q  FIPG     N+     
Subjt:  ---------------LSLGSHG----------------------RVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE

Query:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE
         I  + + + K     + +D  KA+D+++  ++ + + K+G +
Subjt:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE

P08548 LINE-1 reverse transcriptase homolog1.0e-1023.05Show/hide
Query:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDID-VVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSG---
        RK+R ++LI  + +    +  +P EI  +++EY++ +++      K+ID  + A     ++ +    L RP    EI   ++ +   K+PG DG +    
Subjt:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDID-VVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSG---

Query:  -----------LSLGSH--------------------------GRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE
                   L+L  +                           R   +RPISL N+  K++ K L NR++  +  +I  +Q  FIPG     N+     
Subjt:  -----------LSLGSH--------------------------GRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE

Query:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE
         I  + K + K +   L +D  KA+D ++  ++ + + K+G E
Subjt:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-0720.58Show/hide
Query:  RKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNR----RLMRPFHQEEILLALKQMHPNKAPGSDGLSG---
        R + LI  + +  G +  +P EI   +  +++ ++++     +++D +   + R    ++N+     L  P   +EI   +  +   K+PG DG S    
Subjt:  RKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNR----RLMRPFHQEEILLALKQMHPNKAPGSDGLSG---

Query:  --------------------------------LSL-----GSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE
                                        ++L         ++  FRPISL N+  K++ K L NR++  +  +I  +Q  FIPG     N+     
Subjt:  --------------------------------LSL-----GSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYE

Query:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE
         IH + K + K     + LD  KA+D+++  ++ +++ + G +
Subjt:  CIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFE

P14381 Transposon TX1 uncharacterized 149 kDa protein6.5e-1329Show/hide
Query:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGL------
        +K+  R  I  L    G    +P  I      +++N+F S  P + D           V++    RL  P   +E+  AL+ M  NK+PG DGL      
Subjt:  RKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVRRSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGL------

Query:  -----------------------------SGLSL----GSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECI
                                     + LSL    G    +  +RP+SL +  YK+V KA+  R+K +L  +I  +QS  +PGR + DNV L  + +
Subjt:  -----------------------------SGLSL----GSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECI

Query:  HALKKRRGKTEWASLKLDMSKAYDRVEWVYL
        H    RR     A L LD  KA+DRV+  YL
Subjt:  HALKKRRGKTEWASLKLDMSKAYDRVEWVYL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.8e-1640.24Show/hide
Query:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV
        +V R+K ++  LI   Q++FIPGR   DN++   E +H++++++G   W  LKLD+ KAYDR+ W YLE  ++  GF + W+
Subjt:  LVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKKRRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCACTATTGAAACGTCTTCGGGGGATGCTGAGACTCCGTGGATGGTTGGCGGTGATTTTAATGCGAACTTGTACCAGAATGAGAAGGAAGGAGGAAGGGCTAAATC
AAAATCTGAACTAAACGGTTTTCGAGAGGCTGTGGACTCGTGCAGTTTGATTGATTTTGGGTTCACAGGGAGAGGTTTACGTCGTGTAATAGAAGGTAGGGTACGGGAAC
AGTCTGGGAGAGAATTGACAGATGCTTTGGAAACATGGCACTACAAATGCTTTTCCCACACGCTGAGGGGAGCAGAATTTATAGATTTGAGGAGGCCTGGTTGTTGGATC
CGGGATTTATGGAGGTGGTTAAGAGCAGTTGGGGGGCAAGTCTGTCGGTTGGATCGCCGAGAGGAGTGGCAGGGGGACTGGGAAATGCCTGGAGGCAATGAGGCGTTGGG
GAAGGGGGAGAGCTACAGGCAGTTGAGGCCAGATTGGAGGCGATATTCCTGGAGGAAGAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGACAGTGGTGGCGTGATGA
GGCATGAGCCTGGAGAGATTGTGGGTCTGGTCTCAGAGTACTTCGAGAACATCTTCACGTCTAGTTGTCCGACAGCCAAGGATATTGATGTTGTTACAGCAGGGGTGAGG
AGATCAGTAACAGATGAGATGAACAGACGACTGATGAGACCTTTCCACCAGGAGGAGATTCTCCTTGCTTTGAAACAAATGCATCCTAATAAAGCTCCGGGGTCGGATGG
GCTTTCAGGGCTTTCTTTAGGAAGTCATGGGAGGGTCTCAAAGTTTAGACCCATATCCCTTTGTAATGTGGTGTACAAGCTAGTCTTCAAAGCACTGGTGAACAGAATGA
AAGGAATTCTGAACATGCTAATCTCCCAAAACCAGAGTGCCTTTATCCCGGGTCGATGTGTGGTGGATAATGTCATACTGGGTTATGAGTGCATCCATGCTTTGAAGAAA
AGGAGGGGGAAAACTGAGTGGGCCTCACTCAAGCTTGACATGAGTAAGGCCTACGATCGGGTGGAATGGGTGTATTTGGAGCAGATTATGCTGAAAATGGGATTCGAGCA
GGGATGGGTCGATTCACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCACTATTGAAACGTCTTCGGGGGATGCTGAGACTCCGTGGATGGTTGGCGGTGATTTTAATGCGAACTTGTACCAGAATGAGAAGGAAGGAGGAAGGGCTAAATC
AAAATCTGAACTAAACGGTTTTCGAGAGGCTGTGGACTCGTGCAGTTTGATTGATTTTGGGTTCACAGGGAGAGGTTTACGTCGTGTAATAGAAGGTAGGGTACGGGAAC
AGTCTGGGAGAGAATTGACAGATGCTTTGGAAACATGGCACTACAAATGCTTTTCCCACACGCTGAGGGGAGCAGAATTTATAGATTTGAGGAGGCCTGGTTGTTGGATC
CGGGATTTATGGAGGTGGTTAAGAGCAGTTGGGGGGCAAGTCTGTCGGTTGGATCGCCGAGAGGAGTGGCAGGGGGACTGGGAAATGCCTGGAGGCAATGAGGCGTTGGG
GAAGGGGGAGAGCTACAGGCAGTTGAGGCCAGATTGGAGGCGATATTCCTGGAGGAAGAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGACAGTGGTGGCGTGATGA
GGCATGAGCCTGGAGAGATTGTGGGTCTGGTCTCAGAGTACTTCGAGAACATCTTCACGTCTAGTTGTCCGACAGCCAAGGATATTGATGTTGTTACAGCAGGGGTGAGG
AGATCAGTAACAGATGAGATGAACAGACGACTGATGAGACCTTTCCACCAGGAGGAGATTCTCCTTGCTTTGAAACAAATGCATCCTAATAAAGCTCCGGGGTCGGATGG
GCTTTCAGGGCTTTCTTTAGGAAGTCATGGGAGGGTCTCAAAGTTTAGACCCATATCCCTTTGTAATGTGGTGTACAAGCTAGTCTTCAAAGCACTGGTGAACAGAATGA
AAGGAATTCTGAACATGCTAATCTCCCAAAACCAGAGTGCCTTTATCCCGGGTCGATGTGTGGTGGATAATGTCATACTGGGTTATGAGTGCATCCATGCTTTGAAGAAA
AGGAGGGGGAAAACTGAGTGGGCCTCACTCAAGCTTGACATGAGTAAGGCCTACGATCGGGTGGAATGGGTGTATTTGGAGCAGATTATGCTGAAAATGGGATTCGAGCA
GGGATGGGTCGATTCACCGTGA
Protein sequenceShow/hide protein sequence
MGTIETSSGDAETPWMVGGDFNANLYQNEKEGGRAKSKSELNGFREAVDSCSLIDFGFTGRGLRRVIEGRVREQSGRELTDALETWHYKCFSHTLRGAEFIDLRRPGCWI
RDLWRWLRAVGGQVCRLDRREEWQGDWEMPGGNEALGKGESYRQLRPDWRRYSWRKRRKRNLIRGLVDSGGVMRHEPGEIVGLVSEYFENIFTSSCPTAKDIDVVTAGVR
RSVTDEMNRRLMRPFHQEEILLALKQMHPNKAPGSDGLSGLSLGSHGRVSKFRPISLCNVVYKLVFKALVNRMKGILNMLISQNQSAFIPGRCVVDNVILGYECIHALKK
RRGKTEWASLKLDMSKAYDRVEWVYLEQIMLKMGFEQGWVDSP