; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036622 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036622
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:49511772..49513585
RNA-Seq ExpressionLag0036622
SyntenyLag0036622
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010666352.1 PREDICTED: uncharacterized protein LOC104883509 [Beta vulgaris subsp. vulgaris]1.3e-8536.48Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        ++RN+I G++D +G+WQTE   ++     YF  IF+S+ PS+ ++ +VL HV   V++E N +L+ PYT+ E+  A+   HP K   PDG   +F+QR W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
         I+G+   +    IL++ +     N T++ LIPK+    +VS +RPISLCNV YKI +K +V RLK  L  I+ E QS F+PGR+I+DN +I  E  H +
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA
        + +N  + G  A+KLDMSKAYDRVEW +LR++L  +GF   W+NLVM C++   +S ++NG   G + PSRG+RQGDPLSP++F+L  +  S ++   + 
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA

Query:  N------------------------------------------GRSR-------------------------------------------VAIARIPRRI
        N                                          GRS+                                           + + ++P  +
Subjt:  N------------------------------------------GRSR-------------------------------------------VAIARIPRRI

Query:  LSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
        + +I S  A+FWWG RGD+RKMHW  WE++CKP+ +GGL F+DL  FN A+L KQ
Subjt:  LSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.6e-8936.94Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        RRRN I GI D  G WQ  T  +     +YF+ I+SS+ P++  I +VL+ +P  V+EEMN  L+  +TR E+E A+   HPTK   PDG   +FFQ+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
        +IVGN  V   L +LNS  S+   N T+I L+PKI++   +S +RPISLCNV YK+++KV+ NRLK +L  II E QS F+ GRLITDN+++  E +HYL
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLAST--
        +HK  GK G+ AIKLDMSKAYDRVEW +++Q++ K+GFH +WI LVM C+T   +SIL+NG  +G I P+RG+RQGDP+SPY+FLLC +G S+LL     
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLAST--

Query:  -----------------------------RAN-----------------------------------------------------------------GRS
                                     +AN                                                                 G+S
Subjt:  -----------------------------RAN-----------------------------------------------------------------GRS

Query:  RVAI-------------------------------------------ARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLIS
        +V I                                            +IP+ +  +I ++  +FWWG RG + K+ W  W++LCK ++ GG+ FR+L +
Subjt:  RVAI-------------------------------------------ARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLIS

Query:  FNQAMLAKQ
        FN AMLAKQ
Subjt:  FNQAMLAKQ

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]1.3e-8542.93Show/hide
Query:  RRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWD
        RRN I G+ED  G+W  +   +    E YF +IF+S+NP     D +L  +   V  +  + L      +EV+ A+    P      DG  PVF++  W 
Subjt:  RRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWD

Query:  IVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQ
        IVG    +  L  LNSG   +S N T I LIPKI++ + VS +RPISLCNV YK++ KV+ NRLK  L   + + QS F+ GRLI+DN+++  ETLHYL+
Subjt:  IVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQ

Query:  HKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRAN
         K RGK+G+ A+KLDMSKAYDRVEWS++  I+  LG       ++M C+    +SIL+NG+  G+I+PSRG+RQG PLSPY+FLLC  GL  LL      
Subjt:  HKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRAN

Query:  GRSRVAIA-------------RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
        GR  +  A             ++P+ ++ ++  L  KFWWG   D RK+HW  WE LC+ +E+GG+ F+++  FN A+LAKQ
Subjt:  GRSRVAIA-------------RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

XP_035546285.1 uncharacterized protein LOC108996706 [Juglans regia]8.4e-8539.43Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        R++N I  I D   L QT+   ++  F  YF+ +F+STNP+  DI+  L  V   V+++MN  L   +TR EVE+A++   P K   PDGF P FFQ+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
         +VG+      L+ LN      S N+T + LIPK++  R+ S +RPISLCNV+YKI++KV+ NRLK  LN +I   QS FIPG+LITDN+++ +E LH +
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA
        + + +GK+G  AIKLDMSKAYDR+EW YL  ++ KLGF  +WI L+MKC+T   +S+L+NG       P RG+RQGDPLSPY+F+LC EGLSALL  + +
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA

Query:  NGRSR-VAIAR-----------------------------------------------------------IPRRILSQISSLC-----------------
         G +R V +AR                                                             R+IL Q  S+                  
Subjt:  NGRSR-VAIAR-----------------------------------------------------------IPRRILSQISSLC-----------------

Query:  ------------AKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
                     KFWWG++     +HW  WE+L K +  GGL FRDL SFN A+LAKQ
Subjt:  ------------AKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

XP_040374850.1 uncharacterized protein LOC112200370 [Rosa chinensis]3.6e-8838.6Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        R+RN I G+ D  G+WQ     +      YF++IFSS        D VL  +  +VS EMN +L+APY+  EV +++   HP+K   PDGF P FFQ+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
        D+VG+   +  +++L         N+T + LIPKI+    +SH RPI+LCNV YKI +KVI NRLK  L DII   QS F+P RLI+DN ++  E  HY+
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA
           +RG+ G+ A+KLD+SKAYDR+EW +L++I+ K+GF  +W+ L+M CL+   FS L+NG   G++ P RG+RQGDPLSPY+FLLC EGLSAL++   +
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA

Query:  NGR-------------------------------------------------------------------SRVAIARIP-------------------RR
        NG+                                                                    RV++A+                     + 
Subjt:  NGR-------------------------------------------------------------------SRVAIARIP-------------------RR

Query:  ILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
        ++ ++  LCA+FWWG+   K  MHWR W+ELCKP+ +GG+ FR L +FN AMLAKQ
Subjt:  ILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

TrEMBL top hitse value%identityAlignment
A0A2N9EV70 Uncharacterized protein1.3e-9442.51Show/hide
Query:  RRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWD
        RRN I  + D  G W      +      Y+  +F++++P+  ++ + + +V   V+ EMN  L   +   EVE AI+   P+K   PDG PP+F+Q+ W 
Subjt:  RRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWD

Query:  IVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQ
        +VGN      L+ LNSG+ ++S NHT I LIPK++    V+ +RPISLCNV YK+V KV+VNRLK++L  I+ + QS F+PGRLITDN++I  ETLH++ 
Subjt:  IVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQ

Query:  HKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRAN
        H   G+ G  A+KLDMSKAYDRVEW YL QI+ K+GFH +WI L++ C++   +S+L+NG+  G+IQPSRG+RQGDPLSPY+FLLC EGL +L+    A+
Subjt:  HKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRAN

Query:  GRSRVAIARIPRRI-----------LSQ-------------------------------ISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDL
        G  + + A+I  R+           LSQ                               + +L  +FWW +  D++K+HW  W++LC P++ GG+ FRDL
Subjt:  GRSRVAIARIPRRI-----------LSQ-------------------------------ISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDL

Query:  ISFNQAMLAKQKTE
          FN+A+LAKQ  E
Subjt:  ISFNQAMLAKQKTE

A0A2N9FBQ4 Reverse transcriptase domain-containing protein4.0e-9337.01Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        +RRN +  ++D  G+W T    V   F  Y+  +F+++NP+   +D V+ ++   V+  MN ML++ +T  EV  A++   P K   PDG PPVF+Q  W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
         ++G   ++  L  LNSG  +++ NHT + LIPK+++  +V+ +RPISLCNV YKI++KV+ NRLK +L  I+ E QS F+PGRLITDN+++  ETLHY+
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTR-
        QH+  GK G  A+KLDMSKAYDRVEW YL+ ++ ++GFH +WI+++M+C++   +SIL+NGE  G+IQP+RG+RQGDPLSPY+FLLC EGL +L+   + 
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTR-

Query:  ANGRSRVAIA------------------------------------------------------------------------------------------
        A G   V+I+                                                                                          
Subjt:  ANGRSRVAIA------------------------------------------------------------------------------------------

Query:  --------------------RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
                            R+P+R++ +I  L  +FWWG  GD+ KMHW  W+ LCK +  GGL FR L  FN+A+LAKQ
Subjt:  --------------------RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

A0A2N9FK76 Reverse transcriptase domain-containing protein1.4e-9043.32Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        +R N I G+ D    WQT+  V+++    YF+ +F S+NP+   I  V   V   V+ +MN  L+ P+T  EV+ A+   HP+K   PDG   +FFQ+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
         IVG       L  LNS   +   N T+I+LIPK++  + ++ +RPISLCNV YK+++KV+VNR+K +L ++I  CQS F+PGR+ITDN+I+  E LH+L
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALL-----
        ++K  GK+G  A KLDMSKAYDRVEW YLR IL KLGFH +W+  VM C+T   +SI++NGE  G I+P RG+RQGDPLSPY+FL+C EGLSALL     
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALL-----

Query:  -----------------------------ASTRANGRSRVAIARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAM
                                     A  +A     ++  + P+ + S+ISS+  +FWWG RG +RK+HW     L + +  GG+ F +L  FN A+
Subjt:  -----------------------------ASTRANGRSRVAIARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAM

Query:  LAKQ
        LA+Q
Subjt:  LAKQ

A0A2N9GHW6 Reverse transcriptase domain-containing protein1.6e-8938.38Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        +RRN+I GI D  G+WQ     V+ T   Y+K +F+++ P   + D++L  V   V+ +MN  L A +T AEVE A+      K +  DG  P+F+Q+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
        +IVGN   ++ L+ L  G+ ++  NHT I LIPK+++   V  +RPISLCNV YKI+ KV+ NRLK +L  II E QS F+PGRLI+DN++I  +TLH++
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLA----
        +     + GY A+KLDMSKAYDRVEW++L  I+ K+GF+V W+++VM+C+    +S+L+NGE  G   P RG+RQGDP+SPY+FLLC EGL+ALLA    
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLA----

Query:  STRANG--------------------------------------RSRVA-----------------------------IARIPRRILSQISSLCAKFWWG
        S +  G                                      RS++                              + ++P+++ S++  +   FWWG
Subjt:  STRANG--------------------------------------RSRVA-----------------------------IARIPRRILSQISSLCAKFWWG

Query:  SRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQKTEPRKQNRNFERFW
          G+ RK+HW  W  LCKP+++GG+ FR+L  FN+A+LAK +     +     R W
Subjt:  SRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQKTEPRKQNRNFERFW

A0A2N9GY38 Reverse transcriptase domain-containing protein2.1e-9441.95Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW
        +RRN++  +++ EG W T        F  ++  +F +  P Q  ID+V  H+   V+EEMN  L   +T  +V  A++   P K   PDG PP+FFQ+ W
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCW

Query:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL
         ++G+      L  LN+G  +++ NHT I LIPKI++   V  +RPISLCNV YKI++KV+ NRLKI+L  I+ E QS F+PGRLITDN+++  ETLH++
Subjt:  DIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYL

Query:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA
        QH+ +G+IG  A+KLDMSKAYDRVEW YL++++ ++GF  +W+ ++M+C++   +SIL+NGE    I+PSR +RQGDPLSPY+FLLC EG  +LL   + 
Subjt:  QHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRA

Query:  NGRSR----------------------------------------VAIARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLI
         G  +                                        ++  R+P R++ +I  L  +FWWG  GDK KMHW  W  LCK +  GG+ FR+L 
Subjt:  NGRSR----------------------------------------VAIARIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLI

Query:  SFNQAMLAKQ
         FN+A+LAKQ
Subjt:  SFNQAMLAKQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.7e-2828.47Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPM-KVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRC
        R +N I  I++ +G   T+ T +Q T   Y+K ++++   +  ++D  L+   + ++++E  + L  P T +E+   I S    K   PDGF   F+QR 
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPM-KVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRC

Query:  WDIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKI-RHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLH
         + +    +    +I   G    S+   SI+LIPK  R      ++RPISL N+  KI+ K++ NR++  +  +I   Q  FIPG     N+    ++++
Subjt:  WDIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKI-RHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLH

Query:  YLQHKNRGK-IGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLS
         +QH NR K   +  I +D  KA+D+++  ++ + L+KLG    ++ ++         +I++NG+         G RQG PLSP +F +  E L+
Subjt:  YLQHKNRGK-IGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLS

P08548 LINE-1 reverse transcriptase homolog6.6e-2426.32Show/hide
Query:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLN--HVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQR
        R ++ I  I +      T+ + +Q     Y+K+++S    + ++ID+ L   H+P ++S++  +ML  P + +E+   I++    K   PDGF   F+Q 
Subjt:  RRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLN--HVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQR

Query:  CWDIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKI-RHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETL
          + +    ++    I   G    ++   +I LIPK  +      +YRPISL N+  KI+ K++ NR++  +  II   Q  FIPG     N+    +++
Subjt:  CWDIVGNTTVSNCLAILNSGASIQSWNHTSILLIPKI-RHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETL

Query:  HYLQHKNRGK-IGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLA
        + +QH N+ K   +  + +D  KA+D ++  ++ + L K+G    ++ L+    +    +I++NG          G RQG PLSP +F +  E L+  + 
Subjt:  HYLQHKNRGK-IGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLA

Query:  STRA
          +A
Subjt:  STRA

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-2528.28Show/hide
Query:  IGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPM-KVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVG
        I  I + +G   T+   +Q+T  +++K ++S+   +  ++DK L+   + K++++    L +P +  E+E  I S    K   PDGF   F+Q       
Subjt:  IGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPM-KVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVG

Query:  NTTVSNCLAILN--------SGASIQSWNHTSILLIPK-IRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHE
         T   + + IL+         G    S+   +I LIPK  +    + ++RPISL N+  KI+ K++ NR++  +  II   Q  FIPG     N+     
Subjt:  NTTVSNCLAILN--------SGASIQSWNHTSILLIPK-IRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHE

Query:  TLHYLQHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLS
         +HY+ +K + K  +  I LD  KA+D+++  ++ ++L + G    ++N++    +    +I +NGE    I    G RQG PLSPY+F +  E L+
Subjt:  TLHYLQHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-2529.67Show/hide
Query:  VQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVGNTTVSNCLAILNSGASIQ
        ++D   ++++ +FS    S    +++ + +P+ VSE   + L  P T  E+  A+      K    DG    FFQ  WD +G             G    
Subjt:  VQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVGNTTVSNCLAILNSGASIQ

Query:  SWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQHKNRGKIGYTAIKLDMSKAYD
        S     + L+PK    RL+ ++RP+SL +  YKIV K I  RLK VL ++I   QS  +PGR I DN+ +  + LH+ +   R  +    + LD  KA+D
Subjt:  SWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQHKNRGKIGYTAIKLDMSKAYD

Query:  RVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALL
        RV+  YL   L    F  +++  +      A   + +N      +   RG+RQG PLS  ++ L  E    LL
Subjt:  RVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALL

P93295 Uncharacterized mitochondrial protein AtMg003105.4e-1048.39Show/hide
Query:  RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCK-PEEIGGLNFRDLISFNQAMLAKQ
        R+ + +  +++S   +FWW S  +KRK+ W  W++LCK  E+ GGL FRDL  FNQA+LAKQ
Subjt:  RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCK-PEEIGGLNFRDLISFNQAMLAKQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.5e-1025.71Show/hide
Query:  TVVQDTFETYFKEIFSSTNP--SQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVGNTTVSNCLAILNSG
        T V++    Y+  +  S +   +   + ++ +  P + ++ +   L A  +  E+  A+ +    K   PD F   FF   W +V ++T++       +G
Subjt:  TVVQDTFETYFKEIFSSTNP--SQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVGNTTVSNCLAILNSG

Query:  ASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVT
          ++ +N T+I LIPK+     +S +RP+S C V YKI+T
Subjt:  ASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVT

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.3e-1440.96Show/hide
Query:  IVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWI
        +V RLK ++ ++I   Q++FIPGR+ TDN++   E +H ++ K +G  G+  +KLD+ KAYDR+ W YL   L   GF   W+
Subjt:  IVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQHKNRGKIGYTAIKLDMSKAYDRVEWSYLRQILSKLGFHVEWI

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-0938.33Show/hide
Query:  IPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ
        +P+ +  QI S+ A FWW ++ + + MHW+ W+ L   +  GG+ F+D+ +FN A+L KQ
Subjt:  IPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQ

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-1148.39Show/hide
Query:  RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCK-PEEIGGLNFRDLISFNQAMLAKQ
        R+ + +  +++S   +FWW S  +KRK+ W  W++LCK  E+ GGL FRDL  FNQA+LAKQ
Subjt:  RIPRRILSQISSLCAKFWWGSRGDKRKMHWRRWEELCK-PEEIGGLNFRDLISFNQAMLAKQ

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.6e-0956.52Show/hide
Query:  LMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRANGR
        ++NG   G + PSRG+RQGDPLSPY+F+LCTE LS L    +  GR
Subjt:  LMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRANGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACGACGAAGGAATTGGATTGGTGGCATAGAGGATGTCGAGGGGCTTTGGCAAACTGAGACGACGGTTGTTCAGGATACCTTTGAGACTTATTTCAAAGAGATTTT
CTCTTCGACAAATCCCTCTCAAAGGGACATTGATAAAGTTCTTAACCATGTACCGATGAAAGTGTCTGAGGAGATGAATCAGATGTTGATGGCTCCGTATACTCGTGCAG
AAGTGGAACTGGCCATCGAGAGTTTTCACCCAACAAAGGTTACAAGGCCTGATGGTTTTCCACCTGTGTTTTTTCAAAGGTGTTGGGATATTGTGGGTAACACAACGGTA
TCAAACTGCCTTGCGATTTTGAACTCAGGAGCATCGATTCAGTCATGGAACCATACAAGCATTCTGCTCATCCCAAAGATTCGTCATGCGAGGTTAGTGTCACATTATCG
TCCAATCAGTTTATGTAATGTTTCCTATAAAATTGTTACAAAGGTCATTGTCAATCGTCTTAAGATAGTGTTAAATGATATTATCGATGAGTGCCAATCGACATTTATCC
CTGGTAGATTAATAACTGATAATCTGATCATTTGTCATGAAACTCTACATTATCTTCAGCATAAGAATAGAGGGAAAATAGGGTACACTGCCATAAAACTTGATATGAGT
AAGGCCTATGACAGGGTGGAGTGGTCATATTTACGTCAAATTCTTTCTAAACTGGGATTTCATGTGGAGTGGATTAATTTGGTGATGAAGTGTTTGACCATGGCCTGTTT
TTCCATTTTAATGAATGGGGAAACTTTTGGTCATATTCAGCCATCCCGGGGAATTAGACAGGGTGACCCATTATCTCCTTACATGTTCTTGTTGTGCACAGAGGGTTTGT
CTGCTTTATTAGCTTCGACTAGAGCGAATGGTCGGTCTAGGGTGGCTATAGCAAGAATTCCTCGAAGGATTCTATCCCAAATATCCTCCCTTTGTGCTAAATTTTGGTGG
GGATCTCGAGGGGATAAGCGAAAGATGCACTGGAGACGTTGGGAGGAGTTATGCAAGCCAGAGGAGATAGGTGGACTAAATTTTCGAGACCTTATTAGTTTCAATCAGGC
AATGCTTGCTAAACAGAAAACCGAGCCAAGGAAGCAAAACCGGAATTTTGAGAGATTTTGGATGATTTTGGAGACTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACGACGAAGGAATTGGATTGGTGGCATAGAGGATGTCGAGGGGCTTTGGCAAACTGAGACGACGGTTGTTCAGGATACCTTTGAGACTTATTTCAAAGAGATTTT
CTCTTCGACAAATCCCTCTCAAAGGGACATTGATAAAGTTCTTAACCATGTACCGATGAAAGTGTCTGAGGAGATGAATCAGATGTTGATGGCTCCGTATACTCGTGCAG
AAGTGGAACTGGCCATCGAGAGTTTTCACCCAACAAAGGTTACAAGGCCTGATGGTTTTCCACCTGTGTTTTTTCAAAGGTGTTGGGATATTGTGGGTAACACAACGGTA
TCAAACTGCCTTGCGATTTTGAACTCAGGAGCATCGATTCAGTCATGGAACCATACAAGCATTCTGCTCATCCCAAAGATTCGTCATGCGAGGTTAGTGTCACATTATCG
TCCAATCAGTTTATGTAATGTTTCCTATAAAATTGTTACAAAGGTCATTGTCAATCGTCTTAAGATAGTGTTAAATGATATTATCGATGAGTGCCAATCGACATTTATCC
CTGGTAGATTAATAACTGATAATCTGATCATTTGTCATGAAACTCTACATTATCTTCAGCATAAGAATAGAGGGAAAATAGGGTACACTGCCATAAAACTTGATATGAGT
AAGGCCTATGACAGGGTGGAGTGGTCATATTTACGTCAAATTCTTTCTAAACTGGGATTTCATGTGGAGTGGATTAATTTGGTGATGAAGTGTTTGACCATGGCCTGTTT
TTCCATTTTAATGAATGGGGAAACTTTTGGTCATATTCAGCCATCCCGGGGAATTAGACAGGGTGACCCATTATCTCCTTACATGTTCTTGTTGTGCACAGAGGGTTTGT
CTGCTTTATTAGCTTCGACTAGAGCGAATGGTCGGTCTAGGGTGGCTATAGCAAGAATTCCTCGAAGGATTCTATCCCAAATATCCTCCCTTTGTGCTAAATTTTGGTGG
GGATCTCGAGGGGATAAGCGAAAGATGCACTGGAGACGTTGGGAGGAGTTATGCAAGCCAGAGGAGATAGGTGGACTAAATTTTCGAGACCTTATTAGTTTCAATCAGGC
AATGCTTGCTAAACAGAAAACCGAGCCAAGGAAGCAAAACCGGAATTTTGAGAGATTTTGGATGATTTTGGAGACTTTTTGA
Protein sequenceShow/hide protein sequence
MGRRRNWIGGIEDVEGLWQTETTVVQDTFETYFKEIFSSTNPSQRDIDKVLNHVPMKVSEEMNQMLMAPYTRAEVELAIESFHPTKVTRPDGFPPVFFQRCWDIVGNTTV
SNCLAILNSGASIQSWNHTSILLIPKIRHARLVSHYRPISLCNVSYKIVTKVIVNRLKIVLNDIIDECQSTFIPGRLITDNLIICHETLHYLQHKNRGKIGYTAIKLDMS
KAYDRVEWSYLRQILSKLGFHVEWINLVMKCLTMACFSILMNGETFGHIQPSRGIRQGDPLSPYMFLLCTEGLSALLASTRANGRSRVAIARIPRRILSQISSLCAKFWW
GSRGDKRKMHWRRWEELCKPEEIGGLNFRDLISFNQAMLAKQKTEPRKQNRNFERFWMILETF