; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022631 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022631
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr7:34661929..34666944
RNA-Seq ExpressionLag0022631
SyntenyLag0022631
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW39501.1 microtubule-associated protein AIR9 [Vitis vinifera]7.6e-6139.51Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  ++ +     ++NG  KG ++A+RGL QGDPLSPFLF LV D LSR+I R  + N+ EGF +G+N+  +SH QFADDTIFF + +E+    L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAYE
          +L  F  + GLK+N+                    I  +N + + + R AE + C  S  P  YLGLPLG NP++  FWDPVV+++    A       
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAYE

Query:  DFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGSKALLEIRGKKSLVLNPSRSS
        DFLW G  EGK   LV WD+V KP + GGL +GN+   N  LL KWLW +  E + LWH++I+S Y  H + W       +   E    K  VL PSR  
Subjt:  DFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGSKALLEIRGKKSLVLNPSRSS

Query:  LSYSFGFIRSLSDRDTTDLLSLLSLIEEI
        L ++  F R+LSD +  DL  L+  I+++
Subjt:  LSYSFGFIRSLSDRDTTDLLSLLSLIEEI

RVW87786.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.1e-5941.67Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W +G ++  ++ +     ++NG  KG ++A+RGL QGDPLSPFLF LV D LSR++ R  + N+ EGF +G+N+  +SH QFADDTIFF + +E+    L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL---ASSRE
          +L  F  + GLK+N+                    I  +N + + + R AE +GC  S  P  YLGLPLG NPR+  FWDPV++++ +RL   A    
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL---ASSRE

Query:  AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW
           DFLW G+ EGK   LV WD+V KP   GGL  GN+ + N  LL KWLW +  E + LWH++I+S Y  H + W
Subjt:  AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW

RVW92839.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.6e-6136.46Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  +F +     ++NG  KG I+A+RGL QGDPLSPFLF +V D LSR++ R  + N+FEGF +G+N+  +SH QFADD IFF S +E+  L L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL--------
          +L  F  + GLK+N+                    I  +N     + R AE + C  S  P  YLGLPLG NP+S  FWDPV++++  RL        
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL--------

Query:  ----ASSREAYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW---------------
                    DFLW GV EGK   LVSWD+V K   +GGL +G + + N  LL KWLW +  E  TLWH++I+S Y  H + W               
Subjt:  ----ASSREAYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW---------------

Query:  ----VTG----GGGSKALLEIRGKKSLVLNP---SRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVF
            V G    G     LL +   K++ ++    S    S++F F R+LSD +  DL SL+  ++ I         L  S   K+SW L  S +F
Subjt:  ----VTG----GGGSKALLEIRGKKSLVLNP---SRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVF

RVX14115.1 Protein SWEETIE [Vitis vinifera]7.1e-5933.33Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  +F +     ++NG  KG ++A+RGL QGDPLSPFLF +V D LSR++ +  + N+ EGF++G+N+  +SH QFADDTIFF S +E+  + L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY-
         ++L  F  + GLK+N+                    I  +N E + + R AE + C  S  P  YLGLPLG NP+++ FWDPV++++ +RL   ++AY 
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY-

Query:  --------------------------------------EDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKI
                                               DFLW GV EGK   LV+WD+V KP S+GGL  G + I N  LL KWLW +  E + LWH++
Subjt:  --------------------------------------EDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKI

Query:  IISKYDPHPSEWVTGG--------------------GGSKALLEIRGKK----SLVLNPSRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPS
        I+S Y  H + W                             LL +   K    S +L  +R   S++F F R+LSD +  DL  L+   + +      P 
Subjt:  IISKYDPHPSEWVTGG--------------------GGSKALLEIRGKK----SLVLNPSRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPS

Query:  GLEGSSVFKKSWGLEESLVF
                K+SW L  S +F
Subjt:  GLEGSSVFKKSWGLEESLVF

XP_022151711.1 uncharacterized protein LOC111019624 [Momordica charantia]7.1e-5939.43Show/hide
Query:  SNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLKIN
        S ++NGRP+G+I A+RGL QGDPLSPFLF+LVVD LSR+IS GV+    E FE+ + K  LSH QFAD+T+ FCSG   +F  LN +L FFE++ GLKIN
Subjt:  SNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLKIN

Query:  MGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYL--------------GLPLGHNPRSTLFWDPVVDKVRKRLASSREAYEDF
         G                   +L +NCE  K+  WA      +SS   ++               G+P   N   +LF  PV     K +    +   DF
Subjt:  MGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYL--------------GLPLGHNPRSTLFWDPVVDKVRKRLASSREAYEDF

Query:  LWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGS-------KALLEIRGKKSLVLN
        LWEGV+EG    LV+W  V KPL  GGL VGNLR+ N+  LAKWLW F+ E + LW KII+SKY+ HPS+W+  GG         KA+       SL L 
Subjt:  LWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGS-------KALLEIRGKKSLVLN

Query:  PS----------------RSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVFKKCLDLEECL
         S                   LS +F  I  LS++       LLS+ E ++F      G E SS+   S GL  SL   + L++   L
Subjt:  PS----------------RSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVFKKCLDLEECL

TrEMBL top hitse value%identityAlignment
A0A438DVJ6 Microtubule-associated protein AIR93.7e-6139.51Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  ++ +     ++NG  KG ++A+RGL QGDPLSPFLF LV D LSR+I R  + N+ EGF +G+N+  +SH QFADDTIFF + +E+    L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAYE
          +L  F  + GLK+N+                    I  +N + + + R AE + C  S  P  YLGLPLG NP++  FWDPVV+++    A       
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAYE

Query:  DFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGSKALLEIRGKKSLVLNPSRSS
        DFLW G  EGK   LV WD+V KP + GGL +GN+   N  LL KWLW +  E + LWH++I+S Y  H + W       +   E    K  VL PSR  
Subjt:  DFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGSKALLEIRGKKSLVLNPSRSS

Query:  LSYSFGFIRSLSDRDTTDLLSLLSLIEEI
        L ++  F R+LSD +  DL  L+  I+++
Subjt:  LSYSFGFIRSLSDRDTTDLLSLLSLIEEI

A0A438HTK2 LINE-1 retrotransposable element ORF2 protein3.4e-5941.67Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W +G ++  ++ +     ++NG  KG ++A+RGL QGDPLSPFLF LV D LSR++ R  + N+ EGF +G+N+  +SH QFADDTIFF + +E+    L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL---ASSRE
          +L  F  + GLK+N+                    I  +N + + + R AE +GC  S  P  YLGLPLG NPR+  FWDPV++++ +RL   A    
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL---ASSRE

Query:  AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW
           DFLW G+ EGK   LV WD+V KP   GGL  GN+ + N  LL KWLW +  E + LWH++I+S Y  H + W
Subjt:  AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW

A0A438I862 LINE-1 retrotransposable element ORF2 protein3.7e-6136.46Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  +F +     ++NG  KG I+A+RGL QGDPLSPFLF +V D LSR++ R  + N+FEGF +G+N+  +SH QFADD IFF S +E+  L L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL--------
          +L  F  + GLK+N+                    I  +N     + R AE + C  S  P  YLGLPLG NP+S  FWDPV++++  RL        
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRL--------

Query:  ----ASSREAYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW---------------
                    DFLW GV EGK   LVSWD+V K   +GGL +G + + N  LL KWLW +  E  TLWH++I+S Y  H + W               
Subjt:  ----ASSREAYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEW---------------

Query:  ----VTG----GGGSKALLEIRGKKSLVLNP---SRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVF
            V G    G     LL +   K++ ++    S    S++F F R+LSD +  DL SL+  ++ I         L  S   K+SW L  S +F
Subjt:  ----VTG----GGGSKALLEIRGKKSLVLNP---SRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVF

A0A438JYU3 Protein SWEETIE3.4e-5933.33Show/hide
Query:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL
        W  G ++  +F +     ++NG  KG ++A+RGL QGDPLSPFLF +V D LSR++ +  + N+ EGF++G+N+  +SH QFADDTIFF S +E+  + L
Subjt:  WSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLL

Query:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY-
         ++L  F  + GLK+N+                    I  +N E + + R AE + C  S  P  YLGLPLG NP+++ FWDPV++++ +RL   ++AY 
Subjt:  NHILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY-

Query:  --------------------------------------EDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKI
                                               DFLW GV EGK   LV+WD+V KP S+GGL  G + I N  LL KWLW +  E + LWH++
Subjt:  --------------------------------------EDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKI

Query:  IISKYDPHPSEWVTGG--------------------GGSKALLEIRGKK----SLVLNPSRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPS
        I+S Y  H + W                             LL +   K    S +L  +R   S++F F R+LSD +  DL  L+   + +      P 
Subjt:  IISKYDPHPSEWVTGG--------------------GGSKALLEIRGKK----SLVLNPSRSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPS

Query:  GLEGSSVFKKSWGLEESLVF
                K+SW L  S +F
Subjt:  GLEGSSVFKKSWGLEESLVF

A0A6J1DFI2 uncharacterized protein LOC1110196243.4e-5939.43Show/hide
Query:  SNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLKIN
        S ++NGRP+G+I A+RGL QGDPLSPFLF+LVVD LSR+IS GV+    E FE+ + K  LSH QFAD+T+ FCSG   +F  LN +L FFE++ GLKIN
Subjt:  SNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLKIN

Query:  MGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYL--------------GLPLGHNPRSTLFWDPVVDKVRKRLASSREAYEDF
         G                   +L +NCE  K+  WA      +SS   ++               G+P   N   +LF  PV     K +    +   DF
Subjt:  MGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYL--------------GLPLGHNPRSTLFWDPVVDKVRKRLASSREAYEDF

Query:  LWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGS-------KALLEIRGKKSLVLN
        LWEGV+EG    LV+W  V KPL  GGL VGNLR+ N+  LAKWLW F+ E + LW KII+SKY+ HPS+W+  GG         KA+       SL L 
Subjt:  LWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGS-------KALLEIRGKKSLVLN

Query:  PS----------------RSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVFKKCLDLEECL
         S                   LS +F  I  LS++       LLS+ E ++F      G E SS+   S GL  SL   + L++   L
Subjt:  PS----------------RSSLSYSFGFIRSLSDRDTTDLLSLLSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVFKKCLDLEECL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-0532.69Show/hide
Query:  PYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGL
        P  + I+NG+         G  QG PLSP LF +V++ L+R I +  +    +G ++G+ +V LS   FADD I +      S   L  +++ F  + G 
Subjt:  PYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGL

Query:  KINM
        KIN+
Subjt:  KINM

P08548 LINE-1 reverse transcriptase homolog2.0e-0820.28Show/hide
Query:  FGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESM
        +  P  + I+NG          G  QG PLSP LF +V++ L+  I    +    +G  IG  ++ LS   FADD I +     DS   L  ++  + ++
Subjt:  FGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESM

Query:  FGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSL----------------------PSSYLG----LPLGHNPRSTLFWDPV
         G KIN    +     +    +  V   +       K++    ++  D+  L                      P S+LG    + +   P++   ++ +
Subjt:  FGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSL----------------------PSSYLG----LPLGHNPRSTLFWDPV

Query:  VDKVRKRLASSRE-AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNT-LWHKIIISKYDP
          K         E     F+W      +    ++  L+      GG+ + +LR++ K+++ K  W ++      +W++I   + DP
Subjt:  VDKVRKRLASSRE-AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNT-LWHKIIISKYDP

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-1221.81Show/hide
Query:  FGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESM
        +  P  +  +NG     I    G  QG PLSP+LF +V++ L+R I +  +    +G +IG+ +V +S    ADD I + S  ++S   L +++  F  +
Subjt:  FGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESM

Query:  FGLKINMGLPLG----------HNLRSTLFWDPVVDKILSLNCEVSK--------------------VRRWAEFVGCDMSSLPSSYLG----LPLGHNPR
         G KIN    +             +R T  +  V + I  L   ++K                    +RRW +        LP S++G    + +   P+
Subjt:  FGLKINMGLPLG----------HNLRSTLFWDPVVDKILSLNCEVSK--------------------VRRWAEFVGCDMSSLPSSYLG----LPLGHNPR

Query:  STLFWDPVVDKVRKRLASSRE-AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNT-LWHKIIISKYDPHPSEWV
        +   ++ +  K+  +  +  E A   F+W           ++  L+    + GG+ + +L+++ + ++ K  W +Y +     W++I   + +PH    +
Subjt:  STLFWDPVVDKVRKRLASSRE-AYEDFLWEGVDEGKSMLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNT-LWHKIIISKYDPHPSEWV

Query:  TGGGGSKALLEIRGKKSLVLN
            G+K    I+ KK  + N
Subjt:  TGGGGSKALLEIRGKKSLVLN

P92555 Uncharacterized mitochondrial protein AtMg012509.8e-1144.12Show/hide
Query:  IINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDT
        IING P+G +  +RGL QGDPLSP+LF+L  + LS +  R  +     G  +  N   ++H  FADDT
Subjt:  IINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDT

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.1e-0437.89Show/hide
Query:  GRPKGRIRATRGLHQGDPLSPFLFLLVVDAL--SRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLK
        G    +I   RG+ QGDPLSPFLF  V+D L  S   + G+ G       IG+ K+P+    FADD +      ED+ +LL   LA   + F L+
Subjt:  GRPKGRIRATRGLHQGDPLSPFLFLLVVDAL--SRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNHILAFFESMFGLK

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-0522.34Show/hide
Query:  SLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY---------------------------------------EDFLWEGVDEGKSMLLVSWDLV
        +LP  YLGLPL     +T  + P+V+K+R R+      +                                         FLW G +       V+W  V
Subjt:  SLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAY---------------------------------------EDFLWEGVDEGKSMLLVSWDLV

Query:  GKPLSQGGLEVGNLRIHNK---------TLLAKWLWCFYSESNTLWHKIIIS---KYDPH----PSEWVTGGGGSKALLEIRGKKSLV
          P  +GGL + +L+  NK         T L  W+W        L H+ + S   K+D H     S W         L+++ G +  +
Subjt:  GKPLSQGGLEVGNLRIHNK---------TLLAKWLWCFYSESNTLWHKIIIS---KYDPH----PSEWVTGGGGSKALLEIRGKKSLV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.0e-1244.12Show/hide
Query:  IINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDT
        IING P+G +  +RGL QGDPLSP+LF+L  + LS +  R  +     G  +  N   ++H  FADDT
Subjt:  IINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAATCCGAAGACGACACGTGGAGCGATGGGAGAGTGACATGTGGGGCTTTTGGGTTGCCATACCGATCCAACATCATCAACGGGAGACCAAAAGGAAGAATTCG
GGCTACTCGGGGCCTTCATCAAGGGGACCCCCTATCCCCATTCCTCTTTCTCCTAGTTGTGGATGCCCTTAGTAGAATCATTTCTAGAGGAGTGGATGGCAATATTTTTG
AAGGTTTTGAGATCGGTCAGAACAAGGTTCCTTTATCCCACTTTCAGTTTGCAGATGATACAATCTTCTTTTGCTCCGGAAAAGAGGATTCCTTCTTGCTTTTGAACCAC
ATCTTGGCCTTCTTTGAATCCATGTTCGGCTTGAAAATTAATATGGGTTTGCCATTGGGGCATAATCTGAGATCTACGCTGTTCTGGGACCCTGTGGTGGATAAGATCTT
GAGCTTGAACTGTGAGGTTAGTAAGGTGAGGAGATGGGCAGAGTTTGTTGGTTGTGACATGAGTTCGCTCCCTTCTTCTTACTTAGGTTTGCCATTAGGGCATAATCCGA
GATCTACACTGTTTTGGGACCCTGTGGTGGATAAGGTAAGGAAAAGACTTGCAAGCTCTCGAGAAGCTTATGAGGACTTTCTGTGGGAAGGTGTGGATGAAGGCAAATCA
ATGCTTTTGGTGAGTTGGGATTTAGTGGGGAAACCTTTAAGTCAAGGGGGGCTCGAGGTGGGTAATTTGAGAATTCATAACAAAACTTTGCTGGCAAAATGGCTTTGGTG
CTTTTACTCTGAATCCAACACCCTGTGGCATAAGATTATTATTAGCAAATACGACCCTCATCCTTCTGAATGGGTGACGGGGGGAGGGGGATCAAAGGCACTTCTAGAAA
TCCGTGGAAAGAAATCTCTCGTTTTGAATCCTTCAAGGAGCTCTCTATCCTACTCTTTTGGCTTCATACGCTCGTTGTCTGATAGAGATACTACAGACCTCCTGTCTCTT
CTGTCTTTGATTGAGGAGATCTCCTTCAGATTGATGAGGCCTTCCGGTCTTGAAGGGTCTTCAGTCTTCAAGAAGTCTTGGGGTCTTGAAGAGTCTTTGGTCTTCAAGAA
ATGTTTAGATCTTGAAGAGTGTTTGGTCTTCAAGATTTGTGGATTTCAAGTTCTCCCTGGATGGGCCCTGGTGAATTTTGGGCCTAACAATTTGGGCTTGAGCTTTATTG
GTCTTGGGTTCAACTATTTGGTCTTGACTCTTGTTGTTATGACGTTCCCCGGCTCAAAGACACATGATGTCATTCCCACTTGGGCTCCTTTACTGCTGCAACGTCCTTCA
ACATTAGTACAAGGACATATGACGTCTTTGGACTCTTGTTGTTATGACGTTCCCCGGCCCAAGGACACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCAATCCGAAGACGACACGTGGAGCGATGGGAGAGTGACATGTGGGGCTTTTGGGTTGCCATACCGATCCAACATCATCAACGGGAGACCAAAAGGAAGAATTCG
GGCTACTCGGGGCCTTCATCAAGGGGACCCCCTATCCCCATTCCTCTTTCTCCTAGTTGTGGATGCCCTTAGTAGAATCATTTCTAGAGGAGTGGATGGCAATATTTTTG
AAGGTTTTGAGATCGGTCAGAACAAGGTTCCTTTATCCCACTTTCAGTTTGCAGATGATACAATCTTCTTTTGCTCCGGAAAAGAGGATTCCTTCTTGCTTTTGAACCAC
ATCTTGGCCTTCTTTGAATCCATGTTCGGCTTGAAAATTAATATGGGTTTGCCATTGGGGCATAATCTGAGATCTACGCTGTTCTGGGACCCTGTGGTGGATAAGATCTT
GAGCTTGAACTGTGAGGTTAGTAAGGTGAGGAGATGGGCAGAGTTTGTTGGTTGTGACATGAGTTCGCTCCCTTCTTCTTACTTAGGTTTGCCATTAGGGCATAATCCGA
GATCTACACTGTTTTGGGACCCTGTGGTGGATAAGGTAAGGAAAAGACTTGCAAGCTCTCGAGAAGCTTATGAGGACTTTCTGTGGGAAGGTGTGGATGAAGGCAAATCA
ATGCTTTTGGTGAGTTGGGATTTAGTGGGGAAACCTTTAAGTCAAGGGGGGCTCGAGGTGGGTAATTTGAGAATTCATAACAAAACTTTGCTGGCAAAATGGCTTTGGTG
CTTTTACTCTGAATCCAACACCCTGTGGCATAAGATTATTATTAGCAAATACGACCCTCATCCTTCTGAATGGGTGACGGGGGGAGGGGGATCAAAGGCACTTCTAGAAA
TCCGTGGAAAGAAATCTCTCGTTTTGAATCCTTCAAGGAGCTCTCTATCCTACTCTTTTGGCTTCATACGCTCGTTGTCTGATAGAGATACTACAGACCTCCTGTCTCTT
CTGTCTTTGATTGAGGAGATCTCCTTCAGATTGATGAGGCCTTCCGGTCTTGAAGGGTCTTCAGTCTTCAAGAAGTCTTGGGGTCTTGAAGAGTCTTTGGTCTTCAAGAA
ATGTTTAGATCTTGAAGAGTGTTTGGTCTTCAAGATTTGTGGATTTCAAGTTCTCCCTGGATGGGCCCTGGTGAATTTTGGGCCTAACAATTTGGGCTTGAGCTTTATTG
GTCTTGGGTTCAACTATTTGGTCTTGACTCTTGTTGTTATGACGTTCCCCGGCTCAAAGACACATGATGTCATTCCCACTTGGGCTCCTTTACTGCTGCAACGTCCTTCA
ACATTAGTACAAGGACATATGACGTCTTTGGACTCTTGTTGTTATGACGTTCCCCGGCCCAAGGACACATGA
Protein sequenceShow/hide protein sequence
MDQSEDDTWSDGRVTCGAFGLPYRSNIINGRPKGRIRATRGLHQGDPLSPFLFLLVVDALSRIISRGVDGNIFEGFEIGQNKVPLSHFQFADDTIFFCSGKEDSFLLLNH
ILAFFESMFGLKINMGLPLGHNLRSTLFWDPVVDKILSLNCEVSKVRRWAEFVGCDMSSLPSSYLGLPLGHNPRSTLFWDPVVDKVRKRLASSREAYEDFLWEGVDEGKS
MLLVSWDLVGKPLSQGGLEVGNLRIHNKTLLAKWLWCFYSESNTLWHKIIISKYDPHPSEWVTGGGGSKALLEIRGKKSLVLNPSRSSLSYSFGFIRSLSDRDTTDLLSL
LSLIEEISFRLMRPSGLEGSSVFKKSWGLEESLVFKKCLDLEECLVFKICGFQVLPGWALVNFGPNNLGLSFIGLGFNYLVLTLVVMTFPGSKTHDVIPTWAPLLLQRPS
TLVQGHMTSLDSCCYDVPRPKDT