; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032085 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032085
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:24161268..24164476
RNA-Seq ExpressionLag0032085
SyntenyLag0032085
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAM93462.1 putative reverse transcriptase [Oryza sativa Japonica Group]4.4e-5934.41Show/hide
Query:  WNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDR
        WN TNIV IPK + P ++ + RP  LCN+ YK ++KV+ANRLK IL +I  + Q AF+PGR ITDN++I +E  H+LQNKR GK GYAA+KLDMSKAYDR
Subjt:  WNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDR

Query:  VEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLS-----------VMLQTTYS------------
        VEWP+L  +L RLGF E    LIM+C+++ ++ I VNG     IK  RG+RQGDPLS YLF++C +  S           ++L+ T S            
Subjt:  VEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLS-----------VMLQTTYS------------

Query:  --CS-----------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWG
          CS                                               ++ Q WR+++NP+    +LLKA+YFP+  + +  ++   SY W+  L G
Subjt:  --CS-----------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWG

Query:  MDLLQKGTRRNLGNAK--------------------------------------------LQQHLCDKDVQLICSLPISQTTPDLWTWHNDRTGSYSVRS
        + LL+KG    +GN +                                            +QQ   + D ++I  +P+ +   D   W  D  G +SV+S
Subjt:  MDLLQKGTRRNLGNAK--------------------------------------------LQQHLCDKDVQLICSLPISQTTPDLWTWHNDRTGSYSVRS

Query:  GYKL
         YKL
Subjt:  GYKL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.0e-7129.94Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N  + I+ WN T I  IPK +QPR +SD+RPI LCN++YK ++K I NRLK ++  +  + Q AF+P R+I+DN+IIGHE LH + + + G  G AALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVML-------------------
        LD+SKA+DRVEW YL  ++ ++GF+E  I  I+ CIS+  FSI +NG   G  + SRGIRQGDPLS YLFLLC +GLS ++                   
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVML-------------------

Query:  ---------------------------------------------------------QTTYSC-------------------------------------
                                                                 Q    C                                     
Subjt:  ---------------------------------------------------------QTTYSC-------------------------------------

Query:  ---------------------SISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------
                              +++ VWR L++PNL + K+LK +YF  ++L   S  S SSYFWKGFLWG DLL KG R  +GN               
Subjt:  ---------------------SISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------

Query:  -----------------------------AKLQQHLCDKDVQLICSLPISQ-TTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPE----------
                                       +    C++D  LI S+PIS     D W WH D+ G+YSVRSGYKL M  +   +SA             
Subjt:  -----------------------------AKLQQHLCDKDVQLICSLPISQ-TTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPE----------

Query:  EITNQTGPKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK-----------------------------------------
        ++T  T  K F+W S H  IPT  NL    I     C I     E+  HA F CKRA++                                         
Subjt:  EITNQTGPKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK-----------------------------------------

Query:  ------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKA
              NDRN +IH + +  +E +C W+  +L  + +A
Subjt:  ------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKA

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]5.0e-7138.26Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N++ +I P NHT IV IPK+ +PR V++YRPI LCN+ Y  V K IANRLK  L+ I    Q AF+P R ITDN+IIG+E LH +++ +  K    ALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTK---------------------GLSV
        LD+ KAYDRVEW +L  V+ERLGF    I+LIM+CI++ +FS+++NG A G I   RG+RQG PLS YLFLLC +                     GL  
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTK---------------------GLSV

Query:  MLQTTYS-CSISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGNAK-------------------------
           + ++   +++Q WR+L+NP   + K++KARY+ ++   N    S  S+ W+  LWG  +LQKGTR  +GN +                         
Subjt:  MLQTTYS-CSISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGNAK-------------------------

Query:  -------------------LQQHLCDKDVQLICSLPISQT-TPDLWTWHNDRTGSYSVRSGYKLSM---MNQQETSSAQPEE------ITNQTGP-KHFV
                           + Q     D  +I S+ + +T   D   WH DR G YSV+SGY+L++     +  TSSA   +        N  G  K FV
Subjt:  -------------------LQQHLCDKDVQLICSLPISQT-TPDLWTWHNDRTGSYSVRSGYKLSM---MNQQETSSAQPEE------ITNQTGP-KHFV

Query:  WNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK
        W +    +PT  NLWR  I  E IC I K   E   HAL  CK A+K
Subjt:  WNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]1.2e-6132.38Show/hide
Query:  NHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRV
        N T I  IPK + P+ VSD+RPI LCN+ YK + KV+ANRLK  L     + Q AF+ GR I+DN+++  ETLHYL+ K +GK G+ ALKLDMSKAYDRV
Subjt:  NHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRV

Query:  EWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------TTYSCS----
        EW ++  +++ LG  E +  +IM C+ S S+SIL+NG+ VGNIK SRG+RQG PLS YLFLLC  GL  +L+                   TY+      
Subjt:  EWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------TTYSCS----

Query:  -------------------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKG
                                                               +++QVWR+++N      ++ KAR+FP+ ++ +    +  SY WK 
Subjt:  -------------------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKG

Query:  FLWGMDLLQKGTRRNLGNAKLQQHLCDK----------------------------------------------DVQLICSLPIS-QTTPDLWTWHNDRT
         L   D+++KG    +GN +  +   DK                                              +  L+  +P+S +  PD  TW    +
Subjt:  FLWGMDLLQKGTRRNLGNAKLQQHLCDK----------------------------------------------DVQLICSLPIS-QTTPDLWTWHNDRT

Query:  GSYSVRSGYKL---SMMNQQETSSAQPEEITNQTG---------PKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFC
        G YS  S YKL   S      +SS Q  + +   G          KHFVW   +N++PT  NL R HI    +C + K  PE   H L FC
Subjt:  GSYSVRSGYKL---SMMNQQETSSAQPEEITNQTG---------PKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFC

XP_030932623.1 uncharacterized protein LOC115958352 [Quercus lobata]1.3e-6332.79Show/hide
Query:  NHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRV
        N T I  IPK + P+ VSD+RPI LCN+ YK + KV+ANRLK  L     + Q AF+ GR I+DN+++  ETLHYL+ K +GK G+ ALKLDMSK YDRV
Subjt:  NHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRV

Query:  EWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------TTYSCS----
        EW ++  +++ LG  E +  +IM C+ S S+SIL+NG+ VGNIK SRG+RQGDPLS YLFLLC  GL  +L+                   TY+      
Subjt:  EWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------TTYSCS----

Query:  -------------------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKG
                                                               +++QVWR+++NP     ++ KAR+FP+ ++ +    +  SY WK 
Subjt:  -------------------------------------------------------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKG

Query:  FLWGMDLLQKGTRRNLGNAKLQQHLCDK----------------------------------------------DVQLICSLPIS-QTTPDLWTWHNDRT
         L   D+++KG    +GN +  +   DK                                              +  L+  +P+S +  PD  TW    +
Subjt:  FLWGMDLLQKGTRRNLGNAKLQQHLCDK----------------------------------------------DVQLICSLPIS-QTTPDLWTWHNDRT

Query:  GSYSVRSGYKL---SMMNQQETSSAQPEEITNQTG---------PKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFC
        G YS  S YKL   S      +SS Q  + +   G          KHFVW   +N++PT  NL R HI    +C + K  PE   HAL FC
Subjt:  GSYSVRSGYKL---SMMNQQETSSAQPEEITNQTG---------PKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFC

TrEMBL top hitse value%identityAlignment
A0A2N9F6W6 Reverse transcriptase domain-containing protein8.4e-6434.77Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N    ++  N T+I  IPK + P L+SDYRPI LCN+ YK ++KVIANRLK++L  I  + Q AF+PGR ITDN+ +  E +H  + KRKGK+G  ALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------
        LDMSKAYDRVEW +L  +L +LGF E  + +IM C+ S  + +L++    G I +SRGIRQGDPLS YLFL+C +GLS +L+                  
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------

Query:  -----------------------------TTYSCS-----------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQ
                                       Y  S           +++Q WR+L+NP     ++ KARYFPS ++   +  ++ S+ W+  L G ++LQ
Subjt:  -----------------------------TTYSCS-----------ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQ

Query:  KGTRRNLGNAKLQQHLCDKDVQLICSLPISQTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITN-------------QTGPKHFVWNSFHN
        KG                    L    P +   P L  W   + GS++V+S Y L     + + S +   + +                 KHF+W ++H 
Subjt:  KGTRRNLGNAKLQQHLCDKDVQLICSLPISQTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITN-------------QTGPKHFVWNSFHN

Query:  SIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAK
        S+PT+ NL R  I     C I K   +TT HAL+ C  A+
Subjt:  SIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAK

A0A2N9FR17 Reverse transcriptase domain-containing protein2.1e-6235.29Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N    +   NHT I  IPK++ P  V+++RPI LCN+ YK ++KV+ANRLKIIL  I  + Q AF+ GR ITDN+++  ETLHY+ + + G+ G  ALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQTT----------------
        LDM+KAYDRVEW +L K++ R+GFH+  I L+ +CIS+ S+SILVNG+  G IK SRG+RQGDPLS YLFLLC +GL  ++                   
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQTT----------------

Query:  -------------------------------YSCSISEQVWR----------VLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQK
                                       Y  +  +QV R            E     + + L+A++FP  T+ N    +  SY W+  L   +L+ K
Subjt:  -------------------------------YSCSISEQVWR----------VLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQK

Query:  GTRRNLGNAKLQQHLCDKDVQLICSLPIS-QTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTGPKHFVWNSFHNSIPTMMNLWRHHI
          +    ++ +       DV+ I  +P+S Q   D   W  +  G YSVRSGY+  +           EE  N  G       +   ++PT +NL + HI
Subjt:  GTRRNLGNAKLQQHLCDKDVQLICSLPIS-QTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTGPKHFVWNSFHNSIPTMMNLWRHHI

Query:  PVEGICPIFKFNPETTEHALFFCKR
        P+   C +     E T HAL+ CK+
Subjt:  PVEGICPIFKFNPETTEHALFFCKR

A0A2N9GY38 Reverse transcriptase domain-containing protein4.0e-6631.2Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N  + ++  NHT I  IPK + P  V D+RPI LCN+ YK ++KV+ANRLKIIL  I  E Q AF+PGR ITDN+++  ETLH++Q+++KG+ G  ALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------
        LDMSKAYDRVEW YL +V+ER+GF    + ++M+CIS+ S+SILVNG+    IK SR +RQGDPLS YLFLLC +G   +LQ                  
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------

Query:  ---------------------------TTYSCS-----------------------------------------------------------ISEQVWRV
                                     Y+ S                                                           +++QVWR+
Subjt:  ---------------------------TTYSCS-----------------------------------------------------------ISEQVWRV

Query:  LENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------------------AKLQQ--HLCD-------
        + NP+    K+ KA+YFP  ++   +  S SSY WK  +   DL++KG+   +GN                           + L Q  HL D       
Subjt:  LENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------------------AKLQQ--HLCD-------

Query:  ----------KDVQLICSLPI-SQTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTG-------------PKHFVWNSFHNSIPTMMN
                   D   I  +P+ S    D   W   R G Y+VRSGY L +++++  +   P + T  T               +HF+W S H S+PT  N
Subjt:  ----------KDVQLICSLPI-SQTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTG-------------PKHFVWNSFHNSIPTMMN

Query:  LWRHHIPVEGICPIFKFNPETTEHALFFCKRAKKNDRNHMIHNRPIPNIEVQCVWILDYLADYHKANQVVSKSILSL
        L   HI  +  C       ETT HAL+ CK  ++  +      R      +Q     D++  +H+  Q++S + L L
Subjt:  LWRHHIPVEGICPIFKFNPETTEHALFFCKRAKKNDRNHMIHNRPIPNIEVQCVWILDYLADYHKANQVVSKSILSL

A0A6J1DX30 uncharacterized protein LOC1110248744.9e-7229.94Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N  + I+ WN T I  IPK +QPR +SD+RPI LCN++YK ++K I NRLK ++  +  + Q AF+P R+I+DN+IIGHE LH + + + G  G AALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVML-------------------
        LD+SKA+DRVEW YL  ++ ++GF+E  I  I+ CIS+  FSI +NG   G  + SRGIRQGDPLS YLFLLC +GLS ++                   
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVML-------------------

Query:  ---------------------------------------------------------QTTYSC-------------------------------------
                                                                 Q    C                                     
Subjt:  ---------------------------------------------------------QTTYSC-------------------------------------

Query:  ---------------------SISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------
                              +++ VWR L++PNL + K+LK +YF  ++L   S  S SSYFWKGFLWG DLL KG R  +GN               
Subjt:  ---------------------SISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN---------------

Query:  -----------------------------AKLQQHLCDKDVQLICSLPISQ-TTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPE----------
                                       +    C++D  LI S+PIS     D W WH D+ G+YSVRSGYKL M  +   +SA             
Subjt:  -----------------------------AKLQQHLCDKDVQLICSLPISQ-TTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPE----------

Query:  EITNQTGPKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK-----------------------------------------
        ++T  T  K F+W S H  IPT  NL    I     C I     E+  HA F CKRA++                                         
Subjt:  EITNQTGPKHFVWNSFHNSIPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK-----------------------------------------

Query:  ------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKA
              NDRN +IH + +  +E +C W+  +L  + +A
Subjt:  ------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKA

A0A803PA46 Uncharacterized protein1.6e-6228.62Show/hide
Query:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK
        +N+  +I P N T I  IPK +QP  V D+RPI LC I YK ++K IANRLK++LN +    Q AF+PGR I+DN+II  E  H ++ K +G++G+ ALK
Subjt:  MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALK

Query:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------
        LDM+KA+DRVEW +LS +L+++ F      LI+DCI++A+F + VNGK  G+I  +RGIRQGDPLS YLFLLC +GLS +++                  
Subjt:  LDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ------------------

Query:  -----------------TTYSCS--------------------------------------------------------------------------ISE
                         +  SC+                                                                          +++
Subjt:  -----------------TTYSCS--------------------------------------------------------------------------ISE

Query:  QVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGNAK--------------------------------------
        Q WR+L++PN  + ++L ARY P+S+    S     S+ W+   WG +LL KG    +GN +                                      
Subjt:  QVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGNAK--------------------------------------

Query:  --LQQHLCDKDVQLICSLPIS-QTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTGP----------------KHFVWNSFHNSIPTM
          LQ       VQ I S+P+      D   W   ++G+YSV+SGY LS       SS  P +I + + P                KHFV+ +  N++PT 
Subjt:  --LQQHLCDKDVQLICSLPIS-QTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTGP----------------KHFVWNSFHNSIPTM

Query:  MNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK--------------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKANQV-VSKSILSLE---DF
         NL    I    +C       ET  HALFFC+  +               ++RN  + N+              +LA+Y  ++Q   S+  ++       
Subjt:  MNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKK--------------NDRNHMIHNRPIPNIEVQCVWILDYLADYHKANQV-VSKSILSLE---DF

Query:  YEMIVKGEDIIMHTDAAFDPVNRKSGLGIVFRDKHGVLQAGLA
        +E    G  + ++TDAA    + K+G G V R+  G + A +A
Subjt:  YEMIVKGEDIIMHTDAAFDPVNRKSGLGIVFRDKHGVLQAGLA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-1426.34Show/hide
Query:  KKESIQP--WNHTNIVFIPK-SRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGK-QGYAA
        +KE I P  +   +I+ IPK  R      ++RPI L NI  K + K++ANR++  +  + +  Q+ FIPG     N+    ++++ +Q+  + K + +  
Subjt:  KKESIQP--WNHTNIVFIPK-SRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGK-QGYAA

Query:  LKLDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLC-------------TKGLSVMLQTTY
        + +D  KA+D+++ P++ K L +LG     + +I       + +I++NG+ +       G RQG PLS  LF +               KG+ +  +   
Subjt:  LKLDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLC-------------TKGLSVMLQTTY

Query:  SCSISEQVWRVLENPNLTLLKLLK
            ++ +   LENP ++   LLK
Subjt:  SCSISEQVWRVLENPNLTLLKLLK

P11369 LINE-1 retrotransposable element ORF2 protein7.9e-1936.94Show/hide
Query:  IVFIPK-SRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWP
        I  IPK  + P  + ++RPI L NI  K + K++ANR++  +  I +  Q+ FIPG     N+      +HY+ NK K K  +  + LD  KA+D+++ P
Subjt:  IVFIPK-SRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWP

Query:  YLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLF
        ++ KVLER G     +++I    S    +I VNG+ +  I    G RQG PLS YLF
Subjt:  YLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLF

P14381 Transposon TX1 uncharacterized 149 kDa protein8.7e-1832.12Show/hide
Query:  IPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSK
        +PK    RL+ ++RP+ L +  YK V K I+ RLK +L ++ +  Q   +PGR+I DN+ +  + LH+    R+     A L LD  KA+DRV+  YL  
Subjt:  IPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSK

Query:  VLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ
         L+   F    +  +    +SA   + +N      +   RG+RQG PLS  L+ L  +    +L+
Subjt:  VLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQ

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.2e-1227.68Show/hide
Query:  PWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYD
        PW       IPK       S++RPI + +   + + +++A RL+  +     +   A I G ++ +++++      Y+ ++R+ ++ Y  + LD+ KA+D
Subjt:  PWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYD

Query:  RVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVN-GKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQTT
         V    + + L+RLG  E   + I   +S ++ +I V  G     I   RG++QGDPLS +LF      L   LQ+T
Subjt:  RVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVN-GKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQTT

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)6.3e-0833.33Show/hide
Query:  HYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLF
        HY++ +R   + Y  + LD+ KA+D V  P + + +   G  + +   IM  I+ A  +I+V G+    I    G++QGDPLS  LF
Subjt:  HYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLF

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.3e-1242.5Show/hide
Query:  IANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSKVLERLGFHE
        +  RLK ++ ++    Q +FIPGR  TDN++   E +H ++ K KG +G+  LKLD+ KAYDR+ W YL   L   GF E
Subjt:  IANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRVEWPYLSKVLERLGFHE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.1e-0733.33Show/hide
Query:  ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN
        +++Q +R++  P+  L +LL++RYFP S++   S  +  SY W+  + G +LL +G  R +G+
Subjt:  ISEQVWRVLENPNLTLLKLLKARYFPSSTLFNTSAKSHSSYFWKGFLWGMDLLQKGTRRNLGN

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-0560Show/hide
Query:  LVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLS
        ++NG   G +  SRG+RQGDPLS YLF+LCT+ LS
Subjt:  LVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAAAGGAGTCGATCCAACCGTGGAATCATACCAATATTGTGTTTATTCCAAAATCTCGACAGCCAAGGTTAGTTTCTGATTATAGACCAATTAGGCTCTGTAA
TATTGCATATAAAACTGTTACAAAGGTTATTGCAAATAGATTGAAGATCATTTTGAACGATATCAAATATGAGTGCCAGTTGGCTTTTATTCCCGGTCGGTCTATAACTG
ATAATATGATTATTGGGCATGAAACTCTTCATTACCTTCAGAATAAAAGGAAAGGGAAGCAGGGCTATGCTGCTCTCAAATTAGATATGAGTAAGGCATATGATAGAGTA
GAATGGCCTTATCTGAGTAAGGTTTTGGAACGATTGGGCTTTCATGAGGACTTGATTCACCTCATTATGGATTGTATTTCCTCTGCTTCCTTTTCTATCCTGGTTAATGG
GAAGGCAGTGGGAAATATTAAATCATCTCGGGGTATACGACAAGGAGATCCCTTGTCTTCTTATCTGTTCCTGCTATGTACGAAAGGCCTCTCAGTCATGTTACAAACAA
CCTATTCTTGTTCGATCTCAGAACAAGTTTGGCGAGTGTTGGAGAATCCAAACTTAACTTTGTTGAAACTGCTTAAGGCGAGGTATTTCCCTTCATCTACATTGTTTAAC
ACTTCGGCCAAGTCTCACTCCTCGTATTTTTGGAAGGGTTTCCTTTGGGGCATGGACTTACTTCAAAAGGGCACTCGTAGAAATCTTGGGAATGCAAAACTTCAACAACA
TTTGTGTGATAAGGACGTTCAATTGATTTGTAGTCTCCCTATTAGTCAAACGACTCCAGATTTATGGACATGGCATAATGATAGGACTGGATCGTACTCTGTTCGAAGTG
GATATAAGCTTAGCATGATGAATCAACAGGAAACTTCTTCGGCTCAACCAGAAGAAATCACCAACCAAACGGGCCCTAAACATTTTGTTTGGAATTCGTTTCATAACTCT
ATTCCAACGATGATGAATCTTTGGCGCCACCATATTCCCGTGGAGGGGATTTGTCCTATTTTCAAATTCAATCCAGAAACGACAGAGCACGCCTTGTTCTTTTGCAAGCG
TGCTAAGAAGAACGATAGAAATCACATGATTCACAACCGTCCTATTCCAAATATTGAGGTTCAATGTGTGTGGATTCTTGATTACTTGGCAGATTACCATAAGGCTAACC
AGGTGGTATCCAAATCTATTTTGTCATTGGAAGATTTTTATGAAATGATTGTTAAAGGGGAGGACATAATCATGCACACGGATGCGGCGTTTGACCCAGTAAATCGGAAA
TCTGGTCTAGGAATTGTTTTTAGAGACAAACATGGTGTTCTGCAGGCAGGGCTAGCTTGGTGTTGCAATTCTGAAGTGTCTTATCATCGTGATCCTCCCGGAACTCCAGA
TTTTTTGGAATATGTTTCAACTCCAATGAAGCTCTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAAAGGAGTCGATCCAACCGTGGAATCATACCAATATTGTGTTTATTCCAAAATCTCGACAGCCAAGGTTAGTTTCTGATTATAGACCAATTAGGCTCTGTAA
TATTGCATATAAAACTGTTACAAAGGTTATTGCAAATAGATTGAAGATCATTTTGAACGATATCAAATATGAGTGCCAGTTGGCTTTTATTCCCGGTCGGTCTATAACTG
ATAATATGATTATTGGGCATGAAACTCTTCATTACCTTCAGAATAAAAGGAAAGGGAAGCAGGGCTATGCTGCTCTCAAATTAGATATGAGTAAGGCATATGATAGAGTA
GAATGGCCTTATCTGAGTAAGGTTTTGGAACGATTGGGCTTTCATGAGGACTTGATTCACCTCATTATGGATTGTATTTCCTCTGCTTCCTTTTCTATCCTGGTTAATGG
GAAGGCAGTGGGAAATATTAAATCATCTCGGGGTATACGACAAGGAGATCCCTTGTCTTCTTATCTGTTCCTGCTATGTACGAAAGGCCTCTCAGTCATGTTACAAACAA
CCTATTCTTGTTCGATCTCAGAACAAGTTTGGCGAGTGTTGGAGAATCCAAACTTAACTTTGTTGAAACTGCTTAAGGCGAGGTATTTCCCTTCATCTACATTGTTTAAC
ACTTCGGCCAAGTCTCACTCCTCGTATTTTTGGAAGGGTTTCCTTTGGGGCATGGACTTACTTCAAAAGGGCACTCGTAGAAATCTTGGGAATGCAAAACTTCAACAACA
TTTGTGTGATAAGGACGTTCAATTGATTTGTAGTCTCCCTATTAGTCAAACGACTCCAGATTTATGGACATGGCATAATGATAGGACTGGATCGTACTCTGTTCGAAGTG
GATATAAGCTTAGCATGATGAATCAACAGGAAACTTCTTCGGCTCAACCAGAAGAAATCACCAACCAAACGGGCCCTAAACATTTTGTTTGGAATTCGTTTCATAACTCT
ATTCCAACGATGATGAATCTTTGGCGCCACCATATTCCCGTGGAGGGGATTTGTCCTATTTTCAAATTCAATCCAGAAACGACAGAGCACGCCTTGTTCTTTTGCAAGCG
TGCTAAGAAGAACGATAGAAATCACATGATTCACAACCGTCCTATTCCAAATATTGAGGTTCAATGTGTGTGGATTCTTGATTACTTGGCAGATTACCATAAGGCTAACC
AGGTGGTATCCAAATCTATTTTGTCATTGGAAGATTTTTATGAAATGATTGTTAAAGGGGAGGACATAATCATGCACACGGATGCGGCGTTTGACCCAGTAAATCGGAAA
TCTGGTCTAGGAATTGTTTTTAGAGACAAACATGGTGTTCTGCAGGCAGGGCTAGCTTGGTGTTGCAATTCTGAAGTGTCTTATCATCGTGATCCTCCCGGAACTCCAGA
TTTTTTGGAATATGTTTCAACTCCAATGAAGCTCTACTGA
Protein sequenceShow/hide protein sequence
MNKKESIQPWNHTNIVFIPKSRQPRLVSDYRPIRLCNIAYKTVTKVIANRLKIILNDIKYECQLAFIPGRSITDNMIIGHETLHYLQNKRKGKQGYAALKLDMSKAYDRV
EWPYLSKVLERLGFHEDLIHLIMDCISSASFSILVNGKAVGNIKSSRGIRQGDPLSSYLFLLCTKGLSVMLQTTYSCSISEQVWRVLENPNLTLLKLLKARYFPSSTLFN
TSAKSHSSYFWKGFLWGMDLLQKGTRRNLGNAKLQQHLCDKDVQLICSLPISQTTPDLWTWHNDRTGSYSVRSGYKLSMMNQQETSSAQPEEITNQTGPKHFVWNSFHNS
IPTMMNLWRHHIPVEGICPIFKFNPETTEHALFFCKRAKKNDRNHMIHNRPIPNIEVQCVWILDYLADYHKANQVVSKSILSLEDFYEMIVKGEDIIMHTDAAFDPVNRK
SGLGIVFRDKHGVLQAGLAWCCNSEVSYHRDPPGTPDFLEYVSTPMKLY