; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007902 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007902
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:7620676..7624446
RNA-Seq ExpressionLag0007902
SyntenyLag0007902
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.3e-6931.11Show/hide
Query:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSY
        ++T E+N +LL P+++ EIE AI QM  +KALGPDGFP +FYQ +W+ VG  T+  C+  L     +  WN T++ALIPK+KQ + ISD+RPISLCNVSY
Subjt:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSY

Query:  KIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQTPR
        KI+ K + NR+K V+  V+S  QS FVP R+I DNVI+                                                     I++C+ T R
Subjt:  KIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQTPR

Query:  FSILLNGVPTGKIIPQRGAVA-------------------------RGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKV
        FSI LNG P G   P RG                             G L+ I   +    I+HL FADDSL+F ++   +    R  L SY +ASGQ +
Subjt:  FSILLNGVPTGKIIPQRGAVA-------------------------RGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKV

Query:  SIGKSALFFSHNVNVDFRLVISNLLGGK-----------------------------------------------------------------------E
        +  KSAL FS NV+ + +  +  +L  K                                                                       +
Subjt:  SIGKSALFFSHNVNVDFRLVISNLLGGK-----------------------------------------------------------------------E

Query:  VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQ
        V+K +Y    SL+ A   S  S FW+GF+W R                          + +TF+P+      +  D TV+  IT    W+++S+      
Subjt:  VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQ

Query:  EDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT
        ED ++I ++PIS  +  D W+WHY   G YSVRSGYKL   +      +S +  R   W S+WKL +  K+K FIWR+ HE +PT
Subjt:  EDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]3.5e-6728.61Show/hide
Query:  FRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPILLD------------------------------
        F+  L DC L D+ F G  FTW N R  +  ++E LDR  AN+A++  F  + V HL  T SDH PIL+D                              
Subjt:  FRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPILLD------------------------------

Query:  --------------------------------------------------------------GTLVDLQAM--------------ADPGFYTTGAQAKVT
                                                                       TL DL+ +               D G         V 
Subjt:  --------------------------------------------------------------GTLVDLQAM--------------ADPGFYTTGAQAKVT

Query:  PEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIV
        PE N +L  P++  E+  A+ QMH +KA GPDG PT+FYQ+FW  VG       +G+L   ++VT  NDT + LIPKVK  K +S +RPISLCNV YK+V
Subjt:  PEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIV

Query:  VKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRFSI
         K+L NRM+ +L D++S NQS FV GR I DN++                                                      I+ C+ +  +S+
Subjt:  VKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRFSI

Query:  LLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIG
        L+NG P  K +P RG                         A   G L+ +   +  P +SHL FADDSL+F  A+++++   +  L  YE  SGQK+++ 
Subjt:  LLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIG

Query:  KSALFFSHNVNVDFRLVISNLLG-GKEVIKGRYAHHMSLMFAPIQSNCSVF--WRGFVWAR-------------QESTFRPIPLAGTT-----------I
        KSA+ FS NV +D +  + N +G G  V  G+Y   + L +   +S   +F   +  VW R             +E   + +  A  T           I
Subjt:  KSALFFSHNVNVDFRLVISNLLG-GKEVIKGRYAHHMSLMFAPIQSNCSVF--WRGFVWAR-------------QESTFRPIPLAGTT-----------I

Query:  RED-------ATVSELITPSMG-WNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDL--SSNDIQR-RWTWVSL
         ED         VS LI      W+++ L+E + + DV  I  IP+      D+ +WHY  NG++SVR  Y L  +IS G+++  S +D+Q   W W  +
Subjt:  RED-------ATVSELITPSMG-WNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDL--SSNDIQR-RWTWVSL

Query:  WKLKIHQKVKHFIWRAYHECLPTNYCL
        W L+I  K+K F W+     LP    L
Subjt:  WKLKIHQKVKHFIWRAYHECLPTNYCL

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.5e-5727.23Show/hide
Query:  QAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNV
        Q KVT  MN  L+  F+  E+ +A+ +M+ +KA G DG P +FYQ+FW+++ +  I  C+ +L     +   NDT +ALIPKV + + I ++RPISLCNV
Subjt:  QAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNV

Query:  SYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQT
         YKIV K L NRM+  L  VVS +QS FV GR I DN I+                                                     IM C+ +
Subjt:  SYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQT

Query:  PRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQ
         +FS ++NG   G++ PQRG                         A  RG +  +  G+    +SHLFFADDSLVF +A+  +   F+  L  Y  ASGQ
Subjt:  PRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQ

Query:  KVSIGKSALFFSHNVNVDFRLVISNLLGGK----------------------------------------------------------------------
         V+  KS + F  NV+   +  ++ L+G K                                                                      
Subjt:  KVSIGKSALFFSHNVNVDFRLVISNLLGGK----------------------------------------------------------------------

Query:  -----------------------------EVIKGRYAHHMSLMFAPIQSNCSVFWRGFVWARQ-------------------ESTFRPIPLA-----GTT
                                     +V+K  Y  +  ++ A   ++ S  WR  VW ++                   E ++ P P+         
Subjt:  -----------------------------EVIKGRYAHHMSLMFAPIQSNCSVFWRGFVWARQ-------------------ESTFRPIPLA-----GTT

Query:  IREDATVSELITPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVK
        + +   V +L  P   W+   +R V +  D ++I ++P S  D  D+ +WHY  +G YSVRSGY++A ++ V    S  +   +W W  LWKLKI  KVK
Subjt:  IREDATVSELITPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVK

Query:  HFIWRAYHECLPTNYCL
        HF+W+  H  +PTN  L
Subjt:  HFIWRAYHECLPTNYCL

XP_030502823.1 uncharacterized protein LOC115717993 [Cannabis sativa]4.3e-5730.84Show/hide
Query:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK
        VT +MN +L+ PF+  E+E A++ M    + G DG   +FYQ  W+ VG +     + +L      T  N T + LIPK+K+ + ++DYRPISLCNV  K
Subjt:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK

Query:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRF
        ++ K+LVNR K VL  V+S+ QS F+P R I DN++                                                     LIM C++T  F
Subjt:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRF

Query:  SILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVS
        S LLNG P G +IP RG                             GNL   K  +  P ISHLFFADDSL+FC+A+       + +L  Y++ASGQ  S
Subjt:  SILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVS

Query:  IGKSALFFSHNVNVDFRLVISNLLGGKE----------VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QEST
        +  +  +  +           NLL  ++          V+K RY  + S++ A    + S+ W+G  W +                            S+
Subjt:  IGKSALFFSHNVNVDFRLVISNLLGGKE----------VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QEST

Query:  FRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSL
        F P   +      +A VS LIT    W+L  L E     DV+ I TIP+S     D  IW+  S+G+Y+V++GY  A S+    + SS+     W W  L
Subjt:  FRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSL

Query:  WKLKIHQKVKHFIWRAYHECLP
        W LK+ +K+K F+WR ++E LP
Subjt:  WKLKIHQKVKHFIWRAYHECLP

XP_030930869.1 uncharacterized protein LOC115956706 [Quercus lobata]3.2e-6027.87Show/hide
Query:  MFRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPILLDGTLVDLQA-----------MADPGF----
        +FR  LD+C L D+ F G  FTWS   +    I E LDR   +  +      + V H+D T SDH+ + ++ +++D              + D G     
Subjt:  MFRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPILLDGTLVDLQA-----------MADPGF----

Query:  -------YTTGAQAK-----------------------VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKM
               Y      +                       VT EMN  L   F   E+E A+ QM   KA GPDG   +FYQ FW+ V    I + +G L  
Subjt:  -------YTTGAQAK-----------------------VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKM

Query:  IRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI--------------------------
               N T + LIPK+K  +F++ YRPISLCNV YKI  K+L NR+K VL ++ S++QS F+ GR I DN++                          
Subjt:  IRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI--------------------------

Query:  ---------------------------LIMECVQTPRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEIS
                                   L+MEC+ +  +SIL+NG P G I P RG                         A+ +G++  +   +  P+++
Subjt:  ---------------------------LIMECVQTPRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEIS

Query:  HLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIGKSALFFSHNVNVDFRLVISNLLGGKEVIKGRYAHHMSLMFAPIQSNCSVF--WRGFVWAR
        HLFFADDSL+FC+A+ ++       L+SYEK SG+K++  K+ALFFS +   + +  I   LG  E+   +Y  ++ L     ++  + F   +  +W R
Subjt:  HLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIGKSALFFSHNVNVDFRLVISNLLGGKEVIKGRYAHHMSLMFAPIQSNCSVF--WRGFVWAR

Query:  QESTFRPIPLAGTTIREDATVSELITP-SMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRW
         +     + L+     E + V  LI P +  W    +  V ++++   I+ IP+  ++  D  +W +  +G YSV+SGY+     S     +  D     
Subjt:  QESTFRPIPLAGTTIREDATVSELITP-SMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRW

Query:  TWVSLWKLKI
         W  +W L++
Subjt:  TWVSLWKLKI

TrEMBL top hitse value%identityAlignment
A0A2N9ESP5 Reverse transcriptase domain-containing protein9.9e-6030.54Show/hide
Query:  FRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPIL--LDGT------------------LVDLQAMA
        FR  L DC LQD+ + G  FTWSNRR     +   LDR  AN  +I  F N+ V H+    SDH  +L  +D T                  + + Q   
Subjt:  FRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPIL--LDGT------------------LVDLQAMA

Query:  DPGFYTT--------------------------------------GAQAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGD
         P    T                                         + VTP+MN  LL  FS  EI RA+ QM  SKA GPDG   +F+Q++W+ VGD
Subjt:  DPGFYTT--------------------------------------GAQAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGD

Query:  ---ITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDN-------------
             IL+     +M+ SV   N T++ LIPKVK  + ++ +RPISLCNV YKIV K+LVNRMK +L  + +K        R  +               
Subjt:  ---ITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDN-------------

Query:  -VILIMECVQTPRFSILLNGVPTGKIIPQRG------------AVARGNLSSIKPGK-------------FWPEISHLFFADDSLVFCKASIEQVWTFRS
         V LIM  V +  +S+L+NG P G I P RG             +    LS++   K               P ISHL FAD+S++FC+AS+       +
Subjt:  -VILIMECVQTPRFSILLNGVPTGKIIPQRG------------AVARGNLSSIKPGK-------------FWPEISHLFFADDSLVFCKASIEQVWTFRS

Query:  ALASYEKASGQKVSIGKSALFFSHNVNVDFRLVISNLLGG---------KEVIKGRYAHHMSLMFAPIQSNCSVFWRGF---------------------
         LA YE+ASGQK++  K+ALFFS N  +  R  I  + GG          ++IK +    M      + +   +  +G+                     
Subjt:  ALASYEKASGQKVSIGKSALFFSHNVNVDFRLVISNLLGG---------KEVIKGRYAHHMSLMFAPIQSNCSVFWRGF---------------------

Query:  --------VWARQESTFRPIPLAGTTIREDATVSELITPS-MGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKL---AQSI
                 W    STF+ I L    +   ATV  LI  + M WN++ L+E+    DV +I+ IP+S+    D+ IW    NG +SVRS Y L     S 
Subjt:  --------VWARQESTFRPIPLAGTTIREDATVSELITPS-MGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKL---AQSI

Query:  SVGQDLSSNDIQRRW-TWVSLWKLKIHQKVKHFIWRAYHECLPTN
        S+G   S   +   W  W ++W  +     K  +W+A    +P++
Subjt:  SVGQDLSSNDIQRRW-TWVSLWKLKIHQKVKHFIWRAYHECLPTN

A0A2N9FJ03 Reverse transcriptase domain-containing protein4.9e-5930Show/hide
Query:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGD---ITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNV
        VT  MN  L+  F   E+E+AI QM  SKA GPDG P IFYQ++W+ +G      +L+C+    +++S+   N T + LIPKVK  + ++D+RPISLCNV
Subjt:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGD---ITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNV

Query:  SYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQT
         YK+V K+L NR+K +L  V+S++QS FVPGR I DNV+                                                     L+ EC+ T
Subjt:  SYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQT

Query:  PRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQ
          +SIL+NG P G I+P RG                         A + G++  +   +  P+I+HLFFADDSL+F KA+I      +  L  YE+ASGQ
Subjt:  PRFSILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQ

Query:  KVSIGKSALFFSHNV----------NVDFRLV--------ISNLLG-GK-----------------------EVIKGRYAHHMSLMFAPIQSNCSVFWRG
        +V+  K+ +FFS  V           +D  ++        + +L+G G+                        V KG++  H S++    ++  S  W+ 
Subjt:  KVSIGKSALFFSHNV----------NVDFRLV--------ISNLLG-GK-----------------------EVIKGRYAHHMSLMFAPIQSNCSVFWRG

Query:  FVWARQ-------------------------ESTFRPIPLAGTTIREDATVSELI-TPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSN
         + AR                          E   R I   G  +  ++TV +LI  P M W+ + + ++    D   I+ IP+S     D++ W   +N
Subjt:  FVWARQ-------------------------ESTFRPIPLAGTTIREDATVSELI-TPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSN

Query:  GMYSVRSGYKLAQSISVGQ--DLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT
        G YSV+SGY+    +   Q    S+N++     W ++W L+I +K +HF WRA  + LPT
Subjt:  GMYSVRSGYKLAQSISVGQ--DLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT

A0A2N9I4W7 Reverse transcriptase domain-containing protein1.6e-6232.27Show/hide
Query:  WSNRRDRTTQINECLDRFRA---NEAFIQTFSNSSVRHLDWTQSDHRPILLDGTLVDLQAMADPGFYTTGAQAKVTPEMNAKLLTPFSRGEIERAINQMH
        ++N+R RT +I    D   A    EA ++   N   R++  T +   P  +D   V + AM             V+ +MN  LL P +  E++ A+ QM+
Subjt:  WSNRRDRTTQINECLDRFRA---NEAFIQTFSNSSVRHLDWTQSDHRPILLDGTLVDLQAMADPGFYTTGAQAKVTPEMNAKLLTPFSRGEIERAINQMH

Query:  SSKALGPDGFPTIFYQQFWNEVG-DIT--ILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQS
         SKA GPDG   +F+Q++W+ +G DIT  +L+C+   K+++SV   N  H+ALIPKV   + +  +RPISLCNV YK+V K+LVNR+K VL  ++S +QS
Subjt:  SSKALGPDGFPTIFYQQFWNEVG-DIT--ILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQS

Query:  GFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRFSILLNGVPTGKIIPQRG------
         FVPGR I DNV+                                                     L+MEC+    +SILLNG PTG IIP RG      
Subjt:  GFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRFSILLNGVPTGKIIPQRG------

Query:  -------AVARGNLSSIKP------------GKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIGKSALFFSHNV-----------
                 A G  S I+              +  P  SHLFFADDS++FC+A IE+    R  L  YE ASGQK+++ K++LFFS N            
Subjt:  -------AVARGNLSSIKP------------GKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIGKSALFFSHNV-----------

Query:  -------NVDFRLVISNLLGGKEVIKGRYA-----HHMSLMFAP------IQSNCSVFWRGFVWARQESTFRPIPLAGTTIREDATVSELITPSMGWNLS
                ++  L +  ++ GKEV+    A     + MS+   P      I S  S +W G     Q+   R I   G          + +  S  WN+ 
Subjt:  -------NVDFRLVISNLLGGKEVIKGRYA-----HHMSLMFAP------IQSNCSVFWRGFVWARQESTFRPIPLAGTTIREDATVSELITPSMGWNLS

Query:  SLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDL--SSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT
         + E+    +  II++IP+S   + D  IW    NGMYSV+S Y L        +   SSN  + R  W  +W+  + QK+K FIW+A    LPT
Subjt:  SLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDL--SSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-6931.11Show/hide
Query:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSY
        ++T E+N +LL P+++ EIE AI QM  +KALGPDGFP +FYQ +W+ VG  T+  C+  L     +  WN T++ALIPK+KQ + ISD+RPISLCNVSY
Subjt:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSY

Query:  KIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQTPR
        KI+ K + NR+K V+  V+S  QS FVP R+I DNVI+                                                     I++C+ T R
Subjt:  KIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVIL-----------------------------------------------------IMECVQTPR

Query:  FSILLNGVPTGKIIPQRGAVA-------------------------RGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKV
        FSI LNG P G   P RG                             G L+ I   +    I+HL FADDSL+F ++   +    R  L SY +ASGQ +
Subjt:  FSILLNGVPTGKIIPQRGAVA-------------------------RGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKV

Query:  SIGKSALFFSHNVNVDFRLVISNLLGGK-----------------------------------------------------------------------E
        +  KSAL FS NV+ + +  +  +L  K                                                                       +
Subjt:  SIGKSALFFSHNVNVDFRLVISNLLGGK-----------------------------------------------------------------------E

Query:  VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQ
        V+K +Y    SL+ A   S  S FW+GF+W R                          + +TF+P+      +  D TV+  IT    W+++S+      
Subjt:  VIKGRYAHHMSLMFAPIQSNCSVFWRGFVWAR--------------------------QESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQ

Query:  EDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT
        ED ++I ++PIS  +  D W+WHY   G YSVRSGYKL   +      +S +  R   W S+WKL +  K+K FIWR+ HE +PT
Subjt:  EDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPT

A0A803PM52 Uncharacterized protein4.6e-5732.1Show/hide
Query:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK
        VT +MN +L+ PF+  E+E A++ M    + G DG   +FYQ  W+ VG +     + +L      T  N T + LIPK+K+ + ++DYRPISLCNV  K
Subjt:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK

Query:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRF
        ++ K+LVNR K VL  V+S+ QS F+P R I DN++                                                     LIM C++T  F
Subjt:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVI-----------------------------------------------------LIMECVQTPRF

Query:  SILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVS
        S LLNG P G +IP RG                             GNL   K  +  P ISHLFFADDSL+FC+A+       + +L  Y++ASGQ + 
Subjt:  SILLNGVPTGKIIPQRG-------------------------AVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVS

Query:  IGKSALFFSHNVNVDFRLVISNLLGGKEVIKGRYAHHMSLMFAPIQSNCSVFWRGFVWARQESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVV
            + ++ +N  ++     S  L  + +  G+      L +  I   C V      W    S+F P   +      +A VS LIT    W+L  L E  
Subjt:  IGKSALFFSHNVNVDFRLVISNLLGGKEVIKGRYAHHMSLMFAPIQSNCSVFWRGFVWARQESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVV

Query:  HQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLP
           DV+ I TIP+S     D  IW+  S+G+Y+V++GY  A S+    + SS+     W W  LW LK+ +K+K F+WR ++E LP
Subjt:  HQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-1028.12Show/hide
Query:  FYTTGAQAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSV-------TDWNDTHLALIPKV-KQQ
        F  T    ++  E    L  P +  EI   IN + + K+ GPDGF   FYQ++  E+          +LK+ +S+         + +  + LIPK  +  
Subjt:  FYTTGAQAKVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSV-------TDWNDTHLALIPKV-KQQ

Query:  KFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQ
            ++RPISL N+  KI+ K+L NR++  +  ++  +Q GF+PG   + N+   +  +Q
Subjt:  KFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQ

P08548 LINE-1 reverse transcriptase homolog9.7e-1232.33Show/hide
Query:  PFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKV-KQQKFISDYRPISLCNVSYKIVVKLLVNRM
        P S  EI   I  +   K+ GPDGF + FYQ F  E+  I +     + K       + + ++ LIPK  K      +YRPISL N+  KI+ K+L NR+
Subjt:  PFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKV-KQQKFISDYRPISLCNVSYKIVVKLLVNRM

Query:  KWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQ
        +  +  ++  +Q GF+PG   + N+   +  +Q
Subjt:  KWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQ

P11369 LINE-1 retrotransposable element ORF2 protein9.7e-1233.09Show/hide
Query:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSV--TDWNDTHLALIPK-VKQQKFISDYRPISLCN
        K+  +    L +P S  EIE  IN + + K+ GPDGF   FYQ F  ++  I IL+ +     +       + +  + LIPK  K    I ++RPISL N
Subjt:  KVTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSV--TDWNDTHLALIPK-VKQQKFISDYRPISLCN

Query:  VSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNV
        +  KI+ K+L NR++  +  ++  +Q GF+PG   + N+
Subjt:  VSYKIVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNV

P14381 Transposon TX1 uncharacterized 149 kDa protein8.5e-1634.25Show/hide
Query:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK
        V+     +L TP +  E+ +A+  M  +K+ G DG    F+Q FW+ +G           K            L+L+PK    + I ++RP+SL +  YK
Subjt:  VTPEMNAKLLTPFSRGEIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYK

Query:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQTPR
        IV K +  R+K VL +V+  +QS  VPGR+IFDNV LI + +   R
Subjt:  IVVKLLVNRMKWVLHDVVSKNQSGFVPGRSIFDNVILIMECVQTPR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.3e-0837.21Show/hide
Query:  EIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIV
        EI  A+  M  +KA GPD F   F+ + W  V D TI       +    +  +N T + LIPKV     +S +RP+S C V YKI+
Subjt:  EIERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIV

AT3G09510.1 Ribonuclease H-like superfamily protein4.6e-0930.77Show/hide
Query:  PIPLAGTTIREDATVSELITPSMG---WNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKL------AQSISVGQDLSSNDIQR
        P PL      ++ T++ L         W+ S + + V Q D   I  I ++ S   D+ IW+Y + G Y+VRSGY L          ++     S D++ 
Subjt:  PIPLAGTTIREDATVSELITPSMG---WNLSSLREVVHQEDVNIIETIPISISDNVDEWIWHYCSNGMYSVRSGYKL------AQSISVGQDLSSNDIQR

Query:  RWTWVSLWKLKIHQKVKHFIWRAYHECLPT
        R     +W L I  K+KHF+WRA  + L T
Subjt:  RWTWVSLWKLKIHQKVKHFIWRAYHECLPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCGCACGCCTTTGGATGATTGTAGGTTGCAAGATATGGAGTTTGTAGGTGACCTTTTCACATGGTCAAACAGACGGGACCGAACCACTCAGATAAATGAGTGTCT
GGATCGTTTCCGGGCAAATGAAGCTTTTATTCAGACTTTCTCGAATAGTTCTGTTCGGCACCTAGATTGGACTCAGTCGGATCACCGTCCCATTCTTCTTGATGGAACTC
TCGTGGACTTGCAGGCAATGGCGGATCCAGGATTTTATACTACGGGGGCACAAGCCAAGGTGACACCAGAAATGAATGCAAAGTTACTTACTCCCTTCTCAAGGGGTGAG
ATCGAAAGAGCAATTAACCAAATGCACTCATCCAAGGCTCTTGGACCTGATGGCTTTCCTACTATTTTCTATCAGCAATTTTGGAATGAGGTTGGTGACATTACTATTTT
GAATTGTATGGGTATGTTAAAAATGATTCGCTCTGTCACAGACTGGAATGATACTCATCTTGCTTTGATTCCCAAGGTCAAGCAACAGAAATTCATTTCTGATTATAGAC
CAATTAGTTTGTGCAATGTATCTTATAAAATAGTAGTCAAATTGTTGGTCAATCGCATGAAATGGGTTTTGCATGATGTTGTCTCTAAGAATCAATCAGGTTTTGTTCCT
GGAAGGTCGATTTTTGATAATGTGATTCTTATTATGGAGTGTGTTCAAACACCGCGGTTCTCCATTTTGCTAAATGGGGTGCCAACGGGAAAAATTATACCCCAGAGAGG
AGCAGTAGCTAGAGGTAATCTCTCTAGTATTAAACCAGGGAAATTTTGGCCAGAGATCTCTCACTTGTTCTTTGCAGATGATAGTCTCGTTTTCTGCAAAGCATCAATTG
AACAAGTATGGACTTTTCGTTCTGCATTGGCAAGTTACGAGAAGGCCTCAGGTCAGAAGGTCAGCATTGGCAAATCAGCTCTATTTTTCTCCCATAATGTGAATGTAGAT
TTCAGATTGGTGATCTCGAACTTGTTGGGGGGCAAGGAGGTTATCAAGGGAAGATATGCGCATCACATGTCTTTAATGTTTGCTCCAATCCAATCTAATTGCTCTGTATT
TTGGAGGGGTTTTGTATGGGCCCGTCAGGAGAGCACTTTTAGGCCAATCCCATTGGCAGGGACCACGATTCGAGAGGATGCTACCGTCTCTGAACTCATAACTCCATCAA
TGGGATGGAATTTGAGTAGTTTAAGGGAGGTGGTGCATCAAGAAGATGTGAATATTATTGAAACTATTCCAATCAGCATTTCAGATAATGTTGACGAATGGATATGGCAT
TACTGTTCTAATGGGATGTATTCAGTGCGCAGTGGATATAAACTTGCTCAATCCATATCCGTGGGCCAGGATTTGTCTAGTAATGATATTCAGCGAAGATGGACGTGGGT
TTCACTTTGGAAGTTGAAAATTCACCAAAAGGTGAAACATTTCATTTGGAGGGCTTATCATGAGTGCCTTCCCACTAATTACTGTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCGCACGCCTTTGGATGATTGTAGGTTGCAAGATATGGAGTTTGTAGGTGACCTTTTCACATGGTCAAACAGACGGGACCGAACCACTCAGATAAATGAGTGTCT
GGATCGTTTCCGGGCAAATGAAGCTTTTATTCAGACTTTCTCGAATAGTTCTGTTCGGCACCTAGATTGGACTCAGTCGGATCACCGTCCCATTCTTCTTGATGGAACTC
TCGTGGACTTGCAGGCAATGGCGGATCCAGGATTTTATACTACGGGGGCACAAGCCAAGGTGACACCAGAAATGAATGCAAAGTTACTTACTCCCTTCTCAAGGGGTGAG
ATCGAAAGAGCAATTAACCAAATGCACTCATCCAAGGCTCTTGGACCTGATGGCTTTCCTACTATTTTCTATCAGCAATTTTGGAATGAGGTTGGTGACATTACTATTTT
GAATTGTATGGGTATGTTAAAAATGATTCGCTCTGTCACAGACTGGAATGATACTCATCTTGCTTTGATTCCCAAGGTCAAGCAACAGAAATTCATTTCTGATTATAGAC
CAATTAGTTTGTGCAATGTATCTTATAAAATAGTAGTCAAATTGTTGGTCAATCGCATGAAATGGGTTTTGCATGATGTTGTCTCTAAGAATCAATCAGGTTTTGTTCCT
GGAAGGTCGATTTTTGATAATGTGATTCTTATTATGGAGTGTGTTCAAACACCGCGGTTCTCCATTTTGCTAAATGGGGTGCCAACGGGAAAAATTATACCCCAGAGAGG
AGCAGTAGCTAGAGGTAATCTCTCTAGTATTAAACCAGGGAAATTTTGGCCAGAGATCTCTCACTTGTTCTTTGCAGATGATAGTCTCGTTTTCTGCAAAGCATCAATTG
AACAAGTATGGACTTTTCGTTCTGCATTGGCAAGTTACGAGAAGGCCTCAGGTCAGAAGGTCAGCATTGGCAAATCAGCTCTATTTTTCTCCCATAATGTGAATGTAGAT
TTCAGATTGGTGATCTCGAACTTGTTGGGGGGCAAGGAGGTTATCAAGGGAAGATATGCGCATCACATGTCTTTAATGTTTGCTCCAATCCAATCTAATTGCTCTGTATT
TTGGAGGGGTTTTGTATGGGCCCGTCAGGAGAGCACTTTTAGGCCAATCCCATTGGCAGGGACCACGATTCGAGAGGATGCTACCGTCTCTGAACTCATAACTCCATCAA
TGGGATGGAATTTGAGTAGTTTAAGGGAGGTGGTGCATCAAGAAGATGTGAATATTATTGAAACTATTCCAATCAGCATTTCAGATAATGTTGACGAATGGATATGGCAT
TACTGTTCTAATGGGATGTATTCAGTGCGCAGTGGATATAAACTTGCTCAATCCATATCCGTGGGCCAGGATTTGTCTAGTAATGATATTCAGCGAAGATGGACGTGGGT
TTCACTTTGGAAGTTGAAAATTCACCAAAAGGTGAAACATTTCATTTGGAGGGCTTATCATGAGTGCCTTCCCACTAATTACTGTCTTTGA
Protein sequenceShow/hide protein sequence
MFRTPLDDCRLQDMEFVGDLFTWSNRRDRTTQINECLDRFRANEAFIQTFSNSSVRHLDWTQSDHRPILLDGTLVDLQAMADPGFYTTGAQAKVTPEMNAKLLTPFSRGE
IERAINQMHSSKALGPDGFPTIFYQQFWNEVGDITILNCMGMLKMIRSVTDWNDTHLALIPKVKQQKFISDYRPISLCNVSYKIVVKLLVNRMKWVLHDVVSKNQSGFVP
GRSIFDNVILIMECVQTPRFSILLNGVPTGKIIPQRGAVARGNLSSIKPGKFWPEISHLFFADDSLVFCKASIEQVWTFRSALASYEKASGQKVSIGKSALFFSHNVNVD
FRLVISNLLGGKEVIKGRYAHHMSLMFAPIQSNCSVFWRGFVWARQESTFRPIPLAGTTIREDATVSELITPSMGWNLSSLREVVHQEDVNIIETIPISISDNVDEWIWH
YCSNGMYSVRSGYKLAQSISVGQDLSSNDIQRRWTWVSLWKLKIHQKVKHFIWRAYHECLPTNYCL