; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038341 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038341
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:15753517..15755012
RNA-Seq ExpressionLag0038341
SyntenyLag0038341
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]1.5e-4529.5Show/hide
Query:  FNNIMSVSSEGKGGGLS---LLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNE
        FN ++S+ SE +GG +S    +     ++    L D+GFSG  YTW + KN      ERLD+F  +    D    + VEH   ++SDH  I++ + +  +
Subjt:  FNNIMSVSSEGKGGGLS---LLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNE

Query:  TLQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERTTNLNFERKVQCGLKAMKEWNRQRL---------------------------------------
          +  RK+   +   +W   +  +    ++W   + ++ L FE ++    + +  W++  L                                       
Subjt:  TLQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERTTNLNFERKVQCGLKAMKEWNRQRL---------------------------------------

Query:  -------------------------WFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDR
                                 +FH KASQR++RN ING+F+    W + +++I     +++ NL +++ PS  ++  V  A+   ISE+  + L R
Subjt:  -------------------------WFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDR

Query:  PFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
            EE+   L+ M+P+KAPGPDG HA+ YQ+FW ++G+D T V   I++     + LN+T I+ IPKVK P  +S+FRPISLCNV++KL+ K LANR+K
Subjt:  PFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]5.5e-4832.42Show/hide
Query:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET
        FN I+S++ +  G   S    D    I++ CG  D+G+ G  YTW+  +  ++    RLD+     +   K   +KV H      DH A+++    DN  
Subjt:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET

Query:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSW------------------------------------RIPERTTNLNF-----------------------
           PR +     E  WT+ E  K    +SW                                    +I ++ + LN                        
Subjt:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSW------------------------------------RIPERTTNLNF-----------------------

Query:  ---ERKVQCGLKA----MKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLD
           + +   G +A    +KE +R   +FH++AS+RRK+N I GI++  G+W ++E+ I + A S+F+N+ S++ PS+  I EVT+AI  +++E+    L 
Subjt:  ---ERKVQCGLKA----MKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLD

Query:  RPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRM
        R F+ EE+   LK ++P KAPGPDG  A+ +QK+W ++G + T + L++LN+N  +  LN T IS IPK  +P  M+DFRPISLCNVVYKLI+K LANR+
Subjt:  RPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRM

Query:  K
        K
Subjt:  K

XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]5.7e-4532.29Show/hide
Query:  SLLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTE
        SL+     ++  CGL+D+GF G ++TW + K       ERLD+   +          KV H   H SDH+AIV++++        PR     K E  W +
Subjt:  SLLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTE

Query:  HEGSKEAFTSSWRIPERTTNL-------------------------------------NFERKVQCGLKAMKE------------------W--------
         EG  E   S+W        +                                       E  V  GLK   E                  W        
Subjt:  HEGSKEAFTSSWRIPERTTNL-------------------------------------NFERKVQCGLKAMKE------------------W--------

Query:  ----NRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNP
            +R   +FHSKAS R +RN+I G+ NS   W   EK++ + A ++F +L +T++PS+ S+  V +A+   ++++   +L  PF  EE+   L  M  
Subjt:  ----NRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNP

Query:  TKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
          APGPDG   + Y KFW V+G++ T   LD LNN      +N T I+ IPKVK P  MSD+RPISLCNVVYKL++K LANR K
Subjt:  TKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.2e-4732.8Show/hide
Query:  ISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFT
        +S+C L D+GF G  YTW+  +  +  T+ RLD+   N    D+ +  +V H   H SDH  ++L ++  ++  QH  +    K E SW   +       
Subjt:  ISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFT

Query:  SSW-----------------------------------------------RIPE-------RTTNLNFERKV-------------QCGLKAMKEWNRQRL
         +W                                               R+ E       +   LN  +K+             +  +  ++  +R   
Subjt:  SSW-----------------------------------------------RIPE-------RTTNLNFERKV-------------QCGLKAMKEWNRQRL

Query:  WFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGA
        +FH+KASQRR++N I GI NS G+WVE+ +E+G+ AA +FDNL       +    E   A+ T+++ED +  L   F+ EE++  L  M PTKAPGPDG 
Subjt:  WFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGA

Query:  HAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
        +A+ YQKFW ++G+      LD LNN   L  +N T I  IPKV++P  MS+FRPISLCNV+YK+I+K LANR+K
Subjt:  HAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

XP_030939839.1 uncharacterized protein LOC115964720 [Quercus lobata]1.9e-4834.5Show/hide
Query:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET
        F  I+S+  +  G   S    D   ++I+ C  +D+G++G  +TW   +   D     LD+     +     ++ +V H     SDH A+++     N+ 
Subjt:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET

Query:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSWR------IPE-RTTNLNFERKVQCGLKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIG
        +  P++R     E  WT  E  KE    +W        PE    +L      +  ++ +KE +R   +FH +AS+R+K+N I G+++ NG W ++   I 
Subjt:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSWR------IPE-RTTNLNFERKVQCGLKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIG

Query:  ETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPL
        E A  +F N+ ++++P  AS  +VT AI TR++++    L + FS +EI   LK M+PTKAPGPDG  A+ + K+W ++G   T + L++LN+N  +  L
Subjt:  ETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPL

Query:  NSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
        N T I+ IPK K P  M+DFRPISL NV YK+IAK L+NR+K
Subjt:  NSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein3.3e-4633.85Show/hide
Query:  IISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAF
        ++ +CG VD+GF G  YTW   +       ERLD+     + + K  + +V H Q   SDH+ + +++ W    L+  RKR   + E  WT H G ++  
Subjt:  IISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAF

Query:  TSSWRIPERTTNLNFER----KVQCGLKAMKEWNR--QRLW----------FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPS
          +W    ++ +L F++          K  + W +  + LW          FH +A+ R++RN I+GI +  G W    +E+  T   ++ +L +T++P 
Subjt:  TSSWRIPERTTNLNFER----KVQCGLKAMKEWNR--QRLW----------FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPS

Query:  KASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWM
             E+   +   I+ D   +LD  F+  E+E  L  M P KAPGPDG   + YQK+W ++G D T   L  L +   L+ +N T I  IPKV++P  +
Subjt:  KASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWM

Query:  SDFRPISLCNVVYKLIAKALANRMK
         DFRPISLCNV+YK+IAK LANR+K
Subjt:  SDFRPISLCNVVYKLIAKALANRMK

A0A2N9H567 Reverse transcriptase domain-containing protein6.6e-4731.41Show/hide
Query:  EEASKLKNKLGFNNIMSVSSEGKGGGLSLLWQDH-HII------------------------------------------------------------SN
        +    L+ KLG  + M V   G GGGL+LLW+D  HI+                                                            S+
Subjt:  EEASKLKNKLGFNNIMSVSSEGKGGGLSLLWQDH-HII------------------------------------------------------------SN

Query:  CGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFTSSW
        C L D+GF G ++TW   +   D   ERLD+     +      + ++ H  F  SDH A+VL++         P KR   + E+ W + +  +E   ++W
Subjt:  CGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFTSSW

Query:  RIP--------------ERTTNLNFERKVQCGLKAMKE----------W----NRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDN
          P              E   N N  RK    L AM+E          W    +R   +FH+ ASQRR++NEI+ + + NG+   S+ EI   A  +F  
Subjt:  RIP--------------ERTTNLNFERKVQCGLKAMKE----------W----NRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDN

Query:  LLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIP
        +  T+ P      +VT+ +  +++      L + F+ EEI T +  M PTKAPGPDG +A+ YQKFW ++G D T   LD   +   L+ +N T IS IP
Subjt:  LLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIP

Query:  KVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
        K K P  MS FRPISLCNV+YK+I+K LANR+K
Subjt:  KVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

A0A2N9ITS3 Reverse transcriptase domain-containing protein4.7e-4530.16Show/hide
Query:  IISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRK-RNLVKMEHSWTEHEGSKEA
        ++ +CG++D+GF G  +TW  N++  +TT  RLD+     + ++K    ++EH     SDH+ + L++    E    P K R   + E  WT   G +  
Subjt:  IISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNETLQHPRK-RNLVKMEHSWTEHEGSKEA

Query:  FTSSW-RIPERTTNLNFERKVQCGLKAMKEWNRQ-----------------------------------------------RLW----------------
          + W +  + T       K++   K +  W+RQ                                               R+W                
Subjt:  FTSSW-RIPERTTNLNFERKVQCGLKAMKEWNRQ-----------------------------------------------RLW----------------

Query:  --FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPS--KASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGP
          FH +ASQRR+RN I GI N+ G W + ++E+      +++++  T+ PS  + ++  V + +++ ++E     L R F+  E+E  LK M P KAPGP
Subjt:  --FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPS--KASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGP

Query:  DGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
        DG   + YQ+FW V+G+D TR  L  LN+ + L  +N T I+ IPKVK+P  +++FRPISLCNV+YKL++K +ANR+K
Subjt:  DGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

A0A7N2L6Z9 Reverse transcriptase domain-containing protein2.7e-4831.19Show/hide
Query:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET
        FN I+S+  +  G   + +  D   ++I++CG  D+G+SG  YTW   +        RLD+     + +++   +KV H     SDH A+ +    +++ 
Subjt:  FNNIMSVSSEGKGGGLSLLWQD--HHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNET

Query:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERTTNLNFERKVQCGLKA-------------------------------------------------
        ++ PR R     E  WT+ E  +    S W     T+++N    +  GLK                                                  
Subjt:  LQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERTTNLNFERKVQCGLKA-------------------------------------------------

Query:  --------------------MKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKI
                            +KE +R   +FH++AS+RRK+N I+G+++  G+W E    I   A ++F+++ ST+ PS     EVT AI T I+E+   
Subjt:  --------------------MKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKI

Query:  RLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALA
         L R F+ EEI T LK ++PTK+PGPDG  A+ +QK+W ++G + + + L++LN   SL+ +N T I  IPK  +P  M+DFRPISLCNV+YKLI+K LA
Subjt:  RLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALA

Query:  NRMK
        NR+K
Subjt:  NRMK

A0A803P3V6 Uncharacterized protein2.1e-4531.09Show/hide
Query:  FNNIMSVSSEGKGGGL---SLLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNE
        FN I+  +SE KGGG+   S + +    IS C   ++   GG YTW  N    +   E+LD+   N +  +K R+ KV    +  SDHR ++++I  D +
Subjt:  FNNIMSVSSEGKGGGL---SLLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDNE

Query:  -TLQHPRKRNLVKMEHSWTEHEGSKEAFTSSW-----------------RIPERTTNLNFERKVQCGLKAMK---------------EWNRQR-------
         + + P+ R+    E +W E E   +   S+W                 +  E+    N ++K +   K  K               +W  +R       
Subjt:  -TLQHPRKRNLVKMEHSWTEHEGSKEAFTSSW-----------------RIPERTTNLNFERKVQCGLKAMK---------------EWNRQR-------

Query:  ----------------LW----------FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRL
                        LW          FH KASQR+K+N I G+F+   KW  S  EI E   +++ NL +++RPS      + +++   +S      L
Subjt:  ----------------LW----------FHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRL

Query:  DRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANR
           F+ E+++T +  ++P KAPG DG   + Y   W  +G++    CL +LNNNE  + +N TL+  IPKVKDP  +SDFRP+SLCNV+YK+I+K LANR
Subjt:  DRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANR

Query:  MK
        MK
Subjt:  MK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.6e-1025.69Show/hide
Query:  ERTTNLNFERKVQCGLKAMKEWNRQRLWFHSKAS-----------QRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS
        +  T +  E K     K +++ N  R WF  + +           ++R++N+I+ I N  G       EI  T   ++ +L +       ++ E+   + 
Subjt:  ERTTNLNFERKVQCGLKAMKEWNRQRLWFHSKAS-----------QRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS

Query:  T----RISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKV-KDPLWMSDFRPIS
        T    R+++++   L+RP +  EI   + S+   K+PGPDG  A  YQ++   L     ++   I              I  IPK  +D     +FRPIS
Subjt:  T----RISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKV-KDPLWMSDFRPIS

Query:  LCNVVYKLIAKALANRMK
        L N+  K++ K LANR++
Subjt:  LCNVVYKLIAKALANRMK

P08548 LINE-1 reverse transcriptase homolog1.1e-1125.63Show/hide
Query:  TWAKNKNKQDTT---EERLDQFFVNGNLMDKARSIK----VEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERT
        TW  ++ K++ T   E+  +Q     NL D A+++     +    F +   R  V      N  + H   + L K EHS  +    KE            
Subjt:  TWAKNKNKQDTT---EERLDQFFVNGNLMDKARSIK----VEHSQFHQSDHRAIVLDIKWDNETLQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERT

Query:  TNLNFERKVQCGLKAMKEWNRQRLWFHSKAS-----------QRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS-TR
        T +  E       + +++ N+ + WF  K +           ++R ++ I+ I N N +      EI +    ++  L S    +   I +  +A    R
Subjt:  TNLNFERKVQCGLKAMKEWNRQRLWFHSKAS-----------QRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS-TR

Query:  ISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTL----ISFIPKV-KDPLWMSDFRPISLC
        +S+ +   L+RP S+ EI +T++++   K+PGPDG  +  YQ F     E+   + L++  N E    L +T     I+ IPK  KDP    ++RPISL 
Subjt:  ISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTL----ISFIPKV-KDPLWMSDFRPISLC

Query:  NVVYKLIAKALANRMK
        N+  K++ K L NR++
Subjt:  NVVYKLIAKALANRMK

P11369 LINE-1 retrotransposable element ORF2 protein6.0e-1331.07Show/hide
Query:  KAMKEWNRQRLWFHSKASQ-----------RRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS----TRISEDQKIRLD
        + ++  N+ R WF  K ++            R +  IN I N  G      +EI  T  SF+  L ST      ++ E+ K +      ++++DQ   L+
Subjt:  KAMKEWNRQRLWFHSKASQ-----------RRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAIS----TRISEDQKIRLD

Query:  RPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTL----ISFIPK-VKDPLWMSDFRPISLCNVVYKLIAKA
         P S +EIE  + S+   K+PGPDG  A  YQ F     ED   +   + +  E    L ++     I+ IPK  KDP  + +FRPISL N+  K++ K 
Subjt:  RPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTL----ISFIPK-VKDPLWMSDFRPISLCNVVYKLIAKA

Query:  LANRMK
        LANR++
Subjt:  LANRMK

P14381 Transposon TX1 uncharacterized 149 kDa protein6.9e-1728.8Show/hide
Query:  VQCGLKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIET
        V+  ++ + + +R   +F++   ++  R +I  +F  +G  +E  + I + A SF+ NL S +  S  +  E+   +   +SE +K RL+ P + +E+  
Subjt:  VQCGLKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRPSKASILEVTKAISTRISEDQKIRLDRPFSNEEIET

Query:  TLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK
         L+ M   K+PG DG     +Q FW  LG D  RV  +     E        ++S +PK  D   + ++RP+SL +  YK++AKA++ R+K
Subjt:  TLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLIAKALANRMK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.2e-1226.11Show/hide
Query:  LKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRP--SKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTL
        +K +++ +    +FH      + +N I  +   +   VE+  ++ E   +++ +LL ++    +  S+  +      R ++    RL    S++EI   +
Subjt:  LKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNRP--SKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTL

Query:  KSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLI
         +M   KAPGPD   A  + + W V+ + T     +       L+  N+T I+ IPKV     +S FRP+S C VVYK+I
Subjt:  KSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISLCNVVYKLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGCGAGGAAGCAAGCAAGCTGAAGAACAAACTCGGTTTCAACAACATCATGAGCGTTAGTAGCGAAGGAAAAGGAGGGGGTTTGAGCCTCCTTTGGCAGGATCA
TCACATTATTAGTAATTGTGGCCTTGTTGATGTTGGCTTTTCTGGTGGCAGGTACACGTGGGCGAAAAATAAAAATAAGCAGGACACAACCGAGGAAAGGCTGGACCAGT
TCTTTGTGAATGGGAACTTGATGGACAAAGCCAGGAGCATCAAGGTGGAACATTCTCAATTTCACCAATCTGACCATAGAGCAATTGTGTTGGATATTAAGTGGGATAAT
GAGACTCTCCAACATCCAAGGAAAAGAAACTTGGTCAAAATGGAGCATAGCTGGACAGAGCATGAAGGGAGCAAGGAGGCTTTTACAAGCTCGTGGAGGATCCCTGAAAG
AACTACTAACCTCAACTTTGAGAGGAAGGTTCAGTGTGGTTTGAAAGCCATGAAAGAGTGGAACAGGCAGAGGCTGTGGTTCCATTCTAAGGCCTCCCAAAGGAGGAAAA
GAAACGAGATCAATGGTATTTTCAACTCTAATGGAAAGTGGGTGGAAAGTGAGAAGGAAATAGGGGAGACAGCAGCCAGCTTCTTCGATAATTTACTCAGCACCAACCGC
CCAAGCAAGGCTAGCATTTTGGAGGTGACAAAAGCTATTTCTACAAGAATCTCTGAGGACCAAAAAATAAGGCTGGATAGGCCTTTCTCAAACGAAGAAATTGAAACAAC
CTTAAAAAGCATGAATCCTACTAAGGCACCGGGTCCTGACGGTGCTCATGCAATGCTTTATCAGAAATTCTGGGTCGTTCTAGGGGAAGACACTACTCGGGTTTGCTTGG
ACATCCTTAACAACAATGAAAGCTTAGAGCCTCTCAATAGCACTTTGATATCCTTCATCCCAAAGGTGAAAGATCCCCTATGGATGAGTGATTTTAGACCTATAAGTCTA
TGCAATGTGGTGTATAAGCTCATAGCTAAAGCGCTTGCGAATAGAATGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGCGAGGAAGCAAGCAAGCTGAAGAACAAACTCGGTTTCAACAACATCATGAGCGTTAGTAGCGAAGGAAAAGGAGGGGGTTTGAGCCTCCTTTGGCAGGATCA
TCACATTATTAGTAATTGTGGCCTTGTTGATGTTGGCTTTTCTGGTGGCAGGTACACGTGGGCGAAAAATAAAAATAAGCAGGACACAACCGAGGAAAGGCTGGACCAGT
TCTTTGTGAATGGGAACTTGATGGACAAAGCCAGGAGCATCAAGGTGGAACATTCTCAATTTCACCAATCTGACCATAGAGCAATTGTGTTGGATATTAAGTGGGATAAT
GAGACTCTCCAACATCCAAGGAAAAGAAACTTGGTCAAAATGGAGCATAGCTGGACAGAGCATGAAGGGAGCAAGGAGGCTTTTACAAGCTCGTGGAGGATCCCTGAAAG
AACTACTAACCTCAACTTTGAGAGGAAGGTTCAGTGTGGTTTGAAAGCCATGAAAGAGTGGAACAGGCAGAGGCTGTGGTTCCATTCTAAGGCCTCCCAAAGGAGGAAAA
GAAACGAGATCAATGGTATTTTCAACTCTAATGGAAAGTGGGTGGAAAGTGAGAAGGAAATAGGGGAGACAGCAGCCAGCTTCTTCGATAATTTACTCAGCACCAACCGC
CCAAGCAAGGCTAGCATTTTGGAGGTGACAAAAGCTATTTCTACAAGAATCTCTGAGGACCAAAAAATAAGGCTGGATAGGCCTTTCTCAAACGAAGAAATTGAAACAAC
CTTAAAAAGCATGAATCCTACTAAGGCACCGGGTCCTGACGGTGCTCATGCAATGCTTTATCAGAAATTCTGGGTCGTTCTAGGGGAAGACACTACTCGGGTTTGCTTGG
ACATCCTTAACAACAATGAAAGCTTAGAGCCTCTCAATAGCACTTTGATATCCTTCATCCCAAAGGTGAAAGATCCCCTATGGATGAGTGATTTTAGACCTATAAGTCTA
TGCAATGTGGTGTATAAGCTCATAGCTAAAGCGCTTGCGAATAGAATGAAATAA
Protein sequenceShow/hide protein sequence
MRSEEASKLKNKLGFNNIMSVSSEGKGGGLSLLWQDHHIISNCGLVDVGFSGGRYTWAKNKNKQDTTEERLDQFFVNGNLMDKARSIKVEHSQFHQSDHRAIVLDIKWDN
ETLQHPRKRNLVKMEHSWTEHEGSKEAFTSSWRIPERTTNLNFERKVQCGLKAMKEWNRQRLWFHSKASQRRKRNEINGIFNSNGKWVESEKEIGETAASFFDNLLSTNR
PSKASILEVTKAISTRISEDQKIRLDRPFSNEEIETTLKSMNPTKAPGPDGAHAMLYQKFWVVLGEDTTRVCLDILNNNESLEPLNSTLISFIPKVKDPLWMSDFRPISL
CNVVYKLIAKALANRMK