; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006085 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006085
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:37113559..37114878
RNA-Seq ExpressionLag0006085
SyntenyLag0006085
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RYR04437.1 hypothetical protein Ahy_B06g084164 [Arachis hypogaea]3.6e-8440.68Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        MKGFQEA+    LLD GF+G  FTW   +  +  ++E LDR    +   +      V HL  Y SDH P++  +     K   R + N+ RFEE WL+ +
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLKSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMET-EGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQS
           +++  SW   SG  EH +  +L+ C + L QW K    G +   I++K++ I+ + T        M +++ + +L+ LL +EE  W  RSR  WL++
Subjt:  LAMKIVQNSWLKSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMET-EGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQS

Query:  GDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKA
        GD+NTK+FH KA QR KRNRIE I D      E +E+I ++  ++++ LFK+    +E  E+VA  V  R+S +  + L+  +++ EV  A+K ++P+KA
Subjt:  GDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKA

Query:  PGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRL
        PG+DG  A F+Q   DI+GE+T +  L  LN E D    N T L LIPK+K PK  +E+RPISLCNV +K+  K IANRLK++L  I+   Q AF PGRL
Subjt:  PGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRL

Query:  ISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        I+DN LV F+  H +     G +G V IKLDM+KAYD +E
Subjt:  ISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]8.8e-8342.12Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        M+ FQ A+ RC L D GF G +FTW   +      +E LDR   N    +K  +  + H   ++SDH PI+  +   + + F        RFEE WL ++
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLKSSG--RGEHYINTRLKGCIKNLHQWSKVRLNGSI--ISAIRKKEEEIKRME-TEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREV
            +V +SW  S G   G      ++KGC  +L  W   +   ++  I  + KK E++ + E TE  R    E+  A   L+ LL ++E +W   SR  
Subjt:  LAMKIVQNSWLKSSG--RGEHYINTRLKGCIKNLHQWSKVRLNGSI--ISAIRKKEEEIKRME-TEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREV

Query:  WLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLN
        WL+ GDRNTK+FH+KA+QRRKRN I GI +++  W E   E+A+VA +YF+ +F S     E +EE    V QR++   K EL +PY+  EV+ A+  + 
Subjt:  WLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLN

Query:  PSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFV
        P+KAPG DG +A F+Q +  IVG D     L  LN+   +   N T + LIPK K P+ M +FRPISLCNV YKII+K +ANRLK +L  +ISPTQ AFV
Subjt:  PSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFV

Query:  PGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        PGRLI+DNVL+ +E +HA++  KKGK   +A+KLD+SKAYD VE
Subjt:  PGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.2e-8441.12Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATI---VMPSAKAFNRIKGNLLRFEEGWL
        M+ F+  +++C L +    GD+FTW R +     +KE LD  F+N          K+ HL+YY SDHR ++A I   + P A+A  + +    RFE+ WL
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATI---VMPSAKAFNRIKGNLLRFEEGWL

Query:  KFKLAMKIVQNSWLKSSGRG-EHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDM--EIEKAEMDLESLLEEEEYYWRSRSRE
        K K   +I+ NSWL SS       + + L+ C  NLH W   R  G +   I+  ++ +  + T      D   ++  AE  L+ LL  EE YW+ RSR 
Subjt:  KFKLAMKIVQNSWLKSSGRG-EHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDM--EIEKAEMDLESLLEEEEYYWRSRSRE

Query:  VWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSL
         WLQSGDRNTK+FH+KA+ R   NRI+ + D +       E I+ V  DYF+ LF +SN     +  V   +   +SD+Q   L + ++ +EV  A+K++
Subjt:  VWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSL

Query:  NPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAF
           K+PG+DG  A F+    +IVGE   +  L +LNN  +   FN+TL++LIPK K PKTM++FRPISLCNVTYKII+K +A R K+VL S+IS TQ AF
Subjt:  NPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAF

Query:  VPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        +  RLI+DN+LV FE +H++ +  +G +G  A+KLDMSKA+D VE
Subjt:  VPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]3.9e-8339.82Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        M+ F+  +++C L +   +GD+FTW + + +   +KE LD  F+N      +   K+ HL+YY SDHR ++A I     +     +    RFE+ WLK K
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWL-KSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDME--IEKAEMDLESLLEEEEYYWRSRSREVWL
           +I+ +SW   S       + + L  C  NL QW   R  G +   I+  ++ +  + T      D +  I  AE  L+ LL  EE YW+ RSR  WL
Subjt:  LAMKIVQNSWL-KSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDME--IEKAEMDLESLLEEEEYYWRSRSREVWL

Query:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS
        QSGDRNTK+FH+KA+ R   NRI+ + D +       E I+ + TDYF+ LF +SN     +  V   +   +SD+Q   L + ++ ++V   +K++   
Subjt:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS

Query:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG
        K+PG+DG  A F+    DIVG    +  L +LN+ A+   FN+TL++LIPK K PKTM++FRPISLCNVTYKII+K +A R K+VL S+IS TQ AF+  
Subjt:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG

Query:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        RLI+DN+LV FE +H++ +  +G +G VA KLDMSKA+D VE
Subjt:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]1.0e-8341.18Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        M+ F++A++ C  +D GFSG +FTW  G+ R +++ E LDR   N     +  + +V HLN Y+SDHRP++  + + S     R +    RFE  W+   
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLKSSGRGEHYIN--TRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGC-RWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWL
             V  +W     RG   +N  T++K C K L +WSK    G++   I+  +E++   E E   R +   ++  + +L  LLE+EE  W  RSR  WL
Subjt:  LAMKIVQNSWLKSSGRGEHYIN--TRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGC-RWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWL

Query:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS
        Q GD+NT++FH  AT R+++N I+G+ D+N  W+ +++  + + TD+++ LFKSSNP  +NI+ V   V + +++   ++L +PYS +EVE A+K + P 
Subjt:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS

Query:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG
        KAPG DG    F+Q Y   V  D  +  L  LN+ + +   N T ++LIPK K+P+ + EFRPISLCNV YKI++K IANRLK +L SIIS TQ AF+  
Subjt:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG

Query:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        RLI+DNVL+ FE +H + N   GK G +A+KLDMSKAYD VE
Subjt:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein1.3e-8741.95Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        M+GF++A++ C L+D GF G  FTW   +         LDR   NL   QK     V HL+  +SDH+ ++ T    +   F R      RFEE W    
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLKS-SGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDME-IEKAEMDLESLLEEEEYYWRSRSREVWLQ
           + +Q +W  +  G     +  +LK C + L  WS+ +  GSI   +R+K EE    E E  +   ++ + K + ++  LLE+EE  WR RSR  WL+
Subjt:  LAMKIVQNSWLKS-SGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDME-IEKAEMDLESLLEEEEYYWRSRSREVWLQ

Query:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK
         GDRNT++FH +A+QRR+RNRI G+ D    WRE+  E   +   +F+ +F++S P  ENIEE    V   +S +  + L   ++  EVE+A+K + P K
Subjt:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK

Query:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR
        APG DG    FFQ Y  +VG +  +  L  LN+   ++  N T ++LIPK K+P+ + EFRPISLCNVTYK+I+K IANRLK +L SIIS  Q AFVPGR
Subjt:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR

Query:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        LI+DNVL+ FE +H +++ K GK+G +A+KLDMSKAYD VE
Subjt:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

A0A2N9GPZ7 Reverse transcriptase domain-containing protein2.2e-8740.82Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        ++ F+EA+  C L D G+ G+ +TW R +     V   LDR   +++         V HL   +SDH PI+  I  P      R K  L RFE  W+K +
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLK--SSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ
           +++ ++W    + G     +  ++K C  +L  WS+ R  GS+ S+I++K E+++ +  E        I + + DL  LLE+EE +WR RSR  W+ 
Subjt:  LAMKIVQNSWLK--SSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ

Query:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK
         GD+NTK+FHA+  +RR+ N I G+ D++  W+ +  +IA++A DYF+ +F SSNP  E+I  V   +   +++    +L   ++K+EV +A+K + P+K
Subjt:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK

Query:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR
        APG DG  A F+Q Y DIVG +  +  L IL++   +   N T ++LIPK K+P+ + +FRPISLCNV YKI++K +ANRLKKVL  +IS  Q AFVPGR
Subjt:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR

Query:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        LI+DNVLV FE +H+++  +KGK+GQ+A+KLDMSKAYD VE
Subjt:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.2e-8740.82Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        ++ F+EA+  C L D G+ G+ +TW R +     V   LDR   +++         V HL   +SDH PI+  I  P      R K  L RFE  W+K +
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLK--SSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ
           +++ ++W    + G     +  ++K C  +L  WS+ R  GS+ S+I++K E+++ +  E        I + + DL  LLE+EE +WR RSR  W+ 
Subjt:  LAMKIVQNSWLK--SSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ

Query:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK
         GD+NTK+FHA+  +RR+ N I G+ D++  W+ +  +IA++A DYF+ +F SSNP  E+I  V   +   +++    +L   ++K+EV +A+K + P+K
Subjt:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK

Query:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR
        APG DG  A F+Q Y DIVG +  +  L IL++   +   N T ++LIPK K+P+ + +FRPISLCNV YKI++K +ANRLKKVL  +IS  Q AFVPGR
Subjt:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR

Query:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        LI+DNVLV FE +H+++  +KGK+GQ+A+KLDMSKAYD VE
Subjt:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

A0A803P4U9 Uncharacterized protein3.5e-8539Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRI-KGNLLRFEEGWLKF
        M  F++ I+ C L +    G  FTW  G+     + E LDR   N   +   +   V  L++ +SDHRP+  T  + +    NRI +G+   FE+ W + 
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRI-KGNLLRFEEGWLKF

Query:  KLAMKIVQNSWL-KSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ
        +   +I+Q  W  K+SG     +   L+GC + LH+W+K R    +   I++ +++I  +    C+ + + ++K E DL  + E+ E YW+ RSR +WL+
Subjt:  KLAMKIVQNSWL-KSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQ

Query:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK
         GDRNTK+FH KA+QR+++N IEG++D    W+    +I+++A +YF+ LF  SN   E  + + GCV  R+S ++   L +P+ + EV  AM  ++P K
Subjt:  SGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSK

Query:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR
        APG DG    FFQ   ++VG++    CL +LNN+AD S  N TL+ LIPK+K P  + EFRPISLCNV YK+++K +ANR+K  L + IS  Q AF+ GR
Subjt:  APGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGR

Query:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        +I DN ++GFE +H +   + G   ++A+KLDMSKAYD VE
Subjt:  LISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

A0A803P5H2 Uncharacterized protein1.6e-8540.72Show/hide
Query:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK
        M+ F+  +++C   +   +GD+FTW R +     +KE LD  F+N        S K+ HL+YY SDHR ++A I   S       +    RFE+ WLK K
Subjt:  MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFK

Query:  LAMKIVQNSWLKSSGRG-EHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDM--EIEKAEMDLESLLEEEEYYWRSRSREVWL
           +I+ N WL SS       + + L+ C  NLH W   R  G +   I+  ++ +  + T      D   ++  AE  L+ LL  EE YW+ RSR  WL
Subjt:  LAMKIVQNSWLKSSGRG-EHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDM--EIEKAEMDLESLLEEEEYYWRSRSREVWL

Query:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS
        QSGDRNTK+FH+KA+ R   NRI+ + D +       E I+ V  DYF+ LF +SN     +  V   +   +SD+Q   L + ++ +EV  A+K++   
Subjt:  QSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPS

Query:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG
        K+PG+DG  A F+    +IVGE   +  L +LNN  +   FN+TL++LIPK K PKTM++FRPISLCNVTYKII+K +A R K+VL S+IS TQ AF+  
Subjt:  KAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPG

Query:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        RLI+DN+LV FE +H++ +  +G +G  A+KLDMSKA+D VE
Subjt:  RLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.5e-1825.24Show/hide
Query:  LHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWR
        L+ + + +    I +   + +E  K+ +T        EI K   +L+ +  ++     + SR  + +  ++  +       ++R++N+I+ I +      
Subjt:  LHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWR

Query:  EKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVS----QRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQ
            EI     +Y+KHL+ +   + EN+EE+   +      RL+ ++   L+RP + +E+   + SL   K+PG DG  A F+Q Y + +    ++    
Subjt:  EKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVS----QRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQ

Query:  ILNNEADISPFNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVA
        I       + F    + LIPK  +D    + FRPISL N+  KI+ K +ANR+++ ++ +I   QV F+PG     N+      I  IN  K   +  V 
Subjt:  ILNNEADISPFNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVA

Query:  IKLDMSKAYDMVE
        I +D  KA+D ++
Subjt:  IKLDMSKAYDMVE

P08548 LINE-1 reverse transcriptase homolog3.5e-1825.98Show/hide
Query:  KNLHQWSKVRLNGSIISA---IRKKEEE--------IKRMETEGCR----WEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQR
        +NL   +K  L G  I+    ++K E E        +K++E E           EI K   +L  +  +      ++S+  + +  ++  K       ++
Subjt:  KNLHQWSKVRLNGSIISA---IRKKEEE--------IKRMETEGCR----WEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQR

Query:  RKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEE-VAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAY
        R ++ I  I + N +      EI  +  +Y+K L+       + I++ +  C   RLS ++   L+RP S +E+   +++L   K+PG DG  + F+Q +
Subjt:  RKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEE-VAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAY

Query:  LDIVGEDTVRECLQILNN--EADISP--FNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGF
             E+ V   L +  N  +  I P  F    ++LIPK  KDP   + +RPISL N+  KI+ K + NR+++ ++ II   QV F+PG     N+    
Subjt:  LDIVGEDTVRECLQILNN--EADISP--FNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGF

Query:  ECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
          I  IN +K   +  + + +D  KA+D ++
Subjt:  ECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-1927.6Show/hide
Query:  NLHQWSKVRLNGSII--SAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYY-WRSRSREV----WLQSGDRNTKWFHAKATQ----------
        NL    K  L G +I  SA +KK E      T         +EK E +       +E    R    +V     +Q  ++   WF  K  +          
Subjt:  NLHQWSKVRLNGSII--SAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYY-WRSRSREV----WLQSGDRNTKWFHAKATQ----------

Query:  -RRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQ----RLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAF
          R +  I  I ++        EEI +    ++K L+ +   + EN++E+   + +    +L+  Q   L+ P S  E+E  + SL   K+PG DG  A 
Subjt:  -RRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQ----RLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAF

Query:  FFQAYLDIVGEDTVRECLQILNNEADI-----SPFNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISD
        F+Q +     ++ +   L  L ++ ++     + F    ++LIPK  KDP  ++ FRPISL N+  KI+ K +ANR+++ +++II P QV F+PG     
Subjt:  FFQAYLDIVGEDTVRECLQILNNEADI-----SPFNRTLLSLIPK-SKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISD

Query:  NVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        N+      IH IN +K   +  + I LD  KA+D ++
Subjt:  NVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

P14381 Transposon TX1 uncharacterized 149 kDa protein3.9e-2529Show/hide
Query:  LNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIAD
        LNG ++      E+ +   E +  + E +E ++A  ++E       +    RSR   L   DR +++F+A   ++  R +I  +F ++    E  E I D
Subjt:  LNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIAD

Query:  VATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFN
         A  ++++LF       +  EE+   +   +S+++K  L+ P + +E+  A++ +  +K+PG+DG    FFQ + D +G D  R   +            
Subjt:  VATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFN

Query:  RTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE
        R +LSL+PK  D + ++ +RP+SL +  YKI+AK I+ RLK VL  +I P Q   VPGR I DNV +  + +H     ++       + LD  KA+D V+
Subjt:  RTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.8e-1628.57Show/hide
Query:  EYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNP--RKENIEEVAGCVSQRLSDQQKSELDRPY
        E ++R +SR  WLQ GD NT++FH      + +N I+ +   +    E   ++ ++   Y+ HL  S +     ++++ +      R +D   S L    
Subjt:  EYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRIEGIFDKNSKWREKDEEIADVATDYFKHLFKSSNP--RKENIEEVAGCVSQRLSDQQKSELDRPY

Query:  SKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKII
        S  E+  A+ ++  +KAPG D   A FF     +V + T+    +       +  FN T ++LIPK      +  FRP+S C V YKII
Subjt:  SKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILNNEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-0839.06Show/hide
Query:  IANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMV
        +  RLK ++ ++I P Q +F+PGR+ +DN++   E +H++   KKG +G + +KLD+ KAYD +
Subjt:  IANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGGTTCCAAGAGGCTATCAACCGGTGTGTGCTCTTGGATGCTGGGTTCAGTGGTGATAAGTTCACTTGGATAAGGGGAAAGATTAGGAAAAAGCAAGTTAAAGA
GTGGTTGGATAGGTACTTTGTTAATTTAGCTTTATCTCAGAAAGTCCAGAGTATTAAAGTTGGTCATCTCAACTACTATTCTTCTGATCACAGGCCCATTGTTGCTACTA
TTGTGATGCCATCTGCCAAGGCTTTCAATAGGATAAAAGGTAATTTGCTGAGATTTGAGGAGGGTTGGTTAAAATTCAAGCTTGCAATGAAAATTGTTCAAAACTCTTGG
TTAAAGAGTTCTGGCAGGGGAGAGCATTATATTAATACCAGATTGAAAGGGTGCATCAAAAACCTCCATCAATGGAGCAAAGTAAGGCTTAATGGCAGTATCATTTCTGC
TATTAGGAAAAAAGAAGAAGAAATAAAGCGAATGGAAACCGAGGGTTGCAGGTGGGAAGATATGGAAATCGAAAAGGCTGAAATGGATTTAGAATCGCTTTTGGAGGAAG
AGGAGTACTATTGGAGAAGTAGATCGAGGGAAGTGTGGTTACAAAGTGGGGATAGGAACACCAAATGGTTCCATGCTAAAGCAACCCAAAGAAGGAAGAGAAATAGAATT
GAGGGCATCTTTGATAAAAACAGTAAATGGAGGGAGAAAGATGAGGAGATAGCAGATGTAGCCACCGATTATTTTAAGCACCTTTTCAAATCCTCCAATCCGAGGAAAGA
AAATATTGAAGAAGTTGCTGGATGTGTTAGTCAGAGACTCTCGGATCAACAGAAAAGTGAGCTAGATCGTCCGTATAGCAAAAATGAGGTGGAGGTAGCAATGAAAAGTC
TTAACCCAAGCAAGGCCCCGGGAATAGATGGTACTCACGCCTTCTTTTTCCAAGCTTATTTGGATATAGTGGGGGAAGATACAGTCAGAGAGTGTCTCCAAATTCTGAAC
AATGAGGCGGATATTTCTCCGTTCAATAGGACCTTACTATCCCTTATCCCTAAAAGTAAGGACCCTAAAACGATGCAAGAATTCAGACCGATCAGTCTTTGTAACGTTAC
ATATAAGATTATAGCAAAGACCATAGCAAATAGGCTTAAAAAGGTCCTTGAGTCGATCATCTCTCCAACTCAAGTGGCCTTTGTTCCAGGAAGGCTCATTTCTGACAATG
TCTTGGTTGGGTTTGAATGTATTCACGCTATCAACAATATAAAGAAAGGAAAGGAAGGTCAAGTGGCGATCAAGCTTGATATGAGCAAGGCGTATGATATGGTCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGGTTCCAAGAGGCTATCAACCGGTGTGTGCTCTTGGATGCTGGGTTCAGTGGTGATAAGTTCACTTGGATAAGGGGAAAGATTAGGAAAAAGCAAGTTAAAGA
GTGGTTGGATAGGTACTTTGTTAATTTAGCTTTATCTCAGAAAGTCCAGAGTATTAAAGTTGGTCATCTCAACTACTATTCTTCTGATCACAGGCCCATTGTTGCTACTA
TTGTGATGCCATCTGCCAAGGCTTTCAATAGGATAAAAGGTAATTTGCTGAGATTTGAGGAGGGTTGGTTAAAATTCAAGCTTGCAATGAAAATTGTTCAAAACTCTTGG
TTAAAGAGTTCTGGCAGGGGAGAGCATTATATTAATACCAGATTGAAAGGGTGCATCAAAAACCTCCATCAATGGAGCAAAGTAAGGCTTAATGGCAGTATCATTTCTGC
TATTAGGAAAAAAGAAGAAGAAATAAAGCGAATGGAAACCGAGGGTTGCAGGTGGGAAGATATGGAAATCGAAAAGGCTGAAATGGATTTAGAATCGCTTTTGGAGGAAG
AGGAGTACTATTGGAGAAGTAGATCGAGGGAAGTGTGGTTACAAAGTGGGGATAGGAACACCAAATGGTTCCATGCTAAAGCAACCCAAAGAAGGAAGAGAAATAGAATT
GAGGGCATCTTTGATAAAAACAGTAAATGGAGGGAGAAAGATGAGGAGATAGCAGATGTAGCCACCGATTATTTTAAGCACCTTTTCAAATCCTCCAATCCGAGGAAAGA
AAATATTGAAGAAGTTGCTGGATGTGTTAGTCAGAGACTCTCGGATCAACAGAAAAGTGAGCTAGATCGTCCGTATAGCAAAAATGAGGTGGAGGTAGCAATGAAAAGTC
TTAACCCAAGCAAGGCCCCGGGAATAGATGGTACTCACGCCTTCTTTTTCCAAGCTTATTTGGATATAGTGGGGGAAGATACAGTCAGAGAGTGTCTCCAAATTCTGAAC
AATGAGGCGGATATTTCTCCGTTCAATAGGACCTTACTATCCCTTATCCCTAAAAGTAAGGACCCTAAAACGATGCAAGAATTCAGACCGATCAGTCTTTGTAACGTTAC
ATATAAGATTATAGCAAAGACCATAGCAAATAGGCTTAAAAAGGTCCTTGAGTCGATCATCTCTCCAACTCAAGTGGCCTTTGTTCCAGGAAGGCTCATTTCTGACAATG
TCTTGGTTGGGTTTGAATGTATTCACGCTATCAACAATATAAAGAAAGGAAAGGAAGGTCAAGTGGCGATCAAGCTTGATATGAGCAAGGCGTATGATATGGTCGAATGA
Protein sequenceShow/hide protein sequence
MKGFQEAINRCVLLDAGFSGDKFTWIRGKIRKKQVKEWLDRYFVNLALSQKVQSIKVGHLNYYSSDHRPIVATIVMPSAKAFNRIKGNLLRFEEGWLKFKLAMKIVQNSW
LKSSGRGEHYINTRLKGCIKNLHQWSKVRLNGSIISAIRKKEEEIKRMETEGCRWEDMEIEKAEMDLESLLEEEEYYWRSRSREVWLQSGDRNTKWFHAKATQRRKRNRI
EGIFDKNSKWREKDEEIADVATDYFKHLFKSSNPRKENIEEVAGCVSQRLSDQQKSELDRPYSKNEVEVAMKSLNPSKAPGIDGTHAFFFQAYLDIVGEDTVRECLQILN
NEADISPFNRTLLSLIPKSKDPKTMQEFRPISLCNVTYKIIAKTIANRLKKVLESIISPTQVAFVPGRLISDNVLVGFECIHAINNIKKGKEGQVAIKLDMSKAYDMVE