; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011800 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011800
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:33088977..33090326
RNA-Seq ExpressionLag0011800
SyntenyLag0011800
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]1.9e-9240.58Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE E+G  +     I +EIL +F KLY     + + +EG++W PI       LE  F++EEIFKA+  +   K+PGPDG T   +++ W ++K DLVKV 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
         EF + GI+N+ TN ++I L+PK   + ++ D+RPISL+TSLYK+IAKVLA R++ VL  TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  VSW FLD +L +KGFG R                                               D + + +    E+ VL+G+++G N  
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDT+ F  + EE++   K +LL+    SGLK+NL KS++ GIN+++  L+  A    C  S  P  YLG  LGGN +   FWDP+++R  R+
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        LD WQK  +S GGRITL Q+ L  +P Y  SL K P +V   +E++ RDF+
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.6e-9240.58Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE E G  L     I +EIL +F KLY    ++ + +EG++W PID    S LE  F++EEI+KA+  +   K+PGPDG T   +++ W+++K DLV+V 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
         EF + GI+N+ TN ++I L+PK   + ++ D+RPISL+TSLYK+IAKVLA RL+ VL  TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  VSW FLD +L +KGF  R                                               D + + +    E+ VL+G+++G N  
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDT+ F    EE+L   K +LL+    SGLK+NL KS++ GIN+++  L+  A    C  S  P  YLG  LGGN +   FWDP+++R  R+
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        LD WQK  +S GGRITL Q+ L  +P Y  SL + P +V   +E++ R+F+
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

RVW78846.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.0e-9343.06Show/hide
Query:  MLETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKV
        +LE E G  L   + I +EIL +F KLY     + + +EG++W PI     S LE  F++EEI+KA+  +   K+PGPDG T   +++ W+++K DLV+V
Subjt:  MLETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKV

Query:  LQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLL
          EF + GI+N+ TN ++I L+PK   A K+ DYRPISL+TSLYK+IAKVLA RL+ +L  TI   Q AFVQGRQILDA+L+A+E V++ +   ++GV+ 
Subjt:  LQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLL

Query:  KLDLEKAYGMVSWQFLDEILALKGFGQR-----DAICKSVKFCL---------------EKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWK
        K+D EKAY  VSW FLD ++  KGF  +          SV F +               E+ V +G+++G N   ++ LQ+ADDT+ F    EE+L   K
Subjt:  KLDLEKAYGMVSWQFLDEILALKGFGQR-----DAICKSVKFCL---------------EKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWK

Query:  EILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLP
         +L++    SGLK+NL KS++ GIN+ +  L+  A    C  S  P  YLG  LGGN +  SFWDP+++R   +LD WQK  +S GGRITL Q+ L  +P
Subjt:  EILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLP

Query:  IYQFSLLKAPKAVIKNMEKLIRDFI
         Y  SL K P ++   +E+L RDF+
Subjt:  IYQFSLLKAPKAVIKNMEKLIRDFI

RVW98505.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.3e-9340.8Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE E G  L     I +EIL +F KLY     + + +EG++W PID    S LE  F++EEI+KA+  +   K+PGPDG T   +++ W+++K DLV+V 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
         EF + GI+N+ TN ++I L+PK  ++ ++ D+RPISL+TSLYK+IAKVLAERL+ VL  TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  VSW FLD +L +KGF  R                                               D + + +    E+ VL+G+++G N  
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDT+ F    EE++   K +LL+    SGLK+NL KS++ GIN+++  L+  A    C  S  P  YLG  LGGN +   FWDP+++R  R+
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        LD WQK  +S GGRITL Q+ L  +P Y  SL K P +V   +E++ R+F+
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

VVA31869.1 Hypothetical predicted protein, partial [Prunus dulcis]1.0e-9344.1Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE ED   +  E  I  E++ FF  L+  +    + ++G+NW PI       LE  F  EE+ KAV   G  KSPGPDG +  F+++ W ++K DL+KV+
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
        Q+FFQ GIVN  TNET+ICLIPK   + KV D+RPISLVTSLYKVI+KVLA RL+ VL +TIS  Q AFVQ RQILDA+LVA+E VE+ R + +KG++ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR--------------------DAICKSVKFCLEKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKE
        +D EKAY  V W F+D++LA KGFG R                    D + + ++   +  ++ G   G + ++++ LQ+ADDT+ F    EE      +
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR--------------------DAICKSVKFCLEKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKE

Query:  ILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPI
        +L L  + SG+K+N  KS ++GIN     LN  A  +GC     P  YLG  LGGN R ++FW+P+LD+ +++L +W++  +SKGGR+TL QAVL+S+P 
Subjt:  ILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPI

Query:  YQFSLLKAPKAVIKNMEKLIRDFI
        Y  SL K P  V   +E+L+R+F+
Subjt:  YQFSLLKAPKAVIKNMEKLIRDFI

TrEMBL top hitse value%identityAlignment
A0A438H2J5 Transposon TX1 uncharacterized 149 kDa protein2.4e-9343.06Show/hide
Query:  MLETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKV
        +LE E G  L   + I +EIL +F KLY     + + +EG++W PI     S LE  F++EEI+KA+  +   K+PGPDG T   +++ W+++K DLV+V
Subjt:  MLETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKV

Query:  LQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLL
          EF + GI+N+ TN ++I L+PK   A K+ DYRPISL+TSLYK+IAKVLA RL+ +L  TI   Q AFVQGRQILDA+L+A+E V++ +   ++GV+ 
Subjt:  LQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLL

Query:  KLDLEKAYGMVSWQFLDEILALKGFGQR-----DAICKSVKFCL---------------EKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWK
        K+D EKAY  VSW FLD ++  KGF  +          SV F +               E+ V +G+++G N   ++ LQ+ADDT+ F    EE+L   K
Subjt:  KLDLEKAYGMVSWQFLDEILALKGFGQR-----DAICKSVKFCL---------------EKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWK

Query:  EILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLP
         +L++    SGLK+NL KS++ GIN+ +  L+  A    C  S  P  YLG  LGGN +  SFWDP+++R   +LD WQK  +S GGRITL Q+ L  +P
Subjt:  EILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLP

Query:  IYQFSLLKAPKAVIKNMEKLIRDFI
         Y  SL K P ++   +E+L RDF+
Subjt:  IYQFSLLKAPKAVIKNMEKLIRDFI

A0A438IPG2 Transposon TX1 uncharacterized 149 kDa protein1.1e-9340.8Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE E G  L     I +EIL +F KLY     + + +EG++W PID    S LE  F++EEI+KA+  +   K+PGPDG T   +++ W+++K DLV+V 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
         EF + GI+N+ TN ++I L+PK  ++ ++ D+RPISL+TSLYK+IAKVLAERL+ VL  TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  VSW FLD +L +KGF  R                                               D + + +    E+ VL+G+++G N  
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDT+ F    EE++   K +LL+    SGLK+NL KS++ GIN+++  L+  A    C  S  P  YLG  LGGN +   FWDP+++R  R+
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        LD WQK  +S GGRITL Q+ L  +P Y  SL K P +V   +E++ R+F+
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

A0A5E4FWN6 Reverse transcriptase domain-containing protein (Fragment)4.9e-9444.1Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE ED   +  E  I  E++ FF  L+  +    + ++G+NW PI       LE  F  EE+ KAV   G  KSPGPDG +  F+++ W ++K DL+KV+
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
        Q+FFQ GIVN  TNET+ICLIPK   + KV D+RPISLVTSLYKVI+KVLA RL+ VL +TIS  Q AFVQ RQILDA+LVA+E VE+ R + +KG++ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR--------------------DAICKSVKFCLEKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKE
        +D EKAY  V W F+D++LA KGFG R                    D + + ++   +  ++ G   G + ++++ LQ+ADDT+ F    EE      +
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR--------------------DAICKSVKFCLEKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKE

Query:  ILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPI
        +L L  + SG+K+N  KS ++GIN     LN  A  +GC     P  YLG  LGGN R ++FW+P+LD+ +++L +W++  +SKGGR+TL QAVL+S+P 
Subjt:  ILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPI

Query:  YQFSLLKAPKAVIKNMEKLIRDFI
        Y  SL K P  V   +E+L+R+F+
Subjt:  YQFSLLKAPKAVIKNMEKLIRDFI

A0A803QQM3 Uncharacterized protein3.2e-9341.56Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        +E ++G  +  E EIV+E++ FFSKLY  + +    +EG+ W  I+ +    LE  F +EE+   V      K+PGPDG +    +N+W  +K DL++V 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
        + F ++G +    N+T+ICLIPK   + KV+DYRPISL+TS+YK+IAK LA RL+ VL  TIS+ Q+AFV+GRQILD++L+A+EAVEDYR R KKG++LK
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  V W FLD ++  KGFG+R                                               D + + V   ++ E L G+QIG +++
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDTL F    E  L +  +I+      SGLK+NL KS L+G+ +D   +   A + GC     P  YLG  LGG+ RK SFW+P+LD+   +
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDF
        +D W+   +S+GGR+TL Q+VL+SLPIY  SL KAPK V+K +EK++RDF
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDF

A5BCI7 Reverse transcriptase domain-containing protein9.3e-9340.58Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL
        LE E+G  +     I +EIL +F KLY     + + +EG++W PI       LE  F++EEIFKA+  +   K+PGPDG T   +++ W ++K DLVKV 
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVL

Query:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK
         EF + GI+N+ TN ++I L+PK   + ++ D+RPISL+TSLYK+IAKVLA R++ VL  TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K
Subjt:  QEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLK

Query:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL
        +D EKAY  VSW FLD +L +KGFG R                                               D + + +    E+ VL+G+++G N  
Subjt:  LDLEKAYGMVSWQFLDEILALKGFGQR-----------------------------------------------DAICKSVKFCLEKEVLKGWQIGSNNL

Query:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK
         ++ LQ+ADDT+ F  + EE++   K +LL+    SGLK+NL KS++ GIN+++  L+  A    C  S  P  YLG  LGGN +   FWDP+++R  R+
Subjt:  DIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKISFWDPLLDRFKRK

Query:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        LD WQK  +S GGRITL Q+ L  +P Y  SL K P +V   +E++ RDF+
Subjt:  LDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.9e-2824.25Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLY---------KDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNI
        ++ + G   T   EI   I  ++  LY          D     + L  +N + ++     SL    +  EI   ++ L   KSPGPDG T+EF++ +   
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLY---------KDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNI

Query:  LKPDLVKVLQEFFQKGIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDY-
        L P L+K+ Q   ++GI+     E  I LIPK  +  TK  ++RPISL+    K++ K+LA R+++ +   I   Q  F+ G Q    I  +   ++   
Subjt:  LKPDLVKVLQEFFQKGIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDY-

Query:  RIRKKKGVLLKLDLEKAYGMVSWQFLDEILALKGF----------------------GQR-------------------------DAICKSVKFCLEKEV
        R + K  V++ +D EKA+  +   F+ + L   G                       GQ+                         + + ++++   EKE+
Subjt:  RIRKKKGVLLKLDLEKAYGMVSWQFLDEILALKGF----------------------GQR-------------------------DAICKSVKFCLEKEV

Query:  LKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKI--
         KG Q+G   + +++  +ADD +V+  N         +++    + SG K+N+ KS     N +R   +    +     +     YLG QL  + + +  
Subjt:  LKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRKI--

Query:  SFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI
          + PLL   K   +KW+  P S  GRI + +  +    IY+F+   +K P      +EK    FI
Subjt:  SFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI

P08548 LINE-1 reverse transcriptase homolog1.9e-2625.55Show/hide
Query:  TKENEIVDEILTFFSKLYK---DDCKQ-RFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVLQEFFQ
        T  +EI   +  ++ KLY    ++ K+    LE  +   +       L    S  EI   +  L   KSPGPDG TSEF++     L P L+ + Q   +
Subjt:  TKENEIVDEILTFFSKLYK---DDCKQ-RFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVLQEFFQ

Query:  KGIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDY-RIRKKKGVLLKLDL
        +GI+     E  I LIPK  K  T+  +YRPISL+    K++ K+L  R+++ +   I   Q  F+ G Q    I  +   ++   +++ K  ++L +D 
Subjt:  KGIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDY-RIRKKKGVLLKLDL

Query:  EKAYGMVSWQFLDEILALKGFGQRDAICKSVK-------------------FCL---------------------------EKEVLKGWQIGSNNLDIAI
        EKA+  +   F+  I  LK  G      K ++                   F L                           E++ +KG  IGS  + +++
Subjt:  EKAYGMVSWQFLDEILALKGFGQRDAICKSVK-------------------FCL---------------------------EKEVLKGWQIGSNNLDIAI

Query:  LQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKS-SLIGINVDRLQLNLWANKFGCHCSEVP--FNYLGFQLGGNFRKI--SFWDPLLDRFKR
          +ADD +V+  N+ +  ++  E++      SG K+N  KS + I  N ++ +  +   K     + VP    YLG  L  + + +    ++ L      
Subjt:  LQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKS-SLIGINVDRLQLNLWANKFGCHCSEVP--FNYLGFQLGGNFRKI--SFWDPLLDRFKR

Query:  KLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI
         ++KW+  P S  GRI + +  +    IY F+   +KAP +  K++EK+I  FI
Subjt:  KLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI

P11369 LINE-1 retrotransposable element ORF2 protein5.7e-2324.09Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYK---------DDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNI
        +  E G   T   EI + I +F+ +LY          D    R+ +  +N D +D      L    S +EI   ++ L   KSPGPDG ++EF++     
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYK---------DDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNI

Query:  LKPDLVKVLQEFFQK----GIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAV
         K DL+ +L + F K    G +     E  I LIPK  K  TK+ ++RPISL+    K++ K+LA R++  + + I   Q  F+ G Q    I  +   +
Subjt:  LKPDLVKVLQEFFQK----GIVNKRTNETYICLIPKMDK-ATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAV

Query:  EDY-RIRKKKGVLLKLDLEKAYGMVSWQFLDEILALKGF---------------------------------GQRDAICKS-----------VKFCLEKE
            +++ K  +++ LD EKA+  +   F+ ++L   G                                  G R     S            +   +++
Subjt:  EDY-RIRKKKGVLLKLDLEKAYGMVSWQFLDEILALKGF---------------------------------GQRDAICKS-----------VKFCLEKE

Query:  VLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSS--LIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRK
         +KG QIG   + I++L  ADD +V+  + +        ++    E  G K+N  KS   L   N    +       F    + +   YLG  L    + 
Subjt:  VLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSS--LIGINVDRLQLNLWANKFGCHCSEVPFNYLGFQLGGNFRK

Query:  I--SFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI
        +    +  L    K  L +W+  P S  GRI + +  +    IY+F+   +K P      +E  I  F+
Subjt:  I--SFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSL--LKAPKAVIKNMEKLIRDFI

P14381 Transposon TX1 uncharacterized 149 kDa protein3.2e-2625.61Show/hide
Query:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWD--PIDSNCRSS-LEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLV
        L  EDG  L     I D   +F+  L+  D       E + WD  P+ S  R   LE   + +E+ +A+ ++ + KSPG DG+T EF++  W+ L PD  
Subjt:  LETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWD--PIDSNCRSS-LEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLV

Query:  KVLQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGV
        +VL E F+KG +        + L+PK      ++++RP+SL+++ YK++AK ++ RLK VL   I   Q+  V GR I D + +  + +   R       
Subjt:  KVLQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGV

Query:  LLKLDLEKAYGMVSWQFLDEILALKGFGQR----------DAIC---------------KSVK-----------------FCLEKEVLKGWQIGSNNLDI
         L LD EKA+  V  Q+L   L    FG +           A C               + V+                  CL ++ L G  +   ++ +
Subjt:  LLKLDLEKAYGMVSWQFLDEILALKGFGQR----------DAIC---------------KSVK-----------------FCLEKEVLKGWQIGSNNLDI

Query:  AILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKF-GCHCSEVPFNYLGFQLGGNFRKIS-FWDPLLDRFKRK
         +  YADD ++   +   +L R +E   +    S  ++N  KSS  G+    L+++     F           YLG  L      +S  +  L +    +
Subjt:  AILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKF-GCHCSEVPFNYLGFQLGGNFRKIS-FWDPLLDRFKRK

Query:  LDKWQKFP--ISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        L KW+ F   +S  GR  +   ++ S   Y+   L   +  I  +++ + DF+
Subjt:  LDKWQKFP--ISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-1339.58Show/hide
Query:  SSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVLQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVI
        S L    SD+EI  AV  +   K+PGPD  T+EF+   W ++K   +  ++EFF+ G + KR N T I LIPK+    ++  +RP+S  T +YK+I
Subjt:  SSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVLQEFFQKGIVNKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVI

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-0528.4Show/hide
Query:  VPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI
        +P  YLG  L       S + PL+++ + ++ KW    +S  GR+ L  +V++SL  +  S  + P A IK ++ +   F+
Subjt:  VPFNYLGFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-0944.3Show/hide
Query:  LAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGV----LLKLDLEKAYGMVSWQFLDEILALKGF
        + ERLK ++ + I   QA+F+ GR   D I+   EAV  + +R+KKGV    LLKLDLEKAY  + W +L++ L   GF
Subjt:  LAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGV----LLKLDLEKAYGMVSWQFLDEILALKGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGAGACTGAAGATGGGAGGTTTCTTACCAAGGAGAATGAGATTGTTGATGAAATCTTGACCTTTTTCAGTAAGCTATACAAGGATGATTGCAAGCAAAGATTTGT
GCTTGAAGGCGTGAATTGGGATCCGATTGACAGCAACTGTAGAAGTAGTCTGGAAGGCTCTTTCAGTGATGAAGAAATTTTTAAAGCTGTTAGTGTTCTGGGAAATCTGA
AATCCCCTGGCCCCGATGGTATGACTAGCGAATTTTGGAAAAATCATTGGAACATCTTGAAGCCTGATTTAGTAAAGGTGCTCCAAGAATTTTTCCAAAAGGGCATTGTT
AACAAAAGAACAAATGAGACTTATATTTGCTTGATTCCGAAGATGGATAAAGCTACTAAGGTGAGGGATTACAGACCTATCAGCCTAGTCACTTCCCTATACAAGGTGAT
CGCCAAAGTCTTAGCCGAGAGATTAAAAAGAGTGCTTCCTTCGACAATTAGTGACTGCCAAGCGGCTTTTGTGCAAGGTAGGCAAATTCTTGATGCTATTTTAGTGGCTT
CGGAAGCCGTGGAAGACTATAGAATTAGAAAGAAGAAAGGAGTGTTGCTCAAGTTGGACTTAGAAAAAGCGTATGGCATGGTTAGTTGGCAATTTCTTGATGAGATTCTA
GCTTTGAAGGGCTTTGGCCAAAGAGATGCTATATGTAAGTCTGTAAAATTCTGCCTTGAGAAAGAGGTTCTTAAGGGTTGGCAGATTGGATCCAATAATTTGGATATAGC
CATTTTGCAATACGCAGATGACACCTTGGTCTTTTGTCCGAATAGTGAAGAGGAGTTGTCGAGATGGAAGGAGATTCTTCTGTTGATTATGGAGGGTTCGGGCCTCAAAT
TAAATTTACTGAAATCTTCTCTTATCGGCATAAATGTGGATAGACTTCAGCTTAATCTTTGGGCCAACAAATTTGGGTGCCATTGCAGTGAGGTCCCATTTAACTACCTT
GGTTTTCAATTAGGGGGCAACTTTCGTAAAATATCCTTTTGGGATCCTCTTTTGGACAGATTTAAGCGGAAACTTGACAAATGGCAGAAATTTCCCATCTCCAAAGGGGG
AAGAATTACCTTGGCACAAGCTGTTTTGAACAGCCTCCCTATATATCAATTTTCTCTCCTTAAAGCTCCAAAGGCTGTCATTAAAAATATGGAAAAACTCATCAGGGACT
TCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGAGACTGAAGATGGGAGGTTTCTTACCAAGGAGAATGAGATTGTTGATGAAATCTTGACCTTTTTCAGTAAGCTATACAAGGATGATTGCAAGCAAAGATTTGT
GCTTGAAGGCGTGAATTGGGATCCGATTGACAGCAACTGTAGAAGTAGTCTGGAAGGCTCTTTCAGTGATGAAGAAATTTTTAAAGCTGTTAGTGTTCTGGGAAATCTGA
AATCCCCTGGCCCCGATGGTATGACTAGCGAATTTTGGAAAAATCATTGGAACATCTTGAAGCCTGATTTAGTAAAGGTGCTCCAAGAATTTTTCCAAAAGGGCATTGTT
AACAAAAGAACAAATGAGACTTATATTTGCTTGATTCCGAAGATGGATAAAGCTACTAAGGTGAGGGATTACAGACCTATCAGCCTAGTCACTTCCCTATACAAGGTGAT
CGCCAAAGTCTTAGCCGAGAGATTAAAAAGAGTGCTTCCTTCGACAATTAGTGACTGCCAAGCGGCTTTTGTGCAAGGTAGGCAAATTCTTGATGCTATTTTAGTGGCTT
CGGAAGCCGTGGAAGACTATAGAATTAGAAAGAAGAAAGGAGTGTTGCTCAAGTTGGACTTAGAAAAAGCGTATGGCATGGTTAGTTGGCAATTTCTTGATGAGATTCTA
GCTTTGAAGGGCTTTGGCCAAAGAGATGCTATATGTAAGTCTGTAAAATTCTGCCTTGAGAAAGAGGTTCTTAAGGGTTGGCAGATTGGATCCAATAATTTGGATATAGC
CATTTTGCAATACGCAGATGACACCTTGGTCTTTTGTCCGAATAGTGAAGAGGAGTTGTCGAGATGGAAGGAGATTCTTCTGTTGATTATGGAGGGTTCGGGCCTCAAAT
TAAATTTACTGAAATCTTCTCTTATCGGCATAAATGTGGATAGACTTCAGCTTAATCTTTGGGCCAACAAATTTGGGTGCCATTGCAGTGAGGTCCCATTTAACTACCTT
GGTTTTCAATTAGGGGGCAACTTTCGTAAAATATCCTTTTGGGATCCTCTTTTGGACAGATTTAAGCGGAAACTTGACAAATGGCAGAAATTTCCCATCTCCAAAGGGGG
AAGAATTACCTTGGCACAAGCTGTTTTGAACAGCCTCCCTATATATCAATTTTCTCTCCTTAAAGCTCCAAAGGCTGTCATTAAAAATATGGAAAAACTCATCAGGGACT
TCATATGA
Protein sequenceShow/hide protein sequence
MLETEDGRFLTKENEIVDEILTFFSKLYKDDCKQRFVLEGVNWDPIDSNCRSSLEGSFSDEEIFKAVSVLGNLKSPGPDGMTSEFWKNHWNILKPDLVKVLQEFFQKGIV
NKRTNETYICLIPKMDKATKVRDYRPISLVTSLYKVIAKVLAERLKRVLPSTISDCQAAFVQGRQILDAILVASEAVEDYRIRKKKGVLLKLDLEKAYGMVSWQFLDEIL
ALKGFGQRDAICKSVKFCLEKEVLKGWQIGSNNLDIAILQYADDTLVFCPNSEEELSRWKEILLLIMEGSGLKLNLLKSSLIGINVDRLQLNLWANKFGCHCSEVPFNYL
GFQLGGNFRKISFWDPLLDRFKRKLDKWQKFPISKGGRITLAQAVLNSLPIYQFSLLKAPKAVIKNMEKLIRDFI