; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011015 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011015
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:12272115..12273653
RNA-Seq ExpressionLag0011015
SyntenyLag0011015
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]3.0e-6642.59Show/hide
Query:  DVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGL
        DV+L H G++       G   + +SHLQF DDTI F    E +  NL  ++K F + SG+ +N  KS  +GI  + +T++ +A  +GC+ G W   YLGL
Subjt:  DVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGL

Query:  PLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGL
        PL G P++ +FW+PV++K+EKRL  W    LSKGGRLTLIQA LS++P Y++SLFK P  V  K+E+L RN+LW G  E K  HL++W RV    E GGL
Subjt:  PLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGL

Query:  GIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLN--SSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLF
        GI  + ++N +L AKW+WRF +E ++LW  ++ +KYG    G   DTK ++  S R PW+ I              VG G+   F +D W+    L+ LF
Subjt:  GIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLN--SSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLF

Query:  PSLYHLSNKKDAPIVDF
        P LY LS +K+  I  F
Subjt:  PSLYHLSNKKDAPIVDF

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-6739.51Show/hide
Query:  HLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPL
        +L  K  I G+   K S  L+++H+ F DD ++F    + ++ NL  ++  FE ASGLN+N  KS    I +       +AD +G   G    +YLG+PL
Subjt:  HLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPL

Query:  NGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGI
         G+P S++FWD V++KI+K+L +W  + LSKGGR+TLI +TL +LPIY +S+FK P  +  KIE  +RN+LW G +    I L++W+++ +P E GGLGI
Subjt:  NGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGI

Query:  MGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLY
          +   N +LL KW+W+F  EKD LW  ++ +KY     G  P     +S+  PWK++    S  Y NIS KV  G+  SF  D W GN  L    P L+
Subjt:  MGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLY

Query:  HLSNKKDAPIVDFWCYQSNSWMIY
         LS  K   + +FW   SN W ++
Subjt:  HLSNKKDAPIVDFWCYQSNSWMIY

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-1540.19Show/hide
Query:  MWNELILDIDEMVKGNFSLTLNLSLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTEL
        MWN+    I  + KG FS+++ +   +G   W++ IYGP     R  FW+EL +L S+C P WI+GGDFN+ RW  E +   P    M+ FN FI    L
Subjt:  MWNELILDIDEMVKGNFSLTLNLSLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTEL

Query:  IDVHLEH
        ID  L +
Subjt:  IDVHLEH

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-6642.04Show/hide
Query:  FNKFIDLTELIDVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCK
        F   +D+   +        LIKGL  G G   + ISHLQF DDTI F + DE   +NL  V+  F   SGL +N  K    GI    + +  +AD +GC+
Subjt:  FNKFIDLTELIDVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCK

Query:  TGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWD
         G W   YLGLPL G+PK+  FWDPVVEK+EKRL SW    LSKGGRLTLIQ  L +LPIY++SLFK P  VI ++EKL + +LW G  E K  HL+KW+
Subjt:  TGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWD

Query:  RVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWM
         V    E GGL +  +  +  +LLAKW+WRF  E ++LW  V+ +KYG    G   +     SSR PWK I             +VG G+   F +D W+
Subjt:  RVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWM

Query:  GNMSLQHLFPSLYHLSNKKDAPIVDFWCYQSNS
            L+  +P L+ LS   +  I  F     NS
Subjt:  GNMSLQHLFPSLYHLSNKKDAPIVDFWCYQSNS

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-6939.69Show/hide
Query:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP
        ++L  KG I G++ G     L+++H+ F DD ++F    E ++ NL  ++  FE ASGLN+N  KS    I +     + + D +G   G    TYLG+P
Subjt:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP

Query:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG
        L GKP S++FWD +++KI+K+L SW  + LSKGGR+TLI +TL +LPIY LS+FK P  +  KIE  +RN+LW G +    I L++W++V +P E GGLG
Subjt:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG

Query:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL
        I  +   N +LL KW+W+F  EK+ LW  ++ +KY     G  P     +S+  PWK++ +  S  Y NI  KV  G+  SF  D W GN  L  + P L
Subjt:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL

Query:  YHLSNKKDAPIVDFWCYQSNSWMIY
        + LS  K   + D W      W I+
Subjt:  YHLSNKKDAPIVDFWCYQSNSWMIY

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.4e-6939.69Show/hide
Query:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP
        ++L  KG I G++ G     L+++H+ F DD ++F    E ++ NL  ++  FE ASGLN+N  KS    I +     + + D +G   G    TYLG+P
Subjt:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP

Query:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG
        L GKP S++FWD +++KI+K+L SW  + LSKGGR+TLI +TL +LPIY LS+FK P  +  KIE  +RN+LW G +    I L++W++V +P E GGLG
Subjt:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG

Query:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL
        I  +   N +LL KW+W+F  EK+ LW  ++ +KY     G  P     +S+  PWK++ +  S  Y NI  KV  G+  SF  D W GN  L    P L
Subjt:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL

Query:  YHLSNKKDAPIVDFWCYQSNSWMIY
        + LS  K   + D W      W I+
Subjt:  YHLSNKKDAPIVDFWCYQSNSWMIY

TrEMBL top hitse value%identityAlignment
A0A2N9IL38 Reverse transcriptase domain-containing protein2.5e-7426.38Show/hide
Query:  IIISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSIDAIGSSGGILIMWNELILDIDEMVKGNFSLTLNL-SLADG
        +I+SWN+RGL    KR  ++N ++     VV LQE K   +D+  IKSIW   ++ W S+ + G+S GIL++W++ +++  +   G FSL+    ++++ 
Subjt:  IIISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSIDAIGSSGGILIMWNELILDIDEMVKGNFSLTLNL-SLADG

Query:  FDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVHLE----------------------
        F+   TG+YGPN   +R+  W+EL  L S  +  W +GGDFN+ R+  EKS   P    M+ F++FI    L+D  LE                      
Subjt:  FDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVHLE----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------HKGLIKGLHVGK-GSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLAD
                             +GL+ G  VG   + +L ISHL F DDT++FS  +  H+ ++  +   FE  SGL +N  KSE + +G  P  +  LA+
Subjt:  --------------------HKGLIKGLHVGK-GSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLAD

Query:  RFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIH
          GCKT      YLGLPL  K KS + WDP++EK+E++L  W   +LSKGGRLTLI++TLS+LP YFLSLF  P  V ++I+K+ R++LW G  E +  H
Subjt:  RFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIH

Query:  LLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFG
        L+ W++V  P+  GGLGI  +   N +LL KW+W F  E++ALW  V+  KYG+   G         +    WK+I    + V S +S ++G G   SF 
Subjt:  LLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFG

Query:  KDTWMGNMSLQHLFPSLYHLSNKKDAPIVDFWCYQSN--SWMI
         D W G  SL+  +P L+ +S  K A + +   Y +   SW++
Subjt:  KDTWMGNMSLQHLFPSLYHLSNKKDAPIVDFWCYQSN--SWMI

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein5.9e-6839.51Show/hide
Query:  HLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPL
        +L  K  I G+   K S  L+++H+ F DD ++F    + ++ NL  ++  FE ASGLN+N  KS    I +       +AD +G   G    +YLG+PL
Subjt:  HLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPL

Query:  NGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGI
         G+P S++FWD V++KI+K+L +W  + LSKGGR+TLI +TL +LPIY +S+FK P  +  KIE  +RN+LW G +    I L++W+++ +P E GGLGI
Subjt:  NGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGI

Query:  MGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLY
          +   N +LL KW+W+F  EKD LW  ++ +KY     G  P     +S+  PWK++    S  Y NIS KV  G+  SF  D W GN  L    P L+
Subjt:  MGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLY

Query:  HLSNKKDAPIVDFWCYQSNSWMIY
         LS  K   + +FW   SN W ++
Subjt:  HLSNKKDAPIVDFWCYQSNSWMIY

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein8.9e-1640.19Show/hide
Query:  MWNELILDIDEMVKGNFSLTLNLSLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTEL
        MWN+    I  + KG FS+++ +   +G   W++ IYGP     R  FW+EL +L S+C P WI+GGDFN+ RW  E +   P    M+ FN FI    L
Subjt:  MWNELILDIDEMVKGNFSLTLNLSLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTEL

Query:  IDVHLEH
        ID  L +
Subjt:  IDVHLEH

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.8e-6740.16Show/hide
Query:  WQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPP--TKDMKSFNKFIDLTELIDVHLEHKGLIKGLHVGK-GSHILSISHLQFVDDTILFSSHDESHL
        W  L YL   C P     G F  SR   +     P      M++F++ ID            G++ G  VG   S  L +SHL F DDT++F   D  HL
Subjt:  WQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPP--TKDMKSFNKFIDLTELIDVHLEHKGLIKGLHVGK-GSHILSISHLQFVDDTILFSSHDESHL

Query:  DNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATL
         +L SV+  FE  SGL +N  KSE + +G  P  +  LAD  GC+T      YLGLPL  K K+   W+P+VEK+E+RL  W   +LSKGGRLTL+++TL
Subjt:  DNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATL

Query:  SNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLK
        SNLP YFLSLF  P  V ++IEK+ RN+LW  + E+   HL+KWD V +P  +GGL I  + + N +LL KW+WRF +E+DA W  V+  KYG+   G  
Subjt:  SNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLK

Query:  PDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLYHLSNKKDAPIVDF
            S     G WK I S        ++ +VG G    F  D W G   L+  +P LY ++  KD  + DF
Subjt:  PDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLYHLSNKKDAPIVDF

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein8.2e-7039.69Show/hide
Query:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP
        ++L  KG I G++ G     L+++H+ F DD ++F    E ++ NL  ++  FE ASGLN+N  KS    I +     + + D +G   G    TYLG+P
Subjt:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP

Query:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG
        L GKP S++FWD +++KI+K+L SW  + LSKGGR+TLI +TL +LPIY LS+FK P  +  KIE  +RN+LW G +    I L++W++V +P E GGLG
Subjt:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG

Query:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL
        I  +   N +LL KW+W+F  EK+ LW  ++ +KY     G  P     +S+  PWK++ +  S  Y NI  KV  G+  SF  D W GN  L  + P L
Subjt:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL

Query:  YHLSNKKDAPIVDFWCYQSNSWMIY
        + LS  K   + D W      W I+
Subjt:  YHLSNKKDAPIVDFWCYQSNSWMIY

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein3.1e-6939.69Show/hide
Query:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP
        ++L  KG I G++ G     L+++H+ F DD ++F    E ++ NL  ++  FE ASGLN+N  KS    I +     + + D +G   G    TYLG+P
Subjt:  VHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLP

Query:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG
        L GKP S++FWD +++KI+K+L SW  + LSKGGR+TLI +TL +LPIY LS+FK P  +  KIE  +RN+LW G +    I L++W++V +P E GGLG
Subjt:  LNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLG

Query:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL
        I  +   N +LL KW+W+F  EK+ LW  ++ +KY     G  P     +S+  PWK++ +  S  Y NI  KV  G+  SF  D W GN  L    P L
Subjt:  IMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSL

Query:  YHLSNKKDAPIVDFWCYQSNSWMIY
        + LS  K   + D W      W I+
Subjt:  YHLSNKKDAPIVDFWCYQSNSWMIY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.6e-0424.31Show/hide
Query:  IISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSI---DAIGSSGGILIMWNELI----LDIDEMVKGNFSLTLNL
        I++ N+ GL S  KR  + +++KS +P+V  +QET  +  D   +K        GW  I   +      G+ I+ ++        I    +G++ +    
Subjt:  IISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSI---DAIGSSGGILIMWNELI----LDIDEMVKGNFSLTLNL

Query:  SLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVH
           +  +L +  IY PN    R    Q L  L    + + ++ GDFN      ++S  +   KD +  N  +  T+LID++
Subjt:  SLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVH

P08548 LINE-1 reverse transcriptase homolog5.9e-0921.98Show/hide
Query:  ELIDVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTY
        E++ + +  +  IKG+H+G     LS+    F DD I++  +       L  VIK++   SG  +N HKS         Q    + D            Y
Subjt:  ELIDVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTY

Query:  LGLPLNGKPKS--TSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSL--FKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKA
        LG+ L    K      ++ + ++I + +  W +   S  GR+ +++ ++    IY  +    K P      +EK+  +++W          LL  ++ KA
Subjt:  LGLPLNGKPKS--TSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSL--FKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKA

Query:  PIENGGLGIMGIEQKNTSLLAKWIWRFHIEKD
            GG+ +  +     S++ K  W +H  ++
Subjt:  PIENGGLGIMGIEQKNTSLLAKWIWRFHIEKD

P0C2F6 Putative ribonuclease H protein At1g657505.0e-2431.19Show/hide
Query:  LPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGG
        +P+  K  +   +  ++E++  R+  W    LS  GRLTL +A LS++P++ +S    P  ++N++++L R +LW    E K  HL+KW +V +P + GG
Subjt:  LPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGG

Query:  LGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSL---NSSRGPWKSID-STKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQ
        LG+   +  N +L++K  WR   EK++LW  V+  KY   H G   D++ L    S    W+SI    + +V   +    G G+   F  D W+    L 
Subjt:  LGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSL---NSSRGPWKSID-STKSLVYSNISIKVGYGKSTSFGKDTWMGNMSLQ

Query:  HL
         L
Subjt:  HL

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-0626.52Show/hide
Query:  IISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSI---DAIGSSGGILIMWNELILDIDEMVK----GNFSLTLNL
        +IS NI GL S  KR  + ++L   +PT   LQET     D+ ++      R  GW +I   + +    G+ I+ ++ I    +++K    G+F L    
Subjt:  IISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSI---DAIGSSGGILIMWNELILDIDEMVK----GNFSLTLNL

Query:  SLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVH
         L +  +L +  IY PNA          L  L +   P+ I+ GDFN    S ++S  +   +D     + +   +L D++
Subjt:  SLADGFDLWVTGIYGPNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVH

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.5e-1930.49Show/hide
Query:  GIAPQTVSCLADRFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNY
        G+     + +   F   +G     YLGLPL  K  +TS + P+VEKI  R+  W + HLS  GRL LI + + +L  +++S F+ P+  I +I+ +  ++
Subjt:  GIAPQTVSCLADRFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKTPTKVINKIEKLFRNY

Query:  LWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKN---------TSLLAKWIWRFHIEKDAL
        LW G   +     + W  V  P + GGLGI  +++ N          + L  W+W+  ++  AL
Subjt:  LWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKN---------TSLLAKWIWRFHIEKDAL

AT4G29090.1 Ribonuclease H-like superfamily protein4.5e-1227.97Show/hide
Query:  LPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPD
        LP Y ++ F  P  V  +I  +  ++ WR   E+KG+H   WD +      GG+G   IE  N +LL K +WR     ++L   V  ++Y      L   
Subjt:  LPIYFLSLFKTPTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPD

Query:  TKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWM
          S  S    WKSI +++ ++       VG G+     +  W+
Subjt:  TKSLNSSRGPWKSIDSTKSLVYSNISIKVGYGKSTSFGKDTWM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTATTATCTCCTGGAACATCAGAGGCCTTGGCTCATGGAAGAAACGAGCCCTTATTAAGAACTTCCTCAAAAGCTGTAACCCGACCGTTGTTATTCTCCAAGAAAC
TAAATCCTCATCGATTGACAAGAAATTCATCAAGTCTATATGGAGTTCTCGATACATCGGTTGGTCCTCCATTGATGCCATTGGATCATCAGGTGGCATCCTCATCATGT
GGAATGAACTTATCCTCGATATTGATGAAATGGTCAAAGGTAACTTCTCTCTGACTCTAAACCTCTCTTTGGCTGATGGTTTCGATCTTTGGGTTACAGGTATTTATGGT
CCTAATGCTCCCACAGAGAGAGTTCATTTCTGGCAGGAGCTTTTTTATTTATCCTCCCTTTGTGAACCGAATTGGATTATGGGTGGTGACTTTAATATTTCCAGATGGTC
ATGGGAGAAATCCAATCATAAACCTCCCACTAAAGACATGAAGAGCTTTAACAAGTTTATTGACTTGACTGAGTTGATAGATGTTCACTTGGAACACAAAGGTCTTATTA
AGGGCTTACATGTCGGTAAAGGATCCCATATTCTATCCATCTCGCATCTCCAATTTGTCGATGACACTATCCTTTTCTCATCGCACGACGAATCTCACCTCGATAATCTT
TTCAGTGTTATCAAGCAATTCGAGGAAGCTTCTGGGCTGAATGTAAATTGTCATAAATCTGAATTTATGGGTATTGGTATTGCTCCACAAACGGTTTCTTGCCTTGCAGA
TCGATTTGGTTGCAAAACAGGAGGATGGTCGAACACCTACCTTGGCTTGCCGCTGAATGGTAAACCAAAATCCACATCTTTTTGGGATCCAGTGGTCGAGAAGATTGAAA
AAAGGCTCCTTTCATGGGGCTCTACTCATCTTTCCAAAGGAGGGAGGCTCACTCTAATACAGGCTACCTTGTCTAACCTTCCCATATATTTCCTTTCCCTCTTCAAGACT
CCCACAAAGGTTATCAACAAGATTGAGAAGCTCTTTCGGAACTATCTATGGAGAGGCAACAACGAATCTAAAGGCATCCATCTATTGAAATGGGATCGAGTCAAAGCCCC
TATTGAGAATGGTGGCCTAGGTATTATGGGCATCGAACAGAAAAACACATCTCTTCTGGCCAAATGGATATGGCGCTTTCATATAGAGAAAGACGCCCTTTGGTGCCATG
TTGTAGCTACTAAATATGGGACCTCCCACTTTGGCTTAAAGCCCGACACTAAATCTCTTAATTCTTCCAGAGGCCCGTGGAAATCTATTGACAGCACGAAGTCTTTGGTG
TATTCCAATATATCGATTAAGGTGGGTTATGGTAAAAGCACTTCATTCGGGAAGGACACATGGATGGGGAACATGAGCCTTCAACATTTGTTTCCAAGTTTATATCATCT
ATCTAACAAAAAAGATGCCCCTATCGTTGATTTTTGGTGCTATCAATCCAATTCTTGGATGATCTACCAAGGAGAAATCTCACTGATCTCGAGACTGAAGAATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGATTATTATCTCCTGGAACATCAGAGGCCTTGGCTCATGGAAGAAACGAGCCCTTATTAAGAACTTCCTCAAAAGCTGTAACCCGACCGTTGTTATTCTCCAAGAAAC
TAAATCCTCATCGATTGACAAGAAATTCATCAAGTCTATATGGAGTTCTCGATACATCGGTTGGTCCTCCATTGATGCCATTGGATCATCAGGTGGCATCCTCATCATGT
GGAATGAACTTATCCTCGATATTGATGAAATGGTCAAAGGTAACTTCTCTCTGACTCTAAACCTCTCTTTGGCTGATGGTTTCGATCTTTGGGTTACAGGTATTTATGGT
CCTAATGCTCCCACAGAGAGAGTTCATTTCTGGCAGGAGCTTTTTTATTTATCCTCCCTTTGTGAACCGAATTGGATTATGGGTGGTGACTTTAATATTTCCAGATGGTC
ATGGGAGAAATCCAATCATAAACCTCCCACTAAAGACATGAAGAGCTTTAACAAGTTTATTGACTTGACTGAGTTGATAGATGTTCACTTGGAACACAAAGGTCTTATTA
AGGGCTTACATGTCGGTAAAGGATCCCATATTCTATCCATCTCGCATCTCCAATTTGTCGATGACACTATCCTTTTCTCATCGCACGACGAATCTCACCTCGATAATCTT
TTCAGTGTTATCAAGCAATTCGAGGAAGCTTCTGGGCTGAATGTAAATTGTCATAAATCTGAATTTATGGGTATTGGTATTGCTCCACAAACGGTTTCTTGCCTTGCAGA
TCGATTTGGTTGCAAAACAGGAGGATGGTCGAACACCTACCTTGGCTTGCCGCTGAATGGTAAACCAAAATCCACATCTTTTTGGGATCCAGTGGTCGAGAAGATTGAAA
AAAGGCTCCTTTCATGGGGCTCTACTCATCTTTCCAAAGGAGGGAGGCTCACTCTAATACAGGCTACCTTGTCTAACCTTCCCATATATTTCCTTTCCCTCTTCAAGACT
CCCACAAAGGTTATCAACAAGATTGAGAAGCTCTTTCGGAACTATCTATGGAGAGGCAACAACGAATCTAAAGGCATCCATCTATTGAAATGGGATCGAGTCAAAGCCCC
TATTGAGAATGGTGGCCTAGGTATTATGGGCATCGAACAGAAAAACACATCTCTTCTGGCCAAATGGATATGGCGCTTTCATATAGAGAAAGACGCCCTTTGGTGCCATG
TTGTAGCTACTAAATATGGGACCTCCCACTTTGGCTTAAAGCCCGACACTAAATCTCTTAATTCTTCCAGAGGCCCGTGGAAATCTATTGACAGCACGAAGTCTTTGGTG
TATTCCAATATATCGATTAAGGTGGGTTATGGTAAAAGCACTTCATTCGGGAAGGACACATGGATGGGGAACATGAGCCTTCAACATTTGTTTCCAAGTTTATATCATCT
ATCTAACAAAAAAGATGCCCCTATCGTTGATTTTTGGTGCTATCAATCCAATTCTTGGATGATCTACCAAGGAGAAATCTCACTGATCTCGAGACTGAAGAATGGATAG
Protein sequenceShow/hide protein sequence
MIIISWNIRGLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDKKFIKSIWSSRYIGWSSIDAIGSSGGILIMWNELILDIDEMVKGNFSLTLNLSLADGFDLWVTGIYG
PNAPTERVHFWQELFYLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKDMKSFNKFIDLTELIDVHLEHKGLIKGLHVGKGSHILSISHLQFVDDTILFSSHDESHLDNL
FSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWSNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPIYFLSLFKT
PTKVINKIEKLFRNYLWRGNNESKGIHLLKWDRVKAPIENGGLGIMGIEQKNTSLLAKWIWRFHIEKDALWCHVVATKYGTSHFGLKPDTKSLNSSRGPWKSIDSTKSLV
YSNISIKVGYGKSTSFGKDTWMGNMSLQHLFPSLYHLSNKKDAPIVDFWCYQSNSWMIYQGEISLISRLKNG