; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015890 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015890
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr12:28178233..28180103
RNA-Seq ExpressionLag0015890
SyntenyLag0015890
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]2.8e-7935.96Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN S  R+ FW EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + N      DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGNLTEQE-------------------SYRRTEIEADLLSISA-----------------------
        WGP PFRF N WL H SF     +WW+     GW G+   ++                   S R+ +I +DL++  +                       
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGNLTEQE-------------------SYRRTEIEADLLSISA-----------------------

Query:  ----NVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
              E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  ----NVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]8.2e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

RVW12714.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.0e-8137.84Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEGVYGPNSSRERRLFWKELTDLQALCPPNLILGGDFNVIRWTWEKSSYNV
        ETK  + DR+++ SVWS RN  WA++ A GASGGILI+W+      +E++  VYGPN+S  R+ FW EL+D+  L  P   +GGDFNVIR + EK   + 
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEGVYGPNSSRERRLFWKELTDLQALCPPNLILGGDFNVIRWTWEKSSYNV

Query:  PTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEKWGPPPFRFINAWLNHHSFLR
         T  MK F+ FI D  LID PL +  YTWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     KWGP PFRF N WL H +F  
Subjt:  PTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEKWGPPPFRFINAWLNHHSFLR

Query:  KVDQWWKSVPSLGWSGN---------LTEQESYRRTEI------EADLLSISANVEML-------------------------------WRHSCKVKWLK
           +WW      GW G+           + + + +T        + D+L++ AN + L                               WR   +VKW+K
Subjt:  KVDQWWKSVPSLGWSGN---------LTEQESYRRTEI------EADLLSISANVEML-------------------------------WRHSCKVKWLK

Query:  EGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSG
        EGD NS FFH++    R +  I E+ ++SG  L N   I++E + + EKLY      +   +  +WSPI  E    LE PFTE+EI +A++ +  +K  G
Subjt:  EGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSG

Query:  PDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        PDGFT   F+  W+++K+D+  VFT+F ++GIIN S
Subjt:  PDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.2e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.2e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

TrEMBL top hitse value%identityAlignment
A0A438BP29 Transposon TX1 uncharacterized 149 kDa protein1.5e-8137.84Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEGVYGPNSSRERRLFWKELTDLQALCPPNLILGGDFNVIRWTWEKSSYNV
        ETK  + DR+++ SVWS RN  WA++ A GASGGILI+W+      +E++  VYGPN+S  R+ FW EL+D+  L  P   +GGDFNVIR + EK   + 
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEGVYGPNSSRERRLFWKELTDLQALCPPNLILGGDFNVIRWTWEKSSYNV

Query:  PTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEKWGPPPFRFINAWLNHHSFLR
         T  MK F+ FI D  LID PL +  YTWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     KWGP PFRF N WL H +F  
Subjt:  PTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEKWGPPPFRFINAWLNHHSFLR

Query:  KVDQWWKSVPSLGWSGN---------LTEQESYRRTEI------EADLLSISANVEML-------------------------------WRHSCKVKWLK
           +WW      GW G+           + + + +T        + D+L++ AN + L                               WR   +VKW+K
Subjt:  KVDQWWKSVPSLGWSGN---------LTEQESYRRTEI------EADLLSISANVEML-------------------------------WRHSCKVKWLK

Query:  EGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSG
        EGD NS FFH++    R +  I E+ ++SG  L N   I++E + + EKLY      +   +  +WSPI  E    LE PFTE+EI +A++ +  +K  G
Subjt:  EGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSG

Query:  PDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        PDGFT   F+  W+++K+D+  VFT+F ++GIIN S
Subjt:  PDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein4.0e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

A0A438JX47 LINE-1 retrotransposable element ORF2 protein4.0e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

A5BCI7 Reverse transcriptase domain-containing protein1.4e-7935.96Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN S  R+ FW EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + N      DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGNLTEQE-------------------SYRRTEIEADLLSISA-----------------------
        WGP PFRF N WL H SF     +WW+     GW G+   ++                   S R+ +I +DL++  +                       
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGNLTEQE-------------------SYRRTEIEADLLSISA-----------------------

Query:  ----NVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
              E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  ----NVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

A5CAA2 Reverse transcriptase domain-containing protein4.0e-7935.53Show/hide
Query:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL
        ETK  + DR+ + SVW++RN  WA++ A GASGGILI+W+      +E++ G                    VYGPN+S  R+  W EL+D+  L  P  
Subjt:  ETKMADIDRKIIKSVWSSRNIAWASIDAIGASGGILILWNESSFVVKEIIEG--------------------VYGPNSSRERRLFWKELTDLQALCPPNL

Query:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK
         +GGDFNVIR + EK   +  T +MK F+ FI D  LID+PL +  +TWS+ + NP     DRFL S+     +  +    L R TSDH+PI L     K
Subjt:  ILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEK

Query:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL
        WGP PFRF N WL H SF     +WW+     GW G+                                              L+ +   +R   + +L 
Subjt:  WGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGN----------------------------------------------LTEQESYRRTEIEADLL

Query:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP
         +    E+ WR   +VKW+KEGD NS FFH++    R +  I E+ +++G  + N   I++E + + EKLYT     +   +  +WSPIS E    LE P
Subjt:  SISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVP

Query:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS
        FTE+EI +A++ +  +K  GPDGFT   F+  W ++K+D+  VFT+F ++GIIN S
Subjt:  FTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNGIINAS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.1e-0729.13Show/hide
Query:  RIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKK-DNIAPLPQ-IDEWS--PISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTA
        R++   R K+ I  I +D G+  T+  +IQ     +++ LY  K +N+  +   +D ++   ++ E+ + L  P T  EI+  +  L T K+ GPDGFTA
Subjt:  RIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKK-DNIAPLPQ-IDEWS--PISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTA

Query:  EFFKKSWNILKDDIKGVFTDFFQNGII
        EF+++    L   +  +F    + GI+
Subjt:  EFFKKSWNILKDDIKGVFTDFFQNGII

P08548 LINE-1 reverse transcriptase homolog1.5e-0629.66Show/hide
Query:  RRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKK-DNIAPLPQIDE---WSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTAEFFKKS
        R KS I  I + +    T+  +IQK    +++KLY+ K +N+  + Q  E      +S ++ ++L  P +  EI   + +L   K+ GPDGFT+EF++  
Subjt:  RRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKK-DNIAPLPQIDE---WSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTAEFFKKS

Query:  WNILKDDIKGVFTDFFQN
            K+++  +  + FQN
Subjt:  WNILKDDIKGVFTDFFQN

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-0732.69Show/hide
Query:  RIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLY-TKKDNIAPLPQ-IDEWS--PISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTA
        R+   HR K  I +I ++ G+  T+  +IQ    +F+++LY TK +N+  + + +D +    ++ +Q D L  P + +EI   +  L T K+ GPDGF+A
Subjt:  RIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLY-TKKDNIAPLPQ-IDEWS--PISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTA

Query:  EFFK
        EF++
Subjt:  EFFK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.0e-2726.32Show/hide
Query:  LILGGDFNVIRWTWEKSSY---NVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYR-SNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLT
        +IL GDF+ I  T +  S    ++P R +++F   + D+ L+DIP     YTWS+++  NP +   DR + + +  S + +A A       SDH P  + 
Subjt:  LILGGDFNVIRWTWEKSSY---NVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYR-SNPTMTLTDRFLISDNIVSKYLAASAQRLSRITSDHFPISLT

Query:  L-GKEKWGPPPFRFINAWLNHHSFLRKVD-QWWKSVP------SLG----------------WSGNL---TEQESYRRTEIEADLLS-------------
        L    K     FR+ +    H +FL  +   W + +P      SLG                  GN+   T++       I++ LL+             
Subjt:  L-GKEKWGPPPFRFINAWLNHHSFLRKVD-QWWKSVP------SLG----------------WSGNL---TEQESYRRTEIEADLLS-------------

Query:  ------ISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLY-TKKDNIAP--LPQIDEWSPISVE
               +A +E  +R   ++KWL++GD N+ FFH+++ A++ K+ I  +  D    + N   +++  +A++  L  +  D + P  + +I +  P    
Subjt:  ------ISANVEMLWRHSCKVKWLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLY-TKKDNIAP--LPQIDEWSPISVE

Query:  Q--RDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNG
              L    +++EI  AV+ +  NK  GPD FTAEFF +SW ++KD       +FF+ G
Subjt:  Q--RDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTAEFFKKSWNILKDDIKGVFTDFFQNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGCAAGCTTTTCATAGAGGAACACAGTCAGTATCCAATGATTGAGGATCCCACGCCGCTGAGAATAGAAGATCCCAATAACCGATGCACTAGCCTCTCGATAAA
TTTGGAGGACTCTTCCTTAATTGATATTGCCGTAGAGGAAGAAGAACAAGAAAACCAGAATTTGGCTGGAACAAAAACAGACCCAGCGACTTACCTTCCAATCTTGTTCC
CATGGTTAACTGAACATGAAACCAAGATGGCTGATATTGACCGGAAGATCATTAAATCTGTTTGGAGCTCGAGGAACATAGCTTGGGCTTCCATTGATGCTATCGGTGCC
TCTGGAGGGATATTGATCCTTTGGAATGAATCCTCTTTTGTCGTTAAGGAGATTATTGAAGGAGTATATGGCCCCAACTCCTCCAGAGAGAGGCGTTTATTCTGGAAGGA
ATTAACAGATCTCCAGGCTCTTTGTCCCCCTAACTTGATTTTGGGGGGCGACTTTAATGTCATTCGATGGACATGGGAAAAATCCTCTTATAATGTGCCAACCCGAGCCA
TGAAGAAATTCAACCGTTTCATTGTGGACAATGGCCTTATAGACATTCCCCTCGCCAATGGAAAGTACACTTGGTCCAGTTATCGGTCAAACCCCACAATGACCCTCACT
GATAGATTCCTCATCTCTGATAATATTGTATCCAAGTATCTAGCAGCTTCGGCCCAGAGATTGTCCAGAATCACCTCTGACCATTTCCCTATTAGCCTTACGTTGGGGAA
AGAAAAATGGGGGCCTCCTCCTTTCAGATTCATTAATGCGTGGCTCAATCATCATTCCTTTCTCCGAAAGGTTGATCAATGGTGGAAGAGCGTTCCATCTCTTGGATGGT
CGGGAAATCTGACTGAGCAAGAATCCTACAGAAGAACTGAGATTGAAGCCGATTTACTCTCGATATCTGCCAATGTTGAGATGCTGTGGCGACATTCTTGCAAAGTTAAG
TGGCTAAAGGAGGGGGATGTAAACTCGGCCTTTTTCCACCGTATTATGGCAGCCCACAGAAGAAAGAGCACCATAGTGGAGATTGTATCAGATTCGGGCAATAGCCTCAC
AAATGATGGTGATATTCAAAAGGAATTCATTGCTTTTCATGAGAAGCTTTACACGAAAAAGGACAACATTGCCCCTTTACCTCAGATAGATGAATGGAGTCCCATTTCGG
TTGAACAGAGAGATTTGCTTGAAGTTCCCTTCACAGAACAGGAAATTCTCAGGGCAGTGTATGATTTGGGCACAAATAAGACCTCGGGGCCTGATGGTTTTACAGCGGAA
TTCTTCAAAAAATCTTGGAACATTCTCAAGGATGATATAAAGGGAGTGTTCACTGATTTTTTTCAGAATGGGATCATTAATGCGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGGCAAGCTTTTCATAGAGGAACACAGTCAGTATCCAATGATTGAGGATCCCACGCCGCTGAGAATAGAAGATCCCAATAACCGATGCACTAGCCTCTCGATAAA
TTTGGAGGACTCTTCCTTAATTGATATTGCCGTAGAGGAAGAAGAACAAGAAAACCAGAATTTGGCTGGAACAAAAACAGACCCAGCGACTTACCTTCCAATCTTGTTCC
CATGGTTAACTGAACATGAAACCAAGATGGCTGATATTGACCGGAAGATCATTAAATCTGTTTGGAGCTCGAGGAACATAGCTTGGGCTTCCATTGATGCTATCGGTGCC
TCTGGAGGGATATTGATCCTTTGGAATGAATCCTCTTTTGTCGTTAAGGAGATTATTGAAGGAGTATATGGCCCCAACTCCTCCAGAGAGAGGCGTTTATTCTGGAAGGA
ATTAACAGATCTCCAGGCTCTTTGTCCCCCTAACTTGATTTTGGGGGGCGACTTTAATGTCATTCGATGGACATGGGAAAAATCCTCTTATAATGTGCCAACCCGAGCCA
TGAAGAAATTCAACCGTTTCATTGTGGACAATGGCCTTATAGACATTCCCCTCGCCAATGGAAAGTACACTTGGTCCAGTTATCGGTCAAACCCCACAATGACCCTCACT
GATAGATTCCTCATCTCTGATAATATTGTATCCAAGTATCTAGCAGCTTCGGCCCAGAGATTGTCCAGAATCACCTCTGACCATTTCCCTATTAGCCTTACGTTGGGGAA
AGAAAAATGGGGGCCTCCTCCTTTCAGATTCATTAATGCGTGGCTCAATCATCATTCCTTTCTCCGAAAGGTTGATCAATGGTGGAAGAGCGTTCCATCTCTTGGATGGT
CGGGAAATCTGACTGAGCAAGAATCCTACAGAAGAACTGAGATTGAAGCCGATTTACTCTCGATATCTGCCAATGTTGAGATGCTGTGGCGACATTCTTGCAAAGTTAAG
TGGCTAAAGGAGGGGGATGTAAACTCGGCCTTTTTCCACCGTATTATGGCAGCCCACAGAAGAAAGAGCACCATAGTGGAGATTGTATCAGATTCGGGCAATAGCCTCAC
AAATGATGGTGATATTCAAAAGGAATTCATTGCTTTTCATGAGAAGCTTTACACGAAAAAGGACAACATTGCCCCTTTACCTCAGATAGATGAATGGAGTCCCATTTCGG
TTGAACAGAGAGATTTGCTTGAAGTTCCCTTCACAGAACAGGAAATTCTCAGGGCAGTGTATGATTTGGGCACAAATAAGACCTCGGGGCCTGATGGTTTTACAGCGGAA
TTCTTCAAAAAATCTTGGAACATTCTCAAGGATGATATAAAGGGAGTGTTCACTGATTTTTTTCAGAATGGGATCATTAATGCGAGCTGA
Protein sequenceShow/hide protein sequence
MIGKLFIEEHSQYPMIEDPTPLRIEDPNNRCTSLSINLEDSSLIDIAVEEEEQENQNLAGTKTDPATYLPILFPWLTEHETKMADIDRKIIKSVWSSRNIAWASIDAIGA
SGGILILWNESSFVVKEIIEGVYGPNSSRERRLFWKELTDLQALCPPNLILGGDFNVIRWTWEKSSYNVPTRAMKKFNRFIVDNGLIDIPLANGKYTWSSYRSNPTMTLT
DRFLISDNIVSKYLAASAQRLSRITSDHFPISLTLGKEKWGPPPFRFINAWLNHHSFLRKVDQWWKSVPSLGWSGNLTEQESYRRTEIEADLLSISANVEMLWRHSCKVK
WLKEGDVNSAFFHRIMAAHRRKSTIVEIVSDSGNSLTNDGDIQKEFIAFHEKLYTKKDNIAPLPQIDEWSPISVEQRDLLEVPFTEQEILRAVYDLGTNKTSGPDGFTAE
FFKKSWNILKDDIKGVFTDFFQNGIINAS