; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029037 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029037
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:16923279..16930513
RNA-Seq ExpressionSpg029037
SyntenySpg029037
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015382608.1 uncharacterized protein LOC107175577 [Citrus sinensis]2.3e-7533.94Show/hide
Query:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG
        L +  CF V C GR GGL L+W ++  V I+S+SK HIDA ++  + +    + +YGHP   Q+ HTW L+  L     S W+  GD N  L   EK GG
Subjt:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG

Query:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRR-PRLFRF
             + I  F  AL DC L D+ Y G  FTWSN +  +  I ERLD F+ N  +   F N +  +LN   SDH P+L  V  +      +RR      +
Subjt:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRR-PRLFRF

Query:  EEVWIQHPECKDLISDMGCWA-----DQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDF-SEIKCVEDQLDHAFEEDEI
        E++W  +  CK+++S+   W+       GN         ++   RL  W +  +      ++ LQ+ LQ    +   ++   EIK VE+Q+ +   ++EI
Subjt:  EEVWIQHPECKDLISDMGCWA-----DQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDF-SEIKCVEDQLDHAFEEDEI

Query:  YWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVE
        YWKQRS  +WL+ GDKNT++FH++A+ RKK+N I  ++  +G+ +   K +E+ F  YFT +F+++ P    ++  L+ I  R++++MN+ L  PF+  E
Subjt:  YWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVE

Query:  IERAIKQMHPSKAHGPDGFSACFYQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLV
        +  A+ QM P+KA GPDG  A FYQK W EV  G           +L     +    +    +L   L AE   +IHG+   + L +S      DSL+
Subjt:  IERAIKQMHPSKAHGPDGFSACFYQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLV

XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]2.0e-7135.58Show/hide
Query:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKWD--SKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEG
        L  S   +V    RSGGL L+W  ++ V+++SYS+ HIDA +  +  S+ W F+  YG+P  S+R  +W L+  L + +   W+  GD N  +   EKEG
Subjt:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKWD--SKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEG

Query:  GIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRF
        G      Q+  F +A++ C L+DL Y+G+ FTWS R      + ERLD  L +  +   FP   + H   + SDH  +L   S  P+     R P+ FRF
Subjt:  GIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRF

Query:  EEVWIQHPECKDLISD---MGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEIYW
        E +W++   C D++S     G  +D G+    L +CL++C+  L  W +  +  +   I +LQ  L+ L   K  P    EI C   +L+   E +E+ W
Subjt:  EEVWIQHPECKDLISD---MGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEIYW

Query:  KQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIE
         QRS  +WL+ GDKNT +FH +A+ R +RN I  +Q  NGE  ++ + + + F+ YF ++F++SNP +   D  L  +  ++T  MN  LL  F   E+E
Subjt:  KQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIE

Query:  RAIKQMHPSKAHGPDGFSACFYQKFWSEVA
        RA+KQM P+ A GPDG    FYQ +W  V+
Subjt:  RAIKQMHPSKAHGPDGFSACFYQKFWSEVA

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.3e-7335.71Show/hide
Query:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG
        L Y  CF V   G  GGL L+W +++DV I SYS  HIDA I   D K+W  S IYGHP   Q+ HTW L+  L       W+  GD N  L   EK GG
Subjt:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG

Query:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRR--PRLFR
               +  F + ++DC L DL   G  FTWSNR+     I ERLD FL ++++     N  V +L+   SDH P++  +  +     + +   PR+F 
Subjt:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRR--PRLFR

Query:  FEEVWIQHPECKDLISDM----GCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEI
        +E++W  +  CK+++ +     G W+ QG+     +     C  +LR W R  +      +  L++ L ++ ++     + SEIK  E+Q++    ++E+
Subjt:  FEEVWIQHPECKDLISDM----GCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEI

Query:  YWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVE
        YWKQRS  +WL+ GDKNT++FH++A+ RK++N I  V   +   +D+++ +E+ F  YF ++F++S+P  + I+ AL  +  R+T  MN++L +PF+  E
Subjt:  YWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVE

Query:  IERAIKQMHPSKAHGPDGFSACFYQKFWSEVAIG
        +  A+ QM P+KA GPDG  A F+QK W  V +G
Subjt:  IERAIKQMHPSKAHGPDGFSACFYQKFWSEVAIG

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.9e-0524.82Show/hide
Query:  YQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLVAIKMIRGEMQITSEVHHWVVQIQNM
        +QK   + A+  +N+ A  G ++   DG  R A      +  +   AE   +  G+++ ++  ++      DSL  I +I  +    +E+   +  IQ  
Subjt:  YQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLVAIKMIRGEMQITSEVHHWVVQIQNM

Query:  KLSFQELSFSHIPREANRGADYLARDALTRSQSILWLENLP
          +FQ     H PR+ N  A  LA+ AL + ++++WL+ +P
Subjt:  KLSFQELSFSHIPREANRGADYLARDALTRSQSILWLENLP

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.1e-7237.35Show/hide
Query:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE
        + L +  CF VD  GRSGGL L+W DDI++ I +YS  HI ASI   D   W  + +YGH  + QR+  W L+  L       WI+ GD N  L H EK 
Subjt:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE

Query:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR
        GG    D Q+++F + L DC L+DL YVG  FTWSNR+  E  + ERLD FLAN  +  +FPN  V H   A+SDH P+              R  RLFR
Subjt:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR

Query:  FEEVWIQHPECKDLISDMGCWADQ-GNVK-PHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFSEIKCVED------QLDHAFEE
        FE +W+   EC  +I  +  W  + G +    +   +  C T L RW + ++  +  ++ T +R LQ L         S   C+E+      ++    E 
Subjt:  FEEVWIQHPECKDLISDMGCWADQ-GNVK-PHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFSEIKCVED------QLDHAFEE

Query:  DEIYWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFS
        DE+ WKQRS   WL+ GD N+++FH++A+ R+++N I ++Q  +G +     Q++     YF  +F++++    D+++ L  +  R+T  MN+ LL P+ 
Subjt:  DEIYWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFS

Query:  RVEIERAIKQMHPSKAHGPDGFSACFYQKFW
          E+E A+KQMHPSKA GPDG    F+QK+W
Subjt:  RVEIERAIKQMHPSKAHGPDGFSACFYQKFW

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]2.0e-7135.92Show/hide
Query:  GGLCLMWMDDIDVTIRSYSKFHIDASIKWDSK-LWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIPVRDSQIQQFHDAL
        GGL L+W +D+ + + S+SK+HIDA +   S+  W  +  YG P  S+R+  W ++  L ++    W   GD N  L   +K GG+P   +Q+Q F DAL
Subjt:  GGLCLMWMDDIDVTIRSYSKFHIDASIKWDSK-LWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIPVRDSQIQQFHDAL

Query:  DDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEVWIQHPECKDLISD
        D CG  DL + G  FTW  R+  E +I ERLD  +AN  ++  FP G VQHLN   SDHRP+L S+          R+P  FRFE +W+ +P CK  +++
Subjt:  DDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEVWIQHPECKDLISD

Query:  MGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQWGDKNTQWF
              +G    +    +++CK RL+RW + T+ ++   I  ++  L           D   +  ++ +L    E++E  W QRS   WLQ GD+NT++F
Subjt:  MGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDL-YSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQWGDKNTQWF

Query:  HNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSA
        H  AT RK++N I+ ++  NG     ++        ++  +F SSNP  ++ID  +  +   +T +MN  L  P+S  E+ERAIK M P KA GPDG   
Subjt:  HNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSA

Query:  CFYQKFWSEVAI
         FYQ +WS+V++
Subjt:  CFYQKFWSEVAI

TrEMBL top hitse value%identityAlignment
A0A2N9F5W1 Reverse transcriptase domain-containing protein1.3e-7135.93Show/hide
Query:  YSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKWD-SKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIP
        + G F+V   G+SGGL L W  +I V+I SYS  HIDA + +D    W F+  YG PTA+ ++  W+L+   ++     W  GGD N  L  EEK G + 
Subjt:  YSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKWD-SKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIP

Query:  VRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEV
          + Q+++F   +DDCG  DL +VG  +TW N+Q    ++ ERLD  LA  +++  FPN  V HL    SDHRP+   +     N    R  + FRFEE+
Subjt:  VRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEV

Query:  WIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTL-QDLYSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCE
        W  HP C++ I         G++   +   ++  +  L++W    + SIW  I T  R L Q++   P   + S I+ +  +L     ++E  WKQRS  
Subjt:  WIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTL-QDLYSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCE

Query:  NWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQM
         WLQ GD+NT++FH QAT RK+RN I  ++   G      +++E   + Y+ ++F++S P   + D  L  +   IT +MN +L   F+  E+E A+ QM
Subjt:  NWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQM

Query:  HPSKAHGPDGFSACFYQKFWSEV
         P KA G DG +  FYQK+W+ V
Subjt:  HPSKAHGPDGFSACFYQKFWSEV

A0A2N9GJ35 Uncharacterized protein3.4e-7232.9Show/hide
Query:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDAS-IKWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG
        L   GC  V+  G+ GGL L+W   + + I+SYS+ HID   ++ D   W  +  YG+P A  R+ +W+L+  L++  D  W+I GD N     EEK G 
Subjt:  LNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDAS-IKWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGG

Query:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILF-SVSMKPNNDHHTRRPRLFRF
             +Q+  F +AL DC LQD+ + G  FTWSN +E    +  RLD  +A+  ++ LFP+ S+ HL  A SDH  +L  S + +P N    R+ R+FRF
Subjt:  IPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILF-SVSMKPNNDHHTRRPRLFRF

Query:  EEVWIQHPECKDLISDMGCWADQ--GNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSK-PPPWDFSEIKCVEDQLDHAFEEDEIYWK
        E+ W++   C+++I     W  Q  G     +   +++C+ +L +W +         ID+  + LQ+L  K    +D  +I  ++  L+   E+ EI W+
Subjt:  EEVWIQHPECKDLISDMGCWADQ--GNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSK-PPPWDFSEIKCVEDQLDHAFEEDEIYWK

Query:  QRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIER
        QRS   WL  GD+NT++FH  A++RKK N I  ++           ++E+  + YF+++F+SSNP    ID  L ++   +T  MN+ L+ PF++ EI+R
Subjt:  QRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIER

Query:  AIKQMHPSKAHGPDGFSACFYQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNV----------IIHGI---RLLQRLEVSIA
        A+ QMHPSK+ GPDG SA F+QK+W  V     N       +   ++G + G++ +   +L   +AA  N+          +I+ I    L+ R++  + 
Subjt:  AIKQMHPSKAHGPDGFSACFYQKFWSEVAIGSDNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNV----------IIHGI---RLLQRLEVSIA

Query:  TVFLDSLVAI---KMIRGEMQITSEVHHWVVQIQN
         V  DS  A    +MI   + I  E  H++  +QN
Subjt:  TVFLDSLVAI---KMIRGEMQITSEVHHWVVQIQN

A0A2N9GQE6 Reverse transcriptase domain-containing protein2.8e-7133.72Show/hide
Query:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE
        + L +  CF +   G SGGL L+W D +++ I++Y++ HIDA ++   S+ W F+  YGHP + +   +W L++ L   D   W+  GD N  L  EE+ 
Subjt:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKW-DSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE

Query:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR
        G +P    ++Q F++ ++ CGL D+ + G  FTW NR++    + +RLD  LA  +++  F   SV H+  +HSDH PIL  + +  ++    RRPR  +
Subjt:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR

Query:  FEEVWIQHPECKDLISDMGCWAD---QGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWD-FSEIKCVEDQLDHAFEEDEIY
        FEE W  HPEC+ +I ++  WAD   QG+    +   +++C+ RL  W +         I     +L  L       D  S I   + +++     +E++
Subjt:  FEEVWIQHPECKDLISDMGCWAD---QGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWD-FSEIKCVEDQLDHAFEEDEIY

Query:  WKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEI
        W+QRS   WL  GD NT++FHN A +R++ N +  +     E   + +++E   + YF  +F+SS+P  + I   L  + + ++++ N +LL PF+  E+
Subjt:  WKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEI

Query:  ERAIKQMHPSKAHGPDGFSACFYQKFWSEV
          A+ QMHPSKA GPDG S+ F+QKFW  V
Subjt:  ERAIKQMHPSKAHGPDGFSACFYQKFWSEV

A0A2N9IXK4 RNase H domain-containing protein3.7e-7135.47Show/hide
Query:  IVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIPVRD-S
        +V    + GGL + W  +  V+I+S+S  HIDA I + +   W F+  YG P   +R+ +W+L+  L +Q    W   GD N  L +EEK+GG P+R   
Subjt:  IVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIPVRD-S

Query:  QIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEVWIQH
        Q+Q F DA+D CG +DL + G  FTW N +     + ERLD  LA  ++I LFP   VQHL+   SDH PI  S    P+     R  R+FRFEE+W+ H
Subjt:  QIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEVWIQH

Query:  PECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPW-DFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQ
        P CK+ I+        G     + + L+ C+  LR+W R ++ ++   +    + L++  S+       ++   ++ +++     +E  W+QRS + WL+
Subjt:  PECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPW-DFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQ

Query:  WGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQMHPSK
        WGDKNT +FH+ AT+R++RN I E+Q  +G   ++++ +   F  +F ++FSSS+P   + D+ L  +   +T  MN  L+  F+  E++ A+KQM PS 
Subjt:  WGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQMHPSK

Query:  AHGPDG
        A GPDG
Subjt:  AHGPDG

A0A2N9J109 Uncharacterized protein1.2e-7234.95Show/hide
Query:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE
        +  ++    +V    + GGL L W  D+D+ I SYS  HID  +    S  WCF+  YG P   +R  +WNL+  L+ Q D  W  GGD N  +  EEK+
Subjt:  MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASI-KWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKE

Query:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR
        G I   DSQ+Q F DALD CG  DL Y+G  FTW N + +   + ERLD  +A   ++ +FP   V HL++  SDH+P+  S    P ++H+ R  + F 
Subjt:  GGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFR

Query:  FEEVWIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFS-EIKCVEDQLDHAFEEDEIYWKQ
        FEE+W+    C + I++    +  GN    + + L  C+  L+ W +  + SI   +   +R L+         + S  I  +  ++     ++E  W+Q
Subjt:  FEEVWIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFS-EIKCVEDQLDHAFEEDEIYWKQ

Query:  RSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERA
        RS   WL++GD+NT +FH++AT R++RN I  ++  +G     Q Q++     YF ++F +SNP    ID  LQ +P  +T +MN+ L  P++  E+E A
Subjt:  RSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERA

Query:  IKQMHPSKAHGPDGFSACFYQKFWSEVAIGSD
        ++QM P  A GPDG    FYQ  W    IG D
Subjt:  IKQMHPSKAHGPDGFSACFYQKFWSEVAIGSD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.6e-0520.57Show/hide
Query:  KQRSCENWLQWGDKNTQWFHNQAT-----------RRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMND
        K+   +  LQ  +++  WF  +             +++++N+I  ++   G++  +  +++     Y+ +++++   ++E++D  L    + R+ +   +
Subjt:  KQRSCENWLQWGDKNTQWFHNQAT-----------RRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMND

Query:  RLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFWSEV
         L  P +  EI   I  +   K+ GPDGF+A FYQ++  E+
Subjt:  RLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFWSEV

P08548 LINE-1 reverse transcriptase homolog2.1e-0719.66Show/hide
Query:  RLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRL--FRFEEVWIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLR-
        ++D  L +++ +  F    ++ +    SDH  I   ++   N   HT+  +L     ++ W+     K++   +       N   + +N     K  LR 
Subjt:  RLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRL--FRFEEVWIQHPECKDLISDMGCWADQGNVKPHLENCLQKCKTRLR-

Query:  ----------RWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENW-LQWGDKNTQWFHNQATRRKKRNEIREV
                  +  R   +++  H+  L++   + +S P P    EI  +  +L+   E   I  +    ++W  +  +K  +   N   +++ ++ I  +
Subjt:  ----------RWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENW-LQWGDKNTQWFHNQATRRKKRNEIREV

Query:  QGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFWSEV
        +  N E+  +  ++++    Y+  ++S    ++++ID  L+   + R+++   + L  P S  EI   I+ +   K+ GPDGF++ FYQ F  E+
Subjt:  QGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFWSEV

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-0627.37Show/hide
Query:  IREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKF
        I +++   G++  + ++++     ++  ++S+   +++++D  L    V ++ ++  D L +P S  EIE  I  +   K+ GPDGFSA FYQ F
Subjt:  IREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPV-RITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.3e-2326.97Show/hide
Query:  DSAWIIGGDLN---ATLLH-EEKEGGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDH
        D   I+ GD +   AT  H    +  IP+R   +++F + L D  L D+   G  +TWSN Q+ +  I  +LD  +AN ++   FP+          SDH
Subjt:  DSAWIIGGDLN---ATLLH-EEKEGGIPVRDSQIQQFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDH

Query:  RPILFSVSMKPNNDHHTRRPRLFRFEEVWIQHPECKDLISDMGCWADQGNVKPHLEN------CLQKCKTRLRRWGRGTYSSIWCH-IDTLQRTLQDLYS
         P +  +   P      R  + FR+      HP    L+S    W +Q  V  H+ +        +KC   L R G G         +D+L+     L +
Subjt:  RPILFSVSMKPNNDHHTRRPRLFRFEEVWIQHPECKDLISDMGCWADQGNVKPHLEN------CLQKCKTRLRRWGRGTYSSIWCH-IDTLQRTLQDLYS

Query:  KPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHM-EDI
         P    F        + +      E +++Q+S   WLQ GD NT++FH      + +N I+ ++  +   ++N  Q++E  + Y+T++  S +  +  D 
Subjt:  KPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIREVQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHM-EDI

Query:  DNALQDI-PVRITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFW
           ++DI P R    +  RL    S  EI  A+  M  +KA GPD F+A F+ + W
Subjt:  DNALQDI-PVRITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFW

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-0427.35Show/hide
Query:  DNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLVAIKMIRGEMQITSEVHHWVVQIQNMKLSFQELSFSHI
        DN R   G ++  E G V+       P L + L AE+  +   +  L R + +      DS V I+++  + +I   +   +  +Q +   F E+ F  I
Subjt:  DNRRARYGAIILGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLVAIKMIRGEMQITSEVHHWVVQIQNMKLSFQELSFSHI

Query:  PREANRGADYLARDALT
        PRE N  A+ +AR++L+
Subjt:  PREANRGADYLARDALT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGAATTATTCGGGTTGCTTTATTGTGGATTGTGAGGGTCGGAGTGGAGGCTTATGCTTGATGTGGATGGATGATATTGATGTCACTATACGATCATATTCTAA
ATTTCATATTGATGCATCAATTAAGTGGGATTCTAAGTTGTGGTGCTTTTCTAGGATATATGGTCATCCAACTGCTAGTCAAAGAAATCATACATGGAATTTGATTCATT
GTTTACAGAACCAGGATGATTCTGCATGGATTATTGGGGGCGATTTAAATGCTACACTATTACATGAGGAAAAGGAAGGTGGAATTCCTGTAAGGGATTCCCAAATCCAA
CAGTTTCATGATGCATTAGATGATTGTGGACTACAAGATCTGGATTATGTGGGAGAATCTTTTACGTGGTCCAATAGACAGGAGGCAGAAACTCAAATTAATGAAAGATT
GGACGGGTTCCTAGCAAATGAGAATTTCATTCATCTTTTCCCTAATGGATCAGTTCAACATTTAAATTGGGCTCATTCTGATCATCGCCCAATCCTGTTCAGTGTAAGTA
TGAAGCCCAATAATGATCACCATACAAGGAGGCCCAGGCTGTTCCGATTTGAAGAAGTTTGGATTCAACATCCAGAATGTAAGGATCTAATTTCGGATATGGGTTGTTGG
GCAGACCAAGGTAATGTGAAGCCACACCTGGAGAATTGTCTCCAGAAATGCAAAACACGCTTGAGAAGATGGGGTAGAGGCACGTACTCTTCCATTTGGTGCCATATAGA
TACTCTGCAGCGGACACTACAAGACCTGTACAGCAAGCCTCCACCATGGGATTTCAGTGAAATAAAGTGTGTAGAAGATCAGCTCGACCATGCTTTTGAAGAGGATGAAA
TATATTGGAAACAGAGATCATGTGAAAACTGGCTTCAATGGGGAGACAAGAATACGCAATGGTTCCATAATCAAGCGACAAGGAGGAAAAAGAGGAACGAAATTCGAGAG
GTTCAGGGTCCGAATGGTGAACTGATTGACAACCAAAAGCAACTGGAGGAGGCTTTCTTGTTGTATTTCACTAATATGTTTTCCTCTTCCAATCCACATATGGAAGATAT
TGATAATGCGTTGCAGGATATTCCGGTCAGAATAACGAAAAACATGAATGACAGATTGTTAACACCATTCAGCAGAGTTGAGATTGAAAGAGCTATTAAGCAAATGCATC
CATCCAAGGCTCATGGACCTGATGGTTTCTCTGCGTGTTTCTACCAGAAATTTTGGAGTGAGGTAGCGATTGGATCAGATAATAGAAGGGCAAGATATGGGGCAATTATC
CTTGGAGAGGATGGTTTGGTGCGTGGCGCAATGAAGTATATGGATCCCATACTTCACACACCATTGGCGGCTGAGGTAAATGTCATTATTCATGGTATTCGATTGTTGCA
ACGTTTGGAAGTATCCATCGCTACTGTTTTCTTAGATTCATTAGTAGCCATCAAGATGATTCGAGGTGAAATGCAAATTACATCAGAAGTACATCATTGGGTTGTCCAAA
TCCAGAATATGAAGCTTTCTTTTCAGGAGTTGTCATTCTCCCACATTCCTAGGGAGGCTAATAGGGGAGCAGATTATCTAGCTAGGGATGCATTAACTAGATCGCAATCA
ATTCTTTGGTTGGAAAATTTACCTGATTGGTTGATTTCAATGGGTAGTTTTACCCATCGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTGAATTATTCGGGTTGCTTTATTGTGGATTGTGAGGGTCGGAGTGGAGGCTTATGCTTGATGTGGATGGATGATATTGATGTCACTATACGATCATATTCTAA
ATTTCATATTGATGCATCAATTAAGTGGGATTCTAAGTTGTGGTGCTTTTCTAGGATATATGGTCATCCAACTGCTAGTCAAAGAAATCATACATGGAATTTGATTCATT
GTTTACAGAACCAGGATGATTCTGCATGGATTATTGGGGGCGATTTAAATGCTACACTATTACATGAGGAAAAGGAAGGTGGAATTCCTGTAAGGGATTCCCAAATCCAA
CAGTTTCATGATGCATTAGATGATTGTGGACTACAAGATCTGGATTATGTGGGAGAATCTTTTACGTGGTCCAATAGACAGGAGGCAGAAACTCAAATTAATGAAAGATT
GGACGGGTTCCTAGCAAATGAGAATTTCATTCATCTTTTCCCTAATGGATCAGTTCAACATTTAAATTGGGCTCATTCTGATCATCGCCCAATCCTGTTCAGTGTAAGTA
TGAAGCCCAATAATGATCACCATACAAGGAGGCCCAGGCTGTTCCGATTTGAAGAAGTTTGGATTCAACATCCAGAATGTAAGGATCTAATTTCGGATATGGGTTGTTGG
GCAGACCAAGGTAATGTGAAGCCACACCTGGAGAATTGTCTCCAGAAATGCAAAACACGCTTGAGAAGATGGGGTAGAGGCACGTACTCTTCCATTTGGTGCCATATAGA
TACTCTGCAGCGGACACTACAAGACCTGTACAGCAAGCCTCCACCATGGGATTTCAGTGAAATAAAGTGTGTAGAAGATCAGCTCGACCATGCTTTTGAAGAGGATGAAA
TATATTGGAAACAGAGATCATGTGAAAACTGGCTTCAATGGGGAGACAAGAATACGCAATGGTTCCATAATCAAGCGACAAGGAGGAAAAAGAGGAACGAAATTCGAGAG
GTTCAGGGTCCGAATGGTGAACTGATTGACAACCAAAAGCAACTGGAGGAGGCTTTCTTGTTGTATTTCACTAATATGTTTTCCTCTTCCAATCCACATATGGAAGATAT
TGATAATGCGTTGCAGGATATTCCGGTCAGAATAACGAAAAACATGAATGACAGATTGTTAACACCATTCAGCAGAGTTGAGATTGAAAGAGCTATTAAGCAAATGCATC
CATCCAAGGCTCATGGACCTGATGGTTTCTCTGCGTGTTTCTACCAGAAATTTTGGAGTGAGGTAGCGATTGGATCAGATAATAGAAGGGCAAGATATGGGGCAATTATC
CTTGGAGAGGATGGTTTGGTGCGTGGCGCAATGAAGTATATGGATCCCATACTTCACACACCATTGGCGGCTGAGGTAAATGTCATTATTCATGGTATTCGATTGTTGCA
ACGTTTGGAAGTATCCATCGCTACTGTTTTCTTAGATTCATTAGTAGCCATCAAGATGATTCGAGGTGAAATGCAAATTACATCAGAAGTACATCATTGGGTTGTCCAAA
TCCAGAATATGAAGCTTTCTTTTCAGGAGTTGTCATTCTCCCACATTCCTAGGGAGGCTAATAGGGGAGCAGATTATCTAGCTAGGGATGCATTAACTAGATCGCAATCA
ATTCTTTGGTTGGAAAATTTACCTGATTGGTTGATTTCAATGGGTAGTTTTACCCATCGCAATTAA
Protein sequenceShow/hide protein sequence
MSLNYSGCFIVDCEGRSGGLCLMWMDDIDVTIRSYSKFHIDASIKWDSKLWCFSRIYGHPTASQRNHTWNLIHCLQNQDDSAWIIGGDLNATLLHEEKEGGIPVRDSQIQ
QFHDALDDCGLQDLDYVGESFTWSNRQEAETQINERLDGFLANENFIHLFPNGSVQHLNWAHSDHRPILFSVSMKPNNDHHTRRPRLFRFEEVWIQHPECKDLISDMGCW
ADQGNVKPHLENCLQKCKTRLRRWGRGTYSSIWCHIDTLQRTLQDLYSKPPPWDFSEIKCVEDQLDHAFEEDEIYWKQRSCENWLQWGDKNTQWFHNQATRRKKRNEIRE
VQGPNGELIDNQKQLEEAFLLYFTNMFSSSNPHMEDIDNALQDIPVRITKNMNDRLLTPFSRVEIERAIKQMHPSKAHGPDGFSACFYQKFWSEVAIGSDNRRARYGAII
LGEDGLVRGAMKYMDPILHTPLAAEVNVIIHGIRLLQRLEVSIATVFLDSLVAIKMIRGEMQITSEVHHWVVQIQNMKLSFQELSFSHIPREANRGADYLARDALTRSQS
ILWLENLPDWLISMGSFTHRN