; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g011120 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g011120
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr05:12604471..12608149
RNA-Seq ExpressionLcy05g011120
SyntenyLcy05g011120
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EPS72636.1 hypothetical protein M569_02121, partial [Genlisea aurea]1.1e-7536.51Show/hide
Query:  CYRTVINLRDKQLSTQMDKDKQNARGLSGGLALFWNDDVELQLLTYSKGHIDTILN--YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAG
        CY  +  +R + LS+   +  +  RG SGGLAL W   + + + ++S  HID +++   G+ ++R TGFYG     +R  SW+L+TRL     LPWL+ G
Subjt:  CYRTVINLRDKQLSTQMDKDKQNARGLSGGLALFWNDDVELQLLTYSKGHIDTILN--YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAG

Query:  DFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQD---
        DFNE+L + E       + +      NAL   +L D G+ G PFTW N R     V+ RLDR V N+    +     V+HL F  SDH P+ +  +D   
Subjt:  DFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQD---

Query:  -HPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILE-DLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLE
         H TL R R+    ++FE+ W   E C  +I G W + + S   +  L+  L+     L  W R   G  + +I   ++R+  L++    +    QIR  
Subjt:  -HPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILE-DLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLE

Query:  EKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEE
        + +L  +L  +EI+WKQR++  W++ GDKN K+FH  A+ R+++NKI +L++++  W++   DI H F+  Y +LFKS   +E+A+N I+    + V +E
Subjt:  EKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEE

Query:  MSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW
        M+ +L +AFT +EI TA+ QM+   APGPDGFP LFYQK+W
Subjt:  MSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]6.8e-7335.7Show/hide
Query:  KRISARQQKKHTSGAEELDNLRLMGKKPCYRTVINLRDKQLSTQMDKDKQ-NARGLSGGLALFWNDDVELQLLTYSKGHIDTIL--NYGTKRFRFTGFYG
        KRI  + +KK TS   E+D+L    ++P  + VI  RD   + +     +  A G SGGLAL W  D+ + L  +S  HID  +  N     +RFTGFYG
Subjt:  KRISARQQKKHTSGAEELDNLRLMGKKPCYRTVINLRDKQLSTQMDKDKQ-NARGLSGGLALFWNDDVELQLLTYSKGHIDTIL--NYGTKRFRFTGFYG

Query:  ELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFL
          +   RH SW L+ +L + S+  WL AGDFN +L   E+ G   A+    +  ++ L  + L D G++G PFTW N R   +  R+RLDR   N++   
Subjt:  ELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFL

Query:  LFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILE
        LF +  V HLD L SDH PL I  +    + +   R   ++FE  W + E+CE++IR +W  +       D   NL+     L  W R   G  + +I +
Subjt:  LFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILE

Query:  SKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLF
         K +I  L +        ++I    + LD +L +EE+ W+QRA+  WM+ GDKNTK+FHAKA+ RR+KN I  L   +G W ++  DIE    D++ ++F
Subjt:  SKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLF

Query:  KSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW
         S     + + E++ A +  V + ++  L   +T DE++ AL  M P K+PGPDGFP +F+Q++W
Subjt:  KSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW

XP_015382608.1 uncharacterized protein LOC107175577 [Citrus sinensis]6.6e-7635.66Show/hide
Query:  GLSGGLALFWNDDVELQLLTYSKGHIDTILNYGT-KRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN
        G  GGLAL WN++  +Q+ ++SK HID  +     ++ R T  YG  +   +  +WTL+ RL   S  PWL  GDFNEILH  E+ GG +  +       
Subjt:  GLSGGLALFWNDDVELQLLTYSKGHIDTILNYGT-KRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN

Query:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDH-PTLGRNRYRQHPYRFEEAWTRYEDCEE
         AL   EL D  Y G PFTW N R+G  F+ +RLDR V N      F +    +L+   SDH P+ + +Q+    +  +R +     +E+ W+ YE C+E
Subjt:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDH-PTLGRNRYRQHPYRFEEAWTRYEDCEE

Query:  MIRGSWGLS---QDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWG
        ++   W L+   Q  N +       K++   L  W + +  G +K++ + + +++   Q      +  +I++ E ++ N++++EEIYWKQR+R DW+K G
Subjt:  MIRGSWGLS---QDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWG

Query:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP
        DKNTK+FH KA+ R+KKN+I  ++   G W+ + +D+E  F +++  LF +   ++N +  ++      V E+M+  L++ FT +E+  AL QM PTKAP
Subjt:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP

Query:  GPDGFPALFYQKYWR
        GPDG PA FYQK+W+
Subjt:  GPDGFPALFYQKYWR

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.8e-8237.83Show/hide
Query:  GLSGGLALFWNDDVELQLLTYSKGHIDTILN-YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN
        GL GGLAL WN++V++ +++YS  HID +++    K +R +G YG  + + +  +WTL+ RL      PWL  GDFNEILH  E+ GG+D  L +     
Subjt:  GLSGGLALFWNDDVELQLLTYSKGHIDTILN-YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN

Query:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRF-EEAWTRYEDCEE
          +N   L D G  G PFTW NRR+G   + +RLDR + + D      +++V++LD  CSDH P+ + +Q+       +    P  F E+ W+ YE C+ 
Subjt:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRF-EEAWTRYEDCEE

Query:  MIRGSWGLSQDSNILEDLMCNLKQTA----TALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKW
        +++  W L   S    D +   K+T+      L  W RN+  G K+K+ + K ++  +  +      +++I+  E++++ +LL+EE+YWKQR+R DW+K 
Subjt:  MIRGSWGLSQDSNILEDLMCNLKQTA----TALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKW

Query:  GDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKA
        GDKNTK+FH+KA+ R++KN+I  +  Q   W+D RE +E  F +++ +LF +   +E  +   +   +  V  EM+++LD  FTE+E+ TAL QM PTKA
Subjt:  GDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKA

Query:  PGPDGFPALFYQKYW
        PGPDG PA F+QK+W
Subjt:  PGPDGFPALFYQKYW

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.0e-7336.98Show/hide
Query:  RGLSGGLALFWNDDVELQLLTYSKGHIDTILNYG-TKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGL
        R L GGLAL WN+D+ L + T+S  HID ++N G    +RFTGFYG  +  NR  SW+++  L     LPW+  GDFNEI    E+ GG        +  
Subjt:  RGLSGGLALFWNDDVELQLLTYSKGHIDTILNYG-TKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGL

Query:  NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEE
         + L+V  L+D G+ G PFTW NRRY    V  RLDR V  +D  L F +  ++HL    SDHKPL +   D  T  R    + P+RFE  W   E CE 
Subjt:  NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEE

Query:  MIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKN
        ++   W  +   + +  +M  +++  T L  W +N  G  +K +  ++  +      S      A+++L  + +  ++  EE  W QR++ +W+++GD+N
Subjt:  MIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKN

Query:  TKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPD
        TK+FH +AT+R KKN I+ L+   G+WI+  E I      +Y NLF +  +N   +  ++   Q  V E M+S L + F  +E+  ALKQM P  APGPD
Subjt:  TKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPD

Query:  GFPALFYQKYW
        GFP LFY+ +W
Subjt:  GFPALFYQKYW

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein1.2e-7538.65Show/hide
Query:  GLSGGLALFWNDDVELQLLTYSKGHID-TILNYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN
        G  GGLAL W + +++  L++S  HID TI + GT  + FTGFYG  D   RH SWTL+ RLK   D+PWL+ GDFNE+L   E+ G             
Subjt:  GLSGGLALFWNDDVELQLLTYSKGHID-TILNYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN

Query:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEM
         AL+  EL+D GY GN FTW N R+G   V +RLDR V ++  F LF    + H+ F  SDH+ L + L    T  R +Y    +RFE  W + E CEE+
Subjt:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEM

Query:  IRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRI---EALLQHSPDEPAMAQIRLEEKR-LDNVLLEEEIYWKQRAREDWMKWG
        +R +W   Q   +L  +   +K    AL  W R      K ++ +++      E   Q + D       R + +R L+ VL +EE YW+QR+   W++ G
Subjt:  IRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRI---EALLQHSPDEPAMAQIRLEEKR-LDNVLLEEEIYWKQRAREDWMKWG

Query:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP
        D+NT++FHA A+QR+KKN I  L+  +G     R  +      +++N+F++   N +A+ +++ +  +TV + M+  L   FT +EIR+AL QM PTKAP
Subjt:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP

Query:  GPDGFPALFYQKYW
        GPDG  A+FYQK+W
Subjt:  GPDGFPALFYQKYW

A0A2N9J3U0 Reverse transcriptase domain-containing protein1.2e-7538.65Show/hide
Query:  GLSGGLALFWNDDVELQLLTYSKGHID-TILNYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN
        G  GGLAL W + +++  L++S  HID TI + GT  + FTGFYG  D   RH SWTL+ RLK   D+PWL+ GDFNE+L   E+ G             
Subjt:  GLSGGLALFWNDDVELQLLTYSKGHID-TILNYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLN

Query:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEM
         AL+  EL+D GY GN FTW N R+G   V +RLDR V ++  F LF    + H+ F  SDH+ L + L    T  R +Y    +RFE  W + E CEE+
Subjt:  NALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEM

Query:  IRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRI---EALLQHSPDEPAMAQIRLEEKR-LDNVLLEEEIYWKQRAREDWMKWG
        +R +W   Q   +L  +   +K    AL  W R      K ++ +++      E   Q + D       R + +R L+ VL +EE YW+QR+   W++ G
Subjt:  IRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRI---EALLQHSPDEPAMAQIRLEEKR-LDNVLLEEEIYWKQRAREDWMKWG

Query:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP
        D+NT++FHA A+QR+KKN I  L+  +G     R  +      +++N+F++   N +A+ +++ +  +TV + M+  L   FT +EIR+AL QM PTKAP
Subjt:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP

Query:  GPDGFPALFYQKYW
        GPDG  A+FYQK+W
Subjt:  GPDGFPALFYQKYW

A0A803P2K3 Uncharacterized protein2.3e-7436.89Show/hide
Query:  GLSGGLALFWNDDVELQLLTYSKGHIDTILNYGT-KRFRFTGFYGELDPLNRHVSWTLITRLKDCSDL-PWLMAGDFNEILHEKERDGGRDATLTLSRGL
        GLSGGL L W DDV++ LL ++    D  +  G    + FT FYG     NR  +WTL+ RLKD + L PW++ GDFNEIL    + GG           
Subjt:  GLSGGLALFWNDDVELQLLTYSKGHIDTILNYGT-KRFRFTGFYGELDPLNRHVSWTLITRLKDCSDL-PWLMAGDFNEILHEKERDGGRDATLTLSRGL

Query:  NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEE
           L+   L +  Y G+PFTW+  R+ A  +++RLD C VN+    LFQSI  +HLD+  SDH+ + + +    +  + + R   +RFE+ W + E+  +
Subjt:  NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEE

Query:  MIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEP-AMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDK
        +I+ +W      N       NL Q   +L  W R K G +KKKI  ++ ++  L   +   P AM  ++  E  LD++L +EE YW+QR+R DW++ GD+
Subjt:  MIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEP-AMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDK

Query:  NTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGP
        NTK+FHA A+ R++ N I  L   +G  +  +  + +  + ++  LF +  +N++A+N  +     TV  EM+  L R FTEDEI +ALK ++P K+PG 
Subjt:  NTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGP

Query:  DGFPALFYQKYW
        DG  A+FY KYW
Subjt:  DGFPALFYQKYW

M5XHI9 Reverse transcriptase domain-containing protein8.1e-7238.31Show/hide
Query:  NARGLSGGLALFWNDDVELQLLTYSKGHIDTIL--NYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLS
        ++RG SGGLAL W ++V++ +  +S   ID  +  N G  R+R T FYG     +R  SW L+ +L   + LPWL  GDFNEIL   E++GG        
Subjt:  NARGLSGGLALFWNDDVELQLLTYSKGHIDTIL--NYGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLS

Query:  RGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYED
        +G  N ++    RD G+ G  FTW   R+G  FVR RLDR +  +    LF    V HLD   SDH P+ + ++ H T  ++RY  H + FE  WT + D
Subjt:  RGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYED

Query:  CEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWG
        CE+ I+  W    D + +  L   +KQ    L  W ++  G  K++    + ++ +L Q    E      R+ +K LD +L + E+YW QR+RE+W+K G
Subjt:  CEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWG

Query:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP
        DKNT +FH KAT RR++N I  L+  +G W   R+ I    +D++ +LF+S     + + EI+ A +  V  +M   L   F+  EI+ A+ QM P+KAP
Subjt:  DKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAP

Query:  GPDGFPALFYQKYWR
        GPDG P LFYQKYWR
Subjt:  GPDGFPALFYQKYWR

S8D5C6 Uncharacterized protein (Fragment)5.4e-7636.51Show/hide
Query:  CYRTVINLRDKQLSTQMDKDKQNARGLSGGLALFWNDDVELQLLTYSKGHIDTILN--YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAG
        CY  +  +R + LS+   +  +  RG SGGLAL W   + + + ++S  HID +++   G+ ++R TGFYG     +R  SW+L+TRL     LPWL+ G
Subjt:  CYRTVINLRDKQLSTQMDKDKQNARGLSGGLALFWNDDVELQLLTYSKGHIDTILN--YGTKRFRFTGFYGELDPLNRHVSWTLITRLKDCSDLPWLMAG

Query:  DFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQD---
        DFNE+L + E       + +      NAL   +L D G+ G PFTW N R     V+ RLDR V N+    +     V+HL F  SDH P+ +  +D   
Subjt:  DFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDHKPLEICLQD---

Query:  -HPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILE-DLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLE
         H TL R R+    ++FE+ W   E C  +I G W + + S   +  L+  L+     L  W R   G  + +I   ++R+  L++    +    QIR  
Subjt:  -HPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILE-DLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRLE

Query:  EKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEE
        + +L  +L  +EI+WKQR++  W++ GDKN K+FH  A+ R+++NKI +L++++  W++   DI H F+  Y +LFKS   +E+A+N I+    + V +E
Subjt:  EKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEE

Query:  MSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW
        M+ +L +AFT +EI TA+ QM+   APGPDGFP LFYQK+W
Subjt:  MSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.3e-0420.72Show/hide
Query:  LMAGDFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNP----FTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQ--SIVVNHLDFLCSDHK-
        L+ GDFN  L   +R   R      ++ LN+AL+ ++L D     +P    +T+ +  +  Y    ++D  V +  L    +   I+ N+L    SDH  
Subjt:  LMAGDFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNP----FTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQ--SIVVNHLDFLCSDHK-

Query:  -PLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMC-NLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEP
          LE+ +++        ++ +     + W   E     ++    +  ++N  +D    NL     A+          YK+K  + +++I+ L     +  
Subjt:  -PLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMC-NLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEP

Query:  AMAQIRLEEKRLDNVL-LEEEIYWKQRAREDWMKWGDKNTKWFHAKAT-----------QRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFV
           Q   +  R   +  +  E+  K+   +  ++  +++  WF  +             ++R+KN+I  ++   G       +I+    ++Y +L+ + +
Subjt:  AMAQIRLEEKRLDNVL-LEEEIYWKQRAREDWMKWGDKNTKWFHAKAT-----------QRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFV

Query:  LNENAVNEIM--FATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKY
         N   ++  +  +   +   EE+ S L+R  T  EI   +  +   K+PGPDGF A FYQ+Y
Subjt:  LNENAVNEIM--FATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKY

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein7.4e-0924.22Show/hide
Query:  RHVSWTLITRLKDCSDL---PWLMAGDFNEILHEKERDGGRDATLTLS--RGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLL
        R   W  ITRL   S L   PWL+ GDFN+I    E      + ++L     L   +  S+L D    G  +TW N +     +RK LDR +VN      
Subjt:  RHVSWTLITRLKDCSDL---PWLMAGDFNEILHEKERDGGRDATLTLS--RGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLL

Query:  FQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILES
        F +          SDH    + L + P L + +     +++    + + D    I  +W                K+ A     +   +     KK    
Subjt:  FQSIVVNHLDFLCSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILES

Query:  KNR-----IEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGD
         NR     I+A L  +P +       +  K  +      E ++KQ++R  W+K GD
Subjt:  KNR-----IEALLQHSPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGD

AT1G43760.1 DNAse I-like superfamily protein3.2e-2025.56Show/hide
Query:  SDLPWLMAGDFNEILHEKERDGGRDATLTLSRGL---NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDH
        +D   ++ GDF++I    +       ++ + RGL    N L  S+L D    G  +TW N +     +RK LDR + N D F  F S +        SDH
Subjt:  SDLPWLMAGDFNEILHEKERDGGRDATLTLSRGL---NNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFLCSDH

Query:  KPLEICLQDHPTLGRNRYR------QHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQH
         P  I L++ P   +  +R       HP         +E  E++  GS   S     L + +   K+    L+  G        K+ L+S   I++ L  
Subjt:  KPLEICLQDHPTLGRNRYR------QHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQH

Query:  SPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLF--KSFVLNENA
        +P +       +  K+ +      E +++Q++R  W++ GD NT++FH      + KN I  L+  D   ++    ++   + +Y +L    S +L  ++
Subjt:  SPDEPAMAQIRLEEKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLF--KSFVLNENA

Query:  VNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW
        V  I         + ++SRL    ++ EI  A+  M   KAPGPD F A F+ + W
Subjt:  VNEIMFATQQTVMEEMSSRLDRAFTEDEIRTALKQMSPTKAPGPDGFPALFYQKYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGACAGATTTGGGCCAATCTAAGGGGCAAAAGAATTTCAGCCAGACAACAAAAAAAACACACAAGTGGCGCAGAAGAACTCGATAATCTGAGGTTAATGGGGAAGAA
GCCATGCTACAGGACAGTTATCAATTTAAGAGACAAGCAACTGAGTACGCAAATGGACAAGGACAAGCAAAACGCTAGAGGCTTGAGTGGTGGGTTAGCGTTGTTTTGGA
ATGATGATGTTGAACTCCAGCTGTTGACTTACTCGAAAGGTCATATTGATACAATTTTGAATTATGGAACTAAGCGTTTTCGATTCACAGGCTTTTATGGGGAACTTGAT
CCCTTGAATAGGCATGTATCTTGGACGTTAATTACTCGCTTAAAAGATTGCTCGGATCTTCCATGGCTAATGGCAGGAGATTTCAACGAAATTCTACATGAGAAGGAAAG
AGATGGCGGTCGTGATGCAACCTTAACTCTTAGTAGGGGTCTGAATAATGCCCTTAATGTGAGTGAGTTAAGGGACGCAGGCTATATTGGGAACCCTTTCACCTGGATGA
ATAGACGATATGGAGCTTATTTTGTTAGAAAGCGTTTGGATCGCTGCGTGGTGAATTCTGACCTTTTTTTACTTTTCCAATCTATTGTGGTTAATCATCTAGACTTTTTA
TGCTCTGATCATAAACCTCTTGAAATTTGTTTGCAGGATCATCCCACCCTTGGTAGAAACAGATATCGTCAACATCCGTATCGGTTTGAGGAGGCTTGGACACGGTATGA
GGATTGTGAGGAGATGATTCGAGGCTCCTGGGGACTATCACAAGATAGTAATATTTTGGAGGATCTAATGTGTAATCTTAAGCAAACGGCTACTGCTCTTCACACATGGG
GTCGCAACAAAACAGGCGGGTACAAAAAAAAGATATTGGAGTCGAAAAATCGCATCGAGGCACTTTTACAACACTCTCCCGATGAGCCTGCTATGGCGCAAATTCGACTT
GAGGAGAAGAGATTAGATAATGTTCTGCTTGAGGAGGAAATCTACTGGAAGCAAAGAGCTCGGGAAGATTGGATGAAGTGGGGGGATAAGAATACAAAATGGTTTCATGC
AAAGGCTACCCAAAGACGTAAAAAAAATAAAATCACAAAGCTCCAGACCCAAGATGGACAATGGATTGATAAAAGGGAAGATATTGAGCATGCGTTCCTTGATTTCTATT
ATAATCTTTTTAAATCTTTTGTGTTAAATGAAAATGCAGTGAATGAGATCATGTTCGCTACTCAACAAACAGTAATGGAGGAGATGAGTTCCAGATTAGACCGAGCATTC
ACTGAGGATGAGATCAGAACAGCACTTAAGCAAATGAGCCCAACCAAAGCACCAGGTCCTGATGGTTTCCCAGCCCTGTTTTACCAAAAATATTGGCGGGGCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGACAGATTTGGGCCAATCTAAGGGGCAAAAGAATTTCAGCCAGACAACAAAAAAAACACACAAGTGGCGCAGAAGAACTCGATAATCTGAGGTTAATGGGGAAGAA
GCCATGCTACAGGACAGTTATCAATTTAAGAGACAAGCAACTGAGTACGCAAATGGACAAGGACAAGCAAAACGCTAGAGGCTTGAGTGGTGGGTTAGCGTTGTTTTGGA
ATGATGATGTTGAACTCCAGCTGTTGACTTACTCGAAAGGTCATATTGATACAATTTTGAATTATGGAACTAAGCGTTTTCGATTCACAGGCTTTTATGGGGAACTTGAT
CCCTTGAATAGGCATGTATCTTGGACGTTAATTACTCGCTTAAAAGATTGCTCGGATCTTCCATGGCTAATGGCAGGAGATTTCAACGAAATTCTACATGAGAAGGAAAG
AGATGGCGGTCGTGATGCAACCTTAACTCTTAGTAGGGGTCTGAATAATGCCCTTAATGTGAGTGAGTTAAGGGACGCAGGCTATATTGGGAACCCTTTCACCTGGATGA
ATAGACGATATGGAGCTTATTTTGTTAGAAAGCGTTTGGATCGCTGCGTGGTGAATTCTGACCTTTTTTTACTTTTCCAATCTATTGTGGTTAATCATCTAGACTTTTTA
TGCTCTGATCATAAACCTCTTGAAATTTGTTTGCAGGATCATCCCACCCTTGGTAGAAACAGATATCGTCAACATCCGTATCGGTTTGAGGAGGCTTGGACACGGTATGA
GGATTGTGAGGAGATGATTCGAGGCTCCTGGGGACTATCACAAGATAGTAATATTTTGGAGGATCTAATGTGTAATCTTAAGCAAACGGCTACTGCTCTTCACACATGGG
GTCGCAACAAAACAGGCGGGTACAAAAAAAAGATATTGGAGTCGAAAAATCGCATCGAGGCACTTTTACAACACTCTCCCGATGAGCCTGCTATGGCGCAAATTCGACTT
GAGGAGAAGAGATTAGATAATGTTCTGCTTGAGGAGGAAATCTACTGGAAGCAAAGAGCTCGGGAAGATTGGATGAAGTGGGGGGATAAGAATACAAAATGGTTTCATGC
AAAGGCTACCCAAAGACGTAAAAAAAATAAAATCACAAAGCTCCAGACCCAAGATGGACAATGGATTGATAAAAGGGAAGATATTGAGCATGCGTTCCTTGATTTCTATT
ATAATCTTTTTAAATCTTTTGTGTTAAATGAAAATGCAGTGAATGAGATCATGTTCGCTACTCAACAAACAGTAATGGAGGAGATGAGTTCCAGATTAGACCGAGCATTC
ACTGAGGATGAGATCAGAACAGCACTTAAGCAAATGAGCCCAACCAAAGCACCAGGTCCTGATGGTTTCCCAGCCCTGTTTTACCAAAAATATTGGCGGGGCTCGTGA
Protein sequenceShow/hide protein sequence
MRQIWANLRGKRISARQQKKHTSGAEELDNLRLMGKKPCYRTVINLRDKQLSTQMDKDKQNARGLSGGLALFWNDDVELQLLTYSKGHIDTILNYGTKRFRFTGFYGELD
PLNRHVSWTLITRLKDCSDLPWLMAGDFNEILHEKERDGGRDATLTLSRGLNNALNVSELRDAGYIGNPFTWMNRRYGAYFVRKRLDRCVVNSDLFLLFQSIVVNHLDFL
CSDHKPLEICLQDHPTLGRNRYRQHPYRFEEAWTRYEDCEEMIRGSWGLSQDSNILEDLMCNLKQTATALHTWGRNKTGGYKKKILESKNRIEALLQHSPDEPAMAQIRL
EEKRLDNVLLEEEIYWKQRAREDWMKWGDKNTKWFHAKATQRRKKNKITKLQTQDGQWIDKREDIEHAFLDFYYNLFKSFVLNENAVNEIMFATQQTVMEEMSSRLDRAF
TEDEIRTALKQMSPTKAPGPDGFPALFYQKYWRGS