; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000086 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000086
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold6:21064504..21077568
RNA-Seq ExpressionSpg000086
SyntenySpg000086
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONK66393.1 uncharacterized protein A4U43_C06F7380 [Asparagus officinalis]9.0e-4629.74Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGY--
        G+R  +G+G  I++  D WIP+P TF+V+ +     E+  VA+ I+PS  W+V  + +     +  +I  +P+ R    D  +WHY++ G+Y+VK GY  
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGY--

Query:  --KYSMLQQQASSSSSLETES---SWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGED
          +    ++ A+S SS+   S     WK+LW L +PNK+K F+W +    IP    L++ HV V+G C  CK G E+  HAL+ CKR  KVW++    + 
Subjt:  --KYSMLQQQASSSSSLETES---SWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGED

Query:  MMIQRHMDIQDRWLYISSCS-EAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKING--ACVSTSSSGVD---FQALVEEGEETI--
          + R  D    + +++  + +  L+   I +W IW+DRN  ++   +    +       + E F    G   C   S  G++     +L+    +    
Subjt:  MMIQRHMDIQDRWLYISSCS-EAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKING--ACVSTSSSGVD---FQALVEEGEETI--

Query:  ---MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL
           ++ D   D        G V+RD++G    VL+ P    +SPL  EA A++ GL F + +G + V +  DSL+++ AL G ++ D S+   L D    
Subjt:  ---MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL

Query:  QTSFRRNSLTEFSSLNW
          SF       F S +W
Subjt:  QTSFRRNSLTEFSSLNW

ONK66393.1 uncharacterized protein A4U43_C06F7380 [Asparagus officinalis]2.3e-0442.11Show/hide
Query:  VLCHIPRKVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMG
        +L  + RKV+ ++N+ L+ P+ R E+EAA K  P   +PG DG   LF+++YWD +G
Subjt:  VLCHIPRKVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMG

ONK66393.1 uncharacterized protein A4U43_C06F7380 [Asparagus officinalis]9.0e-4629.11Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDV-GVISHLPISRLATDAGIWHYDRKGKYTVKIGYKY
        G+R  +G+G+ + ++ D WIPRP TF+ +  K   +E + VA+ I   N+W V +L Q+   ED+  ++  L  S    D  +WH+D+KG+Y+VK GY+ 
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDV-GVISHLPISRLATDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWE----VLAPGEDMMI
        ++ Q   +   S  + S  WK  W L +P KVK F+W +  N +PT  NLW+     + +C  CK  +ET  H L +CK A K+W+    ++ P +D   
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWE----VLAPGEDMMI

Query:  QRHMDIQDRWLYISSCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKIN--GACVSTSSSGVDFQALVEEGEETI-MHTDATFD
             IQ+ W   S  S A  +L+ +  W IW+ RN  +            A     L+ +++++  G        G+D Q      +  + ++ DA   
Subjt:  QRHMDIQDRWLYISSCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKIN--GACVSTSSSGVDFQALVEEGEETI-MHTDATFD

Query:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFRR
          + K  +G ++RD +G++  V    +Q       AEA A+  GL+ A  +   S+ V SD   +V+ L       + I  +L D+R     F++
Subjt:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFRR

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-0244Show/hide
Query:  KVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMG
        KVS E+N  L +P+T +++  A     P  APGPDG    F+QK+W I+G
Subjt:  KVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.9e-5432.06Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY
        G+R  +GNG  IK F+DPW+PRP TFK L     A  D  VA FI     WDV+ +S   C+ED  +I  +PIS     D+ +WHYD++G Y+V+ GYK 
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG-EDMMIQRH
         M  +  ++S+S     + W  +WKL +P K+K F+W S H  IPT  NL    +     CT C +  E+  HA F CKRA ++W  L P    +  + +
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG-EDMMIQRH

Query:  MDIQDRWLYISSCSEAV-LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVE-----EGEETIMHTDATFD
        +   + W  ++   E   L+L  I  W IWNDRN+ +H + +  +  +C W+  +L+   +   +  S  +   + + +V+           ++TDA   
Subjt:  MDIQDRWLYISSCSEAV-LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVE-----EGEETIMHTDATFD

Query:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSF
             +  G ++RD    L    S+     +SPL AE   +LEGL+FA +     + V SDSL  +Q +R          + + +I+ L   F
Subjt:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]7.0e-1434.52Show/hide
Query:  LRKCASALGGWGFRQNKLVRANIRKIKDQIKPAYESPLPLDLGIIHHLEADL--------------------------------ERVLCHIPRKVSLEIN
        ++  +SAL  WG      +   I+  K  I  AY  PLPLD  IIH LE DL                                E ++  IP +++ E+N
Subjt:  LRKCASALGGWGFRQNKLVRANIRKIKDQIKPAYESPLPLDLGIIHHLEADL--------------------------------ERVLCHIPRKVSLEIN

Query:  RQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGNKTGGIRQN-LGNGQLIKIFTDPWI
         QL+ PYT++E+E A +   P  A GPDGF  LFYQ YW ++G KT     N L NG  IK +   +I
Subjt:  RQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGNKTGGIRQN-LGNGQLIKIFTDPWI

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.3e-4729.51Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY
        G++  +G G+ I   +D W+P   TF     KG  +  ++VA+ I    QWD++ +S      D   I  +P+S   A D  IW+    G YTVK GY++
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM
        ++    +  +++  T  SWW K WKL++P+K++ FVW  FHN++P  + L R H++    C  CK   ET +HALF C RA  VW       +  +    
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM

Query:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET---------------
           D  LY+S + S    +   +  W+IW +RN   HN+P         +   YL +++    +    + S     +     +ET               
Subjt:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET---------------

Query:  --IMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL
           +++DA  ++   K  IG V+RD  G +   LS P Q C  P   EA A+   L++A +LG+    + +DSL +VQ L+    C+S+   +L D+  L
Subjt:  --IMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL

Query:  QTSFRRNSLT
         + F R  +T
Subjt:  QTSFRRNSLT

XP_030483228.1 uncharacterized protein LOC115699823 [Cannabis sativa]1.4e-4632.39Show/hide
Query:  GFSTLFYQK-YWDIMGNKTGGIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPIS-RLATD
        G S+L +Q  YW     K  GIR  +GNG  I   TDPWIP    F  +   G  N +  VAE+I P  +W++SKLS      DVG I  LP+S  +A+D
Subjt:  GFSTLFYQK-YWDIMGNKTGGIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPIS-RLATD

Query:  AGIWHYDRKGKYTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKR
          IWH D  G+Y VK  Y ++      +  S     +SWWK  W+L++P KVK F W + HN++P  A L++        CT C    E+  H +F CK 
Subjt:  AGIWHYDRKGKYTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKR

Query:  AMKVWEVLAPGEDMMIQRHMDIQDRWLYISSC-SEAVLDLICIGAWAIWNDRNNALHNR----PIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQ
        A  VW+++    +      M I+D     S C ++  L+ I    W+IW+DRNN +H +    PI        ++  Y           +S +++    +
Subjt:  AMKVWEVLAPGEDMMIQRHMDIQDRWLYISSC-SEAVLDLICIGAWAIWNDRNNALHNR----PIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQ

Query:  ALVEEGEETI-MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALR
        A        + ++ D  FDE  NK   G ++RD  G++   +S P   C  P   EA  +   L++A     +   V +DSL L  ALR
Subjt:  ALVEEGEETI-MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALR

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248743.3e-5432.06Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY
        G+R  +GNG  IK F+DPW+PRP TFK L     A  D  VA FI     WDV+ +S   C+ED  +I  +PIS     D+ +WHYD++G Y+V+ GYK 
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG-EDMMIQRH
         M  +  ++S+S     + W  +WKL +P K+K F+W S H  IPT  NL    +     CT C +  E+  HA F CKRA ++W  L P    +  + +
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG-EDMMIQRH

Query:  MDIQDRWLYISSCSEAV-LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVE-----EGEETIMHTDATFD
        +   + W  ++   E   L+L  I  W IWNDRN+ +H + +  +  +C W+  +L+   +   +  S  +   + + +V+           ++TDA   
Subjt:  MDIQDRWLYISSCSEAV-LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVE-----EGEETIMHTDATFD

Query:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSF
             +  G ++RD    L    S+     +SPL AE   +LEGL+FA +     + V SDSL  +Q +R          + + +I+ L   F
Subjt:  EPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSF

A0A6J1DX30 uncharacterized protein LOC1110248743.4e-1434.52Show/hide
Query:  LRKCASALGGWGFRQNKLVRANIRKIKDQIKPAYESPLPLDLGIIHHLEADL--------------------------------ERVLCHIPRKVSLEIN
        ++  +SAL  WG      +   I+  K  I  AY  PLPLD  IIH LE DL                                E ++  IP +++ E+N
Subjt:  LRKCASALGGWGFRQNKLVRANIRKIKDQIKPAYESPLPLDLGIIHHLEADL--------------------------------ERVLCHIPRKVSLEIN

Query:  RQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGNKTGGIRQN-LGNGQLIKIFTDPWI
         QL+ PYT++E+E A +   P  A GPDGF  LFYQ YW ++G KT     N L NG  IK +   +I
Subjt:  RQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGNKTGGIRQN-LGNGQLIKIFTDPWI

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-5228.52Show/hide
Query:  EADLERVLCHIPRKVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGN----------KTG-----------------------
        E  L   L  IP  V+   N  L+ P+T  EV++A K+     +PG DG S +FYQ++W I+G+           TG                       
Subjt:  EADLERVLCHIPRKVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTLFYQKYWDIMGN----------KTG-----------------------

Query:  ---------GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGK
                 G+R  +G G+LI+   DPWIPR  +F  L   G +N    VA  I    QW+ + L QY    DV  I  LP+S   + D  IWH+   G 
Subjt:  ---------GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGK

Query:  YTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG
        +TV+  Y  +   +    SS+  +  +WWK  W L++  KVK F W + H+++P   +L R  +  D  C+ CK+  E+T HALF CK A  VW  L   
Subjt:  YTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPG

Query:  EDMMIQRHMDIQDRWLYISSC-SEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKI-NGAC-----VSTSSSGVDFQALVEEGEET
         +      M   D   ++SS  ++  ++ +    WAIW +RNN +H +      +  A+   +L+ FR     +C      + ++  +  Q L       
Subjt:  EDMMIQRHMDIQDRWLYISSC-SEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKI-NGAC-----VSTSSSGVDFQALVEEGEET

Query:  IMH----------TDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISS
          H          TDA  D   +K+ +G VLR+  G +K  LS+P    +     EA ++  GL +A+   +    +  D+L +V AL+     +S  S 
Subjt:  IMH----------TDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISS

Query:  VLWDIRGLQTSF
        ++ D+  L + F
Subjt:  VLWDIRGLQTSF

A0A803Q8J4 Uncharacterized protein1.8e-4731.61Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRLA-TDAGIWHYDRKGKYTVKIGYKY
        GIR+ +GNG  I    DPWIP  + F  +   G  N +  VA++I P  +W+ SKLS      DVG I  LP+S  A  D  IWH    G+Y VK GY +
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRLA-TDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM
        +      ++ S     ++WWK  W+L++P KVK F W + HN++P  A L++        C+ C    E+  HALF CK A  VW+V     +      M
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM

Query:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEE-----TIMHTDATFDE
        +I+D    IS + +++ L+ I    W+IW+DRNN +H +         A    +L  ++      +    S     ++ +           ++ DA FDE
Subjt:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEE-----TIMHTDATFDE

Query:  PNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIR
          NK   G ++RD  G +K  +S P   C  P   EA  +   L++A  L  +   V +DSL L  ALR      SS   +++D++
Subjt:  PNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIR

A0A803QJ22 Uncharacterized protein8.5e-5029.85Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRLA-TDAGIWHYDRKGKYTVKIGYKY
        G+R  +GNG+ I   + PW+P   +FK L  +G  N  M+V++ I+   QW+ S L+      D+ +I  +P++ +   D  IWH++  G Y+VK GY  
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRLA-TDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM
        +   ++    SS      WWKK W L++P+K++ F+W + H+ +P    L   H++    CT C    ET  HALF CKR  KVW+        ++ RHM
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM

Query:  DIQDRWLYISS-CSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGV---------DFQALVEEGEETIMHTDA
        ++++ +L +SS  S   L+      W+IW +RN   H       +    +   Y+ E+  ++G   +TSS+           D   L        ++TDA
Subjt:  DIQDRWLYISS-CSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGV---------DFQALVEEGEETIMHTDA

Query:  TFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFRRNS
          +   NKS  G VLRD  G++   +S P   C +P   EA A++  L++   L +    + +DSLS+V++L    +  S    +L +I  L ++F    
Subjt:  TFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFRRNS

Query:  LT
        +T
Subjt:  LT

A0A803QJN9 Uncharacterized protein3.0e-4729.51Show/hide
Query:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY
        G++  +G G+ I   +D W+P   TF     KG  +  ++VA+ I    QWD++ +S      D   I  +P+S   A D  IW+    G YTVK GY++
Subjt:  GIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRL-ATDAGIWHYDRKGKYTVKIGYKY

Query:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM
        ++    +  +++  T  SWW K WKL++P+K++ FVW  FHN++P  + L R H++    C  CK   ET +HALF C RA  VW       +  +    
Subjt:  SMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHM

Query:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET---------------
           D  LY+S + S    +   +  W+IW +RN   HN+P         +   YL +++    +    + S     +     +ET               
Subjt:  DIQDRWLYIS-SCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET---------------

Query:  --IMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL
           +++DA  ++   K  IG V+RD  G +   LS P Q C  P   EA A+   L++A +LG+    + +DSL +VQ L+    C+S+   +L D+  L
Subjt:  --IMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGL

Query:  QTSFRRNSLT
         + F R  +T
Subjt:  QTSFRRNSLT

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.6e-1622.98Show/hide
Query:  DIMGNKTGGIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFI-IPSNQWDVSKLSQYLCDEDVGVISHLPISRL--ATDAGIWHYDRKG
        D++ +  G I    G+GQ I+ +TD W+      ++   +   + D  VA+ + IP   WD +K+  Y  +     +  + +  +  A D   W + + G
Subjt:  DIMGNKTGGIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFI-IPSNQWDVSKLSQYLCDEDVGVISHLPISRL--ATDAGIWHYDRKG

Query:  KYTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAP
        +++V+  Y+   + +    +      +S++  LWK+R+P +VK F+W   + ++ T     R H+S   VC  CK G+E+  H L  C   + +W  + P
Subjt:  KYTVKIGYKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAP

Query:  GEDMMIQRHMDIQDRWLY-----ISSCSEAVLDLI-CIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET
                   + + WLY      S C +     I  +  W  W  R   +         DR  ++  +  E  + +   V    +    + ++      
Subjt:  GEDMMIQRHMDIQDRWLY-----ISSCSEAVLDLI-CIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEET

Query:  I----MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALR
        +    ++TD         +  G VLRD  G      SL    C +P  AE + V  GL FA    V  V +  DS  +V  L+
Subjt:  I----MHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALR

Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.6e-1125.45Show/hide
Query:  KVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGE------DMMIQRHMDIQDRWLYISSCSEAVLDLICIGA
        K+K F+W +   ++P  A L R H+S    C  C    ET+ H LF C  A +VW  LAP +         I   +++  + + +         L     
Subjt:  KVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGE------DMMIQRHMDIQDRWLYISSCSEAVLDLICIGA

Query:  WAIWNDRNNALHNRP--------IPSINDRCAW--IYRYLEEFRKINGACVSTSSSGVDFQALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELK-
        W IW  RN  +              ++ D  AW      L + R       ST++   DF          + + DA + + ++ +  G V +      K 
Subjt:  WAIWNDRNNALHNRP--------IPSINDRCAW--IYRYLEEFRKINGACVSTSSSGVDFQALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELK-

Query:  -TVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFR
         T  S   +   SPL AEA+A+   +  A+ L    + V+SDS S+V AL   +   + I  +L +IR ++  FR
Subjt:  -TVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFR

AT3G09510.1 Ribonuclease H-like superfamily protein5.3e-2023.66Show/hide
Query:  GIRQNLGNGQLIKI----FTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQ---WDVSKLSQYLCDEDVGVISHLPISR-LATDAGIWHYDRKGKYT
        G R  +G+GQ I+I      D   PRP+  +      +  ++M +            WD SK+SQ++   D G I  + +++    D  IW+Y+  G+YT
Subjt:  GIRQNLGNGQLIKI----FTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQ---WDVSKLSQYLCDEDVGVISHLPISR-LATDAGIWHYDRKGKYT

Query:  VKIGYKYSMLQQQASSS--------SSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVW
        V+ G  Y +L    S++         S++ ++    ++W L I  K+KHF+W +   ++ T   L    + +D  C  C    E+ +HALF C  A   W
Subjt:  VKIGYKYSMLQQQASSS--------SSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVW

Query:  EVLAPGEDMMIQRHMDIQDRWLYISSCSEAVLDLICIG---------AWAIWNDRNNALHNR----PIPSINDRCAWIYRYL---EEFRKINGACVSTSS
         +    +  +I+  +   D    IS+    V D               W IW  RNN + N+    P  ++    A  + +L   +  +K        + 
Subjt:  EVLAPGEDMMIQRHMDIQDRWLYISSCSEAVLDLICIG---------AWAIWNDRNNALHNR----PIPSINDRCAWIYRYL---EEFRKINGACVSTSS

Query:  SGVDFQALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCD
        + ++++           + DA FD    ++  G ++R+  G   +  S+      +PL AE  A+L  L+     G   V +  D  +L+  + G+    
Subjt:  SGVDFQALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCD

Query:  SSISSVLWDI
        SS+++ L DI
Subjt:  SSISSVLWDI

AT3G25270.1 Ribonuclease H-like superfamily protein6.5e-1021.26Show/hide
Query:  KLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHMDIQDRW-LYISSC----SEAV
        K+WKL+   K+KHF+W     ++ T  NL R H+     C  C +  ET+ H  F C  A +VW         +    + ++ +  L +SSC       +
Subjt:  KLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHMDIQDRW-LYISSC----SEAV

Query:  LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEETIM---------------HTDATFDEPNNKSRI
         +L     W +W  RN  +  +   S  +        ++E+   N     T    ++ Q      ++  M               + D  F+     ++ 
Subjt:  LDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEETIM---------------HTDATFDEPNNKSRI

Query:  GVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDS
        G ++RD+ G                L +E  A++  ++ A S G   V    DS
Subjt:  GVVLRDKQGELKTVLSLPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDS

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-0732.84Show/hide
Query:  SSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMK
        ++W   +W L+I  K+K  +W + +N++P  A L   ++S++  CT C++  ET  H LF C  A +
Subjt:  SSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMK

AT4G29090.1 Ribonuclease H-like superfamily protein4.8e-2125.48Show/hide
Query:  GIRQNLGNGQLIKIFTDPWI-PRPVTFKVLMTK------GQANEDMRVAEFIIPS-NQWDVSKLSQYLCDEDVGVISHL-PISRLATDAGIWHYDRKGKY
        G R  +GNG+ I I+   W+  +P +  + M +         +  ++V++ I  S  +W    +     + +  +I  L P  R   D+  W Y   G Y
Subjt:  GIRQNLGNGQLIKIFTDPWI-PRPVTFKVLMTK------GQANEDMRVAEFIIPS-NQWDVSKLSQYLCDEDVGVISHL-PISRLATDAGIWHYDRKGKY

Query:  TVKIGY-KYSMLQQQASSSSSLETES--SWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEV--
        TVK GY   + +  + SS   +   S    ++K+WK +   K++HF+W    NS+P    L   H+S +  C  C    ET +H LF+C  A   W +  
Subjt:  TVKIGY-KYSMLQQQASSSSSLETES--SWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEV--

Query:  --LAPGEDMMIQRHMDIQDRWLYISSCS----EAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGA--C-----VSTSSSGVDF
          +  G +     ++++   W++         E    L+    W +W +RN  +      +  +        LEE+R    A  C     V+ SS G   
Subjt:  --LAPGEDMMIQRHMDIQDRWLYISSCS----EAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGA--C-----VSTSSSGVDF

Query:  QALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELKTV--LSLP--SQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDS
        +      +    +TDAT++  N +  IG VLR+++GE+K +   +LP    V  + L A  +AVL   RF  +     V   SDS  L++ L   D+   
Subjt:  QALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELKTV--LSLP--SQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDS

Query:  SISSVLWDIRGLQTSF
        S+   + D++ L + F
Subjt:  SISSVLWDIRGLQTSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAATGCCCTCTGCAGAAGGACCTTCGAAAGTGTGCCTCTGCGCTTGGTGGATGGGGTTTTCGTCAGAATAAGCTCGTTCGGGCTAATATTAGAAAGATCAAGGA
CCAAATCAAGCCAGCTTATGAAAGTCCTCTACCACTCGATTTGGGCATCATTCACCACTTAGAAGCGGATCTAGAGAGAGTCCTCTGCCATATTCCTAGAAAGGTATCTC
TTGAGATTAATCGTCAACTAATGGACCCTTATACCAGGGACGAGGTGGAAGCGGCAACCAAAAGCTTTCCCCCAATGATAGCGCCCGGGCCTGATGGTTTTTCGACTCTG
TTTTACCAGAAATATTGGGATATTATGGGCAATAAGACGGGGGGTATACGACAGAATCTAGGGAATGGGCAGTTGATTAAAATCTTTACGGATCCATGGATTCCTCGTCC
TGTCACCTTTAAGGTGTTAATGACCAAAGGCCAGGCTAATGAGGATATGCGAGTGGCAGAGTTTATCATTCCATCAAATCAGTGGGATGTATCCAAACTTAGCCAATATT
TATGTGATGAGGATGTTGGAGTGATTTCTCACTTACCAATCAGCAGGTTGGCTACGGATGCGGGGATTTGGCACTACGATAGGAAAGGGAAATATACCGTCAAAATTGGG
TATAAATATAGCATGTTGCAGCAACAGGCCTCATCGTCTTCGTCTCTAGAAACTGAGAGTAGCTGGTGGAAGAAATTATGGAAGTTGAGGATTCCAAATAAGGTCAAACA
CTTTGTCTGGAACTCATTCCATAATTCTATTCCGACGATGGCTAACCTGTGGCGTCACCATGTTTCGGTGGATGGCGTATGTACCAGTTGTAAAGAGGGTCTAGAAACTA
CCGATCATGCTCTGTTTCAGTGTAAGCGTGCGATGAAGGTATGGGAAGTTCTCGCTCCAGGTGAGGACATGATGATACAGAGACATATGGATATACAGGATCGTTGGCTC
TATATCAGTAGTTGCTCTGAGGCGGTTTTGGATCTGATTTGTATTGGAGCCTGGGCAATTTGGAACGATAGAAATAATGCTCTTCATAATCGCCCTATTCCAAGCATCAA
TGATAGGTGTGCATGGATTTATAGATACCTAGAAGAATTTAGGAAGATTAATGGGGCCTGCGTTTCAACTTCATCGTCTGGGGTAGATTTCCAAGCGTTGGTTGAAGAAG
GTGAGGAAACCATTATGCATACTGACGCGACCTTTGATGAACCCAACAATAAGTCGAGAATTGGGGTAGTGCTTCGTGACAAGCAAGGTGAATTAAAGACAGTTCTGTCC
TTACCATCTCAAGTTTGCATCTCCCCTTTATGTGCTGAAGCATTTGCAGTGCTAGAGGGACTTCGTTTCGCTATAAGTTTGGGTGTGGAAAGTGTTTCAGTGATGTCTGA
TTCTCTCTCATTGGTTCAGGCCCTTCGAGGAATGGACCAATGTGATTCGAGTATTTCATCAGTTTTGTGGGACATAAGAGGTCTTCAAACTTCTTTTCGTAGGAATAGCT
TAACAGAATTCAGTTCATTAAATTGGCTAACGAGCTGTGACGTGGCAGTTTTGGTTCCAGCTTTGCTACAACACAGAATTTGCTCCCTTAAGATAATTGAGCTTAGGCTA
TGTCGGGAGCCGATCACGCTAATTGGTGTTAATTCGGGTCAATTACGGAGTTTTGGAGCCATCTCGGTGTCTTGGACGGAGAAGAGGCCTAAGTCTCCATCTTTCCCATC
TTCCTCATCCTCTCCTCTTCATACCCACGGCAGCAACCCCCTCCCTCACACCCTCTTCTGCGGCGGCAACATTCAGAGGCTCCGGCGGACGTGCGTTTCTTCTGGCGGTT
CCACGGTGGTGCATCGGCAGTACGACGGCAGTTCGAGCTTCCGGTCGACGCAGCGAGTAGAGCTCGCAGTTTTGGGCGTGGTTGTCGCTCAATTTTGGTCCACTAAATCA
CAAGTTTTTGGTAATTTAGCGTTTTGTCCAGCAAGCTTAAAGATTTCAAATGATGAAATTCCTCCCAGAAACAGTCCAGTGCAGGCTCGACAGAACGACGTCGTTTTCGT
GCTCGACGGCTTCAAATCTGACGTCCCACAGCGTTTTCGTGCTGACAGCTTCAGATCTAACGTCCCACGACGTTTTCGTGCTCGAAGGCTTCAGATCGATGGCTACGAAT
CCTTTTTTATCTCTCTGAGTGGGGGTTTTAAAGAGAGAGAACAGAGGGTTTTAGAGGGAACAGAACAGAGAAAGAAGAAAGTGAGGTTGAAGAAGAAGAGAAAGAAGAAA
GGAAGGAGAAAAAAGTGTCGTCGGCAGTCGGCCGGCGATGGCCAGTGTGCTGGTTGGTGGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAATGCCCTCTGCAGAAGGACCTTCGAAAGTGTGCCTCTGCGCTTGGTGGATGGGGTTTTCGTCAGAATAAGCTCGTTCGGGCTAATATTAGAAAGATCAAGGA
CCAAATCAAGCCAGCTTATGAAAGTCCTCTACCACTCGATTTGGGCATCATTCACCACTTAGAAGCGGATCTAGAGAGAGTCCTCTGCCATATTCCTAGAAAGGTATCTC
TTGAGATTAATCGTCAACTAATGGACCCTTATACCAGGGACGAGGTGGAAGCGGCAACCAAAAGCTTTCCCCCAATGATAGCGCCCGGGCCTGATGGTTTTTCGACTCTG
TTTTACCAGAAATATTGGGATATTATGGGCAATAAGACGGGGGGTATACGACAGAATCTAGGGAATGGGCAGTTGATTAAAATCTTTACGGATCCATGGATTCCTCGTCC
TGTCACCTTTAAGGTGTTAATGACCAAAGGCCAGGCTAATGAGGATATGCGAGTGGCAGAGTTTATCATTCCATCAAATCAGTGGGATGTATCCAAACTTAGCCAATATT
TATGTGATGAGGATGTTGGAGTGATTTCTCACTTACCAATCAGCAGGTTGGCTACGGATGCGGGGATTTGGCACTACGATAGGAAAGGGAAATATACCGTCAAAATTGGG
TATAAATATAGCATGTTGCAGCAACAGGCCTCATCGTCTTCGTCTCTAGAAACTGAGAGTAGCTGGTGGAAGAAATTATGGAAGTTGAGGATTCCAAATAAGGTCAAACA
CTTTGTCTGGAACTCATTCCATAATTCTATTCCGACGATGGCTAACCTGTGGCGTCACCATGTTTCGGTGGATGGCGTATGTACCAGTTGTAAAGAGGGTCTAGAAACTA
CCGATCATGCTCTGTTTCAGTGTAAGCGTGCGATGAAGGTATGGGAAGTTCTCGCTCCAGGTGAGGACATGATGATACAGAGACATATGGATATACAGGATCGTTGGCTC
TATATCAGTAGTTGCTCTGAGGCGGTTTTGGATCTGATTTGTATTGGAGCCTGGGCAATTTGGAACGATAGAAATAATGCTCTTCATAATCGCCCTATTCCAAGCATCAA
TGATAGGTGTGCATGGATTTATAGATACCTAGAAGAATTTAGGAAGATTAATGGGGCCTGCGTTTCAACTTCATCGTCTGGGGTAGATTTCCAAGCGTTGGTTGAAGAAG
GTGAGGAAACCATTATGCATACTGACGCGACCTTTGATGAACCCAACAATAAGTCGAGAATTGGGGTAGTGCTTCGTGACAAGCAAGGTGAATTAAAGACAGTTCTGTCC
TTACCATCTCAAGTTTGCATCTCCCCTTTATGTGCTGAAGCATTTGCAGTGCTAGAGGGACTTCGTTTCGCTATAAGTTTGGGTGTGGAAAGTGTTTCAGTGATGTCTGA
TTCTCTCTCATTGGTTCAGGCCCTTCGAGGAATGGACCAATGTGATTCGAGTATTTCATCAGTTTTGTGGGACATAAGAGGTCTTCAAACTTCTTTTCGTAGGAATAGCT
TAACAGAATTCAGTTCATTAAATTGGCTAACGAGCTGTGACGTGGCAGTTTTGGTTCCAGCTTTGCTACAACACAGAATTTGCTCCCTTAAGATAATTGAGCTTAGGCTA
TGTCGGGAGCCGATCACGCTAATTGGTGTTAATTCGGGTCAATTACGGAGTTTTGGAGCCATCTCGGTGTCTTGGACGGAGAAGAGGCCTAAGTCTCCATCTTTCCCATC
TTCCTCATCCTCTCCTCTTCATACCCACGGCAGCAACCCCCTCCCTCACACCCTCTTCTGCGGCGGCAACATTCAGAGGCTCCGGCGGACGTGCGTTTCTTCTGGCGGTT
CCACGGTGGTGCATCGGCAGTACGACGGCAGTTCGAGCTTCCGGTCGACGCAGCGAGTAGAGCTCGCAGTTTTGGGCGTGGTTGTCGCTCAATTTTGGTCCACTAAATCA
CAAGTTTTTGGTAATTTAGCGTTTTGTCCAGCAAGCTTAAAGATTTCAAATGATGAAATTCCTCCCAGAAACAGTCCAGTGCAGGCTCGACAGAACGACGTCGTTTTCGT
GCTCGACGGCTTCAAATCTGACGTCCCACAGCGTTTTCGTGCTGACAGCTTCAGATCTAACGTCCCACGACGTTTTCGTGCTCGAAGGCTTCAGATCGATGGCTACGAAT
CCTTTTTTATCTCTCTGAGTGGGGGTTTTAAAGAGAGAGAACAGAGGGTTTTAGAGGGAACAGAACAGAGAAAGAAGAAAGTGAGGTTGAAGAAGAAGAGAAAGAAGAAA
GGAAGGAGAAAAAAGTGTCGTCGGCAGTCGGCCGGCGATGGCCAGTGTGCTGGTTGGTGGTGTTGA
Protein sequenceShow/hide protein sequence
MLECPLQKDLRKCASALGGWGFRQNKLVRANIRKIKDQIKPAYESPLPLDLGIIHHLEADLERVLCHIPRKVSLEINRQLMDPYTRDEVEAATKSFPPMIAPGPDGFSTL
FYQKYWDIMGNKTGGIRQNLGNGQLIKIFTDPWIPRPVTFKVLMTKGQANEDMRVAEFIIPSNQWDVSKLSQYLCDEDVGVISHLPISRLATDAGIWHYDRKGKYTVKIG
YKYSMLQQQASSSSSLETESSWWKKLWKLRIPNKVKHFVWNSFHNSIPTMANLWRHHVSVDGVCTSCKEGLETTDHALFQCKRAMKVWEVLAPGEDMMIQRHMDIQDRWL
YISSCSEAVLDLICIGAWAIWNDRNNALHNRPIPSINDRCAWIYRYLEEFRKINGACVSTSSSGVDFQALVEEGEETIMHTDATFDEPNNKSRIGVVLRDKQGELKTVLS
LPSQVCISPLCAEAFAVLEGLRFAISLGVESVSVMSDSLSLVQALRGMDQCDSSISSVLWDIRGLQTSFRRNSLTEFSSLNWLTSCDVAVLVPALLQHRICSLKIIELRL
CREPITLIGVNSGQLRSFGAISVSWTEKRPKSPSFPSSSSSPLHTHGSNPLPHTLFCGGNIQRLRRTCVSSGGSTVVHRQYDGSSSFRSTQRVELAVLGVVVAQFWSTKS
QVFGNLAFCPASLKISNDEIPPRNSPVQARQNDVVFVLDGFKSDVPQRFRADSFRSNVPRRFRARRLQIDGYESFFISLSGGFKEREQRVLEGTEQRKKKVRLKKKRKKK
GRRKKCRRQSAGDGQCAGWWC