; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0018858 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0018858
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRNase H domain-containing protein
Genome locationchr09:17122358..17132730
RNA-Seq ExpressionPI0018858
SyntenyPI0018858
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR011320 - Ribonuclease H1, N-terminal
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143798.1 uncharacterized protein LOC101210930 isoform X1 [Cucumis sativus]8.8e-19392.18Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNCFSQ+STYTRVIFRRTNLVFAASTSIHGCSN YWTSSFH VAVK TALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPP+ESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+CQAQIGSSICDLPVSV+KGHSLPKDTEEYLASVGLKNALYTIKAAD+RPDLFGSL PCTFH GDTSL GETSGQD IKKRSRE I+ ENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
        GSTVLTPT +DPTRKH+KLEDSIVS ++SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
Subjt:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH

Query:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF
        VQGDSKLVCMQVQGLWKAK+ENMSELCNEV KLKNKFLSFEVNHVLRHLNSEADAQANLALTLA+GEVQEF
Subjt:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF

XP_008465682.1 PREDICTED: uncharacterized protein LOC103503315 isoform X1 [Cucumis melo]1.5e-18990.67Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGE+QE
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE

XP_008465684.1 PREDICTED: uncharacterized protein LOC103503315 isoform X3 [Cucumis melo]1.1e-17690.34Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLR
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR

XP_011655308.1 uncharacterized protein LOC101210930 isoform X2 [Cucumis sativus]8.5e-18091.69Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNCFSQ+STYTRVIFRRTNLVFAASTSIHGCSN YWTSSFH VAVK TALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPP+ESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+CQAQIGSSICDLPVSV+KGHSLPKDTEEYLASVGLKNALYTIKAAD+RPDLFGSL PCTFH GDTSL GETSGQD IKKRSRE I+ ENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
        GSTVLTPT +DPTRKH+KLEDSIVS ++SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
Subjt:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH

Query:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHL
        VQGDSKLVCMQVQGLWKAK+ENMSELCNEV KLKNKFLSFEVNHVLR L
Subjt:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHL

XP_038889960.1 uncharacterized protein LOC120079705 [Benincasa hispida]6.7e-17786.33Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSF--HYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRK
        MNCFSQ+STYTR IFRRT LV  ASTSI+G SN YWTSSF  H VAVKATA+DSL SRF LRCYSS   RK RK  SPS  LDSEPP ESEMGDFFVVRK
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSF--HYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRK

Query:  GDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSE
        GD++GVYKSFS+CQAQIGSSICDLPVS+YKGHSLPKDT+EYLASVGLKNALYTIKAAD+RPDLFGSLVPCTFHDGDTS+KGE SGQD IKKR RE I+SE
Subjt:  GDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSE

Query:  NVGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTR
        N+GS+VLTPTS+DP+RKHVKLEDSIVS A+SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGFTR
Subjt:  NVGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTR

Query:  IHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF
        IHVQGDSKLVCMQVQGLWK KNEN+SELCNEV+KLK+KFLSFE+NHVLR+LNSEADAQANLA+TLADGEVQEF
Subjt:  IHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein4.2e-19392.18Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNCFSQ+STYTRVIFRRTNLVFAASTSIHGCSN YWTSSFH VAVK TALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPP+ESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+CQAQIGSSICDLPVSV+KGHSLPKDTEEYLASVGLKNALYTIKAAD+RPDLFGSL PCTFH GDTSL GETSGQD IKKRSRE I+ ENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
        GSTVLTPT +DPTRKH+KLEDSIVS ++SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH
Subjt:  GSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIH

Query:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF
        VQGDSKLVCMQVQGLWKAK+ENMSELCNEV KLKNKFLSFEVNHVLRHLNSEADAQANLALTLA+GEVQEF
Subjt:  VQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF

A0A1S3CPG2 uncharacterized protein LOC103503315 isoform X35.6e-17790.34Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLR
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X17.5e-19090.67Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGE+QE
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE

A0A1S3CQX0 uncharacterized protein LOC103503315 isoform X25.6e-17790.34Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLR
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLR

A0A5A7TCE2 RNase H family protein, putative isoform 27.5e-19090.67Show/hide
Query:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD
        MNC SQ+STYTRVIFRRTNLVFAASTSIHGCSNPYW+S+FH VAVKATALDSL SRFGLRCYS+RKPRKPRKPTSPS  LDSEPPMESEMGDFFVVRKGD
Subjt:  MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGD

Query:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV
        VVGVYKSFS+C AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAAD+RPDLFGSLVPCTFHDGD SL GETSGQD IKKRSRE I+SENV
Subjt:  VVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENV

Query:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
        GS+VL     TPTSEDPTRKH+KLEDSIVS  +SSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK ALKKG
Subjt:  GSTVL-----TPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE
        FTRIHVQGDSKLVCMQVQGLWKAKNEN+SELCNEV+KLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGE+QE
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQE

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein2.0e-0630.65Show/hide
Query:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLS
        DGAS GNPG +G G  ++ H+G         +G+ TN  AE+ A++ G+K    +G+  +  + DS +V  +   L   KN        E+++LK  F  
Subjt:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLS

Query:  FEVNHVLRHLNSEADAQANLALTL
        F +  +    N +AD  A  A+ L
Subjt:  FEVNHVLRHLNSEADAQANLALTL

P64956 Uncharacterized protein Mb2253c1.5e-1742.64Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A+K G T   V  DSKLV  Q+ G WK K+ ++ +L  +   L +
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN

Query:  KFLSFEVNHVLRHLNSEADAQANLALTLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFEVNHVLRHLNSEADAQANLALTLA

P9WLH4 Uncharacterized protein MT22871.5e-1742.64Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A+K G T   V  DSKLV  Q+ G WK K+ ++ +L  +   L +
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN

Query:  KFLSFEVNHVLRHLNSEADAQANLALTLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFEVNHVLRHLNSEADAQANLALTLA

P9WLH5 Bifunctional protein Rv2228c1.5e-1742.64Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A+K G T   V  DSKLV  Q+ G WK K+ ++ +L  +   L +
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKN

Query:  KFLSFEVNHVLRHLNSEADAQANLALTLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFEVNHVLRHLNSEADAQANLALTLA

Q9HSF6 Ribonuclease HI1.3e-1640.65Show/hide
Query:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFL
        FDGAS+GNPG A  G VL + DG ++    + +G ATNN AEY A++  L++A   GF  I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFL

Query:  SFEVNHVLRHLNSEADAQANLAL
         + + HV R  N  ADA AN AL
Subjt:  SFEVNHVLRHLNSEADAQANLAL

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein2.5e-8446.52Show/hide
Query:  MNCFSQISTYTRV-IFRRTNLVFAASTSIHGCSNPYWTSSFHYV----AVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFV
        MNC S   +Y  + + +R++ V          S+  W   F Y+     +K  A+ S+     +  YSSR      K  S +        ++ E   FFV
Subjt:  MNCFSQISTYTRV-IFRRTNLVFAASTSIHGCSNPYWTSSFHYV----AVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFV

Query:  VRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDI
        VRKGDV+G+YK  S+CQAQ+GSS+ DLPVSVYKG+SLPKDTEEYL+SVGLK  LY+++A+DL+ D+FG+L PC F +        +  + T + +S++D 
Subjt:  VRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDI

Query:  LSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG
          +   +++    S DP  K  K+E S    A  S+ E+CF+EFDGASKGNPG +GA AVL+  DGS+ICR+R+GLGIATNN AEY A++LGLK A++KG
Subjt:  LSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKG

Query:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ
        +  I V+GDSKLVCMQ++G WK  +E +++L  E   L NK +SFE++HVLR+LN++AD QANLA+ L +GEV+
Subjt:  FTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-7650.34Show/hide
Query:  MESEMGDFFVVRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQD
        ME E   F++VRKGD++GVY+S SECQ Q GSS+    +SVYKG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC      +S +GE+  + 
Subjt:  MESEMGDFFVVRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQD

Query:  TIKKRSREDILSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISS--------NCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAVLRA D SV+  LREG+G ATNN
Subjt:  TIKKRSREDILSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISS--------NCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN

Query:  VAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ
        VAEYRA+LLGL+SAL KGF  +HV GDS LVCMQVQG WK  +  M+ELC +  +L N F +F++ H+ R  NSEAD QAN A+ LADG+ Q
Subjt:  VAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-7650.34Show/hide
Query:  MESEMGDFFVVRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQD
        ME E   F++VRKGD++GVY+S SECQ Q GSS+    +SVYKG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC      +S +GE+  + 
Subjt:  MESEMGDFFVVRKGDVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQD

Query:  TIKKRSREDILSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISS--------NCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAVLRA D SV+  LREG+G ATNN
Subjt:  TIKKRSREDILSENVGSTVLTPTSEDPTRKHVKLEDSIVSRAISS--------NCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN

Query:  VAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ
        VAEYRA+LLGL+SAL KGF  +HV GDS LVCMQVQG WK  +  M+ELC +  +L N F +F++ H+ R  NSEAD QAN A+ LADG+ Q
Subjt:  VAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ

AT5G51080.1 RNase H family protein2.5e-7643.78Show/hide
Query:  MNCFSQISTY-TRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKG
        MN FS+  +Y + V+FR+++ V          S+  W   F Y ++K++   +  S   + CYSSR      K +  S ++      + E   FFVVRKG
Subjt:  MNCFSQISTY-TRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKG

Query:  DVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSEN
        D+VG+YK   +CQAQ+GSS+ D PVSVYKG+SL KDTEE L++VGLK  LY  +A DL+ D+FG+L PC F D                           
Subjt:  DVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSEN

Query:  VGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI
               P++     K  +LE S       ++ E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK A++KG+T+I
Subjt:  VGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI

Query:  HVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ
         V+ DSKLVCMQ++G WK  +E +S+L  E  +L +K LSFE++HVLR LNS+AD QAN+A  L++GEV+
Subjt:  HVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ

AT5G51080.2 RNase H family protein2.5e-7643.78Show/hide
Query:  MNCFSQISTY-TRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKG
        MN FS+  +Y + V+FR+++ V          S+  W   F Y ++K++   +  S   + CYSSR      K +  S ++      + E   FFVVRKG
Subjt:  MNCFSQISTY-TRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKG

Query:  DVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSEN
        D+VG+YK   +CQAQ+GSS+ D PVSVYKG+SL KDTEE L++VGLK  LY  +A DL+ D+FG+L PC F D                           
Subjt:  DVVGVYKSFSECQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSEN

Query:  VGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI
               P++     K  +LE S       ++ E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK A++KG+T+I
Subjt:  VGSTVLTPTSEDPTRKHVKLEDSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRI

Query:  HVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ
         V+ DSKLVCMQ++G WK  +E +S+L  E  +L +K LSFE++HVLR LNS+AD QAN+A  L++GEV+
Subjt:  HVQGDSKLVCMQVQGLWKAKNENMSELCNEVMKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGCTTCTCCCAAATCTCTACCTATACTCGCGTCATTTTCAGAAGGACCAATCTTGTTTTTGCGGCTTCAACCTCCATTCATGGCTGCTCTAATCCCTACTGGAC
CTCATCCTTTCACTATGTCGCTGTTAAGGCTACTGCTTTAGACTCATTGCCTTCCAGATTCGGTCTACGGTGCTATTCCTCTCGAAAACCCCGAAAACCCCGAAAGCCCA
CTTCTCCTTCACGCAACTTGGATTCTGAACCTCCCATGGAATCAGAGATGGGCGACTTCTTTGTCGTTCGAAAAGGGGATGTTGTTGGAGTTTATAAAAGTTTTAGTGAG
TGTCAGGCGCAAATTGGATCTTCGATATGTGATCTTCCTGTTAGTGTGTATAAAGGACACTCATTACCAAAAGACACTGAAGAATATCTTGCTTCCGTTGGGCTTAAGAA
CGCTCTGTACACTATTAAAGCTGCAGATTTGAGACCTGATCTTTTCGGTTCGCTCGTGCCTTGCACTTTTCATGATGGAGATACCTCTCTTAAAGGTGAGACTTCTGGCC
AGGATACCATAAAGAAGAGATCAAGAGAGGATATTCTATCAGAAAATGTTGGGTCAACTGTTTTAACTCCTACATCAGAAGATCCCACTAGGAAACATGTCAAGTTGGAA
GATTCCATTGTGTCCCGCGCAATATCCTCCAACTGCGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAAGGAAATCCTGGACAAGCCGGGGCAGGAGCTGTTCTGCG
AGCTCATGATGGGAGTGTGATATGTCGACTGCGTGAAGGTCTAGGTATAGCAACCAATAACGTTGCTGAATATCGAGCTATTCTTTTAGGGTTGAAGTCTGCACTTAAGA
AAGGGTTCACTAGGATTCACGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGTTTATGGAAGGCAAAAAATGAAAATATGTCTGAGTTATGTAATGAAGTT
ATGAAGCTGAAGAATAAATTTCTCTCTTTCGAGGTCAATCATGTACTAAGGCATCTAAACTCAGAAGCCGATGCTCAAGCGAACTTGGCTCTCACTTTAGCTGACGGTGA
AGTCCAGGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
AAGAAAATTACTTAGGAGAGAGGGAAAGAAGAAGAAGAAGAAGGTGGAGTCCTTGGAGCAGCTTTGAATGGACAAACAAACCTACTGAACGACATTGATTTAAGCTCGGA
ATCTTCATAGTCGCTGGAAAACTACCCTCTGATGAACTGCTTCTCCCAAATCTCTACCTATACTCGCGTCATTTTCAGAAGGACCAATCTTGTTTTTGCGGCTTCAACCT
CCATTCATGGCTGCTCTAATCCCTACTGGACCTCATCCTTTCACTATGTCGCTGTTAAGGCTACTGCTTTAGACTCATTGCCTTCCAGATTCGGTCTACGGTGCTATTCC
TCTCGAAAACCCCGAAAACCCCGAAAGCCCACTTCTCCTTCACGCAACTTGGATTCTGAACCTCCCATGGAATCAGAGATGGGCGACTTCTTTGTCGTTCGAAAAGGGGA
TGTTGTTGGAGTTTATAAAAGTTTTAGTGAGTGTCAGGCGCAAATTGGATCTTCGATATGTGATCTTCCTGTTAGTGTGTATAAAGGACACTCATTACCAAAAGACACTG
AAGAATATCTTGCTTCCGTTGGGCTTAAGAACGCTCTGTACACTATTAAAGCTGCAGATTTGAGACCTGATCTTTTCGGTTCGCTCGTGCCTTGCACTTTTCATGATGGA
GATACCTCTCTTAAAGGTGAGACTTCTGGCCAGGATACCATAAAGAAGAGATCAAGAGAGGATATTCTATCAGAAAATGTTGGGTCAACTGTTTTAACTCCTACATCAGA
AGATCCCACTAGGAAACATGTCAAGTTGGAAGATTCCATTGTGTCCCGCGCAATATCCTCCAACTGCGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAAGGAAATC
CTGGACAAGCCGGGGCAGGAGCTGTTCTGCGAGCTCATGATGGGAGTGTGATATGTCGACTGCGTGAAGGTCTAGGTATAGCAACCAATAACGTTGCTGAATATCGAGCT
ATTCTTTTAGGGTTGAAGTCTGCACTTAAGAAAGGGTTCACTAGGATTCACGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGTTTATGGAAGGCAAAAAA
TGAAAATATGTCTGAGTTATGTAATGAAGTTATGAAGCTGAAGAATAAATTTCTCTCTTTCGAGGTCAATCATGTACTAAGGCATCTAAACTCAGAAGCCGATGCTCAAG
CGAACTTGGCTCTCACTTTAGCTGACGGTGAAGTCCAGGAGTTTTGAAGATTAATGGTTAAAATGCACGGCAGGATATATCTTACAGTATAGCAAGATTTTCTGAGGAGT
ACATTGCCTTTGGAGGCCAATGCTTTGCATGGTACTATTCTTTTCCCAAGATTGCCAAGATCTTTGGGCATTGTTTTGGTTTCACCTACATTTCCGTGGCCCTGACTTAG
CAAACTGGAATTCTAGTTGCGACAATGAATTAGCATTTATTTATTTTTCATTATTCAATAGCATTTTTGGGAAGTTTCAAGCAAATTGAGATTAACAAACTTGGTGTCAT
TTGGCATTTGGATGAAAGTTCATTTGAGGACTAGATTTCAGAATTTGATTTTTTTTTTTTTTATAATTTTTTTTAAATGTTGGAGATGAACTTCAACCTCAAGGAAGATG
GTAGATGCCTTATCCCTTGAATTTACAGAAAGGAAGATCTATTGCTCTTTCATCATATTATTCTGTAAATCTTTCTGTTATTTTTGCTTGAAATAATTTATGTAATAAAC
ACATTGCTCAGTCAAATTTTTTCTCAATTAAGAAAAGAGAAATTTTGCATA
Protein sequenceShow/hide protein sequence
MNCFSQISTYTRVIFRRTNLVFAASTSIHGCSNPYWTSSFHYVAVKATALDSLPSRFGLRCYSSRKPRKPRKPTSPSRNLDSEPPMESEMGDFFVVRKGDVVGVYKSFSE
CQAQIGSSICDLPVSVYKGHSLPKDTEEYLASVGLKNALYTIKAADLRPDLFGSLVPCTFHDGDTSLKGETSGQDTIKKRSREDILSENVGSTVLTPTSEDPTRKHVKLE
DSIVSRAISSNCESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKSALKKGFTRIHVQGDSKLVCMQVQGLWKAKNENMSELCNEV
MKLKNKFLSFEVNHVLRHLNSEADAQANLALTLADGEVQEF