; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004001 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004001
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr6:318387..324029
RNA-Seq ExpressionLag0004001
SyntenyLag0004001
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143798.1 uncharacterized protein LOC101210930 isoform X1 [Cucumis sativus]6.1e-17684.68Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD
        MNCFSQV TYTR IFRRT LVFAA TS HGCSN YWTSSF NVAVK TALDSLCSRF LRCYS+   RK RK TS SPK DSEPP+E E GDFFVVRKGD
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD

Query:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI
        VVGVYKSFSDCQAQIGSSICDLPVSV+KGHSLPKDTEEYLAS+GLKNALYTIKAADMR DLFGSL PCTFH G TSL GE SGQDAIKKRSREAIV EN+
Subjt:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI

Query:  GSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIH
        GST+LTPT KDP+RKH+KLEDSIVS ++SSNRESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLK AL+KGFTRIH
Subjt:  GSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIH

Query:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        VQGDSKLVCMQVQGLWK K+EN+SELCNEV KLK+KFLSFE++HVLR+LNSEADAQANLA+TLA+GE    +
Subjt:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

XP_022946833.1 uncharacterized protein LOC111450776 [Cucurbita moschata]3.0e-17585.41Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MNCFSQ  TYTRAIFR T L FAA TS HGC  N YWTSSF +V +K T LDSLCSRF LRCYSSRK+RKG S SP  DS+PPMEP+ GDFFVVRKGDVV
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        GVYKSF+DCQAQIGSSICDLPVSVYKGHSLPKDT EYL+S+GLKNALYTIKAADMR DLF SLVPCTFHD  T+LKGEASGQDAIKKRSRE IVSENIGS
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
        T+LTPTSKDP RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYAL+KGFTRIHVQ
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSFEISHVLRNLNSEADAQANLA+TL DGEA   +
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

XP_022999715.1 uncharacterized protein LOC111493983 [Cucurbita maxima]2.3e-17585.41Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MNC SQ  TY RAIFR T L FAA TS HGC  NPYWTSS  +V +K T LDSLCSRF LRCYSSRK+RKG S SP  DSEPPMEP+ GDFFVVRKGDVV
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        GVY+SF+DCQAQIGSSICDLPVSVYKGHSLPKDT+EYLAS+GLKNALYTI+AADMR DLF SLVPCTFHD  TSLKGEASGQDAIKKRSRE IVS+NIGS
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
        T+LTPTSKDP RKHVKLEDS+VSRA  SN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYAL+KGFTRIHVQ
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        GDSKLVCMQVQGLWKVKNENISELCNEV+KLKDKFLSFEISHVLRNLNSEADAQANLAI+L DGEA   +
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

XP_023547239.1 uncharacterized protein LOC111806115 [Cucurbita pepo subsp. pepo]4.2e-17785.95Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MNCFSQ  TYTRAIFR T L FAA TS HGC  NPYWTSSF +V +K T LDSLCSRF LRCYSSRK+RKG S SP  DS+PPMEP+ GDFFVVRKGDVV
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        GVYKSF+DCQAQIGSSICDLPVSVYKGHSLPKDT EYLAS+GLKNALYTIKAADMR DLF SLVPCTFHD  TSLKGEASGQDAIKKRSRE IVSENIGS
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
        T+LTPTSKDP RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYAL+KGFTRIHVQ
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSFEISHVLRNLNSEADA+ANLA+TL DGEA   +
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

XP_038889960.1 uncharacterized protein LOC120079705 [Benincasa hispida]1.3e-18188.68Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSF--QNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDV
        MNCFSQV TYTRAIFRRTTLV  A TS +G SN YWTSSF   NVAVKATA+DSLCSRF LRCYSSRK+RK  S SPK DSEPP E E GDFFVVRKGD+
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSF--QNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDV

Query:  VGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIG
        +GVYKSFSDCQAQIGSSICDLPVS+YKGHSLPKDT+EYLAS+GLKNALYTIKAADMR DLFGSLVPCTFHDG TS+KGEASGQDAIKKR REAIVSENIG
Subjt:  VGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIG

Query:  STLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHV
        S++LTPTSKDPSRKHVKLEDSIVS ALSSNRESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYALQKGFTRIHV
Subjt:  STLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        QGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEI+HVLRNLNSEADAQANLAITLADGE    +
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein2.9e-17684.68Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD
        MNCFSQV TYTR IFRRT LVFAA TS HGCSN YWTSSF NVAVK TALDSLCSRF LRCYS+   RK RK TS SPK DSEPP+E E GDFFVVRKGD
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD

Query:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI
        VVGVYKSFSDCQAQIGSSICDLPVSV+KGHSLPKDTEEYLAS+GLKNALYTIKAADMR DLFGSL PCTFH G TSL GE SGQDAIKKRSREAIV EN+
Subjt:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI

Query:  GSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIH
        GST+LTPT KDP+RKH+KLEDSIVS ++SSNRESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLK AL+KGFTRIH
Subjt:  GSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIH

Query:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        VQGDSKLVCMQVQGLWK K+EN+SELCNEV KLK+KFLSFE++HVLR+LNSEADAQANLA+TLA+GE    +
Subjt:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X11.9e-17585.22Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD
        MNC SQV TYTR IFRRT LVFAA TS HGCSNPYW+S+F NVAVKATALDSLCSRF LRCYS+   RK RK TS SPK DSEPPME E GDFFVVRKGD
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD

Query:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI
        VVGVYKSFSDC AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAADMR DLFGSLVPCTFHDG  SL GE SGQDAIKKRSREAIVSEN+
Subjt:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI

Query:  GSTLL-----TPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKG
        GS++L     TPTS+DP+RKH+KLEDSIVS  LSSN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLK+AL+KG
Subjt:  GSTLL-----TPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKG

Query:  FTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE
        FTRIHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK+KFLSFE++HVLR+LNSEADAQANLA+TLADGE
Subjt:  FTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE

A0A5A7TCE2 RNase H family protein, putative isoform 21.9e-17585.22Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD
        MNC SQV TYTR IFRRT LVFAA TS HGCSNPYW+S+F NVAVKATALDSLCSRF LRCYS+   RK RK TS SPK DSEPPME E GDFFVVRKGD
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSS---RKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGD

Query:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI
        VVGVYKSFSDC AQIGSSICDLPVSV+KGHSLPKD+EEYLAS+GLKNALYTIKAADMR DLFGSLVPCTFHDG  SL GE SGQDAIKKRSREAIVSEN+
Subjt:  VVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENI

Query:  GSTLL-----TPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKG
        GS++L     TPTS+DP+RKH+KLEDSIVS  LSSN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLK+AL+KG
Subjt:  GSTLL-----TPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKG

Query:  FTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE
        FTRIHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK+KFLSFE++HVLR+LNSEADAQANLA+TLADGE
Subjt:  FTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE

A0A6J1G4Y7 uncharacterized protein LOC1114507761.5e-17585.41Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MNCFSQ  TYTRAIFR T L FAA TS HGC  N YWTSSF +V +K T LDSLCSRF LRCYSSRK+RKG S SP  DS+PPMEP+ GDFFVVRKGDVV
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        GVYKSF+DCQAQIGSSICDLPVSVYKGHSLPKDT EYL+S+GLKNALYTIKAADMR DLF SLVPCTFHD  T+LKGEASGQDAIKKRSRE IVSENIGS
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
        T+LTPTSKDP RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYAL+KGFTRIHVQ
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSFEISHVLRNLNSEADAQANLA+TL DGEA   +
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

A0A6J1KGD7 uncharacterized protein LOC1114939831.1e-17585.41Show/hide
Query:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MNC SQ  TY RAIFR T L FAA TS HGC  NPYWTSS  +V +K T LDSLCSRF LRCYSSRK+RKG S SP  DSEPPMEP+ GDFFVVRKGDVV
Subjt:  MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCS-NPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        GVY+SF+DCQAQIGSSICDLPVSVYKGHSLPKDT+EYLAS+GLKNALYTI+AADMR DLF SLVPCTFHD  TSLKGEASGQDAIKKRSRE IVS+NIGS
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
        T+LTPTSKDP RKHVKLEDS+VSRA  SN ESCFLEFDGASKGNPGQAGAGAV+RAHDGSVICRLREGLGIAT+NVAEYRA+LLGLKYAL+KGFTRIHVQ
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD
        GDSKLVCMQVQGLWKVKNENISELCNEV+KLKDKFLSFEISHVLRNLNSEADAQANLAI+L DGEA   +
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.1e-0532.82Show/hide
Query:  DGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLS
        DGAS+GNPG A AG V+R   G+        +G  ++  AE   V  GL +A +K   R+ ++ DS+++     G  K    + S   + +++L   FL 
Subjt:  DGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLS

Query:  ----FEISHVLRNLNSEADAQANLAITLADG
              I HV R  N  AD  AN A +L+ G
Subjt:  ----FEISHVLRNLNSEADAQANLAITLADG

P64956 Uncharacterized protein Mb2253c3.0e-1642.22Show/hide
Query:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG S+GNPG AG GAVV   D S V+   ++ +G AT+NVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA
        +F       V R  N+ AD  AN A+  A   A A
Subjt:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA

P9WLH4 Uncharacterized protein MT22873.0e-1642.22Show/hide
Query:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG S+GNPG AG GAVV   D S V+   ++ +G AT+NVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA
        +F       V R  N+ AD  AN A+  A   A A
Subjt:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA

P9WLH5 Bifunctional protein Rv2228c3.0e-1642.22Show/hide
Query:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG S+GNPG AG GAVV   D S V+   ++ +G AT+NVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVVRAHDGS-VICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA
        +F       V R  N+ AD  AN A+  A   A A
Subjt:  KFLSFEISHVLRNLNSEADAQANLAITLADGEANA

Q9HSF6 Ribonuclease HI7.3e-1539.02Show/hide
Query:  FDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFL
        FDGAS+GNPG A  G V+ + DG ++    + +G AT+N AEY A++  L+ A   GF  I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFL

Query:  SFEISHVLRNLNSEADAQANLAI
         + I+HV R  N  ADA AN A+
Subjt:  SFEISHVLRNLNSEADAQANLAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein9.9e-8446.79Show/hide
Query:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSF----QNVAVKATALDSLCSRFHLRCYSSR-KVRKGTSRSPKFDSEPPMEPEKGDFFVVR
        MNC S   +Y    + +R++ V + P          W   F        +K  A+ S+     +  YSSR K  K    S    S   ++ EK  FFVVR
Subjt:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSF----QNVAVKATALDSLCSRFHLRCYSSR-KVRKGTSRSPKFDSEPPMEPEKGDFFVVR

Query:  KGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSL----KGEASGQDAIKKRSRE
        KGDV+G+YK  SDCQAQ+GSS+ DLPVSVYKG+SLPKDTEEYL+S+GLK  LY+++A+D++ D+FG+L PC F +         + E + +   K   ++
Subjt:  KGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSL----KGEASGQDAIKKRSRE

Query:  AIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQ
         + S +I        S DP  K  K+E S        + E+CF+EFDGASKGNPG +GA AV++  DGS+ICR+R+GLGIAT+N AEY A++LGLKYA++
Subjt:  AIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQ

Query:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE
        KG+  I V+GDSKLVCMQ++G WKV +E +++L  E   L +K +SFEISHVLRNLN++AD QANLA+ L +GE
Subjt:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-7448.64Show/hide
Query:  MEPEKGDFFVVRKGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQD
        ME EK  F++VRKGD++GVY+S S+CQ Q GSS+    +SVYKG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC      +S +GE+  + 
Subjt:  MEPEKGDFFVVRKGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQD

Query:  AIKKRSREAIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAV+RA D SV+  LREG+G AT+N
Subjt:  AIKKRSREAIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSN

Query:  VAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANAL
        VAEYRA+LLGL+ AL KGF  +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F+I H+ R  NSEAD QAN AI LADG+   +
Subjt:  VAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANAL

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-7448.64Show/hide
Query:  MEPEKGDFFVVRKGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQD
        ME EK  F++VRKGD++GVY+S S+CQ Q GSS+    +SVYKG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC      +S +GE+  + 
Subjt:  MEPEKGDFFVVRKGDVVGVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQD

Query:  AIKKRSREAIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAV+RA D SV+  LREG+G AT+N
Subjt:  AIKKRSREAIVSENIGSTLLTPTSKDPSRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSN

Query:  VAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANAL
        VAEYRA+LLGL+ AL KGF  +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F+I H+ R  NSEAD QAN AI LADG+   +
Subjt:  VAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGEANAL

AT5G51080.1 RNase H family protein7.6e-7644.11Show/hide
Query:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MN FS+  +Y +  +FR+++ V          S+  W   F   ++K++   +  S   + CYSSR     +  S    S    + EK  FFVVRKGD+V
Subjt:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        G+YK   DCQAQ+GSS+ D PVSVYKG+SL KDTEE L+++GLK  LY  +A D++ D+FG+L PC F D                              
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
            P++     K  +LE S       ++ E+C +EFDGASKGNPG +GA AV++  DGS+I ++R+GLGIAT+N AEY  ++LGLK+A++KG+T+I V+
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE
         DSKLVCMQ++G WKV +E +S+L  E  +L DK LSFEISHVLR+LNS+AD QAN+A  L++GE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE

AT5G51080.2 RNase H family protein7.6e-7644.11Show/hide
Query:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV
        MN FS+  +Y +  +FR+++ V          S+  W   F   ++K++   +  S   + CYSSR     +  S    S    + EK  FFVVRKGD+V
Subjt:  MNCFSQVCTY-TRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVV

Query:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS
        G+YK   DCQAQ+GSS+ D PVSVYKG+SL KDTEE L+++GLK  LY  +A D++ D+FG+L PC F D                              
Subjt:  GVYKSFSDCQAQIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGS

Query:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ
            P++     K  +LE S       ++ E+C +EFDGASKGNPG +GA AV++  DGS+I ++R+GLGIAT+N AEY  ++LGLK+A++KG+T+I V+
Subjt:  TLLTPTSKDPSRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE
         DSKLVCMQ++G WKV +E +S+L  E  +L DK LSFEISHVLR+LNS+AD QAN+A  L++GE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFEISHVLRNLNSEADAQANLAITLADGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGCTTCTCGCAAGTCTGTACCTATACTCGCGCCATTTTCAGAAGGACCACTCTTGTTTTTGCAGCTCCGACCTCCACTCATGGCTGCTCCAATCCCTACTGGAC
ATCAAGCTTTCAGAATGTCGCTGTTAAGGCTACTGCTTTAGACTCTTTGTGCTCCAGATTCCATCTACGCTGCTATTCCTCTCGAAAAGTCCGCAAGGGCACTTCTCGAT
CGCCCAAGTTTGATTCTGAACCTCCCATGGAACCGGAGAAGGGCGACTTCTTCGTCGTTCGGAAGGGGGATGTTGTTGGAGTCTACAAAAGTTTTAGTGATTGTCAGGCT
CAGATTGGATCTTCGATTTGTGATCTTCCTGTCAGCGTGTATAAAGGACACTCATTGCCAAAAGACACTGAGGAATATCTTGCTTCCCTTGGGCTTAAGAATGCTCTGTA
TACTATTAAAGCTGCAGATATGAGATCTGATCTTTTCGGTTCGCTCGTGCCTTGCACTTTTCATGATGGGGGTACTTCTCTTAAAGGTGAAGCTTCTGGCCAGGATGCCA
TAAAGAAGAGATCAAGAGAGGCTATTGTATCAGAAAATATTGGGTCGACTCTTTTAACTCCTACATCAAAAGATCCCTCGAGGAAACATGTCAAGTTGGAAGATTCCATT
GTGTCCCGGGCACTATCCTCTAACCGTGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGGGCGGGAGCTGTTGTGCGAGCTCATGA
TGGGAGTGTGATATGTAGGCTGCGTGAAGGCCTAGGTATAGCAACCAGTAACGTTGCTGAATATAGAGCTGTTCTGTTAGGGTTGAAGTATGCACTTCAGAAAGGGTTTA
CTAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAACATCTCTGAGCTATGTAATGAAGTTATCAAGCTG
AAGGATAAATTTCTCTCGTTCGAGATTAGTCATGTACTAAGGAATCTAAATTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGATGGCGAAGCCAATGC
TTTGGATGCTACTATTCTTTTCCCCAGGTTGCTGAGATCTTTGGATGTTGTTTCGGTTGCTTGCTACGATTCTGTGGCCCTGACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGCTTCTCGCAAGTCTGTACCTATACTCGCGCCATTTTCAGAAGGACCACTCTTGTTTTTGCAGCTCCGACCTCCACTCATGGCTGCTCCAATCCCTACTGGAC
ATCAAGCTTTCAGAATGTCGCTGTTAAGGCTACTGCTTTAGACTCTTTGTGCTCCAGATTCCATCTACGCTGCTATTCCTCTCGAAAAGTCCGCAAGGGCACTTCTCGAT
CGCCCAAGTTTGATTCTGAACCTCCCATGGAACCGGAGAAGGGCGACTTCTTCGTCGTTCGGAAGGGGGATGTTGTTGGAGTCTACAAAAGTTTTAGTGATTGTCAGGCT
CAGATTGGATCTTCGATTTGTGATCTTCCTGTCAGCGTGTATAAAGGACACTCATTGCCAAAAGACACTGAGGAATATCTTGCTTCCCTTGGGCTTAAGAATGCTCTGTA
TACTATTAAAGCTGCAGATATGAGATCTGATCTTTTCGGTTCGCTCGTGCCTTGCACTTTTCATGATGGGGGTACTTCTCTTAAAGGTGAAGCTTCTGGCCAGGATGCCA
TAAAGAAGAGATCAAGAGAGGCTATTGTATCAGAAAATATTGGGTCGACTCTTTTAACTCCTACATCAAAAGATCCCTCGAGGAAACATGTCAAGTTGGAAGATTCCATT
GTGTCCCGGGCACTATCCTCTAACCGTGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGGGCGGGAGCTGTTGTGCGAGCTCATGA
TGGGAGTGTGATATGTAGGCTGCGTGAAGGCCTAGGTATAGCAACCAGTAACGTTGCTGAATATAGAGCTGTTCTGTTAGGGTTGAAGTATGCACTTCAGAAAGGGTTTA
CTAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAACATCTCTGAGCTATGTAATGAAGTTATCAAGCTG
AAGGATAAATTTCTCTCGTTCGAGATTAGTCATGTACTAAGGAATCTAAATTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGATGGCGAAGCCAATGC
TTTGGATGCTACTATTCTTTTCCCCAGGTTGCTGAGATCTTTGGATGTTGTTTCGGTTGCTTGCTACGATTCTGTGGCCCTGACTTAG
Protein sequenceShow/hide protein sequence
MNCFSQVCTYTRAIFRRTTLVFAAPTSTHGCSNPYWTSSFQNVAVKATALDSLCSRFHLRCYSSRKVRKGTSRSPKFDSEPPMEPEKGDFFVVRKGDVVGVYKSFSDCQA
QIGSSICDLPVSVYKGHSLPKDTEEYLASLGLKNALYTIKAADMRSDLFGSLVPCTFHDGGTSLKGEASGQDAIKKRSREAIVSENIGSTLLTPTSKDPSRKHVKLEDSI
VSRALSSNRESCFLEFDGASKGNPGQAGAGAVVRAHDGSVICRLREGLGIATSNVAEYRAVLLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKL
KDKFLSFEISHVLRNLNSEADAQANLAITLADGEANALDATILFPRLLRSLDVVSVACYDSVALT