; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G009200 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G009200
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRNase H domain-containing protein
Genome locationCmo_Chr05:7133374..7138611
RNA-Seq ExpressionCmoCh05G009200
SyntenyCmoCh05G009200
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR011320 - Ribonuclease H1, N-terminal
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599033.1 hypothetical protein SDJN03_08811, partial [Cucurbita argyrosperma subsp. sororia]2.8e-20798.92Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNCFSQFSTYTRAIFRTTNLAFAASTSI+GCRFNLYWTSSFHSVTLKPTPLDSLCSRF LRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEAT+LKGEASGQDAIKKRSRETIVSENIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESC LEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

KAG7029985.1 hypothetical protein SDJN02_08329 [Cucurbita argyrosperma subsp. argyrosperma]8.4e-20495.55Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNCFSQFSTYTRAIFRTTNLAFAASTSI+GCR NLYWTSSFHSVTLKPTPLDSLCSRF LRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEAT+LKGEASGQDAIKKRSRETIVSENIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSV------------ICRLREGLGIATNNVAEYRAILLGLKYA
        TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESC LEFDGASKGNPGQAGAGAVLRAHDGSV            ICRLREGLGIATNNVAEYRAILLGLKYA
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSV------------ICRLREGLGIATNNVAEYRAILLGLKYA

Query:  LEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        LEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
Subjt:  LEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

XP_022946833.1 uncharacterized protein LOC111450776 [Cucurbita moschata]1.2e-210100Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

XP_022999715.1 uncharacterized protein LOC111493983 [Cucurbita maxima]1.5e-20095.41Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNC SQFSTY RAIFRTTNLAFAASTSIHGC FN YWTSS HSVTLKPTPLDSLCSRF LRCYSSRKLRKGASPSPNLDS+PPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVY+SFTDCQAQIGSSICDLPVSVYKGHSLPKDT EYL+SVGLKNALYTI+AADMRPDLFSSLVPCTFHDEAT+LKGEASGQDAIKKRSRETIVS+NIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVS+APSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENI+ELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLA++LPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

XP_023547239.1 uncharacterized protein LOC111806115 [Cucurbita pepo subsp. pepo]1.1e-20698.38Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNCFSQFSTYTRAIFR TNLAFAASTSIHGCRFN YWTSSFHSVTLKPTPLDSLCSRF LRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYL+SVGLKNALYTIKAADMRPDLFSSLVPCTFHDEAT+LKGEASGQDAIKKRSRETIVSENIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADA+ANLAVTLPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein2.0e-17182.62Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG
        MNCFSQ STYTR IFR TNL FAASTSIHGC  N YWTSSFH+V +K T LDSLCSRF LRCYS+   RK RK  SPSP LDS+PP+E +MGDFFVVRKG
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG

Query:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN
        DVVGVYKSF+DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+SVGLKNALYTIKAADMRPDLF SL PCTFH   T+L GE SGQDAIKKRSRE IV EN
Subjt:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN

Query:  IGSTVLTPTSKDPLRKHVKLEDSVVSQA-PSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI
        +GSTVLTPT KDP RKH+KLEDS+VS +  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGFTRI
Subjt:  IGSTVLTPTSKDPLRKHVKLEDSVVSQA-PSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRI

Query:  HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        HVQGDSKLVCMQVQGLWK K+EN++ELCNEV KLK+KFLSFE++HVLR+LNSEADAQANLA+TL +GE QEFE+
Subjt:  HVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X13.9e-17081.48Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG
        MNC SQ STYTR IFR TNL FAASTSIHGC  N YW+S+FH+V +K T LDSLCSRF LRCYS+   RK RK  SPSP LDS+PPME +MGDFFVVRKG
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG

Query:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN
        DVVGVYKSF+DC AQIGSSICDLPVSV+KGHSLPKD+ EYL+S+GLKNALYTIKAADMRPDLF SLVPCTFHD   +L GE SGQDAIKKRSRE IVSEN
Subjt:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN

Query:  IGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG
        +GS+VL     TPTS+DP RKH+KLEDS+VS + SNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+KG
Subjt:  IGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG

Query:  FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        FTRIHVQGDSKLVCMQVQGLWK KNENI+ELCNEV+KLK+KFLSFE++HVLR+LNSEADAQANLA+TL DGE QE E+
Subjt:  FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

A0A5A7TCE2 RNase H family protein, putative isoform 23.9e-17081.48Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG
        MNC SQ STYTR IFR TNL FAASTSIHGC  N YW+S+FH+V +K T LDSLCSRF LRCYS+   RK RK  SPSP LDS+PPME +MGDFFVVRKG
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSS---RKLRKGASPSPNLDSKPPMEPDMGDFFVVRKG

Query:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN
        DVVGVYKSF+DC AQIGSSICDLPVSV+KGHSLPKD+ EYL+S+GLKNALYTIKAADMRPDLF SLVPCTFHD   +L GE SGQDAIKKRSRE IVSEN
Subjt:  DVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSEN

Query:  IGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG
        +GS+VL     TPTS+DP RKH+KLEDS+VS + SNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+KG
Subjt:  IGSTVL-----TPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKG

Query:  FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        FTRIHVQGDSKLVCMQVQGLWK KNENI+ELCNEV+KLK+KFLSFE++HVLR+LNSEADAQANLA+TL DGE QE E+
Subjt:  FTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

A0A6J1G4Y7 uncharacterized protein LOC1114507765.9e-211100Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

A0A6J1KGD7 uncharacterized protein LOC1114939837.2e-20195.41Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV
        MNC SQFSTY RAIFRTTNLAFAASTSIHGC FN YWTSS HSVTLKPTPLDSLCSRF LRCYSSRKLRKGASPSPNLDS+PPMEPDMGDFFVVRKGDVV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVV

Query:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS
        GVY+SFTDCQAQIGSSICDLPVSVYKGHSLPKDT EYL+SVGLKNALYTI+AADMRPDLFSSLVPCTFHDEAT+LKGEASGQDAIKKRSRETIVS+NIGS
Subjt:  GVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGS

Query:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
        TVLTPTSKDPLRKHVKLEDSVVS+APSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG
Subjt:  TVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE
        DSKLVCMQVQGLWKVKNENI+ELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLA++LPDGEAQEFEE
Subjt:  DSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein1.4e-0732.26Show/hide
Query:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLS
        DGAS GNPG +G G  ++ H+G         +G+ TN  AE+ A++ G+K    +G+  +  + DS +V  +   L  VKN        E+++LK  F  
Subjt:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLS

Query:  FEISHVLRNLNSEADAQANLAVTL
        F I  +    N +AD  A  A+ L
Subjt:  FEISHVLRNLNSEADAQANLAVTL

P64956 Uncharacterized protein Mb2253c4.3e-1742.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAV
        +F       V R  N+ AD  AN A+
Subjt:  KFLSFEISHVLRNLNSEADAQANLAV

P9WLH4 Uncharacterized protein MT22874.3e-1742.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAV
        +F       V R  N+ AD  AN A+
Subjt:  KFLSFEISHVLRNLNSEADAQANLAV

P9WLH5 Bifunctional protein Rv2228c4.3e-1742.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKD

Query:  KFLSFEISHVLRNLNSEADAQANLAV
        +F       V R  N+ AD  AN A+
Subjt:  KFLSFEISHVLRNLNSEADAQANLAV

Q9HSF6 Ribonuclease HI9.5e-1740.65Show/hide
Query:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFL
        FDGAS+GNPG A  G VL + DG ++    + +G ATNN AEY A++  L+ A + GF  I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFL

Query:  SFEISHVLRNLNSEADAQANLAV
         + I+HV R  N  ADA AN A+
Subjt:  SFEISHVLRNLNSEADAQANLAV

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein4.8e-8850.13Show/hide
Query:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSR-KLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV
        MNC S   +Y  A+      ++ +S   + C F +   S      LKP  + S+     +  YSSR K  K    S  + S    E D   FFVVRKGDV
Subjt:  MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSR-KLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV

Query:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEA-TTLK---GEASGQDAIKKRSRETIVS
        +G+YK  +DCQAQ+GSS+ DLPVSVYKG+SLPKDT EYLSSVGLK  LY+++A+D++ D+F +L PC F + A  T+K    E + +   K   ++ + S
Subjt:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEA-TTLK---GEASGQDAIKKRSRETIVS

Query:  ENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTR
         +I        S DPL K  K+E S    A  + E+CF+EFDGASKGNPG +GA AVL+  DGS+ICR+R+GLGIATNN AEY A++LGLKYA+EKG+  
Subjt:  ENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTR

Query:  IHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ
        I V+GDSKLVCMQ++G WKV +E +A+L  E   L +K +SFEISHVLRNLN++AD QANLAV LP+GE +
Subjt:  IHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-7347.6Show/hide
Query:  MEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQD
        ME +   F++VRKGD++GVY+S ++CQ Q GSS+    +SVYKG+  PK   + LSS G+KNAL+++ A+ ++ D F  L+PC     +++ +GE+  + 
Subjt:  MEPDMGDFFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQD

Query:  AIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSN---------HESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN
        +  KR       +++GS      S  P +K +K+E+ ++ + PS+         ++SC +EFDGASKGNPG+AGAGAVLRA D SV+  LREG+G ATNN
Subjt:  AIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSN---------HESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN

Query:  VAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ
        VAEYRA+LLGL+ AL+KGF  +HV GDS LVCMQVQG WK  +  +AELC +  +L + F +F+I H+ R  NSEAD QAN A+ L DG+ Q
Subjt:  VAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ

AT5G51080.1 RNase H family protein2.9e-7743.87Show/hide
Query:  MNCFSQFSTY-TRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV
        MN FS+  +Y +  +FR ++   +          ++ W   F++ +LK +   +  S   + CYSSR     +  S +  S    + +   FFVVRKGD+
Subjt:  MNCFSQFSTY-TRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV

Query:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIG
        VG+YK   DCQAQ+GSS+ D PVSVYKG+SL KDT E LS+VGLK  LY  +A D++ D+F +L PC F D+                            
Subjt:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIG

Query:  STVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQ
             P++   + K  +LE S    A +++E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK+A+EKG+T+I V+
Subjt:  STVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ
         DSKLVCMQ++G WKV +E +++L  E  +L DK LSFEISHVLR+LNS+AD QAN+A  L +GE +
Subjt:  GDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ

AT5G51080.2 RNase H family protein2.9e-7743.87Show/hide
Query:  MNCFSQFSTY-TRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV
        MN FS+  +Y +  +FR ++   +          ++ W   F++ +LK +   +  S   + CYSSR     +  S +  S    + +   FFVVRKGD+
Subjt:  MNCFSQFSTY-TRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDV

Query:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIG
        VG+YK   DCQAQ+GSS+ D PVSVYKG+SL KDT E LS+VGLK  LY  +A D++ D+F +L PC F D+                            
Subjt:  VGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIG

Query:  STVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQ
             P++   + K  +LE S    A +++E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK+A+EKG+T+I V+
Subjt:  STVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQ

Query:  GDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ
         DSKLVCMQ++G WKV +E +++L  E  +L DK LSFEISHVLR+LNS+AD QAN+A  L +GE +
Subjt:  GDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ

AT5G51080.3 RNase H family protein1.0e-7451.45Show/hide
Query:  FFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSR
        FFVVRKGD+VG+YK   DCQAQ+GSS+ D PVSVYKG+SL KDT E LS+VGLK  LY  +A D++ D+F +L PC F D+                   
Subjt:  FFVVRKGDVVGVYKSFTDCQAQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSR

Query:  ETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALE
                      P++   + K  +LE S    A +++E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK+A+E
Subjt:  ETIVSENIGSTVLTPTSKDPLRKHVKLEDSVVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALE

Query:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ
        KG+T+I V+ DSKLVCMQ++G WKV +E +++L  E  +L DK LSFEISHVLR+LNS+AD QAN+A  L +GE +
Subjt:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKLKDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGCTTCTCCCAATTCTCTACCTACACTCGCGCCATTTTCAGAACCACCAATCTTGCTTTTGCAGCTTCCACCTCCATTCATGGCTGCCGCTTTAATCTCTACTG
GACCTCGAGCTTTCACAGCGTCACTCTTAAACCTACTCCTTTAGACTCCTTGTGTTCCAGATTTTGTCTACGTTGCTACTCCTCTCGAAAACTCCGCAAGGGCGCTTCTC
CTTCTCCCAACTTAGATTCTAAACCTCCCATGGAACCAGACATGGGCGACTTCTTTGTCGTTCGGAAGGGGGACGTTGTTGGAGTCTATAAAAGTTTTACTGACTGTCAG
GCGCAAATTGGATCTTCGATATGCGATCTTCCTGTTAGCGTGTATAAGGGACACTCATTGCCGAAAGACACAGGGGAATATCTTTCTTCCGTCGGGCTTAAGAATGCTCT
GTACACTATTAAAGCTGCAGATATGAGACCTGATCTTTTCAGTTCGCTCGTTCCTTGCACTTTTCATGATGAAGCTACTACTCTTAAAGGTGAGGCTTCTGGCCAGGATG
CCATAAAGAAGAGATCAAGAGAGACCATTGTATCAGAAAATATTGGGTCAACTGTTTTAACTCCTACTTCAAAAGATCCCTTGAGGAAACATGTCAAGTTGGAAGATTCC
GTTGTGTCCCAAGCACCCTCTAACCATGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAGGGAAACCCTGGGCAAGCTGGTGCAGGAGCTGTTCTACGAGCTCATGA
TGGGAGTGTGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCAATTCTTTTAGGGTTGAAGTATGCACTTGAGAAAGGGTTTA
CTAGGATCCATGTCCAAGGCGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAATATCGCTGAGTTATGTAATGAAGTTATGAAGCTG
AAGGATAAATTTCTCTCCTTCGAGATCAGTCATGTACTAAGGAATCTAAACTCTGAAGCCGATGCTCAAGCAAACTTGGCTGTCACTCTACCAGACGGCGAAGCCCAAGA
ATTTGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
GGACAAACCTATTTGGGTGGATGGATGACATTGATTTGATTTGGAAACCCTGAAATCCCCATAGTTCCTGGAAAATTCCCCTCTGATGAACTGCTTCTCCCAATTCTCTA
CCTACACTCGCGCCATTTTCAGAACCACCAATCTTGCTTTTGCAGCTTCCACCTCCATTCATGGCTGCCGCTTTAATCTCTACTGGACCTCGAGCTTTCACAGCGTCACT
CTTAAACCTACTCCTTTAGACTCCTTGTGTTCCAGATTTTGTCTACGTTGCTACTCCTCTCGAAAACTCCGCAAGGGCGCTTCTCCTTCTCCCAACTTAGATTCTAAACC
TCCCATGGAACCAGACATGGGCGACTTCTTTGTCGTTCGGAAGGGGGACGTTGTTGGAGTCTATAAAAGTTTTACTGACTGTCAGGCGCAAATTGGATCTTCGATATGCG
ATCTTCCTGTTAGCGTGTATAAGGGACACTCATTGCCGAAAGACACAGGGGAATATCTTTCTTCCGTCGGGCTTAAGAATGCTCTGTACACTATTAAAGCTGCAGATATG
AGACCTGATCTTTTCAGTTCGCTCGTTCCTTGCACTTTTCATGATGAAGCTACTACTCTTAAAGGTGAGGCTTCTGGCCAGGATGCCATAAAGAAGAGATCAAGAGAGAC
CATTGTATCAGAAAATATTGGGTCAACTGTTTTAACTCCTACTTCAAAAGATCCCTTGAGGAAACATGTCAAGTTGGAAGATTCCGTTGTGTCCCAAGCACCCTCTAACC
ATGAATCTTGCTTTCTAGAATTCGATGGTGCCTCAAAGGGAAACCCTGGGCAAGCTGGTGCAGGAGCTGTTCTACGAGCTCATGATGGGAGTGTGATATGTAGACTGCGT
GAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCAATTCTTTTAGGGTTGAAGTATGCACTTGAGAAAGGGTTTACTAGGATCCATGTCCAAGGCGACTC
CAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAATATCGCTGAGTTATGTAATGAAGTTATGAAGCTGAAGGATAAATTTCTCTCCTTCGAGA
TCAGTCATGTACTAAGGAATCTAAACTCTGAAGCCGATGCTCAAGCAAACTTGGCTGTCACTCTACCAGACGGCGAAGCCCAAGAATTTGAAGAATAATAGTTAGAATGC
ACGGCAGGATACATCTTACAGCATAGAATTATTTTCTGAGGAATGCATTTATAGTCCAATGCTTTGTATGCTACTATTCTTTTCCCAAGGTTGACACGATCTTTGGGCAT
TGTTTCGGTTGCTCGCTTATGCTTCATGGCCCTGACTTAGCGAACTGAAGTTCTAGTGGCAATGGACTGGACAGTGCACAAGTGCGTTTGTCCTGCATATATGAGATAAG
AGTCCTAGTTCAATTTATAACTGGAATGTTGCATTTATTTTCTTATTCGTGTTACCAGTTGATTAGGCAATGATGCAGACTATAATCGAGAATTAAAGAACACAAATTTT
GGATTGTTCAATGAAGATCTTTGAGAAAGACGTTTTTTTTTTTCTGCTTCAGCCATCGAGGATTGTCAAACTTTGTAGCTTTTTCTCTTGTTTAAGTCTCGATTAGATCT
TCTATTCCTTCAGTTGGCTACTGG
Protein sequenceShow/hide protein sequence
MNCFSQFSTYTRAIFRTTNLAFAASTSIHGCRFNLYWTSSFHSVTLKPTPLDSLCSRFCLRCYSSRKLRKGASPSPNLDSKPPMEPDMGDFFVVRKGDVVGVYKSFTDCQ
AQIGSSICDLPVSVYKGHSLPKDTGEYLSSVGLKNALYTIKAADMRPDLFSSLVPCTFHDEATTLKGEASGQDAIKKRSRETIVSENIGSTVLTPTSKDPLRKHVKLEDS
VVSQAPSNHESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALEKGFTRIHVQGDSKLVCMQVQGLWKVKNENIAELCNEVMKL
KDKFLSFEISHVLRNLNSEADAQANLAVTLPDGEAQEFEE