; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007819 (gene) of Chayote v1 genome

Gene IDSed0007819
OrganismSechium edule (Chayote v1)
DescriptionRNase H domain-containing protein
Genome locationLG08:32266053..32274682
RNA-Seq ExpressionSed0007819
SyntenySed0007819
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599033.1 hypothetical protein SDJN03_08811, partial [Cucurbita argyrosperma subsp. sororia]9.6e-16882.21Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+CFSQ STYTRA+FR T L F ASTSI+GC  N YWTSSFH++ +K T LDSLCSRFRLRCYSSRK+RKG S  P LDS+P PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVYK+F DCQAQIGSSICDLPVSV+KGHSLPKDT EYLSS+GLKNALYTIKAADMR DLF SL PCTFHD ATS KGEASGQDAIKKRSRE IVSE+ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+VS      ESC L FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKD+F+SFEISHVLRNLNSEADAQANLA+TL DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

XP_022946833.1 uncharacterized protein LOC111450776 [Cucurbita moschata]5.6e-16882.21Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+CFSQ STYTRA+FR T L F ASTSIHGC  N YWTSSFH++ +K T LDSLCSRF LRCYSSRK+RKG S  P LDS+P PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVYK+F DCQAQIGSSICDLPVSV+KGHSLPKDT EYLSS+GLKNALYTIKAADMR DLF SL PCTFHD AT+ KGEASGQDAIKKRSRE IVSE+ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+VS      ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKD+F+SFEISHVLRNLNSEADAQANLA+TL DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

XP_022999715.1 uncharacterized protein LOC111493983 [Cucurbita maxima]6.2e-16781.67Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGC-TNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+C SQ STY RA+FR T L F ASTSIHGC  NPYWTSS H++ +K T LDSLCSRFRLRCYSSRK+RKG S  P LDSEP PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGC-TNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVY++F DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+S+GLKNALYTI+AADMR DLF SL PCTFHD ATS KGEASGQDAIKKRSRE IVS++ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIV----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+V    S+ ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIV----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENISELCNEV+KLKD+F+SFEISHVLRNLNSEADAQANLAI+L DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

XP_023547239.1 uncharacterized protein LOC111806115 [Cucurbita pepo subsp. pepo]1.0e-16982.48Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+CFSQ STYTRA+FR T L F ASTSIHGC  NPYWTSSFH++ +K T LDSLCSRFRLRCYSSRK+RKG S  P LDS+P PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVYK+F DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+S+GLKNALYTIKAADMR DLF SL PCTFHD ATS KGEASGQDAIKKRSRE IVSE+ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+VS      ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKD+F+SFEISHVLRNLNSEADA+ANLA+TL DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

XP_038889960.1 uncharacterized protein LOC120079705 [Benincasa hispida]4.2e-17184.18Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSF--HNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGD
        M+CFSQVSTYTRA+FRRTTLV  ASTSI+G +N YWTSSF  HN+AVKATA+DSLCSRFRLRCYSSRK+RK  S  PKLDSEP P E E  DFFVVRKGD
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSF--HNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGD

Query:  VVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSEST
        ++GVYK+F+DCQAQIGSSICDLPVS++KGHSLPKDT EYL+S+GLKNALYTIKAADMR DLFGSL PCTFHDG TS KGEASGQDAIKKR REAIVSE+ 
Subjt:  VVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSEST

Query:  GSTVLTPPSEDPSRKHVKLEGSIVSDC-----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIH
        GS+VLTP S+DPSRKHVKLE SIVS       ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+ALQKGFT+IH
Subjt:  GSTVLTPPSEDPSRKHVKLEGSIVSDC-----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIH

Query:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD+F+SFEI+HVLRNLNSEADAQANLAITLADGEVQEFE+
Subjt:  VQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein1.1e-16681.02Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG
        M+CFSQVSTYTR +FRRT LVF ASTSIHGC+N YWTSSFHN+AVK TALDSLCSRF LRCYS+   RK RK TS  PKLDSEP P+E E  DFFVVRKG
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG

Query:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES
        DVVGVYK+F+DCQAQIGSSICDLPVSVFKGHSLPKDT EYL+S+GLKNALYTIKAADMR DLFGSL PCTFH G TS  GE SGQDAIKKRSREAIV E+
Subjt:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES

Query:  TGSTVLTPPSEDPSRKHVKLEGSIV-----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKI
         GSTVLTP  +DP+RKH+KLE SIV     S+ ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK AL+KGFT+I
Subjt:  TGSTVLTPPSEDPSRKHVKLEGSIV-----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKI

Query:  HVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        HVQGDSKLVCMQVQGLWK K+EN+SELCNEV KLK++F+SFE++HVLR+LNSEADAQANLA+TLA+GEVQEFE+
Subjt:  HVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X11.1e-16680.9Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG
        M+C SQVSTYTR +FRRT LVF ASTSIHGC+NPYW+S+FHN+AVKATALDSLCSRF LRCYS+   RK RK TS  PKLDSEP PME E  DFFVVRKG
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG

Query:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES
        DVVGVYK+F+DC AQIGSSICDLPVSVFKGHSLPKD+ EYL+SIGLKNALYTIKAADMR DLFGSL PCTFHDG  S  GE SGQDAIKKRSREAIVSE+
Subjt:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES

Query:  TGSTVL-----TPPSEDPSRKHVKLEGSIV---SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGF
         GS+VL     TP SEDP+RKH+KLE SIV   S+ ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK AL+KGF
Subjt:  TGSTVL-----TPPSEDPSRKHVKLEGSIV---SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGF

Query:  TKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        T+IHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK++F+SFE++HVLR+LNSEADAQANLA+TLADGE+QE E+
Subjt:  TKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

A0A5A7TCE2 RNase H family protein, putative isoform 21.1e-16680.9Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG
        M+C SQVSTYTR +FRRT LVF ASTSIHGC+NPYW+S+FHN+AVKATALDSLCSRF LRCYS+   RK RK TS  PKLDSEP PME E  DFFVVRKG
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSS---RKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKG

Query:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES
        DVVGVYK+F+DC AQIGSSICDLPVSVFKGHSLPKD+ EYL+SIGLKNALYTIKAADMR DLFGSL PCTFHDG  S  GE SGQDAIKKRSREAIVSE+
Subjt:  DVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSES

Query:  TGSTVL-----TPPSEDPSRKHVKLEGSIV---SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGF
         GS+VL     TP SEDP+RKH+KLE SIV   S+ ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK AL+KGF
Subjt:  TGSTVL-----TPPSEDPSRKHVKLEGSIV---SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGF

Query:  TKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        T+IHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK++F+SFE++HVLR+LNSEADAQANLA+TLADGE+QE E+
Subjt:  TKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

A0A6J1G4Y7 uncharacterized protein LOC1114507762.7e-16882.21Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+CFSQ STYTRA+FR T L F ASTSIHGC  N YWTSSFH++ +K T LDSLCSRF LRCYSSRK+RKG S  P LDS+P PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCT-NPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVYK+F DCQAQIGSSICDLPVSV+KGHSLPKDT EYLSS+GLKNALYTIKAADMR DLF SL PCTFHD AT+ KGEASGQDAIKKRSRE IVSE+ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+VS      ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIVSDC----ESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENI+ELCNEV+KLKD+F+SFEISHVLRNLNSEADAQANLA+TL DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

A0A6J1KGD7 uncharacterized protein LOC1114939833.0e-16781.67Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGC-TNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV
        M+C SQ STY RA+FR T L F ASTSIHGC  NPYWTSS H++ +K T LDSLCSRFRLRCYSSRK+RKG S  P LDSEP PMEP+  DFFVVRKGDV
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGC-TNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDV

Query:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG
        VGVY++F DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+S+GLKNALYTI+AADMR DLF SL PCTFHD ATS KGEASGQDAIKKRSRE IVS++ G
Subjt:  VGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTG

Query:  STVLTPPSEDPSRKHVKLEGSIV----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        STVLTP S+DP RKHVKLE S+V    S+ ESCFL FDGASKGNPGQAGAGAVLR HDGS++CRLREGLGIATNNVAEYRAILLGLK+AL+KGFT+IHVQ
Subjt:  STVLTPPSEDPSRKHVKLEGSIV----SDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE
        GDSKLVCMQVQGLWKVKNENISELCNEV+KLKD+F+SFEISHVLRNLNSEADAQANLAI+L DGE QEFEE
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein3.6e-0833.87Show/hide
Query:  DGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFIS
        DGAS GNPG +G G  ++ H+G         +G+ TN  AE+ A++ G+K    +G+  +  + DS +V  +   L  VKN        E+I+LK  F  
Subjt:  DGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFIS

Query:  FEISHVLRNLNSEADAQANLAITL
        F I  +    N +AD  A  AI L
Subjt:  FEISHVLRNLNSEADAQANLAITL

P64956 Uncharacterized protein Mb2253c2.5e-1744.44Show/hide
Query:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI
        DG S+GNPG AG GAV+ T D S V    ++ +G ATNNVAEYR ++ GL  A++ G T+  V  DSKLV  Q+ G WKVK+ ++ +L  +   L  QF 
Subjt:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI

Query:  SFEISHVLRNLNSEADAQANLAITLA
              V R  N+ AD  AN A+  A
Subjt:  SFEISHVLRNLNSEADAQANLAITLA

P9WLH4 Uncharacterized protein MT22872.5e-1744.44Show/hide
Query:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI
        DG S+GNPG AG GAV+ T D S V    ++ +G ATNNVAEYR ++ GL  A++ G T+  V  DSKLV  Q+ G WKVK+ ++ +L  +   L  QF 
Subjt:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI

Query:  SFEISHVLRNLNSEADAQANLAITLA
              V R  N+ AD  AN A+  A
Subjt:  SFEISHVLRNLNSEADAQANLAITLA

P9WLH5 Bifunctional protein Rv2228c2.5e-1744.44Show/hide
Query:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI
        DG S+GNPG AG GAV+ T D S V    ++ +G ATNNVAEYR ++ GL  A++ G T+  V  DSKLV  Q+ G WKVK+ ++ +L  +   L  QF 
Subjt:  DGASKGNPGQAGAGAVLRTHDGSLV-CRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI

Query:  SFEISHVLRNLNSEADAQANLAITLA
              V R  N+ AD  AN A+  A
Subjt:  SFEISHVLRNLNSEADAQANLAITLA

Q9HSF6 Ribonuclease HI9.4e-1741.46Show/hide
Query:  FDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI
        FDGAS+GNPG A  G VL + DG +V    + +G ATNN AEY A++  L+ A   GF  I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFI

Query:  SFEISHVLRNLNSEADAQANLAI
         + I+HV R  N  ADA AN A+
Subjt:  SFEISHVLRNLNSEADAQANLAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein2.2e-8546.05Show/hide
Query:  MSCFSQVSTY-TRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNL----AVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVR
        M+C S   +Y    + +R++ V +            W   F  +     +K  A+ S+     +  YSSR   K         +  S ++ EKD FFVVR
Subjt:  MSCFSQVSTY-TRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNL----AVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVR

Query:  KGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVS
        KGDV+G+YK+ +DCQAQ+GSS+ DLPVSV+KG+SLPKDT EYLSS+GLK  LY+++A+D++ D+FG+L PC F + A      +  +   + +S++    
Subjt:  KGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVS

Query:  ESTGSTVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ
        +   +++    S DP  K  K+E S     E+CF+ FDGASKGNPG +GA AVL+T DGSL+CR+R+GLGIATNN AEY A++LGLK+A++KG+  I V+
Subjt:  ESTGSTVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQ

Query:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ
        GDSKLVCMQ++G WKV +E +++L  E   L ++ +SFEISHVLRNLN++AD QANLA+ L +GEV+
Subjt:  GDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.0e-7147.96Show/hide
Query:  SPMEPEKDDFFVVRKGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASG
        S ME EKD F++VRKGD++GVY++ ++CQ Q GSS+    +SV+KG+  PK   + LSS G+KNAL+++ A+ ++ D FG L PC     ++S +GE+  
Subjt:  SPMEPEKDDFFVVRKGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASG

Query:  QDAIKKRSREAIVSESTGSTVLTPPSEDPSRKHVKLEGSI-------------VSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIAT
        + +  KR ++    ES GS   +PP     +K +K+E  +             +   +SC + FDGASKGNPG+AGAGAVLR  D S++  LREG+G AT
Subjt:  QDAIKKRSREAIVSESTGSTVLTPPSEDPSRKHVKLEGSI-------------VSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIAT

Query:  NNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ
        NNVAEYRA+LLGL+ AL KGF  +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F+I H+ R  NSEAD QAN AI LADG+ Q
Subjt:  NNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ

AT5G51080.1 RNase H family protein1.8e-7945.3Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDVV
        M+ FS+  +Y   V  R +   T+    + C   ++TS   +L   + ++ S      + CYSSR  +   S++ K     S  + EKD FFVVRKGD+V
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDVV

Query:  GVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTGS
        G+YK+  DCQAQ+GSS+ D PVSV+KG+SL KDT E LS++GLK  LY  +A D++ D+FG+L PC F D                              
Subjt:  GVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTGS

Query:  TVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKL
            P +     K  +LE S  +  E+C + FDGASKGNPG +GA AVL+T DGSL+ ++R+GLGIATNN AEY  ++LGLKHA++KG+TKI V+ DSKL
Subjt:  TVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKL

Query:  VCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ
        VCMQ++G WKV +E +S+L  E  +L D+ +SFEISHVLR+LNS+AD QAN+A  L++GEV+
Subjt:  VCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ

AT5G51080.2 RNase H family protein1.8e-7945.3Show/hide
Query:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDVV
        M+ FS+  +Y   V  R +   T+    + C   ++TS   +L   + ++ S      + CYSSR  +   S++ K     S  + EKD FFVVRKGD+V
Subjt:  MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDVV

Query:  GVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTGS
        G+YK+  DCQAQ+GSS+ D PVSV+KG+SL KDT E LS++GLK  LY  +A D++ D+FG+L PC F D                              
Subjt:  GVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTGS

Query:  TVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKL
            P +     K  +LE S  +  E+C + FDGASKGNPG +GA AVL+T DGSL+ ++R+GLGIATNN AEY  ++LGLKHA++KG+TKI V+ DSKL
Subjt:  TVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKL

Query:  VCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ
        VCMQ++G WKV +E +S+L  E  +L D+ +SFEISHVLR+LNS+AD QAN+A  L++GEV+
Subjt:  VCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ

AT5G51080.3 RNase H family protein2.4e-7651.6Show/hide
Query:  SPMEPEKDDFFVVRKGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASG
        S  + EKD FFVVRKGD+VG+YK+  DCQAQ+GSS+ D PVSV+KG+SL KDT E LS++GLK  LY  +A D++ D+FG+L PC F D           
Subjt:  SPMEPEKDDFFVVRKGDVVGVYKNFNDCQAQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASG

Query:  QDAIKKRSREAIVSESTGSTVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGL
                               P +     K  +LE S  +  E+C + FDGASKGNPG +GA AVL+T DGSL+ ++R+GLGIATNN AEY  ++LGL
Subjt:  QDAIKKRSREAIVSESTGSTVLTPPSEDPSRKHVKLEGSIVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGL

Query:  KHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ
        KHA++KG+TKI V+ DSKLVCMQ++G WKV +E +S+L  E  +L D+ +SFEISHVLR+LNS+AD QAN+A  L++GEV+
Subjt:  KHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQFISFEISHVLRNLNSEADAQANLAITLADGEVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGCTTTTCGCAAGTCTCTACCTATACTCGTGCCGTTTTCAGAAGGACCACTCTTGTTTTTACGGCTTCCACCTCCATTCATGGCTGTACGAATCCCTACTGGAC
CTCAAGCTTTCACAACCTTGCTGTTAAGGCTACAGCTTTAGACTCCTTGTGCTCCAGATTCCGTCTACGATGCTATTCCTCTCGAAAAGTCCGCAAGGGCACTTCTCAAT
TGCCCAAGTTAGATTCTGAACCTTCTCCCATGGAACCAGAGAAGGATGACTTCTTTGTCGTCCGCAAGGGGGATGTTGTTGGAGTCTATAAGAATTTTAATGATTGCCAG
GCGCAGATTGGATCTTCGATATGTGATCTTCCTGTTAGCGTGTTTAAAGGACACTCATTGCCGAAAGACACCATGGAATATCTTTCTTCCATCGGGCTTAAGAATGCTCT
GTACACTATTAAAGCTGCAGATATGAGGCTTGATCTTTTTGGTTCGCTCGAGCCTTGCACTTTTCATGATGGAGCGACTTCTTGTAAAGGTGAAGCTTCTGGCCAGGATG
CCATAAAGAAGAGATCAAGAGAGGCTATTGTATCAGAGAGTACTGGGTCAACTGTTTTAACTCCCCCATCAGAAGATCCCTCGAGGAAACATGTCAAGTTGGAAGGTTCC
ATTGTGTCTGACTGTGAATCTTGCTTTCTAGTATTTGATGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGGGCAGGAGCTGTTCTTCGAACTCATGATGGGAGTTTGGT
ATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAACGTTGCTGAATATCGAGCTATTCTTTTAGGGCTGAAGCATGCACTTCAGAAAGGGTTCACTAAGATCCATG
TTCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAACATCTCTGAGTTATGCAATGAAGTTATCAAGCTGAAGGATCAATTT
ATCTCGTTCGAGATTAGTCATGTACTCAGGAATTTAAATTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGATGGTGAAGTTCAGGAGTTCGAAGAATA
A
mRNA sequenceShow/hide mRNA sequence
CGCGTGAAGTTGCAGCCTTTCACCCTCCTTTATATTCTTCTTCACGCCGTCGCCCACAGACCTTCGACGACGGCACCACTCAACATATCGCCGGCAACTGGTTCAGCGTT
CAGAATCCGGCGACGCGAAACCCACTCATCGGAATCCATCTCACTCGCGGTTTCGACCTCCAGTAACGGCACCCACACGTTTCAATACCGAATCGGTTCCGGCCCTTGAA
TCACCTTTTCTGGTGATTATTTCTCTTGAGCTTGATGATGTGGGCAAGGAGGCAAGACAAGAACACCATTAGTAGCTTGAAGTTTCCTTACAGTAATTCTTTTGTTTAAG
TAGTTAGTAATTCTTTTGTTTAAGTCAGGGTTCAATATTCTGGAACCCCGATTATGTATAGGTTGTCTTCTTTGATTCTTAAGGTTGTCTCTGGCTACATTAAAGTCTAG
TTTGTTTTGTTTTGAAGCTTTAGTTAGGTTTTTTTGCTAGAACATAGATCCTTGAATAAAAAAATTGCATGTTGTTAGAGTTTAACCTTGTTCCTAGTAGCTTGTGTTGT
TTAGAATCGTTGAGACTTTATTGTGCTAGGAGGTGGAGTTGTGGAGTGGTCTTGGTTGTTGTACTGGCTTGGAAGTGATTGAAACTAGTCATAAAAGATGTTCAAATTTT
GGTTATCTTGTAGCTATGTTTCCAAAAATTTTGGGGCATAATTGTTGTAAGTTGGTAACGTTCCTTAGATTATGCTAAATTTTTGGGGCGTTAGAATTATCCTCTGATGA
GCTGCTTTTCGCAAGTCTCTACCTATACTCGTGCCGTTTTCAGAAGGACCACTCTTGTTTTTACGGCTTCCACCTCCATTCATGGCTGTACGAATCCCTACTGGACCTCA
AGCTTTCACAACCTTGCTGTTAAGGCTACAGCTTTAGACTCCTTGTGCTCCAGATTCCGTCTACGATGCTATTCCTCTCGAAAAGTCCGCAAGGGCACTTCTCAATTGCC
CAAGTTAGATTCTGAACCTTCTCCCATGGAACCAGAGAAGGATGACTTCTTTGTCGTCCGCAAGGGGGATGTTGTTGGAGTCTATAAGAATTTTAATGATTGCCAGGCGC
AGATTGGATCTTCGATATGTGATCTTCCTGTTAGCGTGTTTAAAGGACACTCATTGCCGAAAGACACCATGGAATATCTTTCTTCCATCGGGCTTAAGAATGCTCTGTAC
ACTATTAAAGCTGCAGATATGAGGCTTGATCTTTTTGGTTCGCTCGAGCCTTGCACTTTTCATGATGGAGCGACTTCTTGTAAAGGTGAAGCTTCTGGCCAGGATGCCAT
AAAGAAGAGATCAAGAGAGGCTATTGTATCAGAGAGTACTGGGTCAACTGTTTTAACTCCCCCATCAGAAGATCCCTCGAGGAAACATGTCAAGTTGGAAGGTTCCATTG
TGTCTGACTGTGAATCTTGCTTTCTAGTATTTGATGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGGGCAGGAGCTGTTCTTCGAACTCATGATGGGAGTTTGGTATGT
AGACTGCGTGAAGGCCTAGGTATAGCAACCAATAACGTTGCTGAATATCGAGCTATTCTTTTAGGGCTGAAGCATGCACTTCAGAAAGGGTTCACTAAGATCCATGTTCA
AGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAAAATGAGAACATCTCTGAGTTATGCAATGAAGTTATCAAGCTGAAGGATCAATTTATCT
CGTTCGAGATTAGTCATGTACTCAGGAATTTAAATTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGATGGTGAAGTTCAGGAGTTCGAAGAATAATTG
TTAGAAATGCACAGCAGGATAATATAACTTACAGAATAGCAAGTTTCTCTGAGGAATGCGTTTATAGACCAATACTTTGTATACTCTTATTCTTTTTCGGCGCTGCCAAG
ATCTGTGAGCGTTATTTGGTTGCTAGTTCAATACCAATTCTGGGAAGTCTCAAGAAAAGTGAGATGAACAAACTCTGGATGTCTGCTTCAGAGCTCATTTGAAACCAGAT
TTGGAACACCCCAAAAATTTAAAGCATTGTTCGCAATGACCCTCCGGAGCTTGTGGCGCACCAACGCAAGCTTTTATTTACTAAAAGCACACTATATAAGAAAACATAAT
ATGTAGCATATGCACAATAAAAACACTTTCATAGAAGGAAACAAAGTTTTGACTAGAATAATATATAAAACACATAAGATTCAC
Protein sequenceShow/hide protein sequence
MSCFSQVSTYTRAVFRRTTLVFTASTSIHGCTNPYWTSSFHNLAVKATALDSLCSRFRLRCYSSRKVRKGTSQLPKLDSEPSPMEPEKDDFFVVRKGDVVGVYKNFNDCQ
AQIGSSICDLPVSVFKGHSLPKDTMEYLSSIGLKNALYTIKAADMRLDLFGSLEPCTFHDGATSCKGEASGQDAIKKRSREAIVSESTGSTVLTPPSEDPSRKHVKLEGS
IVSDCESCFLVFDGASKGNPGQAGAGAVLRTHDGSLVCRLREGLGIATNNVAEYRAILLGLKHALQKGFTKIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDQF
ISFEISHVLRNLNSEADAQANLAITLADGEVQEFEE