; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001007 (gene) of Snake gourd v1 genome

Gene IDTan0001007
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG01:58857970..58865394
RNA-Seq ExpressionTan0001007
SyntenyTan0001007
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143798.1 uncharacterized protein LOC101210930 isoform X1 [Cucumis sativus]5.7e-17685.29Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG
        MNCFSQVSTYTR IFRRT LVFAASTSIHGCSN YWTS   NVAV  TALDSLCSRF LRCYS+   RK RK  SPSPK DSE PP+E E  DFFVVRKG
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG

Query:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN
        DV+GVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSL PCTFH G TSL GE S QDAIKKRSRE+IVPEN
Subjt:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN

Query:  IGSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRI
        +GSTVLTPT K P RKH+KLEDSIVS ++SSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGF RI
Subjt:  IGSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRI

Query:  HVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        HVQGDSKLVCMQVQGLWK K+EN+SELC+EV KLK++FLSFE++HVLR+LNSEADAQANLA+TLA+GEV+EFE+
Subjt:  HVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

XP_022946833.1 uncharacterized protein LOC111450776 [Cucurbita moschata]3.1e-17485.22Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNCFSQ STYTRAIFR T L FAASTSIHGC  N YWTS   +V +  T LDSLCSRF LRCYSSRK+RK ASPSP  DS+ PPMEP+  DFFVVRKGDV
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +GVYKSF+DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+SVGLKNALYTIKAADMRPDLF SL+PCTFHD AT+LKGEAS QDAIKKRSRE+IV ENIG
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        STVLTPTSK P+RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYAL+KGF RIHV
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        QGDSKLVCMQVQGLWKVKNENI+ELC+EV+KLKD+FLSFEISHVLRNLNSEADAQANLA+TL DGE +EFEE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

XP_022999715.1 uncharacterized protein LOC111493983 [Cucurbita maxima]1.3e-17585.75Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNC SQ STY RAIFR T L FAASTSIHGC  NPYWTS   +V +  T LDSLCSRFRLRCYSSRK+RK ASPSP  DSE PPMEP+  DFFVVRKGDV
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +GVY+SF+DCQAQIGSSICDLPVSV+KGHSLPKDT+EYLASVGLKNALYTI+AADMRPDLF SL+PCTFHD ATSLKGEAS QDAIKKRSRE+IV +NIG
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        STVLTPTSK P+RKHVKLEDS+VSRA  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYAL+KGF RIHV
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        QGDSKLVCMQVQGLWKVKNENISELC+EV+KLKD+FLSFEISHVLRNLNSEADAQANLAI+L DGE +EFEE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

XP_023547239.1 uncharacterized protein LOC111806115 [Cucurbita pepo subsp. pepo]1.1e-17686.02Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNCFSQ STYTRAIFR T L FAASTSIHGC  NPYWTS   +V +  T LDSLCSRFRLRCYSSRK+RK ASPSP  DS+ PPMEP+  DFFVVRKGDV
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +GVYKSF+DCQAQIGSSICDLPVSV+KGHSLPKDT EYLASVGLKNALYTIKAADMRPDLF SL+PCTFHD ATSLKGEAS QDAIKKRSRE+IV ENIG
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        STVLTPTSK P+RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYAL+KGF RIHV
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        QGDSKLVCMQVQGLWKVKNENI+ELC+EV+KLKD+FLSFEISHVLRNLNSEADA+ANLA+TL DGE +EFEE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

XP_038889960.1 uncharacterized protein LOC120079705 [Benincasa hispida]2.0e-18189.01Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSI--SPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGD
        MNCFSQVSTYTRAIFRRTTLV  ASTSI+G SN YWTS   + NVAV ATA+DSLCSRFRLRCYSSRK+RKAASPSPK DSE PP E E  DFFVVRKGD
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSI--SPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGD

Query:  VLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENI
        ++GVYKSFSDCQAQIGSSICDLPVS++KGHSLPKDT+EYLASVGLKNALYTIKAADMRPDLFGSL+PCTFHDG TS+KGEAS QDAIKKR RE+IV ENI
Subjt:  VLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENI

Query:  GSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIH
        GS+VLTPTSK P RKHVKLEDSIVS ALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGF RIH
Subjt:  GSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIH

Query:  VQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        VQGDSKLVCMQVQGLWKVKNENISELC+EVIKLKD+FLSFEI+HVLRNLNSEADAQANLAITLADGEV+EFE+
Subjt:  VQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein2.8e-17685.29Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG
        MNCFSQVSTYTR IFRRT LVFAASTSIHGCSN YWTS   NVAV  TALDSLCSRF LRCYS+   RK RK  SPSPK DSE PP+E E  DFFVVRKG
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG

Query:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN
        DV+GVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSL PCTFH G TSL GE S QDAIKKRSRE+IVPEN
Subjt:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN

Query:  IGSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRI
        +GSTVLTPT K P RKH+KLEDSIVS ++SSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK AL+KGF RI
Subjt:  IGSTVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRI

Query:  HVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        HVQGDSKLVCMQVQGLWK K+EN+SELC+EV KLK++FLSFE++HVLR+LNSEADAQANLA+TLA+GEV+EFE+
Subjt:  HVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X16.4e-17383.38Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG
        MNC SQVSTYTR IFRRT LVFAASTSIHGCSNPYW+S   NVAV ATALDSLCSRF LRCYS+   RK RK  SPSPK DSE PPME E  DFFVVRKG
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG

Query:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN
        DV+GVYKSFSDC AQIGSSICDLPVSVFKGHSLPKD+EEYLAS+GLKNALYTIKAADMRPDLFGSL+PCTFHDG  SL GE S QDAIKKRSRE+IV EN
Subjt:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN

Query:  IGSTVL-----TPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQK
        +GS+VL     TPTS+ P RKH+KLEDSIVS  LSSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+K
Subjt:  IGSTVL-----TPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQK

Query:  GFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        GF RIHVQGDSKLVCMQVQGLWK KNENISELC+EV+KLK++FLSFE++HVLR+LNSEADAQANLA+TLADGE++E E+
Subjt:  GFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

A0A5A7TCE2 RNase H family protein, putative isoform 26.4e-17383.38Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG
        MNC SQVSTYTR IFRRT LVFAASTSIHGCSNPYW+S   NVAV ATALDSLCSRF LRCYS+   RK RK  SPSPK DSE PPME E  DFFVVRKG
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSS---RKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKG

Query:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN
        DV+GVYKSFSDC AQIGSSICDLPVSVFKGHSLPKD+EEYLAS+GLKNALYTIKAADMRPDLFGSL+PCTFHDG  SL GE S QDAIKKRSRE+IV EN
Subjt:  DVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPEN

Query:  IGSTVL-----TPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQK
        +GS+VL     TPTS+ P RKH+KLEDSIVS  LSSN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLK+AL+K
Subjt:  IGSTVL-----TPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQK

Query:  GFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        GF RIHVQGDSKLVCMQVQGLWK KNENISELC+EV+KLK++FLSFE++HVLR+LNSEADAQANLA+TLADGE++E E+
Subjt:  GFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

A0A6J1G4Y7 uncharacterized protein LOC1114507761.5e-17485.22Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNCFSQ STYTRAIFR T L FAASTSIHGC  N YWTS   +V +  T LDSLCSRF LRCYSSRK+RK ASPSP  DS+ PPMEP+  DFFVVRKGDV
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +GVYKSF+DCQAQIGSSICDLPVSV+KGHSLPKDT EYL+SVGLKNALYTIKAADMRPDLF SL+PCTFHD AT+LKGEAS QDAIKKRSRE+IV ENIG
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        STVLTPTSK P+RKHVKLEDS+VS+A  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYAL+KGF RIHV
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        QGDSKLVCMQVQGLWKVKNENI+ELC+EV+KLKD+FLSFEISHVLRNLNSEADAQANLA+TL DGE +EFEE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

A0A6J1KGD7 uncharacterized protein LOC1114939836.1e-17685.75Show/hide
Query:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNC SQ STY RAIFR T L FAASTSIHGC  NPYWTS   +V +  T LDSLCSRFRLRCYSSRK+RK ASPSP  DSE PPMEP+  DFFVVRKGDV
Subjt:  MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCS-NPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +GVY+SF+DCQAQIGSSICDLPVSV+KGHSLPKDT+EYLASVGLKNALYTI+AADMRPDLF SL+PCTFHD ATSLKGEAS QDAIKKRSRE+IV +NIG
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        STVLTPTSK P+RKHVKLEDS+VSRA  SN ESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYAL+KGF RIHV
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE
        QGDSKLVCMQVQGLWKVKNENISELC+EV+KLKD+FLSFEISHVLRNLNSEADAQANLAI+L DGE +EFEE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein1.3e-0833.87Show/hide
Query:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLS
        DGAS GNPG +G G  ++ H+G         +G+ TN  AE+ A++ G+K    +G+  +  + DS +V  +   L  VKN       +E+I+LK  F  
Subjt:  DGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLS

Query:  FEISHVLRNLNSEADAQANLAITL
        F I  +    N +AD  A  AI L
Subjt:  FEISHVLRNLNSEADAQANLAITL

P64956 Uncharacterized protein Mb2253c2.1e-1641.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G     V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD

Query:  RFLSFEISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  RFLSFEISHVLRNLNSEADAQANLAITLA

P9WLH4 Uncharacterized protein MT22872.1e-1641.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G     V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD

Query:  RFLSFEISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  RFLSFEISHVLRNLNSEADAQANLAITLA

P9WLH5 Bifunctional protein Rv2228c2.1e-1641.86Show/hide
Query:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD
        +E DG S+GNPG AG GAV+   D S V+   ++ +G ATNNVAEYR ++ GL  A++ G     V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGASKGNPGQAGAGAVLRAHDGS-VICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKD

Query:  RFLSFEISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  RFLSFEISHVLRNLNSEADAQANLAITLA

Q9HSF6 Ribonuclease HI1.3e-1640.65Show/hide
Query:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFL
        FDGAS+GNPG A  G VL + DG ++    + +G ATNN AEY A++  L+ A   GF+ I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFL

Query:  SFEISHVLRNLNSEADAQANLAI
         + I+HV R  N  ADA AN A+
Subjt:  SFEISHVLRNLNSEADAQANLAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein1.0e-8547.55Show/hide
Query:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MNC S   +Y    + +R++  + +S   + C   Y  S S    V  +++  +CS   +  YSSR   KA        +    ++ EKD FFVVRKGDV
Subjt:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +G+YK  SDCQAQ+GSS+ DLPVSV+KG+SLPKDTEEYL+SVGLK  LY+++A+D++ D+FG+L PC F + A      +  +   + +S++    +   
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
        +++    S  P+ K  K+E S        + E+CF+EFDGASKGNPG +GA AVL+  DGS+ICR+R+GLGIATNN AEY A++LGLKYA++KG+  I V
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE
        +GDSKLVCMQ++G WKV +E +++L  E   L ++ +SFEISHVLRNLN++AD QANLA+ L +GEVE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-7549.66Show/hide
Query:  MEPEKDDFFVVRKGDVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQD
        ME EKD F++VRKGD++GVY+S S+CQ Q GSS+    +SV+KG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC     ++S +GE+ ++ 
Subjt:  MEPEKDDFFVVRKGDVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQD

Query:  AIKKRSRESIVPENIGSTVLTPTSKGPMRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAVLRA D SV+  LREG+G ATNN
Subjt:  AIKKRSRESIVPENIGSTVLTPTSKGPMRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN

Query:  VAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE
        VAEYRA+LLGL+ AL KGF  +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F+I H+ R  NSEAD QAN AI LADG+ +
Subjt:  VAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-7549.66Show/hide
Query:  MEPEKDDFFVVRKGDVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQD
        ME EKD F++VRKGD++GVY+S S+CQ Q GSS+    +SV+KG+  PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC     ++S +GE+ ++ 
Subjt:  MEPEKDDFFVVRKGDVLGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQD

Query:  AIKKRSRESIVPENIGSTVLTPTSKGPMRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN
        +  KR       +++GS      S  P +K +K+E+ ++ R  SS          +SC +EFDGASKGNPG+AGAGAVLRA D SV+  LREG+G ATNN
Subjt:  AIKKRSRESIVPENIGSTVLTPTSKGPMRKHVKLEDSIVSRALSS--------NRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNN

Query:  VAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE
        VAEYRA+LLGL+ AL KGF  +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F+I H+ R  NSEAD QAN AI LADG+ +
Subjt:  VAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE

AT5G51080.1 RNase H family protein1.0e-7744.57Show/hide
Query:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MN FS+  +Y +  +FR+++ V    TS    +  ++TS+  ++   + ++ S      + CYSSR  + A S   K        + EKD FFVVRKGD+
Subjt:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +G+YK   DCQAQ+GSS+ D PVSV+KG+SL KDTEE L++VGLK  LY  +A D++ D+FG+L PC F D                             
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
             P++   + K  +LE S       ++ E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK+A++KG+ +I V
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE
        + DSKLVCMQ++G WKV +E +S+L  E  +L D+ LSFEISHVLR+LNS+AD QAN+A  L++GEVE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE

AT5G51080.2 RNase H family protein1.0e-7744.57Show/hide
Query:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV
        MN FS+  +Y +  +FR+++ V    TS    +  ++TS+  ++   + ++ S      + CYSSR  + A S   K        + EKD FFVVRKGD+
Subjt:  MNCFSQVSTY-TRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDV

Query:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG
        +G+YK   DCQAQ+GSS+ D PVSV+KG+SL KDTEE L++VGLK  LY  +A D++ D+FG+L PC F D                             
Subjt:  LGVYKSFSDCQAQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIG

Query:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV
             P++   + K  +LE S       ++ E+C +EFDGASKGNPG +GA AVL+  DGS+I ++R+GLGIATNN AEY  ++LGLK+A++KG+ +I V
Subjt:  STVLTPTSKGPMRKHVKLEDSIVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE
        + DSKLVCMQ++G WKV +E +S+L  E  +L D+ LSFEISHVLR+LNS+AD QAN+A  L++GEVE
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCDEVIKLKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGCTTCTCGCAAGTCTCTACCTATACTCGCGCCATTTTCAGAAGGACCACTCTTGTTTTTGCGGCTTCCACCTCCATTCATGGCTGCTCTAATCCCTACTGGAC
CTCAATCTCTCCCAACGTCGCTGTTAACGCTACTGCTTTAGATTCCTTGTGCTCCAGATTCCGTCTACGGTGCTATTCATCTCGAAAAGTCCGCAAGGCCGCTTCTCCAT
CGCCCAAGTTTGATTCTGAACCTCCTCCCATGGAACCGGAGAAGGACGACTTCTTTGTCGTTCGCAAGGGGGATGTTCTTGGAGTCTATAAAAGTTTTAGTGATTGTCAG
GCGCAAATTGGATCTTCGATATGTGATCTTCCCGTTAGCGTGTTTAAAGGACACTCATTGCCAAAAGACACTGAGGAATATCTTGCCTCTGTTGGGCTTAAGAATGCTCT
GTACACTATTAAAGCTGCTGATATGAGACCTGATCTTTTCGGTTCACTCATGCCTTGCACTTTTCATGATGGAGCTACTTCTCTTAAAGGTGAAGCTTCTAGCCAGGATG
CCATAAAGAAGAGATCAAGAGAGTCTATTGTACCAGAAAATATTGGGTCAACTGTTTTAACTCCTACGTCAAAAGGTCCCATGAGGAAACATGTCAAGTTGGAAGATTCC
ATTGTGTCTAGAGCACTATCCTCTAACCGTGAATCTTGCTTTCTAGAATTTGATGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGAGCAGGAGCTGTTCTACGAGCTCA
TGATGGGAGTGTGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATAATGTTGCTGAATATCGAGCTATTCTCTTAGGGTTGAAGTATGCACTTCAGAAAGGGT
TCAATAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAAGGATTATGGAAGGTAAAGAATGAGAACATCTCTGAGCTATGTGATGAAGTTATCAAG
CTGAAGGATAGATTTCTTTCGTTCGAGATCAGTCATGTACTAAGGAATCTAAATTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGACGGCGAAGTTGA
GGAGTTTGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATTGATTTAAATTGGGAATTCTGGAATGCTTAATAGTTCGTCAGAAATTGCCCTCTGATGAACTGCTTCTCGCAAGTCTCTACCTATACTCGCGCCATTTTCAGAAGGAC
CACTCTTGTTTTTGCGGCTTCCACCTCCATTCATGGCTGCTCTAATCCCTACTGGACCTCAATCTCTCCCAACGTCGCTGTTAACGCTACTGCTTTAGATTCCTTGTGCT
CCAGATTCCGTCTACGGTGCTATTCATCTCGAAAAGTCCGCAAGGCCGCTTCTCCATCGCCCAAGTTTGATTCTGAACCTCCTCCCATGGAACCGGAGAAGGACGACTTC
TTTGTCGTTCGCAAGGGGGATGTTCTTGGAGTCTATAAAAGTTTTAGTGATTGTCAGGCGCAAATTGGATCTTCGATATGTGATCTTCCCGTTAGCGTGTTTAAAGGACA
CTCATTGCCAAAAGACACTGAGGAATATCTTGCCTCTGTTGGGCTTAAGAATGCTCTGTACACTATTAAAGCTGCTGATATGAGACCTGATCTTTTCGGTTCACTCATGC
CTTGCACTTTTCATGATGGAGCTACTTCTCTTAAAGGTGAAGCTTCTAGCCAGGATGCCATAAAGAAGAGATCAAGAGAGTCTATTGTACCAGAAAATATTGGGTCAACT
GTTTTAACTCCTACGTCAAAAGGTCCCATGAGGAAACATGTCAAGTTGGAAGATTCCATTGTGTCTAGAGCACTATCCTCTAACCGTGAATCTTGCTTTCTAGAATTTGA
TGGTGCCTCAAAAGGAAATCCTGGACAAGCTGGAGCAGGAGCTGTTCTACGAGCTCATGATGGGAGTGTGATATGTAGACTGCGTGAAGGCCTAGGTATAGCAACCAATA
ATGTTGCTGAATATCGAGCTATTCTCTTAGGGTTGAAGTATGCACTTCAGAAAGGGTTCAATAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTTCAA
GGATTATGGAAGGTAAAGAATGAGAACATCTCTGAGCTATGTGATGAAGTTATCAAGCTGAAGGATAGATTTCTTTCGTTCGAGATCAGTCATGTACTAAGGAATCTAAA
TTCTGAAGCCGATGCTCAAGCGAACTTGGCTATCACTCTAGCTGACGGCGAAGTTGAGGAGTTTGAAGAATAATAGCTATAGAATGCACAGCAGGATATATATTACACCA
TAGCAAGTTTTTCTGAGGAATGCATTTATAGGGCAATCCTTTGTATGCTACTATTCTTTACCCGGGGTTGCCAAGATCTTTGGGCGTTGTTTCGGGTTCTAGCTACGTTT
CTGCGTGGCCTTGCAGTGGACTCGACAATGCAGAAGTGCATTTTTCCTGCATATCTGACTGAGATAAGAATTGGGCTGATTGTTTTCTGTAATTCAATTTCAGTTGAAGG
ATTATCACTAAAATGTTGCATTCATTTTCTTTTCAAATAAAAAATGCAACACTCTTTGAGCAGATATGCTTGTATCTATTGTGGAACCTCAGTTCAATACCAATTTTGGG
TAGTCTCAAGAAAAGTGAGATCAACGAACTATGGGCTTTCTGGTTTAAAGCTCATTTGAGGACAAGGTTTCTAGAAATTAGTGTGTTTTTTCCTTTTCTAATGAGTTTGA
TCTTGTACAAGAATGTTATTGAGAATTTTGGACATTTGACTGAGAGAATGTATTGCTGTTTCACAATATTTTTGCTTAATATAAATTATTTAATAAACTTATGGTATTCG
TCAG
Protein sequenceShow/hide protein sequence
MNCFSQVSTYTRAIFRRTTLVFAASTSIHGCSNPYWTSISPNVAVNATALDSLCSRFRLRCYSSRKVRKAASPSPKFDSEPPPMEPEKDDFFVVRKGDVLGVYKSFSDCQ
AQIGSSICDLPVSVFKGHSLPKDTEEYLASVGLKNALYTIKAADMRPDLFGSLMPCTFHDGATSLKGEASSQDAIKKRSRESIVPENIGSTVLTPTSKGPMRKHVKLEDS
IVSRALSSNRESCFLEFDGASKGNPGQAGAGAVLRAHDGSVICRLREGLGIATNNVAEYRAILLGLKYALQKGFNRIHVQGDSKLVCMQVQGLWKVKNENISELCDEVIK
LKDRFLSFEISHVLRNLNSEADAQANLAITLADGEVEEFEE