; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G05470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G05470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Description3',5'-nucleoside bisphosphate phosphatase
Genome locationChr7:4079356..4083454
RNA-Seq ExpressionCSPI07G05470
SyntenyCSPI07G05470
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004534 - 5'-3' exoribonuclease activity (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR003141 - Polymerase/histidinol phosphatase, N-terminal
IPR004013 - PHP domain
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031491.1 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo var. makuwa]1.9e-24288.98Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSS NSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE
        ERAHGNG                                               VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE

Query:  SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF
        TGSEPCAAEAIQLI DTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF
Subjt:  TGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF

Query:  LKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        LKAARPVWC AIRDILE YVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  LKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

KAG6588776.1 hypothetical protein SDJN03_17341, partial [Cucurbita argyrosperma subsp. sororia]2.9e-22790.56Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHF Q+ PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEW YLDQSNSLAS+AAASVVDDFGVQKT+GKGGEKVVF+LHSHSK SDGFL+PSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGG+AVLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP
        EVYRSDG+LAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAMHDFLK ARP+WCSAIRDIL+SYVEEPS+SNLAKITRFGRTRVLKGGS  P
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP

Query:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP
        S  ND+I+ CLT WLTNEEKQ+ EFEAIRLKLSHIS+  QEVQVP
Subjt:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP

XP_004136869.1 uncharacterized protein LOC101218042 [Cucumis sativus]5.3e-25399.77Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS
        EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSP 
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS

Query:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

XP_008455216.1 PREDICTED: 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo]3.5e-24998.42Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSS NSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS
        EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWC AIRDILE YVEEPSESNLAKITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS

Query:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

XP_038886806.1 3',5'-nucleoside bisphosphate phosphatase [Benincasa hispida]1.0e-24094.58Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQS+P+SKKSK KKKKRGG+KKKMTSEQ AAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYS GSEPCAAEAIQLIHDTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS
        EVYRSDG+LAAY DLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARP+WCSAIRDILE YVEEPSESNLAKITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS

Query:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        S ND+I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

TrEMBL top hitse value%identityAlignment
A0A0A0K205 POLIIIAc domain-containing protein2.6e-25399.77Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS
        EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSP 
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS

Query:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

A0A1S3C0E5 3',5'-nucleoside bisphosphate phosphatase1.7e-24998.42Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSS NSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS
        EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWC AIRDILE YVEEPSESNLAKITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPS

Query:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  SGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

A0A5A7SPM7 3',5'-nucleoside bisphosphate phosphatase9.1e-24388.98Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHFPQSS NSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSKCSDGFLTPSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE
        ERAHGNG                                               VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSE

Query:  SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF
        TGSEPCAAEAIQLI DTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF
Subjt:  TGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDF

Query:  LKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
        LKAARPVWC AIRDILE YVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP
Subjt:  LKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEVQVP

A0A6J1EMA5 uncharacterized protein LOC1114346464.5e-22690.11Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGDAHF Q+ PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEW YLDQSNSLAS+AAASVVDDFGVQKT+GKGGEKVVF+LHSHSK SDGFL+PSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKLE FLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGG+AVLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP
        EVYRSDG+LA YSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLP LAMHDFLK ARP+WCSAIRDILESYVEEPS+SNLAKITRFGRTRVLKGGS  P
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP

Query:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP
        S  ND+I+ CLT WLTNEEKQ+ EFEAIRLKLSHIS+  QEVQVP
Subjt:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP

A0A6J1JMJ7 uncharacterized protein LOC1114860223.8e-22589.89Show/hide
Query:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV
        MVGD HF Q+ PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEW YLDQSNSLAS+AAASVVDDFGVQKT+GKGGEKVVFELHSHSK SDGFL+PSKLV
Subjt:  MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGG+AVLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGL

Query:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP
        EVYRSDG+LAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAMHDFLK AR +WCSAIRDILESYVEEPS SNLAKITRFGRTRVLKGGS  P
Subjt:  EVYRSDGRLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGS-SP

Query:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP
        S  ND+I+ CL  WLTNEEKQ+ EFEAIRLKLSHIS+  QEV+VP
Subjt:  SSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISI-NQEVQVP

SwissProt top hitse value%identityAlignment
C8WJZ5 Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase2.6e-2129.62Show/hide
Query:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKF
        ++ +LH HS  SDG  T  +++E+A   GV+ LA T+HDT +G+  A E   R G++++ G+E+S       D E    VHIL      G   +  L   
Subjt:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKF

Query:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGMAVLA
          +  E R   +   + +L E    +  +   ++        + H+  AL    Y     +   R LF +GG          A +A++++ + GG+AVLA
Subjt:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGMAVLA

Query:  HPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAY---SDLADNYGLLKLGGSDFHGRGG
        HP  L +   ++  L + GL G+E +  D  LA +   ++LA  Y L+  GGSD+HG+ G
Subjt:  HPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAY---SDLADNYGLLKLGGSDFHGRGG

O54453 5'-3' exoribonuclease3.8e-2834.73Show/hide
Query:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKL
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT + IP A E   R G  + +IPGVEIST++ N         +HI+        PA    +
Subjt:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAV
          FL    E R  R + +  +L +  +P  W+   ++   G A  R H AR LVE G    +   F +YL  G   Y         +AI +IH +GG AV
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAV

Query:  LAHPWALKNPVAVIRRL--KDAGLHG-----LEVYRSDGRLAAYSDLADNYGLLKLGGSDFH
        LAHP         ++RL    A  HG      +  +S       + LA  + L    GSDFH
Subjt:  LAHPWALKNPVAVIRRL--KDAGLHG-----LEVYRSDGRLAAYSDLADNYGLLKLGGSDFH

P44176 5'-3' exoribonuclease2.6e-2935.66Show/hide
Query:  FELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKLEKFL
        ++LH HS  SDG L+P++LV RA+  GV VLAL DHDT++GI EA  AA+  GI++I GVEIST +   G       +HI+   +    P    K+   L
Subjt:  FELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKLEKFL

Query:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHP
        ++ +  R  RA  +  KL +  +P  +D    +    V   R H AR LV+ G V N  QAF RYL  G  A+          AI+ IH  GG+A++AHP
Subjt:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHP

Query:  WALKNPVAVIRRL----KDAGLHGLEVY---RSDGRLAAYSDLADNYGLLKLGGSDFH
                 +R+L    K  G  G+E+    ++  +    +  A  + L    GSDFH
Subjt:  WALKNPVAVIRRL----KDAGLHGLEVY---RSDGRLAAYSDLADNYGLLKLGGSDFH

P77766 5'-3' exoribonuclease6.1e-2634.22Show/hide
Query:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKL
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT + I  A E   R G  + +IPGVEIST++ N         +HI+        P   E  
Subjt:  VVFELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPAKIEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAV
          FL    E R  RA+ +  +L + ++P   +   ++  +G A  R H AR LVE G   ++   F +YL  G   Y         +AI +IH +GG AV
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAV

Query:  LAHP--------WALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFH
        LAHP        W LK  VA         +   +  +S       + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFH

Q7NXD4 3',5'-nucleoside bisphosphate phosphatase4.8e-3134.6Show/hide
Query:  ELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLEN
        +LH HS+ SDG LTP+++++RA      +LALTDHD   G+ EA  AA R GI  + GVE+S        S     VHI+       PA+   L   L++
Subjt:  ELHSHSKCSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLEN

Query:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWA
        IREGR  RA+ M + L    +   +D   +         R H AR LV++G V++++  F +YL  G P Y +       +A+  I   GGMAV+AHP  
Subjt:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWA

Query:  LKNPVAVIRRL----KDAGLHGLEVYRSDGRL---AAYSDLADNYGLLKLGGSDFH--GRGGHSESEVGSVNLPVLAMHDFLKAARPVW
              +I RL    + AG  G+EV      L     ++  AD +GL    GSDFH  G GG    +VG          D     RP+W
Subjt:  LKNPVAVIRRL----KDAGLHGLEVYRSDGRL---AAYSDLADNYGLLKLGGSDFH--GRGGHSESEVGSVNLPVLAMHDFLKAARPVW

Arabidopsis top hitse value%identityAlignment
AT2G13840.1 Polymerase/histidinol phosphatase-like1.1e-15863.93Show/hide
Query:  NSKKSKPKKKKRG-GTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVG--KGGEKVVFELHSHSKCSDGFLTPSKLVERAHGNGVK
        + KK + KKKKR  G K+KMT+EQ  AFK +T+W  L  S SL+SS+     DDF V    G  + GEKVVFELHSHS  SDGFL+PSK+VERA+ NGVK
Subjt:  NSKKSKPKKKKRG-GTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVG--KGGEKVVFELHSHSKCSDGFLTPSKLVERAHGNGVK

Query:  VLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHV
        VL+LTDHDTM+G+PEAVEA RRFGIKIIPG+EIST+F    DS SEEPVHILAYY + GPA  ++LE FL  IR+GRF+R + MV KLN+LK+PLKW+HV
Subjt:  VLALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHV

Query:  AKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRL
         +I GK VAPGR+HVARAL+EAGYVENL+QAF++YL DGGPAY+TG+EP A EA++LI  TGG+AVLAHPWALKN V +IRRLKDAGLHG+EVYRSDG+L
Subjt:  AKITGKGVAPGRLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRL

Query:  AAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERC
          +S+LAD Y LLKLGGSD+HG+GG +ESE+GSVNLPV A+ DFL   RP+WC AI+  + +++++PS+SNL+ I RF + R+LKG S+ S G +L++RC
Subjt:  AAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERC

Query:  LTLWLTNEEKQNDEFEAIRLKLSHISI
        L +WLT++E+ +++FEA+RLKLS + I
Subjt:  LTLWLTNEEKQNDEFEAIRLKLSHISI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGTGACGCTCACTTTCCTCAGTCTTCCCCCAATTCCAAGAAATCCAAGCCCAAGAAGAAGAAACGAGGTGGCACCAAGAAGAAGATGACTTCCGAACAGATTGC
CGCTTTTAAGTATGTCACGGAATGGGCTTATTTGGATCAATCTAATTCTCTTGCCTCCTCTGCTGCTGCCTCTGTTGTGGATGATTTTGGAGTTCAGAAGACTGTTGGCA
AAGGTGGGGAGAAGGTGGTCTTTGAGTTGCATTCCCATTCCAAATGCAGTGATGGGTTTCTTACCCCTTCTAAGCTTGTTGAGAGAGCTCACGGAAATGGGGTGAAAGTT
CTTGCTTTGACAGATCATGACACAATGTCTGGCATCCCTGAAGCTGTCGAGGCAGCTCGGAGATTTGGTATCAAAATAATTCCAGGTGTTGAAATCAGTACTATATTCTC
TAACGGTGGAGACTCAGAATCCGAAGAACCAGTACACATCCTTGCATACTACAGCAGCTGTGGACCAGCAAAGATTGAGAAGCTGGAAAAATTCTTAGAAAATATAAGGG
AGGGGCGTTTTTTGCGAGCAAAGAACATGGTGTCGAAACTGAATGAGCTAAAGCTGCCTCTTAAGTGGGATCATGTAGCTAAGATTACTGGTAAAGGAGTTGCTCCTGGG
AGACTCCATGTGGCCCGAGCCTTGGTTGAAGCAGGCTATGTGGAAAATTTAAAACAAGCGTTTTCTCGGTACCTTTTTGATGGTGGACCGGCTTACTCAACGGGATCAGA
GCCTTGTGCAGCGGAAGCAATACAATTGATACACGATACAGGTGGTATGGCCGTACTAGCTCATCCATGGGCCTTGAAGAATCCCGTTGCTGTCATTAGAAGATTGAAAG
ACGCTGGTCTTCATGGGCTTGAGGTTTACAGGAGTGATGGAAGATTGGCAGCATACAGTGACCTAGCGGACAATTATGGGCTTCTTAAACTCGGAGGATCAGATTTTCAT
GGAAGAGGTGGACATAGTGAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCATGATTTCCTCAAGGCCGCTCGGCCAGTTTGGTGTAGTGCCATTCGAGA
TATTCTTGAGAGTTACGTCGAAGAGCCTTCGGAATCAAATCTAGCAAAGATCACTAGATTTGGAAGGACCCGGGTTTTGAAGGGTGGCTCCTCACCAAGCAGCGGAAATG
ACTTAATTGAACGTTGTTTAACTTTGTGGCTGACAAATGAAGAAAAGCAAAATGATGAGTTTGAGGCCATCAGATTAAAACTCTCCCATATTTCAATTAATCAAGAAGTT
CAAGTGCCTTAA
mRNA sequenceShow/hide mRNA sequence
GAAATTAATCCAAATGAAATTAATTTAGGTATGAAGAAACAAGTTGGTTTTAGTTTTGTGTGATTGATTGATTGGGAAAAGGAGTATTCAGAGAAAAAAGCAAAGAAGAG
AGAGAGGAAAATCCAGAAAAAGGGTGGAAAATAGAGGACACCTGCGCTTTGGAAGATATCCTTTCCCCAAAAACCCCTCTGTCTCACTCTTTTTTCCTCCTTTTGTTTGT
TATCACTTTCTCTCAATTTCCATTCCCCACTCTCTATTGACCTAAATCAACCCCTTCTCTCAAATCTCCACATCTCTTTCCATGGTGGGTGACGCTCACTTTCCTCAGTC
TTCCCCCAATTCCAAGAAATCCAAGCCCAAGAAGAAGAAACGAGGTGGCACCAAGAAGAAGATGACTTCCGAACAGATTGCCGCTTTTAAGTATGTCACGGAATGGGCTT
ATTTGGATCAATCTAATTCTCTTGCCTCCTCTGCTGCTGCCTCTGTTGTGGATGATTTTGGAGTTCAGAAGACTGTTGGCAAAGGTGGGGAGAAGGTGGTCTTTGAGTTG
CATTCCCATTCCAAATGCAGTGATGGGTTTCTTACCCCTTCTAAGCTTGTTGAGAGAGCTCACGGAAATGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACAATGTC
TGGCATCCCTGAAGCTGTCGAGGCAGCTCGGAGATTTGGTATCAAAATAATTCCAGGTGTTGAAATCAGTACTATATTCTCTAACGGTGGAGACTCAGAATCCGAAGAAC
CAGTACACATCCTTGCATACTACAGCAGCTGTGGACCAGCAAAGATTGAGAAGCTGGAAAAATTCTTAGAAAATATAAGGGAGGGGCGTTTTTTGCGAGCAAAGAACATG
GTGTCGAAACTGAATGAGCTAAAGCTGCCTCTTAAGTGGGATCATGTAGCTAAGATTACTGGTAAAGGAGTTGCTCCTGGGAGACTCCATGTGGCCCGAGCCTTGGTTGA
AGCAGGCTATGTGGAAAATTTAAAACAAGCGTTTTCTCGGTACCTTTTTGATGGTGGACCGGCTTACTCAACGGGATCAGAGCCTTGTGCAGCGGAAGCAATACAATTGA
TACACGATACAGGTGGTATGGCCGTACTAGCTCATCCATGGGCCTTGAAGAATCCCGTTGCTGTCATTAGAAGATTGAAAGACGCTGGTCTTCATGGGCTTGAGGTTTAC
AGGAGTGATGGAAGATTGGCAGCATACAGTGACCTAGCGGACAATTATGGGCTTCTTAAACTCGGAGGATCAGATTTTCATGGAAGAGGTGGACATAGTGAATCTGAAGT
TGGAAGTGTAAACCTTCCTGTTCTTGCTATGCATGATTTCCTCAAGGCCGCTCGGCCAGTTTGGTGTAGTGCCATTCGAGATATTCTTGAGAGTTACGTCGAAGAGCCTT
CGGAATCAAATCTAGCAAAGATCACTAGATTTGGAAGGACCCGGGTTTTGAAGGGTGGCTCCTCACCAAGCAGCGGAAATGACTTAATTGAACGTTGTTTAACTTTGTGG
CTGACAAATGAAGAAAAGCAAAATGATGAGTTTGAGGCCATCAGATTAAAACTCTCCCATATTTCAATTAATCAAGAAGTTCAAGTGCCTTAAGACTGAATTACTCCAAC
CAAGTTTGTCAATACTTGCTCGTCGTGCTACTAACATAGTCGACATACCGATCAGATTTATTCAGGTTTTTGTACTTCTCATTGTTACTCTTTTTCTTTACTGTTAGGGA
AGGGTGGGTTTCAGTAAATGATTCGCTGAGGTACAACAAAAGGCTTTCATTGTCTTAAAAGTTATACTTTTTGGTCAAATCAATAATACTTCATTCAATCTTTTTCAGGT
TTTTCTCTTTGTTTTATAGGTTCTAGCAATATTTATTGATCACATGCGTAATGACTTGTACTGGATCATTTCTTTTTT
Protein sequenceShow/hide protein sequence
MVGDAHFPQSSPNSKKSKPKKKKRGGTKKKMTSEQIAAFKYVTEWAYLDQSNSLASSAAASVVDDFGVQKTVGKGGEKVVFELHSHSKCSDGFLTPSKLVERAHGNGVKV
LALTDHDTMSGIPEAVEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPG
RLHVARALVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGMAVLAHPWALKNPVAVIRRLKDAGLHGLEVYRSDGRLAAYSDLADNYGLLKLGGSDFH
GRGGHSESEVGSVNLPVLAMHDFLKAARPVWCSAIRDILESYVEEPSESNLAKITRFGRTRVLKGGSSPSSGNDLIERCLTLWLTNEEKQNDEFEAIRLKLSHISINQEV
QVP