; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G015330 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G015330
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Description3',5'-nucleoside bisphosphate phosphatase
Genome locationCG_Chr09:29305530..29309599
RNA-Seq ExpressionClCG09G015330
SyntenyClCG09G015330
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004534 - 5'-3' exoribonuclease activity (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR003141 - Polymerase/histidinol phosphatase, N-terminal
IPR004013 - PHP domain
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031491.1 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo var. makuwa]5.1e-22483.47Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSA
        ERAHGNG                                               VKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS 
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSA

Query:  SEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNF
        TGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GLEVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+F
Subjt:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNF

Query:  LKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        LKAARP+WC AIRDIL+ YVEEPSE+NLA ITRFGRTRVLKGGSSPSSGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  LKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

KAG6588776.1 hypothetical protein SDJN03_17341, partial [Cucurbita argyrosperma subsp. sororia]2.0e-22090.11Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLAS+AAASVVDDFGVQK+LGKGGEKVVF+LHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEAIEAA RFGIKIIPGVEISTIFS SG+S SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP
        EVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAMH+FLK ARPIWCSAIRDIL SYVEEPS++NLA ITRFGRTRVLKGGS  P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP

Query:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        S  ND ID CLT WLTNEEKQ+AEFEAIRLKLSHIS+  QEVQVP
Subjt:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

XP_004136869.1 uncharacterized protein LOC101218042 [Cucumis sativus]3.9e-23292.33Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEW YLD SNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+FLKAARP+WCSAIRDIL+SYVEEPSE+NLA ITRFGRTRVLKGGSSP 
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS

Query:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        SGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

XP_008455216.1 PREDICTED: 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo]9.5e-23192.33Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+FLKAARP+WC AIRDIL+ YVEEPSE+NLA ITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS

Query:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        SGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

XP_038886806.1 3',5'-nucleoside bisphosphate phosphatase [Benincasa hispida]1.4e-23494.36Show/hide
Query:  MVGD------APNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGD      AP+SKKSKTKKKKRGG+KKKMTSEQAAAFKYVTEWVYLD SNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
Subjt:  MVGD------APNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEAIEAA RFGIKIIPGVEISTIFSN GDS SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS GSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS
        EVYRSDGKLAAY DLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+FLKAARPIWCSAIRDIL+ YVEEPSE+NLA ITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS

Query:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        S ND IDRCLT WLTNEEKQNAEFEAIRLKLSHISINQEVQVP
Subjt:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

TrEMBL top hitse value%identityAlignment
A0A0A0K205 POLIIIAc domain-containing protein1.9e-23292.33Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEW YLD SNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+FLKAARP+WCSAIRDIL+SYVEEPSE+NLA ITRFGRTRVLKGGSSP 
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS

Query:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        SGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A1S3C0E5 3',5'-nucleoside bisphosphate phosphatase4.6e-23192.33Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+FLKAARP+WC AIRDIL+ YVEEPSE+NLA ITRFGRTRVLKGGSSPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPS

Query:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        SGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  SGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A5A7SPM7 3',5'-nucleoside bisphosphate phosphatase2.5e-22483.47Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSA
        ERAHGNG                                               VKVLALTDHDTM+GIPEA+EAA RFGIKIIPGVEISTIFSN GDS 
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSA

Query:  SEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNF
        TGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GLEVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMH+F
Subjt:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNF

Query:  LKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        LKAARP+WC AIRDIL+ YVEEPSE+NLA ITRFGRTRVLKGGSSPSSGND I+RCLTLWLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  LKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A6J1EMA5 uncharacterized protein LOC1114346464.1e-21989.44Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGGTKKKMTSEQ AAFKYVTEWVYLD SNSLAS+AAASVVDDFGVQK+LGKGGEKVVF+LHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEAIEAA RFGIKIIPGVEISTIFS SG+S SEEPVHILAYYSSCGPA +EKLE FLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP
        EVYRSDGKLA YSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLP LAMH+FLK ARPIWCSAIRDIL+SYVEEPS++NLA ITRFGRTRVLKGGS  P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP

Query:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        S  ND ID CLT WLTNEEKQ+AEFEAIRLKLSHIS+  QEVQVP
Subjt:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

A0A6J1JMJ7 uncharacterized protein LOC1114860223.4e-21889.21Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGD       PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLD SNSLAS+AAASVVDDFGVQK+LGKGGEKVVFELHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTM+GIPEAIEAA RFGIKIIPGVEISTIFS SG+S SEEPVHILAYYSSCGPA +EKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP
        EVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAMH+FLK AR IWCSAIRDIL+SYVEEPS +NLA ITRFGRTRVLKGGS  P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGS-SP

Query:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        S  ND ID CL  WLTNEEKQ+AEFEAIRLKLSHIS+  QEV+VP
Subjt:  SSGNDFIDRCLTLWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

SwissProt top hitse value%identityAlignment
C8WJZ5 Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase4.9e-2029.62Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKF
        ++ +LH HS  SDG  T  +++E+A   GV+ LA T+HDT AG+  A E   R G++++ G+E+S       D      VHIL      G   L  L   
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKF

Query:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGVAVLA
          +  E R   +   + +L E    +  +   ++        + H+  A+    Y     +   R LF +GG          A +A++++ + GG+AVLA
Subjt:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGVAVLA

Query:  HPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAY---SDLADNYGLLKLGGSDFHGRGG
        HP  L +   ++  L + GL G+E +  D  LA +   ++LA  Y L+  GGSD+HG+ G
Subjt:  HPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAY---SDLADNYGLLKLGGSDFHGRGG

O54453 5'-3' exoribonuclease8.3e-2834.6Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFG--IKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKL
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT A IP A E   R G  + +IPGVEIST++ N         +HI+        PA    +
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFG--IKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV
          FL    E R  R + +  +L +  +P  W+   ++   G A  R H AR +VE G    +   F +YL  G   Y         +AI +IH +GG AV
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV

Query:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH
        LAHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH

P44176 5'-3' exoribonuclease1.7e-2835.66Show/hide
Query:  FELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKLEKFL
        ++LH HS  SDG L+P++LV RA+  GV VLAL DHDT+AGI EA  AA   GI++I GVEIST +   G       +HI+   +    P    K+   L
Subjt:  FELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKLEKFL

Query:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHP
        ++ +  R  RA  +  KL +  +P  +D    +    V   R H AR +V+ G V N  QAF RYL  G  A+          AI+ IH  GG+A++AHP
Subjt:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHP

Query:  WALKNPVAIIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADNYGLLKLGGSDFH
                 +R+L    K  G  G+E+    ++  +    +  A  + L    GSDFH
Subjt:  WALKNPVAIIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADNYGLLKLGGSDFH

P77766 5'-3' exoribonuclease1.7e-2534.22Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFG--IKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKL
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT A I  A E   R G  + +IPGVEIST++ N         +HI+        P   E  
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFG--IKIIPGVEISTIFSNSGDSASEEPVHILAY-YSSCGPANLEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV
          FL    E R  RA+ +  +L + ++P   +   ++  +G A  R H AR +VE G   ++   F +YL  G   Y         +AI +IH +GG AV
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV

Query:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH
        LAHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH

Q7NXD4 3',5'-nucleoside bisphosphate phosphatase8.0e-3133.8Show/hide
Query:  ELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLEN
        +LH HS+ SDG LTP+++++RA      +LALTDHD   G+ EA  AA R GI  + GVE+S        S     VHI+       PA    L   L++
Subjt:  ELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLEN

Query:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWA
        IREGR  RA+ M + L    +   +D   +         R H AR +V++G V++++  F +YL  G P Y +       +A+  I   GG+AV+AHP  
Subjt:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWA

Query:  LKNPVAIIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIW
              +I RL    + AG QG+EV      L     ++  AD +GL    GSDFH  G        + +LP +         RPIW
Subjt:  LKNPVAIIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIW

Arabidopsis top hitse value%identityAlignment
AT2G13840.1 Polymerase/histidinol phosphatase-like4.4e-15763.32Show/hide
Query:  APNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLG--KGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGV
        A + KK   KKK+  G K+KMT+EQ+ AFK +T+W+ L  S SL+SS+     DDF V  + G  + GEKVVFELHSHS  SDGFL+PSK+VERA+ NGV
Subjt:  APNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLG--KGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGV

Query:  KVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDH
        KVL+LTDHDTMAG+PEA+EA  RFGIKIIPG+EIST+F    DS SEEPVHILAYY + GPA  ++LE FL  IR+GRF+R + MV KLN+LK+PLKW+H
Subjt:  KVLALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDH

Query:  VAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGK
        V +I GK VAPGR+HVARA++EAGYVENL+QAF++YL DGGPAY+TG+EP A EA++LI  TGGVAVLAHPWALKN V IIRRLKDAGL G+EVYRSDGK
Subjt:  VAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGK

Query:  LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDR
        L  +S+LAD Y LLKLGGSD+HG+GG +ESE+GSVNLPV A+ +FL   RPIWC AI+  +++++++PS++NL+ I RF + R+LKG S+ S G + +DR
Subjt:  LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDR

Query:  CLTLWLTNEEKQNAEFEAIRLKLSHISI
        CL +WLT++E+ + +FEA+RLKLS + I
Subjt:  CLTLWLTNEEKQNAEFEAIRLKLSHISI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGTGATGCCCCAAATTCCAAGAAATCTAAGACCAAGAAGAAGAAACGGGGCGGCACCAAGAAGAAGATGACTTCCGAACAGGCTGCCGCTTTTAAGTAT
GTCACCGAATGGGTTTATTTGGATCATTCTAATTCTCTTGCCTCCTCTGCTGCTGCTTCTGTTGTGGATGATTTTGGAGTTCAGAAGAGTCTTGGCAAAGGTGGG
GAGAAGGTGGTCTTTGAGTTGCATTCCCATTCCAAATTTAGTGATGGGTTTCTCACCCCTTCCAAGCTCGTTGAGAGAGCTCATGGAAATGGGGTGAAAGTTCTT
GCTTTGACAGATCATGACACAATGGCTGGCATACCTGAGGCTATAGAGGCAGCTTGTAGATTTGGTATCAAAATAATTCCAGGTGTTGAAATCAGTACGATATTC
TCTAACAGTGGAGACTCAGCATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGCTGTGGACCAGCAAATCTTGAGAAGCTGGAAAAGTTTTTAGAAAAT
ATAAGGGAGGGGCGTTTTTTGCGTGCGAAGAACATGGTGTCAAAACTGAATGAGCTAAAGCTGCCCCTTAAATGGGATCATGTGGCTAAGATTACTGGTAAAGGA
GTTGCTCCTGGGAGACTCCATGTGGCCCGTGCCATGGTTGAAGCAGGCTATGTGGAAAATTTAAAACAAGCATTTTCTCGATACCTTTTTGATGGTGGACCGGCT
TACTCAACGGGATCAGAGCCTTGTGCAGCGGAAGCAATACAATTGATACACGATACAGGTGGTGTGGCCGTACTAGCTCATCCATGGGCTTTGAAGAATCCTGTT
GCTATCATTAGAAGATTGAAAGATGCTGGTCTTCAGGGGCTGGAGGTTTACAGGAGTGATGGAAAATTGGCAGCATACAGTGACCTAGCAGACAATTATGGGCTT
CTGAAACTTGGAGGATCAGATTTTCATGGCAGAGGTGGACATAGTGAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCACAATTTCCTCAAGGCT
GCTCGACCAATTTGGTGCAGTGCCATTCGAGATATCCTCAAGAGTTATGTTGAAGAGCCTTCGGAAACAAATCTAGCAACGATTACTAGATTTGGAAGGACCCGG
GTTTTGAAGGGTGGCTCCTCACCAAGCAGCGGAAATGACTTCATTGATCGTTGTTTAACTTTGTGGCTGACAAATGAAGAGAAGCAAAATGCTGAGTTCGAGGCT
ATCAGATTAAAGCTTTCCCATATTTCAATTAATCAAGAAGTTCAAGTGCCTTAA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGAGAGAGAGAGAGGGAAATCCAGAAAAGTGGGGGAAAAAGAGAGGACACCTGCGTTTTGGAAGATATCCTTTCCTCAAAAAGCAATCTGTCTCTCTCTT
TTCCTCCTTCTGTTTGTTATCACTTTCCCTCAATTCTCATTCGCTTTCTTCTACTTCCCCCCTTCCTCATTCCATTGACCTAAATCCACCCCTTCTCTCAAATCT
TCACCTTCCTTATCCTTACCATGGTGGGTGATGCCCCAAATTCCAAGAAATCTAAGACCAAGAAGAAGAAACGGGGCGGCACCAAGAAGAAGATGACTTCCGAAC
AGGCTGCCGCTTTTAAGTATGTCACCGAATGGGTTTATTTGGATCATTCTAATTCTCTTGCCTCCTCTGCTGCTGCTTCTGTTGTGGATGATTTTGGAGTTCAGA
AGAGTCTTGGCAAAGGTGGGGAGAAGGTGGTCTTTGAGTTGCATTCCCATTCCAAATTTAGTGATGGGTTTCTCACCCCTTCCAAGCTCGTTGAGAGAGCTCATG
GAAATGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACAATGGCTGGCATACCTGAGGCTATAGAGGCAGCTTGTAGATTTGGTATCAAAATAATTCCAGGTG
TTGAAATCAGTACGATATTCTCTAACAGTGGAGACTCAGCATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGCTGTGGACCAGCAAATCTTGAGAAGC
TGGAAAAGTTTTTAGAAAATATAAGGGAGGGGCGTTTTTTGCGTGCGAAGAACATGGTGTCAAAACTGAATGAGCTAAAGCTGCCCCTTAAATGGGATCATGTGG
CTAAGATTACTGGTAAAGGAGTTGCTCCTGGGAGACTCCATGTGGCCCGTGCCATGGTTGAAGCAGGCTATGTGGAAAATTTAAAACAAGCATTTTCTCGATACC
TTTTTGATGGTGGACCGGCTTACTCAACGGGATCAGAGCCTTGTGCAGCGGAAGCAATACAATTGATACACGATACAGGTGGTGTGGCCGTACTAGCTCATCCAT
GGGCTTTGAAGAATCCTGTTGCTATCATTAGAAGATTGAAAGATGCTGGTCTTCAGGGGCTGGAGGTTTACAGGAGTGATGGAAAATTGGCAGCATACAGTGACC
TAGCAGACAATTATGGGCTTCTGAAACTTGGAGGATCAGATTTTCATGGCAGAGGTGGACATAGTGAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTA
TGCACAATTTCCTCAAGGCTGCTCGACCAATTTGGTGCAGTGCCATTCGAGATATCCTCAAGAGTTATGTTGAAGAGCCTTCGGAAACAAATCTAGCAACGATTA
CTAGATTTGGAAGGACCCGGGTTTTGAAGGGTGGCTCCTCACCAAGCAGCGGAAATGACTTCATTGATCGTTGTTTAACTTTGTGGCTGACAAATGAAGAGAAGC
AAAATGCTGAGTTCGAGGCTATCAGATTAAAGCTTTCCCATATTTCAATTAATCAAGAAGTTCAAGTGCCTTAAGACTAAATAACTCCAACAAAGGTCGTTGATA
CTTGCTCGTTGAGCTAACATAGTCAACATACCCATCAGATTTAGTCAGGTTTTTGTACTTTCCTTTGTTACTCCTTTTCTTTTATGTTAGGGAAAAGTGGGTTTC
AGTAAATGATTCGTTGAGGTACAACAAAAAGCTCTCTGTCTTAAAGTTATACATTTTGGTCAAATCAATAGTACTTCATTCAATCTTTTTCAAGTTTTTGACCAT
TGTTTTATAGGATCTAACAATCTTTATCCATTTAATTGGATCACATGCATGACTGTTGAAATGGA
Protein sequenceShow/hide protein sequence
MVGDAPNSKKSKTKKKKRGGTKKKMTSEQAAAFKYVTEWVYLDHSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGVKVL
ALTDHDTMAGIPEAIEAACRFGIKIIPGVEISTIFSNSGDSASEEPVHILAYYSSCGPANLEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKG
VAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGL
LKLGGSDFHGRGGHSESEVGSVNLPVLAMHNFLKAARPIWCSAIRDILKSYVEEPSETNLATITRFGRTRVLKGGSSPSSGNDFIDRCLTLWLTNEEKQNAEFEA
IRLKLSHISINQEVQVP