; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G007710 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G007710
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description3',5'-nucleoside bisphosphate phosphatase
Genome locationchr03:10920854..10924922
RNA-Seq ExpressionLsi03G007710
SyntenyLsi03G007710
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004534 - 5'-3' exoribonuclease activity (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR003141 - Polymerase/histidinol phosphatase, N-terminal
IPR004013 - PHP domain
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031491.1 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo var. makuwa]6.0e-22583.88Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSE
        ERAHGNG                                               VKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSE
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSE

Query:  SEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDF
        TGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GLEVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DF
Subjt:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDF

Query:  LNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        L  ARP+WC AIRD LE YVEEPSESNLAKITRFGRTRVLKGG SPS GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  LNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

KAG6588776.1 hypothetical protein SDJN03_17341, partial [Cucurbita argyrosperma subsp. sororia]3.0e-22491.24Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLAS+AAASVVDDFGVQK+LGKGGEKVVF+LHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP
        EVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAM DFL VARPIWCSAIRD L+SYVEEPS+SNLAKITRFGRTRVLKGG   P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP

Query:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        SC ND ID CLTSWLTNEEKQ+AEFEAIRLKLSHIS+  QEVQVP
Subjt:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

XP_004136869.1 uncharacterized protein LOC101218042 [Cucumis sativus]4.6e-23392.78Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DFL  ARP+WCSAIRD LESYVEEPSESNLAKITRFGRTRVLKGG SP 
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS

Query:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
         GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

XP_008455216.1 PREDICTED: 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo]1.1e-23192.78Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DFL  ARP+WC AIRD LE YVEEPSESNLAKITRFGRTRVLKGG SPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS

Query:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
         GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

XP_038886806.1 3',5'-nucleoside bisphosphate phosphatase [Benincasa hispida]1.1e-23795.71Show/hide
Query:  MVGD------APNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGD      AP+SKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
Subjt:  MVGD------APNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS GSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS
        EVYRSDGKLAAY DLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DFL  ARPIWCSAIRD LE YVEEPSESNLAKITRFGRTRVLKGG SPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS

Query:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
          ND IDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
Subjt:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

TrEMBL top hitse value%identityAlignment
A0A0A0K205 POLIIIAc domain-containing protein2.2e-23392.78Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEW YLDQSNSLASSAAASVVDDFGVQK++GKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DFL  ARP+WCSAIRD LESYVEEPSESNLAKITRFGRTRVLKGG SP 
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS

Query:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
         GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A1S3C0E5 3',5'-nucleoside bisphosphate phosphatase5.5e-23292.78Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS
        EVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DFL  ARP+WC AIRD LE YVEEPSESNLAKITRFGRTRVLKGG SPS
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPS

Query:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
         GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  CGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A5A7SPM7 3',5'-nucleoside bisphosphate phosphatase2.9e-22583.88Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA       NSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSK SDGFLTPSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSE
        ERAHGNG                                               VKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFSNGGDSE
Subjt:  ERAHGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSE

Query:  SEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS
        SEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARA+VEAGYVENLKQAFSRYLFDGGPAYS
Subjt:  SEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYS

Query:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDF
        TGSEPCAAEAIQLI DTGG+AVLAHPWALKNPVA+IRRLKDAGL GLEVYRSDG+LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAM DF
Subjt:  TGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDF

Query:  LNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP
        L  ARP+WC AIRD LE YVEEPSESNLAKITRFGRTRVLKGG SPS GND I+RCLT WLTNEEKQN EFEAIRLKLSHISINQEVQVP
Subjt:  LNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP

A0A6J1EMA5 uncharacterized protein LOC1114346464.6e-22390.79Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGDA      PNSKKSK KKKKRGG+KKKMTSEQ AAFKYVTEWVYLDQSNSLAS+AAASVVDDFGVQK+LGKGGEKVVF+LHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGP KIEKLE FLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP
        EVYRSDGKLA YSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLP LAM DFL VARPIWCSAIRD LESYVEEPS+SNLAKITRFGRTRVLKGG   P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP

Query:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        SC ND ID CLTSWLTNEEKQ+AEFEAIRLKLSHIS+  QEVQVP
Subjt:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

A0A6J1JMJ7 uncharacterized protein LOC1114860226.0e-22391.01Show/hide
Query:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV
        MVGD       PNSKKSK KKKKRGGSKKKMTSEQ AAFKYVTEWVYLDQSNSLAS+AAASVVDDFGVQK+LGKGGEKVVFELHSHSKFSDGFL+PSKLV
Subjt:  MVGDA------PNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLV

Query:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL
        ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGP KIEKLEKFLENIREGRFLRAKNMVSKLNEL
Subjt:  ERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL
        KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCA +AIQLIH+TGGVAVLAHPWALKNPVAIIRRLKDAGL GL
Subjt:  KLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGL

Query:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP
        EVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESEVGSVNLPVLAM DFL VAR IWCSAIRD LESYVEEPS SNLAKITRFGRTRVLKGG   P
Subjt:  EVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGF-SP

Query:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP
        SC ND ID CL SWLTNEEKQ+AEFEAIRLKLSHIS+  QEV+VP
Subjt:  SCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISI-NQEVQVP

SwissProt top hitse value%identityAlignment
C8WJZ5 Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase4.4e-2129.23Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKF
        ++ +LH HS  SDG  T  +++E+A   GV+ LA T+HDT +G+  A E   R G++++ G+E+S       D E    VHIL      G   +  L   
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKF

Query:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGVAVLA
          +  E R   +   + +L E    +  +   ++        + H+  A+    Y     +   R LF +GG          A +A++++ + GG+AVLA
Subjt:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLF-DGGPAYSTGSEPCAAEAIQLIHDTGGVAVLA

Query:  HPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAY---SDLADNYGLLKLGGSDFHGRGG
        HP  L +   ++  L + GL G+E +  D  LA +   ++LA  Y L+  GGSD+HG+ G
Subjt:  HPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAY---SDLADNYGLLKLGGSDFHGRGG

O54453 5'-3' exoribonuclease1.9e-2733.59Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLE
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT + IP A E   R G  + +IPGVEIST++ N         +HI+             + 
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLE

Query:  KFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVL
         FL    E R  R + +  +L +  +P  W+   ++   G A  R H AR +VE G    +   F +YL  G   Y         +AI +IH +GG AVL
Subjt:  KFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVL

Query:  AHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH
        AHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  AHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH

P44176 5'-3' exoribonuclease5.8e-2935.27Show/hide
Query:  FELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPTKIEKLEKFL
        ++LH HS  SDG L+P++LV RA+  GV VLAL DHDT++GI EA  AA+  GI++I GVEIST +   G       +HI+   +    P    K+   L
Subjt:  FELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPTKIEKLEKFL

Query:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHP
        ++ +  R  RA  +  KL +  +P  +D    +    V   R H AR +V+ G V N  QAF RYL  G  A+          AI+ IH  GG+A++AHP
Subjt:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHP

Query:  WALKNPVAIIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADNYGLLKLGGSDFH
                 +R+L    K  G  G+E+    ++  +    +  A  + L    GSDFH
Subjt:  WALKNPVAIIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADNYGLLKLGGSDFH

P77766 5'-3' exoribonuclease1.3e-2533.84Show/hide
Query:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPTKIEKL
        V+++LHSH+  SDG LTP  LV RA    V  LA+TDHDT + I  A E   R G  + +IPGVEIST++ N         +HI+        P   E  
Subjt:  VVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSNGGDSESEEPVHILAY-YSSCGPTKIEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV
          FL    E R  RA+ +  +L + ++P   +   ++  +G A  R H AR +VE G   ++   F +YL  G   Y         +AI +IH +GG AV
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAV

Query:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH
        LAHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFH

Q7NXD4 3',5'-nucleoside bisphosphate phosphatase6.2e-3134.26Show/hide
Query:  ELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLEN
        +LH HS+ SDG LTP+++++RA      +LALTDHD   G+ EA  AA R GI  + GVE+S        S     VHI+       P +   L   L++
Subjt:  ELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLEN

Query:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWA
        IREGR  RA+ M + L    +   +D   +         R H AR +V++G V++++  F +YL  G P Y +       +A+  I   GG+AV+AHP  
Subjt:  IREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWA

Query:  LKNPVAIIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADNYGLLKLGGSDFH--GRGGHSESEVGSVNLPVLAMRDFLNVARPIW
              +I RL    + AG QG+EV      L     ++  AD +GL    GSDFH  G GG    +VG          D   + RPIW
Subjt:  LKNPVAIIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADNYGLLKLGGSDFH--GRGGHSESEVGSVNLPVLAMRDFLNVARPIW

Arabidopsis top hitse value%identityAlignment
AT2G13840.1 Polymerase/histidinol phosphatase-like1.2e-15964.02Show/hide
Query:  APNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLG--KGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGV
        A + KK   KKK+  G+K+KMT+EQ+ AFK +T+W+ L  S SL+SS+     DDF V  + G  + GEKVVFELHSHS  SDGFL+PSK+VERA+ NGV
Subjt:  APNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLG--KGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGV

Query:  KVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDH
        KVL+LTDHDTM+G+PEA+EA RRFGIKIIPG+EIST+F    DS SEEPVHILAYY + GP   ++LE FL  IR+GRF+R + MV KLN+LK+PLKW+H
Subjt:  KVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDH

Query:  VAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGK
        V +I GK VAPGR+HVARA++EAGYVENL+QAF++YL DGGPAY+TG+EP A EA++LI  TGGVAVLAHPWALKN V IIRRLKDAGL G+EVYRSDGK
Subjt:  VAKITGKGVAPGRLHVARAMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGK

Query:  LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDR
        L  +S+LAD Y LLKLGGSD+HG+GG +ESE+GSVNLPV A++DFLNV RPIWC AI+  + +++++PS+SNL+ I RF + R+LKG  + SCG + +DR
Subjt:  LAAYSDLADNYGLLKLGGSDFHGRGGHSESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDR

Query:  CLTSWLTNEEKQNAEFEAIRLKLSHISI
        CL  WLT++E+ + +FEA+RLKLS + I
Subjt:  CLTSWLTNEEKQNAEFEAIRLKLSHISI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGTGATGCCCCAAATTCCAAGAAATCCAAGACCAAGAAGAAGAAAAGGGGTGGCTCCAAGAAGAAGATGACTTCCGAACAGGCTGCCGCCTTTAAGTATGTCAC
GGAATGGGTTTATTTGGATCAATCTAATTCTCTTGCCTCCTCTGCTGCTGCGTCTGTTGTCGATGATTTTGGAGTTCAGAAGAGTCTTGGCAAAGGTGGGGAGAAGGTGG
TCTTTGAGTTGCATTCCCATTCCAAATTCAGTGATGGGTTTCTCACCCCTTCCAAGCTCGTTGAGAGAGCTCATGGAAATGGGGTGAAAGTTCTTGCTTTGACAGATCAT
GACACAATGTCTGGAATCCCTGAGGCTATAGAGGCAGCTCGTAGATTTGGTATCAAAATAATTCCCGGTGTTGAAATCAGTACAATATTCTCCAACGGTGGAGACTCAGA
ATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGCTGTGGACCAACAAAGATTGAGAAGCTGGAAAAGTTTTTAGAAAACATAAGGGAGGGGCGTTTTTTGCGTG
CAAAGAACATGGTGTCAAAACTGAATGAGCTAAAGCTGCCTCTTAAATGGGATCATGTAGCTAAGATTACTGGTAAAGGAGTTGCTCCTGGGAGACTCCATGTGGCCCGT
GCCATGGTTGAAGCAGGCTATGTGGAAAATTTAAAACAAGCATTTTCTCGATACCTTTTTGATGGTGGACCGGCTTACTCAACGGGATCAGAGCCTTGTGCAGCAGAAGC
AATACAATTGATACACGACACAGGTGGTGTGGCCGTTCTAGCTCATCCATGGGCCTTGAAAAATCCCGTTGCTATCATTAGAAGATTGAAAGATGCTGGTCTTCAGGGGC
TGGAGGTTTACAGGAGTGATGGGAAATTGGCAGCATACAGTGACCTAGCAGACAATTATGGGCTTCTTAAACTTGGAGGATCAGATTTTCATGGAAGAGGTGGACATAGT
GAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCGCGATTTCCTCAACGTTGCTCGACCAATTTGGTGCAGTGCCATTCGAGATAATCTCGAGAGTTATGT
CGAAGAGCCTTCAGAATCAAATCTAGCAAAGATTACTAGATTTGGAAGGACCCGTGTTTTGAAGGGTGGATTCTCACCAAGCTGCGGAAATGACTTCATTGATCGCTGTT
TAACTTCGTGGCTGACAAATGAAGAGAAGCAAAATGCCGAGTTTGAGGCTATCAGATTAAAGCTCTCCCACATTTCAATTAATCAAGAAGTTCAGGTGCCTTAA
mRNA sequenceShow/hide mRNA sequence
GGGAAAAGGAGTATTCAGAGAAAGGAAGCAGAGAGAGGGAAATTCAGAAAGGGGGGGAAAAAAGAGGACACCTGCGTTTTGGAAGATATCCTTTCCTCAAAAAGCCGTCT
GTCTCACTCTTTTCCTCCCTTTGTTTGTTATCACTTTCCCTCAATTCTCATTCTCTTCCCCTCCCCTTCCCCTTCCCCTTCCCCACTCCATTAACCTAAATCCACCTCTT
CTCTCAAATCTTCACCTTCCTTATCCTTACCATGGTGGGTGATGCCCCAAATTCCAAGAAATCCAAGACCAAGAAGAAGAAAAGGGGTGGCTCCAAGAAGAAGATGACTT
CCGAACAGGCTGCCGCCTTTAAGTATGTCACGGAATGGGTTTATTTGGATCAATCTAATTCTCTTGCCTCCTCTGCTGCTGCGTCTGTTGTCGATGATTTTGGAGTTCAG
AAGAGTCTTGGCAAAGGTGGGGAGAAGGTGGTCTTTGAGTTGCATTCCCATTCCAAATTCAGTGATGGGTTTCTCACCCCTTCCAAGCTCGTTGAGAGAGCTCATGGAAA
TGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACAATGTCTGGAATCCCTGAGGCTATAGAGGCAGCTCGTAGATTTGGTATCAAAATAATTCCCGGTGTTGAAATCA
GTACAATATTCTCCAACGGTGGAGACTCAGAATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGCTGTGGACCAACAAAGATTGAGAAGCTGGAAAAGTTTTTA
GAAAACATAAGGGAGGGGCGTTTTTTGCGTGCAAAGAACATGGTGTCAAAACTGAATGAGCTAAAGCTGCCTCTTAAATGGGATCATGTAGCTAAGATTACTGGTAAAGG
AGTTGCTCCTGGGAGACTCCATGTGGCCCGTGCCATGGTTGAAGCAGGCTATGTGGAAAATTTAAAACAAGCATTTTCTCGATACCTTTTTGATGGTGGACCGGCTTACT
CAACGGGATCAGAGCCTTGTGCAGCAGAAGCAATACAATTGATACACGACACAGGTGGTGTGGCCGTTCTAGCTCATCCATGGGCCTTGAAAAATCCCGTTGCTATCATT
AGAAGATTGAAAGATGCTGGTCTTCAGGGGCTGGAGGTTTACAGGAGTGATGGGAAATTGGCAGCATACAGTGACCTAGCAGACAATTATGGGCTTCTTAAACTTGGAGG
ATCAGATTTTCATGGAAGAGGTGGACATAGTGAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCGCGATTTCCTCAACGTTGCTCGACCAATTTGGTGCA
GTGCCATTCGAGATAATCTCGAGAGTTATGTCGAAGAGCCTTCAGAATCAAATCTAGCAAAGATTACTAGATTTGGAAGGACCCGTGTTTTGAAGGGTGGATTCTCACCA
AGCTGCGGAAATGACTTCATTGATCGCTGTTTAACTTCGTGGCTGACAAATGAAGAGAAGCAAAATGCCGAGTTTGAGGCTATCAGATTAAAGCTCTCCCACATTTCAAT
TAATCAAGAAGTTCAGGTGCCTTAAGACTAAACAACTCCAACCAAGGTCGTCGATACTTGCTCATCATGCTAAGATAGTCGACATACCAATCAGATTCAGTCAGGTTTTG
TACTTTCCTTTGTTACTCTTCTTTGATGTTAGGGAAAAAGTGGGTTTCAGTAAATGATTCGTTGAGGTACAACAAAAAGCTCCCATTGGCTTATACTTTTTGGTCAAATC
AATAATACTTCATTCAATCTCTTCCAAGTTTTTTACCATTGTTTTATAGGATTTAGCAATATTTATCCATTTAATTGGATCACATGCATATGACTGTTGAAATGGATCTG
TTTTTTTGTTTTCTTGGATGAGCTCAATAATATGGGAGTGAGAGATTTGATCTT
Protein sequenceShow/hide protein sequence
MVGDAPNSKKSKTKKKKRGGSKKKMTSEQAAAFKYVTEWVYLDQSNSLASSAAASVVDDFGVQKSLGKGGEKVVFELHSHSKFSDGFLTPSKLVERAHGNGVKVLALTDH
DTMSGIPEAIEAARRFGIKIIPGVEISTIFSNGGDSESEEPVHILAYYSSCGPTKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAKITGKGVAPGRLHVAR
AMVEAGYVENLKQAFSRYLFDGGPAYSTGSEPCAAEAIQLIHDTGGVAVLAHPWALKNPVAIIRRLKDAGLQGLEVYRSDGKLAAYSDLADNYGLLKLGGSDFHGRGGHS
ESEVGSVNLPVLAMRDFLNVARPIWCSAIRDNLESYVEEPSESNLAKITRFGRTRVLKGGFSPSCGNDFIDRCLTSWLTNEEKQNAEFEAIRLKLSHISINQEVQVP