; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015493 (gene) of Chayote v1 genome

Gene IDSed0015493
OrganismSechium edule (Chayote v1)
DescriptionPOLIIIAc domain-containing protein
Genome locationLG10:35381577..35386860
RNA-Seq ExpressionSed0015493
SyntenySed0015493
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004534 - 5'-3' exoribonuclease activity (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR003141 - Polymerase/histidinol phosphatase, N-terminal
IPR004013 - PHP domain
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588776.1 hypothetical protein SDJN03_17341, partial [Cucurbita argyrosperma subsp. sororia]2.8e-22288.06Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HFV + PNSKKSKKKKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLAS+AAASVVDDFGVQKTLGKGGEKVVFDLHSHSK SDGFLSPSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK
        ERA+GNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNELK
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK

Query:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE
        LPLKWDHVA+IT KGVAPGRLHVARAMVEAG+VENLKQAF+RYL+DGGPAY+ GSEPCAE+AI+LIH+TGGV+VLAHPWALKNPVAIIRRLKDAGLHGLE
Subjt:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE

Query:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS
        VYRSDGKLAAYSDLAD  GLLKLGGSDFHG GGNS+SEVGSVNLPVLAMHDFLK ARPIWC AIRD L+SYV+EPSD+NLAKITRFGRT   KGGS PPS
Subjt:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP
          ND+I+HCL  WLTNEEKQ+AEFEAIRLKLSH+S+  QE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP

XP_004136869.1 uncharacterized protein LOC101218042 [Cucumis sativus]4.8e-22286.68Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HF  S PNSKKSK KKKKRGGTKKKMTSEQ AAFKYV +W YLD+ NSLASSAAASVVDDFGVQKT+GKGGEKVVF+LHSHSKCSDGFL+PSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL
        ERA+GNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNEL
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL
        KLPLKWDHVA+IT KGVAPGRLHVARA+VEAG+VENLKQAF+RYL+DGGPAY+ GSEPCA EAI+LIHDTGG++VLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL

Query:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS
        EVYRSDG+LAAYSDLAD +GLLKLGGSDFHG GG+S+SEVGSVNLPVLAMHDFLKAARP+WC AIRD LESYV+EPS++NLAKITRFGRT   KGGS P 
Subjt:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP
        SGND+IE CL LWLTNEEKQN EFEAIRLKLSH+SINQE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP

XP_008455216.1 PREDICTED: 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo]3.4e-22086.46Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HF  S  NSKKSK KKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLASSAAASVVDDFGVQK+LGKGGEKVVF+LHSHSKCSDGFL+PSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL
        ERA+GNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNEL
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL
        KLPLKWDHVA+IT KGVAPGRLHVARA+VEAG+VENLKQAF+RYL+DGGPAY+ GSEPCA EAI+LI DTGG++VLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL

Query:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS
        EVYRSDG+LAAYSDLAD +GLLKLGGSDFHG GG+S+SEVGSVNLPVLAMHDFLKAARP+WC AIRD LE YV+EPS++NLAKITRFGRT   KGGS PS
Subjt:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP
        SGND+IE CL LWLTNEEKQN EFEAIRLKLSH+SINQE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP

XP_022927883.1 uncharacterized protein LOC111434646 [Cucurbita moschata]3.1e-22187.84Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HFV + PNSKKSKKKKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLAS+AAASVVDDFGVQKTLGKGGEKVVFDLHSHSK SDGFLSPSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK
        ERA+GNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKL+  LENIREGRFLRAKNMVSKLNELK
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK

Query:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE
        LPLKWDHVA+IT KGVAPGRLHVARAMVEAG+VENLKQAF+RYL+DGGPAY+ GSEPCAE+AI+LIH+TGGV+VLAHPWALKNPVAIIRRLKDAGLHGLE
Subjt:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE

Query:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS
        VYRSDGKLA YSDLAD  GLLKLGGSDFHG GGNS+SEVGSVNLP LAMHDFLK ARPIWC AIRD LESYV+EPSD+NLAKITRFGRT   KGGS PPS
Subjt:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP
          ND+I+HCL  WLTNEEKQ+AEFEAIRLKLSH+S+  QE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP

XP_023531739.1 uncharacterized protein LOC111793902 [Cucurbita pepo subsp. pepo]1.2e-22087.61Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HFV + PNSKKSKKKKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLAS+AAASVVDDFGVQKTLGKGGEKVVFDLHSHSK SDGFLSPSKL+
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK
        ERA+GNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNELK
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK

Query:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE
        LPLKWDHVA+IT KGVAPGRLHVARAMVEAG+VENLKQAF+RYL+DGGPAY+ GSEPCAE+AI+LIH+TGGV+VLAHPWALKNPVAIIRRLKDAGLHGLE
Subjt:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE

Query:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS
        VYRSDGKLAAYSDLAD  GLLKLGGSDFHG GGNS+SEVGSVNLPVLAM DFLK ARPIWC AIRD LESYV+EP+D+NLAKITRFGRT   KGGS PPS
Subjt:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP
          ND+I+HCL  WLTNEEKQ+AEFEAIRLKLSH+S+  QE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP

TrEMBL top hitse value%identityAlignment
A0A0A0K205 POLIIIAc domain-containing protein2.3e-22286.68Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HF  S PNSKKSK KKKKRGGTKKKMTSEQ AAFKYV +W YLD+ NSLASSAAASVVDDFGVQKT+GKGGEKVVF+LHSHSKCSDGFL+PSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL
        ERA+GNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNEL
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL
        KLPLKWDHVA+IT KGVAPGRLHVARA+VEAG+VENLKQAF+RYL+DGGPAY+ GSEPCA EAI+LIHDTGG++VLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL

Query:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS
        EVYRSDG+LAAYSDLAD +GLLKLGGSDFHG GG+S+SEVGSVNLPVLAMHDFLKAARP+WC AIRD LESYV+EPS++NLAKITRFGRT   KGGS P 
Subjt:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP
        SGND+IE CL LWLTNEEKQN EFEAIRLKLSH+SINQE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP

A0A1S3C0E5 3',5'-nucleoside bisphosphate phosphatase1.7e-22086.46Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HF  S  NSKKSK KKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLASSAAASVVDDFGVQK+LGKGGEKVVF+LHSHSKCSDGFL+PSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL
        ERA+GNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNEL
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNEL

Query:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL
        KLPLKWDHVA+IT KGVAPGRLHVARA+VEAG+VENLKQAF+RYL+DGGPAY+ GSEPCA EAI+LI DTGG++VLAHPWALKNPVA+IRRLKDAGLHGL
Subjt:  KLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGL

Query:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS
        EVYRSDG+LAAYSDLAD +GLLKLGGSDFHG GG+S+SEVGSVNLPVLAMHDFLKAARP+WC AIRD LE YV+EPS++NLAKITRFGRT   KGGS PS
Subjt:  EVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP
        SGND+IE CL LWLTNEEKQN EFEAIRLKLSH+SINQE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP

A0A5A7SPM7 3',5'-nucleoside bisphosphate phosphatase8.9e-21478.16Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HF  S  NSKKSK KKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLASSAAASVVDDFGVQK+LGKGGEKVVF+LHSHSKCSDGFL+PSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSE
        ERA+GNG                                               VKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS+ G+SE
Subjt:  ERANGNG-----------------------------------------------VKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSS-GNSE

Query:  SEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYA
        SEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNELKLPLKWDHVA+IT KGVAPGRLHVARA+VEAG+VENLKQAF+RYL+DGGPAY+
Subjt:  SEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYA

Query:  KGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDF
         GSEPCA EAI+LI DTGG++VLAHPWALKNPVA+IRRLKDAGLHGLEVYRSDG+LAAYSDLAD +GLLKLGGSDFHG GG+S+SEVGSVNLPVLAMHDF
Subjt:  KGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDF

Query:  LKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPSSGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP
        LKAARP+WC AIRD LE YV+EPS++NLAKITRFGRT   KGGS PSSGND+IE CL LWLTNEEKQN EFEAIRLKLSH+SINQE QVP
Subjt:  LKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPSSGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQVP

A0A6J1EMA5 uncharacterized protein LOC1114346461.5e-22187.84Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GD HFV + PNSKKSKKKKKKRGGTKKKMTSEQ AAFKYV +WVYLD+ NSLAS+AAASVVDDFGVQKTLGKGGEKVVFDLHSHSK SDGFLSPSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK
        ERA+GNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKL+  LENIREGRFLRAKNMVSKLNELK
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK

Query:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE
        LPLKWDHVA+IT KGVAPGRLHVARAMVEAG+VENLKQAF+RYL+DGGPAY+ GSEPCAE+AI+LIH+TGGV+VLAHPWALKNPVAIIRRLKDAGLHGLE
Subjt:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE

Query:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS
        VYRSDGKLA YSDLAD  GLLKLGGSDFHG GGNS+SEVGSVNLP LAMHDFLK ARPIWC AIRD LESYV+EPSD+NLAKITRFGRT   KGGS PPS
Subjt:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP
          ND+I+HCL  WLTNEEKQ+AEFEAIRLKLSH+S+  QE QVP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP

A0A6J1JMJ7 uncharacterized protein LOC1114860221.7e-22087.39Show/hide
Query:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV
        M GDGHFV + PNSKKSKKKKKKRGG+KKKMTSEQ AAFKYV +WVYLD+ NSLAS+AAASVVDDFGVQKTLGKGGEKVVF+LHSHSK SDGFLSPSKLV
Subjt:  MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLV

Query:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK
        ERA+GNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKL++ LENIREGRFLRAKNMVSKLNELK
Subjt:  ERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELK

Query:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE
        LPLKWDHVA+IT KGVAPGRLHVARAMVEAG+VENLKQAF+RYL+DGGPAY+ GSEPCAE+AI+LIH+TGGV+VLAHPWALKNPVAIIRRLKDAGLHGLE
Subjt:  LPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLE

Query:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS
        VYRSDGKLAAYSDLAD  GLLKLGGSDFHG GGNS+SEVGSVNLPVLAMHDFLK AR IWC AIRD LESYV+EPS++NLAKITRFGRT   KGGS PPS
Subjt:  VYRSDGKLAAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGS-PPS

Query:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP
          ND+I+HCL  WLTNEEKQ+AEFEAIRLKLSH+S+  QE +VP
Subjt:  SGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSI-NQEPQVP

SwissProt top hitse value%identityAlignment
C8WJZ5 Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase1.1e-1928.4Show/hide
Query:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCG-PAKIEKLDQL
        ++ DLH HS  SDG  +  +++E+A   GV+ LA T+HDT +G+  A E   R G++++ G+E+S      + E    VHIL      G PA    L  L
Subjt:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCG-PAKIEKLDQL

Query:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPC-AEEAIRLIHDTGGVSVLA
          +  E R   +   + +L E    +  +    +        + H+  A+    +     +   R L+  G    +  +   A +A+R++ + GG++VLA
Subjt:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPC-AEEAIRLIHDTGGVSVLA

Query:  HPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAY---SDLADKFGLLKLGGSDFHG
        HP  L +   ++  L + GL G+E +  D  LA +   ++LA ++ L+  GGSD+HG
Subjt:  HPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAY---SDLADKFGLLKLGGSDFHG

O54453 5'-3' exoribonuclease1.4e-2734.48Show/hide
Query:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLD
        V++DLHSH+  SDG L+P  LV RA    V  LA+TDHDT + IP A E   R G  + +IPGVEIST++ +        +HI+        PA  + L 
Subjt:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLD

Query:  QLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVL
        Q  E     R  R + +  +L +  +P  W+   R+ + G A  R H AR +VE G    +   F +YL  G   Y        E+AI +IH +GG +VL
Subjt:  QLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVL

Query:  AHPWALKNPVAIIRRL--KDAGLHG-----LEVYRSDGKLAAYSDLADKFGLLKLGGSDFH
        AHP         ++RL    A  HG      +  +S  +    + LA +  L    GSDFH
Subjt:  AHPWALKNPVAIIRRL--KDAGLHG-----LEVYRSDGKLAAYSDLADKFGLLKLGGSDFH

P44176 5'-3' exoribonuclease3.7e-3136.96Show/hide
Query:  FDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLDQLLE
        +DLH HS  SDG LSP++LV RA   GV VLAL DHDT++GI EA  AA+  GI++I GVEIST +          +HI+   +    P    K+  LL+
Subjt:  FDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLDQLLE

Query:  NIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPW
        + +  R  RA  +  KL +  +P  +D    + D  V   R H AR +V+ G V N  QAF RYL  G  A+ K        AI  IH  GG++++AHP 
Subjt:  NIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPW

Query:  ALKNPVAIIRRL----KDAGLHGLEVY---RSDGKLAAYSDLADKFGLLKLGGSDFH
                +R+L    K  G  G+E+    ++  +    +  A +F L    GSDFH
Subjt:  ALKNPVAIIRRL----KDAGLHGLEVY---RSDGKLAAYSDLADKFGLLKLGGSDFH

P77766 5'-3' exoribonuclease1.8e-2533.97Show/hide
Query:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLD
        V++DLHSH+  SDG L+P  LV RA    V  LA+TDHDT + I  A E   R G  + +IPGVEIST++ +        +HI+        P   E L 
Subjt:  VVFDLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSSGNSESEEPVHILAY-YSSCGPAKIEKLD

Query:  QLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVL
        Q  E     R  RA+ +  +L + ++P   +   R+  +G A  R H AR +VE G   ++   F +YL  G   Y        E+AI +IH +GG +VL
Subjt:  QLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVL

Query:  AHP--------WALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAYSDLADKFGLLKLGGSDFH
        AHP        W LK  VA         +   +  +S  +    + LA +  L    GSDFH
Subjt:  AHP--------WALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAYSDLADKFGLLKLGGSDFH

Q7NXD4 3',5'-nucleoside bisphosphate phosphatase2.3e-3333.66Show/hide
Query:  DLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENI
        DLH HS+ SDG L+P+++++RA      +LALTDHD   G+ EA  AA R GI  + GVE+S       S     VHI+       PA+   L   L++I
Subjt:  DLHSHSKCSDGFLSPSKLVERANGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENI

Query:  REGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWAL
        REGR  RA+ M + L    +   +D   R  D      R H AR +V++G V++++  F +YL  G P Y        E+A+  I   GG++V+AHP   
Subjt:  REGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWAL

Query:  KNPVAIIRRL----KDAGLHGLEVYRSDGKL---AAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDE
             +I RL    + AG  G+EV      L     ++  AD+ GL    GSDFH  G   +    + +LP +         RPIW       LE+ +  
Subjt:  KNPVAIIRRL----KDAGLHGLEVYRSDGKL---AAYSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDE

Query:  PSD
        P+D
Subjt:  PSD

Arabidopsis top hitse value%identityAlignment
AT2G13840.1 Polymerase/histidinol phosphatase-like5.4e-15562.59Show/hide
Query:  NSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLG--KGGEKVVFDLHSHSKCSDGFLSPSKLVERANGNGVKV
        + KK  KKKK+  G K+KMT+EQ+ AFK + DW+ L    SL+SS+     DDF V    G  + GEKVVF+LHSHS  SDGFLSPSK+VERA  NGVKV
Subjt:  NSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLG--KGGEKVVFDLHSHSKCSDGFLSPSKLVERANGNGVKV

Query:  LALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELKLPLKWDHVAR
        L+LTDHDTM+G+PEA+EA RRFGIKIIPG+EIST+F   +S SEEPVHILAYY + GPA  ++L+  L  IR+GRF+R + MV KLN+LK+PLKW+HV R
Subjt:  LALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELKLPLKWDHVAR

Query:  ITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAA
        I  K VAPGR+HVARA++EAG+VENL+QAF +YL+DGGPAYA G+EP AEEA++LI  TGGV+VLAHPWALKN V IIRRLKDAGLHG+EVYRSDGKL  
Subjt:  ITDKGVAPGRLHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAA

Query:  YSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPSSGNDVIEHCLA
        +S+LAD + LLKLGGSD+HG GG ++SE+GSVNLPV A+ DFL   RPIWC AI+  + +++D+PSD+NL+ I RF +    KG S  S G ++++ CLA
Subjt:  YSDLADKFGLLKLGGSDFHGIGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPSSGNDVIEHCLA

Query:  LWLTNEEKQNAEFEAIRLKLSHVSI
        +WLT++E+ + +FEA+RLKLS V I
Subjt:  LWLTNEEKQNAEFEAIRLKLSHVSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCGACGGGCATTTCGTCAATTCGGTGCCCAATTCGAAGAAATCGAAGAAGAAGAAGAAGAAGCGAGGCGGCACCAAGAAGAAGATGACTTCCGAACAGGCGGC
GGCGTTCAAGTACGTCGTGGATTGGGTTTATTTGGATCGCTGTAATTCTCTTGCCTCTTCGGCGGCGGCGTCTGTTGTTGACGATTTTGGCGTTCAGAAGACGCTTGGGA
AAGGTGGGGAGAAGGTTGTGTTCGATTTGCATTCGCACTCCAAGTGTAGCGATGGGTTTCTTTCCCCTTCGAAGCTCGTCGAGCGTGCCAATGGAAATGGGGTGAAAGTT
CTTGCTTTGACAGATCATGACACGATGTCTGGGATCCCCGAGGCTATCGAAGCAGCTCGTAGATTTGGTATCAAAATAATTCCAGGTGTAGAAATCAGTACCATATTCTC
TTCAGGAAATTCAGAATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGCTGTGGACCAGCAAAGATTGAGAAGCTGGACCAACTCTTGGAAAATATAAGGGAGG
GCCGTTTTTTGCGTGCGAAGAACATGGTCTCAAAACTGAATGAGCTAAAGCTGCCTCTTAAATGGGATCATGTGGCGAGGATTACTGATAAAGGAGTTGCTCCTGGAAGA
CTTCATGTGGCTCGTGCCATGGTTGAAGCAGGCCATGTCGAAAATCTAAAACAAGCATTTGCTCGTTACCTTTATGATGGTGGACCTGCTTATGCAAAGGGATCAGAGCC
GTGCGCAGAGGAAGCAATTCGATTGATACACGATACGGGTGGTGTTTCCGTACTAGCTCATCCATGGGCCTTGAAGAATCCTGTTGCTATCATTAGAAGATTGAAAGATG
CCGGTCTTCATGGGCTGGAGGTTTACAGGAGTGATGGAAAATTGGCAGCATACAGTGATCTAGCAGACAAGTTTGGGCTTCTGAAACTTGGAGGATCAGATTTTCATGGA
ATAGGTGGAAACAGTAAATCTGAAGTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCACGACTTCCTCAAGGCTGCTCGACCTATTTGGTGTGGCGCCATTCGAGACAA
TCTTGAGAGCTATGTCGATGAGCCTTCAGACACAAATCTAGCAAAAATTACTAGATTTGGTAGGACGCACGGTTCGAAAGGTGGCTCTCCGCCGAGCAGCGGAAATGACG
TCATCGAGCATTGTTTAGCTTTGTGGCTTACAAATGAAGAGAAGCAAAATGCTGAGTTTGAGGCTATTAGATTGAAGCTCTCTCATGTTTCAATTAATCAAGAACCTCAG
GTGCCTTAA
mRNA sequenceShow/hide mRNA sequence
AGAAAATCCAGAAAAAAAGGGGCAAAAAAAAGAAAAGAAAAAAAGGACACTGCAATTTGAGAAGAAATCCTTTCCCCAAAAAGCTGCCTGTCTCAAACGTTCCTCCTTTT
CTTTGTTCATCACTTTCCCCCAATTCCCAATTCTGACCTAAAAACCCTTTTCTCCAGTAAAACCTCCACCCTCAAATCCATGGCGGGCGACGGGCATTTCGTCAATTCGG
TGCCCAATTCGAAGAAATCGAAGAAGAAGAAGAAGAAGCGAGGCGGCACCAAGAAGAAGATGACTTCCGAACAGGCGGCGGCGTTCAAGTACGTCGTGGATTGGGTTTAT
TTGGATCGCTGTAATTCTCTTGCCTCTTCGGCGGCGGCGTCTGTTGTTGACGATTTTGGCGTTCAGAAGACGCTTGGGAAAGGTGGGGAGAAGGTTGTGTTCGATTTGCA
TTCGCACTCCAAGTGTAGCGATGGGTTTCTTTCCCCTTCGAAGCTCGTCGAGCGTGCCAATGGAAATGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACGATGTCTG
GGATCCCCGAGGCTATCGAAGCAGCTCGTAGATTTGGTATCAAAATAATTCCAGGTGTAGAAATCAGTACCATATTCTCTTCAGGAAATTCAGAATCAGAAGAACCAGTA
CACATCCTTGCATATTACAGCAGCTGTGGACCAGCAAAGATTGAGAAGCTGGACCAACTCTTGGAAAATATAAGGGAGGGCCGTTTTTTGCGTGCGAAGAACATGGTCTC
AAAACTGAATGAGCTAAAGCTGCCTCTTAAATGGGATCATGTGGCGAGGATTACTGATAAAGGAGTTGCTCCTGGAAGACTTCATGTGGCTCGTGCCATGGTTGAAGCAG
GCCATGTCGAAAATCTAAAACAAGCATTTGCTCGTTACCTTTATGATGGTGGACCTGCTTATGCAAAGGGATCAGAGCCGTGCGCAGAGGAAGCAATTCGATTGATACAC
GATACGGGTGGTGTTTCCGTACTAGCTCATCCATGGGCCTTGAAGAATCCTGTTGCTATCATTAGAAGATTGAAAGATGCCGGTCTTCATGGGCTGGAGGTTTACAGGAG
TGATGGAAAATTGGCAGCATACAGTGATCTAGCAGACAAGTTTGGGCTTCTGAAACTTGGAGGATCAGATTTTCATGGAATAGGTGGAAACAGTAAATCTGAAGTTGGAA
GTGTAAACCTTCCTGTTCTTGCTATGCACGACTTCCTCAAGGCTGCTCGACCTATTTGGTGTGGCGCCATTCGAGACAATCTTGAGAGCTATGTCGATGAGCCTTCAGAC
ACAAATCTAGCAAAAATTACTAGATTTGGTAGGACGCACGGTTCGAAAGGTGGCTCTCCGCCGAGCAGCGGAAATGACGTCATCGAGCATTGTTTAGCTTTGTGGCTTAC
AAATGAAGAGAAGCAAAATGCTGAGTTTGAGGCTATTAGATTGAAGCTCTCTCATGTTTCAATTAATCAAGAACCTCAGGTGCCTTAACACTCAGCTTGTCGTGCCGACG
TAGTCGCATATCGATCAGATTTAGTCAGGTTTTTTCACTTGCTTTTGTTCCTTGATTTAGTCAGGTTCCAGTAAACAATTCGTTGAAAGTTGATACTTTTTGTTCAAACC
AATAAACTTCATTCAATCTCATGTTAGTTTTTTGTTCATCGTTTTATAGGACATTGTAATATTTATCCATACCTATTTAAATTGGATCACAAGCATAGGACTGTTGTAAT
AGTTCATCTTTTGGTAATGTTGAAAATGAACAAGGATATGTTGAAAGGTTTTAGTGTTTTGGCCTCTTGACTTTTGTTAGCTTTTATTTGTTTATTATAATTTAAC
Protein sequenceShow/hide protein sequence
MAGDGHFVNSVPNSKKSKKKKKKRGGTKKKMTSEQAAAFKYVVDWVYLDRCNSLASSAAASVVDDFGVQKTLGKGGEKVVFDLHSHSKCSDGFLSPSKLVERANGNGVKV
LALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSSGNSESEEPVHILAYYSSCGPAKIEKLDQLLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITDKGVAPGR
LHVARAMVEAGHVENLKQAFARYLYDGGPAYAKGSEPCAEEAIRLIHDTGGVSVLAHPWALKNPVAIIRRLKDAGLHGLEVYRSDGKLAAYSDLADKFGLLKLGGSDFHG
IGGNSKSEVGSVNLPVLAMHDFLKAARPIWCGAIRDNLESYVDEPSDTNLAKITRFGRTHGSKGGSPPSSGNDVIEHCLALWLTNEEKQNAEFEAIRLKLSHVSINQEPQ
VP