; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023298 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023298
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHistidine phosphatase superfamily, clade-1
Genome locationtig00000892:1984959..1995775
RNA-Seq ExpressionSgr023298
SyntenySgr023298
Gene Ontology termsGO:0016311 - dephosphorylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016791 - phosphatase activity (molecular function)
GO:0016868 - intramolecular transferase activity, phosphotransferases (molecular function)
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010362.1 putative 2-carboxy-D-arabinitol-1-phosphatase, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-20676.97Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        M+ FSL PTAH+SH LLL +SSG FP  I  SSFTVRSSSSLQEVEKFSESS++RK+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +         +  R            S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF+IDGHYPVRELW
Subjt:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
         RARNCWN+ILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQ EGGSP+I LNRLNQTPNSPVAS SSGGRKT +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQ-----D
        GV E+N ASSSFLEDKPMN+LG IQSQKVAELLLDLKVS VISSPKKAC+ETA A+SRVQEAADCLGADCVPRYVEMKQTNKLD+D+I DHFKQ     D
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQ-----D

Query:  VANTNVFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        V +TNVFEPGW N L+ GVITEVWNQSGEAWKSLLNEM DE+  EKI+VVVGHPAILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VANTNVFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

XP_022140576.1 probable 2-carboxy-D-arabinitol-1-phosphatase [Momordica charantia]6.2e-21480.7Show/hide
Query:  FSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFSI--
        FSL P  HYSHLL L KSSGFFPARIP+SSFTVRSSSSLQEVEKFSESSSERKELSSELYAS+PLPP++                   R +    FS+  
Subjt:  FSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFSI--

Query:  ------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARA
              + T  + +I        S P IRSKRTAEIIWGDRE+ IIT+SELREIDLYSFQGLLK EGK KFGAAYRQWQVDAANF IDGHYPVRELWARA
Subjt:  ------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARA

Query:  RNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVY
        RN WNKILAHESRSVLVVAHNAVNQALVATA+GLGAEYFRVLLQSNCGVSVLDF+ QAEGGSP+I LNRLNQTPNSP+ASGSS GRKTA+RIILVCHGV 
Subjt:  RNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVY

Query:  ENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFE
        ENN ASSSFLEDKPMNILG IQSQKVAELLLDLKVSTVISSPKK C+ETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDI+NI DHFKQDVANTNVFE
Subjt:  ENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFE

Query:  PGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        PGW NNLDDG+IT VWNQSGEAWK LL+EMADE   EKI VVVGHPA+LLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPSGRG
Subjt:  PGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

XP_022943446.1 probable 2-carboxy-D-arabinitol-1-phosphatase [Cucurbita moschata]1.1e-20777.55Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        M+ FSL PTAH+SH LLL +SSG FP  I  SSFTVRSSSSLQEVEKFSESS++RK+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +         +  R            S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF+IDGHYPVRELW
Subjt:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
         RARNCWN+ILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQ EGGSP+I LNRLNQTPNSPVAS SSGGRKT +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMN+LG IQSQKVAELLLDLKVS VISSPKKAC+ETA A+SRVQEAADCLGADCVPRYVEMKQTNKLD+D+I DHFKQDV +TN
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+ GVITEVWNQSGEAWKSLLNEM DE+  EKI+VVVGHP+ILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

XP_023512319.1 probable 2-carboxy-D-arabinitol-1-phosphatase [Cucurbita pepo subsp. pepo]3.3e-20777.35Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        MF FSL PT H+SH LLL +SSG  P RI  S FTVRSSSSLQEVEKFSESS++RK+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +         +  R            S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF+IDGHYPVRELW
Subjt:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
         RARNCWN+ILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQ EGGSP+I LNRLNQTPNSPVAS SSGGRKT +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMN+LG IQSQKVAELLLDLKVS VISSPKKAC+ETA A+SRVQEAADCLGADCVPRYVEMKQTNKLD+D+I DHFKQD+ +TN
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+ GVITEVWNQSGEAWKSLLNEM DE+  EKI+VVVGHPAILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

XP_038901330.1 probable 2-carboxy-D-arabinitol-1-phosphatase [Benincasa hispida]4.6e-20977.96Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        MF  SL P AH    LLLQK SG FPARIP SSFTVRSSSSLQEVEKFSESSS+RK+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +        + T  + +I        S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
Subjt:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
        ARARNCW++ILAHESRSVLVVAHNAVNQALVATAIGLG+EYFRVLLQSNCGVSVLDFTP AEGGSP+I LNRLNQTPNSPVASGSSGGRK  +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMN+LG+IQSQKVAELLLDLKVS VISS KKAC+ETA AISRVQEAADCLGADCVPRYVEM+QTNKLD+DNI DHFKQD+ +TN
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+ GVITEVWNQSGEAWKSLLNE+ADE+  EKI+VVVGHPAILLGLVGQCLN+TK+WIGSFHLDAGS+SV D+PDGPSGRG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

TrEMBL top hitse value%identityAlignment
A0A0A0KDK7 Uncharacterized protein8.8e-20677.55Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        MF  SL P AH+ HLL    SSG+FPARI  SSFTVRSSSSLQEVEK SESS + K+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +        + T  + +I        S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF IDGHYPVRELW
Subjt:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
        ARARNCW++ILAHESRSVLVVAHNAVNQALVATAIGLG+EYFRVLLQSNCGVSVLDFTP AEGGSP+I LNRLNQTPNSPVASGSSGGRK  +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMNILG+IQSQKVAELLLDLKVS VISSPKKAC+ETA AISRVQEAADCLGADCVPRYVEMKQTNKLD++NI DHF QDV + N
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+DGVITEVWNQSGEAWKSLLNEMADE+  EKIVVVVGHPAILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS +G
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

A0A1S3C9H8 probable 2-carboxy-D-arabinitol-1-phosphatase2.6e-20576.94Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        MF  SL P A++ HLL    SS +FP RIP SSFT+RSSSS+QEVEK SESSS+RKELSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +        + T  + +I        S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF IDGHYPVRELW
Subjt:  I--------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
        ARARNCW++ILAHESRSVLVVAHNAVNQALVATAIGLG+EYFRVLLQSNCGVSVLDFTP A+GGSP+I LNRLNQTPNSPVASGSSGGRK  +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMNILG+IQSQKVAELLLDLKV+ VISSPKKAC+ETA AISRVQEAADCLGADCVPRYVEMKQTNKLD++NI DHFKQDV + N
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+DGVITEVWNQS EAWKSLLNEMADE+  EKIVVVVGHPAILLGL+GQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

A0A6J1CGG9 probable 2-carboxy-D-arabinitol-1-phosphatase3.0e-21480.7Show/hide
Query:  FSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFSI--
        FSL P  HYSHLL L KSSGFFPARIP+SSFTVRSSSSLQEVEKFSESSSERKELSSELYAS+PLPP++                   R +    FS+  
Subjt:  FSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFSI--

Query:  ------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARA
              + T  + +I        S P IRSKRTAEIIWGDRE+ IIT+SELREIDLYSFQGLLK EGK KFGAAYRQWQVDAANF IDGHYPVRELWARA
Subjt:  ------SLTPCRFIIK---QWRLSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARA

Query:  RNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVY
        RN WNKILAHESRSVLVVAHNAVNQALVATA+GLGAEYFRVLLQSNCGVSVLDF+ QAEGGSP+I LNRLNQTPNSP+ASGSS GRKTA+RIILVCHGV 
Subjt:  RNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVY

Query:  ENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFE
        ENN ASSSFLEDKPMNILG IQSQKVAELLLDLKVSTVISSPKK C+ETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDI+NI DHFKQDVANTNVFE
Subjt:  ENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFE

Query:  PGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        PGW NNLDDG+IT VWNQSGEAWK LL+EMADE   EKI VVVGHPA+LLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPSGRG
Subjt:  PGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

A0A6J1FRQ6 probable 2-carboxy-D-arabinitol-1-phosphatase5.5e-20877.55Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        M+ FSL PTAH+SH LLL +SSG FP  I  SSFTVRSSSSLQEVEKFSESS++RK+LSSELYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +         +  R            S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAYRQWQVDAANF+IDGHYPVRELW
Subjt:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
         RARNCWN+ILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQ EGGSP+I LNRLNQTPNSPVAS SSGGRKT +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMN+LG IQSQKVAELLLDLKVS VISSPKKAC+ETA A+SRVQEAADCLGADCVPRYVEMKQTNKLD+D+I DHFKQDV +TN
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+ GVITEVWNQSGEAWKSLLNEM DE+  EKI+VVVGHP+ILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

A0A6J1J6A1 probable 2-carboxy-D-arabinitol-1-phosphatase6.7e-20676.73Show/hide
Query:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS
        M  FSL PTAH+SH LLL +SSG FP  I +SSFTVRS SSLQEVEKFSESS++RK+LSS+LYASVPLPP++                   R +    FS
Subjt:  MFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRL------------------RREWFCRFS

Query:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW
        +         +  R            S P +RSKRTAEIIWGDRE+VI+T+SELREIDLYSFQGLLKHEGK KFGAAY QWQVDAANF+ID HYPVRELW
Subjt:  ISLTPCRFIIKQWR-----------LSEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELW

Query:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH
         RARNCWN+ILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQ EGGSP+I LNRLNQTPNSPVAS SSGGRKT +RIILVCH
Subjt:  ARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCH

Query:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN
        GV E+N ASSSFLEDKPMN+LG IQSQKVAELLLDLKVS VISSPKKAC+ETA A+SRVQEAADCLGADCVPRYVEMKQTNKLD+D+I DHFKQD+ +TN
Subjt:  GVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTN

Query:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        VFEPGW N L+ GVITEVWNQSGEAWKSLLNEM DE+  EKI+VVVGHPAILLGLVGQCLNLTK+WIGSFHLDAGSISVLD+PDGPS RG
Subjt:  VFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

SwissProt top hitse value%identityAlignment
Q9FNJ9 Probable 2-carboxy-D-arabinitol-1-phosphatase8.1e-12461.14Show/hide
Query:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV
        + P  RSK+TAEIIWG RE  +I + +LREIDLYSFQGLLK EGK KFG A++QWQ D ANF IDGHYPVRELW+RAR+CW  ILAHES+SVLVVAHNAV
Subjt:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV

Query:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI
        NQAL+ATAIGLG EYFR LLQSNCGVSVLDF P+A+GGSP + LNRLNQTPNSP+A GSSGGRK +++IILVCHG   N  +   + +   D+ MN+LG+
Subjt:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI

Query:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG
        I SQK AELLLDL+VS+++ SPK A +E++  ISRVQEAA CLG D VP YV+ KQ N+LD++++       +  +N       + LD+   + +WN+S 
Subjt:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG

Query:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        +AW+SLL+E++DE+    +I+VVVG     + L+ QCLNLTKE +G FHLDAGSISV+D+PDGPS +G
Subjt:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

W5EP13 2-carboxy-D-arabinitol-1-phosphatase6.0e-11957.22Show/hide
Query:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV
        + P  RS+RTAEIIW  R+  +I + +LREIDLYSFQGLLKHEGK K+GA ++QWQ + ++  IDGHYPVRELW RA+ CW +IL HE +SVLVVAHNAV
Subjt:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV

Query:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGASS-SFLEDKPMNILGIIQ
        NQALVAT++GLG EYFR LLQSNCG SVLDFTPQ  G  P + LNRLNQTP+SP+++ SS GRK+++RIILVC G  +++   S   +   P+N+LG+IQ
Subjt:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGASS-SFLEDKPMNILGIIQ

Query:  SQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSGEA
        +QK AELLLDLKV+++I SP+ A ++TA AI  VQEAA CLGADCVPRYVEMK    L+ID+      +  +   + + GW   ++   +  +W QS +A
Subjt:  SQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSGEA

Query:  WKSLLNEMADEEGTE--KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        W++L+NE+ +++G E  ++VV +GHPAI LGL+ +CLNLT +++ SFHLD GSISV+D+PDGP G G
Subjt:  WKSLLNEMADEEGTE--KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

Arabidopsis top hitse value%identityAlignment
AT5G22620.1 phosphoglycerate/bisphosphoglycerate mutase family protein5.8e-12561.14Show/hide
Query:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV
        + P  RSK+TAEIIWG RE  +I + +LREIDLYSFQGLLK EGK KFG A++QWQ D ANF IDGHYPVRELW+RAR+CW  ILAHES+SVLVVAHNAV
Subjt:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV

Query:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI
        NQAL+ATAIGLG EYFR LLQSNCGVSVLDF P+A+GGSP + LNRLNQTPNSP+A GSSGGRK +++IILVCHG   N  +   + +   D+ MN+LG+
Subjt:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI

Query:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG
        I SQK AELLLDL+VS+++ SPK A +E++  ISRVQEAA CLG D VP YV+ KQ N+LD++++       +  +N       + LD+   + +WN+S 
Subjt:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG

Query:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        +AW+SLL+E++DE+    +I+VVVG     + L+ QCLNLTKE +G FHLDAGSISV+D+PDGPS +G
Subjt:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

AT5G22620.2 phosphoglycerate/bisphosphoglycerate mutase family protein5.8e-12561.14Show/hide
Query:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV
        + P  RSK+TAEIIWG RE  +I + +LREIDLYSFQGLLK EGK KFG A++QWQ D ANF IDGHYPVRELW+RAR+CW  ILAHES+SVLVVAHNAV
Subjt:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV

Query:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI
        NQAL+ATAIGLG EYFR LLQSNCGVSVLDF P+A+GGSP + LNRLNQTPNSP+A GSSGGRK +++IILVCHG   N  +   + +   D+ MN+LG+
Subjt:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGA---SSSFLEDKPMNILGI

Query:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG
        I SQK AELLLDL+VS+++ SPK A +E++  ISRVQEAA CLG D VP YV+ KQ N+LD++++       +  +N       + LD+   + +WN+S 
Subjt:  IQSQKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSG

Query:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        +AW+SLL+E++DE+    +I+VVVG     + L+ QCLNLTKE +G FHLDAGSISV+D+PDGPS +G
Subjt:  EAWKSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG

AT5G22620.3 phosphoglycerate/bisphosphoglycerate mutase family protein1.6e-11960Show/hide
Query:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV
        + P  RSK+TAEIIWG RE  +I + +LREIDLYSFQGLLK EGK KFG A++QWQ D ANF IDGHYPVRELW+RAR+CW  ILAHES+SVLVVAHNAV
Subjt:  SEPSIRSKRTAEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAV

Query:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGASSSFLEDKPMNILGIIQS
        NQAL+ATAIGLG EYFR LLQSNCGVSVLDF P+A+GGSP + LNRLNQTPNSP+A GSSGGRK +++IILVCHG   N                   +S
Subjt:  NQALVATAIGLGAEYFRVLLQSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGASSSFLEDKPMNILGIIQS

Query:  QKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSGEAW
        QK AELLLDL+VS+++ SPK A +E++  ISRVQEAA CLG D VP YV+ KQ N+LD++++       +  +N       + LD+   + +WN+S +AW
Subjt:  QKVAELLLDLKVSTVISSPKKACLETAAAISRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSGEAW

Query:  KSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG
        +SLL+E++DE+    +I+VVVG     + L+ QCLNLTKE +G FHLDAGSISV+D+PDGPS +G
Subjt:  KSLLNEMADEEGTE-KIVVVVGHPAILLGLVGQCLNLTKEWIGSFHLDAGSISVLDYPDGPSGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCCAATGTTCTGCTTCTCTCTTATCCCCACAGCTCACTACTCGCATCTCCTCCTCCTCCAGAAAAGCTCCGGCTTCTTTCCGGCGAGAATTCCGAGCAGTTCCTT
TACTGTTCGATCGTCTTCCAGTCTTCAGGAGGTCGAGAAGTTTTCCGAGTCATCGTCTGAACGGAAGGAGCTCAGCTCGGAGCTCTACGCTTCGGTTCCGTTGCCTCCAT
TAAGACTGCGAAGAGAGTGGTTCTGCAGGTTTTCTATCTCTTTAACTCCATGTCGCTTCATAATTAAGCAGTGGCGACTTTCGGAACCCTCTATACGATCAAAGAGAACT
GCTGAAATCATATGGGGTGATCGTGAGGATGTGATAATTACGGAATCTGAACTAAGGGAAATAGATTTATATTCATTTCAAGGTCTACTCAAGCACGAAGGGAAGGCAAA
GTTTGGTGCCGCTTATCGTCAGTGGCAGGTAGATGCTGCAAATTTTCACATTGATGGTCACTATCCAGTGAGAGAGTTGTGGGCACGTGCTCGAAATTGTTGGAATAAAA
TTCTAGCCCACGAGAGCAGGTCTGTGCTAGTGGTTGCTCACAATGCTGTCAACCAGGCTCTTGTTGCTACAGCTATTGGACTAGGAGCGGAGTACTTCAGGGTTTTACTT
CAGAGCAACTGTGGTGTTAGTGTTCTTGATTTCACTCCCCAAGCTGAGGGTGGATCTCCAGTTATTAGTTTAAACCGTTTAAATCAGACCCCAAACTCACCCGTTGCTTC
TGGTAGTTCTGGAGGCAGAAAAACAGCAAGAAGAATCATACTTGTTTGCCATGGAGTTTATGAGAATAATGGGGCGAGTTCTTCCTTCCTGGAGGATAAGCCAATGAACA
TTCTTGGGATTATACAGTCCCAGAAAGTTGCAGAGCTACTTCTTGATCTAAAAGTGAGCACTGTAATTAGCAGTCCCAAGAAAGCTTGTTTAGAAACGGCTGCAGCAATT
TCCAGAGTACAAGAAGCTGCAGATTGCTTGGGTGCTGATTGCGTGCCCCGCTATGTGGAGATGAAGCAGACGAATAAGCTTGATATAGACAATATTTCTGATCATTTCAA
GCAGGATGTAGCTAACACCAATGTCTTCGAACCTGGTTGGTTCAACAATTTGGACGATGGGGTGATTACAGAAGTGTGGAATCAGAGTGGTGAAGCCTGGAAGTCTCTGT
TGAATGAGATGGCTGATGAAGAGGGGACGGAAAAGATTGTTGTTGTAGTTGGCCATCCTGCCATTCTTCTAGGATTGGTGGGGCAGTGCCTAAATCTCACAAAAGAATGG
ATTGGATCATTCCATCTGGATGCTGGAAGCATTAGTGTGCTTGACTACCCTGATGGGCCGAGTGGCCGAGGAAAAAGACGGCTTCAAGTTCATCGGTGGGAAGAGATCGA
CATCTATGCTCGTGCTGTGCAGCGGATCCACGAATTGCAATCACAGAACATTTATCCAACGAATAGCTGGGAAATCAAGCAGCACAATCTTGTCAATGGCTGCCGAGAAA
GTTGCAGAGAACTCAACATGCTTGGGATGCCCCACAAACGCAGCGTGAGCTTCCTTGTTCTCGAATGTCATCAAGAAGGCATGCGTGAAGCCCTGTGTAAGCATCTCTGG
TCCCTCCATCTCCTGCCCCCTGTTTACACCTTAAAACATTCAGCTTTCTGCTTCGATTTCAGAGACGAGGTTCTCCATGGCTTTGAGAATCTCTTCCACCGCTGCTCCTT
CCTTAAACTTTGCGAGCACCAAGTGCTTGAACTCTCCTCCCATGGCTGCTCCACCCTTGAACTCGGAGGCCTCTCCCTCTTTATATATAAAAAATCACATCGAAACATTA
CATTAGATTGGATGTCACTGTTTCTGTGCACCGACACAGCTCACGTGTCATTTCTTTTCTCTGCTGACTCAAATACTATAAAAGGATTGTTGTCTCCTATCTTCCTCGAC
AGCATCTCTTGCTCGATATCTGGAAAGTTACGAGAAAAAAAAAACCTCACAAGGATAGCGTCGTTATCGCTGTGGCACAACGGGGTGGAGCAGCCGAGCGGGCACGGGCA
GGCAGCAGGACATTTCTCGCACGTGCCGAACCGTCGTGCAAATTCTGGTACCCGCGTCTTTCGATTCCCTTGGACCCCACCCACCCAATCAAATTCTGCCGCTCAACATG
TTACCGTGTACCGTCTCGCACTTGACCACTACCCGATTGTTGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCCAATGTTCTGCTTCTCTCTTATCCCCACAGCTCACTACTCGCATCTCCTCCTCCTCCAGAAAAGCTCCGGCTTCTTTCCGGCGAGAATTCCGAGCAGTTCCTT
TACTGTTCGATCGTCTTCCAGTCTTCAGGAGGTCGAGAAGTTTTCCGAGTCATCGTCTGAACGGAAGGAGCTCAGCTCGGAGCTCTACGCTTCGGTTCCGTTGCCTCCAT
TAAGACTGCGAAGAGAGTGGTTCTGCAGGTTTTCTATCTCTTTAACTCCATGTCGCTTCATAATTAAGCAGTGGCGACTTTCGGAACCCTCTATACGATCAAAGAGAACT
GCTGAAATCATATGGGGTGATCGTGAGGATGTGATAATTACGGAATCTGAACTAAGGGAAATAGATTTATATTCATTTCAAGGTCTACTCAAGCACGAAGGGAAGGCAAA
GTTTGGTGCCGCTTATCGTCAGTGGCAGGTAGATGCTGCAAATTTTCACATTGATGGTCACTATCCAGTGAGAGAGTTGTGGGCACGTGCTCGAAATTGTTGGAATAAAA
TTCTAGCCCACGAGAGCAGGTCTGTGCTAGTGGTTGCTCACAATGCTGTCAACCAGGCTCTTGTTGCTACAGCTATTGGACTAGGAGCGGAGTACTTCAGGGTTTTACTT
CAGAGCAACTGTGGTGTTAGTGTTCTTGATTTCACTCCCCAAGCTGAGGGTGGATCTCCAGTTATTAGTTTAAACCGTTTAAATCAGACCCCAAACTCACCCGTTGCTTC
TGGTAGTTCTGGAGGCAGAAAAACAGCAAGAAGAATCATACTTGTTTGCCATGGAGTTTATGAGAATAATGGGGCGAGTTCTTCCTTCCTGGAGGATAAGCCAATGAACA
TTCTTGGGATTATACAGTCCCAGAAAGTTGCAGAGCTACTTCTTGATCTAAAAGTGAGCACTGTAATTAGCAGTCCCAAGAAAGCTTGTTTAGAAACGGCTGCAGCAATT
TCCAGAGTACAAGAAGCTGCAGATTGCTTGGGTGCTGATTGCGTGCCCCGCTATGTGGAGATGAAGCAGACGAATAAGCTTGATATAGACAATATTTCTGATCATTTCAA
GCAGGATGTAGCTAACACCAATGTCTTCGAACCTGGTTGGTTCAACAATTTGGACGATGGGGTGATTACAGAAGTGTGGAATCAGAGTGGTGAAGCCTGGAAGTCTCTGT
TGAATGAGATGGCTGATGAAGAGGGGACGGAAAAGATTGTTGTTGTAGTTGGCCATCCTGCCATTCTTCTAGGATTGGTGGGGCAGTGCCTAAATCTCACAAAAGAATGG
ATTGGATCATTCCATCTGGATGCTGGAAGCATTAGTGTGCTTGACTACCCTGATGGGCCGAGTGGCCGAGGAAAAAGACGGCTTCAAGTTCATCGGTGGGAAGAGATCGA
CATCTATGCTCGTGCTGTGCAGCGGATCCACGAATTGCAATCACAGAACATTTATCCAACGAATAGCTGGGAAATCAAGCAGCACAATCTTGTCAATGGCTGCCGAGAAA
GTTGCAGAGAACTCAACATGCTTGGGATGCCCCACAAACGCAGCGTGAGCTTCCTTGTTCTCGAATGTCATCAAGAAGGCATGCGTGAAGCCCTGTGTAAGCATCTCTGG
TCCCTCCATCTCCTGCCCCCTGTTTACACCTTAAAACATTCAGCTTTCTGCTTCGATTTCAGAGACGAGGTTCTCCATGGCTTTGAGAATCTCTTCCACCGCTGCTCCTT
CCTTAAACTTTGCGAGCACCAAGTGCTTGAACTCTCCTCCCATGGCTGCTCCACCCTTGAACTCGGAGGCCTCTCCCTCTTTATATATAAAAAATCACATCGAAACATTA
CATTAGATTGGATGTCACTGTTTCTGTGCACCGACACAGCTCACGTGTCATTTCTTTTCTCTGCTGACTCAAATACTATAAAAGGATTGTTGTCTCCTATCTTCCTCGAC
AGCATCTCTTGCTCGATATCTGGAAAGTTACGAGAAAAAAAAAACCTCACAAGGATAGCGTCGTTATCGCTGTGGCACAACGGGGTGGAGCAGCCGAGCGGGCACGGGCA
GGCAGCAGGACATTTCTCGCACGTGCCGAACCGTCGTGCAAATTCTGGTACCCGCGTCTTTCGATTCCCTTGGACCCCACCCACCCAATCAAATTCTGCCGCTCAACATG
TTACCGTGTACCGTCTCGCACTTGACCACTACCCGATTGTTGCGTAA
Protein sequenceShow/hide protein sequence
MSPMFCFSLIPTAHYSHLLLLQKSSGFFPARIPSSSFTVRSSSSLQEVEKFSESSSERKELSSELYASVPLPPLRLRREWFCRFSISLTPCRFIIKQWRLSEPSIRSKRT
AEIIWGDREDVIITESELREIDLYSFQGLLKHEGKAKFGAAYRQWQVDAANFHIDGHYPVRELWARARNCWNKILAHESRSVLVVAHNAVNQALVATAIGLGAEYFRVLL
QSNCGVSVLDFTPQAEGGSPVISLNRLNQTPNSPVASGSSGGRKTARRIILVCHGVYENNGASSSFLEDKPMNILGIIQSQKVAELLLDLKVSTVISSPKKACLETAAAI
SRVQEAADCLGADCVPRYVEMKQTNKLDIDNISDHFKQDVANTNVFEPGWFNNLDDGVITEVWNQSGEAWKSLLNEMADEEGTEKIVVVVGHPAILLGLVGQCLNLTKEW
IGSFHLDAGSISVLDYPDGPSGRGKRRLQVHRWEEIDIYARAVQRIHELQSQNIYPTNSWEIKQHNLVNGCRESCRELNMLGMPHKRSVSFLVLECHQEGMREALCKHLW
SLHLLPPVYTLKHSAFCFDFRDEVLHGFENLFHRCSFLKLCEHQVLELSSHGCSTLELGGLSLFIYKKSHRNITLDWMSLFLCTDTAHVSFLFSADSNTIKGLLSPIFLD
SISCSISGKLREKKNLTRIASLSLWHNGVEQPSGHGQAAGHFSHVPNRRANSGTRVFRFPWTPPTQSNSAAQHVTVYRLALDHYPIVA