; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G008370 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G008370
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHistidinol-phosphatase
Genome locationCmo_Chr20:4178136..4187836
RNA-Seq ExpressionCmoCh20G008370
SyntenyCmoCh20G008370
Gene Ontology termsGO:0000105 - histidine biosynthetic process (biological process)
GO:0009415 - response to water (biological process)
GO:0046855 - inositol phosphate dephosphorylation (biological process)
GO:0000287 - magnesium ion binding (molecular function)
GO:0004401 - histidinol-phosphatase activity (molecular function)
InterPro domainsIPR000167 - Dehydrin
IPR000760 - Inositol monophosphatase-like
IPR001841 - Zinc finger, RING-type
IPR011809 - Histidinol-phosphate phosphatase, putative, inositol monophosphatase
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR020583 - Inositol monophosphatase, metal-binding site
IPR036376 - Ribulose bisphosphate carboxylase, large subunit, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010909.1 Bifunctional phosphatase IMPL2, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.0e-18996.63Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRR QFGSSVMSSNSQFSNIV TSIDMGFDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFIT---GKPLFGTLIALLHRGKPDGVN
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFIT   GKPLFGTLIALLHRGKP    
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFIT---GKPLFGTLIALLHRGKPDGVN

Query:  LQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDF
          ILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFS EADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDF
Subjt:  LQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDF

Query:  LSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        LSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  LSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

XP_022944230.1 uncharacterized protein LOC111448743 isoform X1 [Cucurbita moschata]1.3e-17699.08Show/hide
Query:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
        +GQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
Subjt:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP

Query:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
        RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
Subjt:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
        CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH

Query:  FSFGSKGSSRVMSDNHHSTRRKALF
        FSFGSKGSSRVMSDNHHSTRRK  F
Subjt:  FSFGSKGSSRVMSDNHHSTRRKALF

XP_022944579.1 bifunctional phosphatase IMPL2, chloroplastic [Cucurbita moschata]6.8e-19498.3Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKP      I
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI

Query:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
        LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
Subjt:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL

Query:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

XP_022986560.1 bifunctional phosphatase IMPL2, chloroplastic [Cucurbita maxima]6.2e-18795.47Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLS SISQSATNF SLSTPAIP KP SLLSHFST SLQSSPSLSVS+PCRR QFGSSVMSSNSQFSNIV TS+DMGFDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALL+RGKP      I
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI

Query:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
        LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
Subjt:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL

Query:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

XP_023512300.1 bifunctional phosphatase IMPL2, chloroplastic [Cucurbita pepo subsp. pepo]2.4e-18695.47Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLS SISQSATNF SLSTPAIP KPT LLSHFST SLQSSPSLSVSLPCRR QFGSSVMSSNSQFSNIV TSIDM FDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALL+RGKP      I
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI

Query:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
        LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEA+EAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
Subjt:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL

Query:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

TrEMBL top hitse value%identityAlignment
A0A6J1FTV1 uncharacterized protein LOC111448743 isoform X16.3e-17799.08Show/hide
Query:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
        +GQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
Subjt:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP

Query:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
        RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
Subjt:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
        CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH

Query:  FSFGSKGSSRVMSDNHHSTRRKALF
        FSFGSKGSSRVMSDNHHSTRRK  F
Subjt:  FSFGSKGSSRVMSDNHHSTRRKALF

A0A6J1FV76 uncharacterized protein LOC111448743 isoform X26.3e-17799.08Show/hide
Query:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
        +GQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
Subjt:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP

Query:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
        RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
Subjt:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
        CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH

Query:  FSFGSKGSSRVMSDNHHSTRRKALF
        FSFGSKGSSRVMSDNHHSTRRK  F
Subjt:  FSFGSKGSSRVMSDNHHSTRRKALF

A0A6J1FWY7 Histidinol-phosphatase3.3e-19498.3Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKP      I
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI

Query:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
        LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
Subjt:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL

Query:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

A0A6J1JEC0 uncharacterized protein LOC1114842501.2e-17598.15Show/hide
Query:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
        +GQSNSRNVSTDVSLEQVKEAMDSPT SYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTH+SPGHQLLRQGSDNRIRGLKSPGSYLASEDRP
Subjt:  AGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRP

Query:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
        RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
Subjt:  RLPSWSNESVRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
        CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFE+NNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRH

Query:  FSFGSKGSSRVMSDNHHSTRRKALF
        FSFGSKGSSRVMSDNHHSTRRK  F
Subjt:  FSFGSKGSSRVMSDNHHSTRRKALF

A0A6J1JEE0 Histidinol-phosphatase3.0e-18795.47Show/hide
Query:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE
        MFSLS SISQSATNF SLSTPAIP KP SLLSHFST SLQSSPSLSVS+PCRR QFGSSVMSSNSQFSNIV TS+DMGFDDHDLDRFAEVANRVADAAGE
Subjt:  MFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGE

Query:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI
        VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALL+RGKP      I
Subjt:  VILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQI

Query:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
        LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
Subjt:  LGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL

Query:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
        IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD
Subjt:  IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD

SwissProt top hitse value%identityAlignment
P56160 Histidinol-phosphatase1.4e-2731.47Show/hide
Query:  EVANRVADAAGEVILKYF-RKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIAL
        ++A  +A+ AG++ L YF R+  ++  K D +PVT AD+ AEE +   +   FP   ++GEE       N     W++DPIDGT+SFI G PL+G +IAL
Subjt:  EVANRVADAAGEVILKYF-RKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIAL

Query:  LHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDIS-QAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDL
           G         LG+I+ P L E +    G    +NG  V   + ++ S    ++T   +L    ++    ++R    +     DCY + L+ASG  ++
Subjt:  LHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDIS-QAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDL

Query:  VIESGLKPYDFLSLIPIIKGAGGVITDWKGEE
         ++  + P+D  ++IPI++ AGG   D++G +
Subjt:  VIESGLKPYDFLSLIPIIKGAGGVITDWKGEE

Q6NPM8 Bifunctional phosphatase IMPL2, chloroplastic4.5e-12466.57Show/hide
Query:  TPAIPSKPTSL--LSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGEVILKYFRKKFEIIDKAD
        +PA+ S   SL   S +S   L    S ++++P  R +F    M+SNS+  NI + S      D +LDRFA V N +ADA+GEVI KYFRKKF+I+DK D
Subjt:  TPAIPSKPTSL--LSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVADAAGEVILKYFRKKFEIIDKAD

Query:  FSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQILGIIDQPVLRERWIGIS
         SPVT+ADQ AEE+MVS++ +N PSHAIYGEEKGWRCKE SAD+VWVLDPIDGTKSFITGKP+FGTLIALL++GKP      ILG+IDQP+L+ERWIG++
Subjt:  FSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDGVNLQILGIIDQPVLRERWIGIS

Query:  GRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGE
        GRRT LNG+++STRSC  +SQAYLYTTSPHLFS EA++A++RVR+KVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFL+L+P+I+GAGG ITDW G+
Subjt:  GRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGE

Query:  ELYWEALPNSPATSFNVLAAGDKVIHQQALESLRW
           WEA  ++ ATSFNV+AAGD  IHQQALESL W
Subjt:  ELYWEALPNSPATSFNVLAAGDKVIHQQALESLRW

Q8NS80 Histidinol-phosphatase1.9e-2131.49Show/hide
Query:  VANRVADAAGEVILKYFR-KKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALL
        +A  +A+ A  + L  F     E+  K D +PV+ AD A EE++   +    P+ +I GEE G   + +     W++DPIDGTK+++ G P++ TLIALL
Subjt:  VANRVADAAGEVILKYFR-KKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALL

Query:  HRGKPDGVNLQILGIIDQPVLRERWIGISGRRT--ALNGQNVSTRSCSDISQAYLYTTSPHLFSGEA-----DEAFARVRNKVKVPLYGCDCYAYALLAS
          GKP      + G+I  P L  RW    G       NG +    S S +S+    + S    SG A     D+  +      ++  YG D ++Y L+A 
Subjt:  HRGKPDGVNLQILGIIDQPVLRERWIGISGRRT--ALNGQNVSTRSCSDISQAYLYTTSPHLFSGEA-----DEAFARVRNKVKVPLYGCDCYAYALLAS

Query:  GFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKG
        G VD+  E  +  +D   L  ++  AGG  T   G
Subjt:  GFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKG

Q9CNV8 Nus factor SuhB2.9e-2226.44Show/hide
Query:  VANRVADAAGEVILKYFRKKFEIID--KADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIAL
        +A R A  AG VI K + ++ ++    K+    VT  D+A+EE+++ V+ +++P H I  EE G   +   +D  WV+DP+DGT +F+ G P F   IA+
Subjt:  VANRVADAAGEVILKYFRKKFEIID--KADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIAL

Query:  LHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVK----VPLYGCDCYAYALLASGF
          +G+ +      +G++  P+  E +  + G    +N   +   +  D++   L T  P   +      FA + N ++        G        +A+G 
Subjt:  LHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVK----VPLYGCDCYAYALLASGF

Query:  VDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQ
        VD   E G+K +D  +   I++ AGG++ D+ G   Y         TS +++AA  +++ +
Subjt:  VDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQ

Q9HXI4 Nus factor SuhB1.5e-2129.75Show/hide
Query:  VANRVADAAGEVILKYFRK--KFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGW-RCKENSADFVWVLDPIDGTKSFITGKPLFGTLIA
        +A R A +AGE+I +   +     + +K     VT  D+AAE+++V+ L + +P+HAI GEE G+       AD++WV+DP+DGT +FI G P F   IA
Subjt:  VANRVADAAGEVILKYFRK--KFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGW-RCKENSADFVWVLDPIDGTKSFITGKPLFGTLIA

Query:  LLHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFS--GEADEAFARVRNKV----KVPLYGCDCYAYALLA
          ++G+ +        ++  PV +E +    GR  ALNG+ +       +  A L T  P   +     D      R+ V     +   G      A +A
Subjt:  LLHRGKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFS--GEADEAFARVRNKV----KVPLYGCDCYAYALLA

Query:  SGFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGEELYWE
        +G  D   E GL  +D  +   +++ AGG+++D+ G   + E
Subjt:  SGFVDLVIESGLKPYDFLSLIPIIKGAGGVITDWKGEELYWE

Arabidopsis top hitse value%identityAlignment
AT4G39120.1 myo-inositol monophosphatase like 27.2e-12563.59Show/hide
Query:  SRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSS---------PSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVA
        S+ ++QS  +F S S   IP +  +L S   +  + SS          S ++++P  R +F    M+SNS+  NI + S      D +LDRFA V N +A
Subjt:  SRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSS---------PSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFDDHDLDRFAEVANRVA

Query:  DAAGEVILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDG
        DA+GEVI KYFRKKF+I+DK D SPVT+ADQ AEE+MVS++ +N PSHAIYGEEKGWRCKE SAD+VWVLDPIDGTKSFITGKP+FGTLIALL++GKP  
Subjt:  DAAGEVILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHRGKPDG

Query:  VNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPY
            ILG+IDQP+L+ERWIG++GRRT LNG+++STRSC  +SQAYLYTTSPHLFS EA++A++RVR+KVKVPLYGCDCYAYALLASGFVDLVIESGLKPY
Subjt:  VNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPY

Query:  DFLSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRW
        DFL+L+P+I+GAGG ITDW G+   WEA  ++ ATSFNV+AAGD  IHQQALESL W
Subjt:  DFLSLIPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRW

AT4G39140.1 RING/U-box superfamily protein1.3e-5746.27Show/hide
Query:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS
        RN S +   EQ +  + +S   SY SPA+LSLSL S  SS  TSPLSSQSYL P +SS  + T Q P  +L +Q SD +I G  S     A+E+R   P+
Subjt:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS

Query:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
                S  G S+ WS+ AFSE+M++S   +     ++D+D FG   +KI    +R+S       TCG CS+ L+EKS WSSQ+I   NELSV+A+L 
Subjt:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG
        CGHVYH +CLE MTPEI K+DP+CP+CT GEK T KLSEKALK EMD K+ + KR +N   DS F+       ++  +  A   K  R+ +SSS +S S 
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG

Query:  KPFLKRHFSFGSKGSSRVMSDN
        KPFL RHFSFGS+ + +   +N
Subjt:  KPFLKRHFSFGSKGSSRVMSDN

AT4G39140.2 RING/U-box superfamily protein1.3e-5746.27Show/hide
Query:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS
        RN S +   EQ +  + +S   SY SPA+LSLSL S  SS  TSPLSSQSYL P +SS  + T Q P  +L +Q SD +I G  S     A+E+R   P+
Subjt:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS

Query:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
                S  G S+ WS+ AFSE+M++S   +     ++D+D FG   +KI    +R+S       TCG CS+ L+EKS WSSQ+I   NELSV+A+L 
Subjt:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG
        CGHVYH +CLE MTPEI K+DP+CP+CT GEK T KLSEKALK EMD K+ + KR +N   DS F+       ++  +  A   K  R+ +SSS +S S 
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG

Query:  KPFLKRHFSFGSKGSSRVMSDN
        KPFL RHFSFGS+ + +   +N
Subjt:  KPFLKRHFSFGSKGSSRVMSDN

AT4G39140.3 RING/U-box superfamily protein1.3e-5746.27Show/hide
Query:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS
        RN S +   EQ +  + +S   SY SPA+LSLSL S  SS  TSPLSSQSYL P +SS  + T Q P  +L +Q SD +I G  S     A+E+R   P+
Subjt:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS

Query:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
                S  G S+ WS+ AFSE+M++S   +     ++D+D FG   +KI    +R+S       TCG CS+ L+EKS WSSQ+I   NELSV+A+L 
Subjt:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG
        CGHVYH +CLE MTPEI K+DP+CP+CT GEK T KLSEKALK EMD K+ + KR +N   DS F+       ++  +  A   K  R+ +SSS +S S 
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG

Query:  KPFLKRHFSFGSKGSSRVMSDN
        KPFL RHFSFGS+ + +   +N
Subjt:  KPFLKRHFSFGSKGSSRVMSDN

AT4G39140.4 RING/U-box superfamily protein1.3e-5746.27Show/hide
Query:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS
        RN S +   EQ +  + +S   SY SPA+LSLSL S  SS  TSPLSSQSYL P +SS  + T Q P  +L +Q SD +I G  S     A+E+R   P+
Subjt:  RNVSTDVSLEQVKE-AMDSPTVSYKSPAKLSLSLPS-TSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPS

Query:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT
                S  G S+ WS+ AFSE+M++S   +     ++D+D FG   +KI    +R+S       TCG CS+ L+EKS WSSQ+I   NELSV+A+L 
Subjt:  WSNESVRDSHGGSSDCWSVHAFSELMATSHRERW----SFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLT

Query:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG
        CGHVYH +CLE MTPEI K+DP+CP+CT GEK T KLSEKALK EMD K+ + KR +N   DS F+       ++  +  A   K  R+ +SSS +S S 
Subjt:  CGHVYHADCLESMTPEIYKYDPACPVCTFGEKHTQKLSEKALKAEMDWKSLY-KRSKNCNADSHFEA------NNPFKNHARLEKGSRMSASSSTRSSSG

Query:  KPFLKRHFSFGSKGSSRVMSDN
        KPFL RHFSFGS+ + +   +N
Subjt:  KPFLKRHFSFGSKGSSRVMSDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGTCAATCAAATTCAAGGAATGTATCCACAGATGTGAGTTTAGAACAGGTGAAGGAGGCTATGGATTCTCCCACAGTCTCATATAAGTCTCCTGCAAAATTGTC
ACTTTCATTGCCTTCCACTTCATCTTTGTCAACCTCTCCTTTGTCATCACAGAGCTATCTGCCTCCAACCAATTCATCCCTAACAAGATGTACCCATCAATCTCCAGGAC
ATCAGCTGTTGCGTCAAGGTTCTGACAACCGAATCCGAGGATTAAAATCACCAGGTAGTTATTTAGCTTCTGAAGATAGACCAAGGCTTCCCTCTTGGAGCAACGAATCG
GTCCGTGACTCACATGGAGGGTCTTCAGATTGTTGGTCTGTCCATGCCTTCTCTGAGCTCATGGCCACTTCTCATCGAGAAAGATGGTCTTTTGATAGCGACTCTTTCGG
CTTTAACGGTGAGAAAATAGCCAGGTCTAGTAGTCGAATTTCAACCTCCTCAGTTGATTTGCAAACTTGCGGAGTTTGCTCAAAACTATTGACCGAGAAATCCTCATGGA
GTAGCCAAAGGATCATTGCTAACAACGAGCTTTCTGTTGCTGCTGTACTAACCTGTGGACATGTTTATCATGCCGACTGCCTGGAGAGTATGACACCCGAGATTTACAAG
TACGACCCCGCTTGTCCCGTTTGTACCTTCGGGGAGAAGCACACGCAGAAGCTGTCCGAAAAGGCGCTTAAAGCTGAAATGGACTGGAAGAGTCTATACAAGAGATCCAA
AAACTGTAATGCAGATAGCCATTTTGAAGCAAACAATCCTTTCAAAAACCATGCCCGTCTCGAAAAGGGCTCTAGAATGTCTGCGAGTTCTAGCACGAGAAGCTCCTCAG
GAAAGCCGTTCTTGAAACGGCACTTCTCCTTCGGCTCGAAGGGATCTTCTAGAGTGATGTCTGATAATCATCACTCCACAAGAAGGAAAGCTCTGTTCCGTGGTCCATTG
CAGGCAGCGAAATGGTGCTGCTCCCTCGATCCGGGTGCCTTCCAGGAATTGTTTGTAGTGTTTGAATGTTTCATGAAACAGGAATTCTGCGGTATTTATTTCACTCAAGA
TTGGGTCTCTTTACCATGTGTTCTGCCAGTGGCTTCCGGTGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGAGGGACGTGATCTTGCTC
GTGAGGAACCTTCATCAATGGCGAATGTACGTGACGAGTTCGGCAACCCGGTCCAGCTGACGGACGAACGTGGGAACCCGGTTCAGCTGACTGACGAATTCGGCAACCCT
GTCCAACTCACTGGAGTTGCCACCACTGGGTCCCAGCCGGTTGGGAACCCGGTTCAGATGAGTGAGGAATTGGGCAACCCCATGCACCCGACCGGAGTTTCCACCACTGG
GTTCAGCCCCGTTGGGAACCCGGTTCAGATGAGTGAGGAATTGGGCAACCCCATGCACCCTACTGGAGTTGCCACCACTGGGTCCAAGCCCGTGTCGAGTGGCGGGGACT
CAGAGGACGACGGGCAAGGCGGGAGGAGGAAGAAGAAGAAGGGGTTGACGCAGAAGATAAAGGAGAAGTTAACAGGAAAACACAAGGAGGCCGGGACGACGACGACGACA
ACGGGGGCGGGGCACCACCAGAGTACCACCGTGACATCGACGGTGGACGTCGAACACCAAGAACATGTCCCCCCTCTCGTGCTCGCTACGATGTTCTCTCTTTCGCGTTC
AATTTCTCAATCCGCCACCAATTTCCTCTCTCTTTCGACCCCTGCAATTCCCTCTAAACCCACCTCTCTTTTGTCCCATTTCTCCACTTTCTCACTACAATCGTCTCCTT
CCCTCTCCGTTTCTCTCCCTTGCCGCCGTTACCAGTTCGGTTCTTCAGTAATGTCGTCCAATTCCCAGTTCTCTAACATTGTGCACACTTCCATTGACATGGGTTTTGAT
GATCATGACCTCGATCGATTTGCCGAAGTTGCGAACAGGGTCGCGGATGCTGCTGGGGAAGTGATTCTGAAGTACTTTCGCAAGAAATTTGAGATCATTGATAAGGCGGA
TTTCAGTCCTGTGACTGTGGCTGATCAAGCAGCGGAGGAATCTATGGTTTCAGTTTTAATGGAGAACTTTCCTTCTCATGCTATATATGGTGAGGAAAAAGGATGGAGAT
GCAAAGAGAATTCAGCTGATTTTGTTTGGGTTTTAGATCCAATAGATGGCACAAAGAGTTTTATTACTGGAAAACCCCTATTTGGCACTCTGATTGCATTGTTACATAGA
GGGAAACCAGATGGTGTTAATCTTCAGATTCTGGGTATCATTGATCAGCCTGTTCTGAGAGAAAGATGGATAGGAATAAGTGGGAGGAGAACTGCATTGAATGGGCAAAA
TGTATCTACAAGATCTTGTTCAGACATATCACAGGCCTATTTATATACTACAAGCCCTCATCTATTTAGTGGAGAAGCCGATGAGGCATTTGCTCGTGTGAGGAACAAGG
TAAAAGTTCCCCTGTATGGTTGTGACTGCTATGCGTATGCCCTTTTAGCATCTGGGTTTGTAGATCTTGTCATTGAATCTGGGCTCAAGCCCTATGATTTTCTCTCTCTA
ATACCCATAATCAAAGGGGCTGGTGGTGTGATAACCGATTGGAAAGGAGAGGAGCTTTATTGGGAAGCCTTGCCTAATTCCCCTGCAACAAGTTTTAATGTGTTAGCAGC
TGGGGATAAAGTTATCCATCAACAAGCCCTTGAATCATTAAGATGGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGTCAATCAAATTCAAGGAATGTATCCACAGATGTGAGTTTAGAACAGGTGAAGGAGGCTATGGATTCTCCCACAGTCTCATATAAGTCTCCTGCAAAATTGTC
ACTTTCATTGCCTTCCACTTCATCTTTGTCAACCTCTCCTTTGTCATCACAGAGCTATCTGCCTCCAACCAATTCATCCCTAACAAGATGTACCCATCAATCTCCAGGAC
ATCAGCTGTTGCGTCAAGGTTCTGACAACCGAATCCGAGGATTAAAATCACCAGGTAGTTATTTAGCTTCTGAAGATAGACCAAGGCTTCCCTCTTGGAGCAACGAATCG
GTCCGTGACTCACATGGAGGGTCTTCAGATTGTTGGTCTGTCCATGCCTTCTCTGAGCTCATGGCCACTTCTCATCGAGAAAGATGGTCTTTTGATAGCGACTCTTTCGG
CTTTAACGGTGAGAAAATAGCCAGGTCTAGTAGTCGAATTTCAACCTCCTCAGTTGATTTGCAAACTTGCGGAGTTTGCTCAAAACTATTGACCGAGAAATCCTCATGGA
GTAGCCAAAGGATCATTGCTAACAACGAGCTTTCTGTTGCTGCTGTACTAACCTGTGGACATGTTTATCATGCCGACTGCCTGGAGAGTATGACACCCGAGATTTACAAG
TACGACCCCGCTTGTCCCGTTTGTACCTTCGGGGAGAAGCACACGCAGAAGCTGTCCGAAAAGGCGCTTAAAGCTGAAATGGACTGGAAGAGTCTATACAAGAGATCCAA
AAACTGTAATGCAGATAGCCATTTTGAAGCAAACAATCCTTTCAAAAACCATGCCCGTCTCGAAAAGGGCTCTAGAATGTCTGCGAGTTCTAGCACGAGAAGCTCCTCAG
GAAAGCCGTTCTTGAAACGGCACTTCTCCTTCGGCTCGAAGGGATCTTCTAGAGTGATGTCTGATAATCATCACTCCACAAGAAGGAAAGCTCTGTTCCGTGGTCCATTG
CAGGCAGCGAAATGGTGCTGCTCCCTCGATCCGGGTGCCTTCCAGGAATTGTTTGTAGTGTTTGAATGTTTCATGAAACAGGAATTCTGCGGTATTTATTTCACTCAAGA
TTGGGTCTCTTTACCATGTGTTCTGCCAGTGGCTTCCGGTGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGAGGGACGTGATCTTGCTC
GTGAGGAACCTTCATCAATGGCGAATGTACGTGACGAGTTCGGCAACCCGGTCCAGCTGACGGACGAACGTGGGAACCCGGTTCAGCTGACTGACGAATTCGGCAACCCT
GTCCAACTCACTGGAGTTGCCACCACTGGGTCCCAGCCGGTTGGGAACCCGGTTCAGATGAGTGAGGAATTGGGCAACCCCATGCACCCGACCGGAGTTTCCACCACTGG
GTTCAGCCCCGTTGGGAACCCGGTTCAGATGAGTGAGGAATTGGGCAACCCCATGCACCCTACTGGAGTTGCCACCACTGGGTCCAAGCCCGTGTCGAGTGGCGGGGACT
CAGAGGACGACGGGCAAGGCGGGAGGAGGAAGAAGAAGAAGGGGTTGACGCAGAAGATAAAGGAGAAGTTAACAGGAAAACACAAGGAGGCCGGGACGACGACGACGACA
ACGGGGGCGGGGCACCACCAGAGTACCACCGTGACATCGACGGTGGACGTCGAACACCAAGAACATGTCCCCCCTCTCGTGCTCGCTACGATGTTCTCTCTTTCGCGTTC
AATTTCTCAATCCGCCACCAATTTCCTCTCTCTTTCGACCCCTGCAATTCCCTCTAAACCCACCTCTCTTTTGTCCCATTTCTCCACTTTCTCACTACAATCGTCTCCTT
CCCTCTCCGTTTCTCTCCCTTGCCGCCGTTACCAGTTCGGTTCTTCAGTAATGTCGTCCAATTCCCAGTTCTCTAACATTGTGCACACTTCCATTGACATGGGTTTTGAT
GATCATGACCTCGATCGATTTGCCGAAGTTGCGAACAGGGTCGCGGATGCTGCTGGGGAAGTGATTCTGAAGTACTTTCGCAAGAAATTTGAGATCATTGATAAGGCGGA
TTTCAGTCCTGTGACTGTGGCTGATCAAGCAGCGGAGGAATCTATGGTTTCAGTTTTAATGGAGAACTTTCCTTCTCATGCTATATATGGTGAGGAAAAAGGATGGAGAT
GCAAAGAGAATTCAGCTGATTTTGTTTGGGTTTTAGATCCAATAGATGGCACAAAGAGTTTTATTACTGGAAAACCCCTATTTGGCACTCTGATTGCATTGTTACATAGA
GGGAAACCAGATGGTGTTAATCTTCAGATTCTGGGTATCATTGATCAGCCTGTTCTGAGAGAAAGATGGATAGGAATAAGTGGGAGGAGAACTGCATTGAATGGGCAAAA
TGTATCTACAAGATCTTGTTCAGACATATCACAGGCCTATTTATATACTACAAGCCCTCATCTATTTAGTGGAGAAGCCGATGAGGCATTTGCTCGTGTGAGGAACAAGG
TAAAAGTTCCCCTGTATGGTTGTGACTGCTATGCGTATGCCCTTTTAGCATCTGGGTTTGTAGATCTTGTCATTGAATCTGGGCTCAAGCCCTATGATTTTCTCTCTCTA
ATACCCATAATCAAAGGGGCTGGTGGTGTGATAACCGATTGGAAAGGAGAGGAGCTTTATTGGGAAGCCTTGCCTAATTCCCCTGCAACAAGTTTTAATGTGTTAGCAGC
TGGGGATAAAGTTATCCATCAACAAGCCCTTGAATCATTAAGATGGGACTGATAATTCTAGACACTCACAAGAGAGAATGGTTTGTGACTGCAACTTTTGGCCTCTTGGA
CTGAGTAACACAGAATAAATAAGCAAATGCAAAGCATATCTAGTCTAGGTTCATTTTTTAGAAGAAACAGATCCTTCACTCTTCCCTCACTGTTCATGTTAACGTAGGGG
TCTCATGTTAAGGGAGGGGCAATAGCTCTCTTATGACGGAGTTTAGTGCTTTTGACATAGCCATTATATGTTAATCAGTGATTAAAATTGCCCGCCAATCGTAGATCTTA
TGGCTTATGGTTTCTTTGCATTACCTTAATAGGTTAAGAATATATTATTCATGAGATCATTTCTTGAATACTGTCTATGAATGTATAATCTTCATTATGTGTACCTTGAG
AAATTTCCAGGGGCATCTTTTTCCTTGCAATTTTCTATGCTG
Protein sequenceShow/hide protein sequence
MAGQSNSRNVSTDVSLEQVKEAMDSPTVSYKSPAKLSLSLPSTSSLSTSPLSSQSYLPPTNSSLTRCTHQSPGHQLLRQGSDNRIRGLKSPGSYLASEDRPRLPSWSNES
VRDSHGGSSDCWSVHAFSELMATSHRERWSFDSDSFGFNGEKIARSSSRISTSSVDLQTCGVCSKLLTEKSSWSSQRIIANNELSVAAVLTCGHVYHADCLESMTPEIYK
YDPACPVCTFGEKHTQKLSEKALKAEMDWKSLYKRSKNCNADSHFEANNPFKNHARLEKGSRMSASSSTRSSSGKPFLKRHFSFGSKGSSRVMSDNHHSTRRKALFRGPL
QAAKWCCSLDPGAFQELFVVFECFMKQEFCGIYFTQDWVSLPCVLPVASGGAVANRVALEACVQARNEGRDLAREEPSSMANVRDEFGNPVQLTDERGNPVQLTDEFGNP
VQLTGVATTGSQPVGNPVQMSEELGNPMHPTGVSTTGFSPVGNPVQMSEELGNPMHPTGVATTGSKPVSSGGDSEDDGQGGRRKKKKGLTQKIKEKLTGKHKEAGTTTTT
TGAGHHQSTTVTSTVDVEHQEHVPPLVLATMFSLSRSISQSATNFLSLSTPAIPSKPTSLLSHFSTFSLQSSPSLSVSLPCRRYQFGSSVMSSNSQFSNIVHTSIDMGFD
DHDLDRFAEVANRVADAAGEVILKYFRKKFEIIDKADFSPVTVADQAAEESMVSVLMENFPSHAIYGEEKGWRCKENSADFVWVLDPIDGTKSFITGKPLFGTLIALLHR
GKPDGVNLQILGIIDQPVLRERWIGISGRRTALNGQNVSTRSCSDISQAYLYTTSPHLFSGEADEAFARVRNKVKVPLYGCDCYAYALLASGFVDLVIESGLKPYDFLSL
IPIIKGAGGVITDWKGEELYWEALPNSPATSFNVLAAGDKVIHQQALESLRWD