; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G005930 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G005930
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description2-keto-3-deoxy-L-rhamnonate aldolase
Genome locationCmo_Chr06:2927680..2930507
RNA-Seq ExpressionCmoCh06G005930
SyntenyCmoCh06G005930
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0009055 - electron transfer activity (molecular function)
GO:0016832 - aldehyde-lyase activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR005000 - HpcH/HpaI aldolase/citrate lyase domain
IPR008972 - Cupredoxin
IPR015813 - Pyruvate/Phosphoenolpyruvate kinase-like domain superfamily
IPR040442 - Pyruvate kinase-like domain superfamily
IPR041846 - Early nodulin-like protein domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596679.1 hypothetical protein SDJN03_09859, partial [Cucurbita argyrosperma subsp. sororia]3.0e-19799.16Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFLSSSRLHPTAKS SFSFAPPFPKSSSPFRTLFP+ASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

KAG7028215.1 rhmA, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-19798.89Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFLSSSRLHPTAKS SFSFAPPFPKSSSPFRTLFP+ASNSSSKPSIPSPIDSSSSFAAPS+AFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

XP_022950895.1 uncharacterized protein LOC111453851 [Cucurbita moschata]7.8e-19899.72Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

XP_023005809.1 uncharacterized protein LOC111498702 [Cucurbita maxima]1.9e-19698.89Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFL+SSRLHPTAK LSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

XP_023540559.1 uncharacterized protein LOC111800889 [Cucurbita pepo subsp. pepo]1.7e-19297.49Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSI FTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSL+ NPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPES+AAWAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLE  SK+KNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

TrEMBL top hitse value%identityAlignment
A0A0A0L3Q7 HpcH_HpaI domain-containing protein5.7e-16282.83Show/hide
Query:  SFTLTSPFLSSSRLHPTAKSLSF--------SFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFS
        SF +T+PF+SSS+LHP +KSLSF        SF+ PFPKSSS FRTL P+   SSS PS PSPIDSS SFA    A N TLKSRLRNG+TLYG+FLLSFS
Subjt:  SFTLTSPFLSSSRLHPTAKSLSF--------SFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFS

Query:  PTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVR
        P+LAEIAG SGYDFVVVDMEHGYGGISDALPCL AL+A QTPAILRIPE+SA WAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVR
Subjt:  PTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVR

Query:  ASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMP
        ASKYGIDEGYLTNYEDELLIMCQ+ESEQAVKKID+IMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKV+E+MRKAE AVLE  S+ +NGEKG+FLCGFSMP
Subjt:  ASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMP

Query:  HDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        HDGPIDM++RGYQMISGAVD+GLFR+AAVEDVRKFRMSEM+ SED +QPLTHKEEDEEDK+
Subjt:  HDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

A0A5A7UCF1 2-keto-3-deoxy-L-rhamnonate aldolase1.3e-16183.1Show/hide
Query:  SFTLTSPFLSSSRLHPTAKSLSF--------SFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFS
        SF +T+PF+SSS+LHP +KSLSF        SF+ PFPKSSS  RTL PI   SSS PS PSPIDSS SFAA   A N TLKSRLRNG+TLYGLFLLSFS
Subjt:  SFTLTSPFLSSSRLHPTAKSLSF--------SFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFS

Query:  PTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVR
        P+LAEIAG +GYDFVVVDMEHGYGGISDALPCL AL+ATQTPAILRIPE+SA WAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVR
Subjt:  PTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVR

Query:  ASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMP
        ASKYGIDEGYLT YEDELLIMCQ+ESEQAVKKID+IMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKV+E+MRKAE AVLE  S+ +NG+KG+FLCGFSMP
Subjt:  ASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMP

Query:  HDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        HDGPIDM++RGYQMISGAVD+GLFR+AAVEDVRKFRMSEM+ SED DQPLTHKEEDEEDK+
Subjt:  HDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

A0A6J1CW99 uncharacterized protein LOC1110148121.1e-16285.55Show/hide
Query:  SFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAG
        SFT++  FLSSS+L PT KSLSFS + PF    SPFRTLFPI+SNSSS PSIPSPIDSS SFAAPS A N  LKSRLRNG+TLYGLFLLSFSP+LAEIAG
Subjt:  SFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAG

Query:  LSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDE
        L+GYDFVVVDMEHGYGGISDALPCL AL+A QT AILR+PESSAAWAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVRASKYGIDE
Subjt:  LSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDE

Query:  GYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMR
        GYL+NYEDELLIMCQ+ESEQAVKKI++IMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMR+AEKAVL+  SK  NGE+GAFL GFSMPHDGPIDMR
Subjt:  GYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMR

Query:  KRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        KRGY+MISGAVD+GLFRTAAVEDVRKF+MSE+  SED DQPLTH EEDEEDK+
Subjt:  KRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

A0A6J1GH15 uncharacterized protein LOC1114538513.8e-19899.72Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

A0A6J1L0B5 uncharacterized protein LOC1114987029.3e-19798.89Show/hide
Query:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
        MAASSISFTLTSPFL+SSRLHPTAK LSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT
Subjt:  MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPT

Query:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS
        LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSK+AKKAVSYCRFPPAGVRGSAHPVVRAS
Subjt:  LAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRAS

Query:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
        KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD
Subjt:  KYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHD

Query:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF
        GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDK+
Subjt:  GPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEEDKF

SwissProt top hitse value%identityAlignment
A6TBU6 2-keto-3-deoxy-L-rhamnonate aldolase1.1e-3437.56Show/hide
Query:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS
        +L  NP  +  LR GET  GL+L S S  +AEIA  SGYD++++D EH    I D    L+A++   +  ++R  E + +  K+ LD+G + ++ PM+D+
Subjt:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS

Query:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK
        ++ A++ VS  R+PP G RG    V RA+++G  E Y+    DEL ++ Q+ES  A++ +D I+EVDG+D + +GP D+S S+GY  D GH  V+ ++ +
Subjt:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK

Query:  AEKAV
        + + +
Subjt:  AEKAV

A8A2B1 2-keto-3-deoxy-L-rhamnonate aldolase1.5e-3436.59Show/hide
Query:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS
        +L  NP  K RLR GE   GL+L S +  +AEIA  SGYD++++D EH    I D    L+A++   +  ++R  E S    K+ LD+G Q ++ PM+D+
Subjt:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS

Query:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK
        ++ A++ VS  R+PP G RG    V RA+++G  E Y+    D L ++ Q+ES+ A+  +D+I++V+G+D + +GP D+S S+GY  + GH +V+ ++  
Subjt:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK

Query:  AEKAV
        + + +
Subjt:  AEKAV

B1X8V8 2-keto-3-deoxy-L-rhamnonate aldolase1.5e-3436.59Show/hide
Query:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS
        +L  NP  K RLR GE   GL+L S +  +AEIA  SGYD++++D EH    I D    L+A++   +  ++R  E S    K+ LD+G Q ++ PM+D+
Subjt:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS

Query:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK
        ++ A++ VS  R+PP G RG    V RA+++G  E Y+    D L ++ Q+ES+ A+  +D+I++V+G+D + +GP D+S S+GY  + GH +V+ ++  
Subjt:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK

Query:  AEKAV
        + + +
Subjt:  AEKAV

B7NN63 2-keto-3-deoxy-L-rhamnonate aldolase1.5e-3436.59Show/hide
Query:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS
        +L  NP  K RLR GE   GL+L S +  +AEIA  SGYD++++D EH    I D    L+A++   +  ++R  E+S +  K+ LD+G Q ++ PM+D+
Subjt:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS

Query:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK
        +  A++ VS  R+PP G RG    V RA+++G  E Y+    D L ++ Q+ES+ A+  +D+I++V+G+D + +GP D+S S+GY  + GH +V+ ++  
Subjt:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK

Query:  AEKAV
        + + +
Subjt:  AEKAV

C4ZU87 2-keto-3-deoxy-L-rhamnonate aldolase1.5e-3436.59Show/hide
Query:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS
        +L  NP  K RLR GE   GL+L S +  +AEIA  SGYD++++D EH    I D    L+A++   +  ++R  E S    K+ LD+G Q ++ PM+D+
Subjt:  SLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDS

Query:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK
        ++ A++ VS  R+PP G RG    V RA+++G  E Y+    D L ++ Q+ES+ A+  +D+I++V+G+D + +GP D+S S+GY  + GH +V+ ++  
Subjt:  SKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRK

Query:  AEKAV
        + + +
Subjt:  AEKAV

Arabidopsis top hitse value%identityAlignment
AT1G64640.1 early nodulin-like protein 82.8e-3650.56Show/hide
Query:  VVILL--QIHSVICYQYKVGDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFT
        V+ILL  +I  V    YKVGDLD+WG+P   + K+Y  W K HS KIGDSL+FLYPPS+DS+IQVT  ++ SCN KDPIL M DGNS+FN+T  G L+FT
Subjt:  VVILL--QIHSVICYQYKVGDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFT

Query:  SGVAGHCEKNQKLHISVLSGNGSSASAPSSDGALPEISPSYPTVFGGIPAAPMANSSSSSSSLSTKLSFFPVLVAAFA
        S   GHC K QKL +SV      SA A +   +    +PSY   FG I   P++  SS+SSSL   +S F  + A+ A
Subjt:  SGVAGHCEKNQKLHISVLSGNGSSASAPSSDGALPEISPSYPTVFGGIPAAPMANSSSSSSSLSTKLSFFPVLVAAFA

AT3G20570.1 early nodulin-like protein 97.4e-2134.41Show/hide
Query:  LLQIHSVICYQYKVGDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFTSGVAG
        L+ +      ++ VG    W +P+    ++Y  W++    +IGDSL+F+Y  +QDSV+QVT+++Y+SCN   P     DG +   +   G  +F SG   
Subjt:  LLQIHSVICYQYKVGDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFTSGVAG

Query:  HCEKNQKLHISVL---SGNGSSASAPSSDGALP--EISPSYPT--VFGGIPAAPMANSSSSSSSLSTKLSFFPVLV-AAFAGLLIL
        +C+KN+KL + V+   SGN ++AS+P S    P  E +PS P    F   PA     S  + +S ++ LSF   L+ AA A  L L
Subjt:  HCEKNQKLHISVL---SGNGSSASAPSSDGALP--EISPSYPT--VFGGIPAAPMANSSSSSSSLSTKLSFFPVLV-AAFAGLLIL

AT4G10750.1 Phosphoenolpyruvate carboxylase family protein1.1e-10961.1Show/hide
Query:  SSRLHPTAKSLSFSFAP---PFPKSSSPF--RTLFPIASNSSSKPSIPSPIDSSSSFAAPSLA--FNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSG
        +S L+P++  +  S +P      KSS  F  +TL PI  +S       SP D S + A  ++      +LKSRLR GETLYGLFLLSFSPTLAEIA  +G
Subjt:  SSRLHPTAKSLSFSFAP---PFPKSSSPF--RTLFPIASNSSSKPSIPSPIDSSSSFAAPSLA--FNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSG

Query:  YDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYL
        YD+VVVDMEHG GGI +AL C+RAL+A  T AILR+PE+S  WAKKALDLGPQGIMFPMI+S KDA KAVSYCRFPP G+RGSAH VVRAS YGIDEGYL
Subjt:  YDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYL

Query:  TNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMRKRG
        +NY +E+LIMCQ+ES + VKK D+I  VDGVDC+QMGPLD+S S+GYLWDPGHKKVREMM+KAEK+VL +       + GA+L GF+MPHDG  ++R RG
Subjt:  TNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMRKRG

Query:  YQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEE
        Y M++GAVD+GLFR AAVEDVR+F+M  + +S+  D     K+ D+E
Subjt:  YQMISGAVDIGLFRTAAVEDVRKFRMSEMEDSEDLDQPLTHKEEDEE

AT4G24080.1 aldolase like1.7e-6247.71Show/hide
Query:  LAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSS
        +A   +LKSRL++GE L G FLLSFSP LAEIA  +G+DF+VV +EHG GGI                             KKALDLGP GIMFPM+++ 
Subjt:  LAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGYDFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSS

Query:  KDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKA
        + A +AVS+C + P GVRG A+ VVR S +G +EGYL NY D+L IMCQIESE+ +K + +I+ VDG+DC+ MGP D+S S+G L DPG+ KV+ +MR A
Subjt:  KDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMCQIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKA

Query:  EKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFR
        E AVL   S   NG  GA+L G +   D   D++ RGY ++ G+ D+ L++ A V++V  F+
Subjt:  EKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMRKRGYQMISGAVDIGLFRTAAVEDVRKFR

AT4G30590.1 early nodulin-like protein 122.6e-1834.78Show/hide
Query:  GDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFTSGVAGHCEKNQKLHISVLS
        G + SW +P S N  +  +W++ +  K+GD +++ Y    DSV+QVTKE Y SCN  +P+    DGN+   +   G  FF SG  G+C K +K+ + VL+
Subjt:  GDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPILSMKDGNSVFNITTYGDLFFTSGVAGHCEKNQKLHISVLS

Query:  GNGSSASAPSSDGALPEISPSYPTVFGGIPAAPMANSSSSSSSLSTKLSFFPVLVAAFAGL
           S     SS G  P++SP  PT     PA   A + +++  L     +F  L A   GL
Subjt:  GNGSSASAPSSDGALPEISPSYPTVFGGIPAAPMANSSSSSSSLSTKLSFFPVLVAAFAGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTTCTTCAATTTCCTTCACCCTTACTTCTCCATTCCTCTCTTCTTCCAGGCTTCACCCCACCGCCAAATCCCTCTCCTTCTCCTTCGCTCCTCCATTCCCCAA
ATCCTCTTCTCCTTTCCGGACCCTTTTCCCCATTGCTTCCAATTCCTCCTCTAAACCCTCAATTCCATCTCCCATTGACTCCTCGAGTTCTTTCGCCGCCCCTTCGCTTG
CCTTCAATCCAACATTGAAGTCCCGTCTCCGCAATGGCGAAACCCTCTACGGTCTCTTCCTCCTCTCCTTCTCCCCCACTCTTGCCGAGATCGCCGGTCTCTCCGGCTAC
GATTTCGTCGTCGTCGACATGGAGCACGGCTACGGCGGAATCTCTGATGCCCTCCCCTGTCTCCGCGCCCTCTCCGCCACTCAAACTCCAGCCATTCTCCGCATCCCCGA
GAGCTCCGCCGCCTGGGCCAAAAAGGCCCTAGATTTAGGACCGCAGGGTATCATGTTTCCAATGATCGATTCGTCCAAAGATGCGAAAAAAGCTGTTTCATATTGCAGAT
TCCCACCCGCTGGAGTCCGTGGATCGGCCCATCCAGTGGTTAGAGCATCGAAATACGGCATTGACGAAGGGTATTTGACAAATTACGAGGACGAATTACTGATTATGTGT
CAGATTGAATCGGAGCAAGCGGTGAAGAAGATAGATGACATAATGGAAGTGGATGGGGTTGATTGCATTCAAATGGGGCCATTGGACATGAGTGGAAGCATGGGATATCT
ATGGGACCCTGGACACAAGAAGGTTAGAGAAATGATGAGAAAGGCAGAAAAGGCTGTTTTGGAGAGCAAAAGTAAAAGTAAAAATGGTGAGAAGGGAGCCTTCTTATGTG
GGTTTTCAATGCCTCATGATGGGCCAATTGACATGAGGAAGCGTGGATATCAGATGATTTCTGGAGCTGTTGATATTGGTTTGTTTAGAACTGCTGCTGTTGAGGATGTG
AGGAAGTTTAGAATGAGCGAAATGGAAGACTCTGAGGATTTGGATCAGCCTCTTACTCACAAGGAAGAGGATGAAGAAGATAAGTTTCTTCTTCAGCATGTTGTGATTTT
GTTGCAGATCCATAGTGTAATCTGCTATCAGTACAAAGTTGGAGATTTGGATTCATGGGGTCTTCCCACTTCTGAAAATCCAAAGATCTATATGTATTGGTCTAAATATC
ACTCTCTCAAGATTGGTGATTCTCTGATGTTCTTGTATCCACCAAGCCAAGATTCAGTGATTCAAGTTACAAAGGAATCGTACAACAGCTGCAATCTCAAGGATCCAATC
TTGTCCATGAAAGATGGTAACTCTGTTTTCAATATCACTACTTATGGGGATCTGTTCTTCACCAGTGGAGTTGCAGGCCACTGTGAGAAGAATCAGAAGCTTCATATCTC
TGTGCTTTCTGGAAATGGTTCTTCTGCAAGCGCTCCATCTTCTGATGGTGCACTTCCTGAAATCTCCCCCTCTTACCCCACTGTTTTTGGTGGAATCCCAGCAGCTCCCA
TGGCCAATTCCAGCTCTTCCTCGTCCTCATTGTCAACAAAACTCTCATTTTTTCCTGTCTTGGTTGCTGCTTTTGCTGGGCTTTTGATTCTGAGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTTCTTCAATTTCCTTCACCCTTACTTCTCCATTCCTCTCTTCTTCCAGGCTTCACCCCACCGCCAAATCCCTCTCCTTCTCCTTCGCTCCTCCATTCCCCAA
ATCCTCTTCTCCTTTCCGGACCCTTTTCCCCATTGCTTCCAATTCCTCCTCTAAACCCTCAATTCCATCTCCCATTGACTCCTCGAGTTCTTTCGCCGCCCCTTCGCTTG
CCTTCAATCCAACATTGAAGTCCCGTCTCCGCAATGGCGAAACCCTCTACGGTCTCTTCCTCCTCTCCTTCTCCCCCACTCTTGCCGAGATCGCCGGTCTCTCCGGCTAC
GATTTCGTCGTCGTCGACATGGAGCACGGCTACGGCGGAATCTCTGATGCCCTCCCCTGTCTCCGCGCCCTCTCCGCCACTCAAACTCCAGCCATTCTCCGCATCCCCGA
GAGCTCCGCCGCCTGGGCCAAAAAGGCCCTAGATTTAGGACCGCAGGGTATCATGTTTCCAATGATCGATTCGTCCAAAGATGCGAAAAAAGCTGTTTCATATTGCAGAT
TCCCACCCGCTGGAGTCCGTGGATCGGCCCATCCAGTGGTTAGAGCATCGAAATACGGCATTGACGAAGGGTATTTGACAAATTACGAGGACGAATTACTGATTATGTGT
CAGATTGAATCGGAGCAAGCGGTGAAGAAGATAGATGACATAATGGAAGTGGATGGGGTTGATTGCATTCAAATGGGGCCATTGGACATGAGTGGAAGCATGGGATATCT
ATGGGACCCTGGACACAAGAAGGTTAGAGAAATGATGAGAAAGGCAGAAAAGGCTGTTTTGGAGAGCAAAAGTAAAAGTAAAAATGGTGAGAAGGGAGCCTTCTTATGTG
GGTTTTCAATGCCTCATGATGGGCCAATTGACATGAGGAAGCGTGGATATCAGATGATTTCTGGAGCTGTTGATATTGGTTTGTTTAGAACTGCTGCTGTTGAGGATGTG
AGGAAGTTTAGAATGAGCGAAATGGAAGACTCTGAGGATTTGGATCAGCCTCTTACTCACAAGGAAGAGGATGAAGAAGATAAGTTTCTTCTTCAGCATGTTGTGATTTT
GTTGCAGATCCATAGTGTAATCTGCTATCAGTACAAAGTTGGAGATTTGGATTCATGGGGTCTTCCCACTTCTGAAAATCCAAAGATCTATATGTATTGGTCTAAATATC
ACTCTCTCAAGATTGGTGATTCTCTGATGTTCTTGTATCCACCAAGCCAAGATTCAGTGATTCAAGTTACAAAGGAATCGTACAACAGCTGCAATCTCAAGGATCCAATC
TTGTCCATGAAAGATGGTAACTCTGTTTTCAATATCACTACTTATGGGGATCTGTTCTTCACCAGTGGAGTTGCAGGCCACTGTGAGAAGAATCAGAAGCTTCATATCTC
TGTGCTTTCTGGAAATGGTTCTTCTGCAAGCGCTCCATCTTCTGATGGTGCACTTCCTGAAATCTCCCCCTCTTACCCCACTGTTTTTGGTGGAATCCCAGCAGCTCCCA
TGGCCAATTCCAGCTCTTCCTCGTCCTCATTGTCAACAAAACTCTCATTTTTTCCTGTCTTGGTTGCTGCTTTTGCTGGGCTTTTGATTCTGAGCCAATGAGTTCGTTCA
TACATTGAGATTCATTTGAGTTATACTAGTTTGAGAAATGAATTCTTTTTCTCAATTATTATCCCTGCGTTGTTGTGTTCTTAGATGAGTTTGAATTCATCTGTGTCGTT
TAATGGTGTTGCCTATAAGAAATGGCTCTGATTTTCAGGTGTAAAATTCTCAGGCGGCCCTGTAGTTTTGGTCGTGTCAGTTCTAATTTTCATATGCCAGCCACCTAATA
CTCATAACATGGGAATCTCAACTTGGTTTCTAAATTAACCGACATATTTAATCACTCCATAAATTTGTTTTGCTTGCAATTGTCCCTTTTCTTTGGGGTCGGGAAGGTCA
CTCGTCGCCACATTAACCATCATGACCAAAAATCCCGACCCAACCCAACCCGGAG
Protein sequenceShow/hide protein sequence
MAASSISFTLTSPFLSSSRLHPTAKSLSFSFAPPFPKSSSPFRTLFPIASNSSSKPSIPSPIDSSSSFAAPSLAFNPTLKSRLRNGETLYGLFLLSFSPTLAEIAGLSGY
DFVVVDMEHGYGGISDALPCLRALSATQTPAILRIPESSAAWAKKALDLGPQGIMFPMIDSSKDAKKAVSYCRFPPAGVRGSAHPVVRASKYGIDEGYLTNYEDELLIMC
QIESEQAVKKIDDIMEVDGVDCIQMGPLDMSGSMGYLWDPGHKKVREMMRKAEKAVLESKSKSKNGEKGAFLCGFSMPHDGPIDMRKRGYQMISGAVDIGLFRTAAVEDV
RKFRMSEMEDSEDLDQPLTHKEEDEEDKFLLQHVVILLQIHSVICYQYKVGDLDSWGLPTSENPKIYMYWSKYHSLKIGDSLMFLYPPSQDSVIQVTKESYNSCNLKDPI
LSMKDGNSVFNITTYGDLFFTSGVAGHCEKNQKLHISVLSGNGSSASAPSSDGALPEISPSYPTVFGGIPAAPMANSSSSSSSLSTKLSFFPVLVAAFAGLLILSQ