; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g32430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g32430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF604)
Genome locationchr5:24269697..24273512
RNA-Seq ExpressionMoc05g32430
SyntenyMoc05g32430
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8029300.1 hypothetical protein BT93_E1857 [Corymbia citriodora subsp. variegata]2.3e-17262.14Show/hide
Query:  SNFFKISLAL--ISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVW
        S F K +L L   +SI+ V  F  S  ++L  C +C  R  Q S  R+I +    AEA    TNVSH+LFGI G+  TW +R  Y  LWWRPNVTRGFVW
Subjt:  SNFFKISLAL--ISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVW

Query:  VDEKPN---ATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDL
        +D++P+     W A SPPYRVSA+TS+FK++CWYGSRSAIR+ARIVKESFELG+  VRWFVMGDDDTVFF+ENLV VL KYDH QMYYIG+NSESVEQD+
Subjt:  VDEKPN---ATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDL

Query:  IHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTK
        IHSY  AYGGGG+A+SYPLAAEL R+LDGC+DRY  +YG DQKVQ C+SEIGVP+T E GFHQVDIRGN YG LAAHP+APLVSLHH+DY+  IFP+M +
Subjt:  IHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTK

Query:  LDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPN
        +D+LK L+T  +LDPGRTLQ SFCYD ARNWSVSVSWGY+V+LYP LVT KE+E   QTF+TW+SW++ PFTFNTR VGP PC+ PLV+FLD V  VG  
Subjt:  LDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPN

Query:  QTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        QT+T Y++     E+EC    +  A  V+ F  V+S       W KAPRRQCC+V  GT + G     +V V IRGC P E+VTPP
Subjt:  QTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

KAG6580525.1 hypothetical protein SDJN03_20527, partial [Cucurbita argyrosperma subsp. sororia]3.3e-21675.56Show/hide
Query:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV
        Q+S+KS K  FFKISL L  SI F+SL L SF+ K  NC NCH        RRKI A  + +   +  TNVSHLLFGIAGSTKTWKKR+ YC+LWW PNV
Subjt:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV

Query:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE
        TRGFVWVDEKPNATW A+SPPYRVSADTS F Y+CWYGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLV+VLGKYDH QMYYIGSNSESVE
Subjt:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE

Query:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA
        Q++IH Y TAYGGGGYAISY LA ELVRILDGC+DRY SLYGGDQKVQACV+EIGVPLT E GFHQVDIRG+QYG LAAHPVAPLVSLHHVDYLP IFP 
Subjt:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA

Query:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP
        M ++DALK LKT ++LDPGRTLQQSFCY  ARNWS+SVSWGY+V+LYP L TPK+MEK+FQTF+TWKSW+DGPFTFNTRPV  DPC+MP++FFLD    P
Subjt:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP

Query:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        N+TVT Y + +DVWE+EC RDEFQ AQ VERF+VVT G  SA+ W+KAPRRQCCQVV+GT+     +DSVVNV +RGCNPFETVTPP
Subjt:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

XP_022145686.1 uncharacterized protein LOC111015080 [Momordica charantia]2.3e-193100Show/hide
Query:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF
        MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF
Subjt:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF

Query:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ
        HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ
Subjt:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ

Query:  TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG
        TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG
Subjt:  TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG

Query:  GGVDSVVNVHIRGCNPFETVTPP
        GGVDSVVNVHIRGCNPFETVTPP
Subjt:  GGVDSVVNVHIRGCNPFETVTPP

XP_022935184.1 uncharacterized protein LOC111442138 [Cucurbita moschata]8.8e-21775.56Show/hide
Query:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV
        Q+S+KS K  FFKISL L  SI F+SL L SF+ K  NC NCH        RRKI A  + +   +  TNVSHLLFGIAGSTKTWKKR+ YC+LWW PNV
Subjt:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV

Query:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE
        TRGFVWVDEKPNATW A+SPPYRVSADTS F Y+CWYGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLV+VLGKYDH QMYYIGSNSESVE
Subjt:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE

Query:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA
        Q++IH Y TAYGGGGYAISY LA ELVRILDGC+DRY SLYGGDQKVQACV+EIGVPLT E GFHQVDIRG+QYG LAAHPVAPLVSLHHVDYLP IFP 
Subjt:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA

Query:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP
        M ++DALK LKT ++LDPGRTLQQSFCYD ARNWS+SVSWGY+V+LYP L TPK++EK+FQTF+TWKSW+DGPFTFNTRPV  DPC+MP++FFLD    P
Subjt:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP

Query:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        N+TVT Y + +DVWE+ECGRDEFQ AQ VERF+VVT G  SA+ W+KAPRRQCCQVV+GT+     +D+VVNV  RGCNPFETVTPP
Subjt:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

XP_022983299.1 uncharacterized protein LOC111481920 [Cucurbita maxima]1.4e-21775.56Show/hide
Query:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV
        Q+S+KSRK  FFKISL L  SI F+SL L SF+ K  NC +CH        RRKI A  + +   +  TN+SHLLFGIAGSTKTWKKR+ YC+LWW PNV
Subjt:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV

Query:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE
        TRGFVWVDEKPNATW A+SPPYRVSADTS F Y+CWYGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLV+VLGKYDH QMYYIGSNSESVE
Subjt:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE

Query:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA
        QD+IH Y TAYGGGGYAISY LA ELVRILDGC+DRY SLYGGDQKVQACV+EIGVPLT E GFHQVDIRG+QYG LAAHPVAPLVSLHHVDYLP IFP 
Subjt:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA

Query:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP
        M K+DALK LKT ++LDPGRTLQQSFCYD ARNWS+SVSWGY+V+LYP L TPK+MEK+FQTF+TWKSW+DGPFTFNTRPV  DPCEMP++FFLD V  P
Subjt:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP

Query:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        N+TVT Y + +DVWE+EC +DEFQ  Q VERF+VVTSG  S++ W+KAPRRQCCQVV+ T+     +D+VVNV +RGCNPFETVTPP
Subjt:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

TrEMBL top hitse value%identityAlignment
A0A2P6R9S4 Uncharacterized protein5.4e-17263.86Show/hide
Query:  HHRTLQTSV--RRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEK--PNATWHASSPPYRVSADTSNFKYSCWYG
        HH  +  ++   RKIT        D   TN+SH+LFGI GS K+W KR  Y  LWW+PNVTRGF+W+D+K  PN TW  +SP Y+VSADTS FKYSCWYG
Subjt:  HHRTLQTSV--RRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEK--PNATWHASSPPYRVSADTSNFKYSCWYG

Query:  SRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYD
        SRSAIR+ARIVKESFELG ENVRWFVMGDDDTVFF +NLV+VL KYDH QMYYIG NSESVEQD+IHSY  AYGGGG+AISYPLAAELVR+LDGC+DRYD
Subjt:  SRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYD

Query:  SLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSV
        + YG DQKV+ C+SEIGVPLT E GFHQVDIRG+ YG LAAHPVAPLVSLHH+DY+  +FP +T +D++K L  V+ +DPGRTLQ SFCYD  RNWSVSV
Subjt:  SLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSV

Query:  SWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVT
        SWGY+V+LYP LVT K +E   +TFQTW+SW  GP+TFNTR V PDPC+ PLV+FLD V  VG ++T+T Y ++ +  E++C + ++ AA  V+ F  V+
Subjt:  SWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVT

Query:  SGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        +       W KAPRRQCC+++DG    G GV +VV + +RGCN FE+VTPP
Subjt:  SGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

A0A6J1CX21 uncharacterized protein LOC1110150801.1e-193100Show/hide
Query:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF
        MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF
Subjt:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF

Query:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ
        HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ
Subjt:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ

Query:  TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG
        TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG
Subjt:  TWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNG

Query:  GGVDSVVNVHIRGCNPFETVTPP
        GGVDSVVNVHIRGCNPFETVTPP
Subjt:  GGVDSVVNVHIRGCNPFETVTPP

A0A6J1F4P4 uncharacterized protein LOC1114421384.3e-21775.56Show/hide
Query:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV
        Q+S+KS K  FFKISL L  SI F+SL L SF+ K  NC NCH        RRKI A  + +   +  TNVSHLLFGIAGSTKTWKKR+ YC+LWW PNV
Subjt:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV

Query:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE
        TRGFVWVDEKPNATW A+SPPYRVSADTS F Y+CWYGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLV+VLGKYDH QMYYIGSNSESVE
Subjt:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE

Query:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA
        Q++IH Y TAYGGGGYAISY LA ELVRILDGC+DRY SLYGGDQKVQACV+EIGVPLT E GFHQVDIRG+QYG LAAHPVAPLVSLHHVDYLP IFP 
Subjt:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA

Query:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP
        M ++DALK LKT ++LDPGRTLQQSFCYD ARNWS+SVSWGY+V+LYP L TPK++EK+FQTF+TWKSW+DGPFTFNTRPV  DPC+MP++FFLD    P
Subjt:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP

Query:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        N+TVT Y + +DVWE+ECGRDEFQ AQ VERF+VVT G  SA+ W+KAPRRQCCQVV+GT+     +D+VVNV  RGCNPFETVTPP
Subjt:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

A0A6J1J6Y4 uncharacterized protein LOC1114819206.6e-21875.56Show/hide
Query:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV
        Q+S+KSRK  FFKISL L  SI F+SL L SF+ K  NC +CH        RRKI A  + +   +  TN+SHLLFGIAGSTKTWKKR+ YC+LWW PNV
Subjt:  QDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNV

Query:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE
        TRGFVWVDEKPNATW A+SPPYRVSADTS F Y+CWYGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLV+VLGKYDH QMYYIGSNSESVE
Subjt:  TRGFVWVDEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVE

Query:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA
        QD+IH Y TAYGGGGYAISY LA ELVRILDGC+DRY SLYGGDQKVQACV+EIGVPLT E GFHQVDIRG+QYG LAAHPVAPLVSLHHVDYLP IFP 
Subjt:  QDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPA

Query:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP
        M K+DALK LKT ++LDPGRTLQQSFCYD ARNWS+SVSWGY+V+LYP L TPK+MEK+FQTF+TWKSW+DGPFTFNTRPV  DPCEMP++FFLD V  P
Subjt:  MTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGP

Query:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        N+TVT Y + +DVWE+EC +DEFQ  Q VERF+VVTSG  S++ W+KAPRRQCCQVV+ T+     +D+VVNV +RGCNPFETVTPP
Subjt:  NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

A0A6P5YRG4 uncharacterized protein LOC111294031 isoform X17.9e-17160.67Show/hide
Query:  ISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKP--
        +++  ++SI+    F  S   +L + PN  +  ++T    K  +  I+ +  E+ TN+SH+ FGI GS KTW +R  YC+LWWRPNVTRGFVW+DEKP  
Subjt:  ISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKP--

Query:  NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAY
        +     +SPPY++S DTS FKY+C YGSRSA+R+ARIVKESFELG++NVRWFVMGDDDTVFF+ENLVSVL KYDH QMYYIG NSESVEQD+IHSY  AY
Subjt:  NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAY

Query:  GGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLK
        GGGG+AISYPLAAELV++LDGCVDRY S YG DQKVQAC+SEIG+P+T E GFHQVDIRG+ YG LAAHP+APLVSLHH+DY+  IFP MT++D+LK L 
Subjt:  GGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLK

Query:  TVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMK
        + ++ DP R LQQSFCYD  RNWSVSVSWGY+++LYP LVT K++E  F TFQ+W++W +GPFTFNTRP+G DPCE P+++FLD    V  ++T+T Y +
Subjt:  TVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMK

Query:  HVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP
        HV+   +EC R ++  A  V+ F  V+S +L+ A W  APRRQCC+V++G    G GV +VV V IR CN FE+VTPP
Subjt:  HVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)1.5e-16657.59Show/hide
Query:  FVSLFLSSFDNKLFNCPNCHHRTLQTSVRR------KITADRIAAEA-------DEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEK
        F+S+ + SF    F C +CHH    T +RR       +T +  A+ +         + T++SH+ FGI GS +TW+ R  Y +LWWRPNVTRGF+W+DE+
Subjt:  FVSLFLSSFDNKLFNCPNCHHRTLQTSVRR------KITADRIAAEA-------DEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEK

Query:  P--NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYA
        P  N TW ++SPPY+VSADTS F Y+CWYGSRSAIR+ARI+KE+FELG+ +VRWF+MGDDDTVFFV+NL++VL KYDH QMYYIG NSESVEQD++HSYA
Subjt:  P--NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYA

Query:  TAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALK
         AYGGGG AISYPLA ELV++LDGC+DRY SLYG DQK++AC+SEIGVPLT E GFHQVDIRGN YG LAAHPVAPLV+LHH+DY+  IFP  T++DAL+
Subjt:  TAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALK

Query:  TLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTG
         L + +  DP R +Q SFC+D  RNW VSVSWGY++++YP LVT KE+E  F TF++W++ +  PF+F+TRP+  DPCE PLV+FLD V  VG  QT+T 
Subjt:  TLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTG

Query:  YMKHVDVWE-RECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTP
        Y KHV+V E  +C   ++  A  VE   V T+  L+   W  APRRQCC++V+    +    +SV+NV IR  NP E+VTP
Subjt:  YMKHVDVWE-RECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTP

AT3G11420.1 Protein of unknown function (DUF604)2.1e-12049.66Show/hide
Query:  TSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKPNATWHAS----SPPYRVS-ADTSNFKYSCWYGSRSAI
        T+V +K  A  +   A    TN+SH+ F IAG+ +TW  R  Y  LWWR N TRGFVW+DE      + S    S P RVS    + FK+S    SR+A+
Subjt:  TSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKPNATWHAS----SPPYRVS-ADTSNFKYSCWYGSRSAI

Query:  RLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGG
        R+ARI+ +S+ L + NVRWFVMGDDDTVFF ENLV VL KYDH QM+YIG NSESVEQD++H+Y  A+GGGG+A+S PLAA L   +D C+ RY   YG 
Subjt:  RLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGG

Query:  DQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYS
        DQ++ +C+SEIGVP T E GFHQ+DIRG+ YGFLAAHP+APLVSLHH+ YL  +FP    +++L+TL   + LDP R LQQ  C+D  R WS+S+SWGY+
Subjt:  DQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYS

Query:  VRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLS
        +++Y   +T  E+    QTF+TW+S +DGPF FNTRP+ PDPCE P+ +F+D    V  + T T Y    D     CG+ E      V+R  +VTS +  
Subjt:  VRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPV--VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLS

Query:  AATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETV
           W KAPRRQCC+V++G    G   +  + + IR C   E +
Subjt:  AATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETV

AT4G11350.1 Protein of unknown function (DUF604)3.3e-10542.7Show/hide
Query:  ISLALISSITFVSLFLSSFDNKLFNCPNCHHRTL--QTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDE--
        I L L  S+T++ ++     +    C +    ++  Q   ++ +T    A  A+++ T+++H++FGIA S+K WK+R+ Y ++W++P   RG+VW+DE  
Subjt:  ISLALISSITFVSLFLSSFDNKLFNCPNCHHRTL--QTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDE--

Query:  --KPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESF----ELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDL
          K       S P  R+S DTS+F Y+   G RSAIR++RIV E+         +NVRWFVMGDDDTVF  +NL+ VL KYDH QMYYIGS SES  Q++
Subjt:  --KPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESF----ELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDL

Query:  IHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTK
        I SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++QAC++E+GVPLT E GFHQ D+ GN +G LAAHP+ P VS+HH+D +  IFP MT+
Subjt:  IHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTK

Query:  LDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTD-GPFTFNTRPVGPDPCEMPLVFFL-----DPV
        + A+K L T   +D    LQQS CYD  ++W++SVSWG++V+++    +P+EME   +TF  W    D   + FNTRPV  + C+ P VF +     DP 
Subjt:  LDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTD-GPFTFNTRPVGPDPCEMPLVFFL-----DPV

Query:  VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTN
        +  N TV+ Y +H  V +  C    +  A   E   +V   +     W ++PRR CC+V+     N
Subjt:  VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQVVDGTTTN

AT4G23490.1 Protein of unknown function (DUF604)1.1e-10545.21Show/hide
Query:  EQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKPNATWHASS-----PPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENV
        ++ T+++H++FGIA S+K WK+R+ Y ++W++P   RG+VW+D++   +          PP ++S  T++F Y+   G RSA+R++RIV E+  LG +NV
Subjt:  EQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKPNATWHASS-----PPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENV

Query:  RWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTN
        RWFVMGDDDTVF ++NL+ VL KYDH QMYYIGS SES  Q++  SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++QAC++E+GVPLT 
Subjt:  RWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTN

Query:  EHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNF
        E GFHQ D+ GN +G LAAHPV P VS+HH+D +  IFP MT++ ALK +     LD    LQQS CYD  ++W++SVSWGY+V+++  + +P+EME   
Subjt:  EHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNF

Query:  QTFQTWKSWTD-GPFTFNTRPVGPDPCEMPLVFFLDPVVGP---NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQV
        +TF  W    D   + FNTRPV  +PC+ P VF++         N TV+ Y  H  V    C    ++     E   +V   +     W ++PRR CC+V
Subjt:  QTFQTWKSWTD-GPFTFNTRPVGPDPCEMPLVFFLDPVVGP---NQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQV

Query:  VDGTTTN
        +     N
Subjt:  VDGTTTN

AT5G41460.1 Protein of unknown function (DUF604)6.7e-10648.23Show/hide
Query:  TNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKP----NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFV
        T   H++FGIA S + WK+R+ Y ++W++PN  R +VW+ EKP    +     S PP ++S DTS F Y    G RSAIR++RIV E+ +LG+++VRWFV
Subjt:  TNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWVDEKP----NATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFV

Query:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF
        MGDDDTVF  ENL+ VL KYDH QMYYIGS SES  Q++  SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++QAC++E+GVPLT E GF
Subjt:  MGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYAISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGF

Query:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ
        HQ D+ GN +G LAAHPVAPLV+LHH+D +  IFP MT++DALK L+    LD    +QQS CYD  R W+VSVSWG++V+++  + + +E+E   +TF 
Subjt:  HQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFCYDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQ

Query:  TWKSWTD-GPFTFNTRPVGPDPCEMPLVFFLDPV---VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQV
         W    D   + FNTRPV   PC+ P VF++         N TV+ Y  H  V   EC    ++ A   +   V+   +     W ++PRR CC+V
Subjt:  TWKSWTD-GPFTFNTRPVGPDPCEMPLVFFLDPV---VGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTSGQLSAATWMKAPRRQCCQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCAGGATTCAATGAAATCCCGTAAAAGTAATTTCTTCAAAATTTCACTCGCTCTAATTTCTTCCATAACCTTCGTTTCCCTCTTCCTCTCTTCCTTCGACAA
CAAATTATTCAACTGTCCCAACTGCCACCACCGCACCCTCCAAACCTCCGTCCGCAGGAAAATCACCGCCGACCGTATCGCCGCCGAGGCTGATGAACAAACGACAAACG
TATCCCACCTTCTGTTCGGCATCGCGGGGTCCACCAAGACATGGAAAAAGCGTGAAGGCTACTGCCAACTGTGGTGGAGGCCTAACGTCACCCGCGGGTTCGTCTGGGTG
GACGAGAAGCCGAACGCCACGTGGCATGCCAGCTCCCCTCCGTACAGAGTATCGGCCGACACGTCGAATTTTAAGTACAGTTGCTGGTACGGATCCCGATCGGCGATCAG
ACTGGCGAGGATCGTGAAGGAGAGCTTCGAATTGGGGATCGAAAACGTGCGGTGGTTCGTGATGGGAGACGACGACACGGTGTTTTTCGTGGAGAATTTGGTGAGCGTTT
TGGGGAAGTACGATCACACGCAGATGTATTATATTGGGTCCAACTCCGAGAGCGTGGAGCAGGATTTGATCCATTCGTATGCTACGGCGTATGGCGGCGGCGGATATGCC
ATAAGTTATCCGTTGGCGGCGGAGCTGGTGAGGATTTTGGATGGTTGCGTTGATCGGTACGACAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCTTGTGTTAGTGAAAT
TGGTGTCCCCTTGACCAACGAGCATGGGTTTCATCAGGTTGATATTCGAGGGAACCAATATGGGTTCTTGGCAGCGCATCCGGTGGCTCCGTTGGTGTCGCTCCACCACG
TCGACTACCTTCCGGCCATTTTTCCGGCGATGACCAAACTCGACGCCCTGAAGACTTTAAAAACCGTACACGACCTCGATCCGGGTCGGACCCTTCAGCAAAGTTTCTGT
TACGACTCGGCTCGTAACTGGTCCGTTTCGGTCTCGTGGGGCTACAGCGTCCGGCTCTACCCGCGGCTCGTCACGCCCAAAGAAATGGAGAAGAATTTTCAGACGTTTCA
GACATGGAAGAGTTGGACCGACGGTCCGTTCACGTTCAACACCCGACCGGTCGGCCCGGACCCATGTGAGATGCCTTTGGTTTTCTTTTTGGACCCGGTCGTCGGTCCGA
ACCAGACAGTAACCGGTTACATGAAGCACGTTGACGTGTGGGAGAGAGAGTGTGGCCGAGACGAGTTCCAGGCAGCGCAGGATGTGGAGCGGTTCCAAGTGGTGACTTCT
GGCCAGTTGAGTGCTGCCACGTGGATGAAGGCCCCACGTAGACAATGTTGCCAAGTCGTGGATGGTACAACTACCAATGGCGGTGGAGTTGATAGTGTGGTGAATGTCCA
TATCAGAGGCTGCAATCCCTTTGAGACCGTAACTCCCCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCAGGATTCAATGAAATCCCGTAAAAGTAATTTCTTCAAAATTTCACTCGCTCTAATTTCTTCCATAACCTTCGTTTCCCTCTTCCTCTCTTCCTTCGACAA
CAAATTATTCAACTGTCCCAACTGCCACCACCGCACCCTCCAAACCTCCGTCCGCAGGAAAATCACCGCCGACCGTATCGCCGCCGAGGCTGATGAACAAACGACAAACG
TATCCCACCTTCTGTTCGGCATCGCGGGGTCCACCAAGACATGGAAAAAGCGTGAAGGCTACTGCCAACTGTGGTGGAGGCCTAACGTCACCCGCGGGTTCGTCTGGGTG
GACGAGAAGCCGAACGCCACGTGGCATGCCAGCTCCCCTCCGTACAGAGTATCGGCCGACACGTCGAATTTTAAGTACAGTTGCTGGTACGGATCCCGATCGGCGATCAG
ACTGGCGAGGATCGTGAAGGAGAGCTTCGAATTGGGGATCGAAAACGTGCGGTGGTTCGTGATGGGAGACGACGACACGGTGTTTTTCGTGGAGAATTTGGTGAGCGTTT
TGGGGAAGTACGATCACACGCAGATGTATTATATTGGGTCCAACTCCGAGAGCGTGGAGCAGGATTTGATCCATTCGTATGCTACGGCGTATGGCGGCGGCGGATATGCC
ATAAGTTATCCGTTGGCGGCGGAGCTGGTGAGGATTTTGGATGGTTGCGTTGATCGGTACGACAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCTTGTGTTAGTGAAAT
TGGTGTCCCCTTGACCAACGAGCATGGGTTTCATCAGGTTGATATTCGAGGGAACCAATATGGGTTCTTGGCAGCGCATCCGGTGGCTCCGTTGGTGTCGCTCCACCACG
TCGACTACCTTCCGGCCATTTTTCCGGCGATGACCAAACTCGACGCCCTGAAGACTTTAAAAACCGTACACGACCTCGATCCGGGTCGGACCCTTCAGCAAAGTTTCTGT
TACGACTCGGCTCGTAACTGGTCCGTTTCGGTCTCGTGGGGCTACAGCGTCCGGCTCTACCCGCGGCTCGTCACGCCCAAAGAAATGGAGAAGAATTTTCAGACGTTTCA
GACATGGAAGAGTTGGACCGACGGTCCGTTCACGTTCAACACCCGACCGGTCGGCCCGGACCCATGTGAGATGCCTTTGGTTTTCTTTTTGGACCCGGTCGTCGGTCCGA
ACCAGACAGTAACCGGTTACATGAAGCACGTTGACGTGTGGGAGAGAGAGTGTGGCCGAGACGAGTTCCAGGCAGCGCAGGATGTGGAGCGGTTCCAAGTGGTGACTTCT
GGCCAGTTGAGTGCTGCCACGTGGATGAAGGCCCCACGTAGACAATGTTGCCAAGTCGTGGATGGTACAACTACCAATGGCGGTGGAGTTGATAGTGTGGTGAATGTCCA
TATCAGAGGCTGCAATCCCTTTGAGACCGTAACTCCCCCATAG
Protein sequenceShow/hide protein sequence
MAIQDSMKSRKSNFFKISLALISSITFVSLFLSSFDNKLFNCPNCHHRTLQTSVRRKITADRIAAEADEQTTNVSHLLFGIAGSTKTWKKREGYCQLWWRPNVTRGFVWV
DEKPNATWHASSPPYRVSADTSNFKYSCWYGSRSAIRLARIVKESFELGIENVRWFVMGDDDTVFFVENLVSVLGKYDHTQMYYIGSNSESVEQDLIHSYATAYGGGGYA
ISYPLAAELVRILDGCVDRYDSLYGGDQKVQACVSEIGVPLTNEHGFHQVDIRGNQYGFLAAHPVAPLVSLHHVDYLPAIFPAMTKLDALKTLKTVHDLDPGRTLQQSFC
YDSARNWSVSVSWGYSVRLYPRLVTPKEMEKNFQTFQTWKSWTDGPFTFNTRPVGPDPCEMPLVFFLDPVVGPNQTVTGYMKHVDVWERECGRDEFQAAQDVERFQVVTS
GQLSAATWMKAPRRQCCQVVDGTTTNGGGVDSVVNVHIRGCNPFETVTPP