; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g32420 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g32420
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF604)
Genome locationchr5:24266557..24268627
RNA-Seq ExpressionMoc05g32420
SyntenyMoc05g32420
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442515.1 PREDICTED: uncharacterized protein LOC103486366 [Cucumis melo]3.6e-21276.14Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K  VLRP D  SP +RA+ V+  VASFSLFFYLTFSDQNP  C GC+ A RYSNHRK+KA   D GE + TNISH+VFGIGGSV TWNERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKNVTRGFVWLEEKP + WPE+SPPYRIS DTS+FNYTCWYGFRSAIRVARIIKET+E+GLENVRWFVMGDDDTVFF +NL+++LGRYDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
        ANSESVEQD+VHSY MAYGG GFAISYPLA  LV+ILDGCI+RYA M  SDQKIQGC++EIGV +TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        VQ+IFPAM+QPDSLK+LY AY+TDPSRALQH+FCYD  RNWSVSVSWGY+VQLYPWL TAK++ET FLT+QTWKT+SNEPF FDT+PVSSDPC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        L++ +RL  R  RT+T Y+RY EE    C+R DY PAL V  F+VSAPEFDRRLW Q  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

XP_011652840.2 uncharacterized protein LOC101203954 [Cucumis sativus]3.1e-21176.36Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K  VLRP D  SP ++A+ V+S VASFSLFFYLTFSDQN + C GC+ A RYSNHRK+KA   D GE + TNISH+VFGIGGSV TWNERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKNVTRGFVW+EEKP F WPE+SPPYR+SDDTS+FNYTCWYGFRSAIRVARIIKET+E+GLENVRWFVMGDDDTVFF ENL+++LGRYDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
        ANSESVEQD+VHSYTMAYGG GFAISYPLA  LV+ILDGCINRYA M  SDQKIQGC+SEIGV +TKE GFHQ+DIRGNPYG+LAAHP+APLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        VQTIFP M+QPDSLK+L+ AY+TDPSRALQHTFCYD   NWSVS+SWGY+VQLYP L TAK++ET FLT+QTW+T+SNEPF FDT+PVSSDPCQRPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGRR--TVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        L++A+RL  RR  T+T Y+RYVEE    C+R DY PAL V FF+VSA EFDRRLW Q  ++
Subjt:  LDAADRLDGRR--TVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

XP_022145771.1 uncharacterized protein LOC111015146 [Momordica charantia]5.5e-27799.14Show/hide
Query:  MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE
        MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE
Subjt:  MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE

Query:  RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN
        RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN
Subjt:  RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN

Query:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH
        QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH
Subjt:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH

Query:  HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP
        HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP
Subjt:  HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP

Query:  ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQ  ++
Subjt:  ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

XP_022983991.1 uncharacterized protein LOC111482442 [Cucurbita maxima]4.7e-21276.14Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K +V RP D  + F+R + VIS VASFSLFFYLT  D+ P  C GC+GALR SNHR++KA  + E   + TNISH+VFGIGGSV TW+ERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKN+TRGFVWLEEKP F W ++SPPYRISDDTS+FNYTCWYGFRSAIRVARI+KET+ELGL+NVRWFVMGDDDTVFFTENLVE+LG+YDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
         NSESVEQD VHSY MAYGGAGFAISYPLA  LV+ILDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        +Q IFPAM++PDS+K+L++AYKTDPSRALQH+FCYD ARNWSVSVSWGY+VQLYPWLATAK+L+TPFLTFQTWKT +NE F FDTRPVSS+PC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        LD A+R  GR  RT+T YR+YVE   KEC++ DY  AL V +F+VSAPEFDRRLWRQ  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

XP_038904437.1 uncharacterized protein LOC120090799 [Benincasa hispida]2.1e-21578.94Show/hide
Query:  RPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRG
        RP   L PF+RA  +ISAVASFSLF +LTF+DQ P  C GC+ A RYSNHRK+KA ++ E   + TNISH+VFGIGGSV TWNERRHYCELWWKKNVTRG
Subjt:  RPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRG

Query:  FVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDI
        FVWLEEKP FPWPE+SPPYRISDDTSRFNYTCWYGFRSAIRVARIIKET+ELGLENVRWFVMGDDDTVFFTENLV+LLG+YDHNQM+YIG NSESVEQD+
Subjt:  FVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDI

Query:  VHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQ
        VHSYTMAYGG GFAISYPLA  LV+ILDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDYVQ IFP M+Q
Subjt:  VHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQ

Query:  PDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFFLDAADRLDGR
        PD+LK+L++AYKTDPSRALQH+FCYD  RNWSVSVSWGY++QLYPWL TAK+LET FLT+QTW+T+SNEPF FDTRPVSSDPC+RPIL+FLD+A+RL GR
Subjt:  PDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFFLDAADRLDGR

Query:  --RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
          RT+T YRR++EE    C+R DY PAL V  F+VSAPEFDRRLWRQ  ++
Subjt:  --RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

TrEMBL top hitse value%identityAlignment
A0A1S3B6M6 uncharacterized protein LOC1034863661.8e-21276.14Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K  VLRP D  SP +RA+ V+  VASFSLFFYLTFSDQNP  C GC+ A RYSNHRK+KA   D GE + TNISH+VFGIGGSV TWNERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKNVTRGFVWLEEKP + WPE+SPPYRIS DTS+FNYTCWYGFRSAIRVARIIKET+E+GLENVRWFVMGDDDTVFF +NL+++LGRYDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
        ANSESVEQD+VHSY MAYGG GFAISYPLA  LV+ILDGCI+RYA M  SDQKIQGC++EIGV +TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        VQ+IFPAM+QPDSLK+LY AY+TDPSRALQH+FCYD  RNWSVSVSWGY+VQLYPWL TAK++ET FLT+QTWKT+SNEPF FDT+PVSSDPC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        L++ +RL  R  RT+T Y+RY EE    C+R DY PAL V  F+VSAPEFDRRLW Q  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

A0A5D3DN33 Uncharacterized protein1.8e-21276.14Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K  VLRP D  SP +RA+ V+  VASFSLFFYLTFSDQNP  C GC+ A RYSNHRK+KA   D GE + TNISH+VFGIGGSV TWNERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKNVTRGFVWLEEKP + WPE+SPPYRIS DTS+FNYTCWYGFRSAIRVARIIKET+E+GLENVRWFVMGDDDTVFF +NL+++LGRYDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
        ANSESVEQD+VHSY MAYGG GFAISYPLA  LV+ILDGCI+RYA M  SDQKIQGC++EIGV +TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        VQ+IFPAM+QPDSLK+LY AY+TDPSRALQH+FCYD  RNWSVSVSWGY+VQLYPWL TAK++ET FLT+QTWKT+SNEPF FDT+PVSSDPC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        L++ +RL  R  RT+T Y+RY EE    C+R DY PAL V  F+VSAPEFDRRLW Q  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

A0A6J1CXD9 uncharacterized protein LOC1110151462.7e-27799.14Show/hide
Query:  MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE
        MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE
Subjt:  MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNE

Query:  RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN
        RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN
Subjt:  RRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN

Query:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH
        QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH
Subjt:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLH

Query:  HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP
        HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP
Subjt:  HLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRP

Query:  ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQ  ++
Subjt:  ILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

A0A6J1FA50 uncharacterized protein LOC1114421983.3e-21175.49Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K +V RP D  + F+R   VIS VASFSLF YLT  D+ P  C GC+GALR SNHR++KA  + E   + TNISH+VFGIGGSV TWNERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWW KNVTRGFVWLEEKP FPWP++SPPYRISDDTS+FNYTCWYGFRSAIRVARIIKET++LGL+NVRWFVMGDDDTVFFTENLV+LLG+YDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
        +NSESVEQD VHSY MAYGGAGFAISYPLA  LV+ILDGCINRYADM  SDQKIQGC+S+IGV +TKELGFHQVDIRG+ YG+LAAHPVAPLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        ++ IFPAM++PDS+K+L++AYKTDP RALQH+FCYD ARNWSVSVSWGY+VQLYPWLATAK+L+TPFLTFQTWKT +NE F FDTRPVSS+PC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        LD A+R  GR  RT+TRYR+YVE    EC + DY  AL V +F+VSAPEFDRRLWRQ  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

A0A6J1J3X9 uncharacterized protein LOC1114824422.3e-21276.14Show/hide
Query:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE
        +++ +K +V RP D  + F+R + VIS VASFSLFFYLT  D+ P  C GC+GALR SNHR++KA  + E   + TNISH+VFGIGGSV TW+ERRHYCE
Subjt:  AIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCE

Query:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG
        LWWKKN+TRGFVWLEEKP F W ++SPPYRISDDTS+FNYTCWYGFRSAIRVARI+KET+ELGL+NVRWFVMGDDDTVFFTENLVE+LG+YDHNQMYYIG
Subjt:  LWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIG

Query:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY
         NSESVEQD VHSY MAYGGAGFAISYPLA  LV+ILDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLDY
Subjt:  ANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDY

Query:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF
        +Q IFPAM++PDS+K+L++AYKTDPSRALQH+FCYD ARNWSVSVSWGY+VQLYPWLATAK+L+TPFLTFQTWKT +NE F FDTRPVSS+PC+RPIL+F
Subjt:  VQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFF

Query:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        LD A+R  GR  RT+T YR+YVE   KEC++ DY  AL V +F+VSAPEFDRRLWRQ  ++
Subjt:  LDAADRLDGR--RTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01570.1 Protein of unknown function (DUF604)2.1e-9647.53Show/hide
Query:  TNISHVVFGIGGSVMTWNERRHYCELWWKKN-VTRGFVWLEE--KPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFEL--GLE---NV
        T + HVVFGI  S   W  R+ Y +LWWK N    G VWL++         +  PP RIS DTSRF Y    G RSAIR+ RI+ ET  L  G E   NV
Subjt:  TNISHVVFGIGGSVMTWNERRHYCELWWKKN-VTRGFVWLEE--KPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFEL--GLE---NV

Query:  RWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTK
        RW VMGDDDTVFF ENLV++L +YDHNQ YYIG++SES  Q++  SY MAYGG GFAISYPLA  L ++ D CI RY+++  SD +I  C+SE+GV +TK
Subjt:  RWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTK

Query:  ELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPF
        E+GFHQ+D+ G   GLL+AHP+APLVS+HHLD V  +FP M + ++++R     K D     Q + CYDA   W+VSVSWGYTVQ+   + +A+++  P 
Subjt:  ELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPF

Query:  LTFQTW-KTSSNEPFAFDTRPVSSDPCQRPILFFL-DAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDR
         TF  W K +    +AF+TRP++   CQRP +++L +A   L  RRT + Y R+ +  + EC+     P+   R      P+ DR
Subjt:  LTFQTW-KTSSNEPFAFDTRPVSSDPCQRPILFFL-DAADRLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDR

AT1G07850.1 Protein of unknown function (DUF604)4.0e-10048.31Show/hide
Query:  KIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFE
        ++ A       +  T + H+VFGI  S + W  R+ Y + WW+   TRG VW++++      +  P  RIS DTSRF YT   G RSA+R++R++ ET  
Subjt:  KIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFE

Query:  LGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEI
        LG + VRWFVMGDDDTVF  +N+V +L +YDH Q YY+G++SE+  Q+I  SY+MA+GG GFAISY LA+EL+R+ D CI RY  +  SD +IQ C++E+
Subjt:  LGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEI

Query:  GVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAK
        GV +TKE GFHQ D+ G+  GLL AHPVAPLVSLHH+D VQ IFP M +  +L+ L S+   DP+   Q + CYD  R WS+SVSWG+ VQ+   + + +
Subjt:  GVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAK

Query:  DLETPFLTFQTW-KTSSNEPFAFDTRPVSSDPCQRPILFFLDAADRLDGRRTVTRY
        +LE P  TF  W + +    +AF+TRPVS  PCQRP +F+L++A   +GRR V  Y
Subjt:  DLETPFLTFQTW-KTSSNEPFAFDTRPVSSDPCQRPILFFLDAADRLDGRRTVTRY

AT2G37730.1 Protein of unknown function (DUF604)1.3e-16259.01Show/hide
Query:  LRPKDALSPFIRAIFVISAVASFSLF-FYLTFSDQNPAGCVGCH-----GALRYSNHRKIKAASSDEGELR-------GTNISHVVFGIGGSVMTWNERR
        ++P   LS   +  F I    S ++  FY+ F      GC  CH       L   N      +S+    +R        T+ISH+ FGIGGS+ TW +R 
Subjt:  LRPKDALSPFIRAIFVISAVASFSLF-FYLTFSDQNPAGCVGCH-----GALRYSNHRKIKAASSDEGELR-------GTNISHVVFGIGGSVMTWNERR

Query:  HYCELWWKKNVTRGFVWLEEKPA--FPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN
         Y ELWW+ NVTRGF+WL+E+P     W   SPPY++S DTSRF+YTCWYG RSAIR+ARIIKETFELGL +VRWF+MGDDDTVFF +NL+ +L +YDHN
Subjt:  HYCELWWKKNVTRGFVWLEEKPA--FPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHN

Query:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVS
        QMYYIG NSESVEQDIVHSY MAYGG G AISYPLA+ELV++LDGCI+RYA +  SDQKI+ CLSEIGV +TKELGFHQVDIRGNPYGLLAAHPVAPLV+
Subjt:  QMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVS

Query:  LHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQ
        LHHLDYV  IFP  +Q D+L+RL SAYKTDPSR +QH+FC+D  RNW VSVSWGYT+Q+YP L TAK+LETPFLTF++W+TSS+EPF+FDTRP+S DPC+
Subjt:  LHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQ

Query:  RPILFFLDAADRLDGRRTVTRYRRYVEEVDK-ECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
        RP+++FLD    +   +T+T YR++VE  +  +C   DY  A  V F DVS       LW+   ++
Subjt:  RPILFFLDAADRLDGRRTVTRYRRYVEEVDK-ECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

AT3G11420.1 Protein of unknown function (DUF604)2.9e-11145.15Show/hide
Query:  RPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRG
        RP+D L  F R   +   + S SL    TF   +       +G L+ +   +   A      +  TNISH+ F I G+  TW +R  Y  LWW +N TRG
Subjt:  RPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRG

Query:  FVWLEEKPAFPWPEN----SPPYRISD-DTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSES
        FVWL+E    P   +    S P R+SD   +RF ++     R+A+R+ARII +++ L L NVRWFVMGDDDTVFFTENLV++L +YDH QM+YIG NSES
Subjt:  FVWLEEKPAFPWPEN----SPPYRISD-DTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSES

Query:  VEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIF
        VEQD++H+Y MA+GG GFA+S PLA  L   +D C+ RY     SDQ+I  C+SEIGV  T+E GFHQ+DIRG+PYG LAAHP+APLVSLHHL Y+  +F
Subjt:  VEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIF

Query:  PAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFFLDAAD
        P  +  +SL+ L   Y  DP+R LQ   C+D  R WS+S+SWGYT+Q+Y +  TA +L TP  TF+TW++SS+ PF F+TRP+  DPC+RP+ +F+D A+
Subjt:  PAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFFLDAAD

Query:  RLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK
         +    T T Y    ++    C + ++     V+   V++ + D   W +  ++
Subjt:  RLDGRRTVTRYRRYVEEVDKECERSDYVPALGVRFFDVSAPEFDRRLWRQVKKK

AT5G41460.1 Protein of unknown function (DUF604)3.6e-10153.17Show/hide
Query:  TNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRGFVWLEEKPAFPWPENS----PPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFV
        T   HVVFGI  S   W +R+ Y ++W+K N  R +VWL EKP     E      PP +IS DTS+F Y    G RSAIR++RI+ ET +LGL++VRWFV
Subjt:  TNISHVVFGIGGSVMTWNERRHYCELWWKKNVTRGFVWLEEKPAFPWPENS----PPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFV

Query:  MGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGF
        MGDDDTVF  ENL+ +L +YDHNQMYYIG+ SES  Q+I  SY MAYGG GFAISYPLA+ L ++ D CI RY  +  SD ++Q C++E+GV +TKELGF
Subjt:  MGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSYTMAYGGAGFAISYPLAIELVRILDGCINRYADM--SDQKIQGCLSEIGVSVTKELGF

Query:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQ
        HQ D+ GN +GLLAAHPVAPLV+LHHLD V+ IFP M++ D+LK L    K D +  +Q + CYD  R W+VSVSWG+ VQ++  + +A+++E P  TF 
Subjt:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPSRALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQ

Query:  TW-KTSSNEPFAFDTRPVSSDPCQRPILFFL
         W + +    +AF+TRPVS  PCQ+P +F++
Subjt:  TW-KTSSNEPFAFDTRPVSSDPCQRPILFFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTTCCCGGAGAAGCGATTCAAGGCTGGAAAGTTTTGGTGTTACGGCCAAAGGATGCCTTATCTCCTTTCATTAGAGCCATTTTTGTTATCTCCGCCGTTGCTTC
GTTTTCTCTCTTCTTCTACCTAACTTTTTCCGACCAAAACCCCGCCGGCTGCGTCGGCTGCCACGGCGCACTCCGGTATTCTAATCACCGGAAAATCAAGGCGGCTTCTT
CCGATGAAGGCGAATTACGTGGCACGAATATATCCCATGTCGTGTTCGGCATTGGTGGGTCCGTGATGACGTGGAACGAGCGTCGCCATTACTGCGAGCTGTGGTGGAAG
AAGAACGTTACTCGTGGGTTTGTTTGGCTCGAAGAGAAACCTGCCTTTCCCTGGCCGGAAAACTCCCCGCCCTACCGGATCTCCGATGACACCTCCAGATTCAACTACAC
TTGTTGGTACGGTTTTCGATCTGCTATTCGGGTGGCTAGGATTATCAAAGAGACATTTGAACTTGGGTTGGAGAATGTGCGATGGTTCGTGATGGGGGACGACGATACGG
TCTTCTTCACGGAGAATTTGGTGGAGTTATTGGGTAGATACGATCACAACCAAATGTACTACATCGGAGCTAATTCTGAGAGTGTGGAACAAGACATTGTTCATTCTTAC
ACCATGGCTTATGGTGGTGCCGGATTCGCTATTAGCTACCCGCTCGCGATAGAACTGGTCAGAATTTTAGACGGTTGCATAAATCGTTATGCCGATATGTCCGACCAAAA
AATTCAAGGCTGCCTGAGTGAGATTGGTGTCTCCGTCACCAAAGAGCTTGGATTCCACCAGGTGGATATTAGAGGAAACCCATACGGGCTATTAGCAGCCCATCCAGTAG
CGCCATTAGTGTCGCTCCACCACCTGGATTACGTGCAGACCATATTCCCGGCCATGTCGCAGCCCGACTCGCTCAAGAGGCTCTACAGCGCCTACAAAACGGACCCGAGT
CGAGCCCTTCAGCACACCTTCTGCTACGACGCGGCTCGTAACTGGTCCGTTTCGGTCTCGTGGGGCTACACGGTTCAGCTATATCCATGGCTCGCCACCGCCAAGGACCT
GGAGACGCCATTTCTCACGTTCCAAACGTGGAAGACGTCCAGCAATGAGCCCTTCGCATTCGATACCCGACCCGTCAGTTCTGACCCGTGCCAAAGACCCATTTTGTTTT
TCTTGGACGCGGCGGACCGATTGGACGGCCGGCGGACGGTGACGAGGTACCGGAGATACGTTGAGGAGGTTGACAAGGAGTGTGAGCGGTCGGATTACGTTCCGGCGTTG
GGTGTTCGGTTTTTCGACGTCTCCGCCCCGGAGTTCGACCGCCGTCTCTGGAGACAGGTAAAGAAAAAAGTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTTCCCGGAGAAGCGATTCAAGGCTGGAAAGTTTTGGTGTTACGGCCAAAGGATGCCTTATCTCCTTTCATTAGAGCCATTTTTGTTATCTCCGCCGTTGCTTC
GTTTTCTCTCTTCTTCTACCTAACTTTTTCCGACCAAAACCCCGCCGGCTGCGTCGGCTGCCACGGCGCACTCCGGTATTCTAATCACCGGAAAATCAAGGCGGCTTCTT
CCGATGAAGGCGAATTACGTGGCACGAATATATCCCATGTCGTGTTCGGCATTGGTGGGTCCGTGATGACGTGGAACGAGCGTCGCCATTACTGCGAGCTGTGGTGGAAG
AAGAACGTTACTCGTGGGTTTGTTTGGCTCGAAGAGAAACCTGCCTTTCCCTGGCCGGAAAACTCCCCGCCCTACCGGATCTCCGATGACACCTCCAGATTCAACTACAC
TTGTTGGTACGGTTTTCGATCTGCTATTCGGGTGGCTAGGATTATCAAAGAGACATTTGAACTTGGGTTGGAGAATGTGCGATGGTTCGTGATGGGGGACGACGATACGG
TCTTCTTCACGGAGAATTTGGTGGAGTTATTGGGTAGATACGATCACAACCAAATGTACTACATCGGAGCTAATTCTGAGAGTGTGGAACAAGACATTGTTCATTCTTAC
ACCATGGCTTATGGTGGTGCCGGATTCGCTATTAGCTACCCGCTCGCGATAGAACTGGTCAGAATTTTAGACGGTTGCATAAATCGTTATGCCGATATGTCCGACCAAAA
AATTCAAGGCTGCCTGAGTGAGATTGGTGTCTCCGTCACCAAAGAGCTTGGATTCCACCAGGTGGATATTAGAGGAAACCCATACGGGCTATTAGCAGCCCATCCAGTAG
CGCCATTAGTGTCGCTCCACCACCTGGATTACGTGCAGACCATATTCCCGGCCATGTCGCAGCCCGACTCGCTCAAGAGGCTCTACAGCGCCTACAAAACGGACCCGAGT
CGAGCCCTTCAGCACACCTTCTGCTACGACGCGGCTCGTAACTGGTCCGTTTCGGTCTCGTGGGGCTACACGGTTCAGCTATATCCATGGCTCGCCACCGCCAAGGACCT
GGAGACGCCATTTCTCACGTTCCAAACGTGGAAGACGTCCAGCAATGAGCCCTTCGCATTCGATACCCGACCCGTCAGTTCTGACCCGTGCCAAAGACCCATTTTGTTTT
TCTTGGACGCGGCGGACCGATTGGACGGCCGGCGGACGGTGACGAGGTACCGGAGATACGTTGAGGAGGTTGACAAGGAGTGTGAGCGGTCGGATTACGTTCCGGCGTTG
GGTGTTCGGTTTTTCGACGTCTCCGCCCCGGAGTTCGACCGCCGTCTCTGGAGACAGGTAAAGAAAAAAGTTGTTTAA
Protein sequenceShow/hide protein sequence
MSLPGEAIQGWKVLVLRPKDALSPFIRAIFVISAVASFSLFFYLTFSDQNPAGCVGCHGALRYSNHRKIKAASSDEGELRGTNISHVVFGIGGSVMTWNERRHYCELWWK
KNVTRGFVWLEEKPAFPWPENSPPYRISDDTSRFNYTCWYGFRSAIRVARIIKETFELGLENVRWFVMGDDDTVFFTENLVELLGRYDHNQMYYIGANSESVEQDIVHSY
TMAYGGAGFAISYPLAIELVRILDGCINRYADMSDQKIQGCLSEIGVSVTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKRLYSAYKTDPS
RALQHTFCYDAARNWSVSVSWGYTVQLYPWLATAKDLETPFLTFQTWKTSSNEPFAFDTRPVSSDPCQRPILFFLDAADRLDGRRTVTRYRRYVEEVDKECERSDYVPAL
GVRFFDVSAPEFDRRLWRQVKKKVV