; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009310 (gene) of Snake gourd v1 genome

Gene IDTan0009310
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:108033973..108037027
RNA-Seq ExpressionTan0009310
SyntenyTan0009310
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]2.5e-25077.96Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        M+PYSE+RL EEVLHLH+LWRRGPPRNPKP HNHSST VAAAANRNP NKRP D KNR +KKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWPPIEP A
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A PVSSEER NL A QLQYK  +ACRGFFARNADSGSDEE EEEE   + EM+ESEEYKFFLKLFVE+DELRGYYEKN E G FCCLVCGGM K+K 
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GK+FKNCVGLVQHSISISRT KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVA E DSGVQNEN+ IS D+ +KKNEVV
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI
         +D  +QKLEEE++AED TSN+KDLIS +N+DACK NDV +QAEN DNSV GM ESNAEM+NLP     VPESILKACKEF  AF TSMS++DVSE+NLI
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI

Query:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC
        DG+G+EEREEFKFF KLF +NESLRRYYENN+DDGEFFCL C  AGKKMLK FKTCGRLLQHTTSL KNK  KKPVQKPHIAKM KMKM+AHRA S VIC
Subjt:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC

Query:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNV
        KVLGWDIEKLPAVV+ GEPLG+SLTK+  ++  D+ VGN+VDNT       ED STKINK+Q+E VG+A    DDIV DDS K N+LQG+S GN A GN+
Subjt:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNV

Query:  NDLDGVKE
        NDLDGVKE
Subjt:  NDLDGVKE

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]7.9e-25278.22Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        M+PYSE+RL EEVLHLH+LWRRGPPRNPKP HNHSST VAAAANRNP NKRP D KNR +KKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWPPIEP A
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A PVSSEER NL A QLQYK  +ACRGFFARNADSGSDEE EEEE   + EM+ESEEYKFFLKLFVE+DELRGYYEKN E G FCCLVCGGM K+K 
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GK+FKNCVGLVQHSISISRT KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVA E DSGVQNEN+ IS D+ +KKNEVV
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI
         +D  +QKLEEE++AED TSN+KDLIS +N+DACK NDV +QAEN DNSV GM ESNAEM+NLP     VPESILKACKEF  AF TSMS++DVSE+NLI
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI

Query:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC
        DG+G+EEREEFKFF KLF +NESLRRYYENN+DDGEFFCL C  AGKKMLK FKTCGRLLQHTTSL KNK  KKPVQKPHIAKM KMKM+AHRA S VIC
Subjt:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC

Query:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVND
        KVLGWDIEKLPAVV+ GEPLG+SLTK+  ++D+ VGN+VDNT       ED STKINK+Q+E VG+A    DDIV DDS K N+LQG+S GN A GN+ND
Subjt:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVND

Query:  LDGVKE
        LDGVKE
Subjt:  LDGVKE

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]1.2e-24777.63Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        M+PYSE+RL EEVLHLH+LWRRGPPRNPKP HNHSST VAAAANRNP NKRP D KNR +KKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWPPIEP A
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A PVSSEER NL A QLQYK  +ACRGFFARNADSGSDEE EEEE   + EM+ESEEYKFFLKLFVE+DELRGYYEKN E G FCCLVCGGM K+K 
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GK+FKNCVGLVQHSISISRT KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLADSG+LK   EENHVA E DSGVQNEN+ IS D+ +KKNEVV
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI
         +D  +QKLEEE++AED TSN+KDLIS +N+DACK NDV +QAEN DNSV GM ESNAEM+NLP     VPESILKACKEF  AF TSMS++DVSE+NLI
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLP-----VPESILKACKEFFTAFSTSMSEDDVSEDNLI

Query:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC
        DG+G+EEREEFKFF KLF +NESLRRYYENN+DDGEFFCL C  AGKKMLK FKTCGRLLQHTTSL KNK  KKPVQKPHIAKM KMKM+AHRA S VIC
Subjt:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC

Query:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNV
        KVLGWDIEKLPAVV+ GEPLG+SLTK+  ++  D+ VGN+VDNT       ED STKINK+Q+E VG+A    DDIV DDS K N+LQG+S GN A GN+
Subjt:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNV

Query:  NDLDGVKE
        NDLDGVKE
Subjt:  NDLDGVKE

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]3.5e-25278.61Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        M+PYSE+RL EEVLHLH+LWRRGPPRNPKP HNHSST VAAAANRNP NKRP D KNR +KKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWPPIEP A
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A PVSSEER NL A QLQYK  +ACRGFFARNADSGSDEE EEEE   + EM+ESEEYKFFLKLFVE+DELRGYYEKN E G FCCLVCGGM K+K 
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GK+FKNCVGLVQHSISISRT KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVA E DSGVQNEN+ IS D+ +KKNEVV
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL
         +D  +QKLEEE++AED TSN+KDLIS +N+DACK NDV +QAEN DNSV GM ESNAEM+NLPVPESILKACKEF  AF TSMS++DVSE+NLIDG+G+
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL

Query:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW
        EEREEFKFF KLF +NESLRRYYENN+DDGEFFCL C  AGKKMLK FKTCGRLLQHTTSL KNK  KKPVQKPHIAKM KMKM+AHRA S VICKVLGW
Subjt:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW

Query:  DIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVNDLDG
        DIEKLPAVV+ GEPLG+SLTK+  ++  D+ VGN+VDNT       ED STKINK+Q+E VG+A    DDIV DDS K N+LQG+S GN A GN+NDLDG
Subjt:  DIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVNDLDG

Query:  VKE
        VKE
Subjt:  VKE

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]6.9e-23274.13Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        M+PYSE+RL EEVLHLH+LWRRGPPRNPKP HNHSST VAAAANRNP NKRP D KNR +KKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWPPIEP A
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A PVSSEER NL A QLQYK  +ACRGFFARNADSGSDEE EEEE   + EM+ESEEYKFFLKLFVE+DELRGYYEKN E G FCCLVCGGM K+K 
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GK+FKNCVGLVQHSISISRT KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVA E DSGVQNEN+ IS D+ +KKNEVV
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL
         +D  +QKLEEE++AED TSN+KDLIS +                                   VPESILKACKEF  AF TSMS++DVSE+NLIDG+G+
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL

Query:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW
        EEREEFKFF KLF +NESLRRYYENN+DDGEFFCL C  AGKKMLK FKTCGRLLQHTTSL KNK  KKPVQKPHIAKM KMKM+AHRA S VICKVLGW
Subjt:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW

Query:  DIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVNDLDG
        DIEKLPAVV+ GEPLG+SLTK+  ++  D+ VGN+VDNT       ED STKINK+Q+E VG+A    DDIV DDS K N+LQG+S GN A GN+NDLDG
Subjt:  DIEKLPAVVITGEPLGQSLTKSGVSR--DKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDA----DDIVGDDSIKGNELQGESVGNAAAGNVNDLDG

Query:  VKE
        VKE
Subjt:  VKE

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X13.0e-18867.14Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGD---SKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIE
        M+PYS++RL +EVL+LHSLW RGPPRNPKPTH+HSSTAV   A+ NP NKRP D    KN+  KKKKPR +PPQDSGPEWPCPEPVQNQPSTSSGWPPI+
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGD---SKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIE

Query:  PCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGK
        P ATP AQ VSSEER+NL A QLQYK  +ACR FFARNADSGSDEEEEEEEE  D EM+ES+EY FFLK+FVE++ELR YYEKN E G FCCLVC GMGK
Subjt:  PCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGK

Query:  KKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKN
        KK GK+FKNC+ LVQHSISIS T KKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLA+SGDLKVQ EE HV                    D KN
Subjt:  KKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKN

Query:  EV--VSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLI
        EV  VS++E+EQKLEE K+AED TSN+KDLIS EN+DA K+ DV +Q ENADNS+SGMGESN EM+NL V  +IL+ACKEF  AF  SM++DDVSE    
Subjt:  EV--VSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLI

Query:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC
          DG EEREEFKFF KLF +NE+LRRYYEN++ DGEF CL CE AG+K +K FKTC RLLQH+T L KN   +K  QKP   K+ KM MLAHRAY+ V+C
Subjt:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC

Query:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQ
        KVLG DI+ LPA+V+ GE LG SLTKS VS+ +   +    ++ +DD  ED ST++N+L+
Subjt:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQ

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X21.3e-18867.5Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGD---SKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIE
        M+PYS++RL +EVL+LHSLW RGPPRNPKPTH+HSSTAV   A+ NP NKRP D    KN+  KKKKPR +PPQDSGPEWPCPEPVQNQPSTSSGWPPI+
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGD---SKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIE

Query:  PCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGK
        P ATP AQ VSSEER+NL A QLQYK  +ACR FFARNADSGSDEEEEEEEE  D EM+ES+EY FFLK+FVE++ELR YYEKN E G FCCLVC GMGK
Subjt:  PCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGK

Query:  KKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKN
        KK GK+FKNC+ LVQHSISIS T KKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLA+SGDLKVQ EE HV                    D KN
Subjt:  KKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKN

Query:  EV--VSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLI
        EV  VS++E+EQKLEE K+AED TSN+KDLIS EN+DA K+ DV +Q ENADNS+SGMGESN EM+NL V  +IL+ACKEF  AF  SM++DDVSE    
Subjt:  EV--VSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLI

Query:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC
          DG EEREEFKFF KLF +NE+LRRYYEN++ DGEF CL CE AG+K +K FKTC RLLQH+T L KN   +K  QKP   K+ KM MLAHRAY+ V+C
Subjt:  DGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVIC

Query:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQ
        KVLG DI+ LPA+V+ GE LG SLTKS VS+DK   +    ++ +DD  ED ST++N+L+
Subjt:  KVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQ

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X21.6e-19465.58Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSK--NRKHKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPPI
        M+PY E+RL EEVLHLHSLWRRGPP+N K   NHS+ AVA  ANR P NKRPG  +    K KKKKPRP P  PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSK--NRKHKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPPI

Query:  EPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGS--DEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGG
        +PCATP AQPVSSEER  L A QLQYK  +ACRGFFARNADSGS  +EEEEEEEE  D  + + EEYKFFLK+FVE+ EL  YYEKN E GSFCCLVCGG
Subjt:  EPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGS--DEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDEND
        MGKKKSGKRFK+CVGLVQHSISISRT KKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLADSG+ +VQ E+NHVA E   GV++EN D       
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDEND

Query:  KKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNL
                 +NE+KLEE+K+AED  SNAK+  S EN + CKENDVNMQ EN DNS+ GMG    EM+NLPV + I KACKEFF  FS S S      D L
Subjt:  KKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNL

Query:  IDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVI
         DGDGLEEREEFKFF KLF +N+ LR YYE+N++DGEF CL CE AGKK  KGFKTCGRLLQH+TSLAKN+ G+        AKM KMK LAHRAYS  +
Subjt:  IDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVI

Query:  CKVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDADDIVGDDSIKGNELQG
        CKVLGWD+E+LP+VV+ GEPLG+SLTK GVS+D+ +GN   N + S DP E+GS + +KL+D+ V   +D+VG+ S +  ++ G
Subjt:  CKVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDADDIVGDDSIKGNELQG

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X11.1e-19365.03Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSK--NRKHKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPPI
        M+PY E+RL EEVLHLHSLWRRGPP+N K   NHS+ AVA  ANR P NKRPG  +    K KKKKPRP P  PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSK--NRKHKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPPI

Query:  EPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGS--DEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGG
        +PCATP AQPVSSEER  L A QLQYK  +ACRGFFARNADSGS  +EEEEEEEE  D  + + EEYKFFLK+FVE+ EL  YYEKN E GSFCCLVCGG
Subjt:  EPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGS--DEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDEND
        MGKKKSGKRFK+CVGLVQHSISISRT KKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLADSG+ +VQ E+NHVA E   GV++EN D       
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDEND

Query:  KKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNL
                 +NE+KLEE+K+AED  SNAK+  S EN + CKENDVNMQ EN DNS+ GMG    EM+NLPV + I KACKEFF  FS S S      D L
Subjt:  KKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNL

Query:  IDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVI
         DGDGLEEREEFKFF KLF +N+ LR YYE+N++DGEF CL CE AGKK  KGFKTCGRLLQH+TSLAKN+ G+        AKM KMK LAHRAYS  +
Subjt:  IDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVI

Query:  CKVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKP-----VGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDADDIVGDDSIKGNELQG
        CKVLGWD+E+LP+VV+ GEPLG+SLTK GVS+  P     +GN   N + S DP E+GS + +KL+D+ V   +D+VG+ S +  ++ G
Subjt:  CKVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKP-----VGNAVDNTNKSDDPGEDGSTKINKLQDE-VGDADDIVGDDSIKGNELQG

A0A6J1FFD4 uncharacterized protein LOC111443568 isoform X18.6e-18868.48Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA
        MNPYSE+RL EEVL+LHSLW+RGPPR PKPT  + STAVAAA      NKRP D+KNRK KKKKPR EP QD+GPEWPCPEPVQNQPSTSSGWPP+ PCA
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCA

Query:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS
        TP A+ VSSEER N VA QLQYK +EACR F  RNADSGSD EE EEEEG D E++ESEEYKFFL LF+E+DELRGYYEKN E G FCCLVCGGMGKKKS
Subjt:  TPTAQPVSSEERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKS

Query:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV
        GKRFKNC+GLV HS SISRT KK AHRAFGQ VCRVFGWDIDRLPTIVL GEPL RSLA SGD K Q EEN VA E DS V NEN+ I NDE D K    
Subjt:  GKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVV

Query:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL
            NEQK EEEK+AEDL S                                 GE         VPESI +AC+EFF AF TSM++DDVSE+N I     
Subjt:  SMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGL

Query:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW
        EEREEFKFF KLFI+NESLRRYY+N +DDGEF CLVCE AGKK L+ FKTC RLL+HTT   KNKTGKK V KPHIAKM K+KMLAHRAYSLVIC+VLGW
Subjt:  EEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAAGKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGW

Query:  DIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKIN
        DIEKLPA+V+ GE  G SLTK  V +D PVGNA DNTN+ DDP  D ST+I+
Subjt:  DIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTKIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein1.4e-5231.62Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPP-RNPKPTHNHS--------------------STAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPC
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                      +   A     +++ P + +N  +  K+PRP    DSG EWP 
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPP-RNPKPTHNHS--------------------STAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPC

Query:  PEPVQNQPSTSSGWPPIEPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNAD------SGSDE---EEEEEEEGID-EEMIESEEYKFFLKLFV
         + V   PST SGWP   PC     +P+S+EE+E L A+ LQ  +   CR FF R +       +G DE   +E +E++ ++ EE   S+E++F  ++F 
Subjt:  PEPVQNQPSTSSGWPPIEPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNAD------SGSDE---EEEEEEEGID-EEMIESEEYKFFLKLFV

Query:  ESDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSE
        E+ +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T+ K  HRA  QVVC V GWD++          P+  S  DS  +   + 
Subjt:  ESDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSE

Query:  ENHVANELDSGVQNENLDISNDENDKKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESI
        E       DS +  E   + + E   K  V+ M +N  +  ++   +D T  A     D  E            EN D ++S                  
Subjt:  ENHVANELDSGVQNENLDISNDENDKKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESI

Query:  LKACKEFFTAFSTSMSEDDVSEDNLIDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAA-GKKMLKGFKTCGRLLQHTTSLAKNKTGK
                                          EE +   K+F +N  L+ YYE N++ G F CLVC AA  KKMLK FK C  ++QH T         
Subjt:  LKACKEFFTAFSTSMSEDDVSEDNLIDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAA-GKKMLKGFKTCGRLLQHTTSLAKNKTGK

Query:  KPVQKPHIAKMFKMKMLAHRAYSLVICKVLGWDIEKLPAVVITG
                 K+ KMK+ AH+ ++  +C++LGWD E LP  V+ G
Subjt:  KPVQKPHIAKMFKMKMLAHRAYSLVICKVLGWDIEKLPAVVITG

AT1G78810.2 unknown protein1.4e-5231.62Show/hide
Query:  MNPYSEQRLAEEVLHLHSLWRRGPP-RNPKPTHNHS--------------------STAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPC
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                      +   A     +++ P + +N  +  K+PRP    DSG EWP 
Subjt:  MNPYSEQRLAEEVLHLHSLWRRGPP-RNPKPTHNHS--------------------STAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPC

Query:  PEPVQNQPSTSSGWPPIEPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNAD------SGSDE---EEEEEEEGID-EEMIESEEYKFFLKLFV
         + V   PST SGWP   PC     +P+S+EE+E L A+ LQ  +   CR FF R +       +G DE   +E +E++ ++ EE   S+E++F  ++F 
Subjt:  PEPVQNQPSTSSGWPPIEPCATPTAQPVSSEERENLVASQLQYKVVEACRGFFARNAD------SGSDE---EEEEEEEGID-EEMIESEEYKFFLKLFV

Query:  ESDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSE
        E+ +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T+ K  HRA  QVVC V GWD++          P+  S  DS  +   + 
Subjt:  ESDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTNKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSE

Query:  ENHVANELDSGVQNENLDISNDENDKKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESI
        E       DS +  E   + + E   K  V+ M +N  +  ++   +D T  A     D  E            EN D ++S                  
Subjt:  ENHVANELDSGVQNENLDISNDENDKKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDENEDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESI

Query:  LKACKEFFTAFSTSMSEDDVSEDNLIDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAA-GKKMLKGFKTCGRLLQHTTSLAKNKTGK
                                          EE +   K+F +N  L+ YYE N++ G F CLVC AA  KKMLK FK C  ++QH T         
Subjt:  LKACKEFFTAFSTSMSEDDVSEDNLIDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAA-GKKMLKGFKTCGRLLQHTTSLAKNKTGK

Query:  KPVQKPHIAKMFKMKMLAHRAYSLVICKVLGWDIEKLPAVVITG
                 K+ KMK+ AH+ ++  +C++LGWD E LP  V+ G
Subjt:  KPVQKPHIAKMFKMKMLAHRAYSLVICKVLGWDIEKLPAVVITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGCAAAGGCTCGCCGAAGAGGTCCTCCATCTCCACTCTTTATGGCGGCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAATCATTCATCCAC
TGCCGTCGCCGCTGCCGCGAATCGCAACCCTTTGAACAAGAGACCTGGAGACTCAAAGAATCGGAAGCACAAGAAAAAGAAACCACGCCCCGAGCCACCGCAAGACTCCG
GCCCCGAATGGCCCTGCCCGGAGCCGGTTCAAAATCAGCCCTCCACGTCATCTGGGTGGCCGCCGATCGAGCCCTGTGCCACTCCGACGGCTCAGCCGGTGTCGTCTGAA
GAGCGGGAAAATCTTGTGGCGTCGCAATTGCAGTACAAGGTAGTCGAGGCCTGCCGGGGATTCTTCGCTAGAAATGCCGATTCGGGGAGTGACGAAGAGGAGGAGGAAGA
AGAAGAGGGTATTGATGAGGAAATGATCGAAAGTGAAGAATATAAGTTCTTTTTGAAGCTTTTTGTGGAGAGTGATGAACTTAGGGGTTATTACGAGAAGAATTCCGAAG
GTGGGTCGTTTTGTTGCTTGGTTTGCGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGTGTTGGGCTTGTTCAACATTCGATTTCAATATCGAGGACA
AACAAGAAGCGGGCTCACAGGGCTTTTGGACAGGTCGTGTGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATC
ATTAGCCGATTCTGGAGACTTGAAGGTTCAGTCTGAGGAAAATCATGTGGCTAACGAGCTTGACTCTGGGGTTCAGAATGAAAATTTAGACATTTCGAATGATGAAAATG
ATAAGAAGAATGAAGTGGTTTCTATGGATGAGAATGAACAGAAATTAGAGGAAGAAAAGTCAGCTGAAGATCTCACTTCTAATGCTAAAGATTTGATTTCTGACGAGAAT
GAAGATGCTTGCAAAGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAGTTTCAGGCATGGGAGAAAGCAATGCAGAAATGGAAAATTTGCCTGTACCGGA
GTCAATTTTGAAAGCCTGCAAAGAATTTTTTACAGCCTTCTCCACATCTATGAGTGAGGATGATGTTAGTGAGGATAACTTAATTGATGGAGATGGACTTGAGGAACGCG
AAGAGTTTAAGTTCTTTTTTAAATTGTTTATCAAGAACGAGAGCTTGAGAAGATATTACGAGAACAACCATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGCAGCA
GGAAAGAAAATGCTGAAGGGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACTTCTCTAGCAAAGAACAAAACAGGGAAAAAACCAGTCCAGAAGCCTCACATTGC
TAAAATGTTCAAAATGAAGATGCTAGCTCATAGGGCATACAGTTTAGTTATATGCAAGGTTCTTGGTTGGGACATCGAAAAGCTTCCTGCAGTCGTGATAACAGGCGAAC
CTCTTGGTCAATCCTTAACAAAATCAGGCGTGTCGAGGGACAAACCCGTTGGCAATGCAGTCGATAATACAAACAAATCAGATGATCCTGGAGAAGATGGCTCTACAAAG
ATTAACAAATTGCAGGACGAGGTTGGCGATGCAGATGATATCGTAGGAGATGACTCGATAAAGGGTAACGAATTGCAGGGTGAGTCTGTTGGCAATGCAGCAGCTGGTAA
TGTGAATGATTTAGATGGCGTAAAGGAAAATTAA
mRNA sequenceShow/hide mRNA sequence
CGGATGATGGGAACTGCGAAAGCGATGATGGTTCAATAGGTATAAACAGAACCCGACCTCAATTACTCTGCTTCTTGATTCCGCCATTTTCACACCAATGAATCCCTACT
CCGAGCAAAGGCTCGCCGAAGAGGTCCTCCATCTCCACTCTTTATGGCGGCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAATCATTCATCCACTGCCGTCGCCGCT
GCCGCGAATCGCAACCCTTTGAACAAGAGACCTGGAGACTCAAAGAATCGGAAGCACAAGAAAAAGAAACCACGCCCCGAGCCACCGCAAGACTCCGGCCCCGAATGGCC
CTGCCCGGAGCCGGTTCAAAATCAGCCCTCCACGTCATCTGGGTGGCCGCCGATCGAGCCCTGTGCCACTCCGACGGCTCAGCCGGTGTCGTCTGAAGAGCGGGAAAATC
TTGTGGCGTCGCAATTGCAGTACAAGGTAGTCGAGGCCTGCCGGGGATTCTTCGCTAGAAATGCCGATTCGGGGAGTGACGAAGAGGAGGAGGAAGAAGAAGAGGGTATT
GATGAGGAAATGATCGAAAGTGAAGAATATAAGTTCTTTTTGAAGCTTTTTGTGGAGAGTGATGAACTTAGGGGTTATTACGAGAAGAATTCCGAAGGTGGGTCGTTTTG
TTGCTTGGTTTGCGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGTGTTGGGCTTGTTCAACATTCGATTTCAATATCGAGGACAAACAAGAAGCGGG
CTCACAGGGCTTTTGGACAGGTCGTGTGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATTAGCCGATTCT
GGAGACTTGAAGGTTCAGTCTGAGGAAAATCATGTGGCTAACGAGCTTGACTCTGGGGTTCAGAATGAAAATTTAGACATTTCGAATGATGAAAATGATAAGAAGAATGA
AGTGGTTTCTATGGATGAGAATGAACAGAAATTAGAGGAAGAAAAGTCAGCTGAAGATCTCACTTCTAATGCTAAAGATTTGATTTCTGACGAGAATGAAGATGCTTGCA
AAGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAGTTTCAGGCATGGGAGAAAGCAATGCAGAAATGGAAAATTTGCCTGTACCGGAGTCAATTTTGAAA
GCCTGCAAAGAATTTTTTACAGCCTTCTCCACATCTATGAGTGAGGATGATGTTAGTGAGGATAACTTAATTGATGGAGATGGACTTGAGGAACGCGAAGAGTTTAAGTT
CTTTTTTAAATTGTTTATCAAGAACGAGAGCTTGAGAAGATATTACGAGAACAACCATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGCAGCAGGAAAGAAAATGC
TGAAGGGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACTTCTCTAGCAAAGAACAAAACAGGGAAAAAACCAGTCCAGAAGCCTCACATTGCTAAAATGTTCAAA
ATGAAGATGCTAGCTCATAGGGCATACAGTTTAGTTATATGCAAGGTTCTTGGTTGGGACATCGAAAAGCTTCCTGCAGTCGTGATAACAGGCGAACCTCTTGGTCAATC
CTTAACAAAATCAGGCGTGTCGAGGGACAAACCCGTTGGCAATGCAGTCGATAATACAAACAAATCAGATGATCCTGGAGAAGATGGCTCTACAAAGATTAACAAATTGC
AGGACGAGGTTGGCGATGCAGATGATATCGTAGGAGATGACTCGATAAAGGGTAACGAATTGCAGGGTGAGTCTGTTGGCAATGCAGCAGCTGGTAATGTGAATGATTTA
GATGGCGTAAAGGAAAATTAATCCATGAAGGTTGATAGCAGTGGTGAAGCAGTTTTGAAGGATGATGCTCTGATGGGACTTTAGCAGTTAATCAAATATAGATGGGACTT
TAGCAGGCAGTAGCTTAAGTAGTAAAATCTGCCTTAAAAAATTTCTTCTCAATTATTTCTGTTGTTTTCTCTGTCTCAGATGAAAGCCAGTTTAAGAGAGAGTCAATGTT
AAGAAAAGATGCAAAGATGAAAGTATTATTAGAATGGAAACTGCTCTTTGTTAATCTATTTATTGTTGCTAAAGATGAAAAAACAACATAAATTTTGTATGGCCTCTGGA
GTCTGGTACACAAATATTGTGAATCTTCTAATTTTTTTAGATTTTGATCAA
Protein sequenceShow/hide protein sequence
MNPYSEQRLAEEVLHLHSLWRRGPPRNPKPTHNHSSTAVAAAANRNPLNKRPGDSKNRKHKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPPIEPCATPTAQPVSSE
ERENLVASQLQYKVVEACRGFFARNADSGSDEEEEEEEEGIDEEMIESEEYKFFLKLFVESDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRT
NKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVANELDSGVQNENLDISNDENDKKNEVVSMDENEQKLEEEKSAEDLTSNAKDLISDEN
EDACKENDVNMQAENADNSVSGMGESNAEMENLPVPESILKACKEFFTAFSTSMSEDDVSEDNLIDGDGLEEREEFKFFFKLFIKNESLRRYYENNHDDGEFFCLVCEAA
GKKMLKGFKTCGRLLQHTTSLAKNKTGKKPVQKPHIAKMFKMKMLAHRAYSLVICKVLGWDIEKLPAVVITGEPLGQSLTKSGVSRDKPVGNAVDNTNKSDDPGEDGSTK
INKLQDEVGDADDIVGDDSIKGNELQGESVGNAAAGNVNDLDGVKEN