; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016146 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016146
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold9:38414187..38417527
RNA-Seq ExpressionSpg016146
SyntenySpg016146
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]8.4e-23278.27Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWP IEP A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AA PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS DD +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +QKLEEE+T EDPTS  KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        LIDG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAGKKMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV
        ICKVLGWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES  +AV
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]7.6e-23378.37Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWP IEP A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AA PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS DD +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +QKLEEE+T EDPTS  KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        LIDG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAGKKMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV
        ICKVLGWD+EKLPAVVLKGEPLG SLTK    +D    N  D+  EDDS KINK+Q+ES  +AV
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]2.3e-22977.92Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWP IEP A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AA PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ K  PEENHVAKEHDS VQ+ENVAIS DD +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +QKLEEE+T EDPTS  KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        LIDG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAGKKMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV
        ICKVLGWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES  +AV
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]2.0e-23378.97Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWP IEP A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AA PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS DD +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD
        VV +D  +QKLEEE+T EDPTS  KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+NL V ESILKACKEF AAF TS SD+DVSENNLIDG+
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL
        GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAGKKMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S VICKVL
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL

Query:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV
        GWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES  +AV
Subjt:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]5.3e-21875.4Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR EP QDSGPEWPCPEPVQNQPSTSSGWP IEP A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AA PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS DD +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD
        VV +D  +QKLEEE+T EDPTS  KDLISG+                                   V ESILKACKEF AAF TS SD+DVSENNLIDG+
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL
        GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAGKKMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S VICKVL
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL

Query:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV
        GWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES  +AV
Subjt:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKSDDPVEDDSAKINKVQDESTVDAV

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X12.0e-19168.35Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR +PPQDSGPEWPCPEPVQNQPSTSSGWP I+P A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AAQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD KVQPEE HV                  DN  E  
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD
         VSV+E+EQKLEE KT EDPTS  KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE      D
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG
        G EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AG+K +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKVLG
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG

Query:  WDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKS---------DDPVEDDSAKINKVQ
         D++ LPA+VL GE LG SLTK  V +  +KS         DD VEDDS ++N+++
Subjt:  WDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKS---------DDPVEDDSAKINKVQ

A0A1S3CJZ1 uncharacterized protein LOC103501816 isoform X37.3e-18967.99Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR +PPQDSGPEWPCPEPVQNQPSTSSGWP I+P A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AAQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD K  PEE HV                  DN  E  
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD
         VSV+E+EQKLEE KT EDPTS  KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE      D
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG
        G EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AG+K +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKVLG
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG

Query:  WDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKS---------DDPVEDDSAKINKVQ
         D++ LPA+VL GE LG SLTK  V +  +KS         DD VEDDS ++N+++
Subjt:  WDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKS---------DDPVEDDSAKINKVQ

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X24.1e-19268.41Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR +PPQDSGPEWPCPEPVQNQPSTSSGWP I+P A
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCA

Query:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T AAQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD KVQPEE HV                  DN  E  
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENE

Query:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD
         VSV+E+EQKLEE KT EDPTS  KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE      D
Subjt:  VVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG
        G EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AG+K +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKVLG
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLG

Query:  WDMEKLPAVVLKGEPLGHSLTKPGVPQD-------TNKSDDPVEDDSAKINKVQ
         D++ LPA+VL GE LG SLTK  V +D       ++ +DD VEDDS ++N+++
Subjt:  WDMEKLPAVVLKGEPLGHSLTKPGVPQD-------TNKSDDPVEDDSAKINKVQ

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X22.6e-19467.74Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPK--DRKNKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPTI
        M+PY E+ LTEEVLHLHSLWRRGPP+N K   NHS+ A   VA+R PSNKRP  P+    K KKKKPRP P  PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPK--DRKNKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPTI

Query:  EPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG
        +PCAT AAQPVSSEER   +ALQLQYK  +ACRGFF R ADSGS+ EEEEEEEEE N+GG+ + EEYKFFLK+FVEN EL  YYEKN E GSFCCLVCGG
Subjt:  EPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDND
        MGKKKSGKRFK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLA+SG+ +VQPE+NHVAKE    V+SE     NDD  
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDND

Query:  EENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL
                 +NE+KLEE+K  EDP S  K+  SGEN    KENDVNMQ EN DNSI GMG  K EM+NL V + I KACKEFFA FS STSD+      L
Subjt:  EENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL

Query:  IDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI
         DGDG+EEREEFKFF KLF EN+ LR YYE+NY+DGEF CL CEGAGKK  K FKTCGRLLQH+TSLAK + G+  P     AKM+KMK LAHRAYS  +
Subjt:  IDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI

Query:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT------NKSDDPVEDDSAKINKVQDES
        CKVLGWD+E+LP+VVLKGEPLG SLTKPGV +D       + S DP+E+ S + +K++D++
Subjt:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT------NKSDDPVEDDSAKINKVQDES

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X12.2e-19367.02Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPK--DRKNKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPTI
        M+PY E+ LTEEVLHLHSLWRRGPP+N K   NHS+ A   VA+R PSNKRP  P+    K KKKKPRP P  PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPK--DRKNKKKKPRPEP--PQDSGPEWPCPEPVQNQPSTSSGWPTI

Query:  EPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG
        +PCAT AAQPVSSEER   +ALQLQYK  +ACRGFF R ADSGS+ EEEEEEEEE N+GG+ + EEYKFFLK+FVEN EL  YYEKN E GSFCCLVCGG
Subjt:  EPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDND
        MGKKKSGKRFK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLA+SG+ +VQPE+NHVAKE    V+SE     NDD  
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDND

Query:  EENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL
                 +NE+KLEE+K  EDP S  K+  SGEN    KENDVNMQ EN DNSI GMG  K EM+NL V + I KACKEFFA FS STSD+      L
Subjt:  EENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL

Query:  IDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI
         DGDG+EEREEFKFF KLF EN+ LR YYE+NY+DGEF CL CEGAGKK  K FKTCGRLLQH+TSLAK + G+  P     AKM+KMK LAHRAYS  +
Subjt:  IDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGKKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI

Query:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNK------------SDDPVEDDSAKINKVQDES
        CKVLGWD+E+LP+VVLKGEPLG SLTKPGV + + K            S DP+E+ S + +K++D++
Subjt:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNK------------SDDPVEDDSAKINKVQDES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein1.0e-5431.7Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSG
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                            +  + SRNP+N     P++  N  K+PRP    DSG
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSG

Query:  PEWPCPEPVQNQPSTSSGWPTIEPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF
         EWP  + V   PST SGWP   PC     +P+S+EE+E  AA  LQ      CR FFGRK+       +G DE E  E +E++         S+E++F 
Subjt:  PEWPCPEPVQNQPSTSSGWPTIEPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF

Query:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS
         ++F EN +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+                       
Subjt:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS

Query:  KVQPEENHVAKEHDSAVQSENVAISNDDNDEENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH
                          +  V  S  D+    E  S   ++ K+ +EK                         V    E+A  ++  M ++ SE     
Subjt:  KVQPEENHVAKEHDSAVQSENVAISNDDNDEENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH

Query:  VSESILKACKEFFAAFSTSTSD--DDVSENNLIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GKKMLKSFKTCGRLLQHTTSL
               A K+ F    T  +D  ++  + NL         EE +   K+F+EN  L+ YYE NY+ G F CLVC  A  KKMLK FK C  ++QH T  
Subjt:  VSESILKACKEFFAAFSTSTSD--DDVSENNLIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GKKMLKSFKTCGRLLQHTTSL

Query:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKSDDPVEDDSA--KINKVQDESTVDA
                       K+ KMKI AH+ ++  +C++LGWD E LP  V+KG     SL      ++   +   VE+     K    QD +  +A
Subjt:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKSDDPVEDDSA--KINKVQDESTVDA

AT1G78810.2 unknown protein1.0e-5431.7Show/hide
Query:  MNPYSEKILTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSG
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                            +  + SRNP+N     P++  N  K+PRP    DSG
Subjt:  MNPYSEKILTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSG

Query:  PEWPCPEPVQNQPSTSSGWPTIEPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF
         EWP  + V   PST SGWP   PC     +P+S+EE+E  AA  LQ      CR FFGRK+       +G DE E  E +E++         S+E++F 
Subjt:  PEWPCPEPVQNQPSTSSGWPTIEPCATLAAQPVSSEERENHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF

Query:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS
         ++F EN +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+                       
Subjt:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS

Query:  KVQPEENHVAKEHDSAVQSENVAISNDDNDEENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH
                          +  V  S  D+    E  S   ++ K+ +EK                         V    E+A  ++  M ++ SE     
Subjt:  KVQPEENHVAKEHDSAVQSENVAISNDDNDEENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH

Query:  VSESILKACKEFFAAFSTSTSD--DDVSENNLIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GKKMLKSFKTCGRLLQHTTSL
               A K+ F    T  +D  ++  + NL         EE +   K+F+EN  L+ YYE NY+ G F CLVC  A  KKMLK FK C  ++QH T  
Subjt:  VSESILKACKEFFAAFSTSTSD--DDVSENNLIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GKKMLKSFKTCGRLLQHTTSL

Query:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKSDDPVEDDSA--KINKVQDESTVDA
                       K+ KMKI AH+ ++  +C++LGWD E LP  V+KG     SL      ++   +   VE+     K    QD +  +A
Subjt:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKSDDPVEDDSA--KINKVQDESTVDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAAAATACTCACCGAAGAGGTCCTCCATCTCCACTCTCTATGGCGGCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGCCGTCGCGAGTCGGAACCCCTCGAACAAGAGACCCAGAGACCCAAAGGATCGAAAGAACAAGAAGAAGAAACCACGCCCAGAGCCACCGCAAGACTCCGGCCCCGAGT
GGCCCTGCCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGACGATCGAGCCCTGTGCAACTCTGGCGGCTCAGCCGGTGTCGTCTGAAGAGCGGGAA
AATCATGCGGCGTTGCAATTGCAGTACAAGGGACTCGAGGCCTGCCGGGGATTTTTCGGTAGAAAGGCCGATTCGGGAAGTGACGAAGAGGAAGAGGAGGAGGAGGAAGA
AGAGGGGAATAATGGTGGGATGATGGAAAGTGAAGAGTATAAGTTCTTTTTGAAGCTGTTTGTGGAGAACGATGAACTTAGGGGTTACTACGAGAAGAATTCTGAAGGTG
GGTCGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGCGTTGGGCTTGTTCAACATTCGATTTCCATATCGAGGACAAAG
AAGAAGCGGGCTCATAGGGCTTTTGGACAGGTTGTATGCAGGGTTTTTGGATGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATT
AGCCAATTCTGGAGACTCCAAGGTTCAGCCTGAGGAAAATCATGTGGCTAAAGAACATGACTCTGCGGTTCAGAGTGAAAACGTAGCCATTTCCAATGATGACAATGATG
AGGAGAATGAAGTGGTCTCGGTGGATGAGAATGAACAGAAATTGGAGGAAGAAAAGACAGTTGAAGATCCCACTTCTATTGTTAAAGATTTGATTTCTGGTGAGAATGAA
ACTATTGGCAAGGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAATTTCAGGCATGGGAGAAAGCAAATCTGAAATGGAAAACTTGCATGTGTCGGAGTC
GATTTTGAAAGCCTGTAAAGAATTTTTTGCAGCCTTCTCCACATCTACGAGTGACGATGATGTTAGTGAAAATAACTTAATAGATGGAGATGGAGTTGAGGAACGCGAAG
AGTTCAAGTTCTTTTTTAAGTTGTTCGCCGAGAATGAAAGCTTGAGAAGGTATTACGAGAACAACTATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGGAGCAGGA
AAGAAAATGTTGAAGAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACATCTCTAGCGAAAGGGAAAACAGGAAAAAAACCAGTCAAGCCTCACATTGCTAAAAT
GATGAAAATGAAGATACTGGCTCATAGGGCATACAGTATAGTTATATGCAAGGTTCTTGGTTGGGACATGGAAAAGCTTCCCGCAGTCGTGTTAAAAGGCGAACCTCTCG
GTCATTCCTTAACAAAGCCGGGCGTGCCACAGGATACGAACAAATCGGATGATCCTGTAGAAGATGACTCTGCAAAGATTAACAAAGTGCAGGATGAATCGACTGTCGAT
GCAGTTCGCATAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCCTACTCCGAGAAAATACTCACCGAAGAGGTCCTCCATCTCCACTCTCTATGGCGGCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGCCGTCGCGAGTCGGAACCCCTCGAACAAGAGACCCAGAGACCCAAAGGATCGAAAGAACAAGAAGAAGAAACCACGCCCAGAGCCACCGCAAGACTCCGGCCCCGAGT
GGCCCTGCCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGACGATCGAGCCCTGTGCAACTCTGGCGGCTCAGCCGGTGTCGTCTGAAGAGCGGGAA
AATCATGCGGCGTTGCAATTGCAGTACAAGGGACTCGAGGCCTGCCGGGGATTTTTCGGTAGAAAGGCCGATTCGGGAAGTGACGAAGAGGAAGAGGAGGAGGAGGAAGA
AGAGGGGAATAATGGTGGGATGATGGAAAGTGAAGAGTATAAGTTCTTTTTGAAGCTGTTTGTGGAGAACGATGAACTTAGGGGTTACTACGAGAAGAATTCTGAAGGTG
GGTCGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGCGTTGGGCTTGTTCAACATTCGATTTCCATATCGAGGACAAAG
AAGAAGCGGGCTCATAGGGCTTTTGGACAGGTTGTATGCAGGGTTTTTGGATGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATT
AGCCAATTCTGGAGACTCCAAGGTTCAGCCTGAGGAAAATCATGTGGCTAAAGAACATGACTCTGCGGTTCAGAGTGAAAACGTAGCCATTTCCAATGATGACAATGATG
AGGAGAATGAAGTGGTCTCGGTGGATGAGAATGAACAGAAATTGGAGGAAGAAAAGACAGTTGAAGATCCCACTTCTATTGTTAAAGATTTGATTTCTGGTGAGAATGAA
ACTATTGGCAAGGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAATTTCAGGCATGGGAGAAAGCAAATCTGAAATGGAAAACTTGCATGTGTCGGAGTC
GATTTTGAAAGCCTGTAAAGAATTTTTTGCAGCCTTCTCCACATCTACGAGTGACGATGATGTTAGTGAAAATAACTTAATAGATGGAGATGGAGTTGAGGAACGCGAAG
AGTTCAAGTTCTTTTTTAAGTTGTTCGCCGAGAATGAAAGCTTGAGAAGGTATTACGAGAACAACTATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGGAGCAGGA
AAGAAAATGTTGAAGAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACATCTCTAGCGAAAGGGAAAACAGGAAAAAAACCAGTCAAGCCTCACATTGCTAAAAT
GATGAAAATGAAGATACTGGCTCATAGGGCATACAGTATAGTTATATGCAAGGTTCTTGGTTGGGACATGGAAAAGCTTCCCGCAGTCGTGTTAAAAGGCGAACCTCTCG
GTCATTCCTTAACAAAGCCGGGCGTGCCACAGGATACGAACAAATCGGATGATCCTGTAGAAGATGACTCTGCAAAGATTAACAAAGTGCAGGATGAATCGACTGTCGAT
GCAGTTCGCATAAGATGA
Protein sequenceShow/hide protein sequence
MNPYSEKILTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKNKKKKPRPEPPQDSGPEWPCPEPVQNQPSTSSGWPTIEPCATLAAQPVSSEERE
NHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTK
KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDDNDEENEVVSVDENEQKLEEEKTVEDPTSIVKDLISGENE
TIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLIDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAG
KKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKSDDPVEDDSAKINKVQDESTVD
AVRIR