; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g00200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g00200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr2:186588..188986
RNA-Seq ExpressionMoc02g00200
SyntenyMoc02g00200
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605833.1 hypothetical protein SDJN03_03150, partial [Cucurbita argyrosperma subsp. sororia]2.6e-24195.01Show/hide
Query:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD
        M PA  GRGT AGRA VLVF LW LI LS AARLSPSM  LEVQKHLRR NKPPLKTI+SPDGDIIDCVHISNQPAFDHPFLKDHKI TRPTYHPEGLFD
Subjt:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD

Query:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENKVSEKPKERTNPI QLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR++PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE
        QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDP+E
Subjt:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

TYK21736.1 uncharacterized protein E5676_scaffold859G001080 [Cucumis melo var. makuwa]1.3e-24094.52Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPA   R TAGRAL+L+F LW +ISLS AARLSPSM NLEVQKHLRR NKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKE TNPI QLWHANGRCPENTIPIRRTKEDDVLRASS KRYGKKRHR +PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNS+IAMGASISPVS FRS QYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

XP_022153819.1 uncharacterized protein LOC111021244 [Momordica charantia]4.8e-256100Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

XP_022958627.1 uncharacterized protein LOC111459796 [Cucurbita moschata]2.6e-24195.01Show/hide
Query:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD
        M PA  GRGT AGRA VLVF LW LI LS AARLSPSM  LEVQKHLRR NKPPLKTI+SPDGDIIDCVHISNQPAFDHPFLKDHKI TRPTYHPEGLFD
Subjt:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD

Query:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENKVSEKPKERTNPI QLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR++PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE
        QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDP+E
Subjt:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

XP_038901449.1 uncharacterized protein LOC120088312 [Benincasa hispida]5.2e-24294.52Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        M P+  GRGTAGR L+ +F LW LI LS++ARLSPSM NLEVQKHLRR NKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKI TRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKERTNPI QLWH+NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRT+PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVS FRSPQYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ7 Uncharacterized protein4.0e-24093.81Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPA   R TAGRAL+L+F LW LISLS A+RLSPSM NLEVQKHLRR NKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKE +NPI QLWHANGRCPENTIP+RRTKEDDVLRASS KRYGKKRHRT+PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQV+S+IAMGASISPVS FR+ QYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A1S3BN34 uncharacterized protein LOC1034916322.3e-24094.52Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPA   R TAG AL+L+F LW LISLS AARLSPSM NLEVQKHLRR NKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKE TNPI QLWHANGRCPENTIPIRRTKEDDVLRASS KRYGKKRHR +PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNS+IAMGASISPVS FRS QYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A5D3DDU5 Uncharacterized protein6.2e-24194.52Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPA   R TAGRAL+L+F LW +ISLS AARLSPSM NLEVQKHLRR NKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKE TNPI QLWHANGRCPENTIPIRRTKEDDVLRASS KRYGKKRHR +PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNS+IAMGASISPVS FRS QYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A6J1DK76 uncharacterized protein LOC1110212442.3e-256100Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
        PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A6J1H2C7 uncharacterized protein LOC1114597961.2e-24195.01Show/hide
Query:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD
        M PA  GRGT AGRA VLVF LW LI LS AARLSPSM  LEVQKHLRR NKPPLKTI+SPDGDIIDCVHISNQPAFDHPFLKDHKI TRPTYHPEGLFD
Subjt:  MGPAHSGRGT-AGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFD

Query:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENKVSEKPKERTNPI QLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR++PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE
        QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDP+E
Subjt:  QPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)9.8e-20779.29Show/hide
Query:  GPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDEN
        G  H       R  ++  CLWG  SLS AAR   S    EV+KHL R NKP +K+IQS DGD+IDCV IS QPAFDHPFLKDHKIQ +P YHPEGLFD+N
Subjt:  GPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDEN

Query:  KVS-EKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        KVS  K  E+   I QLWH  G+C E TIP+RRTKEDDVLRASS KRYGKK+ R++P P+SA+PDLINQSGHQHAIAYVEGDK+YGAKATINVWEPKIQQ
Subjt:  KVS-EKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
         NEFSLSQ+W+LGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS +R+ QYDISILIWKDP EG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFGN YVLGYWPSFLFSYL +SASMIEWGGEVVNS+ DG HTSTQMGSG FPEEGF KASYFRNIQVVD SNNLKAPKG+GTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        N DWGHYFYYGGPG+NQ CP
Subjt:  NGDWGHYFYYGGPGRNQNCP

AT2G44210.1 Protein of Unknown Function (DUF239)2.8e-16164.94Show/hide
Query:  NLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDENKVSEKPK-ERTNPITQLWHANGRCPENTIPIRRTKED
        +L+++ HL+R NKP LK+I+SPDGD+IDCV I++QPAF HP L +H +Q  P+ +PE +F E+KVS K K +++N I QLWH NG+CP+NTIPIRRT+  
Subjt:  NLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDENKVSEKPK-ERTNPITQLWHANGRCPENTIPIRRTKED

Query:  DVLRASSAKRYGKKRHRTLPQPRSAD-PDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGD
        D+ RASS + YG K  +++P+P+S++ P+++ Q+GHQHAI YVE   FYGAKA INVW+P ++ PNEFSL+Q+W+LGG+F  DLNSIEAGWQVSP LYGD
Subjt:  DVLRASSAKRYGKKRHRTLPQPRSAD-PDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGD

Query:  NNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEV
        N TRLFTYWTSDAYQ TGCYNLLCSGF+Q+N EIAMG SISP+S + + QYDI+ILIWKDP EGHWW+QFG  Y++GYWP+ LFSYL++SASMIEWGGEV
Subjt:  NNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEV

Query:  VNSE-PDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP
        VNS+  +G HT+TQMGSG F EEG+GKASYF+N+QVVD SN L+ P+ +  FT++ NCY+V++G+ G WG YFYYGGPGRN NCP
Subjt:  VNSE-PDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP

AT3G13510.1 Protein of Unknown Function (DUF239)5.8e-20781.84Show/hide
Query:  CLWGLISLS-RAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDENKVSEKPKERTNPITQLW
        CLW ++SLS  AA    S    EV+KHL R NKPP+KTIQSPDGDIIDC+ IS QPAFDHPFLKDHKIQ RP+YHPEGLFD+NKVS +PK +   I QLW
Subjt:  CLWGLISLS-RAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDENKVSEKPKERTNPITQLW

Query:  HANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGQ
        H  G+C E TIP+RRT+EDDVLRASS KRYGKK+HR++P P+SA+PDLINQ+GHQHAIAYVEGDK+YGAKAT+NVWEPKIQ  NEFSLSQ+W+LGGSFGQ
Subjt:  HANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGQ

Query:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEGHWWMQFGNDYVLGYWPSF
        DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS +R+ QYDISILIWKDP EGHWWMQFGN YVLGYWPSF
Subjt:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEGHWWMQFGNDYVLGYWPSF

Query:  LFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGSNGDWGHYFYYGGPGRNQN
        LFSYL +SASMIEWGGEVVNS+ +G HT TQMGSGHFPEEGF KASYFRNIQVVD SNNLKAPKG+GTFTEKSNCYDVQTGSN DWGHYFYYGGPG+N+N
Subjt:  LFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGSNGDWGHYFYYGGPGRNQN

Query:  CP
        CP
Subjt:  CP

AT5G56530.1 Protein of Unknown Function (DUF239)1.2e-20980.43Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        M  AH  +    R  ++ FC WGL+SL+ A RLS S  N EV KHL R NKP +K+IQSPDGDIIDCVHIS QPAFDHPFLKDHKIQ  P+Y PE LF E
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        +KVSEKPKE  NPITQLWH NG C E TIP+RRTK++DVLRASS KRYGKK+H ++P PRSADPDLINQSGHQHAIAYVEG KFYGAKATINVWEPK+Q 
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS F +PQYDISI IWKDP EG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFG+ YVLGYWPSFLFSYLADSAS++EWGGEVVN E DG HT+TQMGSG FP+EGF KASYFRNIQVVDSSNNLK PKG+ TFTEKSNCYDV+ G 
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNC
        N DWGHYFYYGGPGRN NC
Subjt:  NGDWGHYFYYGGPGRNQNC

AT5G56530.2 Protein of Unknown Function (DUF239)1.2e-20980.43Show/hide
Query:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE
        M  AH  +    R  ++ FC WGL+SL+ A RLS S  N EV KHL R NKP +K+IQSPDGDIIDCVHIS QPAFDHPFLKDHKIQ  P+Y PE LF E
Subjt:  MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDE

Query:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        +KVSEKPKE  NPITQLWH NG C E TIP+RRTK++DVLRASS KRYGKK+H ++P PRSADPDLINQSGHQHAIAYVEG KFYGAKATINVWEPK+Q 
Subjt:  NKVSEKPKERTNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS F +PQYDISI IWKDP EG
Subjt:  PNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS
        HWWMQFG+ YVLGYWPSFLFSYLADSAS++EWGGEVVN E DG HT+TQMGSG FP+EGF KASYFRNIQVVDSSNNLK PKG+ TFTEKSNCYDV+ G 
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNC
        N DWGHYFYYGGPGRN NC
Subjt:  NGDWGHYFYYGGPGRNQNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCCTGCTCACAGCGGCAGGGGAACGGCCGGGAGGGCTTTGGTACTGGTTTTCTGTTTATGGGGTCTCATCTCTCTGTCCCGCGCCGCTCGATTATCCCCTTCCAT
GCACAACTTGGAGGTTCAGAAGCACCTCAGGCGCTTCAACAAGCCCCCATTGAAGACAATCCAGAGTCCAGATGGGGATATCATAGACTGTGTCCACATTTCTAATCAAC
CAGCTTTTGATCATCCTTTCCTCAAAGATCACAAAATTCAGACGAGGCCTACTTACCACCCAGAAGGGCTATTTGACGAGAACAAGGTGTCTGAGAAACCTAAAGAAAGA
ACAAACCCCATCACTCAGCTGTGGCATGCAAATGGAAGGTGCCCAGAAAACACCATTCCTATTAGGAGAACTAAGGAAGATGATGTTCTGAGAGCAAGCTCTGCCAAAAG
ATATGGAAAGAAAAGGCACAGAACTCTTCCTCAACCGAGGTCTGCAGATCCTGATCTCATTAATCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAGGAGATAAGT
TCTATGGAGCAAAGGCAACTATCAACGTATGGGAACCCAAAATACAACAGCCTAATGAGTTTAGCTTGTCACAGTTATGGATACTAGGAGGTTCTTTTGGTCAAGATCTA
AATAGTATTGAAGCTGGCTGGCAGGTCAGCCCAGATCTATATGGCGACAACAACACTAGACTTTTCACTTACTGGACTAGTGATGCATATCAAGCCACAGGTTGTTATAA
CCTCCTCTGCTCAGGCTTTATTCAAGTTAACAGTGAAATAGCAATGGGGGCGAGCATCTCACCTGTGTCTGCATTCCGCAGTCCCCAGTATGATATCAGTATACTTATCT
GGAAGGATCCAAATGAGGGACACTGGTGGATGCAGTTTGGCAATGACTATGTGTTGGGATATTGGCCTTCTTTCTTATTCTCGTACCTGGCTGACAGTGCCTCCATGATT
GAGTGGGGAGGGGAGGTTGTGAATTCAGAGCCTGATGGCCTGCACACCTCAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAAGGGTTTGGGAAGGCAAGCTATTTCAG
GAACATTCAGGTTGTTGACAGTTCCAACAATCTCAAGGCTCCCAAAGGGATTGGTACTTTCACAGAGAAATCCAACTGCTATGATGTCCAAACTGGCAGCAATGGGGATT
GGGGCCATTACTTTTACTATGGAGGCCCTGGCAGAAACCAAAATTGCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCCTGCTCACAGCGGCAGGGGAACGGCCGGGAGGGCTTTGGTACTGGTTTTCTGTTTATGGGGTCTCATCTCTCTGTCCCGCGCCGCTCGATTATCCCCTTCCAT
GCACAACTTGGAGGTTCAGAAGCACCTCAGGCGCTTCAACAAGCCCCCATTGAAGACAATCCAGAGTCCAGATGGGGATATCATAGACTGTGTCCACATTTCTAATCAAC
CAGCTTTTGATCATCCTTTCCTCAAAGATCACAAAATTCAGACGAGGCCTACTTACCACCCAGAAGGGCTATTTGACGAGAACAAGGTGTCTGAGAAACCTAAAGAAAGA
ACAAACCCCATCACTCAGCTGTGGCATGCAAATGGAAGGTGCCCAGAAAACACCATTCCTATTAGGAGAACTAAGGAAGATGATGTTCTGAGAGCAAGCTCTGCCAAAAG
ATATGGAAAGAAAAGGCACAGAACTCTTCCTCAACCGAGGTCTGCAGATCCTGATCTCATTAATCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAGGAGATAAGT
TCTATGGAGCAAAGGCAACTATCAACGTATGGGAACCCAAAATACAACAGCCTAATGAGTTTAGCTTGTCACAGTTATGGATACTAGGAGGTTCTTTTGGTCAAGATCTA
AATAGTATTGAAGCTGGCTGGCAGGTCAGCCCAGATCTATATGGCGACAACAACACTAGACTTTTCACTTACTGGACTAGTGATGCATATCAAGCCACAGGTTGTTATAA
CCTCCTCTGCTCAGGCTTTATTCAAGTTAACAGTGAAATAGCAATGGGGGCGAGCATCTCACCTGTGTCTGCATTCCGCAGTCCCCAGTATGATATCAGTATACTTATCT
GGAAGGATCCAAATGAGGGACACTGGTGGATGCAGTTTGGCAATGACTATGTGTTGGGATATTGGCCTTCTTTCTTATTCTCGTACCTGGCTGACAGTGCCTCCATGATT
GAGTGGGGAGGGGAGGTTGTGAATTCAGAGCCTGATGGCCTGCACACCTCAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAAGGGTTTGGGAAGGCAAGCTATTTCAG
GAACATTCAGGTTGTTGACAGTTCCAACAATCTCAAGGCTCCCAAAGGGATTGGTACTTTCACAGAGAAATCCAACTGCTATGATGTCCAAACTGGCAGCAATGGGGATT
GGGGCCATTACTTTTACTATGGAGGCCCTGGCAGAAACCAAAATTGCCCATGA
Protein sequenceShow/hide protein sequence
MGPAHSGRGTAGRALVLVFCLWGLISLSRAARLSPSMHNLEVQKHLRRFNKPPLKTIQSPDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTYHPEGLFDENKVSEKPKER
TNPITQLWHANGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRTLPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGQDL
NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRSPQYDISILIWKDPNEGHWWMQFGNDYVLGYWPSFLFSYLADSASMI
EWGGEVVNSEPDGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEKSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP