; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001443 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001443
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold36:4173513..4175692
RNA-Seq ExpressionMS001443
SyntenyMS001443
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598306.1 hypothetical protein SDJN03_08084, partial [Cucurbita argyrosperma subsp. sororia]2.1e-20680.99Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         MLV+TL VIVCG VEGGSLSKQKSL    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPALRNHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRI+K VLLKA+S+ERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYSTK+G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

KAG7029277.1 hypothetical protein SDJN02_07615, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-20781.22Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         MLV+TL VIVCG VEGGSLSKQKSL    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPALRNHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRIRK VLLKA+S+ERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYSTK+G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

XP_022997646.1 uncharacterized protein LOC111492515 isoform X1 [Cucurbita maxima]1.2e-20681.46Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         MLV+TLTVIVCG VEGGSLSKQKSL    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPAL NHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRIRK VLLKA+SVERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYSTK+G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

XP_023546232.1 uncharacterized protein LOC111805388 isoform X2 [Cucurbita pepo subsp. pepo]1.9e-20781.46Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         MLV+TLTVIVCG VEGGSLSKQKSL    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPALRNHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRIRK VLLKA+S+ERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYSTK+G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

XP_038886336.1 uncharacterized protein LOC120076548 [Benincasa hispida]8.7e-21382.78Show/hide
Query:  KRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKA
        KRFSC+ L  FKMLV+TLTVIVCG+VEGGSL+KQKSL V KK+NSLRKQA KSIQS+DGDIIDC+ +Y+QPAFDHPALRNHTIQMAPTYDPTM+EHSKKA
Subjt:  KRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKA

Query:  TEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVCNP
        T +REGME KDSM VKQTWRKSGSCPKGTIP+RRI+K +L KADS+E YGRKRP  S +IAQLSN+++SH LL N SKA L   G NYN AKGDIKVCNP
Subjt:  TEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVCNP

Query:  KVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDL
        KVE DDEYSTSQVALL G YY++E VESGWAVNP VYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST  GLPYEIT+FLFRDL
Subjt:  KVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDL

Query:  QTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDV
        +TNNWWVQYGESI+IGYWP+ELF+AL +TAETVQWGGEVYSTKLGGPPHT TGMGNG+FPD++ G+SGWVKR+R+RDNSMVLKFP WVEHYSDEYDCYD+
Subjt:  QTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDV

Query:  DFVRDYLEDPELYYGGPGKNPKCP
        DF+RDYL+DPELYYGGPGKNPKCP
Subjt:  DFVRDYLEDPELYYGGPGKNPKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ6 Uncharacterized protein2.5e-19778.4Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGK   C+GL  FKMLV  LTVIVCGVVE GSLSK K+    KK++SLRKQA KSIQSEDGDIIDCV +Y+QPAFDHPALRNHTIQMAPTYDPTM++HSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT + EGM EK SM VKQ WRKSGSCPK TIP+RRIRK V LKA+SV  YG+KRPT   +IAQLSN+++SH LL N SKA L   G N+N AKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP VE DDEYSTSQVALL G YY++E +ESGWAVNP VYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST   LP+EIT+FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +TNNWWVQYGESINIGYWP+ELF AL +TAETVQWGGEVYSTKLGGPPHT TGMGNG+FPD++ G+SGWVKR+R+RDNSM+LKFP +VEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+R+YL+DPELYYGGPGKN +CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

A0A5A7V3A1 Uncharacterized protein9.8e-19477.46Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGK     GL  FKMLV+ LTVIVCGVVE GSLSK +S    KK+NSLRKQA KSIQSEDGDIIDCV +Y+QPAFDHPALRNHTIQMAPTYDPTM++HSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT ++EGM EK+SM VKQ WR SGSCPK TIP+RRIRK     A+SV  YG+KRPT   +IAQLSN+++SH LL N SKA L   G N+N AKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP VE DDEYSTSQVALL G YY++E +ESGWAVNP VYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST   LP+EIT+FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +TNNWWVQYGESINIGYWP+ELF AL +TAETVQWGGEVYSTKLGGPPHT TGMGNG+FPD++ G+SGWVKR+R+RDNSM+LKFP +VEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+R+YL+DPELYYGGPGKN +CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

A0A6J1BST2 uncharacterized protein LOC1110050246.5e-206100Show/hide
Query:  MAPTYDPTMEEHSKKATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTE
        MAPTYDPTMEEHSKKATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTE
Subjt:  MAPTYDPTMEEHSKKATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTE

Query:  GYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST
        GYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST
Subjt:  GYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST

Query:  PNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKF
        PNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKF
Subjt:  PNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKF

Query:  PGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP
        PGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP
Subjt:  PGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP

A0A6J1HDC5 uncharacterized protein LOC111462577 isoform X11.3e-20680.75Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         ML++TLTVIVCG VEGGSLSKQKS+    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPALRNHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRIRK VLLKA S+ERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYST +G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

A0A6J1K820 uncharacterized protein LOC111492515 isoform X15.9e-20781.46Show/hide
Query:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK
        MGKRF         MLV+TLTVIVCG VEGGSLSKQKSL    ++NSLRKQAIKSI+SEDGDIIDCV +Y+QPAFDHPAL NHTIQ+APTYDPT++EHSK
Subjt:  MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSK

Query:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC
        KAT +REG EEK+SM VKQTWRKSGSCP+GTIP+RRIRK VLLKA+SVERYGRK+P +S + AQL   ++SH LL NRSKAFL T G NYNAAKGDIKVC
Subjt:  KATEKREGMEEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVC

Query:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR
        NP+VE DDEYSTSQVALL G YY+FE +ESGWAVNP VYGDRQTRLFVYWTADAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEI +FLFR
Subjt:  NPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFR

Query:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY
        D +T+NWWVQYGESINIGYWP ELF ALSHTAETVQWGGEVYSTK+G PPHT T MG+GRFPDF+ G SGWVKRMR+RDNSMVL FPGWVEHYSDEYDCY
Subjt:  DLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCY

Query:  DVDFVRDYLEDPELYYGGPGKNPKCP
        DVDF+RDYL+DPELYYGGPGKNP+CP
Subjt:  DVDFVRDYLEDPELYYGGPGKNPKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)7.2e-8840.45Show/hide
Query:  LSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSMAVKQTWRKSGSCPKGTI
        +SKQK  EV K +N L K A+KSIQS DGD+IDCV + +QPAFDHP L++H IQM P Y P       K +  +   +E     + Q W + G C +GTI
Subjt:  LSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSMAVKQTWRKSGSCPKGTI

Query:  PLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFECV
        P+RR ++D +L+A SV+RYG+K      K   +   +++   L+N+S    A  + EG  Y  AK  I V  PK++  +E+S SQ+ LL G++      +
Subjt:  PLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFECV

Query:  ESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHAL
        E+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q +++IA+G++I P+S      Y+I++ +++D +  +WW+Q+G    +GYWP+ LF  L
Subjt:  ESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHAL

Query:  SHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP
        + +A  ++WGGEV +++  G  HT+T MG+G+FP+  F  + + + +++ D S  LK P  +  ++++ +CYDV    +       YYGGPGKN KCP
Subjt:  SHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP

AT2G44210.1 Protein of Unknown Function (DUF239)1.2e-8739.57Show/hide
Query:  FFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEE
        FF  LV+T+ ++   VV G   +    L++   +  L K A+KSI+S DGD+IDCV + +QPAF HP L NHT+QM P+ +P       K + K +  + 
Subjt:  FFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEE

Query:  KDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVN-RSKAFLHTEGYNYNAAKGDIKVCNPKVESDDEY
          S A+ Q W  +G CPK TIP+RR R+  L +A SVE YG K       I +  +++  + L  N    A ++ E   +  AK  I V  P VE  +E+
Subjt:  KDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVN-RSKAFLHTEGYNYNAAKGDIKVCNPKVESDDEY

Query:  STSQVALLNGAY-YHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWV
        S +Q+ +L G +      +E+GW V+P +YGD +TRLF YWT+DA   TGC++L C GFVQ + EIA+G +I P+S      Y+IT+ +++D +  +WW+
Subjt:  STSQVALLNGAY-YHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWV

Query:  QYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYL
        Q+GE   IGYWPA LF  LS +A  ++WGGEV +++     HTTT MG+GRF +  +G + + K +++ D S  L+ P  ++ ++D+ +CY+V       
Subjt:  QYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYL

Query:  EDPELYYGGPGKNPKCP
             YYGGPG+NP CP
Subjt:  EDPELYYGGPGKNPKCP

AT5G25950.1 Protein of Unknown Function (DUF239)1.5e-9341.76Show/hide
Query:  LRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGM
        +RFF      + VI+CG         + SL++  K+ +L K A+K+I+SEDGDIIDC+ +Y+Q AFDHPAL+NH IQM     P+++  +KK T    G 
Subjt:  LRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGM

Query:  EEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSN------NQTSHALLVN------RSKAFLHTEGYNYNAAKGDI
         E       Q W KSG CP GTIP+RR+ ++ + +A S   +GRK P   HK + L N      N       +N      RS+AF+   G+N+  A+ DI
Subjt:  EEKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSN------NQTSHALLVN------RSKAFLHTEGYNYNAAKGDI

Query:  KVCNPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLF
         + NP      +YST+Q+ L+ G   +FE VE GW VNP+V+GD +TRLF+ WT D   KTGC +L C GFVQTS + ALG+ + P+S+ +   Y IT+ 
Subjt:  KVCNPKVESDDEYSTSQVALLNGAYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLF

Query:  LFRDLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLG-GPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDE
        +F D  + NWW+    ++ +GYWP  LF+ L H+A  VQWGGEV+S  +    PHTTT MG+G++  +++  + +   +RI+D SM LK+P ++  Y+DE
Subjt:  LFRDLQTNNWWVQYGESINIGYWPAELFHALSHTAETVQWGGEVYSTKLG-GPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDE

Query:  YDCYDVDFVR-DYLEDPELYYGGPGKNPKCP
        Y+CY     R  Y+ +P  Y+GGPG+N +CP
Subjt:  YDCYDVDFVR-DYLEDPELYYGGPGKNPKCP

AT5G56530.1 Protein of Unknown Function (DUF239)2.5e-8841.6Show/hide
Query:  SLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSM-AVKQTWRKSGSCPKG
        S+S+Q + EV K +N L K A+KSIQS DGDIIDCV + +QPAFDHP L++H IQM P+Y P       K +EK      K+S+  + Q W ++G C +G
Subjt:  SLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSM-AVKQTWRKSGSCPKG

Query:  TIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFE
        TIP+RR +K+ +L+A SV+RYG+K+    H    L   +++   L+N+S    A  + EG  +  AK  I V  PKV+S +E+S SQ+ +L G++     
Subjt:  TIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFE

Query:  CVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFH
         +E+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q +++IA+G++I P+S  +   Y+I++ +++D +  +WW+Q+G+   +GYWP+ LF 
Subjt:  CVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFH

Query:  ALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKC
         L+ +A  V+WGGEV + +  G  HTTT MG+G+FPD  F  + + + +++ D+S  LK P  +  ++++ +CYDV+  ++       YYGGPG+NP C
Subjt:  ALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKC

AT5G56530.2 Protein of Unknown Function (DUF239)2.5e-8841.6Show/hide
Query:  SLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSM-AVKQTWRKSGSCPKG
        S+S+Q + EV K +N L K A+KSIQS DGDIIDCV + +QPAFDHP L++H IQM P+Y P       K +EK      K+S+  + Q W ++G C +G
Subjt:  SLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGMEEKDSM-AVKQTWRKSGSCPKG

Query:  TIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFE
        TIP+RR +K+ +L+A SV+RYG+K+    H    L   +++   L+N+S    A  + EG  +  AK  I V  PKV+S +E+S SQ+ +L G++     
Subjt:  TIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRS---KAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNGAY-YHFE

Query:  CVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFH
         +E+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q +++IA+G++I P+S  +   Y+I++ +++D +  +WW+Q+G+   +GYWP+ LF 
Subjt:  CVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFH

Query:  ALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKC
         L+ +A  V+WGGEV + +  G  HTTT MG+G+FPD  F  + + + +++ D+S  LK P  +  ++++ +CYDV+  ++       YYGGPG+NP C
Subjt:  ALSHTAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAGATTCTCTTGCAGTGGTTTGAGGTTCTTCAAAATGTTGGTACTGACATTGACAGTCATAGTTTGTGGAGTTGTGGAGGGTGGTTCACTTTCTAAACAGAA
GAGTTTGGAAGTTGTGAAGAAAATTAACTCTCTTAGAAAGCAAGCAATAAAAAGCATTCAGAGCGAAGACGGCGACATCATAGATTGCGTTAGAGTTTACGAGCAGCCTG
CTTTTGATCATCCTGCTCTAAGAAATCATACCATTCAGATGGCACCTACTTATGATCCCACCATGGAAGAGCATTCAAAGAAAGCAACAGAAAAAAGGGAAGGGATGGAG
GAAAAGGACTCCATGGCTGTGAAACAAACATGGAGGAAAAGCGGCAGTTGTCCCAAAGGAACAATACCGCTTCGAAGGATCCGAAAAGATGTCCTGCTCAAAGCTGACTC
GGTAGAACGCTATGGAAGAAAGAGACCGACGTATTCGCATAAAATCGCTCAGCTTTCCAACAACCAAACCTCGCATGCTCTGCTAGTGAATCGTTCGAAGGCATTTCTGC
ATACTGAAGGATACAACTACAATGCAGCCAAAGGAGACATTAAAGTCTGTAACCCCAAGGTCGAGAGCGATGACGAGTATAGTACTTCCCAAGTGGCTCTACTAAATGGT
GCTTACTATCATTTCGAGTGCGTCGAATCCGGATGGGCAGTAAATCCAAGTGTTTACGGAGATCGACAGACTCGACTGTTCGTGTATTGGACGGCTGATGCTTCCCATAA
AACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAAACCAGCAACGAAATTGCTCTTGGTTCTGCTATATATCCCATCTCAACTCCAAATGGGCTTCCATATGAAA
TAACTCTGTTCCTTTTCAGAGATTTACAGACAAATAATTGGTGGGTGCAATATGGGGAAAGCATTAACATAGGTTATTGGCCCGCTGAGTTATTCCATGCTCTAAGTCAT
ACTGCAGAGACTGTCCAGTGGGGAGGGGAAGTTTACAGCACAAAGTTAGGAGGGCCTCCCCACACTACAACAGGTATGGGCAATGGAAGGTTTCCTGACTTTGTTTTCGG
CAATTCGGGTTGGGTAAAACGGATGCGAATTCGGGACAACTCGATGGTGTTGAAGTTTCCTGGATGGGTTGAGCATTACTCTGATGAATATGATTGTTATGACGTCGATT
TTGTCCGAGATTATTTAGAAGATCCTGAACTATACTATGGGGGGCCTGGTAAAAATCCAAAGTGTCCC
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAGATTCTCTTGCAGTGGTTTGAGGTTCTTCAAAATGTTGGTACTGACATTGACAGTCATAGTTTGTGGAGTTGTGGAGGGTGGTTCACTTTCTAAACAGAA
GAGTTTGGAAGTTGTGAAGAAAATTAACTCTCTTAGAAAGCAAGCAATAAAAAGCATTCAGAGCGAAGACGGCGACATCATAGATTGCGTTAGAGTTTACGAGCAGCCTG
CTTTTGATCATCCTGCTCTAAGAAATCATACCATTCAGATGGCACCTACTTATGATCCCACCATGGAAGAGCATTCAAAGAAAGCAACAGAAAAAAGGGAAGGGATGGAG
GAAAAGGACTCCATGGCTGTGAAACAAACATGGAGGAAAAGCGGCAGTTGTCCCAAAGGAACAATACCGCTTCGAAGGATCCGAAAAGATGTCCTGCTCAAAGCTGACTC
GGTAGAACGCTATGGAAGAAAGAGACCGACGTATTCGCATAAAATCGCTCAGCTTTCCAACAACCAAACCTCGCATGCTCTGCTAGTGAATCGTTCGAAGGCATTTCTGC
ATACTGAAGGATACAACTACAATGCAGCCAAAGGAGACATTAAAGTCTGTAACCCCAAGGTCGAGAGCGATGACGAGTATAGTACTTCCCAAGTGGCTCTACTAAATGGT
GCTTACTATCATTTCGAGTGCGTCGAATCCGGATGGGCAGTAAATCCAAGTGTTTACGGAGATCGACAGACTCGACTGTTCGTGTATTGGACGGCTGATGCTTCCCATAA
AACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAAACCAGCAACGAAATTGCTCTTGGTTCTGCTATATATCCCATCTCAACTCCAAATGGGCTTCCATATGAAA
TAACTCTGTTCCTTTTCAGAGATTTACAGACAAATAATTGGTGGGTGCAATATGGGGAAAGCATTAACATAGGTTATTGGCCCGCTGAGTTATTCCATGCTCTAAGTCAT
ACTGCAGAGACTGTCCAGTGGGGAGGGGAAGTTTACAGCACAAAGTTAGGAGGGCCTCCCCACACTACAACAGGTATGGGCAATGGAAGGTTTCCTGACTTTGTTTTCGG
CAATTCGGGTTGGGTAAAACGGATGCGAATTCGGGACAACTCGATGGTGTTGAAGTTTCCTGGATGGGTTGAGCATTACTCTGATGAATATGATTGTTATGACGTCGATT
TTGTCCGAGATTATTTAGAAGATCCTGAACTATACTATGGGGGGCCTGGTAAAAATCCAAAGTGTCCC
Protein sequenceShow/hide protein sequence
MGKRFSCSGLRFFKMLVLTLTVIVCGVVEGGSLSKQKSLEVVKKINSLRKQAIKSIQSEDGDIIDCVRVYEQPAFDHPALRNHTIQMAPTYDPTMEEHSKKATEKREGME
EKDSMAVKQTWRKSGSCPKGTIPLRRIRKDVLLKADSVERYGRKRPTYSHKIAQLSNNQTSHALLVNRSKAFLHTEGYNYNAAKGDIKVCNPKVESDDEYSTSQVALLNG
AYYHFECVESGWAVNPSVYGDRQTRLFVYWTADASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEITLFLFRDLQTNNWWVQYGESINIGYWPAELFHALSH
TAETVQWGGEVYSTKLGGPPHTTTGMGNGRFPDFVFGNSGWVKRMRIRDNSMVLKFPGWVEHYSDEYDCYDVDFVRDYLEDPELYYGGPGKNPKCP