; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007349 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007349
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr10:4013549..4016643
RNA-Seq ExpressionHG10007349
SyntenyHG10007349
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055720.1 uncharacterized protein E6C27_scaffold181G001030 [Cucumis melo var. makuwa]3.7e-24895.97Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRT+EALV IF VLG++SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPSYHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPNPINQLWH NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNC+
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

XP_004144071.1 uncharacterized protein LOC101217988 [Cucumis sativus]3.7e-24895.97Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRT+EALV +F VLGM+SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPS+HPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWHVNGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRN NCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

XP_008451013.1 PREDICTED: uncharacterized protein LOC103492421 [Cucumis melo]2.0e-24996.45Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEALV IF VLG++SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPSYHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPNPINQLWH NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNC+
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

XP_022147673.1 uncharacterized protein LOC111016541 [Momordica charantia]5.0e-24594.79Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEA+VA+ CVLG+ISLC ATR+E  SRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RP+YHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
         +KV EK  EKPNPINQLWH+NGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFP+EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

XP_038880223.1 uncharacterized protein LOC120071884 [Benincasa hispida]1.0e-25398.58Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPSYHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+G+VMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LZQ5 Uncharacterized protein1.8e-24895.97Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRT+EALV +F VLGM+SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPS+HPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWHVNGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRN NCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

A0A1S3BQI8 uncharacterized protein LOC1034924219.6e-25096.45Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEALV IF VLG++SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPSYHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPNPINQLWH NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNC+
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

A0A5A7UKQ7 Uncharacterized protein1.8e-24895.97Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRT+EALV IF VLG++SLCC TR+ESGSRQKLEVQKHLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPSYHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPNPINQLWH NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNC+
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

A0A6J1D1Y4 uncharacterized protein LOC1110165412.4e-24594.79Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEA+VA+ CVLG+ISLC ATR+E  SRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RP+YHPEGLFD
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
         +KV EK  EKPNPINQLWH+NGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFP+EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G+NGDWGHFFYYGGPGRNPNCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

A0A6J1HUM5 uncharacterized protein LOC1114669879.3e-24595.25Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        MGSAR SRCRTMEALVAIFCVLG++S+CCA RMES SRQKLEV+KHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRP++HPE L D
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKVAEKASEKPNPI QLWHVNGKCP+GTIPIRRTKHEDVLRASSVKRYGRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSP+LYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVD SNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNC
        G+NGDWGHFFYYGGPGRNPNC
Subjt:  GNNGDWGHFFYYGGPGRNPNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)7.3e-20278.38Show/hide
Query:  GSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDE
        G   LS  +     +   C+ G  SL  A R    S+QK EV+KHL RLNKPAVK+I+S DGD+IDCV +S QPAFDHPFLKDHKIQ +P+YHPEGLFD+
Subjt:  GSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDE

Query:  SKV-AEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        +KV A K++EK   I QLWH  GKC +GTIP+RRTK +DVLRASSVKRYG+KK RS P+ P+SAEPDLINQSGHQHAIAYVEGDKYYGAKAT+NVWEP I
Subjt:  SKV-AEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQ NEFSLSQ+W+LGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYV+GYWPSFLFSYL +SASMIEWGGEVVNS+ +GQHTSTQMGSG FP+EGF KASYFRNIQVVDGSNNLK PKG+GTFTEQ +CYDVQT
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNC
        G+N DWGH+FYYGGPG+N  C
Subjt:  GNNGDWGHFFYYGGPGRNPNC

AT2G44210.1 Protein of Unknown Function (DUF239)3.6e-16463.53Show/hide
Query:  MEALVAIFCVLGMISLCCATRMESGSR--QKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDESKVAEKA-
        M   V+ F  L M  +  A  + SG      L+++ HL+RLNKPA+K+I+SPDGD+IDCV ++ QPAF HP L +H +Q  PS +PE +F ESKV+ K  
Subjt:  MEALVAIFCVLGMISLCCATRMESGSR--QKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDESKVAEKA-

Query:  SEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSL
        +++ N I+QLWHVNGKCP  TIPIRRT+ +D+ RASSV+ YG K  +S P P  S  P+++ Q+GHQHAI YVE   +YGAKA +NVW+P ++ PNEFSL
Subjt:  SEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSL

Query:  SQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQF
        +Q+W+LGG+F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGF+QIN +IAMG SISP+S Y NSQYDI+IL+WKDPKEGHWW+QF
Subjt:  SQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQF

Query:  GSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGNNGDWG
        G  Y++GYWP+ LFSYL++SASMIEWGGEVVNS+   GQHT+TQMGSG F +EG+GKASYF+N+QVVDGSN L+ P+ +  FT+Q +CY+V++GN G WG
Subjt:  GSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGNNGDWG

Query:  HFFYYGGPGRNPNC
         +FYYGGPGRNPNC
Subjt:  HFFYYGGPGRNPNC

AT3G13510.1 Protein of Unknown Function (DUF239)1.7e-20380.85Show/hide
Query:  CVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDESKVAEKASEKPNPINQLW
        C+  M+SL CA      SRQK EV+KHL RLNKP VKTI+SPDGD+IDC+ +S QPAFDHPFLKDHKIQ RPSYHPEGLFD++KV+ +   K   I QLW
Subjt:  CVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDESKVAEKASEKPNPINQLW

Query:  HVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFG
        H  GKC +GTIP+RRT+ +DVLRASSVKRYG+KKHRS PI P+SAEPDLINQ+GHQHAIAYVEGDKYYGAKAT+NVWEP IQ  NEFSLSQ+W+LGGSFG
Subjt:  HVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFG

Query:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQFGSGYVMGYWPS
        +DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPKEGHWWMQFG+GYV+GYWPS
Subjt:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQFGSGYVMGYWPS

Query:  FLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGNNGDWGHFFYYGGPGRNP
        FLFSYL +SASMIEWGGEVVNS+  G HT TQMGSGHFP+EGF KASYFRNIQVVDGSNNLK PKG+GTFTE+ +CYDVQTG+N DWGH+FYYGGPG+N 
Subjt:  FLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGNNGDWGHFFYYGGPGRNP

Query:  NC
        NC
Subjt:  NC

AT5G56530.1 Protein of Unknown Function (DUF239)1.1e-20277.96Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        M +A  S+ R     +  FC  G++SL CA R+ S SRQ  EV KHL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQ  PSY PE LF 
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E  NPI QLWH NG C +GTIP+RRTK EDVLRASSVKRYG+KKH S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        Q  NEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS + N QYDISI +WKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKG+ TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G N DWGH+FYYGGPGRNPNCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ

AT5G56530.2 Protein of Unknown Function (DUF239)1.1e-20277.96Show/hide
Query:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD
        M +A  S+ R     +  FC  G++SL CA R+ S SRQ  EV KHL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQ  PSY PE LF 
Subjt:  MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFD

Query:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E  NPI QLWH NG C +GTIP+RRTK EDVLRASSVKRYG+KKH S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        Q  NEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS + N QYDISI +WKDPK
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKG+ TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGSGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GNNGDWGHFFYYGGPGRNPNCQ
        G N DWGH+FYYGGPGRNPNCQ
Subjt:  GNNGDWGHFFYYGGPGRNPNCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTGCTCGACTTAGCAGATGCAGGACCATGGAAGCTCTCGTCGCCATTTTTTGTGTTTTGGGGATGATTTCTTTGTGCTGTGCGACGAGGATGGAATCAGGCTC
CCGCCAGAAGCTTGAGGTGCAAAAACATCTCAGGCGTTTGAACAAACCTGCTGTTAAAACCATTGAGAGTCCAGATGGGGACCTAATTGACTGTGTTCACATGTCTCATC
AACCTGCATTTGACCATCCTTTCCTCAAGGACCACAAAATCCAGACGAGGCCAAGTTATCATCCAGAAGGGCTATTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAA
AAACCAAATCCAATAAACCAACTGTGGCATGTTAATGGAAAGTGTCCTGATGGAACTATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAA
AAGATATGGAAGAAAGAAGCACAGATCAACCCCAATACCCCCAAGGTCTGCAGAGCCTGATCTCATCAACCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAGGAG
ATAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCCAGCATACAACAGCCTAATGAGTTTAGCTTATCACAGCTTTGGATATTGGGAGGTTCTTTTGGTGAA
GATCTCAATAGCATTGAAGCTGGTTGGCAGGTTAGTCCTGATCTCTACGGCGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTG
TTACAACCTCCTCTGCTCAGGCTTTATTCAAATCAACAGCGATATAGCAATGGGAGCGAGCATTTCTCCAGTCTCGGCATACCGAAATTCACAATACGATATTAGTATAC
TTGTCTGGAAGGATCCAAAAGAAGGGCATTGGTGGATGCAATTTGGAAGTGGTTATGTGATGGGGTATTGGCCTTCATTTCTATTCTCATATTTAGCAGACAGTGCTTCC
ATGATTGAATGGGGAGGTGAAGTAGTAAACTCAGAGCCAAATGGGCAACACACTTCAACTCAAATGGGGAGTGGCCATTTCCCAGATGAAGGATTTGGTAAAGCAAGCTA
TTTTAGAAACATTCAAGTTGTTGATGGATCCAACAATCTCAAACCACCAAAAGGCATTGGCACATTCACAGAACAGCCTGATTGCTATGATGTCCAAACAGGCAACAATG
GGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAATCCTAATTGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTGCTCGACTTAGCAGATGCAGGACCATGGAAGCTCTCGTCGCCATTTTTTGTGTTTTGGGGATGATTTCTTTGTGCTGTGCGACGAGGATGGAATCAGGCTC
CCGCCAGAAGCTTGAGGTGCAAAAACATCTCAGGCGTTTGAACAAACCTGCTGTTAAAACCATTGAGAGTCCAGATGGGGACCTAATTGACTGTGTTCACATGTCTCATC
AACCTGCATTTGACCATCCTTTCCTCAAGGACCACAAAATCCAGACGAGGCCAAGTTATCATCCAGAAGGGCTATTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAA
AAACCAAATCCAATAAACCAACTGTGGCATGTTAATGGAAAGTGTCCTGATGGAACTATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAA
AAGATATGGAAGAAAGAAGCACAGATCAACCCCAATACCCCCAAGGTCTGCAGAGCCTGATCTCATCAACCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAGGAG
ATAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCCAGCATACAACAGCCTAATGAGTTTAGCTTATCACAGCTTTGGATATTGGGAGGTTCTTTTGGTGAA
GATCTCAATAGCATTGAAGCTGGTTGGCAGGTTAGTCCTGATCTCTACGGCGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTG
TTACAACCTCCTCTGCTCAGGCTTTATTCAAATCAACAGCGATATAGCAATGGGAGCGAGCATTTCTCCAGTCTCGGCATACCGAAATTCACAATACGATATTAGTATAC
TTGTCTGGAAGGATCCAAAAGAAGGGCATTGGTGGATGCAATTTGGAAGTGGTTATGTGATGGGGTATTGGCCTTCATTTCTATTCTCATATTTAGCAGACAGTGCTTCC
ATGATTGAATGGGGAGGTGAAGTAGTAAACTCAGAGCCAAATGGGCAACACACTTCAACTCAAATGGGGAGTGGCCATTTCCCAGATGAAGGATTTGGTAAAGCAAGCTA
TTTTAGAAACATTCAAGTTGTTGATGGATCCAACAATCTCAAACCACCAAAAGGCATTGGCACATTCACAGAACAGCCTGATTGCTATGATGTCCAAACAGGCAACAATG
GGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAATCCTAATTGCCAATGA
Protein sequenceShow/hide protein sequence
MGSARLSRCRTMEALVAIFCVLGMISLCCATRMESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPSYHPEGLFDESKVAEKASE
KPNPINQLWHVNGKCPDGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGE
DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQFGSGYVMGYWPSFLFSYLADSAS
MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGNNGDWGHFFYYGGPGRNPNCQ