; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022818 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022818
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold2:10457289..10460977
RNA-Seq ExpressionSpg022818
SyntenySpg022818
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055720.1 uncharacterized protein E6C27_scaffold181G001030 [Cucumis melo var. makuwa]9.8e-24996.21Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRT+EALV IF VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWH NGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

XP_004144071.1 uncharacterized protein LOC101217988 [Cucumis sativus]7.5e-24996.21Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRT+EALV +F VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRN NCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

XP_008451013.1 PREDICTED: uncharacterized protein LOC103492421 [Cucumis melo]5.2e-25096.68Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEALV IF VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWH NGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

XP_022147673.1 uncharacterized protein LOC111016541 [Momordica charantia]1.6e-24695.5Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEA+VA+ CVLGLISLC ATRLE  SRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
         +KV EK  EKP PINQLWH+NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFP+EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

XP_038880223.1 uncharacterized protein LOC120071884 [Benincasa hispida]1.9e-25297.63Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEALVAIFCVLG+ISLCCATR+ESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWHVNGKCP+GTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNG+VMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LZQ5 Uncharacterized protein3.6e-24996.21Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRT+EALV +F VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRN NCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

A0A1S3BQI8 uncharacterized protein LOC1034924212.5e-25096.68Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEALV IF VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWH NGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

A0A5A7UKQ7 Uncharacterized protein4.7e-24996.21Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRT+EALV IF VLG++SLCC TRLESGSRQKLEVQKHLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP PINQLWH NGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

A0A6J1D1Y4 uncharacterized protein LOC1110165417.6e-24795.5Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEA+VA+ CVLGLISLC ATRLE  SRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
         +KV EK  EKP PINQLWH+NGKCP+GTIPIRRTKHEDVLRASSVKRYGRKK RS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFP+EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        GSNGDWGHFFYYGGPGRNPNCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

A0A6J1HUM5 uncharacterized protein LOC1114669878.4e-24696.2Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MGSARFSRCRTMEALVAIFCVLGL+S+CCA R+ES SRQKLEV+KHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ RPT+HPE L D
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKVAEKASEKP PI QLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKK RSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSP+LYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVD SNNLKPPKGIGTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSNGDWGHFFYYGGPGRNPNC
Subjt:  GSNGDWGHFFYYGGPGRNPNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.0e-20379.33Show/hide
Query:  GSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFDE
        G    S  +     +   C+ G  SL  A R    S+QK EV+KHL RLNKPAVK+I+S DGD+IDCV +S QPAFDHPFLKDHKIQM+P YHPEGLFD+
Subjt:  GSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFDE

Query:  SKV-AEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        +KV A K++EK   I QLWH  GKC EGTIP+RRTK +DVLRASSVKRYG+KK RS P+ P+SAEPDLINQSGHQHAIAYVEGDKYYGAKAT+NVWEP I
Subjt:  SKV-AEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        QQ NEFSLSQIW+LGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+ +GQHTSTQMGSG FP+EGF KASYFRNIQVVDGSNNLK PKG+GTFTEQ +CYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSN DWGH+FYYGGPG+N  C
Subjt:  GSNGDWGHFFYYGGPGRNPNC

AT2G44210.1 Protein of Unknown Function (DUF239)1.0e-16363.04Show/hide
Query:  MEALVAIFCVLGLISLCCATRLESGSR--QKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFDESKVAEKA-
        M   V+ F  L +  +  A  + SG      L+++ HL+RLNKPA+K+I+SPDGD+IDCV ++ QPAF HP L +H +QM P+ +PE +F ESKV+ K  
Subjt:  MEALVAIFCVLGLISLCCATRLESGSR--QKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFDESKVAEKA-

Query:  SEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSL
        +++   I+QLWHVNGKCP+ TIPIRRT+ +D+ RASSV+ YG K  +S P P  S  P+++ Q+GHQHAI YVE   +YGAKA +NVW+P ++ PNEFSL
Subjt:  SEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSL

Query:  SQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQF
        +QIW+LGG+F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGF+QIN +IAMG SISP+S Y NSQYDI+IL+WKDPKEGHWW+QF
Subjt:  SQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQF

Query:  GNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWG
        G  Y++GYWP+ LFSYL++SASMIEWGGEVVNS+   GQHT+TQMGSG F +EG+GKASYF+N+QVVDGSN L+ P+ +  FT+Q +CY+V++G+ G WG
Subjt:  GNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWG

Query:  HFFYYGGPGRNPNC
         +FYYGGPGRNPNC
Subjt:  HFFYYGGPGRNPNC

AT3G13510.1 Protein of Unknown Function (DUF239)7.1e-20578.86Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        MG+  FS  +         C+  ++SL CA      SRQK EV+KHL RLNKP VKTI+SPDGD+IDC+ +S QPAFDHPFLKDHKIQMRP+YHPEGLFD
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ++KV+ +   K T I QLWH  GKC EGTIP+RRT+ +DVLRASSVKRYG+KK RS PI P+SAEPDLINQ+GHQHAIAYVEGDKYYGAKAT+NVWEP I
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        Q  NEFSLSQIW+LGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+  G HT TQMGSGHFP+EGF KASYFRNIQVVDGSNNLK PKG+GTFTE+ +CYDVQT
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSN DWGH+FYYGGPG+N NC
Subjt:  GSNGDWGHFFYYGGPGRNPNC

AT5G56530.1 Protein of Unknown Function (DUF239)3.9e-20378.2Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        M +A FS+ R     +  FC  GL+SL CA RL S SRQ  EV KHL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQM P+Y PE LF 
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E   PI QLWH NG C EGTIP+RRTK EDVLRASSVKRYG+KK  S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        Q  NEFSLSQ+WILGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS + N QYDISI +WKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKG+ TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        G N DWGH+FYYGGPGRNPNCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ

AT5G56530.2 Protein of Unknown Function (DUF239)3.9e-20378.2Show/hide
Query:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD
        M +A FS+ R     +  FC  GL+SL CA RL S SRQ  EV KHL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQM P+Y PE LF 
Subjt:  MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFD

Query:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E   PI QLWH NG C EGTIP+RRTK EDVLRASSVKRYG+KK  S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
        Q  NEFSLSQ+WILGGSFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS + N QYDISI +WKDPK
Subjt:  QQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK

Query:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT
        EGHWWMQFG+GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKG+ TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCQ
        G N DWGH+FYYGGPGRNPNCQ
Subjt:  GSNGDWGHFFYYGGPGRNPNCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTGCTCGATTCAGCAGATGCAGGACCATGGAAGCTCTCGTCGCCATTTTCTGCGTTTTGGGGCTGATTTCTCTGTGCTGTGCGACGAGGTTGGAATCAGGCTC
CCGCCAGAAGCTTGAGGTCCAGAAGCACCTCAGGCGCTTGAACAAGCCTGCTGTTAAAACCATTGAGAGCCCAGATGGAGACCTAATTGACTGTGTTCACATGTCTCATC
AACCTGCATTTGACCATCCTTTCCTCAAAGACCACAAAATCCAGATGAGGCCAACTTACCATCCAGAAGGGCTGTTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAA
AAACCAACTCCAATCAACCAACTGTGGCATGTCAATGGAAAGTGCCCTGAAGGCACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAA
AAGATATGGAAGAAAAAAGCTCAGATCAGCCCCAATACCCCCCAGGTCTGCAGAGCCTGATCTCATCAACCAAAGTGGTCATCAACATGCAATAGCTTATGTGGAAGGAG
ACAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCTAGTATACAACAGCCTAATGAGTTTAGCTTATCACAGATTTGGATATTGGGAGGCTCTTTTGGTGAA
GATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGTCCCGATCTCTATGGCGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTG
TTACAACCTCCTCTGCTCAGGCTTTATTCAAATCAACAGCGATATCGCAATGGGAGCGAGTATCTCTCCGGTCTCGGCATACCGAAATTCGCAATACGATATTAGTATAC
TTGTCTGGAAGGATCCAAAAGAGGGGCATTGGTGGATGCAATTTGGCAATGGCTATGTGATGGGATATTGGCCTTCATTCCTGTTCTCATACTTAGCAGACAGTGCCTCC
ATGATTGAGTGGGGAGGTGAAGTTGTGAACTCAGAGCCAAATGGGCAACACACTTCAACTCAAATGGGAAGTGGGCATTTCCCAGATGAAGGATTTGGGAAAGCAAGCTA
TTTTAGAAACATTCAAGTTGTTGATGGATCCAACAATCTCAAGCCCCCAAAAGGCATTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGCAGCAATG
GGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCTAATTGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTGCTCGATTCAGCAGATGCAGGACCATGGAAGCTCTCGTCGCCATTTTCTGCGTTTTGGGGCTGATTTCTCTGTGCTGTGCGACGAGGTTGGAATCAGGCTC
CCGCCAGAAGCTTGAGGTCCAGAAGCACCTCAGGCGCTTGAACAAGCCTGCTGTTAAAACCATTGAGAGCCCAGATGGAGACCTAATTGACTGTGTTCACATGTCTCATC
AACCTGCATTTGACCATCCTTTCCTCAAAGACCACAAAATCCAGATGAGGCCAACTTACCATCCAGAAGGGCTGTTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAA
AAACCAACTCCAATCAACCAACTGTGGCATGTCAATGGAAAGTGCCCTGAAGGCACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAA
AAGATATGGAAGAAAAAAGCTCAGATCAGCCCCAATACCCCCCAGGTCTGCAGAGCCTGATCTCATCAACCAAAGTGGTCATCAACATGCAATAGCTTATGTGGAAGGAG
ACAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCTAGTATACAACAGCCTAATGAGTTTAGCTTATCACAGATTTGGATATTGGGAGGCTCTTTTGGTGAA
GATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGTCCCGATCTCTATGGCGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTG
TTACAACCTCCTCTGCTCAGGCTTTATTCAAATCAACAGCGATATCGCAATGGGAGCGAGTATCTCTCCGGTCTCGGCATACCGAAATTCGCAATACGATATTAGTATAC
TTGTCTGGAAGGATCCAAAAGAGGGGCATTGGTGGATGCAATTTGGCAATGGCTATGTGATGGGATATTGGCCTTCATTCCTGTTCTCATACTTAGCAGACAGTGCCTCC
ATGATTGAGTGGGGAGGTGAAGTTGTGAACTCAGAGCCAAATGGGCAACACACTTCAACTCAAATGGGAAGTGGGCATTTCCCAGATGAAGGATTTGGGAAAGCAAGCTA
TTTTAGAAACATTCAAGTTGTTGATGGATCCAACAATCTCAAGCCCCCAAAAGGCATTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGCAGCAATG
GGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCTAATTGCCAATGA
Protein sequenceShow/hide protein sequence
MGSARFSRCRTMEALVAIFCVLGLISLCCATRLESGSRQKLEVQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPTYHPEGLFDESKVAEKASE
KPTPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKLRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGE
DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSAS
MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNCQ