; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004503 (gene) of Chayote v1 genome

Gene IDSed0004503
OrganismSechium edule (Chayote v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG08:9211383..9215647
RNA-Seq ExpressionSed0004503
SyntenySed0004503
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055720.1 uncharacterized protein E6C27_scaffold181G001030 [Cucumis melo var. makuwa]5.2e-24293.13Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRT+EALV +  VLG++SL C  R+E GSRQKLEVQ HLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP P+NQLWH NGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNCK
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

XP_004144071.1 uncharacterized protein LOC101217988 [Cucumis sativus]2.1e-24394.08Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRT+EALV V  VLGM+SL C  R+E GSRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P+FHPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPKP+NQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRN NC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

XP_008451013.1 PREDICTED: uncharacterized protein LOC103492421 [Cucumis melo]2.8e-24393.6Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEALV +  VLG++SL C  R+E GSRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP P+NQLWH NGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNCK
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

XP_023529873.1 uncharacterized protein LOC111792593 [Cucurbita pepo subsp. pepo]2.9e-24093.82Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEALVA+LCVLG++SL C+ARME  SRQKLEV+ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ +PTFHPE L D
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKVAEKA+EKP P+ QLWHVNGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSP+LY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVD SNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSNGDWGHFFYYGGPGRNPNC
Subjt:  GSNGDWGHFFYYGGPGRNPNC

XP_038880223.1 uncharacterized protein LOC120071884 [Benincasa hispida]5.9e-24694.79Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEALVA+ CVLGMISL C+ RME GSRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP P+NQLWHVNGKCP+GTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

TrEMBL top hitse value%identityAlignment
A0A0A0LZQ5 Uncharacterized protein1.0e-24394.08Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRT+EALV V  VLGM+SL C  R+E GSRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P+FHPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKPKP+NQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRN NC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

A0A1S3BQI8 uncharacterized protein LOC1034924211.3e-24393.6Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEALV +  VLG++SL C  R+E GSRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP P+NQLWH NGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNCK
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

A0A5A7UKQ7 Uncharacterized protein2.5e-24293.13Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRT+EALV +  VLG++SL C  R+E GSRQKLEVQ HLRRLNKPAVKTI SPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        E+KVAEKASEKP P+NQLWH NGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNG+HTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNCK
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

A0A6J1D1Y4 uncharacterized protein LOC1110165416.9e-24092.42Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEA+VAVLCVLG+ISL  + R+E  SRQKLEVQ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQM+PT+HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
         +KV EK  EKP P+NQLWH+NGKCP+GTIPIRRTKHEDVLRASSVKR+GRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSA+RNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNGQHTSTQMGSGHFP+EGFGKASYFRNIQVVDGSNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        GSNGDWGHFFYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

A0A6J1HUM5 uncharacterized protein LOC1114669876.9e-24093.35Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MGSARFSRCRTMEALVA+ CVLG++S+ C+ARME  SRQKLEV+ HLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQ +PTFHPE L D
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKVAEKASEKP P+ QLWHVNGKCPEGTIPIRRTKHEDVLRASSVKR+GRKKHRS PIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQPNEFSLSQ+WILGGSFGEDLNSIEAGWQVSP+LY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG VMGYWPSFLFSYLADSA+MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVD SNNLKPPKG+GTFTEQPDCYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSNGDWGHFFYYGGPGRNPNC
Subjt:  GSNGDWGHFFYYGGPGRNPNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.1e-20078.15Show/hide
Query:  GSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFDE
        G    S  +     +  LC+ G  SLS +AR  V S+QK EV+ HL RLNKPAVK+I+S DGD+IDCV +S QPAFDHPFLKDHKIQMKP +HPEGLFD+
Subjt:  GSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFDE

Query:  SKV-AEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        +KV A K++EK   + QLWH  GKC EGTIP+RRTK +DVLRASSVKR+G+KK RS P+ P+SAEPDLINQSGHQHAIAYVEGDKYYGAKAT+NVWEP I
Subjt:  SKV-AEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        QQ NEFSLSQ+W+LGGSFG+DLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS +RNSQYDISIL+WKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG V+GYWPSFLFSYL +SA+MIEWGGEVVNS+ +GQHTSTQMGSG FP+EGF KASYFRNIQVVDGSNNLK PKGLGTFTEQ +CYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSN DWGH+FYYGGPG+N  C
Subjt:  GSNGDWGHFFYYGGPGRNPNC

AT2G44210.1 Protein of Unknown Function (DUF239)9.1e-16064.49Show/hide
Query:  LEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFDESKVAEKA-SEKPKPVNQLWHVNGKCPEGTIPIRRTKHED
        L+++ HL+RLNKPA+K+I+SPDGD+IDCV ++ QPAF HP L +H +QM P+ +PE +F ESKV+ K  +++   ++QLWHVNGKCP+ TIPIRRT+ +D
Subjt:  LEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFDESKVAEKA-SEKPKPVNQLWHVNGKCPEGTIPIRRTKHED

Query:  VLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDN
        + RASSV+ +G K  +S P P  S  P+++ Q+GHQHAI YVE   +YGAKA +NVW+P ++ PNEFSL+Q+W+LGG+F  DLNSIEAGWQVSP LY DN
Subjt:  VLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDN

Query:  NTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPNEGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVV
         TRLFTYWTSDAYQ TGCYNLLCSGF+QIN +IAMG SISP+S + NSQYDI+IL+WKDP EGHWW+QFG   ++GYWP+ LFSYL++SA+MIEWGGEVV
Subjt:  NTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPNEGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVV

Query:  NSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNC
        NS+   GQHT+TQMGSG F +EG+GKASYF+N+QVVDGSN L+ P+ L  FT+Q +CY+V++G+ G WG +FYYGGPGRNPNC
Subjt:  NSE-PNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNC

AT3G13510.1 Protein of Unknown Function (DUF239)4.8e-20176.96Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        MG+  FS  +        +C+  M+SLSC+A     SRQK EV+ HL RLNKP VKTI+SPDGD+IDC+ +S QPAFDHPFLKDHKIQM+P++HPEGLFD
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ++KV+ +   K   + QLWH  GKC EGTIP+RRT+ +DVLRASSVKR+G+KKHRS PI P+SAEPDLINQ+GHQHAIAYVEGDKYYGAKAT+NVWEP I
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        Q  NEFSLSQ+W+LGGSFG+DLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS +RNSQYDISIL+WKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFGNG V+GYWPSFLFSYL +SA+MIEWGGEVVNS+  G HT TQMGSGHFP+EGF KASYFRNIQVVDGSNNLK PKGLGTFTE+ +CYDVQT
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNC
        GSN DWGH+FYYGGPG+N NC
Subjt:  GSNGDWGHFFYYGGPGRNPNC

AT5G56530.1 Protein of Unknown Function (DUF239)2.0e-19976.07Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        M +A FS+ R     +   C  G++SL+C+ R+ V SRQ  EV  HL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQM P++ PE LF 
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E   P+ QLWH NG C EGTIP+RRTK EDVLRASSVKR+G+KKH S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        Q  NEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS F N QYDISI +WKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFG+G V+GYWPSFLFSYLADSA+++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKGL TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        G N DWGH+FYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK

AT5G56530.2 Protein of Unknown Function (DUF239)2.0e-19976.07Show/hide
Query:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD
        M +A FS+ R     +   C  G++SL+C+ R+ V SRQ  EV  HL RLNKPAVK+I+SPDGD+IDCVH+S QPAFDHPFLKDHKIQM P++ PE LF 
Subjt:  MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFD

Query:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI
        ESKV+EK  E   P+ QLWH NG C EGTIP+RRTK EDVLRASSVKR+G+KKH S P+ PRSA+PDLINQSGHQHAIAYVEG K+YGAKAT+NVWEP +
Subjt:  ESKVAEKASEKPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSI

Query:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN
        Q  NEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLY DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS F N QYDISI +WKDP 
Subjt:  QQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPN

Query:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT
        EGHWWMQFG+G V+GYWPSFLFSYLADSA+++EWGGEVVN E +G HT+TQMGSG FPDEGF KASYFRNIQVVD SNNLK PKGL TFTE+ +CYDV+ 
Subjt:  EGHWWMQFGNGDVMGYWPSFLFSYLADSATMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQT

Query:  GSNGDWGHFFYYGGPGRNPNCK
        G N DWGH+FYYGGPGRNPNC+
Subjt:  GSNGDWGHFFYYGGPGRNPNCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTGCTCGATTTAGCAGATGCAGGACCATGGAGGCTCTCGTCGCTGTTCTATGCGTTTTGGGGATGATTTCTCTCAGTTGTTCGGCGAGAATGGAAGTTGGCTC
CCGCCAGAAGCTTGAGGTGCAAAATCATCTCAGGCGATTGAACAAGCCTGCTGTTAAAACCATTGAGAGCCCAGATGGAGACCTAATTGACTGTGTTCACATGTCTCACC
AGCCTGCATTTGATCATCCATTCCTCAAAGATCACAAAATCCAGATGAAACCAACTTTTCATCCAGAAGGGCTCTTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAA
AAACCAAAGCCAGTTAACCAACTGTGGCATGTTAATGGAAAGTGTCCTGAAGGCACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGGGCAAGTTCAGTCAA
AAGATTTGGAAGAAAAAAGCATAGATCAACACCAATACCTCCTAGGTCTGCTGAGCCTGATCTCATCAACCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAGGCG
ACAAGTATTATGGAGCTAAAGCAACTATGAATGTATGGGAACCTAGTATTCAACAGCCTAATGAATTTAGCTTATCACAGCTTTGGATATTGGGTGGTTCTTTTGGTGAA
GATCTTAATAGCATTGAAGCTGGCTGGCAGGTCAGTCCCGATCTCTACAACGATAACAACACGAGACTCTTCACGTACTGGACGAGTGATGCATATCAAGCTACAGGCTG
TTACAACCTCCTCTGCTCGGGCTTTATTCAAATCAACAGTGATATTGCAATGGGAGCGAGCATCTCCCCGGTCTCGGCCTTCAGAAATTCACAATATGATATCAGTATAC
TTGTTTGGAAGGATCCAAATGAGGGGCATTGGTGGATGCAATTTGGCAATGGAGATGTGATGGGGTATTGGCCCTCATTTCTATTCTCATATTTAGCTGACAGTGCCACC
ATGATTGAGTGGGGAGGTGAAGTTGTGAACTCAGAGCCAAATGGGCAACACACTTCAACACAAATGGGGAGTGGGCATTTTCCTGATGAAGGATTTGGGAAAGCAAGCTA
TTTTAGAAACATTCAAGTAGTTGATGGATCCAATAATCTCAAACCCCCAAAAGGCCTTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGCAGCAATG
GGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCTAATTGCAAATGA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGAGTGGCCCTCATTTCACTGCACTTAAAAAAAATTCCACCAAATTTGACCATGCAAACCATTTGGTCTTAAATCATCATTTTCCCATTACAAAATTTGGAAGAA
GAAACGAACAGCGGAAAAAGAAAATGATGAAAGCAGCAAAATCCGAATCAGAGGGAAAATGACGAGAAAGTGGGGCACTTCTGCGAGAGAGAGAGAAAACTCAAAGGCCT
TTTTAGAGAGAAGTGGGCGTGTGAGATTCGCCTCTGTGTATGTTTCTACACTTCTCTTCTTCGATTTCAACCCATTTCTGATGTGGCTTTTACTGGTAGCCGTCCGCCAT
TGTTGCCCTTGACCTGACCTTCATCTTCTCGTTCTGGGTTTTTCTCCACTTGCAGTGTAGTGGGGGAAAATTGAAATTGCTCTTGATTTGTGATTTTGAGAGCGTTTTTG
AAGATGGGTTCTGCTCGATTTAGCAGATGCAGGACCATGGAGGCTCTCGTCGCTGTTCTATGCGTTTTGGGGATGATTTCTCTCAGTTGTTCGGCGAGAATGGAAGTTGG
CTCCCGCCAGAAGCTTGAGGTGCAAAATCATCTCAGGCGATTGAACAAGCCTGCTGTTAAAACCATTGAGAGCCCAGATGGAGACCTAATTGACTGTGTTCACATGTCTC
ACCAGCCTGCATTTGATCATCCATTCCTCAAAGATCACAAAATCCAGATGAAACCAACTTTTCATCCAGAAGGGCTCTTTGATGAGAGCAAAGTAGCTGAGAAAGCCAGT
GAAAAACCAAAGCCAGTTAACCAACTGTGGCATGTTAATGGAAAGTGTCCTGAAGGCACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGGGCAAGTTCAGT
CAAAAGATTTGGAAGAAAAAAGCATAGATCAACACCAATACCTCCTAGGTCTGCTGAGCCTGATCTCATCAACCAAAGTGGTCATCAGCATGCAATAGCTTATGTGGAAG
GCGACAAGTATTATGGAGCTAAAGCAACTATGAATGTATGGGAACCTAGTATTCAACAGCCTAATGAATTTAGCTTATCACAGCTTTGGATATTGGGTGGTTCTTTTGGT
GAAGATCTTAATAGCATTGAAGCTGGCTGGCAGGTCAGTCCCGATCTCTACAACGATAACAACACGAGACTCTTCACGTACTGGACGAGTGATGCATATCAAGCTACAGG
CTGTTACAACCTCCTCTGCTCGGGCTTTATTCAAATCAACAGTGATATTGCAATGGGAGCGAGCATCTCCCCGGTCTCGGCCTTCAGAAATTCACAATATGATATCAGTA
TACTTGTTTGGAAGGATCCAAATGAGGGGCATTGGTGGATGCAATTTGGCAATGGAGATGTGATGGGGTATTGGCCCTCATTTCTATTCTCATATTTAGCTGACAGTGCC
ACCATGATTGAGTGGGGAGGTGAAGTTGTGAACTCAGAGCCAAATGGGCAACACACTTCAACACAAATGGGGAGTGGGCATTTTCCTGATGAAGGATTTGGGAAAGCAAG
CTATTTTAGAAACATTCAAGTAGTTGATGGATCCAATAATCTCAAACCCCCAAAAGGCCTTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGCAGCA
ATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCTAATTGCAAATGAAGCTGAAATTCCCATTTTACCCTCCACAGTCTTCTTCTTCTCTCTCTC
TCTCTCTCTCTCTCACTGTAAACTACCTTTTAGGACTAGTGTTAGGGGGCTGGGCTATATGTGACATTTTCTCACTGATGGAAAGTTTTGGCCCTGTGGGTCATTTTCTT
TTTTGAAAATTTTATCTTTCTTTTTATTTTTTAGAGGAAAGAATCTCTCCATGTAAGTGGTGGGGCAGGGGAGGTGGCCCTAGTTTCTTAAAAGCATAAAAGATTAGATG
GGAAATGGCCTTTTGTTATGGTTATGACTTGATGATGATGATTTGTATTTTATTTGAGTTGCTTTATTAATGAAGAATGAATTGGTTTTGGTTTTGT
Protein sequenceShow/hide protein sequence
MGSARFSRCRTMEALVAVLCVLGMISLSCSARMEVGSRQKLEVQNHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMKPTFHPEGLFDESKVAEKASE
KPKPVNQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRFGRKKHRSTPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGE
DLNSIEAGWQVSPDLYNDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPNEGHWWMQFGNGDVMGYWPSFLFSYLADSAT
MIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGLGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNCK