; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1478 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1478
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein ZGRF1 isoform X2
Genome locationMC06:22132951..22139245
RNA-Seq ExpressionMC06g1478
SyntenyMC06g1478
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145925.1 uncharacterized protein LOC111015274 isoform X1 [Momordica charantia]0.098.55Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRI HGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSV SYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_022145926.1 uncharacterized protein LOC111015274 isoform X2 [Momordica charantia]0.098.36Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRI HGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSV SYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874787.1 uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida]5.00e-28975.45Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+LHG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VKVEL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL
        RP+ARV LSSGH DE+ISVSV SY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT +  TL EDVEI HSSQL
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL

Query:  LQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        L+SD+ E E  SLRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  LQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874788.1 uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida]1.31e-28675.27Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+LHG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VK EL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL
        RP+ARV LSSGH DE+ISVSV SY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT +  TL EDVEI HSSQL
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL

Query:  LQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        L+SD+ E E  SLRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  LQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874789.1 uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida]9.65e-29276.55Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+LHG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VKVEL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RP+ARV LSSGH DE+ISVSV SY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ TFT +  TL EDVEI HSSQLL+SD+ E 
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        E  SLRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

TrEMBL top hitse value%identityAlignment
A0A6J1CVV0 uncharacterized protein LOC111015274 isoform X10.098.55Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRI HGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSV SYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1CXU9 uncharacterized protein LOC111015274 isoform X20.098.36Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRI HGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSV SYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X23.89e-28173.79Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLVDIDTP GDHKPESGLN QAG D+I E
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI L+QGS+ GD GTR+L+  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDR ILSSKH+SLSK L MG ILELPKYLVE+GEAC +VKVE++NR++D RKD SF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSGH+D++I VSV S K PE S     LD+P   +DDRSH +PS N+D RDS KNAE NQSI LT ST T       E++EI HS+QLL
Subjt:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        Q+++VE ES SLR++ +  QG    AAC+LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X43.70e-27873.61Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+LH  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VKVEL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSG +D++ISVSV S K PE S     LD+P   +D+RSH +PS N+D R+S KNAE+NQS  LTQST T+        +EI HS+Q  
Subjt:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
         ++ VE ES SLR++ +  QG    AAC LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X23.23e-28274.15Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+LH  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VKVEL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSG +D++ISVSV S K PE S     LD+P   +D+RSH +PS N+D R+S KNAE+NQS  LTQST T+        +EI HS+QLL
Subjt:  ILQRPKARVNLSSGHTDESISVSVWSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        Q++ VE ES SLR++ +  QG    AAC LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10890.1 unknown protein4.5e-2644.24Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E  RW   +TKHLKQKRKVYHDGFLD+H +  K+    +LYDE + LLE R LK  EV+ +GETL F +YLVDI  P+   K  S    +  D + + 
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT
        KP  +   NF+ +S      EK     NK +  +LSPSH +IR FKK  L  YG+      T+ T
Subjt:  KPGVLPGKNFRNNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGTGAATCGATGGAAGGTGACCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGATTGG
CACACAGGCGTTGCTCTATGATGAATGCGAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTG
TTGACATCGACACTCCTCAGGGAGATCATAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTC
CGAAATAACTCTGGTTGCTTTGCAAGTGCGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTA
TGGATCGCCACAAAGTAGTCCAGACACAAGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCT
TGAAACTTTCTGTTTGTGGATCCCTTGGGAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGA
GAATCAGTAGCCTTTAATGCTCATTTAGTAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGTGGAAC
CAGGATACTGCATGGACAGAAAAAAAAAAATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATC
AGAATGGAATCATCAGAATTTCTTCCTCTGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGAT
TTGATGATGGGTAATATACTTGAGCTGCCAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGTAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGA
CGCAAGCTTTTACATTCCTGGCAAATATGAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGG
CAAGAGTCAACCTTTCTTCAGGTCATACTGATGAAAGCATTAGTGTATCGGTTTGGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGAT
GATCGATCTCATTACGAACCAAGTGGAAACGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCATGG
CAGAACACTGGCTGAAGATGTTGAAATTGAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTGCTGGGATTCAAG
GTTTGATGGACTCTGCGGCTTGTGACCTTGTCAGTGATGAAGTGAAAATCTATGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATT
TGA
mRNA sequenceShow/hide mRNA sequence
AATCGCAAGAGCGGGTCACTAGCTTAATTTTCGTACGCGAGGATTTCCCGCCATAAGAAATTGCTCGACGAAATTCGACTCCATGGGCGAAGTGAATCGATGGAAGGTGA
CCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGATTGGCACACAGGCGTTGCTCTATGATGAATGC
GAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTGTTGACATCGACACTCCTCAGGGAGATCA
TAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTCCGAAATAACTCTGGTTGCTTTGCAAGTG
CGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTATGGATCGCCACAAAGTAGTCCAGACACA
AGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCTTGAAACTTTCTGTTTGTGGATCCCTTGG
GAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGAGAATCAGTAGCCTTTAATGCTCATTTAG
TAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGTGGAACCAGGATACTGCATGGACAGAAAAAAAAA
AATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATCAGAATGGAATCATCAGAATTTCTTCCTC
TGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGATTTGATGATGGGTAATATACTTGAGCTGC
CAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGTAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGACGCAAGCTTTTACATTCCTGGCAAATAT
GAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCAAGAGTCAACCTTTCTTCAGGTCATAC
TGATGAAAGCATTAGTGTATCGGTTTGGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGATGATCGATCTCATTACGAACCAAGTGGAA
ACGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCATGGCAGAACACTGGCTGAAGATGTTGAAATT
GAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTGCTGGGATTCAAGGTTTGATGGACTCTGCGGCTTGTGACCT
TGTCAGTGATGAAGTGAAAATCTATGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATTTGAACTGGCCGATTCAGATTAACAAATA
TGATGCGCCTACAAAGCAGGTCAGGAAAACTGCATCTAAACTGTTTCCTCCTTATTGTTGGATATTTTATCACATCAGTAATGCATCCCTGTATATGAATCTATTAGGAT
CCCAAATACACTAAACGAAACAAGATAACTATATTGCAATATCAAGAGATAGTTAAAGTATCAATAGCCTTTCGAGAGGGCTATCTCTCCCTCAAATAACTATACTAATT
TCTTCACTTTAGAGTTTAGACTATTTCCAAAGATACCTTCTCCAATCTCCACTGCCTCTAGTTATATCTAACTTACCTCTAACTCATTATCATAACGTCCCTAACTAATA
TACTAATACTATTCTCAAACTAACCCTAATAGGACTCTCACATGATCTAAATTAGCTTGACAATGCATTATTAGGTCTCTAGTTTATGTTTAGACAAGTTTTCTCCTTAT
AAGTCCCAAACTCCTTTGCCTTATGCTCGAGTACCATTTTCCAAATACCCATAAATTTTCACTATTCAGTGGTATTAGCAACAAAAAGATTGATGTATTCAGTTTGTTTG
TGAATAGATTTGGATTAGACTGAAGAAAGTTTAAACATGCTGTGCCAGGTACTTGTTACATTCTAGATTTATGATCGGATGAGAATAGCTCTTTCCTTGTGTAGATGATT
GATTAGTTATATACATCCTTTCAGCTTCCACCTAACATTGATCAGTATTGGTTTTGTGCCAAAATTGACTAAAACCAGCCATTCAGTTTTAGTCGGAAACTAAAAATAAG
CTTTGTACTAAAACCGGCGGTTGTTTTTGTGAACGTCGAGAGGAAGAGATGCCACTGTGCGGATCACACAGGAAAGCTTGAGATGAGAGAGACTTAGAAGAGAGGGAAAA
TGGAGAAAGAAAAATTGAAGAAATGAGTGAAGGATAAAATGGAAAGAAAAATAAGAAAGGGAGATGGGTTGAAGGGAAAGAGAAAAGAGGAGGGGGAGGGGGAGGAAAAG
AAAAAG
Protein sequenceShow/hide protein sequence
MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGVLPGKNF
RNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRKLLDSRFIKKDEIVKSG
ESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRILHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSGSHHIQVTLLNEDRIILSSKHISLSKD
LMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKARVNLSSGHTDESISVSVWSYKAPELSNLDVPNVSLD
DRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI