; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020340 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020340
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein ZGRF1 isoform X2
Genome locationscaffold665:1147369..1152557
RNA-Seq ExpressionMS020340
SyntenyMS020340
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145925.1 uncharacterized protein LOC111015274 isoform X1 [Momordica charantia]1.3e-30698.36Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSG VDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        ESRSLRNST+GIQGLMDSAACDLVSDEVKI EENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

XP_022145926.1 uncharacterized protein LOC111015274 isoform X2 [Momordica charantia]9.6e-30598.18Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSG VDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        ESRSLRNST+GIQGLMDSAACDLVSDEVKI EENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

XP_038874787.1 uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida]7.8e-23075.81Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VKVEL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL
        RP+ARV LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT +  TL EDVEI HSSQL
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL

Query:  LQSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        L+SD+ E E  SLRNS S  +   D+AAC LV+DE KICEE T +R++DACPSFDLGI
Subjt:  LQSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

XP_038874788.1 uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida]5.6e-22875.63Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VK EL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL
        RP+ARV LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT +  TL EDVEI HSSQL
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQ--------STFTDHGRTLAEDVEIEHSSQL

Query:  LQSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        L+SD+ E E  SLRNS S  +   D+AAC LV+DE KICEE T +R++DACPSFDLGI
Subjt:  LQSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

XP_038874789.1 uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida]8.3e-23276.91Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK     +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        K GVL GKNFRNNS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VKVEL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RP+ARV LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ TFT +  TL EDVEI HSSQLL+SD+ E 
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        E  SLRNS S  +   D+AAC LV+DE KICEE T +R++DACPSFDLGI
Subjt:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

TrEMBL top hitse value%identityAlignment
A0A6J1CVV0 uncharacterized protein LOC111015274 isoform X16.5e-30798.36Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSG VDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        ESRSLRNST+GIQGLMDSAACDLVSDEVKI EENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

A0A6J1CXU9 uncharacterized protein LOC111015274 isoform X24.7e-30598.18Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNK    ALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
        KPGVLPGKNFR NSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD
Subjt:  KPGVLPGKNFRNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFD

Query:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
        ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI
Subjt:  ENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRI

Query:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
        SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ
Subjt:  SSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQ

Query:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN
        RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSG VDMRDSIKNAETNQSIVLTQSTFTD GRTLAEDVEIEHSSQLLQSDNVEN
Subjt:  RPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVEN

Query:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        ESRSLRNST+GIQGLMDSAACDLVSDEVKI EENTCKRKIDACPSFDLGI
Subjt:  ESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X27.6e-22373.79Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLVDIDTP GDHKPESGLN QAG D+I E
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI L+QGS+ GD GTR+ +  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDR ILSSKH+SLSK L MG ILELPKYLVE+GEAC +VKVE++NR++D RKD SF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSGH+D++I VSV S K PE S     LD+P   +DDRSH +PS  +D RDS KNAE NQSI LT ST T       E++EI HS+QLL
Subjt:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        Q+++VE ES SLR++ S  QG    AAC+LV+DE K+CEE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X49.3e-22173.79Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+ H  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VKVEL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSG +D++ISVSVSS K PE S     LD+P   +D+RSH +PS  +D R+S KNAE+NQS  LTQST T        ++EI HS+   
Subjt:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        Q++ VE ES SLR++ S  QG    AAC LV+DE K+CEE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X26.9e-22474.33Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK     +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISE
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM
        K GVL GKNFRNNS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVM
Subjt:  KPGVLPGKNFRNNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVM

Query:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI
        LFDENRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+ H  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGI
Subjt:  LFDENRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGI

Query:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS
        I+ISSSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VKVEL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILS
Subjt:  IRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILS

Query:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL
        ILQRPKARV+LSSG +D++ISVSVSS K PE S     LD+P   +D+RSH +PS  +D R+S KNAE+NQS  LTQST T        ++EI HS+QLL
Subjt:  ILQRPKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLL

Query:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI
        Q++ VE ES SLR++ S  QG    AAC LV+DE K+CEE T +R+   CPSFDLGI
Subjt:  QSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10890.1 unknown protein4.5e-2644.24Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE
        M E  RW   +TKHLKQKRKVYHDGFLD+H +  K+    +LYDE + LLE R LK  EV+ +GETL F +YLVDI  P+   K  S    +  D + + 
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISE

Query:  KPGVLPGKNFRNNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT
        KP  +   NF+ +S      EK     NK +  +LSPSH +IR FKK  L  YG+      T+ T
Subjt:  KPGVLPGKNFRNNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGTGAATCGATGGAAGGTGACCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGATTGG
CACACAGGCGTTGCTCTATGATGAATGCGAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTG
TTGACATCGACACTCCTCAGGGAGATCATAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTC
CGAAATAACTCTGGTTGCTTTGCAAGTGCGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTA
TGGATCGCCACAAAGTAGTCCAGACACAAGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCT
TGAAACTTTCTGTTTGTGGATCCCTTGGGAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGA
GAATCAGTAGCCTTTAATGCTCATTTAGTAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGGGGAAC
CAGGATACCGCATGGACAGAAAAAAAAAAATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATC
AGAATGGAATCATCAGAATTTCTTCCTCTGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGAT
TTGATGATGGGTAATATACTTGAGCTGCCAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGTAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGA
CGCAAGCTTTTACATTCCTGGCAAATATGAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGG
CAAGAGTCAACCTTTCTTCAGGTCATACTGATGAAAGCATTAGTGTATCGGTTTCGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGAT
GATCGATCTCATTACGAACCAAGTGGAATCGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCATGG
CAGAACATTGGCTGAAGATGTTGAAATTGAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTTCTGGGATTCAAG
GTTTGATGGACTCTGCGGCTTGTGACCTTGTCAGTGATGAAGTGAAAATCTGTGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATT
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAGTGAATCGATGGAAGGTGACCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGATTGG
CACACAGGCGTTGCTCTATGATGAATGCGAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTG
TTGACATCGACACTCCTCAGGGAGATCATAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTC
CGAAATAACTCTGGTTGCTTTGCAAGTGCGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTA
TGGATCGCCACAAAGTAGTCCAGACACAAGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCT
TGAAACTTTCTGTTTGTGGATCCCTTGGGAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGA
GAATCAGTAGCCTTTAATGCTCATTTAGTAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGGGGAAC
CAGGATACCGCATGGACAGAAAAAAAAAAATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATC
AGAATGGAATCATCAGAATTTCTTCCTCTGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGAT
TTGATGATGGGTAATATACTTGAGCTGCCAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGTAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGA
CGCAAGCTTTTACATTCCTGGCAAATATGAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGG
CAAGAGTCAACCTTTCTTCAGGTCATACTGATGAAAGCATTAGTGTATCGGTTTCGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGAT
GATCGATCTCATTACGAACCAAGTGGAATCGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCATGG
CAGAACATTGGCTGAAGATGTTGAAATTGAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTTCTGGGATTCAAG
GTTTGATGGACTCTGCGGCTTGTGACCTTGTCAGTGATGAAGTGAAAATCTGTGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATT
Protein sequenceShow/hide protein sequence
MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKIGTQALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGVLPGKNF
RNNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRKLLDSRFIKKDEIVKSG
ESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSGSHHIQVTLLNEDRIILSSKHISLSKD
LMMGNILELPKYLVEVGEACESVKVELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLD
DRSHYEPSGIVDMRDSIKNAETNQSIVLTQSTFTDHGRTLAEDVEIEHSSQLLQSDNVENESRSLRNSTSGIQGLMDSAACDLVSDEVKICEENTCKRKIDACPSFDLGI