; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g33760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g33760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein ZGRF1 isoform X2
Genome locationchr6:25615880..25621094
RNA-Seq ExpressionMoc06g33760
SyntenyMoc06g33760
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145925.1 uncharacterized protein LOC111015274 isoform X1 [Momordica charantia]0.0e+0099.82Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
        SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA

Query:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS
        RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS
Subjt:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS

Query:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_022145926.1 uncharacterized protein LOC111015274 isoform X2 [Momordica charantia]0.0e+00100Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR
        SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR

Query:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL
        VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL
Subjt:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL

Query:  RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874787.1 uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida]1.2e-22775.63Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISEK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        L GKNFR NS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++SSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
        SH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VK EL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQRP+A
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA

Query:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDPGRTLAEDVEIEHSSQLLQSD
        RV LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT    TL EDVEI HSSQLL+SD
Subjt:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDPGRTLAEDVEIEHSSQLLQSD

Query:  NVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        + E E  SLRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  NVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874788.1 uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida]5.0e-22975.77Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISEK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        L GKNFR NS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++SSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR
        SH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VKEL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQRP+AR
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR

Query:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDPGRTLAEDVEIEHSSQLLQSDN
        V LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ        +TFT    TL EDVEI HSSQLL+SD+
Subjt:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQ--------STFTDPGRTLAEDVEIEHSSQLLQSDN

Query:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
         E E  SLRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

XP_038874789.1 uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida]1.3e-22976.74Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGE+NRWKVT+TKHLKQKRKVYHDGFLDIHRS+NK +LYDECEKLLECR+LKQDEVI SGETLIFNSYLVDIDTP GDHKPES LN Q GDD+ISEK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        L GKNFR NS  FAS EKNK R +LSPSH+IIREFKK RLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLS+CGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDE VKSGES+AF+AHLVEIGECE+D KPPKI  NQGS+SG+GGTR+ HG+K   +ENEISTGKEW+VLYTSQ+TQKSKKY NGII++SSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
        SH  QVTLLNEDR ILSSKH SLSK++ +G ILELPKYLVE+GEACE+VK EL NRN+D RKDASF I G  E+G GRET+KK LR+AH+ILSILQRP+A
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA

Query:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS
        RV LSSGH DE+ISVSVSSY  PE S  +  ++S+DD+SH +PS   D R+S KNAE NQSIVLTQ TFT    TL EDVEI HSSQLL+SD+ E E  S
Subjt:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS

Query:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        LRNS +  +   D+AAC LV+DE KI EE T +R++DACPSFDLGI
Subjt:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

TrEMBL top hitse value%identityAlignment
A0A6J1CVV0 uncharacterized protein LOC111015274 isoform X10.0e+0099.82Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
        SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKA

Query:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS
        RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS
Subjt:  RVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRS

Query:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  LRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1CXU9 uncharacterized protein LOC111015274 isoform X20.0e+00100Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
        LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK
Subjt:  LPGKNFRKNSGCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRK

Query:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
        LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG
Subjt:  LLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSG

Query:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR
        SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR
Subjt:  SHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKAR

Query:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL
        VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL
Subjt:  VNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSL

Query:  RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
Subjt:  RNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X21.4e-22173.78Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLVDIDTP GDHKPESGLN QAG D+I EK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE
        L GKNFR NS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAKKYHDGFLKL +CGSLGRQVMLFDE
Subjt:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE

Query:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS
        NRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI L+QGS+ GD GTR+ +  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGII+IS
Subjt:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS

Query:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR
        SSGSHH+QVTLLNEDR ILSSKH+SLSK L MG ILELPKYLVE+GEAC +VK E++NR++D RKD SF I G+ E+G  R T+KK LRDAHEILSILQR
Subjt:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR

Query:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN
        PKARV+LSSGH+D++I VSV S K PE S     LD+P   +DDRSH +PS N+D RDS KNAE NQSI LT ST T       E++EI HS+QLLQ+++
Subjt:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN

Query:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        VE ES SLR++ +  QG    AAC+LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X41.7e-21973.78Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISEK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE
        L GKNFR NS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVMLFDE
Subjt:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE

Query:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS
        NRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+ H  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGII+IS
Subjt:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS

Query:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR
        SSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VK EL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILSILQR
Subjt:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR

Query:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN
        PKARV+LSSG +D++ISVSVSS K PE S     LD+P   +D+RSH +PS N+D R+S KNAE+NQS  LTQST T        ++EI HS+   Q++ 
Subjt:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN

Query:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        VE ES SLR++ +  QG    AAC LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X21.3e-22274.32Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        M E+NRWKVT+TKHLKQ+RKVYHDGFLD+HRS+NK +LYDECEKLLECR+LKQ+EV+CSGETLIFNSYLV+IDTP GD+KPESGLN QAG DEISEK GV
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE
        L GKNFR NS CF   ASAEKNKTR TLSPS KIIREFKKSRLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL +CGSLGRQVMLFDE
Subjt:  LPGKNFRKNSGCF---ASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDE

Query:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS
        NRKLLDSRF+KKDE VKSGES+AF+AHLV+IGECER+ KPPKI ++QGS+ GD GTR+ H  KK  +ENEISTGKEWHVLYTSQITQKSKKY NGII+IS
Subjt:  NRKLLDSRFIKKDEIVKSGESVAFNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRIS

Query:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR
        SSGSHH+QVTLLNEDRIILSSKHISLSK L MG ILELPKYLVE+GEACE+VK EL+NR++D RKDASF I G+ E+G  R T+KK LRDAHEILSILQR
Subjt:  SSGSHHIQVTLLNEDRIILSSKHISLSKDLMMGNILELPKYLVEVGEACESVK-ELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQR

Query:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN
        PKARV+LSSG +D++ISVSVSS K PE S     LD+P   +D+RSH +PS N+D R+S KNAE+NQS  LTQST T        ++EI HS+QLLQ++ 
Subjt:  PKARVNLSSGHTDESISVSVSSYKAPELS----NLDVPNVSLDDRSHYEPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDN

Query:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI
        VE ES SLR++ +  QG    AAC LV+DE K+ EE T +R+   CPSFDLGI
Subjt:  VENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10890.1 unknown protein4.8e-2845.96Show/hide
Query:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV
        M E  RW   +TKHLKQKRKVYHDGFLD+H +  K +LYDE + LLE R LK  EV+ +GETL F +YLVDI  P+   K  S    +  D + + KP  
Subjt:  MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGV

Query:  LPGKNFRKNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT
        +   NF+K+S      EK     NK +  +LSPSH +IR FKK  L  YG+      T+ T
Subjt:  LPGKNFRKNSGCFASAEK-----NK-TRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGTGAATCGATGGAAGGTGACCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGGCGTT
GCTCTATGATGAATGCGAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTGTTGACATCGACA
CTCCTCAGGGAGATCATAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTCCGAAAGAACTCT
GGTTGCTTTGCAAGTGCGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTATGGATCGCCACA
AAGTAGTCCAGACACAAGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCTTGAAACTTTCTG
TTTGTGGATCCCTTGGGAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGAGAATCAGTAGCC
TTTAATGCTCATTTAGTAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGGGGAACCAGGATACCGCA
TGGACAGAAAAAAAAAAATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATCAGAATGGAATCA
TCAGAATTTCTTCCTCTGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGATTTGATGATGGGT
AATATACTTGAGCTGCCAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGACGCAAGCTTTTACAT
TCCTGGCAAATATGAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCAAGAGTCAACCTTT
CTTCAGGTCATACTGATGAAAGCATTAGTGTATCGGTTTCGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGATGATCGATCTCATTAC
GAACCAAGTGGAAACGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCCTGGCAGAACATTGGCTGA
AGATGTTGAAATTGAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTGCTGGGATTCAAGGTTTGATGGACTCTG
CGGCTTGTGACCTTGTCAGTGATGAAGTGAAAATCTATGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAGTGAATCGATGGAAGGTGACCTTTACGAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGACGGTTTCTTAGACATCCACCGCTCCACCAATAAGGCGTT
GCTCTATGATGAATGCGAGAAGCTCCTTGAATGCAGGCTATTAAAGCAAGATGAAGTAATTTGCTCCGGCGAAACGCTTATATTCAACAGTTACCTTGTTGACATCGACA
CTCCTCAGGGAGATCATAAGCCCGAGTCTGGTCTGAACACCCAAGCAGGTGATGATGAGATATCCGAGAAACCTGGTGTGTTGCCAGGAAAAAACTTCCGAAAGAACTCT
GGTTGCTTTGCAAGTGCGGAGAAGAATAAAACTCGGACGACTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGCAGATTAAAGTGCTATGGATCGCCACA
AAGTAGTCCAGACACAAGAAAAACCGAGGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGACGGTTTCTTGAAACTTTCTG
TTTGTGGATCCCTTGGGAGGCAGGTCATGCTCTTTGATGAAAACAGGAAATTACTGGATAGCAGATTTATTAAGAAAGATGAAATAGTAAAATCTGGAGAATCAGTAGCC
TTTAATGCTCATTTAGTAGAAATTGGAGAATGTGAAAGAGACCAGAAGCCTCCTAAAATTGCTTTAAATCAAGGCAGCAATTCTGGAGATGGGGGAACCAGGATACCGCA
TGGACAGAAAAAAAAAAATAACGAAAATGAAATATCTACTGGAAAAGAATGGCATGTTCTGTACACTAGTCAGATAACTCAAAAGTCTAAGAAATATCAGAATGGAATCA
TCAGAATTTCTTCCTCTGGTTCTCACCATATTCAGGTTACTTTACTAAATGAAGATAGAATTATATTAAGCAGCAAACACATTAGTTTATCTAAAGATTTGATGATGGGT
AATATACTTGAGCTGCCAAAATACTTGGTTGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGAGCTCAGCAACAGAAATTATGATTCAAGAAAAGACGCAAGCTTTTACAT
TCCTGGCAAATATGAACAAGGATTGGGCAGAGAGACTATTAAAAAGCCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCAAGAGTCAACCTTT
CTTCAGGTCATACTGATGAAAGCATTAGTGTATCGGTTTCGTCATACAAGGCTCCTGAACTTTCCAATTTGGACGTACCGAATGTTTCACTAGATGATCGATCTCATTAC
GAACCAAGTGGAAACGTGGACATGAGGGATTCAATTAAGAATGCAGAAACCAACCAATCCATTGTTCTTACTCAATCAACATTCACTGATCCTGGCAGAACATTGGCTGA
AGATGTTGAAATTGAACACTCCAGCCAGCTTCTTCAGTCAGACAACGTGGAGAATGAAAGTAGATCTCTCAGAAATTCAACTGCTGGGATTCAAGGTTTGATGGACTCTG
CGGCTTGTGACCTTGTCAGTGATGAAGTGAAAATCTATGAGGAGAACACATGTAAAAGAAAAATAGATGCATGCCCGAGTTTTGATCTTGGAATTTGA
Protein sequenceShow/hide protein sequence
MGEVNRWKVTFTKHLKQKRKVYHDGFLDIHRSTNKALLYDECEKLLECRLLKQDEVICSGETLIFNSYLVDIDTPQGDHKPESGLNTQAGDDEISEKPGVLPGKNFRKNS
GCFASAEKNKTRTTLSPSHKIIREFKKSRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSVCGSLGRQVMLFDENRKLLDSRFIKKDEIVKSGESVA
FNAHLVEIGECERDQKPPKIALNQGSNSGDGGTRIPHGQKKKNNENEISTGKEWHVLYTSQITQKSKKYQNGIIRISSSGSHHIQVTLLNEDRIILSSKHISLSKDLMMG
NILELPKYLVEVGEACESVKELSNRNYDSRKDASFYIPGKYEQGLGRETIKKPLRDAHEILSILQRPKARVNLSSGHTDESISVSVSSYKAPELSNLDVPNVSLDDRSHY
EPSGNVDMRDSIKNAETNQSIVLTQSTFTDPGRTLAEDVEIEHSSQLLQSDNVENESRSLRNSTAGIQGLMDSAACDLVSDEVKIYEENTCKRKIDACPSFDLGI