; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009044 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009044
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein ZGRF1 isoform X2
Genome locationChr06:1961948..1966584
RNA-Seq ExpressionHG10009044
SyntenyHG10009044
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654696.1 uncharacterized protein LOC101209453 isoform X2 [Cucumis sativus]8.3e-23780.62Show/hide
Query:  MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKIS
        MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECR+LKQDEVICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKIS
Subjt:  MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKIS

Query:  EKSGVLRGKNFRNNSVCF-------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD
        E SGV+RGK+  NNSVC                    EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LG QV LFD
Subjt:  EKSGVLRGKNFRNNSVCF-------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD

Query:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK
        ENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHKP KIP+N+G+SS   GG  VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Subjt:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK

Query:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL
        ISSSGS  MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+R  DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSIL
Subjt:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL

Query:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE
        QRPR RV+LSSGHTDENISVSV S   P+PSLAEALHL  D QSH+KPSE  +TRES KN EN+QSIALTQSTFTGNA TLTED E G SS+L RSDHVE
Subjt:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE

Query:  AESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI
        AESISLRN+I R    SDS AC LVN DEGKICEEITYERE+ A PSFDLGI
Subjt:  AESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI

XP_022928770.1 uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata]4.4e-23878.38Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV+CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI E
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT
        KSGVLRGKNFRNNSVCF                       EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAKKYHDGFLKL ICG LGRQV 
Subjt:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT

Query:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG
        LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE +HKPPKIP++QG SS G+ GTRVL+  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Subjt:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG

Query:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL
        IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+  GEILELPKYLVEIGEAC +VKVE+ NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+IL
Subjt:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL

Query:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS
        SILQRP+ARVSLSSGH+D+NI VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S       TLTE++EIGHS+QL ++
Subjt:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS

Query:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        +HVEAES SLR+TISRT+ +S  AAC LVNDEGK+CEEITYERE   CPSFDLGI
Subjt:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

XP_038874787.1 uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida]3.5e-25984.97Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEVI SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISE
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD
        KSGVLRGKNFRNNSV F                    EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LGRQV LFD
Subjt:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD

Query:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK
        ENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHKPPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Subjt:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK

Query:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL
        +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGNRNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSIL
Subjt:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL

Query:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------STFTGNAGTLTEDVEIGHSSQ
        QRPRARV LSSGH DENISVSV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ        +TFTGNAGTLTEDVEIGHSSQ
Subjt:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------STFTGNAGTLTEDVEIGHSSQ

Query:  LFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        L RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKICEEITYERE+DACPSFDLGI
Subjt:  LFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

XP_038874788.1 uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida]2.5e-25784.79Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEVI SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISE
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD
        KSGVLRGKNFRNNSV F                    EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LGRQV LFD
Subjt:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD

Query:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK
        ENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHKPPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Subjt:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK

Query:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL
        +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VK ELGNRNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSIL
Subjt:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL

Query:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------STFTGNAGTLTEDVEIGHSSQ
        QRPRARV LSSGH DENISVSV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ        +TFTGNAGTLTEDVEIGHSSQ
Subjt:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------STFTGNAGTLTEDVEIGHSSQ

Query:  LFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        L RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKICEEITYERE+DACPSFDLGI
Subjt:  LFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

XP_038874789.1 uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida]3.7e-26186.21Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEVI SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISE
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD
        KSGVLRGKNFRNNSV F                    EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LGRQV LFD
Subjt:  KSGVLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD

Query:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK
        ENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHKPPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Subjt:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK

Query:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL
        +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGNRNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSIL
Subjt:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL

Query:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE
        QRPRARV LSSGH DENISVSV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ TFTGNAGTLTEDVEIGHSSQL RSDH E
Subjt:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE

Query:  AESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        AE ISLRN+ISRTR SSD+AAC LVNDEGKICEEITYERE+DACPSFDLGI
Subjt:  AESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

TrEMBL top hitse value%identityAlignment
A0A0A0KQR3 Uncharacterized protein4.0e-23780.62Show/hide
Query:  MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKIS
        MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECR+LKQDEVICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKIS
Subjt:  MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKIS

Query:  EKSGVLRGKNFRNNSVCF-------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD
        E SGV+RGK+  NNSVC                    EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LG QV LFD
Subjt:  EKSGVLRGKNFRNNSVCF-------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFD

Query:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK
        ENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHKP KIP+N+G+SS   GG  VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Subjt:  ENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK

Query:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL
        ISSSGS  MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+R  DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSIL
Subjt:  ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSIL

Query:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE
        QRPR RV+LSSGHTDENISVSV S   P+PSLAEALHL  D QSH+KPSE  +TRES KN EN+QSIALTQSTFTGNA TLTED E G SS+L RSDHVE
Subjt:  QRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVE

Query:  AESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI
        AESISLRN+I R    SDS AC LVN DEGKICEEITYERE+ A PSFDLGI
Subjt:  AESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI

A0A1S4E617 uncharacterized protein LOC103482830 isoform X45.2e-23780.51Show/hide
Query:  MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSG
        MNRWKVTYT HLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRIL++DEVICSGETLIFNS+LVDIDTPLGDHKPE GLNFQ GDDKISEKSG
Subjt:  MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSG

Query:  VLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFDENR
        VLRGK+ RNNSVCF                    EFKKRRLK YGSPQ+SPDTRKTEETEWQVLYTTNITQKAKK+HDGFLKLSICG LG QV LFDENR
Subjt:  VLRGKNFRNNSVCF--------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFDENR

Query:  KLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISS
        KLL+SRFIKK ETVK GES+AFDAHLVEIGECE DHKP KIP+N+G+SS   GG RVLHGQKSCFSENEIS GKEW+ LYTSQITQKSKKY NGIIKISS
Subjt:  KLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISS

Query:  SGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRD-----AHQILS
        SGS  MQV LLNEDR ILS KH SLSK V+ GE LELPKYLVEIGEACESVKVELGNRNFDIRKDASFC+SGG+E+GSGRET +KSLRD     AHQILS
Subjt:  SGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRD-----AHQILS

Query:  ILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDH
        ILQRPRARV+LSSGHTDENISVSV S   P+PS+AEALHL +DDQSH+KPSE  +TRES KNAEN+QSIALTQSTFTGNA TLTED EIG SS+L RSDH
Subjt:  ILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDH

Query:  VEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI
        VEAESISLRN+I R    SDSAA  LVN DEGKIC+EITYERE+ A PSFDLGI
Subjt:  VEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYERELDACPSFDLGI

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X22.1e-23878.38Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV+CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI E
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT
        KSGVLRGKNFRNNSVCF                       EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAKKYHDGFLKL ICG LGRQV 
Subjt:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT

Query:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG
        LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE +HKPPKIP++QG SS G+ GTRVL+  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Subjt:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG

Query:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL
        IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+  GEILELPKYLVEIGEAC +VKVE+ NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+IL
Subjt:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL

Query:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS
        SILQRP+ARVSLSSGH+D+NI VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S       TLTE++EIGHS+QL ++
Subjt:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS

Query:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        +HVEAES SLR+TISRT+ +S  AAC LVNDEGK+CEEITYERE   CPSFDLGI
Subjt:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X49.3e-23477.48Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV+CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISE
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT
        KSGVLRGKNFRNNSVCF                       EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL ICG LGRQV 
Subjt:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT

Query:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG
        LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE +HKPPKIP++QG SS G+ GTRVLH  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Subjt:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG

Query:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL
        IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+  GEILELPKYLVEIGEACE+VKVEL NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+IL
Subjt:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL

Query:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS
        SILQRP+ARVSLSSG +D+NISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T        ++EIGHS+Q   +
Subjt:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS

Query:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        ++VEAES SLR+TIS T+ +S  AAC LVNDEGK+CEEITYERE   CPSFDLGI
Subjt:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X24.4e-23677.66Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV+CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISE
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT
        KSGVLRGKNFRNNSVCF                       EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAKKYHDGFLKL ICG LGRQV 
Subjt:  KSGVLRGKNFRNNSVCF-----------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVT

Query:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG
        LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE +HKPPKIP++QG SS G+ GTRVLH  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Subjt:  LFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG

Query:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL
        IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+  GEILELPKYLVEIGEACE+VKVEL NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+IL
Subjt:  IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQIL

Query:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS
        SILQRP+ARVSLSSG +D+NISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T        ++EIGHS+QL ++
Subjt:  SILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRS

Query:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
        ++VEAES SLR+TIS T+ +S  AAC LVNDEGK+CEEITYERE   CPSFDLGI
Subjt:  DHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10890.1 unknown protein7.7e-2345.93Show/hide
Query:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE
        M E  RW   YTKHLKQKRKVYHDGFLD+H +  K+    MLYDE + LLE R LK  EV+ +GETL F +YLVDI  P    K  S    +  D K + 
Subjt:  MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISE

Query:  KSGVLRGKNFRNNSV-CFEFKKRRLKCYGSPQSSP
        K   +   NF+ +S+ C E K   +  + S   SP
Subjt:  KSGVLRGKNFRNNSV-CFEFKKRRLKCYGSPQSSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGA
CACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTG
TTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTC
CGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCT
TTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGA
AACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAG
CCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAA
AGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTAC
TGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCT
TGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAA
AAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGT
CATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAAT
GCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGA
CCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGG
AGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGA
CACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTG
TTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTC
CGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCT
TTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGA
AACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAG
CCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAA
AGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTAC
TGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCT
TGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAA
AAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGT
CATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAAT
GCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGA
CCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGG
AGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
Protein sequenceShow/hide protein sequence
MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNF
RNNSVCFEFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK
PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEA
CESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKN
AENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI