; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G026590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G026590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationCmo_Chr04:19343768..19349998
RNA-Seq ExpressionCmoCh04G026590
SyntenyCmoCh04G026590
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]1.6e-23097.89Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        L+ENPLHSIAIEPQSPLTS SEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVD SDKDSDE IQ EL VNEHKKIEEVLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQ IELESDV LFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGT+MK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
        SSKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-23097.65Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        L+ENPLHSIAIEPQSPLTS SEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVD SDKDSDE IQ EL VNEHKKIEEVLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQ IELESDV LFNSEDNNSTKASGRADEKALSET SDLVEVAQIVEVTNGT+MK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
        SSKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

XP_022921506.1 uncharacterized protein LOC111429750 [Cucurbita moschata]2.5e-236100Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
        SSKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

XP_022990369.1 uncharacterized protein LOC111487247 [Cucurbita maxima]2.7e-23097.65Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHA+KGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDE IQTEL VNEHKKIE+VLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQ IELESDV LFNSEDNNSTKASGRADEKA SETMSDLVEVAQIVEVTNGT+MK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNAS SGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
         SKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

XP_023534676.1 uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo]5.5e-23197.89Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESN+GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDE IQTEL VNEHKKIEEVLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQ IELESDV LFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVE T+GT+MK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEV+GPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
        SSKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X11.5e-15768.78Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHSTDH
        MHAIKGGW G PLALAK NE+EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKL L EEH+TDH
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHK------------
         L++NPLHSIAIEPQSPLT  S+E  FP+N+N  INEEPI VSD EQ T+ NIQGSQN  IINGSLVD S++DSDE IQ+ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHK------------

Query:  ------------------KIEEVLKEESGMPINHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNS-TKA
                          K+EEV+KEESGMPINHVTPLA DV V TFPLD   W  NGSDV SE LIST+ASEK+VSQ IELESDV L N   ++S  + 
Subjt:  ------------------KIEEVLKEESGMPINHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNS-TKA

Query:  SGRADEKALSETMSDLVEVAQIVEVTNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKN
        +G      LSET SDLVEVAQIVE++NG+ +K+G +HEV GP LE+C+DTPISV FEQGQKSS++K+P AS    +NLN + +N  DQASKI    E++N
Subjt:  SGRADEKALSETMSDLVEVAQIVEVTNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKN

Query:  KVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE
        KV+  QTGGSQKES+PTLNR+NL+SW G SK+SSKPENNPLLEI+ +FIAAFVKFWSE
Subjt:  KVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein3.3e-15768.56Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHSTDH
        MHAIKGGW G PLALAK NE+EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKL L EEH+TDH
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHK------------
         L++NPLHSIAIEPQSPLT  S+E  FP+N+N  INEEPI VSD EQ T+ NIQGSQN  IINGSLVD S++DSDE IQ+ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHK------------

Query:  ------------------KIEEVLKEESGMPINHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNS-TKA
                          K+EEV+KEESGMPINHVTPLA DV V TFPLD   W  NGSDV SE LIST+ASEK+VSQ IELESDV L N   ++S  + 
Subjt:  ------------------KIEEVLKEESGMPINHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNS-TKA

Query:  SGRADEKALSETMSDLVEVAQIVEVTNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKN
        +G      LSET SDLVEVAQIVE++NG+ +K+G +HEV GP LE+C+DTPISV FEQGQKSS++K+P AS    +NLN + +N  DQASKI    E++N
Subjt:  SGRADEKALSETMSDLVEVAQIVEVTNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKN

Query:  KVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE
        KV+  QTGGSQKES+PTLNR+NL+SW G SK+SSKPENNPLLEI+ +FIAAFVKFWS+
Subjt:  KVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE

A0A6J1C1R0 uncharacterized protein LOC111006625 isoform X13.0e-15871.36Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVD SDKDSD++I++ELLVNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKA-------LSETMSDLVEVAQIVEV
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQ +ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKA-------LSETMSDLVEVAQIVEV

Query:  TNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKS
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  TNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKS

Query:  WHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297501.2e-236100Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
        SSKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872471.3e-23097.65Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MHA+KGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP
        LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDE IQTEL VNEHKKIE+VLKEESGMP
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMP

Query:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK
        INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQ IELESDV LFNSEDNNSTKASGRADEKA SETMSDLVEVAQIVEVTNGT+MK
Subjt:  INHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMK

Query:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
        DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNAS SGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD
Subjt:  DGRIHEVEGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKD

Query:  SSKPENNPLLEILNAFIAAFVKFWSE
         SKPENNPLLEILNAFIAAFVKFWSE
Subjt:  SSKPENNPLLEILNAFIAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding5.0e-4132.36Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ N GSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG L LE + +  +
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQ----NGVIINGSLVDASD-------KDSDEVIQTELLVNEHKKI
         +++   SI ++P  PL+     F       H  + + +  S E      N+ GSQ    N   ++GS +   D        DS ++  T+L  +  +  
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQ----NGVIINGSLVDASD-------KDSDEVIQTELLVNEHKKI

Query:  EEVLKEESGMPINHVTPL-AVDVKV-----------------------GTFPL---DSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSED
        +  +K  +G+     T   +VD K                        GT P+   D  + A      I   L + D S + V +   L+S     +S D
Subjt:  EEVLKEESGMPINHVTPL-AVDVKV-----------------------GTFPL---DSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSED

Query:  NNSTKAS-----GRADEKALSETMSDL--VEVAQIVEVTNGTVMKD-------GRI-HEVEGP-----GLEI---------CTDTPISVT----------
           T+ +     G+  E  +    S +  V++ +I   T+  V++D       G+I + +  P     G EI         C D   +V           
Subjt:  NNSTKAS-----GRADEKALSETMSDL--VEVAQIVEVTNGTVMKD-------GRI-HEVEGP-----GLEI---------CTDTPISVT----------

Query:  ---FEQGQKSSEIKAPNASPSGTKNLND-------SRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEIL
           F  G  ++E K P +S       ND       S   G + AS  K+ T  K K++A  +  SQKE+  TLNR+  +SW G S +  + E NPLL +L
Subjt:  ---FEQGQKSSEIKAPNASPSGTKNLND-------SRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEIL

Query:  NAFIAAFVKFWSE
         +F+ AFVKFWSE
Subjt:  NAFIAAFVKFWSE

AT3G52170.2 DNA binding5.0e-4132.36Show/hide
Query:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ N GSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG L LE + +  +
Subjt:  MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQ----NGVIINGSLVDASD-------KDSDEVIQTELLVNEHKKI
         +++   SI ++P  PL+     F       H  + + +  S E      N+ GSQ    N   ++GS +   D        DS ++  T+L  +  +  
Subjt:  LEENPLHSIAIEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQ----NGVIINGSLVDASD-------KDSDEVIQTELLVNEHKKI

Query:  EEVLKEESGMPINHVTPL-AVDVKV-----------------------GTFPL---DSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSED
        +  +K  +G+     T   +VD K                        GT P+   D  + A      I   L + D S + V +   L+S     +S D
Subjt:  EEVLKEESGMPINHVTPL-AVDVKV-----------------------GTFPL---DSFSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSED

Query:  NNSTKAS-----GRADEKALSETMSDL--VEVAQIVEVTNGTVMKD-------GRI-HEVEGP-----GLEI---------CTDTPISVT----------
           T+ +     G+  E  +    S +  V++ +I   T+  V++D       G+I + +  P     G EI         C D   +V           
Subjt:  NNSTKAS-----GRADEKALSETMSDL--VEVAQIVEVTNGTVMKD-------GRI-HEVEGP-----GLEI---------CTDTPISVT----------

Query:  ---FEQGQKSSEIKAPNASPSGTKNLND-------SRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEIL
           F  G  ++E K P +S       ND       S   G + AS  K+ T  K K++A  +  SQKE+  TLNR+  +SW G S +  + E NPLL +L
Subjt:  ---FEQGQKSSEIKAPNASPSGTKNLND-------SRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEIL

Query:  NAFIAAFVKFWSE
         +F+ AFVKFWSE
Subjt:  NAFIAAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.3e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.3e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.3e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCCATAAAGGGTGGGTGGCCAGGGTGTCCTCTTGCCCTAGCCAAGTACAATGAGTCTGAAGGGAGGAAGACCAGAATTCGGCGTTCGAAGGAGGAAAGGAAGGC
AATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAAGGGAAGTTTCCCTTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTGCGGGAGA
TCGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCAGGAAAGTTGTCACTAGAAGAGCACAGCACGGATCATTTACTTGAAGAGAATCCACTGCACTCAATTGCT
ATTGAACCTCAATCTCCTTTAACGTCGCCCTCAGAGGAATTTGATTTTCCAATCAACCACAACCATTGTATAAATGAAGAACCAATCATTGTTTCAGATGAGGAGCAATA
CACTTCAAAAAATATTCAGGGATCACAGAATGGGGTCATAATTAACGGCAGCCTGGTGGATGCCAGTGACAAGGATTCTGATGAAGTTATCCAGACAGAGTTGCTAGTAA
ATGAACACAAGAAAATAGAGGAAGTGTTGAAAGAGGAATCAGGAATGCCTATTAATCATGTAACTCCTTTGGCTGTAGATGTCAAGGTAGGGACATTCCCGTTGGATTCA
TTTTCTTGGGCTGCTAATGGTTCAGATGTAATATCTGAGACTTTGATTTCAACCGATGCCTCGGAAAAGAAAGTTAGTCAACCCATCGAGTTAGAATCAGATGTTATCTT
GTTTAACAGTGAAGATAATAATTCCACGAAAGCTTCTGGTCGTGCAGATGAAAAGGCATTATCAGAAACAATGTCTGATTTGGTGGAGGTAGCACAAATTGTTGAAGTCA
CTAATGGAACTGTGATGAAAGATGGTCGCATACATGAAGTTGAGGGTCCTGGGTTGGAAATTTGCACTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCT
AGTGAAATAAAGGCTCCGAATGCTTCTCCGAGCGGTACCAAGAATCTCAACGATTCACGCAACAATGGCATCGATCAGGCTTCAAAAATCAAAGAGGAGACAGAGGTTAA
AAATAAAGTAGAGGCAGAACAGACTGGTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGACTTAATCTCAAATCATGGCATGGGACGTCAAAAGATTCTTCAAAAC
CCGAAAACAACCCGCTTTTGGAAATCCTCAACGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
GGAAGCTTGCAGAAGGGCACCCTGGCTTGCTCCCTGTAACTCGTCTGTAGGGTTTTAGACCTCTTCCATCCGCCATTCTTCTTCTACGCGCGCTTTACCAGACTCGAGGT
TTGTACTCTAAGAAGGGTTGAACTTTTTGGATTTCATGCATGCCATAAAGGGTGGGTGGCCAGGGTGTCCTCTTGCCCTAGCCAAGTACAATGAGTCTGAAGGGAGGAAG
ACCAGAATTCGGCGTTCGAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAAGGGAAGTTTCCCTTCACTCAACCTTACTCACAA
GGAAGTTGGTGGATCTTTCTATACAGTGCGGGAGATCGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCAGGAAAGTTGTCACTAGAAGAGCACAGCACGGATC
ATTTACTTGAAGAGAATCCACTGCACTCAATTGCTATTGAACCTCAATCTCCTTTAACGTCGCCCTCAGAGGAATTTGATTTTCCAATCAACCACAACCATTGTATAAAT
GAAGAACCAATCATTGTTTCAGATGAGGAGCAATACACTTCAAAAAATATTCAGGGATCACAGAATGGGGTCATAATTAACGGCAGCCTGGTGGATGCCAGTGACAAGGA
TTCTGATGAAGTTATCCAGACAGAGTTGCTAGTAAATGAACACAAGAAAATAGAGGAAGTGTTGAAAGAGGAATCAGGAATGCCTATTAATCATGTAACTCCTTTGGCTG
TAGATGTCAAGGTAGGGACATTCCCGTTGGATTCATTTTCTTGGGCTGCTAATGGTTCAGATGTAATATCTGAGACTTTGATTTCAACCGATGCCTCGGAAAAGAAAGTT
AGTCAACCCATCGAGTTAGAATCAGATGTTATCTTGTTTAACAGTGAAGATAATAATTCCACGAAAGCTTCTGGTCGTGCAGATGAAAAGGCATTATCAGAAACAATGTC
TGATTTGGTGGAGGTAGCACAAATTGTTGAAGTCACTAATGGAACTGTGATGAAAGATGGTCGCATACATGAAGTTGAGGGTCCTGGGTTGGAAATTTGCACTGATACTC
CAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATAAAGGCTCCGAATGCTTCTCCGAGCGGTACCAAGAATCTCAACGATTCACGCAACAATGGCATCGAT
CAGGCTTCAAAAATCAAAGAGGAGACAGAGGTTAAAAATAAAGTAGAGGCAGAACAGACTGGTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGACTTAATCTCAA
ATCATGGCATGGGACGTCAAAAGATTCTTCAAAACCCGAAAACAACCCGCTTTTGGAAATCCTCAACGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGAGTAAGTTC
TATGACTGTCAATCGAATAGATAGAGAGTAGTAGTTAATTTTTCCTGCCACAGAACCTGTCTGTCTGTGTACCAAACCTGCAATCGGTTACCCCGTCGCCTCCACTCGGT
CCCATTGTCGATATTTATGAAGAGAAAACTGGAGGTGGGTTGGCATTTCTGTACCCTGCAGTGTGAAAGAAAGCTTAGGAGTAGCCTAAAAGCCTGTAGAAGGGATTTTC
TTCTCTTTTTTTACTGTAAGCATGAGATGATGAGATACATGTCGTCCCCACCGCCATGGTTTTCTTCTAGAAATGAAGATTTTGCTCTAATTAATGCATTAGCATATGAT
GAAAAAAGGGTGAAAAGAGAAGGAAAAGTTGAAGTAACGTGAGGGAGGAGCCTTTGTGTGTTAGGAAAACATGGAAACAAGAGAGAGAGGCATATACAGATTGGAGTTTT
AATAGGCACATCTTCTTTGCCCTTTTTCTCTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATTTTCTGCTTCCCAATTTTCTAATAGTACACTCTCT
TGCAGCCTTTTTGTT
Protein sequenceShow/hide protein sequence
MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHLLEENPLHSIA
IEPQSPLTSPSEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEVIQTELLVNEHKKIEEVLKEESGMPINHVTPLAVDVKVGTFPLDS
FSWAANGSDVISETLISTDASEKKVSQPIELESDVILFNSEDNNSTKASGRADEKALSETMSDLVEVAQIVEVTNGTVMKDGRIHEVEGPGLEICTDTPISVTFEQGQKS
SEIKAPNASPSGTKNLNDSRNNGIDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEILNAFIAAFVKFWSE