; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039808 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039808
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationscaffold10:41469134..41474704
RNA-Seq ExpressionSpg039808
SyntenySpg039808
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]6.5e-18279.21Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAIKGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        L+ENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVDVSDKDS+EFI+ EL VNEHKK+EEV+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+TIELESDV LFN E N+ TKASG   EKA       LSET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K G I+EVEGPGLE+CTDTP SV  EQGQKS+E+KAPN SPSGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+SSKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]6.5e-18279.21Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAIKGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        L+ENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVDVSDKDS+EFI+ EL VNEHKK+EEV+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+TIELESDV LFN E N+ TKASG   EKA       LSET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K G I+EVEGPGLE+CTDTP SV  EQGQKS+E+KAPN SPSGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+SSKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

XP_022921506.1 uncharacterized protein LOC111429750 [Cucurbita moschata]2.5e-18179.21Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAIKGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        LEENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVD SDKDS+E I+TELLVNEHKK+EEV+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+ IELESDV LFN E N+ TKASG   EKA       LSET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K G I+EVEGPGLE+CTDTP SV  EQGQKS+E+KAPN SPSGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+SSKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

XP_023534676.1 uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo]2.5e-18178.98Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAIKGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        LEENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVD SDKDS+EFI+TEL VNEHKK+EEV+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+TIELESDV LFN E N+ TKASG   EKA       LSET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        ++G+ +K G I+EV+GPGLE+CTDTP SV  EQGQKS+E+KAPN SPSGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+SSKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]9.7e-19484.53Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGK-LSLEEHTTDH
        MHAIKGG TG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGK L  EEH  DH
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGK-LSLEEHTTDH

Query:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
         LEENPLHSIAIEPQSPLTLS+KEV FP+NYDQ INEEPIFVSDEQCT+TNI+GSQNGPIINGSLVD++DK+  EFI++ELLVNEHKKVEEVVKEE GMP
Subjt:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DVVVETFPLDS SW V  SDVRSE LISTSASEKQVS+TIELESDVGLFN+      KASGCVVEKAEENFA  LSE  SD+V+AAQIVET
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        SNGSTVK+G IYEV GP LEVC+DTP SV  EQGQKS+EMKAPN SPS  ENLNKTFSNG DQASKIKEET++ENKVDA QTGGSQKESIPTLNRINLES
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        WEGMSKNSSK ENNP+LE+ KAFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X11.5e-17674.95Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHTTDH
        MHAIKGG TGRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKL L EEH TDH
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHTTDH

Query:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHK-------------
         L++NPLHSIAIEPQSPLTLSSKEV FP+NY++ INEEPIFVSDEQCT+TNI+GSQN  IINGSLVDVS++DS+EFI++ELLVNEHK             
Subjt:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHK-------------

Query:  -----------------KVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASG
                         KVEEVVKEE GMPINHVTPLA DVVVETFPLD + W V  SDVRSE LIST+ASEKQVS++IELESDVGL N+       AS 
Subjt:  -----------------KVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASG

Query:  CVVEKAEENFASQLSETKSDLVKAAQIVETSNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEE
         VVEKA ENFA  LSETKSDLV+ AQIVE SNGSTVK+GS++EV GP LEVC+DTP SV  EQGQKS++MK    SP  +ENLNKTFSN  DQASKI   
Subjt:  CVVEKAEENFASQLSETKSDLVKAAQIVETSNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEE

Query:  TKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
         +IENKVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENNPLLE++K+FIAAF KFWSE
Subjt:  TKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein3.4e-17674.73Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHTTDH
        MHAIKGG TGRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKL L EEH TDH
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSL-EEHTTDH

Query:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHK-------------
         L++NPLHSIAIEPQSPLTLSSKEV FP+NY++ INEEPIFVSDEQCT+TNI+GSQN  IINGSLVDVS++DS+EFI++ELLVNEHK             
Subjt:  LLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHK-------------

Query:  -----------------KVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASG
                         KVEEVVKEE GMPINHVTPLA DVVVETFPLD + W V  SDVRSE LIST+ASEKQVS++IELESDVGL N+       AS 
Subjt:  -----------------KVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASG

Query:  CVVEKAEENFASQLSETKSDLVKAAQIVETSNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEE
         VVEKA ENFA  LSETKSDLV+ AQIVE SNGSTVK+GS++EV GP LEVC+DTP SV  EQGQKS++MK    SP  +ENLNKTFSN  DQASKI   
Subjt:  CVVEKAEENFASQLSETKSDLVKAAQIVETSNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEE

Query:  TKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
         +IENKVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENNPLLE++K+FIAAF KFWS+
Subjt:  TKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

A0A6J1C1R0 uncharacterized protein LOC111006625 isoform X18.0e-17878.7Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAI+GG TGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKK+QESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH  DH 
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMPI
        LEENPLHSIAIEPQS L LSS+E DF V Y+QCINEEPI VSDEQCTSTNI+ S NGPIINGSLVDVSDKDS++ I++ELLVNE K+VEEVVKEE GMPI
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMPI

Query:  NHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVETS
         HVTPLAADVVVETFPL  IS A  SS  RSET IST  SEKQVS+T+ELES VGLF  EG++ TK S  VVEKAEENF S LS  K D ++ A IVE+S
Subjt:  NHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVETS

Query:  NGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESW
        NGS +K+G ++EVEGP LEV TDTPT+ A EQ QK++E KAPN SPSGT+N NKTFSNGIDQASKIKEET+IENKVDA+Q  GSQK++IPTLNRINLESW
Subjt:  NGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESW

Query:  EGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        E MSKN S PE+NPLLE+LKAF++AF KFWSE
Subjt:  EGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297501.2e-18179.21Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHAIKGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        LEENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVD SDKDS+E I+TELLVNEHKK+EEV+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+ IELESDV LFN E N+ TKASG   EKA       LSET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K G I+EVEGPGLE+CTDTP SV  EQGQKS+E+KAPN SPSGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+SSKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872478.6e-18078.29Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MHA+KGG  G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKK+QESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEH+TDHL
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP
        LEENPLHSIAIEPQSPLT  S+E DFP+N++ CINEEPI VSD EQ TS NI+GSQNG IINGSLVD SDKDS+EFI+TEL VNEHKK+E+V+KEE GMP
Subjt:  LEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSD-EQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGMP

Query:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET
        INHVTPLA DV V TFPLDS SWA   SDV SETLIST ASEK+VS+TIELESDV LFN E N+ TKASG   EKA        SET SDLV+ AQIVE 
Subjt:  INHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET

Query:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K G I+EVEGPGLE+CTDTP SV  EQGQKS+E+KAPN S SGT+NLN + +NGIDQASKIKEET+++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKKGSIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE
        W G SK+ SKPENNPLLE+L AFIAAF KFWSE
Subjt:  WEGMSKNSSKPENNPLLEMLKAFIAAFAKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding2.5e-4633.4Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MH++K  C G+  ALAK ++S G++TR R  KEERK +VE FIKKHQ+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG L LE + +  +
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSS--------KEVDF------------PVNYDQC-------INEEPIFVSDEQCTSTNIEGSQ--------NGPIINGSL-
         +++   SI ++P  PL+LS         + +DF             V  D C       + +E I +  +   ST+I  +Q        N    N  L 
Subjt:  LEENPLHSIAIEPQSPLTLSS--------KEVDF------------PVNYDQC-------INEEPIFVSDEQCTSTNIEGSQ--------NGPIINGSL-

Query:  ------------------VDVSDKDS--EEF----------IKTELLVNEHKKVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETL
                          +DV +KD   EE           +  +  VN+       +K  LG        ++A+ VVETFPL S++  + S D +   L
Subjt:  ------------------VDVSDKDS--EEF----------IKTELLVNEHKKVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETL

Query:  ISTSASEKQVSRTIELE---------------------SDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET--SNGSTVKKGSIY
               K     +E +                      D+G   + G  P   S  + +K  E   +  S      V+ A   ET   NG         
Subjt:  ISTSASEKQVSRTIELE---------------------SDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET--SNGSTVKKGSIY

Query:  EVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPE
        E     L      PTS + E G + N+    +T  S   N         + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E
Subjt:  EVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPE

Query:  NNPLLEMLKAFIAAFAKFWSE
         NPLL +LK+F+ AF KFWSE
Subjt:  NNPLLEMLKAFIAAFAKFWSE

AT3G52170.2 DNA binding2.5e-4633.4Show/hide
Query:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL
        MH++K  C G+  ALAK ++S G++TR R  KEERK +VE FIKKHQ+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG L LE + +  +
Subjt:  MHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHTTDHL

Query:  LEENPLHSIAIEPQSPLTLSS--------KEVDF------------PVNYDQC-------INEEPIFVSDEQCTSTNIEGSQ--------NGPIINGSL-
         +++   SI ++P  PL+LS         + +DF             V  D C       + +E I +  +   ST+I  +Q        N    N  L 
Subjt:  LEENPLHSIAIEPQSPLTLSS--------KEVDF------------PVNYDQC-------INEEPIFVSDEQCTSTNIEGSQ--------NGPIINGSL-

Query:  ------------------VDVSDKDS--EEF----------IKTELLVNEHKKVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETL
                          +DV +KD   EE           +  +  VN+       +K  LG        ++A+ VVETFPL S++  + S D +   L
Subjt:  ------------------VDVSDKDS--EEF----------IKTELLVNEHKKVEEVVKEELGMPINHVTPLAADVVVETFPLDSISWAVKSSDVRSETL

Query:  ISTSASEKQVSRTIELE---------------------SDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET--SNGSTVKKGSIY
               K     +E +                      D+G   + G  P   S  + +K  E   +  S      V+ A   ET   NG         
Subjt:  ISTSASEKQVSRTIELE---------------------SDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVET--SNGSTVKKGSIY

Query:  EVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPE
        E     L      PTS + E G + N+    +T  S   N         + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E
Subjt:  EVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPE

Query:  NNPLLEMLKAFIAAFAKFWSE
         NPLL +LK+F+ AF KFWSE
Subjt:  NNPLLEMLKAFIAAFAKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein9.2e-0954.72Show/hide
Query:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein9.2e-0954.72Show/hide
Query:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein9.2e-0954.72Show/hide
Query:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCTACAGTTTCATGCAGTGTTGGATTTTCGCCTCTGGAAGGGTTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGTACAGGGCGTCCTCTTGCCCT
AGCCAAGCACAATGAGTCTGAAGGGAGGAAGACCAGAATTCGGCGGTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGCATCAGGAATCAAATAATG
GGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGGGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGA
AAGTTGTCACTAGAAGAGCACACCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACATTATCATCAAAGGAAGTTGA
TTTTCCAGTCAACTATGACCAATGTATAAATGAAGAACCCATCTTCGTTTCAGACGAACAATGCACTTCAACAAATATTGAGGGATCACAGAATGGGCCAATAATTAATG
GCAGCCTGGTCGATGTGAGTGACAAGGATTCTGAAGAATTTATCAAGACAGAGTTGCTAGTAAATGAACATAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATTGGGAATG
CCAATTAATCATGTAACTCCTTTGGCAGCAGATGTCGTGGTAGAGACATTCCCATTGGATTCAATTTCTTGGGCTGTTAAAAGTTCAGATGTAAGATCTGAGACATTGAT
TTCAACTAGTGCCTCGGAAAAGCAAGTTAGTCGAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACGTTGAAGGTAATGATCCCACAAAAGCTTCTGGTTGTGTAG
TCGAGAAAGCAGAGGAAAACTTTGCAAGTCAATTATCAGAAACAAAGTCTGATTTGGTGAAGGCAGCTCAAATTGTTGAAACCTCTAATGGATCTACTGTGAAAAAAGGT
AGCATATATGAAGTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATACTCCAACATCTGTGGCCATTGAACAAGGCCAGAAATCTAATGAAATGAAGGCTCCAAATACTTC
TCCTAGTGGTACCGAGAATCTCAACAAGACATTCAGCAATGGCATCGATCAGGCCTCAAAAATCAAAGAGGAGACAAAGATTGAAAATAAAGTAGATGCTGAACAGACTG
GTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGAATTAATCTCGAATCCTGGGAAGGGATGTCTAAAAACTCTTCGAAACCCGAAAACAACCCCCTTTTGGAAATG
CTCAAGGCATTCATTGCTGCCTTCGCGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGCTACAGTTTCATGCAGTGTTGGATTTTCGCCTCTGGAAGGGTTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGTACAGGGCGTCCTCTTGCCCT
AGCCAAGCACAATGAGTCTGAAGGGAGGAAGACCAGAATTCGGCGGTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGCATCAGGAATCAAATAATG
GGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGGGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGA
AAGTTGTCACTAGAAGAGCACACCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACATTATCATCAAAGGAAGTTGA
TTTTCCAGTCAACTATGACCAATGTATAAATGAAGAACCCATCTTCGTTTCAGACGAACAATGCACTTCAACAAATATTGAGGGATCACAGAATGGGCCAATAATTAATG
GCAGCCTGGTCGATGTGAGTGACAAGGATTCTGAAGAATTTATCAAGACAGAGTTGCTAGTAAATGAACATAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATTGGGAATG
CCAATTAATCATGTAACTCCTTTGGCAGCAGATGTCGTGGTAGAGACATTCCCATTGGATTCAATTTCTTGGGCTGTTAAAAGTTCAGATGTAAGATCTGAGACATTGAT
TTCAACTAGTGCCTCGGAAAAGCAAGTTAGTCGAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACGTTGAAGGTAATGATCCCACAAAAGCTTCTGGTTGTGTAG
TCGAGAAAGCAGAGGAAAACTTTGCAAGTCAATTATCAGAAACAAAGTCTGATTTGGTGAAGGCAGCTCAAATTGTTGAAACCTCTAATGGATCTACTGTGAAAAAAGGT
AGCATATATGAAGTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATACTCCAACATCTGTGGCCATTGAACAAGGCCAGAAATCTAATGAAATGAAGGCTCCAAATACTTC
TCCTAGTGGTACCGAGAATCTCAACAAGACATTCAGCAATGGCATCGATCAGGCCTCAAAAATCAAAGAGGAGACAAAGATTGAAAATAAAGTAGATGCTGAACAGACTG
GTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGAATTAATCTCGAATCCTGGGAAGGGATGTCTAAAAACTCTTCGAAACCCGAAAACAACCCCCTTTTGGAAATG
CTCAAGGCATTCATTGCTGCCTTCGCGAAGTTTTGGTCCGAGTAA
Protein sequenceShow/hide protein sequence
MGATVSCSVGFSPLEGLNFVDFMHAIKGGCTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKHQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPG
KLSLEEHTTDHLLEENPLHSIAIEPQSPLTLSSKEVDFPVNYDQCINEEPIFVSDEQCTSTNIEGSQNGPIINGSLVDVSDKDSEEFIKTELLVNEHKKVEEVVKEELGM
PINHVTPLAADVVVETFPLDSISWAVKSSDVRSETLISTSASEKQVSRTIELESDVGLFNVEGNDPTKASGCVVEKAEENFASQLSETKSDLVKAAQIVETSNGSTVKKG
SIYEVEGPGLEVCTDTPTSVAIEQGQKSNEMKAPNTSPSGTENLNKTFSNGIDQASKIKEETKIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEM
LKAFIAAFAKFWSE