; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017585 (gene) of Chayote v1 genome

Gene IDSed0017585
OrganismSechium edule (Chayote v1)
DescriptionPlastid envelope DNA binding protein
Genome locationLG01:24302921..24311113
RNA-Seq ExpressionSed0017585
SyntenySed0017585
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-16674.41Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHAIKGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        L+ENPLHSIAIEPQSPLT  SEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VDVSDKDS+EFI+ EL  NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        +LSET SDLVEVAQIVE +NG+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-
        +  +HEVEGPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-

Query:  KRKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  KRKPENNPLFEILKSFVAAFVKFWSE

XP_022921506.1 uncharacterized protein LOC111429750 [Cucurbita moschata]1.1e-16774.88Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHAIKGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        LEENPLHSIAIEPQSPLT PSEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VD SDKDS+E I+TELL NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        +LSET SDLVEVAQIVE +NG+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-
        +  +HEVEGPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-

Query:  KRKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  KRKPENNPLFEILKSFVAAFVKFWSE

XP_022990369.1 uncharacterized protein LOC111487247 [Cucurbita maxima]2.4e-16774.41Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHA+KGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        LEENPLHSIAIEPQSPLT PSEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VD SDKDS+EFI+TEL  NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        + SET SDLVEVAQIVE +NG+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKK
        +  +HEVEGPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKK

Query:  -RKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  -RKPENNPLFEILKSFVAAFVKFWSE

XP_023534676.1 uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo]4.2e-16774.41Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHAIKGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        LEENPLHSIAIEPQSPLT PSEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VD SDKDS+EFI+TEL  NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        +LSET SDLVEVAQIVE ++G+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-
        +  +HEV+GPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-

Query:  KRKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  KRKPENNPLFEILKSFVAAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]1.4e-16776.11Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNS-DH
        MHAIKGGWTG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKLL EE +  DH
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNS-DH

Query:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
         LEENPLHSIAIEPQSPLTL ++EV FP+NY+Q INEEP+ VSDEQCT+T IQ SQNG +INGS VD++DK+  EFI++ELL NE KKV     EESGMP
Subjt:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KTEENFVSSLSETKSDLVEVAQIVETSNGSTV
        +NHVTP A DVVVETFPLDS SW VNGSDVRSE LIST+ASE QV+Q I+LESDVGLFNI+ +  +  K EENF   LSE  SD+VE AQIVETSNGSTV
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KTEENFVSSLSETKSDLVEVAQIVETSNGSTV

Query:  KEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK
        KE  ++EV GP LEVC+D PISVTFEQGQ+S+EMK   AS + IENLN   SNG DQASK KEETE+ENKVDA Q GGSQKESIPTL+RINLESWEG SK
Subjt:  KEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK

Query:  -KRKPENNPLFEILKSFVAAFVKFWSE
           K ENNP+ EI K+F+AAFVKFWSE
Subjt:  -KRKPENNPLFEILKSFVAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C344 uncharacterized protein LOC103496473 isoform X21.5e-15768.49Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH
        MHAIKGGWTG PLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNL HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EE N+DH
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH

Query:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------
         L++NPLHSIAIEPQSPLTL S+EV FP+NYN+YINEEP+ VSDEQCT+T IQ SQN ++INGS VDVS++DS+EFI++ELL NE K+V           
Subjt:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------

Query:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT
                                EESGMP+NHVTP A DVVVETFPLD   W VNGSDVRSE LISTNASE QV+Q+I+LESDVGL NI  ++S+  K 
Subjt:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT

Query:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK
         ENF   LSETKSDLVEVAQIVE SNGSTVKE SMHEV GP LEVC+D PISV FEQGQ+S++MK  K                           EIENK
Subjt:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK

Query:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE
        VD  Q GGSQKES+PTL+RINLESWEG SK   KPENNPL EI+KSF+AAFVKFWSE
Subjt:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X12.3e-16371.12Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH
        MHAIKGGWTG PLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNL HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EE N+DH
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH

Query:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------
         L++NPLHSIAIEPQSPLTL S+EV FP+NYN+YINEEP+ VSDEQCT+T IQ SQN ++INGS VDVS++DS+EFI++ELL NE K+V           
Subjt:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------

Query:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT
                                EESGMP+NHVTP A DVVVETFPLD   W VNGSDVRSE LISTNASE QV+Q+I+LESDVGL NI  ++S+  K 
Subjt:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT

Query:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK
         ENF   LSETKSDLVEVAQIVE SNGSTVKE SMHEV GP LEVC+D PISV FEQGQ+S++MK+  AS    ENLN   SN  DQASK     EIENK
Subjt:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK

Query:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE
        VD  Q GGSQKES+PTL+RINLESWEG SK   KPENNPL EI+KSF+AAFVKFWSE
Subjt:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein5.2e-16370.9Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH
        MHAIKGGWTG PLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNL HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EE N+DH
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EELNSDH

Query:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------
         L++NPLHSIAIEPQSPLTL S+EV FP+NYN+YINEEP+ VSDEQCT+T IQ SQN ++INGS VDVS++DS+EFI++ELL NE K+V           
Subjt:  LLEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----------

Query:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT
                                EESGMP+NHVTP A DVVVETFPLD   W VNGSDVRSE LISTNASE QV+Q+I+LESDVGL NI  ++S+  K 
Subjt:  ------------------------EESGMPMNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSI--KT

Query:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK
         ENF   LSETKSDLVEVAQIVE SNGSTVKE SMHEV GP LEVC+D PISV FEQGQ+S++MK+  AS    ENLN   SN  DQASK     EIENK
Subjt:  EENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENK

Query:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE
        VD  Q GGSQKES+PTL+RINLESWEG SK   KPENNPL EI+KSF+AAFVKFWS+
Subjt:  VDAQQIGGSQKESIPTLDRINLESWEGSSK-KRKPENNPLFEILKSFVAAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297505.3e-16874.88Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHAIKGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        LEENPLHSIAIEPQSPLT PSEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VD SDKDS+E I+TELL NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        +LSET SDLVEVAQIVE +NG+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-
        +  +HEVEGPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSK-

Query:  KRKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  KRKPENNPLFEILKSFVAAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872471.2e-16774.41Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MHA+KGGW GCPLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNL HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEE ++DHL
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP
        LEENPLHSIAIEPQSPLT PSEE DFP+N+N  INEEP++VSD EQ TS  IQ SQNG +INGS VD SDKDS+EFI+TEL  NE KK+     EESGMP
Subjt:  LEENPLHSIAIEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSD-EQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKV-----EESGMP

Query:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK
        +NHVTP A DV V TFPLDSFSW  NGSDV SETLIST+ASE +V+Q I+LESDV LFN E NNS K        + SET SDLVEVAQIVE +NG+ +K
Subjt:  MNHVTPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENF-VSSLSETKSDLVEVAQIVETSNGSTVK

Query:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKK
        +  +HEVEGPGLE+CTD PISVTFEQGQ+S+E+K   AS +G +NLN+  +NGIDQASK KEETE++NKV+A+Q GGSQKESIPTL+R+NL+SW G+SK 
Subjt:  EDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQKASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKK

Query:  -RKPENNPLFEILKSFVAAFVKFWSE
          KPENNPL EIL +F+AAFVKFWSE
Subjt:  -RKPENNPLFEILKSFVAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding1.9e-4533.66Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+L HKEVGGSFYT+REIVR+IIQENRVLGPG LLLE   +  +
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTL---------------PSEEVDFPVNYNQ------------YINEEPVLVSDEQCTSTIIQESQ-------------NGAVI
         +++   SI ++P  PL+L                SE  +  VN +Q             + +E + +  +   ST I  +Q             N  + 
Subjt:  LEENPLHSIAIEPQSPLTL---------------PSEEVDFPVNYNQ------------YINEEPVLVSDEQCTSTIIQESQ-------------NGAVI

Query:  N--------------GSRVDVSDKDSN----EFIKTE--LLANECKKVEESGMPMNHV------TPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNA
        N                R+DV +KD       F++++     N   +V ++G  M  +         +A+ VVETFPL S + T++  D +   L     
Subjt:  N--------------GSRVDVSDKDSN----EFIKTE--LLANECKKVEESGMPMNHV------TPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNA

Query:  SEMQVNQAIKLESDVGLFNIEGNNSIKTEENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEV-CTDAPISVT-------------FEQG
         E       ++E+D    N      I +  +  S++ E     V V QI    +    K+     V    ++V C DA  +V              F  G
Subjt:  SEMQVNQAIKLESDVGLFNIEGNNSIKTEENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEV-CTDAPISVT-------------FEQG

Query:  QESNEMK-----TQKASR-NGIENLNNISS-NGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKKRKPENNPLFEILKSFVAAFV
          + E K     T+  SR N    ++ +SS  G + AS  K+ T  + K+DA     SQKE+  TL+RI  ESW+G S   + E NPL  +LKSFV AFV
Subjt:  QESNEMK-----TQKASR-NGIENLNNISS-NGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKKRKPENNPLFEILKSFVAAFV

Query:  KFWSE
        KFWSE
Subjt:  KFWSE

AT3G52170.2 DNA binding1.9e-4533.66Show/hide
Query:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+L HKEVGGSFYT+REIVR+IIQENRVLGPG LLLE   +  +
Subjt:  MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHL

Query:  LEENPLHSIAIEPQSPLTL---------------PSEEVDFPVNYNQ------------YINEEPVLVSDEQCTSTIIQESQ-------------NGAVI
         +++   SI ++P  PL+L                SE  +  VN +Q             + +E + +  +   ST I  +Q             N  + 
Subjt:  LEENPLHSIAIEPQSPLTL---------------PSEEVDFPVNYNQ------------YINEEPVLVSDEQCTSTIIQESQ-------------NGAVI

Query:  N--------------GSRVDVSDKDSN----EFIKTE--LLANECKKVEESGMPMNHV------TPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNA
        N                R+DV +KD       F++++     N   +V ++G  M  +         +A+ VVETFPL S + T++  D +   L     
Subjt:  N--------------GSRVDVSDKDSN----EFIKTE--LLANECKKVEESGMPMNHV------TPFAADVVVETFPLDSFSWTVNGSDVRSETLISTNA

Query:  SEMQVNQAIKLESDVGLFNIEGNNSIKTEENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEV-CTDAPISVT-------------FEQG
         E       ++E+D    N      I +  +  S++ E     V V QI    +    K+     V    ++V C DA  +V              F  G
Subjt:  SEMQVNQAIKLESDVGLFNIEGNNSIKTEENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEV-CTDAPISVT-------------FEQG

Query:  QESNEMK-----TQKASR-NGIENLNNISS-NGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKKRKPENNPLFEILKSFVAAFV
          + E K     T+  SR N    ++ +SS  G + AS  K+ T  + K+DA     SQKE+  TL+RI  ESW+G S   + E NPL  +LKSFV AFV
Subjt:  QESNEMK-----TQKASR-NGIENLNNISS-NGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKKRKPENNPLFEILKSFVAAFV

Query:  KFWSE
        KFWSE
Subjt:  KFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein6.5e-0929.92Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+  HK+VGGS+Y VR+I +++  + +   P   K L E  +S   D     +P     +E ++   L    
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE

Query:  VDFPVNYNQYINEEPVLVSDEQCTSTI
           P + + +++  PV + + +  S +
Subjt:  VDFPVNYNQYINEEPVLVSDEQCTSTI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein6.5e-0929.92Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+  HK+VGGS+Y VR+I +++  + +   P   K L E  +S   D     +P     +E ++   L    
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE

Query:  VDFPVNYNQYINEEPVLVSDEQCTSTI
           P + + +++  PV + + +  S +
Subjt:  VDFPVNYNQYINEEPVLVSDEQCTSTI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein6.5e-0929.92Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+  HK+VGGS+Y VR+I +++  + +   P   K L E  +S   D     +P     +E ++   L    
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLLEELNS---DHLLEENPLHSIAIEPQSPLTLPSEE

Query:  VDFPVNYNQYINEEPVLVSDEQCTSTI
           P + + +++  PV + + +  S +
Subjt:  VDFPVNYNQYINEEPVLVSDEQCTSTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCTATAAAGGGTGGGTGGACAGGGTGTCCTCTTGCCCTAGCCAAGCACAATGAGTCTGAAGGGAGGAAGACTAGAATCCGGCGATCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAACAATGGGAGTTTCCCTTCGCTCAATCTCGCACACAAGGAAGTTGGTGGATCTTTCTACACTGTGCGGGAAA
TCGTACGAGATATAATCCAAGAAAATAGAGTTCTTGGTCCAGGAAAGTTGTTACTAGAAGAGCTCAACTCCGATCATTTACTTGAAGAGAATCCACTGCACTCAATTGCT
ATTGAACCTCAATCTCCTTTAACCTTACCATCAGAGGAAGTTGATTTTCCAGTCAACTACAACCAATATATAAATGAAGAACCAGTCCTTGTTTCAGATGAGCAATGCAC
TTCAACCATTATTCAGGAATCACAGAATGGGGCAGTAATTAACGGTAGCCGGGTGGATGTGAGTGACAAGGATTCTAATGAATTTATTAAGACAGAGTTGCTTGCAAATG
AATGCAAGAAAGTTGAGGAATCCGGAATGCCAATGAATCATGTAACTCCTTTTGCAGCGGATGTTGTGGTAGAGACCTTCCCATTGGATTCGTTTTCTTGGACTGTTAAT
GGTTCAGATGTAAGATCTGAGACATTGATTTCAACTAATGCCTCAGAAATGCAAGTTAATCAAGCCATTAAGTTAGAATCAGATGTTGGCTTGTTTAACATTGAAGGTAA
TAATTCCATAAAAACAGAGGAAAACTTTGTAAGTTCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTTGCACAAATTGTTGAAACCTCTAATGGATCTACTGTGAAAG
AAGATAGCATGCATGAAGTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATGCTCCAATATCTGTGACGTTTGAGCAAGGCCAGGAATCTAATGAAATGAAGACTCAAAAA
GCTTCTCGGAACGGTATCGAGAATCTCAACAACATATCCAGCAATGGCATTGATCAGGCCTCAAAAACCAAAGAGGAGACAGAGATTGAAAATAAAGTAGATGCACAACA
GATTGGTGGCTCCCAGAAGGAAAGCATTCCAACTCTTGATAGAATCAATCTAGAATCATGGGAAGGGAGTTCAAAGAAAAGGAAACCTGAAAACAACCCGCTCTTTGAAA
TTCTCAAGTCGTTCGTTGCTGCCTTTGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
GCTCGCCATTCCCCGCCCCCACTGGGCTGTTGCGTTGGAAGTTTGCATAATGTTGAACTGATCAGTAGGGTTTTAGACCTCTTTTTATCTCCAATTCTTCTTCTTCCACT
CCATTTTCCCGCTCTCTCTCCAAATCTAGGTTTGTACGCTTAAGAGTTGCGGGCTTCATGCATGCTATAAAGGGTGGGTGGACAGGGTGTCCTCTTGCCCTAGCCAAGCA
CAATGAGTCTGAAGGGAGGAAGACTAGAATCCGGCGATCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAACAATGGGAGTTTCC
CTTCGCTCAATCTCGCACACAAGGAAGTTGGTGGATCTTTCTACACTGTGCGGGAAATCGTACGAGATATAATCCAAGAAAATAGAGTTCTTGGTCCAGGAAAGTTGTTA
CTAGAAGAGCTCAACTCCGATCATTTACTTGAAGAGAATCCACTGCACTCAATTGCTATTGAACCTCAATCTCCTTTAACCTTACCATCAGAGGAAGTTGATTTTCCAGT
CAACTACAACCAATATATAAATGAAGAACCAGTCCTTGTTTCAGATGAGCAATGCACTTCAACCATTATTCAGGAATCACAGAATGGGGCAGTAATTAACGGTAGCCGGG
TGGATGTGAGTGACAAGGATTCTAATGAATTTATTAAGACAGAGTTGCTTGCAAATGAATGCAAGAAAGTTGAGGAATCCGGAATGCCAATGAATCATGTAACTCCTTTT
GCAGCGGATGTTGTGGTAGAGACCTTCCCATTGGATTCGTTTTCTTGGACTGTTAATGGTTCAGATGTAAGATCTGAGACATTGATTTCAACTAATGCCTCAGAAATGCA
AGTTAATCAAGCCATTAAGTTAGAATCAGATGTTGGCTTGTTTAACATTGAAGGTAATAATTCCATAAAAACAGAGGAAAACTTTGTAAGTTCATTATCAGAAACAAAGT
CTGATTTGGTGGAGGTTGCACAAATTGTTGAAACCTCTAATGGATCTACTGTGAAAGAAGATAGCATGCATGAAGTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATGCT
CCAATATCTGTGACGTTTGAGCAAGGCCAGGAATCTAATGAAATGAAGACTCAAAAAGCTTCTCGGAACGGTATCGAGAATCTCAACAACATATCCAGCAATGGCATTGA
TCAGGCCTCAAAAACCAAAGAGGAGACAGAGATTGAAAATAAAGTAGATGCACAACAGATTGGTGGCTCCCAGAAGGAAAGCATTCCAACTCTTGATAGAATCAATCTAG
AATCATGGGAAGGGAGTTCAAAGAAAAGGAAACCTGAAAACAACCCGCTCTTTGAAATTCTCAAGTCGTTCGTTGCTGCCTTTGTGAAGTTTTGGTCCGAGTAAGTTCTA
TGATTATCAAGTAGATGAATATAGAGTAGTTAATTTTCCTGCCACAGAACCTGTCTTTGTACCAAACCTGCAGTCCGTTACCCTGTTTCCTCCGGTCGGTCCCATTGTCG
ATATTTATGAAGAGAAAACTGGACATGGGCTGGCATTTCTGTACTCCGTAGTGTGAAAGAAAGTTTAAGAGTAGCAATTCCACCCTCATTTACAAGCTTAAGCTACTAGA
AGGAATGACTTTTTTTACTGTAACCATAAAATGAGATATATGTCCCACCCCCATGATTTTCTTCTAGATATGGAGGTTCTCTTTTATTTTTCTCCTTACCATATGATGAA
AAGAGGGAAATTTGCACCAACACACCTAAATTATGGGATTTGTTACCATCACACTCTTCAATTTTTAATTTGATCAATCATTTCCAGAACTTTGTTAAGTGTTGCAATTA
TATCCTTGAACTTTTAATTTAATCAGCCTCCTAAACTTGTTAAGTATTGTAACTAGCCTTAGCCTTTTGTACTTTAGTAATGGTTGCAATCAACCTCATACTAGTTTTGT
TAAGTAAGTTCATGGGATTGATTGCAACATTACACTTAACAAAGTTCAAGAGTTGATTACAATACTTAACAAAACTATACGGATTGATTGCAACACTTAGTACCTAGTGT
AAGAGTGTGATTGCGATACTTAGCAAAGTTTATGAAGTTGATTGATCAAATTATATGTTGAATGGTGTAATATTACAAATTTCATAATTTATGAGTGATTTGTGCAAATT
TTCGAATTAAAAATCGTGTCGAAGTGAAGAAAAGTTGGAGTAACGTGAAAAGTAGCCTTCGTGTGTAAGGTGTTTGTGTGTTAGGAGGAGGACAGGGAAACAAGGGAGAA
AGGGATATACAGATTGGAATTTTAAAACTTTGCACCTTTTTGCTCTGACCTAGACATCTTAATATCTTATGGATCATATAGTCACATGATTTTCTGCTTCCACAATTTTC
TAA
Protein sequenceShow/hide protein sequence
MHAIKGGWTGCPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLAHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEELNSDHLLEENPLHSIA
IEPQSPLTLPSEEVDFPVNYNQYINEEPVLVSDEQCTSTIIQESQNGAVINGSRVDVSDKDSNEFIKTELLANECKKVEESGMPMNHVTPFAADVVVETFPLDSFSWTVN
GSDVRSETLISTNASEMQVNQAIKLESDVGLFNIEGNNSIKTEENFVSSLSETKSDLVEVAQIVETSNGSTVKEDSMHEVEGPGLEVCTDAPISVTFEQGQESNEMKTQK
ASRNGIENLNNISSNGIDQASKTKEETEIENKVDAQQIGGSQKESIPTLDRINLESWEGSSKKRKPENNPLFEILKSFVAAFVKFWSE