; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003684 (gene) of Snake gourd v1 genome

Gene IDTan0003684
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlastid envelope DNA binding protein
Genome locationLG11:7623502..7628976
RNA-Seq ExpressionTan0003684
SyntenyTan0003684
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]1.0e-18781.06Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         +ENPLHSIAIEPQSPLT  S+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVDVSDKDSDEFI+ EL VNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQTIELESDV LFN EDNNSTKASG   EK        LSET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K+G +HE EGPGLE+CTDTPISVTFEQGQKS+E+KAPNASPSGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK SSKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-18781.06Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         +ENPLHSIAIEPQSPLT  S+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVDVSDKDSDEFI+ EL VNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQTIELESDV LFN EDNNSTKASG   EK        LSET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K+G +HE EGPGLE+CTDTPISVTFEQGQKS+E+KAPNASPSGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK SSKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

XP_022921506.1 uncharacterized protein LOC111429750 [Cucurbita moschata]3.6e-18881.29Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         EENPLHSIAIEPQSPLT PS+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVD SDKDSDE I+TELLVNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQ IELESDV LFN EDNNSTKASG   EK        LSET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K+G +HE EGPGLE+CTDTPISVTFEQGQKS+E+KAPNASPSGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK SSKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

XP_023534676.1 uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo]3.6e-18881.06Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         EENPLHSIAIEPQSPLT PS+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVD SDKDSDEFI+TEL VNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQTIELESDV LFN EDNNSTKASG   EK        LSET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        ++G+ +K+G +HE +GPGLE+CTDTPISVTFEQGQKS+E+KAPNASPSGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK SSKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]1.8e-19283.6Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSTDH
        MHAIKGGWTG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL  EEH  DH
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSTDH

Query:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
          EENPLHSIAIEPQSPLTL +KEV FP+NY+Q INEEPIFVSDEQCT+TN QG QNG IINGSLVD++DK+  EFI++ELLVNEHKKV     EESGMP
Subjt:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DVVVETFPLDS SW VNGSDVRSE LIS +ASE QVSQTIELESDVGLFNI      KASGCVVEK EENF  PLSE  SD+VE AQIVET
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        SNGSTVKEG ++E  GP LEVC+DTPISVTFEQGQKS+EMKAPNASPS  ENLNKTFSNG DQASKIKE+TE+ENKVDA QTGGSQKESIPTLNRINLES
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        WEGMSK SSK ENNP+LEI KAFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X11.2e-18176.03Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTG PLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----------
          ++NPLHSIAIEPQSPLTL SKEV FP+NYN+ INEEPIFVSDEQCT+TN QG QN +IINGSLVDVS++DSDEFI++ELLVNEHK+V           
Subjt:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----------

Query:  ------------------------EESGMPINHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASG
                                EESGMPINHVTP A DVVVETFPLD + W VNGSDVRSE LIS NASE QVSQ+IELESDVGL NI       AS 
Subjt:  ------------------------EESGMPINHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASG

Query:  CVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQ
         VVEK  ENF  PLSETKSDLVEVAQIVE SNGSTVKEGS+HE  GP LEVC+DTPISV FEQGQKS++MK+P AS    ENLNKTFSN  DQASKI   
Subjt:  CVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQ

Query:  TEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
         EIENKVD  QTGGSQKES+PTLNRINLESWEGMSK SSKPENNPLLEI+K+FIAAFVKFWSE
Subjt:  TEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein2.7e-18175.81Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTG PLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----------
          ++NPLHSIAIEPQSPLTL SKEV FP+NYN+ INEEPIFVSDEQCT+TN QG QN +IINGSLVDVS++DSDEFI++ELLVNEHK+V           
Subjt:  LFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----------

Query:  ------------------------EESGMPINHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASG
                                EESGMPINHVTP A DVVVETFPLD + W VNGSDVRSE LIS NASE QVSQ+IELESDVGL NI       AS 
Subjt:  ------------------------EESGMPINHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASG

Query:  CVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQ
         VVEK  ENF  PLSETKSDLVEVAQIVE SNGSTVKEGS+HE  GP LEVC+DTPISV FEQGQKS++MK+P AS    ENLNKTFSN  DQASKI   
Subjt:  CVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQ

Query:  TEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
         EIENKVD  QTGGSQKES+PTLNRINLESWEGMSK SSKPENNPLLEI+K+FIAAFVKFWS+
Subjt:  TEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297501.7e-18881.29Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         EENPLHSIAIEPQSPLT PS+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVD SDKDSDE I+TELLVNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQ IELESDV LFN EDNNSTKASG   EK        LSET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K+G +HE EGPGLE+CTDTPISVTFEQGQKS+E+KAPNASPSGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK SSKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872475.6e-18780.6Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHA+KGGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHSTDHL
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP
         EENPLHSIAIEPQSPLT PS+E DFP+N+N CINEEPI VSD EQ TS N QG QNG IINGSLVD SDKDSDEFI+TEL VNEHKK+     EESGMP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSD-EQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKV-----EESGMP

Query:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET
        INHVTP A DV V TFPLDS SWA NGSDV SETLIS +ASE +VSQTIELESDV LFN EDNNSTKASG   EK         SET SDLVEVAQIVE 
Subjt:  INHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVET

Query:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES
        +NG+ +K+G +HE EGPGLE+CTDTPISVTFEQGQKS+E+KAPNAS SGT+NLN + +NGIDQASKIKE+TE++NKV+AEQTGGSQKESIPTLNR+NL+S
Subjt:  SNGSTVKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLES

Query:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE
        W G SK  SKPENNPLLEIL AFIAAFVKFWSE
Subjt:  WEGMSKKSSKPENNPLLEILKAFIAAFVKFWSE

A0A6J1JX82 uncharacterized protein LOC1114891414.5e-17679.16Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGW G PLALA+ NESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLN+THKEVGGSFYTVREIVRDIIQENRVLGPGK LLEEHSTDH 
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKVEESGMPINHVTP
         EENPLHSIAIEPQSPLT+ SKEVD PVNYNQ INEEPIFVSDEQCTSTN QG QN  +INGS  D+SDKDSDE  K E +V      EESGMP NHVTP
Subjt:  FEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKVEESGMPINHVTP

Query:  WAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTV
        +  DVVVETFPLDSISWAVN SDVRSETLIS +ASE Q SQT+EL SDVGL NI+ NNST ASG VV+K +EN  +PLSETKSDLVEVAQIVETSNGST 
Subjt:  WAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTV

Query:  KEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSK
        KEGS++E EGP      DTPI V  EQGQKS+E KAPNASPS T+NLNK FSN  D+ SKIKE+TE+EN+V+AEQ GGS      TLNRINLESWEGMSK
Subjt:  KEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSK

Query:  KSSKPENNPLLEILKAFIAAFVKFWSE
         SSKPENNPLLEI KAFI AFVKFWSE
Subjt:  KSSKPENNPLLEILKAFIAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding8.6e-4733.27Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE + +  +
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPS--------KEVDFP-------VNYNQ-CIN-----------EEPIFVSDEQCTST------------------NSQGLQ
         +++   SI ++P  PL+L          + +DF        VN +Q C++           +E I +  +   ST                  ++ GLQ
Subjt:  FEENPLHSIAIEPQSPLTLPS--------KEVDFP-------VNYNQ-CIN-----------EEPIFVSDEQCTST------------------NSQGLQ

Query:  N---------GAIINGSLVDVSDKDSD----EFIKTE--LLVNEHKKVEESGMPINHV------TPWAADVVVETFPLDSISWAVNGSDVRSETLISANA
        N                 +DV +KD       F++++    VN   +V ++G  +  +         +A+ VVETFPL S++  ++  D +   L     
Subjt:  N---------GAIINGSLVDVSDKDSD----EFIKTE--LLVNEHKKVEESGMPINHV------TPWAADVVVETFPLDSISWAVNGSDVRSETLISANA

Query:  SENQVSQTIELE-SDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLE-VCTDTPISVTFEQGQKSN
                +E + S V   ++ + +S+ +S  + + G E  V  +    S  +E     E  N ++V        E   +  V  +   +  F  G  + 
Subjt:  SENQVSQTIELE-SDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLE-VCTDTPISVTFEQGQKSN

Query:  EMKAPNASPSGTENLN-----KTFSN--GIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFW
        E K P +S       N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S    + E NPLL +LK+F+ AFVKFW
Subjt:  EMKAPNASPSGTENLN-----KTFSN--GIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFW

Query:  SE
        SE
Subjt:  SE

AT3G52170.2 DNA binding8.6e-4733.27Show/hide
Query:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MH++K    G   ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE + +  +
Subjt:  MHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  FEENPLHSIAIEPQSPLTLPS--------KEVDFP-------VNYNQ-CIN-----------EEPIFVSDEQCTST------------------NSQGLQ
         +++   SI ++P  PL+L          + +DF        VN +Q C++           +E I +  +   ST                  ++ GLQ
Subjt:  FEENPLHSIAIEPQSPLTLPS--------KEVDFP-------VNYNQ-CIN-----------EEPIFVSDEQCTST------------------NSQGLQ

Query:  N---------GAIINGSLVDVSDKDSD----EFIKTE--LLVNEHKKVEESGMPINHV------TPWAADVVVETFPLDSISWAVNGSDVRSETLISANA
        N                 +DV +KD       F++++    VN   +V ++G  +  +         +A+ VVETFPL S++  ++  D +   L     
Subjt:  N---------GAIINGSLVDVSDKDSD----EFIKTE--LLVNEHKKVEESGMPINHV------TPWAADVVVETFPLDSISWAVNGSDVRSETLISANA

Query:  SENQVSQTIELE-SDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLE-VCTDTPISVTFEQGQKSN
                +E + S V   ++ + +S+ +S  + + G E  V  +    S  +E     E  N ++V        E   +  V  +   +  F  G  + 
Subjt:  SENQVSQTIELE-SDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGSTVKEGSVHEFEGPGLE-VCTDTPISVTFEQGQKSN

Query:  EMKAPNASPSGTENLN-----KTFSN--GIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFW
        E K P +S       N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S    + E NPLL +LK+F+ AFVKFW
Subjt:  EMKAPNASPSGTENLN-----KTFSN--GIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNPLLEILKAFIAAFVKFW

Query:  SE
        SE
Subjt:  SE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.9e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.9e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.9e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCCAACTACCTAGGCCTTCGAGTTTACTTGACAACCAAATATAGTAGGGTTAGGTTTGTACTAAGAGGGTTGAACTTTGTGGATTTCATGCATGCTATAAAGGG
TGGGTGGACAGGGCCTCCTCTTGCCCTAGCCAAGCATAATGAGTCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCA
TAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAAGAAGTTGGTGGATCTTTCTATACGGTGCGGGAGATTGTACGTGATATAATC
CAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTACTAGAAGAGCACAGCACTGATCATTTATTTGAAGAGAATCCACTGCACTCGATTGCTATTGAACCTCAATCTCC
TTTAACGTTACCATCAAAGGAAGTTGATTTTCCAGTCAACTACAACCAATGTATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATGCACTTCAACAAATAGTCAGG
GATTACAGAATGGGGCAATAATTAACGGCAGCCTGGTGGATGTAAGTGACAAGGATTCTGATGAATTTATCAAGACAGAGTTGCTTGTAAATGAACACAAGAAAGTAGAG
GAATCAGGAATGCCAATTAATCATGTAACTCCTTGGGCAGCAGATGTTGTGGTAGAGACATTCCCATTGGATTCAATTTCTTGGGCTGTTAATGGTTCAGATGTAAGATC
TGAGACATTGATTTCAGCCAATGCCTCGGAAAATCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATCGAAGATAATAATTCCACAAAAGCTT
CTGGTTGTGTAGTTGAAAAGGGAGAGGAAAACTTTGTAAGTCCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTAGCACAGATTGTTGAAACCTCTAATGGATCTACT
GTGAAAGAGGGTAGCGTACATGAATTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATACTCCAATATCCGTGACCTTTGAACAAGGCCAGAAATCTAATGAAATGAAGGC
TCCAAATGCTTCACCGAGTGGTACTGAGAATCTCAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAATCAAAGAGCAGACAGAGATTGAAAATAAAGTAGATG
CTGAACAGACTGGTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGAATTAATCTCGAATCATGGGAAGGGATGTCAAAGAAGTCTTCAAAACCCGAAAACAACCCG
CTTTTGGAAATCCTCAAGGCATTCATTGCCGCCTTCGTGAAATTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
CATTTTCATCCCCTACTGGCAATTTTGTTGGAAGTTTGCAGAATGTCACACTGGCTCGCTCCCTGTAACTCGTCAGTAGGGTTTTAGACCTCTTTTATCTCCCATTCTCC
TTCTATCAATTCACTTCTCACTCTCTCTCGGACTCCAGGTGAGTTGTTCAAGCCTTTTCATTTGGTAAAATGGTAGTGGTGTGAATGGGTTGGCGCAATGGTCATTGAAT
TGCTGAAAAAGTAAAGGGATTTAGATGGAATGAGTTAAAACCCCATGGTGGCCAACTACCTAGGCCTTCGAGTTTACTTGACAACCAAATATAGTAGGGTTAGGTTTGTA
CTAAGAGGGTTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGGACAGGGCCTCCTCTTGCCCTAGCCAAGCATAATGAGTCTGAAGGGAGGAAGACCAGAAT
TCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAAGAAGTTG
GTGGATCTTTCTATACGGTGCGGGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTACTAGAAGAGCACAGCACTGATCATTTATTT
GAAGAGAATCCACTGCACTCGATTGCTATTGAACCTCAATCTCCTTTAACGTTACCATCAAAGGAAGTTGATTTTCCAGTCAACTACAACCAATGTATAAATGAAGAACC
AATCTTTGTTTCAGATGAGCAATGCACTTCAACAAATAGTCAGGGATTACAGAATGGGGCAATAATTAACGGCAGCCTGGTGGATGTAAGTGACAAGGATTCTGATGAAT
TTATCAAGACAGAGTTGCTTGTAAATGAACACAAGAAAGTAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTGGGCAGCAGATGTTGTGGTAGAGACATTCCCA
TTGGATTCAATTTCTTGGGCTGTTAATGGTTCAGATGTAAGATCTGAGACATTGATTTCAGCCAATGCCTCGGAAAATCAAGTTAGTCAAACCATTGAGTTAGAATCAGA
TGTTGGCTTGTTTAACATCGAAGATAATAATTCCACAAAAGCTTCTGGTTGTGTAGTTGAAAAGGGAGAGGAAAACTTTGTAAGTCCATTATCAGAAACAAAGTCTGATT
TGGTGGAGGTAGCACAGATTGTTGAAACCTCTAATGGATCTACTGTGAAAGAGGGTAGCGTACATGAATTTGAGGGTCCTGGGTTGGAAGTTTGCACTGATACTCCAATA
TCCGTGACCTTTGAACAAGGCCAGAAATCTAATGAAATGAAGGCTCCAAATGCTTCACCGAGTGGTACTGAGAATCTCAACAAGACATTCAGCAATGGCATTGATCAGGC
CTCAAAAATCAAAGAGCAGACAGAGATTGAAAATAAAGTAGATGCTGAACAGACTGGTGGCTCCCAGAAAGAAAGCATTCCAACACTAAATAGAATTAATCTCGAATCAT
GGGAAGGGATGTCAAAGAAGTCTTCAAAACCCGAAAACAACCCGCTTTTGGAAATCCTCAAGGCATTCATTGCCGCCTTCGTGAAATTTTGGTCCGAGTAAGTTCTATGA
TTGTCAAATAGATGAATAGAGAGTAGTTAATTTTCCTGCCACAAAACCTGTCTGTCTTTGTACCAAACCTGCAGTCAGTTACCCCGTTTACTCCAGTCGGCCCCATCGTC
GATATTTATGAAGAGAAAACTGGACGTGGGTTGGCATTTCTGTACTCTGCAGTGTGAAAGAAAATTTAGGAGTAGCAAGCTTAAACTAGTAGAAGGGTTTTTTTTTTTTT
ACTGTAACCATGAGATGAGATCTATGTCCCACCCCCATGATTTTCCTCCAGATTTGAAGGTTTTCTATTATTTTTTTCATTACCATATGATAAAAAAGAAAGGGTCAAAA
AAAGAGAAGGAAAAGTTGCAGTAACGTGAGAAGGAGCCTTTATGTTAAGGTGCTTGTGTGTTAGAAGGAGGACAGGGAAACAAGAGAGAAAGGCATATACAGATTGGAGT
TTTAATAGGCATCTTCTTTGCCCTTTTTGCTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATTTTCTGCTGCCCCTATTTTCTAATAGTACTCTCTT
GCAGCCTTTTTTAAAAATATATTCCAAATAATACACTTCTTTTTTCTTCTGTA
Protein sequenceShow/hide protein sequence
MVANYLGLRVYLTTKYSRVRFVLRGLNFVDFMHAIKGGWTGPPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDII
QENRVLGPGKLLLEEHSTDHLFEENPLHSIAIEPQSPLTLPSKEVDFPVNYNQCINEEPIFVSDEQCTSTNSQGLQNGAIINGSLVDVSDKDSDEFIKTELLVNEHKKVE
ESGMPINHVTPWAADVVVETFPLDSISWAVNGSDVRSETLISANASENQVSQTIELESDVGLFNIEDNNSTKASGCVVEKGEENFVSPLSETKSDLVEVAQIVETSNGST
VKEGSVHEFEGPGLEVCTDTPISVTFEQGQKSNEMKAPNASPSGTENLNKTFSNGIDQASKIKEQTEIENKVDAEQTGGSQKESIPTLNRINLESWEGMSKKSSKPENNP
LLEILKAFIAAFVKFWSE