; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001153 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001153
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationChr09:14470651..14473826
RNA-Seq ExpressionHG10001153
SyntenyHG10001153
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96146.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa]3.0e-22089.95Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM++ FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIAS CIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDSDSG++DNEDDHSP ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        KLDNE+RVLQLK+KEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

XP_004142119.1 uncharacterized protein LOC101205501 [Cucumis sativus]2.6e-21990.8Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM+N FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIASGCIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILPVANFSKGNNES   EDSDSDSDSG++DNEDDHSPVENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTD
        KLDNE+RVLQLK+KEMEL+LK++D
Subjt:  KLDNERRVLQLKQKEMELDLKKTD

XP_008449727.1 PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo]6.7e-22089.72Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM++ FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIAS CIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDSDSG++DNEDDHSP ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        KLDNE+RVLQLK+KEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]4.0e-21285.98Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRR Q+TQL+N SLTHRHHL M+NT EGDH+S+GI+DTK LG KDL M F KGKAIASG +TNN+ TSEEDEPSFTE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG GSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDSD  ++DNEDDH P ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LWP+ESRGRDK SADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQEQC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        K+DNERRVLQLKQKEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]2.2e-23193.49Show/hide
Query:  MKMDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSF
        MKMDSSGLG GFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNM++TFEGDH S+G VDTKSLGQKDLLMAFNKGKAIASG ITNNNYTSEEDEPSF
Subjt:  MKMDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSF

Query:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGR
        TEDGEC EFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKT+SKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+
Subjt:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGR

Query:  GTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVE
        GTSC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILPVANFSKGNNES+EAEDSDSDSDSG++DNEDDHSPVE
Subjt:  GTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVE

Query:  NRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENE
        NRLWPSESRGRDKVSADDGPLWSNSVAKNEFEG+IDVFLSDPTKSQWERRDWV+ QMLQLQEQC NFQAQSVELEKQRFKWLRYCSKKNRDLER RLENE
Subjt:  NRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENE

Query:  RMKLDNERRVLQLKQKEMELDLKKTDKKNG
        RMKLDNERRVLQLKQKEMEL+LK+ D   G
Subjt:  RMKLDNERRVLQLKQKEMELDLKKTDKKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KX12 Uncharacterized protein1.2e-21990.8Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM+N FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIASGCIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILPVANFSKGNNES   EDSDSDSDSG++DNEDDHSPVENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTD
        KLDNE+RVLQLK+KEMEL+LK++D
Subjt:  KLDNERRVLQLKQKEMELDLKKTD

A0A1S3BM36 uncharacterized protein LOC1034915223.3e-22089.72Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM++ FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIAS CIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDSDSG++DNEDDHSP ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        KLDNE+RVLQLK+KEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

A0A5A7TE21 Stress response protein nst1 isoform X13.3e-22089.72Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM++ FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIAS CIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDSDSG++DNEDDHSP ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        KLDNE+RVLQLK+KEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

A0A5D3BB81 Stress response protein nst1 isoform X11.5e-22089.95Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNM++ FEGDH+SIGI+D+KSLGQKDLLMAFN+GKAIAS CIT NNYTSEEDEPS+TE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDSDSG++DNEDDHSP ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        KLDNE+RVLQLK+KEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

A0A6J1H0P0 uncharacterized protein LOC1114593381.9e-21285.98Show/hide
Query:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE
        MDSSGLG GFLSGNGGL+DLESPIRR Q+TQL+N SLTHRHHL M+NT EGDH+S+GI+DTK LG KDL M F KGKAIASG +TNN+ TSEEDEPSFTE
Subjt:  MDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG GSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR
        SC+VVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDSD  ++DNEDDH P ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENR

Query:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LWP+ESRGRDK SADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQEQC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM
Subjt:  LWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELDLKKTDKKNG
        K+DNERRVLQLKQKEMEL+ K++D   G
Subjt:  KLDNERRVLQLKQKEMELDLKKTDKKNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors6.6e-8046.41Show/hide
Query:  NYTSEEDEPSFTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFND
        N  S++DEPSFTE   DG  +E  +  KGSPWQR+KWTD++V+LLI  V+ +GDD    + S+RK  +LQKKGKWK+VSK+M  +G HVSPQQCEDKFND
Subjt:  NYTSEEDEPSFTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFND

Query:  LNKRYKRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEA-----EDS
        LNKRYK+LND+LGRGTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+ +Q + L +A  S+ +++++++     ED 
Subjt:  LNKRYKRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEA-----EDS

Query:  DSDSDSGDTDNEDDHSPVENRLWPSE-----------SRGRDKVSADDG--PLWSNSVAKNEFE-GQIDVFLSDPTKSQWE-------RRDWVKIQMLQL
        D +   GD D  D++                       + R  +S +DG  P   NS+  N+    QI    +D  +   E       ++ W++ + LQL
Subjt:  DSDSDSGDTDNEDDHSPVENRLWPSE-----------SRGRDKVSADDG--PLWSNSVAKNEFE-GQIDVFLSDPTKSQWE-------RRDWVKIQMLQL

Query:  QEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELDL
        +EQ +  Q + +ELEKQRF+W R+  K++++LER+R+ENERMKL+N+R  L+LKQ+E+ ++L
Subjt:  QEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELDL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.2e-6241.14Show/hide
Query:  SEEDEPS-FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRY
        SE+DE    + DG+     K K+ SPWQR+KW D++V+L+I  ++ +G+D    +GS +K  +LQKKGKW++VSK+M  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPS-FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRY

Query:  KRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTD
        K+LN++LGRGTSC+VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D  +Q  +  +   S+ +++++E     ++    D D
Subjt:  KRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTD

Query:  NEDDHS------PVENRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYC
         E+DH       P++ RL   +S+  + V   +       + +++ +    + L D  K+   +R  ++ + L+L+ + +  QA+ +ELE+Q+FKW  + 
Subjt:  NEDDHS------PVENRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYC

Query:  SKKNRDLERVRLENERMKLDNERRVLQLKQKEM
         ++ + L ++R+ENERMKL+NER  L+LK+ E+
Subjt:  SKKNRDLERVRLENERMKLDNERRVLQLKQKEM

AT3G10040.1 sequence-specific DNA binding transcription factors1.2e-6239.89Show/hide
Query:  NKGKAIASGCITNNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDG------------EAGTGSKRKSGILQKKGKWKT
        ++G    SGC           E S   DG+       +K S W RMKWTD +VRLLI  V  +GD+               G G     G+LQKKGKWK+
Subjt:  NKGKAIASGCITNNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDG------------EAGTGSKRKSGILQKKGKWKT

Query:  VSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVD--IQGK
        VS+ M+ KG  VSPQQCEDKFNDLNKRYKR+NDILG+G +C+VVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN     G  D     Q  
Subjt:  VSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCKVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVD--IQGK

Query:  I-LPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQL
        I +P+ +  +    + EA      ++  + + E +    E+    SES   +    +       S A      +    + D  KS WE+++W++ +ML++
Subjt:  I-LPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQL

Query:  QEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELD
        +E+ I ++ + VE+EKQR KW+RY SKK R++E+ +L+N+R +L+ ER +L L++ E+EL+
Subjt:  QEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGATAGTTCAGGTTTGGGAAGTGGATTTCTGTCAGGAAATGGGGGGCTAATAGATCTGGAGTCTCCTATCCGAAGACCTCAAAAAACCCAATTGGTCAATCC
CTCGTTGACACACCGCCATCACCTGAACATGGTAAATACTTTTGAAGGCGATCACCGGTCCATTGGGATTGTGGACACGAAAAGCTTGGGACAGAAGGACTTATTGATGG
CGTTCAATAAAGGGAAAGCTATTGCCTCTGGTTGCATCACAAACAACAACTACACGAGTGAAGAAGACGAGCCAAGTTTTACTGAGGATGGAGAGTGCTCTGAGTTTTTG
AAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAGCCGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAACGGG
TTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACGGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAAT
TTAATGATTTGAACAAAAGATACAAGAGATTGAACGATATACTTGGGAGGGGAACCAGTTGTAAAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCA
AGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAGCAATTCCTGGTTGCCAGGATGTTGA
TATCCAAGGTAAAATTTTGCCTGTTGCAAATTTCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGACAGTGGTGATACAGATAATGAAG
ATGATCACTCTCCTGTGGAAAATAGATTATGGCCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAA
TTTGAAGGTCAAATTGATGTTTTTCTGTCGGATCCAACGAAGTCCCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATCAACTT
TCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAAC
TAGATAATGAGCGGAGGGTACTGCAACTGAAGCAGAAGGAAATGGAACTAGACTTAAAAAAGACAGATAAGAAAAATGGAGAATATGATTCATGGTTTCTTTCCTTGCAC
GCCATGTTTTCTAGTTCCATGCTCTCACTCATGTATTGTTACTTTTGGCCATTTTGTGGACAGATCACAGACCTTATTAAAGTTATCAACGAGAGAAGACTCCCTTTTCA
AAGTGAGTTGAACGACAAAATCATGGCCATGGATGAACTCTCCACTCTTCTTACAATGTCAGTAGCGGTTTTGCTGAGTAAACCAGTTTCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGATAGTTCAGGTTTGGGAAGTGGATTTCTGTCAGGAAATGGGGGGCTAATAGATCTGGAGTCTCCTATCCGAAGACCTCAAAAAACCCAATTGGTCAATCC
CTCGTTGACACACCGCCATCACCTGAACATGGTAAATACTTTTGAAGGCGATCACCGGTCCATTGGGATTGTGGACACGAAAAGCTTGGGACAGAAGGACTTATTGATGG
CGTTCAATAAAGGGAAAGCTATTGCCTCTGGTTGCATCACAAACAACAACTACACGAGTGAAGAAGACGAGCCAAGTTTTACTGAGGATGGAGAGTGCTCTGAGTTTTTG
AAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAGCCGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAACGGG
TTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACGGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAAT
TTAATGATTTGAACAAAAGATACAAGAGATTGAACGATATACTTGGGAGGGGAACCAGTTGTAAAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCA
AGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAGCAATTCCTGGTTGCCAGGATGTTGA
TATCCAAGGTAAAATTTTGCCTGTTGCAAATTTCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGACAGTGGTGATACAGATAATGAAG
ATGATCACTCTCCTGTGGAAAATAGATTATGGCCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAA
TTTGAAGGTCAAATTGATGTTTTTCTGTCGGATCCAACGAAGTCCCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATCAACTT
TCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAAC
TAGATAATGAGCGGAGGGTACTGCAACTGAAGCAGAAGGAAATGGAACTAGACTTAAAAAAGACAGATAAGAAAAATGGAGAATATGATTCATGGTTTCTTTCCTTGCAC
GCCATGTTTTCTAGTTCCATGCTCTCACTCATGTATTGTTACTTTTGGCCATTTTGTGGACAGATCACAGACCTTATTAAAGTTATCAACGAGAGAAGACTCCCTTTTCA
AAGTGAGTTGAACGACAAAATCATGGCCATGGATGAACTCTCCACTCTTCTTACAATGTCAGTAGCGGTTTTGCTGAGTAAACCAGTTTCTTCTTGA
Protein sequenceShow/hide protein sequence
MKMDSSGLGSGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMVNTFEGDHRSIGIVDTKSLGQKDLLMAFNKGKAIASGCITNNNYTSEEDEPSFTEDGECSEFL
KGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCKVVENPALMDSMPHLS
SKAKDDVRKILSSKHLFYKEMCAYHNGQAIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSDSGDTDNEDDHSPVENRLWPSESRGRDKVSADDGPLWSNSVAKNE
FEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCINFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELDLKKTDKKNGEYDSWFLSLH
AMFSSSMLSLMYCYFWPFCGQITDLIKVINERRLPFQSELNDKIMAMDELSTLLTMSVAVLLSKPVSS