; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G048600 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G048600
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationCiama_Chr02:36439074..36443409
RNA-Seq ExpressionCaUC02G048600
SyntenyCaUC02G048600
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96146.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa]1.8e-21789.7Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKADKKNG
        LDNE+RVLQLK+KEMELE K++D   G
Subjt:  LDNERRVLQLKQKEMELELKKADKKNG

XP_004142119.1 uncharacterized protein LOC101205501 [Cucumis sativus]1.0e-21590.07Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMM+ FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIASGC TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPVANFSKGNNES   EDSDSDS+SGES +EDDHSPVEN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKAD
        LDNE+RVLQLK+KEMELELK++D
Subjt:  LDNERRVLQLKQKEMELELKKAD

XP_008449727.1 PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo]4.1e-21789.46Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKADKKNG
        LDNE+RVLQLK+KEMELE K++D   G
Subjt:  LDNERRVLQLKQKEMELELKKADKKNG

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]7.3e-20685.75Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPI R Q+TQL N SLT RHHL MM+T EGDHQS+GI+DTK LG KDL M F+K KAIASG  TNN  TSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLI VVACVGDDGEAGMGSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCEDKFNDLNKRYK LNDI+GRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDS+  ES +EDDH P EN 
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS

Query:  LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LWP+ESRGRDKASADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQEQC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM
Subjt:  LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELELKKADKKNG
        K+DNERRVLQLKQKEMELE K++D   G
Subjt:  KLDNERRVLQLKQKEMELELKKADKKNG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]2.7e-22191.16Show/hide
Query:  MKMDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIAS-GCNTNNYTSEEDEPSF
        MKMDSSGLGGGFLS NGGL+DLESPI RPQKTQL NPSLT RHHLNMMSTFEGDH S+G VDTKSLGQKDLLMAF+K KAIAS G   NNYTSEEDEPSF
Subjt:  MKMDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIAS-GCNTNNYTSEEDEPSF

Query:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGR
        TEDGEC EFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAG GSKRKSGILQKKGKWKT+SKIMLSKGCHVSPQQCEDKFNDLNKRYK LNDIIG+
Subjt:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGR

Query:  GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVE
        GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPVANFSKGNNES+EAEDSDSDS+SGES +EDDHSPVE
Subjt:  GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVE

Query:  NSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENE
        N LWPSESRGRDK SADDGPLWSNSVAKNEFEG+IDVFLSDPTKSQWERRDWV+ QMLQLQEQC  FQAQSVELEKQRFKWLRYCSKKNRDLER RLENE
Subjt:  NSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENE

Query:  RMKLDNERRVLQLKQKEMELELKKADKKNG
        RMKLDNERRVLQLKQKEMELELK+ D   G
Subjt:  RMKLDNERRVLQLKQKEMELELKKADKKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KX12 Uncharacterized protein4.9e-21690.07Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMM+ FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIASGC TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPVANFSKGNNES   EDSDSDS+SGES +EDDHSPVEN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKAD
        LDNE+RVLQLK+KEMELELK++D
Subjt:  LDNERRVLQLKQKEMELELKKAD

A0A1S3BM36 uncharacterized protein LOC1034915222.0e-21789.46Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKADKKNG
        LDNE+RVLQLK+KEMELE K++D   G
Subjt:  LDNERRVLQLKQKEMELELKKADKKNG

A0A5A7TE21 Stress response protein nst1 isoform X12.0e-21789.46Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKADKKNG
        LDNE+RVLQLK+KEMELE K++D   G
Subjt:  LDNERRVLQLKQKEMELELKKADKKNG

A0A5D3BB81 Stress response protein nst1 isoform X18.9e-21889.7Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPI RPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D+KSLGQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLI VVACVGDDGEAGMGSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYK LNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL

Query:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK
        W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMK
Subjt:  WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELELKKADKKNG
        LDNE+RVLQLK+KEMELE K++D   G
Subjt:  LDNERRVLQLKQKEMELELKKADKKNG

A0A6J1H0P0 uncharacterized protein LOC1114593383.5e-20685.75Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPI R Q+TQL N SLT RHHL MM+T EGDHQS+GI+DTK LG KDL M F+K KAIASG  TNN  TSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLI VVACVGDDGEAGMGSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCEDKFNDLNKRYK LNDI+GRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDIIGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDS+  ES +EDDH P EN 
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS

Query:  LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LWP+ESRGRDKASADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQEQC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM
Subjt:  LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELELKKADKKNG
        K+DNERRVLQLKQKEMELE K++D   G
Subjt:  KLDNERRVLQLKQKEMELELKKADKKNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.1e-7846.13Show/hide
Query:  NYTSEEDEPSFTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFND
        N  S++DEPSFTE   DG  +E  +  KGSPWQR+KWTD++V+LLIT V+ +GDD      S+RK  +LQKKGKWK+VSK+M  +G HVSPQQCEDKFND
Subjt:  NYTSEEDEPSFTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFND

Query:  LNKRYKSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEA-----EDS
        LNKRYK LND++GRGTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+ +Q + L +A  S+ +++++++     ED 
Subjt:  LNKRYKSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEA-----EDS

Query:  DSDSESGESASEDDHSPVENSLWPSE-----------SRGRDKASADDG--PLWSNSVAKNEFE-GQIDVFLSDPTKSQWE-------RRDWVKIQMLQL
        D +   G+    D++     +                 + R   S +DG  P   NS+  N+    QI    +D  +   E       ++ W++ + LQL
Subjt:  DSDSESGESASEDDHSPVENSLWPSE-----------SRGRDKASADDG--PLWSNSVAKNEFE-GQIDVFLSDPTKSQWE-------RRDWVKIQMLQL

Query:  QEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELEL
        +EQ +  Q + +ELEKQRF+W R+  K++++LER+R+ENERMKL+N+R  L+LKQ+E+ +EL
Subjt:  QEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)5.2e-6140.72Show/hide
Query:  SEEDEPS-FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRY
        SE+DE    + DG+     K K+ SPWQR+KW D++V+L+IT ++ +G+D     GS +K  +LQKKGKW++VSK+M  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPS-FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRY

Query:  KSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESA
        K LN+++GRGTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D  +Q  +  +   S+ +++++E     ++    +  
Subjt:  KSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESA

Query:  SEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDV---FLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKK
         E+DH    +   P +   + ++  D G              Q DV      D  K+   +R  ++ + L+L+ + +  QA+ +ELE+Q+FKW  +  ++
Subjt:  SEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDV---FLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKK

Query:  NRDLERVRLENERMKLDNERRVLQLKQKEMELEL
         + L ++R+ENERMKL+NER  L+LK+ E+  +L
Subjt:  NRDLERVRLENERMKLDNERRVLQLKQKEMELEL

AT3G10040.1 sequence-specific DNA binding transcription factors3.1e-6136.85Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIA----SGCNTNNYTSEEDEPS
        M+S+ +  GF   +  +L LE P   P      NP  + +       T  GD Q+   +       K L    SK K ++     GC+  +  S      
Subjt:  MDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIA----SGCNTNNYTSEEDEPS

Query:  FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMG----SKRKS----------GILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFN
          ED   ++    +K S W RMKWTD +VRLLI  V  +GD  EAG+     +K+K+          G+LQKKGKWK+VS+ M+ KG  VSPQQCEDKFN
Subjt:  FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMG----SKRKS----------GILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFN

Query:  DLNKRYKSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD----------VDIQGKILPVANFSKGNNES
        DLNKRYK +NDI+G+G +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN     G  D          + I  +     + ++    +
Subjt:  DLNKRYKSLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD----------VDIQGKILPVANFSKGNNES

Query:  EEAEDSDSDSESGESASEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELE
          AE  + + E     +ED  S +E S    E   R K           S A      +    + D  KS WE+++W++ +ML+++E+ + ++ + VE+E
Subjt:  EEAEDSDSDSESGESASEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELE

Query:  KQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMEL
        KQR KW+RY SKK R++E+ +L+N+R +L+ ER +L L++ E+EL
Subjt:  KQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGTTAGTGAAATCCGGCTTTCTGAGATTCTCAGGATGCACTTTATTTGTGGACTAAGAAAAGGCATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTC
AGCAAATGGGGGGCTATTAGATCTGGAGTCTCCTATCGGAAGACCTCAAAAAACCCAATTGTTCAATCCCTCGTTGACACAACGCCATCACTTGAACATGATGAGTACTT
TTGAAGGCGATCACCAGTCCATTGGGATTGTGGACACGAAAAGCTTGGGGCAGAAGGATTTATTGATGGCGTTCAGTAAAAGGAAAGCTATTGCCTCTGGTTGCAACACA
AACAACTACACGAGTGAAGAAGATGAGCCAAGTTTTACCGAGGATGGCGAGTGCTCTGAGTTTTTGAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGA
TGAGATTGTGAGGCTTCTCATAACAGTGGTTGCTTGTGTGGGTGATGACGGGGAGGCTGGAATGGGTTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGA
AAACAGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGAGTTTGAACGATATAATT
GGGAGGGGAACCAGTTGTAGAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACA
CTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATATCCAAGGTAAAATTTTGCCTGTTGCTAATTTCTCCAAAGGAA
ATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGAAAGTGGTGAATCAGCTAGTGAAGACGATCACTCTCCTGTGGAAAATAGTTTATGGCCATCTGAATCT
CGTGGCAGGGATAAAGCGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCGGATCCGACAAAGTC
CCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATGACCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGT
TAAGGTATTGTAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTACTGCAACTAAAACAGAAGGAAATG
GAACTAGAATTAAAAAAGGCAGATAAGAAAAATGGAGAATATGATTCATGGTTTCTTTCCTTGCACGCCATGTTTTCTAGTTCCATGCTCTCTCACTCATGTTCTGTTAC
TTTTGGCCATTTTGTGGACAGAACACAGACCTTATTAAAGTTATCAACGAGGGAAGACTCCCTTTTCAAAGTGGTTGGACGACAAAATCATGGCCATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGTTAGTGAAATCCGGCTTTCTGAGATTCTCAGGATGCACTTTATTTGTGGACTAAGAAAAGGCATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTC
AGCAAATGGGGGGCTATTAGATCTGGAGTCTCCTATCGGAAGACCTCAAAAAACCCAATTGTTCAATCCCTCGTTGACACAACGCCATCACTTGAACATGATGAGTACTT
TTGAAGGCGATCACCAGTCCATTGGGATTGTGGACACGAAAAGCTTGGGGCAGAAGGATTTATTGATGGCGTTCAGTAAAAGGAAAGCTATTGCCTCTGGTTGCAACACA
AACAACTACACGAGTGAAGAAGATGAGCCAAGTTTTACCGAGGATGGCGAGTGCTCTGAGTTTTTGAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGA
TGAGATTGTGAGGCTTCTCATAACAGTGGTTGCTTGTGTGGGTGATGACGGGGAGGCTGGAATGGGTTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGA
AAACAGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGAGTTTGAACGATATAATT
GGGAGGGGAACCAGTTGTAGAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACA
CTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATATCCAAGGTAAAATTTTGCCTGTTGCTAATTTCTCCAAAGGAA
ATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGAAAGTGGTGAATCAGCTAGTGAAGACGATCACTCTCCTGTGGAAAATAGTTTATGGCCATCTGAATCT
CGTGGCAGGGATAAAGCGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCGGATCCGACAAAGTC
CCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATGACCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGT
TAAGGTATTGTAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTACTGCAACTAAAACAGAAGGAAATG
GAACTAGAATTAAAAAAGGCAGATAAGAAAAATGGAGAATATGATTCATGGTTTCTTTCCTTGCACGCCATGTTTTCTAGTTCCATGCTCTCTCACTCATGTTCTGTTAC
TTTTGGCCATTTTGTGGACAGAACACAGACCTTATTAAAGTTATCAACGAGGGAAGACTCCCTTTTCAAAGTGGTTGGACGACAAAATCATGGCCATGGATGA
Protein sequenceShow/hide protein sequence
MSVSEIRLSEILRMHFICGLRKGMKMDSSGLGGGFLSANGGLLDLESPIGRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSLGQKDLLMAFSKRKAIASGCNT
NNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGMGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKSLNDII
GRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSLWPSES
RGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEM
ELELKKADKKNGEYDSWFLSLHAMFSSSMLSHSCSVTFGHFVDRTQTLLKLSTREDSLFKVVGRQNHGHG