; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017987 (gene) of Snake gourd v1 genome

Gene IDTan0017987
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationLG07:11800749..11802071
RNA-Seq ExpressionTan0017987
SyntenyTan0017987
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605907.1 hypothetical protein SDJN03_03224, partial [Cucurbita argyrosperma subsp. sororia]3.4e-22891.06Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQ +GIMDTK LG KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PAESRGRDK SADDGPLWS T+AQNEFEGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]6.9e-22991.28Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK LG KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PAESRGRDK SADDGPLWS T+AQNEFEGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

XP_022995089.1 uncharacterized protein LOC111490737 [Cucurbita maxima]3.4e-22890.6Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK +G KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFS+GNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PA+SRGRDK SADDGPLWSNT+AQNE EGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

XP_023534092.1 uncharacterized protein LOC111795758 [Cucurbita pepo subsp. pepo]7.6e-22890.83Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T E DHQS+GIMDTK LG KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CT+FLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PAESRGRDK SADDGPLWS T+AQNEFEGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]2.3e-22487.47Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GL+DLESPIRR Q+TQLVNPSLTHRHHLNMMSTFEGDH S+G +DTKSLGQKD+ M F KGK      +TNNN TSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+C EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG GSKRKSGILQKKGKWK +SKIM+SKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENR
        SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPVANFSKGNN+ +EA+DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENR

Query:  LWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM
        LWP+ESRGRDKVSADDGPLWSN+ A+NEFEG+IDVFLSDP KSQWERRDW++KQM+QLQEQC +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM
Subjt:  LWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERM

Query:  RLDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH
        +LDNERRVLQLKQKEMELE KR DSSF PTLGIDRIQGREQ+DLGRH
Subjt:  RLDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH

TrEMBL top hitse value%identityAlignment
A0A1S3BM36 uncharacterized protein LOC1034915224.2e-21685.43Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED
        MDSSGLGGGFL+GN GLLDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKD+ M F +GK + +    NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED

Query:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        G+C+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKW+ VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNN+ EEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL

Query:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR
        W +ESRGRDKVSADDGPLWSN+  +NEFEGQIDVFLSDP KSQWER+ WIKKQM+QLQEQC SFQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM+
Subjt:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR

Query:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+  P L  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH

A0A5A7TE21 Stress response protein nst1 isoform X14.2e-21685.43Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED
        MDSSGLGGGFL+GN GLLDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKD+ M F +GK + +    NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED

Query:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        G+C+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKW+ VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNN+ EEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL

Query:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR
        W +ESRGRDKVSADDGPLWSN+  +NEFEGQIDVFLSDP KSQWER+ WIKKQM+QLQEQC SFQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM+
Subjt:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR

Query:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+  P L  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH

A0A5D3BB81 Stress response protein nst1 isoform X11.9e-21685.65Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED
        MDSSGLGGGFL+GN GLLDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKD+ M F +GK + +    NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTN----NNNTSEEDEPSFTED

Query:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        G+C+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKWK VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILP ANFSKGNN+ EEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSD--ESDNEDDHFPEENRL

Query:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR
        W +ESRGRDKVSADDGPLWSN+  +NEFEGQIDVFLSDP KSQWER+ WIKKQM+QLQEQC SFQAQSVELEKQRFKWLRYCSKKNRDLER RLENERM+
Subjt:  WPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMR

Query:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+  P L  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH

A0A6J1H0P0 uncharacterized protein LOC1114593383.3e-22991.28Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK LG KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFSKGNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PAESRGRDK SADDGPLWS T+AQNEFEGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

A0A6J1K4Q0 uncharacterized protein LOC1114907371.7e-22890.6Show/hide
Query:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE
        MDSSGLGGGFL+GN GLLDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK +G KD+SMTFTKGK      VTNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKT-----VTNNNNTSEEDEPSFTE

Query:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DG+CTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKILPV NFS+GNN+ EEADDSDSDSDESDNEDDH+PEENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLW

Query:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL
        PA+SRGRDK SADDGPLWSNT+AQNE EGQIDVFLSDP K QWERRDWIKKQM+QLQEQC+SFQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERM++
Subjt:  PAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRL

Query:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSF PTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.9e-8141.96Show/hide
Query:  MDSSGLGGGFL---AGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTNNNNTSEEDEPSFTE--
        MD +   GG +   A + G  DL+  +R H Q  +   +  HRH+ N     EG   ++    T    Q        + K     N+ S++DEPSFTE  
Subjt:  MDSSGLGGGFL---AGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTNNNNTSEEDEPSFTE--

Query:  -DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRG
         DG   E  +  KGSPWQR+KWTD +V+LLI  V+ +GDD      S+RK  +LQKKGKWK VSK+M  +G HVSPQQCEDKFNDLNKRYK+LND+LGRG
Subjt:  -DGDCTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRG

Query:  TSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNND--------LEEADDSDSDSDESDNEDD
        TSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+ LQ + L +A  S+ ++D        +E+ DD D D D   +E D
Subjt:  TSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNND--------LEEADDSDSDSDESDNEDD

Query:  HFPEENRLW------------PAESRGRDKVSADDG--PLWSNTAAQNEFE------GQIDVFL--SDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVE
         + E++  +                + R  +S +DG  P   N+   N+         Q DV    ++  ++   ++ W++ + +QL+EQ +  Q + +E
Subjt:  HFPEENRLW------------PAESRGRDKVSADDG--PLWSNTAAQNEFE------GQIDVFL--SDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVE

Query:  LEKQRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEMELE
        LEKQRF+W R+  K++++LER+R+ENERM+L+N+R  L+LKQ+E+ +E
Subjt:  LEKQRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)3.5e-6138.89Show/hide
Query:  TVTNNNNTSEEDEPSFTEDGDCTEFL-----KGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQ
        T+   +N  +  + S +ED +          K K+ SPWQR+KW D +V+L+I  ++ +G+D     GS +K  +LQKKGKW+ VSK+M  +G HVSPQQ
Subjt:  TVTNNNNTSEEDEPSFTEDGDCTEFL-----KGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQ

Query:  CEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEE--
        CEDKFNDLNKRYK+LN++LGRGTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D  +Q  +  +   S+ ++D +E  
Subjt:  CEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEE--

Query:  ---ADDSDSDSDESDNEDDHFPEENRLWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEK
            +D D D D  ++ D    +       +S+  + V   +         +++ +    + L D  K+   +R  I+ + ++L+ + +  QA+ +ELE+
Subjt:  ---ADDSDSDSDESDNEDDHFPEENRLWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEK

Query:  QRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEM
        Q+FKW  +  ++ + L ++R+ENERM+L+NER  L+LK+ E+
Subjt:  QRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEM

AT3G10040.1 sequence-specific DNA binding transcription factors4.7e-6639.95Show/hide
Query:  EEDEPSFTEDGDCTEFLKGKKG----SPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILQKKGKWKMVSKIMISKGCHV
        +ED  S +  G   E   G  G    S W RMKWTD +VRLLI  V  +GD  EAG+     +K+K+          G+LQKKGKWK VS+ M+ KG  V
Subjt:  EEDEPSFTEDGDCTEFLKGKKG----SPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILQKKGKWKMVSKIMISKGCHV

Query:  SPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNG----------------QTIP------GC
        SPQQCEDKFNDLNKRYKR+NDILG+G +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN                  +IP       C
Subjt:  SPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNG----------------QTIP------GC

Query:  QDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKK
              GK+  +A   +   ++E     DS+S+  ++E++   ++ R+  A  R R++ ++                      + D  KS WE+++WI++
Subjt:  QDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFLSDPAKSQWERRDWIKK

Query:  QMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEMEL-EFKRSDSSFEPT
        +M++++E+ I ++ + VE+EKQR KW+RY SKK R++E+ +L+N+R RL+ ER +L L++ E+EL E + S +  +P+
Subjt:  QMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEMEL-EFKRSDSSFEPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTCTGGGAGGTGGATTTCTGGCAGGAAATAGGGGGCTACTAGATCTGGAGTCTCCTATTCGAAGACATCAACAGACCCAATTGGTCAATCCCTCATT
GACACACCGACATCACTTGAACATGATGAGCACTTTTGAAGGCGATCACCAATCCATTGGGATTATGGACACGAAAAGCTTGGGACAGAAAGATATTTCGATGACCTTTA
CTAAAGGGAAAACCGTCACAAACAACAATAACACGAGTGAAGAAGATGAGCCTAGTTTTACTGAGGATGGTGACTGCACTGAGTTTTTGAAGGGCAAAAAGGGTTCCCCA
TGGCAGAGAATGAAGTGGACAGATGACATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAATGGGCTCGAAGAGAAAATCTGGAAT
TTTGCAAAAGAAGGGAAAATGGAAAATGGTGTCAAAGATTATGATAAGTAAGGGGTGTCATGTTTCTCCCCAGCAGTGTGAGGACAAATTTAATGACTTAAACAAAAGAT
ACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTTATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTC
AGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATTTACAAGGTAAGATTTTACC
TGTTGCAAATTTCTCCAAAGGAAATAATGATTTAGAAGAGGCTGATGACAGTGATAGTGACAGTGATGAATCAGATAATGAAGATGATCACTTTCCTGAGGAAAATAGAT
TATGGCCGGCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCACTTTGGTCAAACACTGCTGCACAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTT
TCGGATCCAGCAAAGTCCCAATGGGAGCGTAGAGATTGGATTAAAAAACAGATGGTACAACTTCAGGAGCAATGTATCAGCTTCCAGGCTCAATCTGTTGAACTTGAGAA
ACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAGGCTAGATAATGAGCGAAGAGTACTGCAAC
TGAAGCAGAAGGAAATGGAACTGGAGTTCAAAAGGTCTGATTCATCCTTCGAGCCAACCCTTGGCATTGATAGAATTCAAGGGAGAGAGCAAATTGATTTGGGTAGGCAT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTTCAGGTCTGGGAGGTGGATTTCTGGCAGGAAATAGGGGGCTACTAGATCTGGAGTCTCCTATTCGAAGACATCAACAGACCCAATTGGTCAATCCCTCATT
GACACACCGACATCACTTGAACATGATGAGCACTTTTGAAGGCGATCACCAATCCATTGGGATTATGGACACGAAAAGCTTGGGACAGAAAGATATTTCGATGACCTTTA
CTAAAGGGAAAACCGTCACAAACAACAATAACACGAGTGAAGAAGATGAGCCTAGTTTTACTGAGGATGGTGACTGCACTGAGTTTTTGAAGGGCAAAAAGGGTTCCCCA
TGGCAGAGAATGAAGTGGACAGATGACATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAATGGGCTCGAAGAGAAAATCTGGAAT
TTTGCAAAAGAAGGGAAAATGGAAAATGGTGTCAAAGATTATGATAAGTAAGGGGTGTCATGTTTCTCCCCAGCAGTGTGAGGACAAATTTAATGACTTAAACAAAAGAT
ACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTTATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTC
AGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATTTACAAGGTAAGATTTTACC
TGTTGCAAATTTCTCCAAAGGAAATAATGATTTAGAAGAGGCTGATGACAGTGATAGTGACAGTGATGAATCAGATAATGAAGATGATCACTTTCCTGAGGAAAATAGAT
TATGGCCGGCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCACTTTGGTCAAACACTGCTGCACAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTT
TCGGATCCAGCAAAGTCCCAATGGGAGCGTAGAGATTGGATTAAAAAACAGATGGTACAACTTCAGGAGCAATGTATCAGCTTCCAGGCTCAATCTGTTGAACTTGAGAA
ACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAGGCTAGATAATGAGCGAAGAGTACTGCAAC
TGAAGCAGAAGGAAATGGAACTGGAGTTCAAAAGGTCTGATTCATCCTTCGAGCCAACCCTTGGCATTGATAGAATTCAAGGGAGAGAGCAAATTGATTTGGGTAGGCAT
TAA
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLAGNRGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDISMTFTKGKTVTNNNNTSEEDEPSFTEDGDCTEFLKGKKGSP
WQRMKWTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDV
RKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKILPVANFSKGNNDLEEADDSDSDSDESDNEDDHFPEENRLWPAESRGRDKVSADDGPLWSNTAAQNEFEGQIDVFL
SDPAKSQWERRDWIKKQMVQLQEQCISFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMRLDNERRVLQLKQKEMELEFKRSDSSFEPTLGIDRIQGREQIDLGRH