; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024657 (gene) of Chayote v1 genome

Gene IDSed0024657
OrganismSechium edule (Chayote v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationLG05:11426671..11431856
RNA-Seq ExpressionSed0024657
SyntenySed0024657
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605907.1 hypothetical protein SDJN03_03224, partial [Cucurbita argyrosperma subsp. sororia]4.0e-21386.47Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K LGHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFSKGNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P E+RGRDK SADDGPLW   +AQNEF+GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]4.0e-21386.47Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K LGHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFSKGNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P E+RGRDK SADDGPLW   +AQNEF+GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

XP_022995089.1 uncharacterized protein LOC111490737 [Cucurbita maxima]2.0e-21285.78Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K +GHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFS+GNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P ++RGRDK SADDGPLW N +AQNE +GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

XP_023534092.1 uncharacterized protein LOC111795758 [Cucurbita pepo subsp. pepo]6.8e-21386.24Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K LGHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECT+FLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFSKGNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P E+RGRDK SADDGPLW   +AQNEF+GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]4.9e-21183.67Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGL+DLESPIRR Q+TQLVNPSLTHRHHLNMMS+FE      G +D+KSLG KDL M FNKGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGEC EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGD+GEA    KRKSGILQKKGKWK +SKI++SKGCHVSPQQCEDKFNDLNKRYKRLND++G+GT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENR
        SC+VVENPALMDSMPHLSSKAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PVANFSKGNNES+EA+DSDSDSD  ESDNE DH P ENR
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENR

Query:  LWPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERM
        LWP E+RGRDKVSADDGPLW N+ A+NEF+G+IDVFLSD TKSQ ERRDW++KQMLQL+EQC +FQAQSVELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH
        KLDNERRVLQLKQKEMELE KR +SSFGPT+GIDRIQGREQ+DLGRH
Subjt:  KLDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH

TrEMBL top hitse value%identityAlignment
A0A1S3BM36 uncharacterized protein LOC1034915229.6e-20582.51Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED
        MDSSGLGGGFLSG GGLLDLESPIRR Q+TQLVNPSLT RH LNMMS+FE      GILDSKSLG KDL M FN+GK IAS        SEEDEPS+TED
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED

Query:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS
        GEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGD+GEA M  KRKSGIL KKGKW+ VSKI+ SKGCHVSPQQCEDKFNDLNKRYKRLND+LG+GTS
Subjt:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS

Query:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL
        C+VVENPALMDSMPHLSSKAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI P ANFSKGNNESEEA+DSDSDSD  ESDNE DH P ENRL
Subjt:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL

Query:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W  E+RGRDKVSADDGPLW N+  +NEF+GQIDVFLSD TKSQ ER+ WIKKQMLQL+EQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELESKRS+S+ GP +  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH

A0A5A7TE21 Stress response protein nst1 isoform X19.6e-20582.51Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED
        MDSSGLGGGFLSG GGLLDLESPIRR Q+TQLVNPSLT RH LNMMS+FE      GILDSKSLG KDL M FN+GK IAS        SEEDEPS+TED
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED

Query:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS
        GEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGD+GEA M  KRKSGIL KKGKW+ VSKI+ SKGCHVSPQQCEDKFNDLNKRYKRLND+LG+GTS
Subjt:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS

Query:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL
        C+VVENPALMDSMPHLSSKAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI P ANFSKGNNESEEA+DSDSDSD  ESDNE DH P ENRL
Subjt:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL

Query:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W  E+RGRDKVSADDGPLW N+  +NEF+GQIDVFLSD TKSQ ER+ WIKKQMLQL+EQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELESKRS+S+ GP +  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH

A0A5D3BB81 Stress response protein nst1 isoform X14.3e-20582.74Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED
        MDSSGLGGGFLSG GGLLDLESPIRR Q+TQLVNPSLT RH LNMMS+FE      GILDSKSLG KDL M FN+GK IAS        SEEDEPS+TED
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG-------SEEDEPSFTED

Query:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS
        GEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGD+GEA M  KRKSGIL KKGKWK VSKI+ SKGCHVSPQQCEDKFNDLNKRYKRLND+LG+GTS
Subjt:  GECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTS

Query:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL
        C+VVENPALMDSMPHLSSKAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI P ANFSKGNNESEEA+DSDSDSD  ESDNE DH P ENRL
Subjt:  CKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSD--ESDNEGDHYPEENRL

Query:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W  E+RGRDKVSADDGPLW N+  +NEF+GQIDVFLSD TKSQ ER+ WIKKQMLQL+EQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH
        LDNE+RVLQLK+KEMELESKRS+S+ GP +  DRIQGREQ+DLG H
Subjt:  LDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH

A0A6J1H0P0 uncharacterized protein LOC1114593381.9e-21386.47Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K LGHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFSKGNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P E+RGRDK SADDGPLW   +AQNEF+GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

A0A6J1K4Q0 uncharacterized protein LOC1114907379.6e-21385.78Show/hide
Query:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE
        MDSSGLGGGFLSG GGLLDLESPIRRHQQTQL+N SLTHRHHL MM++ E      GI+D+K +GHKDLSMTF KGK IASG        SEEDEPSFTE
Subjt:  MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFED-----GILDSKSLGHKDLSMTFNKGKTIASG--------SEEDEPSFTE

Query:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT
        DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGD+GEA M  KRKSGILQKKGKWK+VSKI+ISKGCHVSPQQCEDKFNDLNKRYKRLND+LGRGT
Subjt:  DGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGT

Query:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW
        SC+VVENPALMDSMPHLSSK K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVD QGKI PV NFS+GNNESEEADDSDSDSDESDNE DHYPEENRLW
Subjt:  SCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLW

Query:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        P ++RGRDK SADDGPLW N +AQNE +GQIDVFLSD TK Q ERRDWIKKQMLQL+EQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG
        DNERRVLQLKQKEMELE KRS+SSFGPT+GIDRIQG
Subjt:  DNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.6e-7941.82Show/hide
Query:  GGGFLSGTG--GLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFEDGILDSKSLG-----HKDLSMTFNKGKTIA----SGSEEDEPSFTE---DGECT
        GG   SG    G  DL+  +R H Q  +   +  HRH+ N      +G+  +   G     H++ +M+ ++ +       S S++DEPSFTE   DG   
Subjt:  GGGFLSGTG--GLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFEDGILDSKSLG-----HKDLSMTFNKGKTIA----SGSEEDEPSFTE---DGECT

Query:  EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVV
        E  +  KGSPWQR+KWTD +V+LLI  V+ +GD+       +RK  +LQKKGKWK VSK++  +G HVSPQQCEDKFNDLNKRYK+LND+LGRGTSC+VV
Subjt:  EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVV

Query:  ENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEA------DDSDSDSDESDNEGDHYPEENRL
        ENPAL+DS+ +L+ K K+DVRKI+SSKHLFY+EMC+YHNG  +    D+ LQ +   +A  S+ +++++++      D  D D D   +E D Y E++  
Subjt:  ENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEA------DDSDSDSDESDNEGDHYPEENRL

Query:  W------------PPEARGRDKVSADDG--PLWPNNAAQNEFD------GQIDVFL--SDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKW
        +             P  + R  +S +DG  P   N+   N+         Q DV    ++S ++   ++ W++ + LQLEEQ +  Q + +ELEKQRF+W
Subjt:  W------------PPEARGRDKVSADDG--PLWPNNAAQNEFD------GQIDVFL--SDSTKSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKW

Query:  LRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELE
         R+  K+ ++LER+R+ENERMKL+N+R  L+LKQ+E+ +E
Subjt:  LRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)7.6e-6139.47Show/hide
Query:  RHHLNMMSSFEDGILDSKSLGHKDLSMTFNKGKTIASGSEEDEPS-FTEDGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSG
        +HH N  S  + G  ++    H ++    ++GK   S SE+DE    + DG+     K K+ SPWQR+KW D +V+L+I  ++ +G++  +     +K  
Subjt:  RHHLNMMSSFEDGILDSKSLGHKDLSMTFNKGKTIASGSEEDEPS-FTEDGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEASMWPKRKSG

Query:  ILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGC
        +LQKKGKW+ VSK++  +G HVSPQQCEDKFNDLNKRYK+LN++LGRGTSC+VVENP+L+D + +L+ K K++VR+I+SSKHLFY+EMC+YHNG  +   
Subjt:  ILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNGQTIPGC

Query:  QDVDLQGKIFPVA--------NFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLWPPEARGRDKV----SADDGPLWPNNAAQNEFDGQIDVFLSDST
         D  +Q  +  +         N   G +++E+ DD D   ++ D      P + RL   +++  + V       D P  P + A    D    + L DS 
Subjt:  QDVDLQGKIFPVA--------NFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLWPPEARGRDKV----SADDGPLWPNNAAQNEFDGQIDVFLSDST

Query:  KSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEM
        K+   +R  I+ + L+LE + +  QA+ +ELE+Q+FKW  +  ++ + L ++R+ENERMKL+NER  L+LK+ E+
Subjt:  KSQCERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEM

AT3G10040.1 sequence-specific DNA binding transcription factors3.1e-6238.79Show/hide
Query:  NKGKTIASGSEEDEPSFTEDGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEAS--MWPKRKS----------GILQKKGKWKVVSKILISK
        ++G    SG   ++ + T DG+       +K S W RMKWTD +VRLLI  V  +GDE   +  +  K+K+          G+LQKKGKWK VS+ ++ K
Subjt:  NKGKTIASGSEEDEPSFTEDGECTEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDEGEAS--MWPKRKS----------GILQKKGKWKVVSKILISK

Query:  GCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNG----------------QTIP----
        G  VSPQQCEDKFNDLNKRYKR+ND+LG+G +C+VVEN  L++SM HL+ K K++V+K+L+SKHLF++EMCAYHN                  +IP    
Subjt:  GCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVVENPALMDSMPHLSSKAKEDVRKILSSKHLFYKEMCAYHNG----------------QTIP----

Query:  --GCQDVDLQGKIFPVANFSKGNN--ESEEADDSDSDSDESDNEGDHYPEENRLWPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCER
           C      GK+  +A   +     ES+ A+DS+S+ +ES+ E             E R + ++S           A      +    + D  KS  E+
Subjt:  --GCQDVDLQGKIFPVANFSKGNN--ESEEADDSDSDSDESDNEGDHYPEENRLWPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQCER

Query:  RDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELESKRSNSS
        ++WI+++ML++EE+ I ++ + VE+EKQR KW+RY SKK R++E+ +L+N+R +L+ ER +L L++ E+EL   +S+ +
Subjt:  RDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELESKRSNSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTCTGGGAGGTGGATTTCTGTCAGGAACTGGGGGGTTATTAGATCTGGAGTCTCCTATTCGAAGACATCAACAAACCCAATTGGTCAATCCCTCATT
GACTCACCGCCATCACTTGAACATGATGAGTTCTTTCGAAGATGGAATTTTGGATTCCAAAAGTTTGGGACACAAAGATCTATCAATGACCTTCAATAAAGGGAAAACTA
TTGCCTCTGGTAGTGAAGAAGATGAGCCAAGTTTTACTGAGGATGGTGAATGCACTGAGTTTTTGAAGGGAAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGAT
GACATTGTGAGGCTTCTTATAGCAGTGGTTGCTTGTGTCGGTGATGAGGGCGAGGCCAGTATGTGGCCGAAGAGAAAATCTGGGATTTTGCAAAAAAAGGGCAAATGGAA
AGTTGTGTCGAAGATTCTGATAAGTAAGGGGTGTCATGTTTCTCCCCAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGCGATTGAATGATCTTCTTG
GGAGGGGAACCAGTTGTAAGGTTGTGGAGAACCCTGCTCTCATGGATTCAATGCCTCACCTCTCAAGTAAAGCCAAGGAGGATGTCAGAAAAATATTAAGCTCTAAGCAC
TTGTTTTATAAGGAAATGTGTGCTTACCATAATGGGCAAACAATTCCTGGTTGCCAGGATGTTGATTTACAAGGTAAGATTTTTCCGGTTGCAAATTTCTCCAAAGGAAA
TAATGAGTCTGAAGAGGCTGATGACAGTGATAGTGATAGTGATGAATCAGATAATGAAGGTGATCACTATCCTGAGGAAAATAGATTGTGGCCGCCTGAAGCTCGTGGCA
GGGATAAAGTGAGTGCTGATGATGGTCCTCTTTGGCCAAACAATGCTGCACAAAATGAATTCGATGGTCAAATTGATGTTTTTCTTTCGGATTCAACAAAATCCCAATGT
GAGCGTAGGGACTGGATTAAAAAGCAGATGCTCCAACTTGAGGAACAATGTATTAGCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATA
TTGCAGTAAGAAAAGTCGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAACGAGCGGAGGGTACTGCAACTGAAGCAGAAGGAAATGGAACTGG
AATCCAAAAGGTCTAATTCATCCTTTGGGCCAACCATTGGTATTGATAGAATTCAAGGAAGAGAGCAAATTGATTTGGGTAGGCATTGA
mRNA sequenceShow/hide mRNA sequence
GCTGGGTGGAATTTACGAGGCATACGCCGTACAACAACTAGCCAAATGGTCTTCGTTTTTCTCCGTAGCTTTAAATCTCAAGAACAAAAGCGCAAAGTTTTTCCCGGAGT
TATCTTCGATGGTCATCGAGTTGCAGGTCGTAGAGGAGTTAATTTTCTTCAACTTTAGATCATCCAAACTGCAAATTTAATTTCTTCCAGCGATTCCGAATAGTTAACCA
GGTTGTAAGTATGCTTTTTTCCTGTATCCTACACAAACCCTGTCTAGTTTTTCCTTGTTTTTTGTGTATGGTTTAGTGATATCCGCCATGGGGTTTTCTGGAATTCACGG
GTTGCACTTGATTTATGGACTAAGAAAGTGGATTACTATGAAGGAATATGAAAATTGGCTTTTCGTTCATGCTAAATTTGTAACTATTTCTTAGAATGATTTGGATTTGA
TCAAGCTTTAAGTAGTAGAGTTCCTATTTGTACTTGTGAACTCCTTTATCTGCTACAAAAAAAAATTGAGGGTTTCTGGTATTAGAGGCTGGAAAATGGATAGTTCAGGT
CTGGGAGGTGGATTTCTGTCAGGAACTGGGGGGTTATTAGATCTGGAGTCTCCTATTCGAAGACATCAACAAACCCAATTGGTCAATCCCTCATTGACTCACCGCCATCA
CTTGAACATGATGAGTTCTTTCGAAGATGGAATTTTGGATTCCAAAAGTTTGGGACACAAAGATCTATCAATGACCTTCAATAAAGGGAAAACTATTGCCTCTGGTAGTG
AAGAAGATGAGCCAAGTTTTACTGAGGATGGTGAATGCACTGAGTTTTTGAAGGGAAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGACATTGTGAGGCTT
CTTATAGCAGTGGTTGCTTGTGTCGGTGATGAGGGCGAGGCCAGTATGTGGCCGAAGAGAAAATCTGGGATTTTGCAAAAAAAGGGCAAATGGAAAGTTGTGTCGAAGAT
TCTGATAAGTAAGGGGTGTCATGTTTCTCCCCAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGCGATTGAATGATCTTCTTGGGAGGGGAACCAGTT
GTAAGGTTGTGGAGAACCCTGCTCTCATGGATTCAATGCCTCACCTCTCAAGTAAAGCCAAGGAGGATGTCAGAAAAATATTAAGCTCTAAGCACTTGTTTTATAAGGAA
ATGTGTGCTTACCATAATGGGCAAACAATTCCTGGTTGCCAGGATGTTGATTTACAAGGTAAGATTTTTCCGGTTGCAAATTTCTCCAAAGGAAATAATGAGTCTGAAGA
GGCTGATGACAGTGATAGTGATAGTGATGAATCAGATAATGAAGGTGATCACTATCCTGAGGAAAATAGATTGTGGCCGCCTGAAGCTCGTGGCAGGGATAAAGTGAGTG
CTGATGATGGTCCTCTTTGGCCAAACAATGCTGCACAAAATGAATTCGATGGTCAAATTGATGTTTTTCTTTCGGATTCAACAAAATCCCAATGTGAGCGTAGGGACTGG
ATTAAAAAGCAGATGCTCCAACTTGAGGAACAATGTATTAGCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAG
TCGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAACGAGCGGAGGGTACTGCAACTGAAGCAGAAGGAAATGGAACTGGAATCCAAAAGGTCTA
ATTCATCCTTTGGGCCAACCATTGGTATTGATAGAATTCAAGGAAGAGAGCAAATTGATTTGGGTAGGCATTGAAAAGCACATAAAAAACTGAAAAGGCCATCCATGTTC
AGAGCTGTTGTGTTGCACGAGATTTTGCTAATCATTGTGGAACAAAACCTTTTAGCCCAATGCATGTTATCATATTAACCTTGTAAGCATGTTTATACTTTTGTTTCATT
TGAGGATGGCCTTCTTCTTATTGGTCATTGGAGCTTCTGTTGCAATGATTAATGTTAAGTGGTTCAGTTTCCAATAAAAAAGATTCCGTTTCTCGAAGTGCACCCCATTT
CTTCTCAAGGATATAAATTTGTGTGATCATATGGATTTATGTTAATAGTCTCGTTGAACTAGTTTTTCTGAAGAACAATGTTGAAGTTCTTACTCCATCTGTGGACCATC
TCAGACTCTCAGTTTTGGAGGTAACCAATCACCAATGATCGCAGATCTTAAAAGCTATGAGAGAGAGAGAGAGAGGCTCCCTTTGAAAAGTGAAGGGACAACAAAATCAT
GGCAATGGATGAACTCTCCTTCCTTACAATGTCAGTAGCAGTTTAACTGAGGAAACCAGTTTCATGTGGGGTGCTCATTTTGTTATGAGAAAAAAGTTTTCAAATTATGA
GCTCTTAAGTCTGATCTTTTGACTTGGGTTTCCCAACAAATACCATTCTGAGAAACATACTGTTTAGTTGAAATTAATTGGAATTTTCTTCAACTTTGTGGTGAAATTAA
AGAGGCCCATTTCATTTGTTTGTTGATTTTTATACGAAAGATGCCA
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLSGTGGLLDLESPIRRHQQTQLVNPSLTHRHHLNMMSSFEDGILDSKSLGHKDLSMTFNKGKTIASGSEEDEPSFTEDGECTEFLKGKKGSPWQRMKWTD
DIVRLLIAVVACVGDEGEASMWPKRKSGILQKKGKWKVVSKILISKGCHVSPQQCEDKFNDLNKRYKRLNDLLGRGTSCKVVENPALMDSMPHLSSKAKEDVRKILSSKH
LFYKEMCAYHNGQTIPGCQDVDLQGKIFPVANFSKGNNESEEADDSDSDSDESDNEGDHYPEENRLWPPEARGRDKVSADDGPLWPNNAAQNEFDGQIDVFLSDSTKSQC
ERRDWIKKQMLQLEEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELESKRSNSSFGPTIGIDRIQGREQIDLGRH