; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g018530 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g018530
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionSequence-specific DNA binding transcription factor
Genome locationChr07:47097982..47099313
RNA-Seq ExpressionLcy07g018530
SyntenyLcy07g018530
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605907.1 hypothetical protein SDJN03_03224, partial [Cucurbita argyrosperma subsp. sororia]1.9e-22691.06Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQ +GIMDTK LG KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFSK N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PAESRGRDK SA DGPLWS ++AQNEFEGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]3.8e-22791.28Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK LG KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFSK N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PAESRGRDK SA DGPLWS ++AQNEFEGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_022995089.1 uncharacterized protein LOC111490737 [Cucurbita maxima]1.9e-22690.6Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK +G KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFS+ N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PA+SRGRDK SA DGPLWSN++AQNE EGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_023534092.1 uncharacterized protein LOC111795758 [Cucurbita pepo subsp. pepo]4.2e-22690.83Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T E DHQS+GIMDTK LG KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC++FLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFSK N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PAESRGRDK SA DGPLWS ++AQNEFEGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]3.6e-22588.14Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNN-SSEEDEPCFTE
        MDSSGLGGGFLSG+GG++DLESPIRR Q+TQLVNPSLTHRHHLNMMSTFEGDH S+G +DTKSLGQKDL M+F+KGKAIASG +TNNN +SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNN-SSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG G KRKSGILQKKGKWK +SK+M+SKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENR
        SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PVANFSK N+ES+EA+DSDSDSD  ESDNEDDH PVENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENR

Query:  LWPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERM
        LWP+ESRGRDKVSA DGPLWSNS A+NEFEG+I+VFLSDPTKSQWER+DW++KQMLQLQEQC +FQAQSVELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERM

Query:  KLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH
        KLDNERRVLQLKQKEMELE KR DSSFGPTLGIDRIQGRE++DLGRH
Subjt:  KLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH

TrEMBL top hitse value%identityAlignment
A0A1S3BM36 uncharacterized protein LOC1034915224.5e-21886.55Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED
        MDSSGLGGGFLSG+GG+LDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKDL M+F++GKAIAS  +TNN +SEEDEP +TED
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKW+ VSK+M SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI P ANFSK N+ESEEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL

Query:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W +ESRGRDKVSA DGPLWSNS  +NEFEGQI+VFLSDPTKSQWERK WIKKQMLQLQEQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGRE++DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH

A0A5A7TE21 Stress response protein nst1 isoform X14.5e-21886.55Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED
        MDSSGLGGGFLSG+GG+LDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKDL M+F++GKAIAS  +TNN +SEEDEP +TED
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKW+ VSK+M SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI P ANFSK N+ESEEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL

Query:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W +ESRGRDKVSA DGPLWSNS  +NEFEGQI+VFLSDPTKSQWERK WIKKQMLQLQEQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGRE++DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH

A0A5D3BB81 Stress response protein nst1 isoform X12.0e-21886.77Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED
        MDSSGLGGGFLSG+GG+LDLESPIRR Q+TQLVNPSLT RH LNMMS FEGDHQSIGI+D+KSLGQKDL M+F++GKAIAS  +TNN +SEEDEP +TED
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKWK VSK+M SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI P ANFSK N+ESEEA+DSDSDSD  ESDNEDDH P ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSD--ESDNEDDHYPVENRL

Query:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK
        W +ESRGRDKVSA DGPLWSNS  +NEFEGQI+VFLSDPTKSQWERK WIKKQMLQLQEQC SFQAQSVELEKQRFKWLRYCSKK+RDLER RLENERMK
Subjt:  WPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGRE++DLG H
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDLGRH

A0A6J1H0P0 uncharacterized protein LOC1114593381.8e-22791.28Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK LG KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFSK N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PAESRGRDK SA DGPLWS ++AQNEFEGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

A0A6J1K4Q0 uncharacterized protein LOC1114907379.1e-22790.6Show/hide
Query:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE
        MDSSGLGGGFLSG+GG+LDLESPIRRHQQTQL+N SLTHRHHL MM+T EGDHQS+GIMDTK +G KDLSM+F+KGKAIASG VTNN N+SEEDEP FTE
Subjt:  MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNN-NSSEEDEPCFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWKMVSK+MISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQD+DFQGKI PV NFS+ N+ESEEADDSDSDSDESDNEDDHYP ENRLW
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKI-PVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLW

Query:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL
        PA+SRGRDK SA DGPLWSN++AQNE EGQI+VFLSDPTK QWER+DWIKKQMLQLQEQC+SFQAQS ELEKQRFKWLRYCSKKSRDLER+RLENERMK+
Subjt:  PAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors9.7e-8042.51Show/hide
Query:  MDSSGLGGGFL---SGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQ-----KDLSMSFSKGKAIASGSVTNNNSSEE
        MD +   GG +   + S G  DL+  +R H Q  +   +  HRH+ N     E      G+  T   GQ     ++ +MS S+ +         N+ S++
Subjt:  MDSSGLGGGFL---SGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQ-----KDLSMSFSKGKAIASGSVTNNNSSEE

Query:  DEPCFTE---DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYK
        DEP FTE   DG  +E  +  KGSPWQR+KWTD +V+LLI  V+ +GDD       +RK  +LQKKGKWK VSKVM  +G HVSPQQCEDKFNDLNKRYK
Subjt:  DEPCFTE---DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYK

Query:  RLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKIPVANFSKENDESEEA----------DDSDS
        +LND+LGRGTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+  Q  + +A  S+++ +++++          +D D 
Subjt:  RLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKIPVANFSKENDESEEA----------DDSDS

Query:  DSDESDN-EDDHYPVENRLWPAESRG-------RDKVSAYDG--PLWSNSAAQNEFE-GQIEVFLSDPTKSQWE-------RKDWIKKQMLQLQEQCISF
        D DE D  E+ HY   +        G       R  +S  DG  P   NS   N+    QI    +D  +   E       +K W++ + LQL+EQ +  
Subjt:  DSDESDN-EDDHYPVENRLWPAESRG-------RDKVSAYDG--PLWSNSAAQNEFE-GQIEVFLSDPTKSQWE-------RKDWIKKQMLQLQEQCISF

Query:  QAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELE
        Q + +ELEKQRF+W R+  K+ ++LER+R+ENERMKL+N+R  L+LKQ+E+ +E
Subjt:  QAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.8e-6242.51Show/hide
Query:  SEEDEPC-FTEDGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRY
        SE+DE C  + DG+     K K+ SPWQR+KW D +V+L+I  ++ +G+D     G  +K  +LQKKGKW+ VSKVM  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPC-FTEDGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRY

Query:  KRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKIPVANFSK----ENDESEEADDSDSDSDES
        K+LN++LGRGTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D   Q  + +         +NDE  +  + D D D+ 
Subjt:  KRLNDILGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKIPVANFSK----ENDESEEADDSDSDSDES

Query:  DNEDDHYPVENRLWP--AESRGRDKV----SAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRY
          ED    + +R      +S+  + V      YD P    S  Q +    I +   D  K+   ++  I+ + L+L+ + +  QA+ +ELE+Q+FKW  +
Subjt:  DNEDDHYPVENRLWP--AESRGRDKV----SAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRY

Query:  CSKKSRDLERVRLENERMKLDNERRVLQLKQKEM
          ++ + L ++R+ENERMKL+NER  L+LK+ E+
Subjt:  CSKKSRDLERVRLENERMKLDNERRVLQLKQKEM

AT3G10040.1 sequence-specific DNA binding transcription factors3.0e-6542.02Show/hide
Query:  KKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----LKRKS----------GILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDIL
        +K S W RMKWTD +VRLLI  V  +GD  EAG+      K+K+          G+LQKKGKWK VS+ M+ KG  VSPQQCEDKFNDLNKRYKR+NDIL
Subjt:  KKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----LKRKS----------GILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDIL

Query:  GRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN------------------GQTIPGCQDIDF-------QGKIPVANFSKENDE
        G+G +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN                     IP  Q   F         +I      +E  E
Subjt:  GRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN------------------GQTIPGCQDIDF-------QGKIPVANFSKENDE

Query:  SEEADDSDSDSDESDNEDDHYPVENRLWPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEK
        S+ A+DS+S+ +ES+ E             E+R + ++          S A      +    + D  KS WE+K+WI+++ML+++E+ I ++ + VE+EK
Subjt:  SEEADDSDSDSDESDNEDDHYPVENRLWPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIEVFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEK

Query:  QRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMEL-EFKRSDSSFGPT
        QR KW+RY SKK R++E+ +L+N+R +L+ ER +L L++ E+EL E + S +   P+
Subjt:  QRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMEL-EFKRSDSSFGPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTCTGGGAGGTGGATTTCTGTCAGGAAGTGGGGGGATATTAGATCTGGAATCTCCTATTCGAAGACATCAACAGACCCAATTGGTCAATCCCTCATT
GACACACCGACATCACTTGAACATGATGAGTACTTTTGAAGGTGATCACCAGTCCATTGGGATTATGGACACGAAAAGCTTGGGACAGAAAGATCTATCGATGAGTTTCT
CTAAAGGGAAAGCTATTGCCTCTGGTAGTGTCACAAACAACAACTCGAGTGAAGAAGACGAGCCATGTTTTACTGAGGATGGTGAGTGCTCTGAGTTTTTGAAGGGAAAA
AAGGGCTCTCCGTGGCAGAGAATGAAGTGGACAGATGACATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAATGGGCCTGAAGAG
AAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAATGGTGTCGAAGGTTATGATAAGTAAAGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACT
TAAACAAAAGATACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCGAGTAAAGCC
AAGGATGATGTCAGAAAGATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATATTGATTTCCAAGG
TAAGATTCCTGTTGCGAATTTCTCCAAAGAAAATGACGAGTCAGAAGAGGCTGATGACAGTGACAGTGACAGTGATGAATCAGATAATGAAGATGATCACTATCCTGTGG
AAAATAGATTGTGGCCAGCTGAATCTCGTGGCAGGGATAAAGTGAGTGCATATGATGGTCCCCTTTGGTCAAACTCGGCTGCACAAAATGAATTTGAAGGTCAAATTGAA
GTTTTTCTTTCGGATCCAACGAAGTCACAATGGGAGCGCAAAGATTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTATCAGCTTCCAAGCTCAATCTGTTGA
ACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAGAGTAGGGATTTGGAAAGAGTGAGGCTTGAAAACGAGAGGATGAAACTAGATAATGAGCGGAGAG
TACTGCAACTGAAGCAGAAGGAAATGGAACTGGAATTCAAAAGGTCTGATTCATCCTTTGGGCCAACCCTTGGCATCGACAGAATTCAAGGGAGAGAGAAAATTGATTTG
GGTCGGCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTTCAGGTCTGGGAGGTGGATTTCTGTCAGGAAGTGGGGGGATATTAGATCTGGAATCTCCTATTCGAAGACATCAACAGACCCAATTGGTCAATCCCTCATT
GACACACCGACATCACTTGAACATGATGAGTACTTTTGAAGGTGATCACCAGTCCATTGGGATTATGGACACGAAAAGCTTGGGACAGAAAGATCTATCGATGAGTTTCT
CTAAAGGGAAAGCTATTGCCTCTGGTAGTGTCACAAACAACAACTCGAGTGAAGAAGACGAGCCATGTTTTACTGAGGATGGTGAGTGCTCTGAGTTTTTGAAGGGAAAA
AAGGGCTCTCCGTGGCAGAGAATGAAGTGGACAGATGACATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGAATGGGCCTGAAGAG
AAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAATGGTGTCGAAGGTTATGATAAGTAAAGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACT
TAAACAAAAGATACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCGAGTAAAGCC
AAGGATGATGTCAGAAAGATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATATTGATTTCCAAGG
TAAGATTCCTGTTGCGAATTTCTCCAAAGAAAATGACGAGTCAGAAGAGGCTGATGACAGTGACAGTGACAGTGATGAATCAGATAATGAAGATGATCACTATCCTGTGG
AAAATAGATTGTGGCCAGCTGAATCTCGTGGCAGGGATAAAGTGAGTGCATATGATGGTCCCCTTTGGTCAAACTCGGCTGCACAAAATGAATTTGAAGGTCAAATTGAA
GTTTTTCTTTCGGATCCAACGAAGTCACAATGGGAGCGCAAAGATTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTATCAGCTTCCAAGCTCAATCTGTTGA
ACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAGAGTAGGGATTTGGAAAGAGTGAGGCTTGAAAACGAGAGGATGAAACTAGATAATGAGCGGAGAG
TACTGCAACTGAAGCAGAAGGAAATGGAACTGGAATTCAAAAGGTCTGATTCATCCTTTGGGCCAACCCTTGGCATCGACAGAATTCAAGGGAGAGAGAAAATTGATTTG
GGTCGGCATTGA
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLSGSGGILDLESPIRRHQQTQLVNPSLTHRHHLNMMSTFEGDHQSIGIMDTKSLGQKDLSMSFSKGKAIASGSVTNNNSSEEDEPCFTEDGECSEFLKGK
KGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGLKRKSGILQKKGKWKMVSKVMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKA
KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDIDFQGKIPVANFSKENDESEEADDSDSDSDESDNEDDHYPVENRLWPAESRGRDKVSAYDGPLWSNSAAQNEFEGQIE
VFLSDPTKSQWERKDWIKKQMLQLQEQCISFQAQSVELEKQRFKWLRYCSKKSRDLERVRLENERMKLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREKIDL
GRH