; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007530 (gene) of Chayote v1 genome

Gene IDSed0007530
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor EMB1444
Genome locationLG07:3591131..3597880
RNA-Seq ExpressionSed0007530
SyntenySed0007530
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR025610 - Transcription factor MYC/MYB N-terminal
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR043561 - Transcription factor LHW-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025488.1 Transcription factor EMB-like protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.65Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLCCNSEWKYAVFWKLKHRARMVLTWEDGYY NSEQH   +SKFY KT+EKFHDG YSHD LGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSSTLEYCDGW TQFSAGIKTIVV AVVPHGVLQLGSLDKVTEDVNLV RIRNVFLTLQESSAGH      C+SSGY+ADIPSRSLATEKNEVEM+S +
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G I+S + KPDAINVDS KSQVRLLDDRIC GEPSGCKDI VGLKQ INV SQNS M M+N+ G   PAEKIIT+EAYFPMNPH SSV  G+
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
         H GM S+ NPS +YLQNDVEASETI++Y SN+SLKFPAGYELHEVLGPAFLKDALYLDW+ EYVFGG+ F+LSEGM+GS LTSDS  EHLLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HSGSDVKSDTSLCKSGQSLLTTE+ P+PSTNV TSACSEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKLNKCANMKLHQK++GMLGSSSTDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_011659263.1 transcription factor bHLH155 isoform X1 [Cucumis sativus]0.0e+0087.5Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P + KF+RKTLE F+DG+YSHD LGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSST+EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLV RIRNVFLTLQESSAG       CKSSGYMADIPSRSLATEK EV  +S N
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VG+ELSG  + ES  TKPD INV++ KSQVRLLDDR+C GEPSGCKD  VGLKQ INV+SQNS MDM+NICGN+LPAEKI+TN+AYF MNPH SS +DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
        NH+GM+ RTN + +YLQND+EASETI+MY SNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EYV GG+ F+LSEGMSGSQLTSDS  E LLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HSGSDVKSDTSLCKSGQSLLTTE+ P+PSTNV TSACSEGYSMGQSQTSF GEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++SSEPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKL KCANMKLHQK SGMLG+S TDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_016901084.1 PREDICTED: transcription factor EMB1444 [Cucumis melo]0.0e+0086.32Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P + KF+RKTLE F+DG+YSHDPLGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSST+EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLV RIRN FLTLQESSAG       CKSSGY+           K E   +S N
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G  ES KTKPDAINV+S KSQVRLLDDRIC GEPSGCKD  VGLKQ INV+SQ+SAMDM+NICGN+LPAEKI+TN AYFPMNPH SSV+DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
        NH+GM++RTN + +YLQND+EAS+TIDMY SN SLKFPAGYELHEVLGPAFLKDALYLDWQ EYV GG+ F+LSEGMSGSQLTSDS  E LLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HS SDVKSDTSLCKSGQSLLTTE+ P+PSTN  TSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++S EPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKL KCANMKLHQKE+GMLG+S+TDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_022155842.1 transcription factor EMB1444-like isoform X1 [Momordica charantia]0.0e+0086.34Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P +SKF+RKTLEKFHDG++SHDPLGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTED+NLV RIRN+FLTLQESSAGH      C SSGYMADI ++SL TEKNEVEM+S +
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G  ES KTKPDA  V  LKSQVR +DDR+C GEPSGCKD+ VGLK  ++VR QNS MDM+NICGN+LPAEKI+TN+A FPMN H SS  DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASE---TIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVA
        NH+GM  RTNP+ + L+NDVEA E     DMY SNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EY FG + F+LSEGMSGSQLTSDS  E LLEAVVA
Subjt:  NHDGMYSRTNPSVIYLQNDVEASE---TIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVA

Query:  DVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKR
        DVCHSGSDVKS+TSLCKSGQSLLTTE+ P+PSTN+ TSACSEGYSMGQSQ+SFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKN+KR
Subjt:  DVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKR

Query:  RAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCS
        RAKPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKLNKCANMKLHQKE+GMLGSS+ DQGSSWAVEVGGQLKVCS
Subjt:  RAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCS

Query:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSN
        IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNR+IHRMDILWSLVQILQRSN
Subjt:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSN

XP_022959660.1 transcription factor EMB1444-like [Cucurbita moschata]0.0e+0087.5Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH   +SKFY KT+EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSSTLEYCDGWQTQFSAGIKTIVV AVVPHGVLQLGSLDKV EDVNLV RIRNVFLTLQESSAGH      C+SSGY+ADIPSRSLATEKNEVEM+S +
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G I+S + KPDAINVDS KSQVRLLDDRIC GEPSGCKDI VGLKQ INV SQNS M M+N+ G   PAEKIIT+EAYFPMNP  SSV  G+
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
         H GM S+TNPS +YLQNDVEASETI++Y SN+SLKFPAGYELHEVLGPAFLKDALYLDW+ EYVFGG+ F+LSEG +GS LTSDS  EHLLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HSGSDVKSDTSLCKSGQSLLTTE+ P+PSTNV TSACSEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKLNKCANMKLHQK++GMLGSSSTDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHGDKTWICFVVEGENNRN+HRMDILWSLVQILQRSNTM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

TrEMBL top hitse value%identityAlignment
A0A0A0K751 BHLH domain-containing protein0.0e+0087.5Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P + KF+RKTLE F+DG+YSHD LGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSST+EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLV RIRNVFLTLQESSAG       CKSSGYMADIPSRSLATEK EV  +S N
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VG+ELSG  + ES  TKPD INV++ KSQVRLLDDR+C GEPSGCKD  VGLKQ INV+SQNS MDM+NICGN+LPAEKI+TN+AYF MNPH SS +DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
        NH+GM+ RTN + +YLQND+EASETI+MY SNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EYV GG+ F+LSEGMSGSQLTSDS  E LLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HSGSDVKSDTSLCKSGQSLLTTE+ P+PSTNV TSACSEGYSMGQSQTSF GEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++SSEPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKL KCANMKLHQK SGMLG+S TDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

A0A1S4DYM6 transcription factor EMB14440.0e+0086.32Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P + KF+RKTLE F+DG+YSHDPLGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSST+EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLV RIRN FLTLQESSAG       CKSSGY+           K E   +S N
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G  ES KTKPDAINV+S KSQVRLLDDRIC GEPSGCKD  VGLKQ INV+SQ+SAMDM+NICGN+LPAEKI+TN AYFPMNPH SSV+DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
        NH+GM++RTN + +YLQND+EAS+TIDMY SN SLKFPAGYELHEVLGPAFLKDALYLDWQ EYV GG+ F+LSEGMSGSQLTSDS  E LLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HS SDVKSDTSLCKSGQSLLTTE+ P+PSTN  TSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++S EPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKL KCANMKLHQKE+GMLG+S+TDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

A0A5A7V1K9 Transcription factor EMB14440.0e+0083.45Show/hide
Query:  MGTDHLLLPTVGPPIKRRAGLRRKQAGRGS----LCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVA
        MGTD LLLPTVGPPIKRRAGLRRKQAGR      L  N+   +  F+        VLTWEDGYYDNSEQH P + KF+RKTLE F+DG+YSHDPLGLAVA
Subjt:  MGTDHLLLPTVGPPIKRRAGLRRKQAGRGS----LCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVA

Query:  KMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIPNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH-
        KMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIPNFSST+EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLV RIRN FLTLQESSAG  
Subjt:  KMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIPNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH-

Query:  -----CKSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDM
             CKSSGY+           K E   +S NVGIELSG G  ES KTKPDAINV+S KSQVRLLDDRIC GEPSGCKD  VGLKQ INV+SQ+SAMDM
Subjt:  -----CKSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDM

Query:  INICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFG
        +NICGN+LPAEKI+TN AYFPMNPH SSV+DGVNH+GM++RTN + +YLQND+EAS+TIDMY SN SLKFPAGYELHEVLGPAFLKDALYLDWQ EYV G
Subjt:  INICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFG

Query:  GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGV
        G+ F+LSEGMSGSQLTSDS  E LLEAVVADVCHS SDVKSDTSLCKSGQSLLTTE+ P+PSTN  TSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGV
Subjt:  GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGV

Query:  MSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLH
        MSPKGFSSTYSGTGSEHL++S EPAKNSKRRA+PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKL KC+NMKLH
Subjt:  MSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLH

Query:  QKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDIL
        QKE+GMLG+S+TDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNRNIHRMDIL
Subjt:  QKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDIL

Query:  WSLVQILQRSNTM
        WSLVQILQRS+TM
Subjt:  WSLVQILQRSNTM

A0A6J1DQG8 transcription factor EMB1444-like isoform X10.0e+0086.34Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH P +SKF+RKTLEKFHDG++SHDPLGLAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTED+NLV RIRN+FLTLQESSAGH      C SSGYMADI ++SL TEKNEVEM+S +
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G  ES KTKPDA  V  LKSQVR +DDR+C GEPSGCKD+ VGLK  ++VR QNS MDM+NICGN+LPAEKI+TN+A FPMN H SS  DGV
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASE---TIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVA
        NH+GM  RTNP+ + L+NDVEA E     DMY SNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EY FG + F+LSEGMSGSQLTSDS  E LLEAVVA
Subjt:  NHDGMYSRTNPSVIYLQNDVEASE---TIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVA

Query:  DVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKR
        DVCHSGSDVKS+TSLCKSGQSLLTTE+ P+PSTN+ TSACSEGYSMGQSQ+SFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKN+KR
Subjt:  DVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKR

Query:  RAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCS
        RAKPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKLNKCANMKLHQKE+GMLGSS+ DQGSSWAVEVGGQLKVCS
Subjt:  RAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCS

Query:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSN
        IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHG+KTWICFVVEGENNR+IHRMDILWSLVQILQRSN
Subjt:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSN

A0A6J1H6X5 transcription factor EMB1444-like0.0e+0087.5Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH   +SKFY KT+EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV+GKHQWIT DEQIP
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN
        NFSSTLEYCDGWQTQFSAGIKTIVV AVVPHGVLQLGSLDKV EDVNLV RIRNVFLTLQESSAGH      C+SSGY+ADIPSRSLATEKNEVEM+S +
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGH------CKSSGYMADIPSRSLATEKNEVEMLSNN

Query:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV
        VGIELSG G I+S + KPDAINVDS KSQVRLLDDRIC GEPSGCKDI VGLKQ INV SQNS M M+N+ G   PAEKIIT+EAYFPMNP  SSV  G+
Subjt:  VGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGV

Query:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC
         H GM S+TNPS +YLQNDVEASETI++Y SN+SLKFPAGYELHEVLGPAFLKDALYLDW+ EYVFGG+ F+LSEG +GS LTSDS  EHLLEAVVADVC
Subjt:  NHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVC

Query:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK
        HSGSDVKSDTSLCKSGQSLLTTE+ P+PSTNV TSACSEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPAKNSKRRA+
Subjt:  HSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAK

Query:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV
        PGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGI KHADKLNKCANMKLHQK++GMLGSSSTDQGSSWAVEVGGQLKVCSIIV
Subjt:  PGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIV

Query:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHGDKTWICFVVEGENNRN+HRMDILWSLVQILQRSNTM
Subjt:  ENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

SwissProt top hitse value%identityAlignment
K4PW38 Protein RICE SALT SENSITIVE 32.1e-1731.28Show/hide
Query:  KRRAGLRRKQAGRGSLCCNSEWKYAVFWKLKHRAR---------------MVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSY
        K  AG+        ++C NS+W Y+VFW ++ R R               ++L WEDG              F R  + +  +     DP+  A +KMS 
Subjt:  KRRAGLRRKQAGRGSLCCNSEWKYAVFWKLKHRAR---------------MVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSY

Query:  HVYSLGEGIVGQVAVSGKHQWI-----TTDEQIPNF--SSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTL
         +Y+ GEG++G+VA    H+W+       +  I N+  SS       W  QF++GI+TI V+    HG+LQLGS   + ED++ V R+R++F +L
Subjt:  HVYSLGEGIVGQVAVSGKHQWI-----TTDEQIPNF--SSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTL

P0C7P8 Transcription factor EMB14444.3e-13543.51Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S+C N++W YAVFWKL H + MVLT ED Y  N E+    +S          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA+SG+HQWI + E + 
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------
        +  STL+  +GW++Q SAGIKTI++VAV   GV+QLGSL KV ED  LV  IR++FL L +  A H          S      IPS+ L           
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------

Query:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ
          A +   + ++S N                           G+E        S+       +V  +    K+QV + D  ++   E +G       L  
Subjt:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ

Query:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE
             S+N   +        + ++++    +Y  ++   S+V   +  D  YSR N    PS     I++++      E SE+  +     SL   +G E
Subjt:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE

Query:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE
        L E LGPAF K +   D+ +   F      + +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT +    +P  +   +  S 
Subjt:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE

Query:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE
          S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLE
Subjt:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE

Query:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI
         TIKHMLFLQ +++HADKL K A+ K+  K++G LG SST+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G 
Subjt:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI

Query:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        TE  G+KTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

Q58G01 Transcription factor bHLH1557.4e-13543.59Show/hide
Query:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI
        S C N++W YAVFW+L HR +RMVLT ED YYD    H+ ++              + +HDPLGLAVAKMSYHVYSLGEGIVGQVAVSG+HQW+   E  
Subjt:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI

Query:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------
         N +S  E+ + W++Q SAGIKTI+VVAV P GV+QLGSL KV EDVN V  IR++FL L++  A H                                 
Subjt:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------

Query:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE
                            +S     + PS  L  EK   +++     ++ S CGS  S  T    +++   K +      ++ D    G  SGCKD  
Subjt:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE

Query:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------
          L   +++  +N  ++  +     + AE++IT+++Y    P + S     +     S  +  V  L   Q +    ET  M   N  S +F        
Subjt:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------

Query:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT
          AG EL E LG AF +       Q+E +    G T + ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSLLT  +  +PS     
Subjt:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT

Query:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI
        +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSI
Subjt:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI

Query:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT
        DSLLERTIKHMLFLQ + KHA+KL+K AN K+ QKE+GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L 
Subjt:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT

Query:  ILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ
        IL+G TE  G+KTWICFV E +N++ + RMDILWSLVQI Q
Subjt:  ILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ

Q7XJU0 Transcription factor bHLH1574.1e-4526.47Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        SLC +  W YAVFW+      M+L +E+ Y D        D                           M      LG+GIVG+VA SG HQW+       
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTL-EYCDGWQTQFSAGIKTIV-------VVAVVP---HGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVE
         FS TL ++   +Q QF  G K ++        +A++P    GV+QLGS  K+ E   +   +      LQE+    C       D+ +   +    E+ 
Subjt:  NFSSTL-EYCDGWQTQFSAGIKTIV-------VVAVVP---HGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVE

Query:  MLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSS
           +  G        I +    P  ++ + + S+    +  +  G+  G            ++    S  D+  +  +  P E+  +           S 
Subjt:  MLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSS

Query:  VHDGVNHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAV
        V  GV                  D +  + + M +   ++  P      E++  +                      LS     S LT+      +    
Subjt:  VHDGVNHDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAV

Query:  VADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNS
             +     K DTS   S  SL   E+                     S++ +I +D ++S+                     G    +   E  K  
Subjt:  VADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNS

Query:  KRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKV
        K+RAK GES RPRP+DRQ+IQDRIKELR ++PNGAKCSID+LL+ TIKHM+F+Q +AK+A++L +    KL +++             +WA+EVG +  V
Subjt:  KRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKV

Query:  CSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ
        C I+VE LN+ G++ +EM+CEE   FLEI + +R LGL ILKG+ E    + W  F+V+ +    + R+ +L+SLVQ+ Q
Subjt:  CSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ

Q9XIN0 Transcription factor LHW1.7e-7031.17Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKM--SYHVYSLGEGIVGQVAVSGKHQWITTDEQ
        S+C N++W YAVFWK+  +   +L WE+ Y +      P       + L         ++ + L   +M  +  +  +GEG+VG+ A +G HQWI  +  
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKM--SYHVYSLGEGIVGQVAVSGKHQWITTDEQ

Query:  IPNFSSTL---EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVEMLSNNV
          +F+  +   E  +    QFSAGI+T+ V  VVPHGV+QLGS   + E++  V  ++ + L L                +P   L+      E  ++ +
Subjt:  IPNFSSTL---EYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVEMLSNNV

Query:  GIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVN
        G+ +S     + HK          L+S   + +        +G  D ++  +   N+  ++         G       +   E   P NP          
Subjt:  GIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVN

Query:  HDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGY-ELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSD------------SRA
            +   N S +   ++V+A+E   +   + S K   G  +L ++LG           W    +   RT  L+  +S  ++  +            S  
Subjt:  HDGMYSRTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGY-ELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSD------------SRA

Query:  EHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYS--GTGSEHLE
        +HLL+AVV+  C S   +  +TS  +S ++ LT   N        +S  +  +S  Q    F  +  Q  L  S V G          +S    GS  + 
Subjt:  EHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYS--GTGSEHLE

Query:  RSSEPAK--NSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSS
          +E AK  N+++R KPGE+ RPRP+DRQ+IQDR+KELRE++PNGAKCSID+LLERTIKHMLFLQ ++KH+DKL +    K+ +++ G         G++
Subjt:  RSSEPAK--NSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSS

Query:  WAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRS
        WA EVG +  VC I+VE++N      VEMLCE+   FLEIA+ IRSLGLTILKG+ E   DK W  F VE   +R++ RM+I   LV IL+++
Subjt:  WAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRS

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-13643.51Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S+C N++W YAVFWKL H + MVLT ED Y  N E+    +S          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA+SG+HQWI + E + 
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------
        +  STL+  +GW++Q SAGIKTI++VAV   GV+QLGSL KV ED  LV  IR++FL L +  A H          S      IPS+ L           
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------

Query:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ
          A +   + ++S N                           G+E        S+       +V  +    K+QV + D  ++   E +G       L  
Subjt:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ

Query:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE
             S+N   +        + ++++    +Y  ++   S+V   +  D  YSR N    PS     I++++      E SE+  +     SL   +G E
Subjt:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE

Query:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE
        L E LGPAF K +   D+ +   F      + +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT +    +P  +   +  S 
Subjt:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE

Query:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE
          S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLE
Subjt:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE

Query:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI
         TIKHMLFLQ +++HADKL K A+ K+  K++G LG SST+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G 
Subjt:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI

Query:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        TE  G+KTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

AT1G06150.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-13643.51Show/hide
Query:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP
        S+C N++W YAVFWKL H + MVLT ED Y  N E+    +S          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA+SG+HQWI + E + 
Subjt:  SLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIP

Query:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------
        +  STL+  +GW++Q SAGIKTI++VAV   GV+QLGSL KV ED  LV  IR++FL L +  A H          S      IPS+ L           
Subjt:  NFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------KSSGYMADIPSRSL-----------

Query:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ
          A +   + ++S N                           G+E        S+       +V  +    K+QV + D  ++   E +G       L  
Subjt:  --ATEKNEVEMLSNNV--------------------------GIELSGCGSIESHKTKPDAINVDSL----KSQVRLLD-DRICEGEPSGCKDIEVGLKQ

Query:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE
             S+N   +        + ++++    +Y  ++   S+V   +  D  YSR N    PS     I++++      E SE+  +     SL   +G E
Subjt:  TINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTN----PS----VIYLQN----DVEASETIDMYTSNTSLKFPAGYE

Query:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE
        L E LGPAF K +   D+ +   F      + +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT +    +P  +   +  S 
Subjt:  LHEVLGPAFLKDALYLDWQQEYVF-GGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEK--NPKPSTNVATSACSE

Query:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE
          S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLE
Subjt:  GYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLE

Query:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI
         TIKHMLFLQ +++HADKL K A+ K+  K++G LG SST+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G 
Subjt:  RTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI

Query:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        TE  G+KTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  TEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

AT2G31280.1 conserved peptide upstream open reading frame 75.2e-13643.59Show/hide
Query:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI
        S C N++W YAVFW+L HR +RMVLT ED YYD    H+ ++              + +HDPLGLAVAKMSYHVYSLGEGIVGQVAVSG+HQW+   E  
Subjt:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI

Query:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------
         N +S  E+ + W++Q SAGIKTI+VVAV P GV+QLGSL KV EDVN V  IR++FL L++  A H                                 
Subjt:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------

Query:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE
                            +S     + PS  L  EK   +++     ++ S CGS  S  T    +++   K +      ++ D    G  SGCKD  
Subjt:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE

Query:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------
          L   +++  +N  ++  +     + AE++IT+++Y    P + S     +     S  +  V  L   Q +    ET  M   N  S +F        
Subjt:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------

Query:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT
          AG EL E LG AF +       Q+E +    G T + ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSLLT  +  +PS     
Subjt:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT

Query:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI
        +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSI
Subjt:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI

Query:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT
        DSLLERTIKHMLFLQ + KHA+KL+K AN K+ QKE+GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L 
Subjt:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT

Query:  ILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ
        IL+G TE  G+KTWICFV E +N++ + RMDILWSLVQI Q
Subjt:  ILKGITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQ

AT2G31280.2 conserved peptide upstream open reading frame 75.3e-12041.59Show/hide
Query:  MGTDHLLLPTVGPPIKRRAGLRRKQAGRG-------SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLG
        MG   +    VGPPIK RAGLRR+QAGRG       S C N++W YAVFW+L HR +RMVLT ED YYD    H+ ++              + +HDPLG
Subjt:  MGTDHLLLPTVGPPIKRRAGLRRKQAGRG-------SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLG

Query:  LAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIPNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDK---------------VTEDVNL
        LAVAKMSYHVYSLGEGIVGQVAVSG+HQW+        F      C       ++  +TI+VVAV P GV+QLGSL K                + +V+ 
Subjt:  LAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQIPNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDK---------------VTEDVNL

Query:  VGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIEV
           +    +  Q  +    +S     + PS  L  EK   +++     ++ S CGS  S  T    +++   K +      ++ D    G  SGCKD   
Subjt:  VGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIEV

Query:  GLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP--------
         L   +++  +N  ++  +     + AE++IT+++Y    P + S     +     S  +  V  L   Q +    ET  M   N  S +F         
Subjt:  GLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP--------

Query:  -AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATS
         AG EL E LG AF +       Q+E +    G T + ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSLLT  +  +PS     +
Subjt:  -AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVATS

Query:  ACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSID
          +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSID
Subjt:  ACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSID

Query:  SLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVE-----------------------
        SLLERTIKHMLFLQ + KHA+KL+K AN K+ QKE+GM       QGSS AVEVGG L+V SIIVENLNK G +L+E                       
Subjt:  SLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVE-----------------------

Query:  -MLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVE
         MLCEEC HFLEIA  IRSL L IL+G TE  G+KTWICFV E
Subjt:  -MLCEECSHFLEIAEAIRSLGLTILKGITEAHGDKTWICFVVE

AT2G31280.3 conserved peptide upstream open reading frame 79.2e-13342.61Show/hide
Query:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI
        S C N++W YAVFW+L HR +RMVLT ED YYD    H+ ++              + +HDPLGLAVAKMSYHVYSLGEGIVGQVAVSG+HQW+   E  
Subjt:  SLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWITTDEQI

Query:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------
         N +S  E+ + W++Q SAGIKTI+VVAV P GV+QLGSL KV EDVN V  IR++FL L++  A H                                 
Subjt:  PNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHC--------------------------------

Query:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE
                            +S     + PS  L  EK   +++     ++ S CGS  S  T    +++   K +      ++ D    G  SGCKD  
Subjt:  --------------------KSSGYMADIPSRSLATEKNEVEMLSNNVGIELSGCGSIESHKTKPDAINVDSLKSQ----VRLLDDRICEGEPSGCKDIE

Query:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------
          L   +++  +N  ++  +     + AE++IT+++Y    P + S     +     S  +  V  L   Q +    ET  M   N  S +F        
Subjt:  VGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYSRTNPSVIYL---QNDVEASETIDMYTSN-TSLKFP-------

Query:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT
          AG EL E LG AF +       Q+E +    G T + ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSLLT  +  +PS     
Subjt:  --AGYELHEVLGPAFLKDALYLDWQQEYVFG--GRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKNPKPSTNVAT

Query:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI
        +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RAKPGESSRPRPRDRQLIQDRIKELRELVPNG+KCSI
Subjt:  SACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSI

Query:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT
        DSLLERTIKHMLFLQ + KHA+KL+K AN K+ QKE+GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L 
Subjt:  DSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT

Query:  ILKGITEAHGDKTWICFVVE-----------------GENNRNIHRMDILWSLVQILQ
        IL+G TE  G+KTWICFV E                  +N++ + RMDILWSLVQI Q
Subjt:  ILKGITEAHGDKTWICFVVE-----------------GENNRNIHRMDILWSLVQILQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACTGATCATTTGCTATTACCAACAGTTGGACCGCCAATAAAACGGCGAGCAGGGTTAAGGAGAAAGCAGGCTGGTAGAGGCTCCCTTTGTTGCAACTCGGAATG
GAAATATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGAATGGTGCTGACTTGGGAGGATGGCTACTATGACAATTCTGAACAACATTATCCCTCAGATAGCAAGTTTT
ACAGAAAAACTCTTGAGAAGTTTCATGATGGAAATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAGATGTCCTATCATGTATACTCTCTTGGGGAAGGGATTGTT
GGACAAGTAGCCGTTAGTGGAAAACATCAATGGATTACTACAGACGAGCAAATACCAAATTTCTCTTCAACACTTGAGTACTGCGATGGTTGGCAAACACAATTTTCAGC
TGGCATTAAGACCATTGTTGTTGTAGCAGTTGTCCCGCATGGTGTTTTACAGCTTGGATCTTTAGATAAAGTCACTGAGGATGTCAATCTTGTGGGTCGCATCAGAAATG
TCTTTCTAACTCTTCAAGAGTCTTCAGCTGGGCATTGTAAGAGCTCGGGATACATGGCAGACATTCCATCTAGAAGCTTAGCTACAGAGAAAAATGAAGTTGAAATGCTC
AGCAATAATGTTGGGATAGAGTTGTCAGGATGTGGGAGCATTGAATCTCACAAGACAAAGCCAGATGCTATCAATGTGGATAGTTTGAAGTCACAAGTGAGGTTGCTTGA
TGATAGGATATGTGAAGGGGAGCCTAGTGGGTGCAAAGATATCGAAGTTGGTTTGAAACAAACAATCAATGTGCGATCGCAAAACTCTGCAATGGATATGATTAATATAT
GTGGCAATATACTTCCAGCTGAGAAGATCATAACAAATGAAGCATACTTCCCTATGAATCCTCATGTCTCTTCTGTCCATGATGGAGTTAATCACGACGGGATGTATAGC
CGAACCAACCCTTCAGTAATATACTTGCAAAATGATGTGGAAGCATCAGAAACTATTGATATGTATACATCGAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACA
TGAAGTATTAGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGATTGGCAACAAGAGTATGTATTTGGTGGTAGAACTTTCAAGTTATCTGAGGGAATGAGTGGTAGCC
AGTTGACATCCGATTCTCGAGCGGAGCATCTTCTAGAAGCAGTAGTGGCCGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAATCTGGACAG
TCTCTGTTAACGACAGAAAAAAATCCAAAGCCGTCCACAAATGTTGCAACATCTGCTTGTTCAGAAGGTTATTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGA
CATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGAGTGATGTCCCCAAAAGGGTTTTCATCGACATATTCTGGTACCGGAAGTGAGCACCTGGAGCGGTCTTCAGAAC
CTGCAAAGAACAGTAAAAGAAGGGCTAAACCTGGTGAGAGTTCTCGGCCTAGGCCGAGGGATAGACAATTGATCCAGGATCGCATCAAAGAACTTCGGGAGCTAGTACCA
AATGGAGCAAAATGCAGCATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCGCCAAGCATGCTGACAAGCTAAACAAATGCGCTAACAT
GAAGTTGCATCAGAAGGAAAGTGGCATGCTAGGATCCTCAAGCACTGATCAGGGTTCAAGCTGGGCTGTTGAGGTCGGTGGTCAACTTAAAGTTTGTTCTATAATTGTGG
AGAATCTCAACAAGAACGGGCAGATTCTTGTGGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAA
GGCATAACGGAAGCTCATGGAGACAAGACATGGATTTGTTTTGTGGTCGAGGGGGAGAACAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACT
ACAGCGCAGTAATACAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACTGATCATTTGCTATTACCAACAGTTGGACCGCCAATAAAACGGCGAGCAGGGTTAAGGAGAAAGCAGGCTGGTAGAGGCTCCCTTTGTTGCAACTCGGAATG
GAAATATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGAATGGTGCTGACTTGGGAGGATGGCTACTATGACAATTCTGAACAACATTATCCCTCAGATAGCAAGTTTT
ACAGAAAAACTCTTGAGAAGTTTCATGATGGAAATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAGATGTCCTATCATGTATACTCTCTTGGGGAAGGGATTGTT
GGACAAGTAGCCGTTAGTGGAAAACATCAATGGATTACTACAGACGAGCAAATACCAAATTTCTCTTCAACACTTGAGTACTGCGATGGTTGGCAAACACAATTTTCAGC
TGGCATTAAGACCATTGTTGTTGTAGCAGTTGTCCCGCATGGTGTTTTACAGCTTGGATCTTTAGATAAAGTCACTGAGGATGTCAATCTTGTGGGTCGCATCAGAAATG
TCTTTCTAACTCTTCAAGAGTCTTCAGCTGGGCATTGTAAGAGCTCGGGATACATGGCAGACATTCCATCTAGAAGCTTAGCTACAGAGAAAAATGAAGTTGAAATGCTC
AGCAATAATGTTGGGATAGAGTTGTCAGGATGTGGGAGCATTGAATCTCACAAGACAAAGCCAGATGCTATCAATGTGGATAGTTTGAAGTCACAAGTGAGGTTGCTTGA
TGATAGGATATGTGAAGGGGAGCCTAGTGGGTGCAAAGATATCGAAGTTGGTTTGAAACAAACAATCAATGTGCGATCGCAAAACTCTGCAATGGATATGATTAATATAT
GTGGCAATATACTTCCAGCTGAGAAGATCATAACAAATGAAGCATACTTCCCTATGAATCCTCATGTCTCTTCTGTCCATGATGGAGTTAATCACGACGGGATGTATAGC
CGAACCAACCCTTCAGTAATATACTTGCAAAATGATGTGGAAGCATCAGAAACTATTGATATGTATACATCGAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACA
TGAAGTATTAGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGATTGGCAACAAGAGTATGTATTTGGTGGTAGAACTTTCAAGTTATCTGAGGGAATGAGTGGTAGCC
AGTTGACATCCGATTCTCGAGCGGAGCATCTTCTAGAAGCAGTAGTGGCCGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAATCTGGACAG
TCTCTGTTAACGACAGAAAAAAATCCAAAGCCGTCCACAAATGTTGCAACATCTGCTTGTTCAGAAGGTTATTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGA
CATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGAGTGATGTCCCCAAAAGGGTTTTCATCGACATATTCTGGTACCGGAAGTGAGCACCTGGAGCGGTCTTCAGAAC
CTGCAAAGAACAGTAAAAGAAGGGCTAAACCTGGTGAGAGTTCTCGGCCTAGGCCGAGGGATAGACAATTGATCCAGGATCGCATCAAAGAACTTCGGGAGCTAGTACCA
AATGGAGCAAAATGCAGCATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCGCCAAGCATGCTGACAAGCTAAACAAATGCGCTAACAT
GAAGTTGCATCAGAAGGAAAGTGGCATGCTAGGATCCTCAAGCACTGATCAGGGTTCAAGCTGGGCTGTTGAGGTCGGTGGTCAACTTAAAGTTTGTTCTATAATTGTGG
AGAATCTCAACAAGAACGGGCAGATTCTTGTGGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAA
GGCATAACGGAAGCTCATGGAGACAAGACATGGATTTGTTTTGTGGTCGAGGGGGAGAACAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACT
ACAGCGCAGTAATACAATGTGA
Protein sequenceShow/hide protein sequence
MGTDHLLLPTVGPPIKRRAGLRRKQAGRGSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHYPSDSKFYRKTLEKFHDGNYSHDPLGLAVAKMSYHVYSLGEGIV
GQVAVSGKHQWITTDEQIPNFSSTLEYCDGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVGRIRNVFLTLQESSAGHCKSSGYMADIPSRSLATEKNEVEML
SNNVGIELSGCGSIESHKTKPDAINVDSLKSQVRLLDDRICEGEPSGCKDIEVGLKQTINVRSQNSAMDMINICGNILPAEKIITNEAYFPMNPHVSSVHDGVNHDGMYS
RTNPSVIYLQNDVEASETIDMYTSNTSLKFPAGYELHEVLGPAFLKDALYLDWQQEYVFGGRTFKLSEGMSGSQLTSDSRAEHLLEAVVADVCHSGSDVKSDTSLCKSGQ
SLLTTEKNPKPSTNVATSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRAKPGESSRPRPRDRQLIQDRIKELRELVP
NGAKCSIDSLLERTIKHMLFLQGIAKHADKLNKCANMKLHQKESGMLGSSSTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILK
GITEAHGDKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM