; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012817 (gene) of Snake gourd v1 genome

Gene IDTan0012817
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor EMB1444-like isoform X1
Genome locationLG06:10028439..10036727
RNA-Seq ExpressionTan0012817
SyntenyTan0012817
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR025610 - Transcription factor MYC/MYB N-terminal
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR043561 - Transcription factor LHW-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025488.1 Transcription factor EMB-like protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.5Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYY NSEQHD  ESKFY KT+EKFHDG YSHD LGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSST+EYCDGW TQFSAGI                    KVTEDVNLVARIRNVFLTLQESSAGH+KPMHSC+SSGYVADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH
        EKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQVRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNPH
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH

Query:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL
        ASSV  G+ H GM S+ NPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFLKDALYLDW+TEYVFGGK FELSEGM+GS LTSDSPTEHLL
Subjt:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL

Query:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA
        EAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPA
Subjt:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA

Query:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ
        KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Subjt:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ

Query:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
Subjt:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_011659263.1 transcription factor bHLH155 isoform X1 [Cucumis sativus]0.0e+0088.13Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTLE F+DGHYSHD LGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSSTIEYCDGWQTQFSAGI                    KVTEDVNLV RIRNVFLTLQESSAG +KPMHSCKSSGY+ADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
        EK EVA VSKNVG+ELSG+   ESL +KPD INVE+ KSQVRLLDDR+CGGEPSGCKD +VGLK K+NVQSQNS   MV+ICGNLLPAEKI+TN+AY+ M
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE
        NPH SS YDGVNH+GMF RTN +EMYLQND+EASETI+MYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE

Query:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS
         LLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNVTTSA SEGYSMGQSQTSF GEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++SS
Subjt:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS

Query:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV
        EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQK +GMLG+S+TDQGSSWAVEV
Subjt:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV

Query:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_016901084.1 PREDICTED: transcription factor EMB1444 [Cucumis melo]0.0e+0086.98Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDL QILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTLE F+DGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSSTIEYCDGWQTQFSAGI                    KVTEDVNLV RIRN FLTLQESSAG +KP+HSCKSSGYV          
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
         K E A VSKNVG+ELSG+GG ESL++KPDAINVES KSQVRLLDDRICGGEPSGCKD +VGLK K+NVQSQ+S   M++ICGNLLPAEKI+TN AY+PM
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE
        NPH SSVYDGVNH+GMF+RTN +EMYLQND+EAS+TIDMYPSN SLKFPAGYELHEVLGPAFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE

Query:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS
         LLEAVVADVCHS SDVKSDTSLCKSGQSLLTTE+IPEPSTN TTSA SEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++S 
Subjt:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS

Query:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV
        EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQKENGMLG+SNTDQGSSWAVEV
Subjt:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV

Query:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

XP_022155842.1 transcription factor EMB1444-like isoform X1 [Momordica charantia]0.0e+0086.42Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDP ESKF+RKTLEKFHDGH+SHDPLGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSST+EYCDGWQTQFSAGI                    KVTED+NLV RIRN+FLTLQESSAGH+KPMHSC SSGY+ADI ++SL T
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
        EKNEV  VSK+VG+ELSG+GG ESL++KPDA  V+ LKSQVR +DDR+C GEPSGCKDM+VGLKHKV+V+ QNS   MV+ICGNLLPAEKI+TN+A +PM
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDS
        N HASS  DGVNH+GM  RTNP+EM L+NDVEA E     DMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EY FG K FELSEGMSGSQLTSDS
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDS

Query:  PTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE
        P E LLEAVVADVCHSGSDVKS+TSLCKSGQSLLTTE+IPEPSTN+TTSA SEGYSMGQSQ+SFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE
Subjt:  PTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE

Query:  RSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA
        RSSEPAKN+KRRA+PGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSN DQGSSWA
Subjt:  RSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA

Query:  VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSN
        VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNR+IHRMDILWSLVQILQRSN
Subjt:  VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSN

XP_022959660.1 transcription factor EMB1444-like [Cucurbita moschata]0.0e+0087.35Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSST+EYCDGWQTQFSAGI                    KV EDVNLVARIRNVFLTLQESSAGH+KPMHSC+SSGYVADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH
        EKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQVRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNP 
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH

Query:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL
        ASSV  G+ H GM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFLKDALYLDW+TEYVFGGK FELSEG +GS LTSDSPTEHLL
Subjt:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL

Query:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA
        EAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPA
Subjt:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA

Query:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ
        KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Subjt:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ

Query:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFVVEGENNRN+HRMDILWSLVQILQRSNTM
Subjt:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

TrEMBL top hitse value%identityAlignment
A0A0A0K751 BHLH domain-containing protein0.0e+0088.13Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTLE F+DGHYSHD LGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSSTIEYCDGWQTQFSAGI                    KVTEDVNLV RIRNVFLTLQESSAG +KPMHSCKSSGY+ADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
        EK EVA VSKNVG+ELSG+   ESL +KPD INVE+ KSQVRLLDDR+CGGEPSGCKD +VGLK K+NVQSQNS   MV+ICGNLLPAEKI+TN+AY+ M
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE
        NPH SS YDGVNH+GMF RTN +EMYLQND+EASETI+MYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE

Query:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS
         LLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNVTTSA SEGYSMGQSQTSF GEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++SS
Subjt:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS

Query:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV
        EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQK +GMLG+S+TDQGSSWAVEV
Subjt:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV

Query:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

A0A1S4DYM6 transcription factor EMB14440.0e+0086.98Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDL QILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTLE F+DGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSSTIEYCDGWQTQFSAGI                    KVTEDVNLV RIRN FLTLQESSAG +KP+HSCKSSGYV          
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
         K E A VSKNVG+ELSG+GG ESL++KPDAINVES KSQVRLLDDRICGGEPSGCKD +VGLK K+NVQSQ+S   M++ICGNLLPAEKI+TN AY+PM
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE
        NPH SSVYDGVNH+GMF+RTN +EMYLQND+EAS+TIDMYPSN SLKFPAGYELHEVLGPAFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTE

Query:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS
         LLEAVVADVCHS SDVKSDTSLCKSGQSLLTTE+IPEPSTN TTSA SEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL++S 
Subjt:  HLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS

Query:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV
        EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQKENGMLG+SNTDQGSSWAVEV
Subjt:  EPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV

Query:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRS+TM
Subjt:  GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

A0A6J1DQG8 transcription factor EMB1444-like isoform X10.0e+0086.42Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDP ESKF+RKTLEKFHDGH+SHDPLGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSST+EYCDGWQTQFSAGI                    KVTED+NLV RIRN+FLTLQESSAGH+KPMHSC SSGY+ADI ++SL T
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM
        EKNEV  VSK+VG+ELSG+GG ESL++KPDA  V+ LKSQVR +DDR+C GEPSGCKDM+VGLKHKV+V+ QNS   MV+ICGNLLPAEKI+TN+A +PM
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM

Query:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDS
        N HASS  DGVNH+GM  RTNP+EM L+NDVEA E     DMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQ EY FG K FELSEGMSGSQLTSDS
Subjt:  NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDS

Query:  PTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE
        P E LLEAVVADVCHSGSDVKS+TSLCKSGQSLLTTE+IPEPSTN+TTSA SEGYSMGQSQ+SFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE
Subjt:  PTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLE

Query:  RSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA
        RSSEPAKN+KRRA+PGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSN DQGSSWA
Subjt:  RSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA

Query:  VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSN
        VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNR+IHRMDILWSLVQILQRSN
Subjt:  VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSN

A0A6J1H6X5 transcription factor EMB1444-like0.0e+0087.35Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        HQWITADEQIPNFSST+EYCDGWQTQFSAGI                    KV EDVNLVARIRNVFLTLQESSAGH+KPMHSC+SSGYVADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH
        EKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQVRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNP 
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH

Query:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL
        ASSV  G+ H GM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFLKDALYLDW+TEYVFGGK FELSEG +GS LTSDSPTEHLL
Subjt:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL

Query:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA
        EAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPA
Subjt:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA

Query:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ
        KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Subjt:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ

Query:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFVVEGENNRN+HRMDILWSLVQILQRSNTM
Subjt:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

A0A6J1L0P6 transcription factor EMB1444-like0.0e+0086.19Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+EKFHDG YSHD LGLAVAKMSYHVYSLGEGIVGQVAV GK
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT
        +QWITADEQIPNFSST+EYCDGWQTQFSAGI                    KVTEDVNLVA IRNVFLTLQESSAGH++PM SC+SSGYVADIPSRSLAT
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT

Query:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH
        EKNEV  VSK+VG+ELSG+GGI+SL+ KPDAIN++S KSQVR+LDDRICGGEPS CKD++VGLK K+NV SQN  +       PAEKIIT+EAY+PMNPH
Subjt:  EKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH

Query:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL
        ASSV  G+ HDGM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFLKDALYLDW+T+YVFGGK FELSEG +GS LTSDSPTEHLL
Subjt:  ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLL

Query:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA
        EAVVADVCHSGSDVKSDTSLCKSGQSLLTTE+IPEPSTNV TS  SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KGFSSTYSGTGSEHLE+ SEPA
Subjt:  EAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPA

Query:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ
        KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHA KLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Subjt:  KNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ

Query:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
        LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL IL+GITEAHG+KTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
Subjt:  LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM

SwissProt top hitse value%identityAlignment
K4PW38 Protein RICE SALT SENSITIVE 38.9e-1326.6Show/hide
Query:  GSTDLHQILKSLCCNSEWKYAVFWKLKHRAR---------------MVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYS
        G   LH+ L+++C NS+W Y+VFW ++ R R               ++L WEDG              F R  + +  +     DP+  A +KMS  +Y+
Subjt:  GSTDLHQILKSLCCNSEWKYAVFWKLKHRAR---------------MVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYS

Query:  LGEGIVGQVAVNGKHQWI-----TADEQIPNF--SSTIEYCDGWQTQFSAGIK-------------------VTEDVNLVARIRNVFLTLQESSAGHMKP
         GEG++G+VA +  H+W+       +  I N+  SS       W  QF++GI+                   + ED++ V R+R++F +L   S   +  
Subjt:  LGEGIVGQVAVNGKHQWI-----TADEQIPNF--SSTIEYCDGWQTQFSAGIK-------------------VTEDVNLVARIRNVFLTLQESSAGHMKP

Query:  MHS
        + S
Subjt:  MHS

P0C7P8 Transcription factor EMB14449.8e-12942.53Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL
        HQWI + E + +  ST++  +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   +  C   S      IPS+ L
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL

Query:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS
                     A +   +  VS+N                  ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Subjt:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS

Query:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS
        G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D  +SR N    PSE    +++++      E SE+  +   
Subjt:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS

Query:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS
          SL   +G EL E LGPAF K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT ++   EP 
Subjt:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS

Query:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG
         +   +  S   S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Subjt:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG

Query:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR
        +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IR
Subjt:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR

Query:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        SL L IL+G TE  GEKTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

Q58G01 Transcription factor bHLH1558.3e-12842.33Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA
        MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                    H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVA
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA

Query:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP
        V+G+HQW+   E   N +S  E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A H   +  C   +S  +  +P
Subjt:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP

Query:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------
        S  L  E   +    V K + +E S       T   +S+  + P +  V    +QV     ++    CG                               
Subjt:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------

Query:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE
          G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +   +        H+ +F  +          +E  L  + E
Subjt:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE

Query:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL
        +S+   +  S  +    AG EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSL
Subjt:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL

Query:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI
        LT  ++ EPS     +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Subjt:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI

Query:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS
        KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC 
Subjt:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS

Query:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ
        HFLEIA  IRSL L IL+G TE  GEKTWICFV E +N++ + RMDILWSLVQI Q
Subjt:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ

Q7XJU0 Transcription factor bHLH1578.8e-4528.47Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MGS   H ILKSLC +  W YAVFW+      M+L +E+ Y D                                 V  M      LG+GIVG+VA +G 
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTG
        HQW+ +D       +  ++   +Q QF  G K+         IR    T                    +A IP                       G+ 
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTG

Query:  GIESLQSKPDAI-NVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTNP
        G+  L S    + + E L+   R L +       SG            ++ +    +  C  + PAE                  + G + D +F+  NP
Subjt:  GIESLQSKPDAI-NVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTNP

Query:  SEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDV---KS
          + L  ++ +SE      SN  L     Y   ++L  ++  D LY                       QL +D P ++    V+  V     D+    S
Subjt:  SEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDV---KS

Query:  DTSLCKSGQSLLTTEKIPEPSTNVTTSA---YSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS------EPAKN--SKR
         T         L +E I    +N T S+     + YS G +Q+     D  ++ SSS     + P+  + T      +  ERSS      +P +    K+
Subjt:  DTSLCKSGQSLLTTEKIPEPSTNVTTSA---YSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSS------EPAKN--SKR

Query:  RARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCS
        RA+ GES RPRP+DRQ+IQDRIKELR ++PNGAKCSID+LL+ TIKHM+F+Q + K+A++L +    KL +++             +WA+EVG +  VC 
Subjt:  RARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCS

Query:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ
        I+VE LN+ G++ +EM+CEE   FLEI + +R LGL ILKG+ E    + W  F+V+ +    + R+ +L+SLVQ+ Q
Subjt:  IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ

Q9XIN0 Transcription factor LHW7.9e-6231.29Show/hide
Query:  LHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKM--SYHVYSLGEGIVGQVAVNGKHQW
        L + L+S+C N++W YAVFWK+  +   +L WE+ Y +     +P       + L         ++ + L   +M  +  +  +GEG+VG+ A  G HQW
Subjt:  LHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKM--SYHVYSLGEGIVGQVAVNGKHQW

Query:  ITADEQIPNFSSTI---EYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKN---------
        I A+    +F+  +   E  +    QFSAGI+             VF  +          +   ++ G+V D+    L       A +S+N         
Subjt:  ITADEQIPNFSSTI---EYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKN---------

Query:  -VGMELS---GTGGIESLQSKPDAINVESLKSQVRLL---DDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSV
         +G+ +S    + G + LQS   A   E+ K         D ++    P    D     +H+   QS    ++              E   P NP A   
Subjt:  -VGMELS---GTGGIESLQSKPDAINVESLKSQVRLL---DDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSV

Query:  YDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGY-ELHEVLGPAFLKDALYLDW-----QTEYVFGGKTFELSE-----------GMSG
                 +   N S M   ++V+A+E   +   + S K   G  +L ++LG           W     +TE +    T ELS+           G SG
Subjt:  YDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGY-ELHEVLGPAFLKDALYLDW-----QTEYVFGGKTFELSE-----------GMSG

Query:  SQLTSDSPTEHLLEAVVADVCHSGSDVKSDTS-LCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYS
         +L   S T+HLL+AVV+  C S   +  +TS  CK+  + ++   +  PS           +S  Q    F  +  Q  L  S V G          +S
Subjt:  SQLTSDSPTEHLLEAVVADVCHSGSDVKSDTS-LCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYS

Query:  --GTGSEHLERSSEPAK--NSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGML
            GS  +   +E AK  N+++R +PGE+ RPRP+DRQ+IQDR+KELRE++PNGAKCSID+LLERTIKHMLFLQ ++KH+DKL +    K+ +++ G  
Subjt:  --GTGSEHLERSSEPAK--NSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGML

Query:  GSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQIL
               G++WA EVG +  VC I+VE++N      VEMLCE+   FLEIA+ IRSLGLTILKG+ E   +K W  F VE   +R++ RM+I   LV IL
Subjt:  GSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQIL

Query:  QRS
        +++
Subjt:  QRS

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.9e-13042.53Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL
        HQWI + E + +  ST++  +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   +  C   S      IPS+ L
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL

Query:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS
                     A +   +  VS+N                  ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Subjt:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS

Query:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS
        G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D  +SR N    PSE    +++++      E SE+  +   
Subjt:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS

Query:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS
          SL   +G EL E LGPAF K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT ++   EP 
Subjt:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS

Query:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG
         +   +  S   S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Subjt:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG

Query:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR
        +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IR
Subjt:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR

Query:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        SL L IL+G TE  GEKTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

AT1G06150.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.9e-13042.53Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK
        MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES          H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGK

Query:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL
        HQWI + E + +  ST++  +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   +  C   S      IPS+ L
Subjt:  HQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIPSRSL

Query:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS
                     A +   +  VS+N                  ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Subjt:  -------------ATEKNEVARVSKNVG----------------MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS

Query:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS
        G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D  +SR N    PSE    +++++      E SE+  +   
Subjt:  GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTN----PSE----MYLQN----DVEASETIDMYPS

Query:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS
          SL   +G EL E LGPAF K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S  +S QSLLTT ++   EP 
Subjt:  NTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKI--PEPS

Query:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG
         +   +  S   S+        G   QN    S +CG  S  GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Subjt:  TNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG

Query:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR
        +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IR
Subjt:  AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR

Query:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT
        SL L IL+G TE  GEKTWICFVVEG+NN+ +HRMDILWSLVQI Q   T
Subjt:  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNT

AT2G31280.1 conserved peptide upstream open reading frame 75.9e-12942.33Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA
        MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                    H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVA
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA

Query:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP
        V+G+HQW+   E   N +S  E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A H   +  C   +S  +  +P
Subjt:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP

Query:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------
        S  L  E   +    V K + +E S       T   +S+  + P +  V    +QV     ++    CG                               
Subjt:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------

Query:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE
          G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +   +        H+ +F  +          +E  L  + E
Subjt:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE

Query:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL
        +S+   +  S  +    AG EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSL
Subjt:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL

Query:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI
        LT  ++ EPS     +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Subjt:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI

Query:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS
        KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC 
Subjt:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS

Query:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ
        HFLEIA  IRSL L IL+G TE  GEKTWICFV E +N++ + RMDILWSLVQI Q
Subjt:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ

AT2G31280.2 conserved peptide upstream open reading frame 78.3e-10739.53Show/hide
Query:  GSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV
        G     +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                    H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVAV
Subjt:  GSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV

Query:  NGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC----------------------KSSGYVADIPS
        +G+HQW+        F      C+   + F   + V      V ++ ++      S   H +    C                      +S     + PS
Subjt:  NGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC----------------------KSSGYVADIPS

Query:  RSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ----VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKI
          L  EK   A+V     +    T G  S  +    +++   K +      ++ D    G  SGCKD S  L   +++  +N ++   S     + AE++
Subjt:  RSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ----VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKI

Query:  ITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEY
        IT+++Y    P   S +   +        H+ +F  +          +E  L  + E+S+   +  S  +    AG EL E LG AF +       Q E 
Subjt:  ITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEY

Query:  VFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSS
        +    G T   ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSLLT  ++ EPS     +  +   +   +Q      D Q   +SS
Subjt:  VFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSS

Query:  GVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCA
         +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K A
Subjt:  GVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCA

Query:  NMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVE------------------------MLCEECSHFLEIAEAIRSLGLTILKG
        N K+ QKE GM       QGSS AVEVGG L+V SIIVENLNK G +L+E                        MLCEEC HFLEIA  IRSL L IL+G
Subjt:  NMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVE------------------------MLCEECSHFLEIAEAIRSLGLTILKG

Query:  ITEAHGEKTWICFVVE
         TE  GEKTWICFV E
Subjt:  ITEAHGEKTWICFVVE

AT2G31280.3 conserved peptide upstream open reading frame 71.0e-12541.4Show/hide
Query:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA
        MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                    H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVA
Subjt:  MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVA

Query:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP
        V+G+HQW+   E   N +S  E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A H   +  C   +S  +  +P
Subjt:  VNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC--KSSGYVADIP

Query:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------
        S  L  E   +    V K + +E S       T   +S+  + P +  V    +QV     ++    CG                               
Subjt:  SRSLATE--KNEVARVSKNVGMELSG------TGGIESL-QSKPDAINVESLKSQV----RLLDDRICG-------------------------------

Query:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE
          G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +   +        H+ +F  +          +E  L  + E
Subjt:  --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVE

Query:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL
        +S+   +  S  +    AG EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+VC    + + D    +S QSL
Subjt:  ASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSL

Query:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI
        LT  ++ EPS     +  +   +   +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Subjt:  LTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI

Query:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS
        KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM       QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC 
Subjt:  KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECS

Query:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVE-----------------GENNRNIHRMDILWSLVQILQ
        HFLEIA  IRSL L IL+G TE  GEKTWICFV E                  +N++ + RMDILWSLVQI Q
Subjt:  HFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVE-----------------GENNRNIHRMDILWSLVQILQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTACTGACTTGCACCAAATACTCAAAAGCCTTTGTTGCAACTCGGAATGGAAGTATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGCATGGTGTTGACTTG
GGAAGATGGCTACTATGACAATTCTGAACAACATGATCCCTCAGAGAGCAAGTTTTACAGAAAAACGCTCGAGAAGTTTCATGATGGACATTATTCACATGACCCTCTTG
GATTAGCTGTGGCAAAAATGTCATATCATGTATATTCTCTTGGGGAAGGGATTGTTGGACAAGTAGCGGTTAATGGAAAACATCAGTGGATTACTGCAGATGAACAAATA
CCAAATTTCTCTTCAACAATTGAGTACTGCGATGGTTGGCAAACACAATTTTCAGCTGGCATTAAGGTCACTGAGGATGTCAATCTTGTGGCTCGCATCAGAAATGTCTT
TCTAACTCTTCAGGAGTCTTCAGCTGGGCATATGAAGCCAATGCATTCTTGTAAGAGCTCAGGATACGTGGCAGACATTCCATCTAGAAGCTTAGCAACAGAGAAAAATG
AAGTTGCAAGGGTCAGCAAGAATGTGGGAATGGAGTTGTCAGGAACTGGGGGTATTGAATCTCTACAATCTAAACCGGATGCTATCAATGTAGAGAGTTTGAAGTCACAA
GTGAGGTTGCTTGATGACAGGATATGTGGAGGGGAGCCTAGTGGGTGCAAAGATATGTCAGTTGGTTTGAAACACAAAGTCAATGTACAATCGCAAAACTCTATGGTTAG
CATATGTGGTAATTTACTTCCAGCTGAAAAGATCATAACAAATGAAGCATACTACCCTATGAATCCTCACGCCTCTTCTGTCTATGATGGAGTTAATCACGATGGGATGT
TTAGCCGAACCAACCCTTCAGAAATGTACTTGCAAAATGACGTGGAAGCATCGGAAACTATTGATATGTATCCATCAAACACATCATTGAAGTTCCCTGCTGGTTATGAG
CTACATGAAGTACTCGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGACTGGCAAACAGAGTACGTTTTTGGTGGTAAAACTTTCGAGTTATCTGAGGGAATGAGTGG
TAGCCAGTTGACATCCGATTCACCAACGGAGCATCTTCTAGAGGCAGTAGTGGCTGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAGTCTG
GACAGTCTCTGTTAACGACTGAAAAAATTCCTGAGCCTTCCACAAATGTGACAACATCTGCTTATTCAGAAGGTTACTCTATGGGTCAAAGTCAAACATCTTTTATTGGA
GAGGACATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGGGTGATGTCCCCAAAAGGGTTTTCGTCGACGTATTCTGGTACCGGCAGTGAGCACCTGGAGCGGTCTTC
AGAACCTGCAAAGAACAGTAAAAGAAGGGCAAGACCTGGTGAGAGTTGTAGGCCTAGGCCGAGGGACAGACAATTGATCCAGGATCGCATCAAAGAACTGCGGGAGCTAG
TACCAAATGGAGCAAAATGTAGTATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCACCAAGCATGCTGACAAGCTAAACAAATGCGCC
AACATGAAGTTGCATCAGAAGGAAAATGGCATGCTAGGATCCTCAAATACTGATCAGGGTTCAAGCTGGGCAGTGGAGGTCGGTGGTCAACTTAAAGTTTGTTCGATAAT
TGTGGAGAATCTAAACAAGAACGGACAGATTCTTGTAGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATAC
TAAAAGGCATAACAGAAGCTCATGGCGAGAAGACATGGATTTGTTTTGTGGTTGAGGGGGAGAATAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAA
ATACTACAGCGCAGTAATACAATGTGA
mRNA sequenceShow/hide mRNA sequence
GTTGTGTGTGAGTTTCTGTTCTGTTCTGTTATTTTGCCATCACCAATTGGAAAAAAGCGAACAAAATCCAACAGAAATCTCAGAGCTTTCTCTCTCTTTCAAGCATTTCA
ACCTTTTTTGCTCTCTCTCTAGAGTTCTACTGCTTCCCAGTGTTGTGTGAAGAGGAAAGTGAATTAAGAAAAAGAAAGAGACAATGCAAGGAAATGTAGATCAGGGCGTT
GAATTTCTCTGTGATTTTCATCGTTTCTTTCGGTCAAAAAGTTTCAAAACTGTTTCTCTCTGTAAAGGGGCTTGTTGAAATCCATCCCTTTTCTCTTTGACGCTTTTTCG
GTTTCTGGGTTTTGGTTTTGGGTTGAGCCTCGAAAGCACTTCTTCTTAGACCCTTCTGTTGTTTTTGTTGTTGTTCTCTAGTTTTATGACGATTTCATAAGTTCATCCAT
GTCGTCCGACATAAGCTAGGGTTTAATTTTCCCCTTGTTTGAGTGATCCGTGGTGGGTCTGAGAGAAAAAGAGCTATTACTTGAGTGTTTGTTTGAGATTCTTCTCTGCT
GCTGTTGTTGTTGTCCATGGATTGTTATACTTATTCCAACTCTAATGGTGTCCACTGTGTTAGAAGCTATGTTGTTGTTGCTGAAATTTCATTGATATCATCTTATTTGT
ACTGAATTTGTTGAAGAATTCTTGTATTTGAGTTGGGGGTTTTTTTGGGGGGCTGTTTGAGTTTTAAGGGGGATGGGAACTGATCGTCTGCTATTACCAACAGTTGGACC
GCCAATAAAACGGCGTGCAGGATTGAGGAGAAAGCAGGCTGGTAGAGGCTCGTATAGGGGAAGCTAGGAAAATTGGGTGTTTTCTTTTCAAGGTGTTTAGAGTTGGGGGT
TTTCAAGGTGGTTTTAGGAACAAGCCCTGTTTTGTAGGGTATTCTTTTTAGGGGGGCGTTTAGACAAAATTGGTTCAGTGAGAGATGGGTTCTACTGACTTGCACCAAAT
ACTCAAAAGCCTTTGTTGCAACTCGGAATGGAAGTATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGCATGGTGTTGACTTGGGAAGATGGCTACTATGACAATTCTG
AACAACATGATCCCTCAGAGAGCAAGTTTTACAGAAAAACGCTCGAGAAGTTTCATGATGGACATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAAATGTCATAT
CATGTATATTCTCTTGGGGAAGGGATTGTTGGACAAGTAGCGGTTAATGGAAAACATCAGTGGATTACTGCAGATGAACAAATACCAAATTTCTCTTCAACAATTGAGTA
CTGCGATGGTTGGCAAACACAATTTTCAGCTGGCATTAAGGTCACTGAGGATGTCAATCTTGTGGCTCGCATCAGAAATGTCTTTCTAACTCTTCAGGAGTCTTCAGCTG
GGCATATGAAGCCAATGCATTCTTGTAAGAGCTCAGGATACGTGGCAGACATTCCATCTAGAAGCTTAGCAACAGAGAAAAATGAAGTTGCAAGGGTCAGCAAGAATGTG
GGAATGGAGTTGTCAGGAACTGGGGGTATTGAATCTCTACAATCTAAACCGGATGCTATCAATGTAGAGAGTTTGAAGTCACAAGTGAGGTTGCTTGATGACAGGATATG
TGGAGGGGAGCCTAGTGGGTGCAAAGATATGTCAGTTGGTTTGAAACACAAAGTCAATGTACAATCGCAAAACTCTATGGTTAGCATATGTGGTAATTTACTTCCAGCTG
AAAAGATCATAACAAATGAAGCATACTACCCTATGAATCCTCACGCCTCTTCTGTCTATGATGGAGTTAATCACGATGGGATGTTTAGCCGAACCAACCCTTCAGAAATG
TACTTGCAAAATGACGTGGAAGCATCGGAAACTATTGATATGTATCCATCAAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACATGAAGTACTCGGACCTGCTTT
TCTGAAAGATGCTCTTTATCTTGACTGGCAAACAGAGTACGTTTTTGGTGGTAAAACTTTCGAGTTATCTGAGGGAATGAGTGGTAGCCAGTTGACATCCGATTCACCAA
CGGAGCATCTTCTAGAGGCAGTAGTGGCTGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAGTCTGGACAGTCTCTGTTAACGACTGAAAAA
ATTCCTGAGCCTTCCACAAATGTGACAACATCTGCTTATTCAGAAGGTTACTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGACATGCAGAACTCTTTAAGCTC
ATCAGGAGTATGTGGGGTGATGTCCCCAAAAGGGTTTTCGTCGACGTATTCTGGTACCGGCAGTGAGCACCTGGAGCGGTCTTCAGAACCTGCAAAGAACAGTAAAAGAA
GGGCAAGACCTGGTGAGAGTTGTAGGCCTAGGCCGAGGGACAGACAATTGATCCAGGATCGCATCAAAGAACTGCGGGAGCTAGTACCAAATGGAGCAAAATGTAGTATT
GATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCACCAAGCATGCTGACAAGCTAAACAAATGCGCCAACATGAAGTTGCATCAGAAGGAAAA
TGGCATGCTAGGATCCTCAAATACTGATCAGGGTTCAAGCTGGGCAGTGGAGGTCGGTGGTCAACTTAAAGTTTGTTCGATAATTGTGGAGAATCTAAACAAGAACGGAC
AGATTCTTGTAGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAAGGCATAACAGAAGCTCATGGC
GAGAAGACATGGATTTGTTTTGTGGTTGAGGGGGAGAATAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACTACAGCGCAGTAATACAATGTG
ACAGCTTCAACTGCTGAGCTTTTTGCAACTGTTTGTGTAGATTCCTCTGTAAAATATAGTGTGAGTTGCTTGTTGTGACTATTGATTCATCTGTCTCCTGCTTTATGGAG
TGCAGCACTTTTCTCAGAGATTCACCCTTCTCCATACAGTCCTCAAAAGTTGAACAAGGAAAAAGCGCAAAAAAAAAAAAAAAAATCTATAGAAGGGTGCACTAAAAAGA
AAACCAAGTCTCTGTAAAGAGCACTATTGTTTCTAAGTGTGAAAAACTATTTTTGTTTTTTGGTGTGTTTTCTAGCCAGTGTTGTCATTTGTTTGAAAAAGCTTTCTACT
TGATATAGTGCAACTATTCTGTATTTGTAGAACCCAATTTTAATAGTATAAGTTCATTTGAACAATACAAGTGTTTTATTTTTTTCC
Protein sequenceShow/hide protein sequence
MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQI
PNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ
VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYE
LHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIG
EDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCA
NMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQ
ILQRSNTM