; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g30360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g30360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBZIP domain-containing protein
Genome locationchr5:22614319..22617811
RNA-Seq ExpressionMoc05g30360
SyntenyMoc05g30360
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-16969.42Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT
        M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD RI+DRGV+S QPSEKEC + S  + ETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE
        ++M+K EKE E                 L++AEKEERR+RR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEY SLE TNKELKE
Subjt:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE

Query:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF
        Q+AQA +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV V+P S+  P+N+   VSDSSHV ENFTN  G  TPF
Subjt:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF

Query:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR
        C++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA D  EA + K H QN VGV V  FE D + QVR+
Subjt:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR

Query:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH
        ++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRRRKELTKLKNLH R C MH
Subjt:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH

XP_022144690.1 uncharacterized protein LOC111014317 isoform X1 [Momordica charantia]8.2e-24995.04Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT
        RKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT
Subjt:  RKMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT

Query:  NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC
        NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC
Subjt:  NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC

Query:  LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM
        LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM
Subjt:  LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM

Query:  ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

XP_022144691.1 uncharacterized protein LOC111014317 isoform X2 [Momordica charantia]3.3e-25095.24Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR

Query:  KMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN
        KMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN
Subjt:  KMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN

Query:  KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL
        KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL
Subjt:  KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL

Query:  LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI
        LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI
Subjt:  LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI

Query:  SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

XP_022144692.1 uncharacterized protein LOC111014317 isoform X3 [Momordica charantia]9.3e-25399.78Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN
        RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN
Subjt:  RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN

Query:  RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC
        RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC
Subjt:  RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC

Query:  STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD
        STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD
Subjt:  STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD

Query:  DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

XP_022934488.1 uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata]4.2e-16869.01Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT
        M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD RI+DRGV+S  PSEKEC + S  + ETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE
        ++M+K EKE E                 L++AEKEERR+RR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEY SLE TNKELKE
Subjt:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE

Query:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF
        Q+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV V+P S+  P+N+   VSDSSHV ENFTN  G  TPF
Subjt:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF

Query:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR
        C++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA D  EA + K H QN VGV V  FE D + QVR+
Subjt:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR

Query:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH
        ++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRRRKELTKLKNLH R C MH
Subjt:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH

TrEMBL top hitse value%identityAlignment
A0A6J1CSC2 uncharacterized protein LOC111014317 isoform X14.0e-24995.04Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT
        RKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT
Subjt:  RKMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETT

Query:  NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC
        NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC
Subjt:  NKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFC

Query:  LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM
        LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM
Subjt:  LLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRM

Query:  ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  ISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

A0A6J1CU60 uncharacterized protein LOC111014317 isoform X21.6e-25095.24Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTR

Query:  KMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN
        KMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN
Subjt:  KMLKTEKEVELSK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTN

Query:  KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL
        KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL
Subjt:  KELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCL

Query:  LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI
        LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI
Subjt:  LPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMI

Query:  SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  SPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

A0A6J1CUE2 uncharacterized protein LOC111014317 isoform X34.5e-25399.78Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT
        MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE-DRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN
        RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN
Subjt:  RKMLKTEKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNN

Query:  RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC
        RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC
Subjt:  RSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSC

Query:  STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD
        STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD
Subjt:  STGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKD

Query:  DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
        DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS
Subjt:  DHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X15.0e-16768.87Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRI-EDRGVVSQQPSEKECTNQSRSDCET
        M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD RI +DRGV+S  PSEKEC + S  + ET
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRI-EDRGVVSQQPSEKECTNQSRSDCET

Query:  TRKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELK
        T++M+K EKE E                 L++AEKEERR+RR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEY SLE TNKELK
Subjt:  TRKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELK

Query:  EQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTP
        EQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV V+P S+  P+N+   VSDSSHV ENFTN  G  TP
Subjt:  EQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTP

Query:  FCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVR
        FC++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA D  EA + K H QN VGV V  FE D + QVR
Subjt:  FCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVR

Query:  RMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH
        +++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRRRKELTKLKNLH R C MH
Subjt:  RMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X22.0e-16869.01Show/hide
Query:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT
        M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD RI+DRGV+S  PSEKEC + S  + ETT
Subjt:  MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETT

Query:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE
        ++M+K EKE E                 L++AEKEERR+RR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEY SLE TNKELKE
Subjt:  RKMLKTEKEVE-----------------LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKE

Query:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF
        Q+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV V+P S+  P+N+   VSDSSHV ENFTN  G  TPF
Subjt:  QMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVVVIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPF

Query:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR
        C++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA D  EA + K H QN VGV V  FE D + QVR+
Subjt:  CLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRR

Query:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH
        ++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRRRKELTKLKNLH R C MH
Subjt:  MISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMH

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a8.8e-0436Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY  L + N  LK ++ ++         P + E    N  SH + P
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein5.2e-5236.36Show/hide
Query:  EIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFL------DSLPSRSDLDRRI--EDRGVVSQQPSEKECTN---QSRSDCET
        E+EAAEALADLA LA+       S   W +  KGKR RK VK+ESP +D L      D+LP+    + R+  E+      +P  KE T    +S  + ET
Subjt:  EIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFL------DSLPSRSDLDRRI--EDRGVVSQQPSEKECTN---QSRSDCET

Query:  TRKMLKT------------EKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQ
         + +L +                 LS+AE+EERR+RRILANRESARQTIRRRQA+CEEL+KKAADL +ENENL+REK+ ALKE+ SLET NK LKEQ+ +
Subjt:  TRKMLKT------------EKEVELSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQ

Query:  AVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSD---SSHVHENFTNSNGPSTPFCLLPCSWL
        +VKP  +E   + + S +++    T  P + Y + P+  + WP     H   +   + S +  PT+   S    ++  HEN  + NG  T F ++PC W 
Subjt:  AVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSD---SSHVHENFTNSNGPSTPFCLLPCSWL

Query:  LPHHDRRN------QQGPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEATDS-TEALNPKNHIQNPVGVAVGGFETDAKAQVR
        LP  D  N      Q   + + S G++ +D      +   T  S +  R +   S  P T    +  +S TE L+             GG          
Subjt:  LPHHDRRN------QQGPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEATDS-TEALNPKNHIQNPVGVAVGGFETDAKAQVR

Query:  RMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHM
               F   +   ++K ++ S+  +G++        + P   H   S+  KK   ++ AAEAR+RRKELT+LKNLHGRQC M
Subjt:  RMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCGGATCAGATGGTGAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGAGCTCAACCGTCGGACAAGAAATG
GAGGACCAAACTTAAGGGGAAACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCGTTCGGATCTGGACCGTCGGATTG
AGGATAGAGGGGTGGTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGATGTTAAAGACGGAGAAGGAGGTCGAA
TTATCTAAAGCTGAAAAGGAAGAAAGGAGAGTACGTAGAATTTTAGCAAACAGAGAGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAA
AAAGGCTGCTGATTTAGCATGGGAAAATGAAAATTTGAAGAGGGAGAAGGAGTTGGCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGA
TGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATCTCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGC
CCTCCATTTGCGTCGTATTTCTGGCCCTCTAGTCCTTATCAACATGAACTACCCAACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGA
CTCTTCCCATGTACATGAAAACTTTACGAACAGCAATGGCCCGAGTACACCCTTTTGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGG
GTCCTCAAGTCTCGTGTTCCACGGGAAATAATCAAGAGGATATTTGTTTGAATTCCCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCT
TCTTTGCCTTCAACTGAAGAAAAAAAGGAAGCAACTGATTCAACCGAAGCTTTGAACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGAC
TGACGCAAAAGCTCAAGTTAGGAGAATGATCTCTCCTGTGAGATTTAAATGTATCGAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGT
CAACAAGAGCTTGTGCTGACTTCTGTGTTTTTCCAGAAAAGAAGCATGAAGCAGAGAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGG
AGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGGCCGTCAATGCCACATGCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCGGATCAGATGGTGAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGAGCTCAACCGTCGGACAAGAAATG
GAGGACCAAACTTAAGGGGAAACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCGTTCGGATCTGGACCGTCGGATTG
AGGATAGAGGGGTGGTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGATGTTAAAGACGGAGAAGGAGGTCGAA
TTATCTAAAGCTGAAAAGGAAGAAAGGAGAGTACGTAGAATTTTAGCAAACAGAGAGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAA
AAAGGCTGCTGATTTAGCATGGGAAAATGAAAATTTGAAGAGGGAGAAGGAGTTGGCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGA
TGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATCTCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGC
CCTCCATTTGCGTCGTATTTCTGGCCCTCTAGTCCTTATCAACATGAACTACCCAACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGA
CTCTTCCCATGTACATGAAAACTTTACGAACAGCAATGGCCCGAGTACACCCTTTTGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGG
GTCCTCAAGTCTCGTGTTCCACGGGAAATAATCAAGAGGATATTTGTTTGAATTCCCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCT
TCTTTGCCTTCAACTGAAGAAAAAAAGGAAGCAACTGATTCAACCGAAGCTTTGAACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGAC
TGACGCAAAAGCTCAAGTTAGGAGAATGATCTCTCCTGTGAGATTTAAATGTATCGAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGT
CAACAAGAGCTTGTGCTGACTTCTGTGTTTTTCCAGAAAAGAAGCATGAAGCAGAGAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGG
AGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGGCCGTCAATGCCACATGCATTCTTGA
Protein sequenceShow/hide protein sequence
MAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVE
LSKAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGR
PPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHS
SLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
RRKELTKLKNLHGRQCHMHS