; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1299 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1299
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBZIP domain-containing protein
Genome locationMC05:17154379..17158332
RNA-Seq ExpressionMC05g1299
SyntenyMC05g1299
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580686.1 hypothetical protein SDJN03_20688, partial [Cucurbita argyrosperma subsp. sororia]2.72e-23070.66Show/hide
Query:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR
        S+KCS+ +SCS L S  SS+ SSS ++M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD R
Subjt:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR

Query:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK
        I+QDRGV+S QPSEKEC + S  + ETT++M+K EKE E    SPK   S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LTKK
Subjt:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK

Query:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI
        A+DLAWENENLKREKELALKEY SLE TNKELKEQ+AQA +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFWPS     SPY H+L NV V+
Subjt:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI

Query:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA
        P S+  P+N+ V  SDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS  EK EA
Subjt:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA

Query:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR
         D  EA + K H QN VGV V  FE D + QVR+++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRR
Subjt:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR

Query:  RKELTKLKNLHGRQCHMH
        RKELTKLKNLH R C MH
Subjt:  RKELTKLKNLHGRQCHMH

XP_022144690.1 uncharacterized protein LOC111014317 isoform X1 [Momordica charantia]0.099.81Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

XP_022144691.1 uncharacterized protein LOC111014317 isoform X2 [Momordica charantia]0.099.62Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

XP_022144692.1 uncharacterized protein LOC111014317 isoform X3 [Momordica charantia]0.095.38Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

XP_022934487.1 uncharacterized protein LOC111441650 isoform X1 [Cucurbita moschata]2.22e-22970.46Show/hide
Query:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR
        S+KCS+ +SCS L S S+ S SSS  +M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD R
Subjt:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR

Query:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK
        I+QDRGV+S  PSEKEC + S  + ETT++M+K EKE E    SPK   S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LTKK
Subjt:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK

Query:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI
        A+DLAWENENLKREKELALKEY SLE TNKELKEQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFWPS     SPY H+L NV V+
Subjt:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI

Query:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA
        P S+  P+N+ V  SDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA
Subjt:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA

Query:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR
         D  EA + K H QN VGV V  FE D + QVR+++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRR
Subjt:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR

Query:  RKELTKLKNLHGRQCHMH
        RKELTKLKNLH R C MH
Subjt:  RKELTKLKNLHGRQCHMH

TrEMBL top hitse value%identityAlignment
A0A6J1CSC2 uncharacterized protein LOC111014317 isoform X10.099.81Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

A0A6J1CU60 uncharacterized protein LOC111014317 isoform X20.099.62Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIE DRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

A0A6J1CUE2 uncharacterized protein LOC111014317 isoform X30.095.38Show/hide
Query:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
        MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP
Subjt:  MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLP

Query:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA
        SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQA
Subjt:  SRSDLDRRIEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
        LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV
Subjt:  LCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNV

Query:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
        VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE
Subjt:  VVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKE

Query:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
        ATDSTEALNPKNHIQNPVGVAVGGFETDAK QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR
Subjt:  ATDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARR

Query:  RRKELTKLKNLHGRQCHMHS
        RRKELTKLKNLHGRQCHMHS
Subjt:  RRKELTKLKNLHGRQCHMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X11.08e-22970.46Show/hide
Query:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR
        S+KCS+ +SCS L S S+ S SSS  +M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD R
Subjt:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR

Query:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK
        I+QDRGV+S  PSEKEC + S  + ETT++M+K EKE E    SPK   S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LTKK
Subjt:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK

Query:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI
        A+DLAWENENLKREKELALKEY SLE TNKELKEQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFWPS     SPY H+L NV V+
Subjt:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI

Query:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA
        P S+  P+N+ V  SDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA
Subjt:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA

Query:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR
         D  EA + K H QN VGV V  FE D + QVR+++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRR
Subjt:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR

Query:  RKELTKLKNLHGRQCHMH
        RKELTKLKNLH R C MH
Subjt:  RKELTKLKNLHGRQCHMH

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X29.79e-22870.46Show/hide
Query:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR
        S+KCS+ +SCS L S S+ S SSS  +M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD R
Subjt:  STKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDRR

Query:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK
        I QDRGV+S  PSEKEC + S  + ETT++M+K EKE E    SPK   S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LTKK
Subjt:  IEQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKK

Query:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI
        A+DLAWENENLKREKELALKEY SLE TNKELKEQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFWPS     SPY H+L NV V+
Subjt:  AADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPS-----SPYQHELPNVVVI

Query:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA
        P S+  P+N+ V  SDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQ  Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK EA
Subjt:  PSSIHLPTNSNV--SDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEA

Query:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR
         D  EA + K H QN VGV V  FE D + QVR+++SPVR +CIE TS VKQD  S+DD GLS+R C D C   EKKHE E VS KKTIDAM A EARRR
Subjt:  TDSTEALNPKNHIQNPVGVAVGGFETDAK-QVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR

Query:  RKELTKLKNLHGRQCHMH
        RKELTKLKNLH R C MH
Subjt:  RKELTKLKNLHGRQCHMH

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a5.8e-0436Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY  L + N  LK ++ ++         P + E    N  SH + P
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein4.2e-5836.17Show/hide
Query:  LRSFSSSSCSSSYTAMAADQMV---KVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEQDRGVVSQ
        L   SSS CSSS ++   +        E+EAAEALADLA LA+       S   W +  KGKR RK VK+ESP +D L   P    L      +  +V +
Subjt:  LRSFSSSSCSSSYTAMAADQMV---KVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEQDRGVVSQ

Query:  QPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLF------GCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADL
        +  E+E           T+++ K   + E++  +PK   +  L       GC +SR+NL+EAE+EERR+RRILANRESARQTIRRRQA+CEEL+KKAADL
Subjt:  QPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLF------GCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADL

Query:  AWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTN
         +ENENL+REK+ ALKE+ SLET NK LKEQ+ ++VKP  +E   + + S +++    T  P + Y + P+  + WP     H   +   + S +  PT+
Subjt:  AWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTN

Query:  SNVSD---SSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRN------QQGPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEA
           S    ++  HEN  + NG  T F ++PC W LP  D  N      Q   + + S G++ +D      +   T  S +  R +   S  P T    + 
Subjt:  SNVSD---SSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRN------QQGPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEA

Query:  TDS-TEALNPKNHIQNPVGVAVGGFETDAKQVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR
         +S TE L+             GG                F   +   ++K ++ S+  +G++        + P   H   S+  KK   ++ AAEAR+R
Subjt:  TDS-TEALNPKNHIQNPVGVAVGGFETDAKQVRRMISPVRFKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRR

Query:  RKELTKLKNLHGRQCHM
        RKELT+LKNLHGRQC M
Subjt:  RKELTKLKNLHGRQCHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTTTCTCCTGCTCCCATGGCTTCCACCAAGTGTTCCGACGGGTCCAGTTGTTCGGCCTTGAGGTCTTTTTCTTCGTCTTCTTGTTCCTCGTCTTACACGGCCAT
GGCCGCGGATCAGATGGTGAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGAGCTCAACCGTCGGACAAGAAATGGA
GGACCAAACTTAAGGGGAAACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCGTTCGGATCTGGACCGTCGGATTGAG
CAGGATAGAGGGGTGGTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGATGTTAAAGACGGAGAAGGAGGTCGA
ATTATCTAAAGTGAGTCCTAAATGTACTACAAGCTACCCATTATTTGGCTGCAGGAAGTCGAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAGTACGTAGAA
TTTTAGCAAACAGAGAGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAAAAAGGCTGCTGATTTAGCATGGGAAAATGAAAATTTGAAG
AGGGAGAAGGAGTTGGCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGG
AAACAATAGATCATCTCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGCCCTCCATTTGCGTCGTATTTCTGGCCCTCTAGTCCTTATC
AACATGAACTACCCAACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGACTCTTCCCATGTACATGAAAACTTTACGAACAGCAATGGC
CCGAGTACACCCTTTTGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGGGTCCTCAAGTCTCGTGTTCCACGGGAAATAATCAAGAGGA
TATTTGTTTGAATTCCCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAAAAAAAGGAAGCAACTGATT
CAACCGAAGCTTTGAACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGACTGACGCAAAACAAGTTAGGAGAATGATCTCTCCTGTGAGA
TTTAAATGTATCGAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGTCAACAAGAGCTTGTGCTGACTTCTGTGTTTTTCCAGAAAAGAA
GCATGAAGCAGAGAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGGAGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGGCCGTC
AATGCCACATGCATTCT
mRNA sequenceShow/hide mRNA sequence
CAGCTGCCACGTTGTGGCTTTAACCTCTTTGTTCAGAGCGGGCTCCTTCTAACCTTTATCCAACAAAAATAAATATAATAATAATAATTCTTATTAAAACGAAAACAAAG
AATGAATGCCACGTGGATATTATATTGACAAGCCACACACCGACGCTTTCACTGAGCGTGGCCTACAATTCTCGTACGGCGGACCCTGCCACGTGTAACCTTTCTACGAG
CTCAGAATCTAACTCGTGGCCTACACTGATACATGGAGGCGAGTAAGCTTACGCCAACAACCCCACGCGCTCTCTCTCACTCACTCTTCTTCTTCTTCTTCTTCTTCTTT
GTCCCTTATCTCTGATTTTCGAGCGATGGTTTTTTCTCCTGCTCCCATGGCTTCCACCAAGTGTTCCGACGGGTCCAGTTGTTCGGCCTTGAGGTCTTTTTCTTCGTCTT
CTTGTTCCTCGTCTTACACGGCCATGGCCGCGGATCAGATGGTGAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGA
GCTCAACCGTCGGACAAGAAATGGAGGACCAAACTTAAGGGGAAACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCG
TTCGGATCTGGACCGTCGGATTGAGCAGGATAGAGGGGTGGTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGA
TGTTAAAGACGGAGAAGGAGGTCGAATTATCTAAAGTGAGTCCTAAATGTACTACAAGCTACCCATTATTTGGCTGCAGGAAGTCGAGGCGTAATCTAACTGAGGCTGAA
AAGGAAGAAAGGAGAGTACGTAGAATTTTAGCAAACAGAGAGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAAAAAGGCTGCTGATTT
AGCATGGGAAAATGAAAATTTGAAGAGGGAGAAGGAGTTGGCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAA
AGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATCTCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGCCCTCCATTTGCGTCG
TATTTCTGGCCCTCTAGTCCTTATCAACATGAACTACCCAACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGACTCTTCCCATGTACA
TGAAAACTTTACGAACAGCAATGGCCCGAGTACACCCTTTTGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGGGTCCTCAAGTCTCGT
GTTCCACGGGAAATAATCAAGAGGATATTTGTTTGAATTCCCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAACT
GAAGAAAAAAAGGAAGCAACTGATTCAACCGAAGCTTTGAACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGACTGACGCAAAACAAGT
TAGGAGAATGATCTCTCCTGTGAGATTTAAATGTATCGAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGTCAACAAGAGCTTGTGCTG
ACTTCTGTGTTTTTCCAGAAAAGAAGCATGAAGCAGAGAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGGAGGAGAAAAGAACTAACA
AAGTTAAAGAATCTTCATGGCCGTCAATGCCACATGCATTCT
Protein sequenceShow/hide protein sequence
MVFSPAPMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIE
QDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPKCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLK
REKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNG
PSTPFCLLPCSWLLPHHDRRNQQGPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKQVRRMISPVR
FKCIESTSAVKQDNRSKDDHGLSTRACADFCVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS