; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019547 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019547
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBZIP domain-containing protein
Genome locationscaffold729:348904..352481
RNA-Seq ExpressionMS019547
SyntenyMS019547
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580686.1 hypothetical protein SDJN03_20688, partial [Cucurbita argyrosperma subsp. sororia]1.8e-18370.38Show/hide
Query:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR
        +S+KCS+ +SCS L   SSS+ SSS ++M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD 
Subjt:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR

Query:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT
        RI+ QDRGV+S QPSEKEC + S  + ETT++M+K EKE E  K+      S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LT
Subjt:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT

Query:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV
        KKA+DLAWENENLKREKELALKEY SLE TNKELKEQ+AQA +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV 
Subjt:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV

Query:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK
        V+P S+  P+N+   VSDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQS Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS  EK 
Subjt:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK

Query:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR
        EA D  EA + K H QN VGV V  FE D + QVR+++SPVRL+CIE TS VKQD  S+DD GLS+R C D+C   EKKHE E VS KKTIDAM A EAR
Subjt:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR

Query:  RRRKELTKLKNLHGRQCHMH
        RRRKELTKLKNLH R C MH
Subjt:  RRRKELTKLKNLHGRQCHMH

KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-18470.58Show/hide
Query:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR
        +S+KCS+ +SCS L   SSS+ SSS ++M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD 
Subjt:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR

Query:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT
        RI  QDRGV+S QPSEKEC + S  + ETT++M+K EKE E  K+      S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LT
Subjt:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT

Query:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV
        KKA+DLAWENENLKREKELALKEY SLE TNKELKEQ+AQA +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV 
Subjt:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV

Query:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK
        V+P S+  P+N+   VSDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQS Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK 
Subjt:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK

Query:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR
        EA D  EA + K H QN VGV V  FE D + QVR+++SPVRL+CIE TS VKQD  S+DD GLS+R C D+C   EKKHE E VS KKTIDAM A EAR
Subjt:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR

Query:  RRRKELTKLKNLHGRQCHMH
        RRRKELTKLKNLH R C MH
Subjt:  RRRKELTKLKNLHGRQCHMH

XP_022144690.1 uncharacterized protein LOC111014317 isoform X1 [Momordica charantia]1.5e-28099.03Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE QDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSP CTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

XP_022144691.1 uncharacterized protein LOC111014317 isoform X2 [Momordica charantia]5.9e-28098.83Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE  DRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSP CTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

XP_022144692.1 uncharacterized protein LOC111014317 isoform X3 [Momordica charantia]6.5e-26394.76Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE QDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

TrEMBL top hitse value%identityAlignment
A0A6J1CSC2 uncharacterized protein LOC111014317 isoform X17.5e-28199.03Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE QDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSP CTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

A0A6J1CU60 uncharacterized protein LOC111014317 isoform X22.8e-28098.83Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE  DRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSP CTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

A0A6J1CUE2 uncharacterized protein LOC111014317 isoform X33.2e-26394.76Show/hide
Query:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
        PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD
Subjt:  PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLD

Query:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL
        RRIE QDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSK                       AEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  RRIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
        TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS
Subjt:  TKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPS

Query:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
        SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQ PQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST
Subjt:  SIHLPTNSNVSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDST

Query:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
        EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVR KCIESTSAVKQDNRSKDDHGLSTRACAD CVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL
Subjt:  EALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKEL

Query:  TKLKNLHGRQCHMHS
        TKLKNLHGRQCHMHS
Subjt:  TKLKNLHGRQCHMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X15.5e-18370.19Show/hide
Query:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR
        +S+KCS+ +SCS L S S+ S SSS  +M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD 
Subjt:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR

Query:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT
        RI+ QDRGV+S  PSEKEC + S  + ETT++M+K EKE E  K+      S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LT
Subjt:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT

Query:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV
        KKA+DLAWENENLKREKELALKEY SLE TNKELKEQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV 
Subjt:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV

Query:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK
        V+P S+  P+N+   VSDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQS Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK 
Subjt:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK

Query:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR
        EA D  EA + K H QN VGV V  FE D + QVR+++SPVRL+CIE TS VKQD  S+DD GLS+R C D+C   EKKHE E VS KKTIDAM A EAR
Subjt:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR

Query:  RRRKELTKLKNLHGRQCHMH
        RRRKELTKLKNLH R C MH
Subjt:  RRRKELTKLKNLHGRQCHMH

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X29.4e-18370.19Show/hide
Query:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR
        +S+KCS+ +SCS L S S+ S SSS  +M ADQMVKVEIEAAEALADLA LAVR+SG QPS+ KWR K  KGKRARK+VK+ESPT+ F+DSLPSR+DLD 
Subjt:  ASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTK-LKGKRARKDVKSESPTNDFLDSLPSRSDLDR

Query:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT
        RI  QDRGV+S  PSEKEC + S  + ETT++M+K EKE E  K+      S+PLFGCR+SRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LT
Subjt:  RIEVQDRGVVSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT

Query:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV
        KKA+DLAWENENLKREKELALKEY SLE TNKELKEQ+A A +PK+EEIPGNNRSSH+Q PPLPTNYPLFL+ RPP+ASYFW     PSSPY H+L NV 
Subjt:  KKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFW-----PSSPYQHELPNVV

Query:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK
        V+P S+  P+N+   VSDSSHV ENFTN  G  TPFC++PCSWLLPHHD RNQQS Q SC  GN QE I  NSQNS +TSKV VRAESRHSSLPS EEK 
Subjt:  VIPSSIHLPTNSN--VSDSSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKK

Query:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR
        EA D  EA + K H QN VGV V  FE D + QVR+++SPVRL+CIE TS VKQD  S+DD GLS+R C D+C   EKKHE E VS KKTIDAM A EAR
Subjt:  EATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEAR

Query:  RRRKELTKLKNLHGRQCHMH
        RRRKELTKLKNLH R C MH
Subjt:  RRRKELTKLKNLHGRQCHMH

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a5.8e-0436Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY  L + N  LK ++ ++         P + E    N  SH + P
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYHSLETTNKELKEQMAQA-------VKPKVEEIPGNNRSSHLQIP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein5.5e-5836.7Show/hide
Query:  LRSFSSSSCSSSYTAMAADQMV---KVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEVQDRGVVS
        L   SSS CSSS ++   +        E+EAAEALADLA LA+       S   W +  KGKR RK VK+ESP +D L   P  SD     ++ +  +V 
Subjt:  LRSFSSSSCSSSYTAMAADQMV---KVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEVQDRGVVS

Query:  QQPSEKECTNQSRSDCETTRKMLKTEKEVELSK--VSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWE
        ++  E+E    ++   E T+  +K+E   E  K  ++ T        GC +SR+NL+EAE+EERR+RRILANRESARQTIRRRQA+CEEL+KKAADL +E
Subjt:  QQPSEKECTNQSRSDCETTRKMLKTEKEVELSK--VSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWE

Query:  NENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNV
        NENL+REK+ ALKE+ SLET NK LKEQ+ ++VKP  +E   + + S +++    T  P + Y + P+  + WP     H   +   + S +  PT+   
Subjt:  NENLKREKELALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNV

Query:  SD---SSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRN------QQSPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEATDS
        S    ++  HEN  + NG  T F ++PC W LP  D  N      Q + + + S G++ +D      +   T  S +  R +   S  P T    +  +S
Subjt:  SD---SSHVHENFTNSNGPSTPFCLLPCSWLLPHHDRRN------QQSPQVSCSTGNNQEDICLNSQNSCHT--SKVAVRAESRHSSLPSTEEKKEATDS

Query:  -TEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRK
         TE L+              GF    +A                  ++K ++ S+  +G        + + P   H   S+  KK   ++ AAEAR+RRK
Subjt:  -TEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCIESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRK

Query:  ELTKLKNLHGRQCHM
        ELT+LKNLHGRQC M
Subjt:  ELTKLKNLHGRQCHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCCATGGCTTCCACCAAGTGTTCCGACGGGTCCAGTTGTTCGGCCTTGAGGTCTTTTTCTTCGTCTTCTTGTTCCTCGTCTTACACGGCCATGGCCGCGGATCAGATGGT
GAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGAGCTCAACCGTCGGACAAGAAATGGAGGACCAAACTCAAGGGGA
AACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCGTTCGGATCTGGACCGTCGGATTGAGGTTCAGGATAGAGGGGTG
GTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGATGTTAAAGACGGAGAAGGAGGTCGAATTATCTAAAGTGAG
TCCTACATGTACTACAAGCTACCCATTATTTGGCTGCAGGAAGTCGAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAGTACGAAGAATTTTAGCAAACAGAG
AGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAAAAAGGCTGCTGATTTAGCATGGGAAAATGAAAATTTGAAGAGGGAGAAGGAGTTG
GCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATC
TCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGCCCTCCATTTGCGTCGTATTTCTGGCCCTCTAGTCCTTATCAACATGAACTACCCA
ACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGACTCTTCCCATGTACATGAAAACTTTACGAACAGCAATGGCCCGAGTACACCCTTT
TGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGAGTCCTCAAGTCTCGTGTTCCACGGGAAATAATCAAGAGGATATTTGTTTGAATTC
CCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAAAAAAAGGAAGCAACTGATTCAACCGAAGCTTTGA
ACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGACTGACGCAAAAGCTCAAGTTAGGAGAATGATCTCTCCTGTGAGACTTAAATGTATC
GAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGTCAACAAGAGCTTGTGCTGACATCTGTGTTTTTCCAGAAAAGAAGCATGAAGCAGA
GAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGGAGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGGCCGTCAATGCCACATGC
ATTCT
mRNA sequenceShow/hide mRNA sequence
CCCATGGCTTCCACCAAGTGTTCCGACGGGTCCAGTTGTTCGGCCTTGAGGTCTTTTTCTTCGTCTTCTTGTTCCTCGTCTTACACGGCCATGGCCGCGGATCAGATGGT
GAAGGTTGAGATTGAGGCGGCGGAGGCTCTGGCGGATTTGGCGGCTTTGGCAGTGAGAGAGAGTGGAGCTCAACCGTCGGACAAGAAATGGAGGACCAAACTCAAGGGGA
AACGAGCCAGGAAGGATGTTAAGAGCGAGTCGCCGACCAATGACTTCCTGGACTCTCTACCGAGTCGTTCGGATCTGGACCGTCGGATTGAGGTTCAGGATAGAGGGGTG
GTAAGTCAGCAGCCATCAGAAAAGGAATGTACAAATCAATCGCGCTCCGATTGCGAAACAACGAGAAAGATGTTAAAGACGGAGAAGGAGGTCGAATTATCTAAAGTGAG
TCCTACATGTACTACAAGCTACCCATTATTTGGCTGCAGGAAGTCGAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAGTACGAAGAATTTTAGCAAACAGAG
AGTCTGCCCGGCAAACAATTCGACGTAGGCAGGCTCTGTGCGAGGAGTTAACCAAAAAGGCTGCTGATTTAGCATGGGAAAATGAAAATTTGAAGAGGGAGAAGGAGTTG
GCCCTGAAAGAGTACCACTCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATC
TCATCTCCAGATACCTCCTTTACCCACCAACTACCCTCTTTTTTTGTACGGTCGCCCTCCATTTGCGTCGTATTTCTGGCCCTCTAGTCCTTATCAACATGAACTACCCA
ACGTCGTTGTCATTCCGTCAAGTATTCATTTGCCAACCAATAGTAATGTTTCTGACTCTTCCCATGTACATGAAAACTTTACGAACAGCAATGGCCCGAGTACACCCTTT
TGTTTACTACCTTGTTCGTGGTTGTTACCTCATCATGACCGTAGGAATCAACAGAGTCCTCAAGTCTCGTGTTCCACGGGAAATAATCAAGAGGATATTTGTTTGAATTC
CCAAAATAGTTGTCATACTTCAAAGGTGGCTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAAAAAAAGGAAGCAACTGATTCAACCGAAGCTTTGA
ACCCCAAGAATCATATTCAGAACCCAGTTGGGGTAGCTGTGGGGGGATTTGAGACTGACGCAAAAGCTCAAGTTAGGAGAATGATCTCTCCTGTGAGACTTAAATGTATC
GAATCCACATCTGCTGTCAAGCAAGACAATCGGAGCAAAGACGATCACGGTCTGTCAACAAGAGCTTGTGCTGACATCTGTGTTTTTCCAGAAAAGAAGCATGAAGCAGA
GAGTGTCTCCAGTAAGAAAACCATAGATGCAATGGTTGCAGCCGAGGCAAGGAGGAGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGGCCGTCAATGCCACATGC
ATTCT
Protein sequenceShow/hide protein sequence
PMASTKCSDGSSCSALRSFSSSSCSSSYTAMAADQMVKVEIEAAEALADLAALAVRESGAQPSDKKWRTKLKGKRARKDVKSESPTNDFLDSLPSRSDLDRRIEVQDRGV
VSQQPSEKECTNQSRSDCETTRKMLKTEKEVELSKVSPTCTTSYPLFGCRKSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKEL
ALKEYHSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHLQIPPLPTNYPLFLYGRPPFASYFWPSSPYQHELPNVVVIPSSIHLPTNSNVSDSSHVHENFTNSNGPSTPF
CLLPCSWLLPHHDRRNQQSPQVSCSTGNNQEDICLNSQNSCHTSKVAVRAESRHSSLPSTEEKKEATDSTEALNPKNHIQNPVGVAVGGFETDAKAQVRRMISPVRLKCI
ESTSAVKQDNRSKDDHGLSTRACADICVFPEKKHEAESVSSKKTIDAMVAAEARRRRKELTKLKNLHGRQCHMHS