; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G18660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G18660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBZIP domain-containing protein
Genome locationClcChr10:32450839..32455215
RNA-Seq ExpressionClc10G18660
SyntenyClc10G18660
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652004.1 uncharacterized protein LOC101210630 isoform X1 [Cucumis sativus]5.2e-22181.44Show/hide
Query:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP
        ASSSKCSDGTT SGL      SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVKTESPTS FADSLP
Subjt:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP

Query:  SRADLNLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQ
        +RADL+LRIE +DRGVV+HQP EKECT QS PE ETTGE+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQ
Subjt:  SRADLNLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQ

Query:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELP
        ALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+RSSHVQMPPLPTN PLFLFSRLPYFWPSVVQ TS YHELP
Subjt:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELP

Query:  NVVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAE
        NVVVVPSSIN PAN+N SVSGSS  QENFTN TG R PLCILPP SWLLPHHDFRNQQSPQIWFPAGN+ E +YSKSQNSA TSK V AESR SSLPSAE
Subjt:  NVVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAE

Query:  EENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTI
        EEN+APDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA    N +EDDHG+SSRTCDDL  FAERRH+PE+ PCKKT+
Subjt:  EENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTI

Query:  DAMAATEARRRRKELTKLKNLYARQCQL
        DAMAATEARRRRKELTKLKNLYARQC++
Subjt:  DAMAATEARRRRKELTKLKNLYARQCQL

XP_011652005.1 uncharacterized protein LOC101210630 isoform X2 [Cucumis sativus]2.1e-22281.59Show/hide
Query:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP
        ASSSKCSDGTT SGL      SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVKTESPTS FADSLP
Subjt:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP

Query:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA
        +RADL+LRIE+DRGVV+HQP EKECT QS PE ETTGE+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQA
Subjt:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN
        LCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+RSSHVQMPPLPTN PLFLFSRLPYFWPSVVQ TS YHELPN
Subjt:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN

Query:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE
        VVVVPSSIN PAN+N SVSGSS  QENFTN TG R PLCILPP SWLLPHHDFRNQQSPQIWFPAGN+ E +YSKSQNSA TSK V AESR SSLPSAEE
Subjt:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE

Query:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID
        EN+APDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA    N +EDDHG+SSRTCDDL  FAERRH+PE+ PCKKT+D
Subjt:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID

Query:  AMAATEARRRRKELTKLKNLYARQCQL
        AMAATEARRRRKELTKLKNLYARQC++
Subjt:  AMAATEARRRRKELTKLKNLYARQCQL

XP_011652006.1 uncharacterized protein LOC101210630 isoform X3 [Cucumis sativus]8.9e-22181.59Show/hide
Query:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP
        ASSSKCSDGTT SGL      SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVKTESPTS FADSLP
Subjt:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP

Query:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA
        +RADL+LRI EDRGVV+HQP EKECT QS PE ETTGE+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQA
Subjt:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN
        LCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+RSSHVQMPPLPTN PLFLFSRLPYFWPSVVQ TS YHELPN
Subjt:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN

Query:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE
        VVVVPSSIN PAN+N SVSGSS  QENFTN TG R PLCILPP SWLLPHHDFRNQQSPQIWFPAGN+ E +YSKSQNSA TSK V AESR SSLPSAEE
Subjt:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE

Query:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID
        EN+APDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA    N +EDDHG+SSRTCDDL  FAERRH+PE+ PCKKT+D
Subjt:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID

Query:  AMAATEARRRRKELTKLKNLYARQCQL
        AMAATEARRRRKELTKLKNLYARQC++
Subjt:  AMAATEARRRRKELTKLKNLYARQCQL

XP_038904850.1 uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida]6.8e-22983.94Show/hide
Query:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSRADL
        MASSSKCS+GT+CS L  SSSSSSSM SS  KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVKTE PTS FADSLPS ADL
Subjt:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSRADL

Query:  NLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEE
        +LRIE +DRGVVRHQP EKECT+QSHPEWETTGE++K DKEAESCK                       AEKEERRVRRILANRESARQTIRRRQALCEE
Subjt:  NLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEE

Query:  LTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVVVV
        LT+KAADLAWENENLKREKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNRSSHVQMPPLPTNYPLFL SRLPYFWPSVVQPT+PYH+LPNVVVV
Subjt:  LTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVVVV

Query:  PSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEENDA
        PSSINLPAN+NVSVSGSSHVQENF + TGPRTPLCILPPCSWLLPHHDFRNQQ+PQIWFPAGNN EDIYSKSQ+SANTSKVV AESR+ SLPSAEEEN+A
Subjt:  PSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEENDA

Query:  PDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAA
        PDLNE PNLN+AS PKDHTQN+VGV VDGFDTN R QVR+VLSPVRLECIE S AVKQ N SEDDH L S+TCDDL DFAERRH+PEI  CKKTIDAMAA
Subjt:  PDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAA

Query:  TEARRRRKELTKLKNLYARQCQL
        TEARRRRKELTKLKNLY RQC++
Subjt:  TEARRRRKELTKLKNLYARQCQL

XP_038904851.1 uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida]2.8e-23084.1Show/hide
Query:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSRADL
        MASSSKCS+GT+CS L  SSSSSSSM SS  KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVKTE PTS FADSLPS ADL
Subjt:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSRADL

Query:  NLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL
        +LRIE+DRGVVRHQP EKECT+QSHPEWETTGE++K DKEAESCK                       AEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  NLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVVVVP
        T+KAADLAWENENLKREKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNRSSHVQMPPLPTNYPLFL SRLPYFWPSVVQPT+PYH+LPNVVVVP
Subjt:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVVVVP

Query:  SSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEENDAP
        SSINLPAN+NVSVSGSSHVQENF + TGPRTPLCILPPCSWLLPHHDFRNQQ+PQIWFPAGNN EDIYSKSQ+SANTSKVV AESR+ SLPSAEEEN+AP
Subjt:  SSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEENDAP

Query:  DLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAAT
        DLNE PNLN+AS PKDHTQN+VGV VDGFDTN R QVR+VLSPVRLECIE S AVKQ N SEDDH L S+TCDDL DFAERRH+PEI  CKKTIDAMAAT
Subjt:  DLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAAT

Query:  EARRRRKELTKLKNLYARQCQL
        EARRRRKELTKLKNLY RQC++
Subjt:  EARRRRKELTKLKNLYARQCQL

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein1.0e-22281.59Show/hide
Query:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP
        ASSSKCSDGTT SGL      SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVKTESPTS FADSLP
Subjt:  ASSSKCSDGTTCSGL------SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLP

Query:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA
        +RADL+LRIE+DRGVV+HQP EKECT QS PE ETTGE+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQA
Subjt:  SRADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQA

Query:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN
        LCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+RSSHVQMPPLPTN PLFLFSRLPYFWPSVVQ TS YHELPN
Subjt:  LCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPN

Query:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE
        VVVVPSSIN PAN+N SVSGSS  QENFTN TG R PLCILPP SWLLPHHDFRNQQSPQIWFPAGN+ E +YSKSQNSA TSK V AESR SSLPSAEE
Subjt:  VVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEE

Query:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID
        EN+APDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA    N +EDDHG+SSRTCDDL  FAERRH+PE+ PCKKT+D
Subjt:  ENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTID

Query:  AMAATEARRRRKELTKLKNLYARQCQL
        AMAATEARRRRKELTKLKNLYARQC++
Subjt:  AMAATEARRRRKELTKLKNLYARQCQL

A0A1S3B649 uncharacterized protein LOC103486593 isoform X37.9e-21580.19Show/hide
Query:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR
        ASSSKCSDGTT S  SSSSSSSSS    M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE KTESP S FADSLP+R
Subjt:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR

Query:  ADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALC
        ADL+LRI EDRGVV+HQP EKECT+QS  E ETT E+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQALC
Subjt:  ADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALC

Query:  EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVV
        EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ SSHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV+
Subjt:  EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVV

Query:  VVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEEN
        VVPSSINLPAN+N SVSGSS  QENFTN TG R P C+LPPCSWLLPHHDFRNQQSPQIWFPAGN+ EDIYSKSQ+SA TSKVV AESR     SAEEEN
Subjt:  VVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEEN

Query:  DAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAM
        DAPDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA K  N +EDDHG+SSRTCDDL  FAERRH+PEI PCKK+IDAM
Subjt:  DAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAM

Query:  AATEARRRRKELTKLKNLYARQCQL
        AATEARRRRKELTK+KNLYARQC++
Subjt:  AATEARRRRKELTKLKNLYARQCQL

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X16.0e-21580.04Show/hide
Query:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR
        ASSSKCSDGTT S  SSSSSSSSS    M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE KTESP S FADSLP+R
Subjt:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR

Query:  ADLNLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQAL
        ADL+LRIE +DRGVV+HQP EKECT+QS  E ETT E+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQAL
Subjt:  ADLNLRIE-EDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQAL

Query:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNV
        CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ SSHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV
Subjt:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNV

Query:  VVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEE
        +VVPSSINLPAN+N SVSGSS  QENFTN TG R P C+LPPCSWLLPHHDFRNQQSPQIWFPAGN+ EDIYSKSQ+SA TSKVV AESR     SAEEE
Subjt:  VVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEE

Query:  NDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDA
        NDAPDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA K  N +EDDHG+SSRTCDDL  FAERRH+PEI PCKK+IDA
Subjt:  NDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDA

Query:  MAATEARRRRKELTKLKNLYARQCQL
        MAATEARRRRKELTK+KNLYARQC++
Subjt:  MAATEARRRRKELTKLKNLYARQCQL

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X22.4e-21680.19Show/hide
Query:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR
        ASSSKCSDGTT S  SSSSSSSSS    M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE KTESP S FADSLP+R
Subjt:  ASSSKCSDGTTCSGLSSSSSSSSS----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSR

Query:  ADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALC
        ADL+LRIE+DRGVV+HQP EKECT+QS  E ETT E+ K+DKEAES K                       AEKEERR+RRILANRESARQTIRRRQALC
Subjt:  ADLNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------------AEKEERRVRRILANRESARQTIRRRQALC

Query:  EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVV
        EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ SSHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV+
Subjt:  EELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVV

Query:  VVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEEN
        VVPSSINLPAN+N SVSGSS  QENFTN TG R P C+LPPCSWLLPHHDFRNQQSPQIWFPAGN+ EDIYSKSQ+SA TSKVV AESR     SAEEEN
Subjt:  VVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEEN

Query:  DAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAM
        DAPDLNE P+L+E+SNPKD TQN+VGV+V+GFDTNAR  VR+VLSPVRLECIE SSA K  N +EDDHG+SSRTCDDL  FAERRH+PEI PCKK+IDAM
Subjt:  DAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAM

Query:  AATEARRRRKELTKLKNLYARQCQL
        AATEARRRRKELTK+KNLYARQC++
Subjt:  AATEARRRRKELTKLKNLYARQCQL

A0A6J1J5U4 uncharacterized protein LOC111481617 isoform X13.1e-18772.55Show/hide
Query:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKG-KGKRARKEVKTESPTSAFADSLPSRAD
        MASSSKCS+ T+CSGLSSSS+ SSS SS     ADQMVKVEIEAAEAL  LAVLAVR++G +PSETKW IKG KGKRARKEVKTESPTSAF DSLPSRAD
Subjt:  MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKG-KGKRARKEVKTESPTSAFADSLPSRAD

Query:  LNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------AEKEERRVRRILANRESARQTIRRRQALCEELTRKAA
        L+LRI++DRGV+ HQP EKEC   SHPEWETT EM+K +KE ES K                 AEKEERR+RR+LANRESARQTIRRRQ LCE+LT+KA+
Subjt:  LNLRIEEDRGVVRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCK-----------------AEKEERRVRRILANRESARQTIRRRQALCEELTRKAA

Query:  DLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLP---YFWPSVVQPTSPYHELPNVVVVPSS
        DLAWENENLKREKELALKEYQSLE TNKELKEQ+A+A +PK+EEIPGNNRSSHVQ PPLPTNYPLF FSR P   YFWPSVVQP+SPYH+L NV VVP S
Subjt:  DLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLP---YFWPSVVQPTSPYHELPNVVVVPSS

Query:  INLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVC-AESRQSSLPSAEEENDAPD
        +  P+N+ V VS SSHVQENFTN TG RTP CI+ PCSWLLPHHD RNQQS Q   PAGN  E IYS SQNSA TSKVV  AESR+SSLPSAEE+N+A D
Subjt:  INLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQSPQIWFPAGNNLEDIYSKSQNSANTSKVVC-AESRQSSLPSAEEENDAPD

Query:  LNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAATE
        LNE P+L      KDHTQN+VGV VD F+ + R +VR+VLSPVRLECIE +S VKQ   SEDD GLSSRTCDDL   AE++H+PE+  CKKTIDAMAATE
Subjt:  LNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAATE

Query:  ARRRRKELTKLKNLYARQCQL
        ARRRRKELTKLKNL+ R C++
Subjt:  ARRRRKELTKLKNLYARQCQL

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a2.2e-0438Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNNRSSHVQMP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY+ L + N  LK +L E+         P + E    N  SH + P
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNNRSSHVQMP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein7.4e-4836.08Show/hide
Query:  SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTS------AFADSLPSRADLNLR-IEEDRG
        SS  SSSSS S     AA  M   E+EAAEALA LA LA+       S   WG   KGKR RK VKTESP S        +D+LP+      R ++E+  
Subjt:  SSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTS------AFADSLPSRADLNLR-IEEDRG

Query:  VVRHQPLEKECTSQSHPEWETTGEMMKV--------DKEAESC--------KAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKR
            +P+ KE T ++  + E  GE  K            +  C        +AE+EERR+RRILANRESARQTIRRRQA+CEEL++KAADL +ENENL+R
Subjt:  VVRHQPLEKECTSQSHPEWETTGEMMKV--------DKEAESC--------KAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKR

Query:  EKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPY---FWPSVVQPTSPYHELPNVVVVPSSINLPANSNVSV
        EK+ ALKE+QSLET NK LKEQ+ ++VKP  +E   + + S V+M    T  P + +++ PY    WP V Q ++P         + S +  P +   S 
Subjt:  EKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPY---FWPSVVQPTSPYHELPNVVVVPSSINLPANSNVSV

Query:  SG-SSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRN------QQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPS--AEEENDAPDLNE
           ++   EN  +  G +T   ++ PC W LP  D  N      Q + +  F  G++++D      +SA    V   E+ +S LP+   EE++ +P+   
Subjt:  SG-SSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRN------QQSPQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPS--AEEENDAPDLNE

Query:  PPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAATEARR
          +LNE++         +    DGF             PV     + + ++K  + SE  +G++              H   I+  +K   ++AA EAR+
Subjt:  PPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSEDDHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAATEARR

Query:  RRKELTKLKNLYARQCQLGAG
        RRKELT+LKNL+ RQC++  G
Subjt:  RRKELTKLKNLYARQCQLGAG

AT2G35530.1 basic region/leucine zipper transcription factor 168.5e-0440Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNN
        ++E +R RR  +NRESAR++  R+QA C+EL ++A  L  EN NL+ E      + + L T N  LK+QL  ++ P +E I  +N
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCAAGTGCTCCGACGGAACCACTTGTTCTGGTTTGAGTTCTTCGTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGGATCAGAT
GGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAGACGGGACCTCAACCGTCCGAAACCAAATGGGGGATTAAAGGGAAAG
GGAAACGAGCTAGGAAGGAGGTTAAGACCGAGTCGCCGACTTCTGCCTTTGCCGACTCTTTACCTAGTCGCGCGGATCTGAACCTTCGGATTGAGGAAGATAGAGGGGTG
GTAAGACATCAGCCATTAGAAAAAGAATGTACTAGTCAGTCCCACCCTGAGTGGGAAACAACTGGAGAGATGATGAAGGTAGACAAGGAGGCCGAATCATGTAAAGCTGA
AAAGGAAGAAAGGAGGGTACGAAGGATTTTAGCAAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAGGCTGCTGATC
TAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAGACCACTAACAAGGAACTAAAGGAACAGTTGGCTGAAGCAGTA
AAGCCGAAGGTGGAGGAGATCCCAGGAAACAATAGATCATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCTTCCATATTTCTG
GCCATCTGTGGTTCAACCTACAAGTCCCTATCATGAACTACCCAATGTTGTCGTCGTCCCGTCAAGTATTAATTTGCCTGCAAATAGTAATGTTTCTGTGTCTGGCTCTT
CTCATGTACAAGAAAACTTTACAAACGCCACTGGCCCGAGAACACCCTTGTGTATACTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGT
CCTCAAATCTGGTTTCCCGCTGGAAATAATCTAGAGGATATTTATTCGAAATCCCAAAACAGTGCTAATACTTCAAAGGTTGTGTGTGCAGAAAGCAGACAGTCTTCTTT
GCCTTCAGCTGAAGAAGAAAACGATGCTCCTGACTTGAATGAACCTCCTAATTTAAACGAAGCTTCGAATCCAAAGGATCATACTCAGAACTCAGTTGGAGTATCTGTAG
ATGGATTTGATACCAACGCAAGACCTCAAGTTAGAGAAGTACTTTCTCCTGTAAGACTTGAATGTATCGAATCCAGTTCCGCTGTCAAACAAGGTAACCGGAGCGAAGAT
GATCACGGTCTGTCATCAAGAACTTGTGATGACTTATTTGATTTTGCGGAAAGAAGGCACAAACCAGAGATAGCTCCCTGTAAGAAAACCATAGATGCAATGGCTGCAAC
TGAGGCAAGGAGGCGGAGAAAAGAACTCACAAAGTTAAAGAATCTTTACGCCCGTCAGTGCCAGCTCGGAGCGGGTCAATATCTTTCTGCTGGATGTAGCAGTGGTGGGG
AAGAAAATTTAGGCATCGGAGCACTTTTTTTCGTGCCTGTTTCGATGAATGAAATGAGAGAGGTGGACAAAGATTTTGATTACTTGCAAGAGTTCAATAAAGTGAAGCAG
AAGATTGCATCTTCCAATGGGTATGGAGATCTGCTATCTACTTTATTATGA
mRNA sequenceShow/hide mRNA sequence
TTTCTCTTTCACTCTCTTTACTCTCTTCATTCTTCTTCTTCTTCTTCTTCTCCTCTGTTATGGGTTTTCTCTTCCTCTCATGGCTTCTTCTTCCAAGTGCTCCGACGGAA
CCACTTGTTCTGGTTTGAGTTCTTCGTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCT
CTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAGACGGGACCTCAACCGTCCGAAACCAAATGGGGGATTAAAGGGAAAGGGAAACGAGCTAGGAAGGAGGTTAAGACCGA
GTCGCCGACTTCTGCCTTTGCCGACTCTTTACCTAGTCGCGCGGATCTGAACCTTCGGATTGAGGAAGATAGAGGGGTGGTAAGACATCAGCCATTAGAAAAAGAATGTA
CTAGTCAGTCCCACCCTGAGTGGGAAACAACTGGAGAGATGATGAAGGTAGACAAGGAGGCCGAATCATGTAAAGCTGAAAAGGAAGAAAGGAGGGTACGAAGGATTTTA
GCAAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAGGCTGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGA
AAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAGACCACTAACAAGGAACTAAAGGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACA
ATAGATCATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCTTCCATATTTCTGGCCATCTGTGGTTCAACCTACAAGTCCCTAT
CATGAACTACCCAATGTTGTCGTCGTCCCGTCAAGTATTAATTTGCCTGCAAATAGTAATGTTTCTGTGTCTGGCTCTTCTCATGTACAAGAAAACTTTACAAACGCCAC
TGGCCCGAGAACACCCTTGTGTATACTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAATCTGGTTTCCCGCTGGAAATAATC
TAGAGGATATTTATTCGAAATCCCAAAACAGTGCTAATACTTCAAAGGTTGTGTGTGCAGAAAGCAGACAGTCTTCTTTGCCTTCAGCTGAAGAAGAAAACGATGCTCCT
GACTTGAATGAACCTCCTAATTTAAACGAAGCTTCGAATCCAAAGGATCATACTCAGAACTCAGTTGGAGTATCTGTAGATGGATTTGATACCAACGCAAGACCTCAAGT
TAGAGAAGTACTTTCTCCTGTAAGACTTGAATGTATCGAATCCAGTTCCGCTGTCAAACAAGGTAACCGGAGCGAAGATGATCACGGTCTGTCATCAAGAACTTGTGATG
ACTTATTTGATTTTGCGGAAAGAAGGCACAAACCAGAGATAGCTCCCTGTAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCAAGGAGGCGGAGAAAAGAACTCACA
AAGTTAAAGAATCTTTACGCCCGTCAGTGCCAGCTCGGAGCGGGTCAATATCTTTCTGCTGGATGTAGCAGTGGTGGGGAAGAAAATTTAGGCATCGGAGCACTTTTTTT
CGTGCCTGTTTCGATGAATGAAATGAGAGAGGTGGACAAAGATTTTGATTACTTGCAAGAGTTCAATAAAGTGAAGCAGAAGATTGCATCTTCCAATGGGTATGGAGATC
TGCTATCTACTTTATTATGA
Protein sequenceShow/hide protein sequence
MASSSKCSDGTTCSGLSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKTESPTSAFADSLPSRADLNLRIEEDRGV
VRHQPLEKECTSQSHPEWETTGEMMKVDKEAESCKAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAV
KPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRLPYFWPSVVQPTSPYHELPNVVVVPSSINLPANSNVSVSGSSHVQENFTNATGPRTPLCILPPCSWLLPHHDFRNQQS
PQIWFPAGNNLEDIYSKSQNSANTSKVVCAESRQSSLPSAEEENDAPDLNEPPNLNEASNPKDHTQNSVGVSVDGFDTNARPQVREVLSPVRLECIESSSAVKQGNRSED
DHGLSSRTCDDLFDFAERRHKPEIAPCKKTIDAMAATEARRRRKELTKLKNLYARQCQLGAGQYLSAGCSSGGEENLGIGALFFVPVSMNEMREVDKDFDYLQEFNKVKQ
KIASSNGYGDLLSTLL