; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G032180 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G032180
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionBZIP domain-containing protein
Genome locationGy14Chr3:31589400..31594005
RNA-Seq ExpressionCsGy3G032180
SyntenyCsGy3G032180
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442819.2 PREDICTED: uncharacterized protein LOC103486593 isoform X1 [Cucumis melo]0.092.59Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKGKRARKE KTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE
        P SGFADSLPARADLDLRIEQ DRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRE
Subjt:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE

Query:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV
        SARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVV
Subjt:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV

Query:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE
        QSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AE
Subjt:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE

Query:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE
        SRHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHE
Subjt:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE

Query:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        PE+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

XP_008442820.2 PREDICTED: uncharacterized protein LOC103486593 isoform X2 [Cucumis melo]0.092.76Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKGKRARKE KTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        P SGFADSLPARADLDLRIEQDRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        E+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

XP_011652004.1 uncharacterized protein LOC101210630 isoform X1 [Cucumis sativus]0.099.81Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE
        PTSGFADSLPARADLDLRIEQ DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE
Subjt:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE

Query:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV
        SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV
Subjt:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV

Query:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE
        QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE
Subjt:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE

Query:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE
        SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE
Subjt:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE

Query:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
Subjt:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

XP_011652005.1 uncharacterized protein LOC101210630 isoform X2 [Cucumis sativus]0.0100Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

XP_011652006.1 uncharacterized protein LOC101210630 isoform X3 [Cucumis sativus]0.099.81Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        PTSGFADSLPARADLDLRIE DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein0.0100Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

A0A1S3B649 uncharacterized protein LOC103486593 isoform X30.092.58Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKGKRARKE KTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        P SGFADSLPARADLDLRIE DRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        E+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X10.092.59Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKGKRARKE KTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE
        P SGFADSLPARADLDLRIEQ DRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRE
Subjt:  PTSGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRE

Query:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV
        SARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVV
Subjt:  SARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVV

Query:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE
        QSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AE
Subjt:  QSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAE

Query:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE
        SRHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHE
Subjt:  SRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHE

Query:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        PE+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  PEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X20.092.76Show/hide
Query:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES
        MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKGKRARKE KTES
Subjt:  MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTES

Query:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES
        P SGFADSLPARADLDLRIEQDRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRES
Subjt:  PTSGFADSLPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQ

Query:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES
        STSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AES
Subjt:  STSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAES

Query:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP
        RHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHEP
Subjt:  RHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEP

Query:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS
        E+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  EVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS

A0A5A7TM13 BZIP transcription factor family protein5.49e-24592.71Show/hide
Query:  MDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELK
        MDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELK
Subjt:  MDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELK

Query:  EQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCIL
        EQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFSR PYFWPSVVQSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+L
Subjt:  EQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCIL

Query:  PPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAP
        PP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+SAITSK V AESRHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAP
Subjt:  PPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAP

Query:  VRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEPEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQC
        VRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDDLCYFAERRHEPE+VPCKK++DAMAATEARRRRKELTK+KNLYARQC
Subjt:  VRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEPEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQC

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a7.9e-0437Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNHRSSHVQMP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY+ L + N  LK +L E+         P + E    +  SH + P
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNHRSSHVQMP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein4.0e-5138.1Show/hide
Query:  SSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADSLPARADLDLRIEQDRGVVKHQP
        SSS  SSSSS S     AA  M   E+EAAEALA LA LA+           WG   KGKR RK VKTESP S   DSL    D D     D    +   
Subjt:  SSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADSLPARADLDLRIEQDRGVVKHQP

Query:  SEKECTIQSQPEPETTGEVTKMDKEAESSKVSP---------ACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAAD
         E+E   + + EP  T E+TK   ++E +  +P          C+ S    GC RSR+ L+EAE+EERRIRRILANRESARQTIRRRQA+CEEL++KAAD
Subjt:  SEKECTIQSQPEPETTGEVTKMDKEAESSKVSP---------ACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAAD

Query:  LAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPY---FWPSVVQSTSSYHELPNVVVVPSSI
        L +ENENL+REK+ ALKE+QSLET NK LKEQ+ ++VKP  +E   + + S V+M    T  P + +++ PY    WP V QS+       N ++ P   
Subjt:  LAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPY---FWPSVVQSTSSYHELPNVVVVPSSI

Query:  NPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKS---QNSAITSKDVRAESRHSSLPS--AEEENE
         P +   ++ + ++Q  EN  +  G +    ++ P  W LP  D  N     + F   + Q G +S      +S+    DV  E+  S LP+   EE++ 
Subjt:  NPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKS---QNSAITSKDVRAESRHSSLPS--AEEENE

Query:  APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEPEVVPCKKTVDAMA
        +P+      L+ES+         +    +GF    +A   K                         H   S T + +       H    +P KK   ++A
Subjt:  APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEPEVVPCKKTVDAMA

Query:  ATEARRRRKELTKLKNLYARQCRMQ
        A EAR+RRKELT+LKNL+ RQCRMQ
Subjt:  ATEARRRRKELTKLKNLYARQCRMQ

AT2G35530.1 basic region/leucine zipper transcription factor 168.1e-0442.25Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLA
        ++E +R RR  +NRESAR++  R+QA C+EL ++A  L  EN NL+ E      + + L T N  LK+QL+
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKELKEQLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTTCCTCTTACTCTTATGGCGGCTTCTTCTTCCAAGTGCTCCGATGGGACCACTTCTTCTGGTTTGAGTTCTTCTTCTTCTTCTTCTTCCTCTTCTTCCTC
TTCTTCCTCTATGTCCTCTTCTATGGCTAAGGCGGCGGATCAGATGGTTAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCTGGTTTGGCGGTTTTGGCTGTTAGAGAAA
CTGGAACTCAACCCTTCCAAACTAAATGGGGGATTAAAGGGAAAGGGAAACGAGCTAGGAAGGAGGTTAAGACCGAATCGCCTACTTCTGGCTTTGCCGACTCTTTGCCT
GCTCGTGCGGATCTGGACCTTCGGATTGAGCAGGATAGAGGGGTGGTAAAACATCAACCATCAGAGAAAGAGTGTACTATTCAGTCCCAGCCTGAGCCAGAAACAACCGG
AGAGGTGACAAAGATGGACAAGGAGGCTGAATCGTCTAAAGTGAGTCCTGCATGTACTACAAGCTACCAGTTTTTTGGCTGCAGGAGGTCAAGGCGTACTCTAACTGAGG
CTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTAGCGAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTATGCGAGGAATTAACTAGAAAGGCTGCT
GATCTGGCATGGGAAAATGAAAATCTAAAGAGGGAAAAGGAGGTGGCCCTGAAAGAGTACCAGTCCCTGGAGACTACTAACAAGGAATTAAAGGAACAGTTGGCTGAAGC
AGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACCATAGATCGTCTCATGTTCAGATGCCTCCTTTACCTACCAACTGCCCTCTTTTCTTGTTTAGTCGCCTTCCATATT
TCTGGCCATCTGTGGTTCAATCTACAAGTTCCTATCATGAACTACCCAATGTCGTCGTTGTCCCGTCAAGTATTAATCCTCCTGCTAATAATAATGCTTCTGTGTCTGGC
TCTTCCCAGACACAAGAAAACTTTACGAATGGCACCGGCTCGAGAGCACCATTGTGCATATTACCACCTTATTCTTGGTTGTTACCTCATCATGATTTTAGGAACCAACA
GAGTCCTCAAATCTGGTTTCCTGCTGGAAATGATCAAGAGGGTGTTTATTCGAAATCCCAAAATAGTGCTATTACTTCAAAAGATGTCCGTGCAGAAAGCAGACATTCTT
CTTTGCCTTCAGCCGAAGAAGAAAACGAAGCTCCTGACTTGAATGAAGCTCCCAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAGAACACAGTTGGAGTAGCT
GTGGAGGGATTCGATACCAATGCAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTATTGAACCCAGTTCCGCTGCCACACTAGATAACTGGAACGA
AGATGATCATGGTGTGTCATCAAGAACGTGTGATGATTTGTGTTATTTTGCAGAAAGAAGGCATGAACCAGAGGTAGTCCCTTGTAAGAAAACCGTAGATGCAATGGCCG
CAACTGAGGCCAGGAGGAGGAGGAAAGAACTAACAAAGTTAAAGAACCTTTACGCCCGTCAATGCCGTATGCAATCTTGA
mRNA sequenceShow/hide mRNA sequence
GAGACGCGCAAGCTTTCACTCATCATCCACACGCGCTCTCTTTCTCTTTCACTCTCTCCATTCTTCTACTTCTCCTCTGCTATGGGTTCTCTTCCTCTTACTCTTATGGC
GGCTTCTTCTTCCAAGTGCTCCGATGGGACCACTTCTTCTGGTTTGAGTTCTTCTTCTTCTTCTTCTTCCTCTTCTTCCTCTTCTTCCTCTATGTCCTCTTCTATGGCTA
AGGCGGCGGATCAGATGGTTAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCTGGTTTGGCGGTTTTGGCTGTTAGAGAAACTGGAACTCAACCCTTCCAAACTAAATGG
GGGATTAAAGGGAAAGGGAAACGAGCTAGGAAGGAGGTTAAGACCGAATCGCCTACTTCTGGCTTTGCCGACTCTTTGCCTGCTCGTGCGGATCTGGACCTTCGGATTGA
GCAGGATAGAGGGGTGGTAAAACATCAACCATCAGAGAAAGAGTGTACTATTCAGTCCCAGCCTGAGCCAGAAACAACCGGAGAGGTGACAAAGATGGACAAGGAGGCTG
AATCGTCTAAAGTGAGTCCTGCATGTACTACAAGCTACCAGTTTTTTGGCTGCAGGAGGTCAAGGCGTACTCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGG
ATTTTAGCGAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTATGCGAGGAATTAACTAGAAAGGCTGCTGATCTGGCATGGGAAAATGAAAATCTAAA
GAGGGAAAAGGAGGTGGCCCTGAAAGAGTACCAGTCCCTGGAGACTACTAACAAGGAATTAAAGGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAG
GAAACCATAGATCGTCTCATGTTCAGATGCCTCCTTTACCTACCAACTGCCCTCTTTTCTTGTTTAGTCGCCTTCCATATTTCTGGCCATCTGTGGTTCAATCTACAAGT
TCCTATCATGAACTACCCAATGTCGTCGTTGTCCCGTCAAGTATTAATCCTCCTGCTAATAATAATGCTTCTGTGTCTGGCTCTTCCCAGACACAAGAAAACTTTACGAA
TGGCACCGGCTCGAGAGCACCATTGTGCATATTACCACCTTATTCTTGGTTGTTACCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAATCTGGTTTCCTGCTGGAA
ATGATCAAGAGGGTGTTTATTCGAAATCCCAAAATAGTGCTATTACTTCAAAAGATGTCCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAGCCGAAGAAGAAAACGAA
GCTCCTGACTTGAATGAAGCTCCCAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAGAACACAGTTGGAGTAGCTGTGGAGGGATTCGATACCAATGCAAGAGC
TCCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTATTGAACCCAGTTCCGCTGCCACACTAGATAACTGGAACGAAGATGATCATGGTGTGTCATCAAGAACGT
GTGATGATTTGTGTTATTTTGCAGAAAGAAGGCATGAACCAGAGGTAGTCCCTTGTAAGAAAACCGTAGATGCAATGGCCGCAACTGAGGCCAGGAGGAGGAGGAAAGAA
CTAACAAAGTTAAAGAACCTTTACGCCCGTCAATGCCGTATGCAATCTTGATCCACATGGCCAGTTAGTTTGGCAACAGTTTGCTGTCACACAGCAATCTTATGTGTTAA
AGTCCTGTATTCCATTGGCTTTTGTTGCCAGAGGCAGGCCACAGAGCAATGAGACGTAGCCAGAGTTCTGACTTCACTTCAGCTTTCTTGTCCAGGCGTACATTGTGCAT
TCCAGAGCTCATAGCACATCAAGATCTTGGTGCTGGAGATAGGTAGTGGTAAAGATGATTTAGGCACAATTTGCTTACAAATGAAATGAGAGAGGTACTAGGACTAAAAC
GCTGATTAGTTCAAAGAGCTTTGAATTGGATTTGGTAGGGCAACATAGAAGAGGAGGTAAGTGTAAGAATTTTGTATTTAAATATTTCTATGGTTACTTCGAAGACCCTT
GTTATTGGGTTGGCTTACACTTCTTCAATGAAGTGAAGCATAAGAATGGAGTGTATGGAGATCTGTTAGTTACTTTATTAAGAAGCAAGGAACAGTTAGGGGGAAAAAAA
ATGTAAAAAGCCAAATGGAAATCTGACACTTCTCTAAAAAAC
Protein sequenceShow/hide protein sequence
MGSLPLTLMAASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADSLP
ARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVTKMDKEAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAA
DLAWENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSG
SSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAITSKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVA
VEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSRTCDDLCYFAERRHEPEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS