; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020135 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020135
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBZIP transcription factor family protein
Genome locationChr04:29074371..29078210
RNA-Seq ExpressionHG10020135
SyntenyHG10020135
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652004.1 uncharacterized protein LOC101210630 isoform X1 [Cucumis sativus]9.3e-22281.13Show/hide
Query:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVK ESPTS FADSLP
Subjt:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ
        +RADLDLRIEQ+D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQ
Subjt:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ

Query:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP
        ALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R+SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELP
Subjt:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP

Query:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE
        NV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ+WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAE
Subjt:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE

Query:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI
        EE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA   +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+
Subjt:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI

Query:  DAMAATEARRRRKELTKLKNLYTRQFRMHS
        DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  DAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_011652005.1 uncharacterized protein LOC101210630 isoform X2 [Cucumis sativus]3.0e-22081.13Show/hide
Query:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVK ESPTS FADSLP
Subjt:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ
        +RADLDLRIEQ D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQ
Subjt:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ

Query:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP
        ALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R+SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELP
Subjt:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP

Query:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE
        NV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ+WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAE
Subjt:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE

Query:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI
        EE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA   +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+
Subjt:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI

Query:  DAMAATEARRRRKELTKLKNLYTRQFRMHS
        DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  DAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_011652006.1 uncharacterized protein LOC101210630 isoform X3 [Cucumis sativus]1.5e-21980.94Show/hide
Query:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVK ESPTS FADSLP
Subjt:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ
        +RADLDLRI  ED GVV+HQPSEKECT QS PE ETTGE+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQ
Subjt:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ

Query:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP
        ALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R+SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELP
Subjt:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP

Query:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE
        NV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ+WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAE
Subjt:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE

Query:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI
        EE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA   +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+
Subjt:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI

Query:  DAMAATEARRRRKELTKLKNLYTRQFRMHS
        DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  DAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_038904850.1 uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida]9.9e-23284.35Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD
        MASSSKCS+  +CS LSSSSSSS +SS   KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVK E PTS FADSLPS ADLD
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD

Query:  LRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL
        LRIEQ+D GVVRHQPSEKECTNQSHPEWETTGE+IKADKEAES K                       AEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  LRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVP
        T+KAADLAWENENLKREKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNR+SHVQMPPLPTNYPLFL SR+PYFWPSVVQPT+PYH+LPNV+VVP
Subjt:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVP

Query:  SSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP
        SSINLPANNNVSVSGSSHVQENF +VTGPRTPLCI+PPCSWLLPHHDFRNQQ+PQ+WFPAGNN EDI SKSQ+SAN+SKVVHAESR  SLPSAEEE+EAP
Subjt:  SSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP

Query:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT
        DLNE+PNLN+AS PKDHT+NTVGV VDGFDTNTRA VRKVLSPVRLE IEPS AV+Q+N SEDDH L S+TCDDLCDFAERRHEPEIV CKKTIDAMAAT
Subjt:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT

Query:  EARRRRKELTKLKNLYTRQFRMHS
        EARRRRKELTKLKNLYTRQ RM S
Subjt:  EARRRRKELTKLKNLYTRQFRMHS

XP_038904851.1 uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida]3.2e-23084.35Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD
        MASSSKCS+  +CS LSSSSSSS +SS   KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVK E PTS FADSLPS ADLD
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD

Query:  LRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL
        LRIEQ D GVVRHQPSEKECTNQSHPEWETTGE+IKADKEAES K                       AEKEERRVRRILANRESARQTIRRRQALCEEL
Subjt:  LRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQALCEEL

Query:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVP
        T+KAADLAWENENLKREKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNR+SHVQMPPLPTNYPLFL SR+PYFWPSVVQPT+PYH+LPNV+VVP
Subjt:  TRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVP

Query:  SSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP
        SSINLPANNNVSVSGSSHVQENF +VTGPRTPLCI+PPCSWLLPHHDFRNQQ+PQ+WFPAGNN EDI SKSQ+SAN+SKVVHAESR  SLPSAEEE+EAP
Subjt:  SSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP

Query:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT
        DLNE+PNLN+AS PKDHT+NTVGV VDGFDTNTRA VRKVLSPVRLE IEPS AV+Q+N SEDDH L S+TCDDLCDFAERRHEPEIV CKKTIDAMAAT
Subjt:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT

Query:  EARRRRKELTKLKNLYTRQFRMHS
        EARRRRKELTKLKNLYTRQ RM S
Subjt:  EARRRRKELTKLKNLYTRQFRMHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein1.4e-22081.13Show/hide
Query:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QP +TKWGIKGKGKRARKEVK ESPTS FADSLP
Subjt:  ASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ
        +RADLDLRIEQ D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQ
Subjt:  SRADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQ

Query:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP
        ALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R+SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELP
Subjt:  ALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELP

Query:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE
        NV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ+WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAE
Subjt:  NVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAE

Query:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI
        EE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA   +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+
Subjt:  EEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTI

Query:  DAMAATEARRRRKELTKLKNLYTRQFRMHS
        DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  DAMAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B649 uncharacterized protein LOC103486593 isoform X34.1e-21579.92Show/hide
Query:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR
        ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP+R
Subjt:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR

Query:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL
        ADLDLRI  ED GVV+HQPSEKECTNQS  E ETT E+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQAL
Subjt:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL

Query:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV
        CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV
Subjt:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV

Query:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE
        +VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    SAEEE
Subjt:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE

Query:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA
        ++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCKK+IDA
Subjt:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA

Query:  MAATEARRRRKELTKLKNLYTRQFRMHS
        MAATEARRRRKELTK+KNLY RQ RM S
Subjt:  MAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X12.6e-21780.11Show/hide
Query:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR
        ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP+R
Subjt:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR

Query:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL
        ADLDLRIEQ+D GVV+HQPSEKECTNQS  E ETT E+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQAL
Subjt:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL

Query:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV
        CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV
Subjt:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV

Query:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE
        +VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    SAEEE
Subjt:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE

Query:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA
        ++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCKK+IDA
Subjt:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA

Query:  MAATEARRRRKELTKLKNLYTRQFRMHS
        MAATEARRRRKELTK+KNLY RQ RM S
Subjt:  MAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X28.2e-21680.11Show/hide
Query:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR
        ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP+R
Subjt:  ASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSR

Query:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL
        ADLDLRIEQ D GVV+HQPSEKECTNQS  E ETT E+ K DKEAES K                       AEKEERR+RRILANRESARQTIRRRQAL
Subjt:  ADLDLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------------AEKEERRVRRILANRESARQTIRRRQAL

Query:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV
        CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YHELPNV
Subjt:  CEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNV

Query:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE
        +VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    SAEEE
Subjt:  LVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEE

Query:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA
        ++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCKK+IDA
Subjt:  HEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDA

Query:  MAATEARRRRKELTKLKNLYTRQFRMHS
        MAATEARRRRKELTK+KNLY RQ RM S
Subjt:  MAATEARRRRKELTKLKNLYTRQFRMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X12.7e-19074.9Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIK-GKGKRARKEVKNESPTSAFADSLPSRADL
        MASSSKCS+  +CSGLSSSS+ S  SSSM   ADQMVKVEIEAAEALA LAVLAVR++G QPSETKW IK  KGKRARKEVK ESPTSAF DSLPSRADL
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIK-GKGKRARKEVKNESPTSAFADSLPSRADL

Query:  DLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------AEKEERRVRRILANRESARQTIRRRQALCEELTRKAA
        DLRI Q+D GV+ H PSEKEC + SHPEWETT EMIKA+KEAES K                 AEKEERR+RR+LANRESARQTIRRRQALCE+LT+KA+
Subjt:  DLRIEQEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFK-----------------AEKEERRVRRILANRESARQTIRRRQALCEELTRKAA

Query:  DLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVP---YFWPSVVQPTSPYHELPNVLVVPSS
        DLAWENENLKREKELALKEYQSLE TNKELKEQ+A A +PK+EEIPGNNR+SHVQ PPLPTNYPLFLFSR P   YFWPSVVQP+SPYH+L NV VVP S
Subjt:  DLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVP---YFWPSVVQPTSPYHELPNVLVVPSS

Query:  INLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSK-VVHAESRHSSLPSAEEEHEAPD
        +  P+NN V VS SSHVQENFTNVTG RTP CIV PCSWLLPHHD RNQQS Q   PAGN  E I S SQNSA +SK VV AESRHSSLPSAEE++EA D
Subjt:  INLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSK-VVHAESRHSSLPSAEEEHEAPD

Query:  LNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATE
        LNE+P+L      K+HT+NTVGV VD F+ +TR  VRKVLSPVRLE IEP+S V+Q+  SEDD GLSSRTCDDLC  AE++HEPEIV CKKTIDAMAATE
Subjt:  LNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATE

Query:  ARRRRKELTKLKNLYTRQFRMH
        ARRRRKELTKLKNL+TR  RMH
Subjt:  ARRRRKELTKLKNLYTRQFRMH

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a2.5e-0438Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNNRTSHVQMP
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY+ L + N  LK +L E+         P + E    N  SH + P
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA-------VKPKVEEIPGNNRTSHVQMP

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein4.2e-4734.76Show/hide
Query:  LSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTS------AFADSLPSRADLDLRIEQEDGG
        +SSS  SSS SSS  +        E+EAAEALA LA LA+       S   WG   KGKR RK VK ESP S        +D+LP+    + R+ +E+  
Subjt:  LSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTS------AFADSLPSRADLDLRIEQEDGG

Query:  VVRHQPSEKECT--------NQSHPEWETTGEMIKADK-------EAESFKAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRE
            +P  KE T        N   P+      +I+  +            +AE+EERR+RRILANRESARQTIRRRQA+CEEL++KAADL +ENENL+RE
Subjt:  VVRHQPSEKECT--------NQSHPEWETTGEMIKADK-------EAESFKAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRE

Query:  KELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPY---FWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVS
        K+ ALKE+QSLET NK LKEQ+ ++VKP  +E   + + S V+M    T  P + +++ PY    WP V Q ++P         + S +  P +   S  
Subjt:  KELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPY---FWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVS

Query:  G-SSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRN------QQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPN
          ++   EN  +  G +T   +V PC W LP  D  N      Q + +  F  G++++D      +SA    V      H      EE+  +P+     +
Subjt:  G-SSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRN------QQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPN

Query:  LNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRK
        LNE++         +    DGF    +A                  +++  + SE  +G++              H   I   +K   ++AA EAR+RRK
Subjt:  LNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRK

Query:  ELTKLKNLYTRQFRM
        ELT+LKNL+ RQ RM
Subjt:  ELTKLKNLYTRQFRM

AT2G35530.1 basic region/leucine zipper transcription factor 167.5e-0440Show/hide
Query:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNN
        ++E +R RR  +NRESAR++  R+QA C+EL ++A  L  EN NL+ E      + + L T N  LK+QL  ++ P +E I  +N
Subjt:  EKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCAAGTGCTCCGACGTGGCCACTTGTTCTGGTTTGAGTTCTTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGGATCAGATGGT
CAAGGTTGAGATTGAGGCGGCAGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAAACTGGACCTCAACCGTCCGAAACCAAATGGGGAATTAAAGGGAAAGGGA
AACGAGCTAGGAAGGAGGTTAAGAACGAGTCGCCGACTTCTGCCTTCGCCGACTCTTTACCTAGTCGAGCGGATCTGGACCTTCGGATTGAGCAGGAGGATGGAGGGGTG
GTAAGACATCAGCCATCAGAAAAAGAATGTACTAATCAGTCCCACCCTGAGTGGGAAACAACCGGAGAGATGATAAAGGCGGACAAGGAGGCCGAATCATTTAAAGCTGA
AAAGGAAGAAAGGAGAGTACGACGGATTTTAGCAAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAAGCTGCTGATC
TAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTATCAATCTCTGGAGACTACTAACAAGGAATTAAAAGAACAGTTGGCTGAAGCAGTA
AAGCCGAAGGTGGAGGAGATCCCAGGAAACAATAGAACATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGTGTTCCGTATTTCTG
GCCATCTGTGGTTCAACCTACAAGTCCTTATCATGAACTACCCAATGTCCTCGTCGTCCCGTCAAGTATTAATTTGCCTGCTAATAATAATGTTTCTGTGTCTGGCTCTT
CCCATGTACAAGAAAACTTTACGAACGTCACTGGCCCGAGAACACCCTTGTGTATAGTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGT
CCTCAAGTCTGGTTTCCCGCTGGAAATAATCTAGAGGATATTTGTTCGAAATCCCAAAACAGTGCTAATTCTTCAAAGGTTGTGCATGCAGAAAGCAGACATTCTTCTTT
GCCCTCAGCTGAAGAAGAACACGAAGCTCCTGACTTGAATGAATCTCCTAATTTAAACGAAGCTTCAAATCCAAAGGATCATACTGAGAACACAGTTGGAGTAGCTGTGG
ACGGATTTGATACCAACACAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAACGTATTGAACCCAGTTCCGCTGTCCAACAAAATAACCGGAGCGAAGAT
GATCACGGTCTGTCATCAAGAACTTGTGATGACTTGTGTGATTTTGCAGAAAGAAGGCATGAACCAGAGATAGTCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAAC
TGAGGCCAGGAGGAGGAGAAAAGAACTAACAAAATTAAAGAACCTTTACACCCGTCAGTTCCGTATGCATTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCCAAGTGCTCCGACGTGGCCACTTGTTCTGGTTTGAGTTCTTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGGATCAGATGGT
CAAGGTTGAGATTGAGGCGGCAGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAAACTGGACCTCAACCGTCCGAAACCAAATGGGGAATTAAAGGGAAAGGGA
AACGAGCTAGGAAGGAGGTTAAGAACGAGTCGCCGACTTCTGCCTTCGCCGACTCTTTACCTAGTCGAGCGGATCTGGACCTTCGGATTGAGCAGGAGGATGGAGGGGTG
GTAAGACATCAGCCATCAGAAAAAGAATGTACTAATCAGTCCCACCCTGAGTGGGAAACAACCGGAGAGATGATAAAGGCGGACAAGGAGGCCGAATCATTTAAAGCTGA
AAAGGAAGAAAGGAGAGTACGACGGATTTTAGCAAACAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAAGCTGCTGATC
TAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTATCAATCTCTGGAGACTACTAACAAGGAATTAAAAGAACAGTTGGCTGAAGCAGTA
AAGCCGAAGGTGGAGGAGATCCCAGGAAACAATAGAACATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGTGTTCCGTATTTCTG
GCCATCTGTGGTTCAACCTACAAGTCCTTATCATGAACTACCCAATGTCCTCGTCGTCCCGTCAAGTATTAATTTGCCTGCTAATAATAATGTTTCTGTGTCTGGCTCTT
CCCATGTACAAGAAAACTTTACGAACGTCACTGGCCCGAGAACACCCTTGTGTATAGTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGT
CCTCAAGTCTGGTTTCCCGCTGGAAATAATCTAGAGGATATTTGTTCGAAATCCCAAAACAGTGCTAATTCTTCAAAGGTTGTGCATGCAGAAAGCAGACATTCTTCTTT
GCCCTCAGCTGAAGAAGAACACGAAGCTCCTGACTTGAATGAATCTCCTAATTTAAACGAAGCTTCAAATCCAAAGGATCATACTGAGAACACAGTTGGAGTAGCTGTGG
ACGGATTTGATACCAACACAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAACGTATTGAACCCAGTTCCGCTGTCCAACAAAATAACCGGAGCGAAGAT
GATCACGGTCTGTCATCAAGAACTTGTGATGACTTGTGTGATTTTGCAGAAAGAAGGCATGAACCAGAGATAGTCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAAC
TGAGGCCAGGAGGAGGAGAAAAGAACTAACAAAATTAAAGAACCTTTACACCCGTCAGTTCCGTATGCATTCCTGA
Protein sequenceShow/hide protein sequence
MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEQEDGGV
VRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAV
KPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQS
PQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSED
DHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS