; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G001880 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G001880
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLight-inducible protein CPRF2-like
Genome locationchr06:2048834..2053895
RNA-Seq ExpressionLsi06G001880
SyntenyLsi06G001880
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR020983 - Basic leucine-zipper, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135652.1 light-inducible protein CPRF2 [Cucumis sativus]1.6e-13576.47Show/hide
Query:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNR-NGGVEKERKI
        MMDRV+ SVDGISDQFW S   PEESSKLNRSASEWSFRRFLQEAASVSDSS+SPPPASP      SNAV+I ESGE+LKQ  EKQSNR NGG++KERK 
Subjt:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNR-NGGVEKERKI

Query:  CSSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQA-SASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG
               ++  DSEEY+AFLKSKLNLACAAVA+CRGSF K++DSCASST A + S L SQS SKGI CSPCVQK+ GI VSSANISSSREQTDE+DD EG
Subjt:  CSSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQA-SASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG

Query:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK
        ENDMNEQMDPAS KR                            VAELR ENS LLKRFSDISQKY+EAAVNNRVLKADLETLRAKVQMAEETVKRITGTK
Subjt:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK

Query:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        SMFHAMSEVSSIS+QSFEGSPSEISTDA N+HIADISSANIQKNS EMATV RNKMARTAS+RRVASLEHLQKRIRGSSS CHPSGKGD +
Subjt:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

XP_008450742.1 PREDICTED: light-inducible protein CPRF2-like [Cucumis melo]3.1e-13976.96Show/hide
Query:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC
        MMDRV+ SVDGISDQFW S   PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASP      SNAV++ ESGEKLK+ WEKQS RN G  +E ++ 
Subjt:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC

Query:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD
        SS        DSEEY+AFLKSKLNLACAAVALCRGSF K QDSCASST      QASAS L SQS SKGISCSPCVQKK GI VSSANISSSREQTDE+D
Subjt:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD

Query:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI
        D EGENDMNEQMDPAS KR                            VAELR+ENS LLKRFSDISQKY+EAAVNNRVLKADLETLRAKVQMAEETVKRI
Subjt:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI

Query:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        TGTKSMFHAMSEVSSIS+QSFEGSPSEISTDA N+HIADISSANIQKNSPEMATV RNKMARTAS+RRVASLEHLQKRIRG+SS+CHPSGKGD +
Subjt:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

XP_022926013.1 light-inducible protein CPRF2-like isoform X1 [Cucurbita moschata]1.2e-9360.14Show/hide
Query:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK
        MDRV SVDGI+DQF         T PEESSKLNRSASEWSFRRFLQE ASVSDSS S  PPPASPS  +   N     ESGE   QI +KQSNRN     
Subjt:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK

Query:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL
          +ICSS +A +                   SDS +YQAFLKSKLNLACAAVALCRGS+ MK QDSCASS  AS  Q       KGI+ SPCVQKK GIL
Subjt:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL

Query:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA
          +A ISSS EQT  +EDD  EGE+  +E+ DPA  KR                            VAELRVENSALLKRF+DISQKY+E+AVNNRVLKA
Subjt:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA

Query:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI
        DLETL+AKVQMAEETVKRITG+K MF+ M SEV S+SMQSF+GSPS+ S DA +NH ADISSA+ Q N   + ATVS   M +T S+RRVASLE LQKRI
Subjt:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI

Query:  RGSSSSCHPSGKGD
        RGSS  C PSGKGD
Subjt:  RGSSSSCHPSGKGD

XP_023529858.1 light-inducible protein CPRF2-like isoform X1 [Cucurbita pepo subsp. pepo]6.2e-9560.34Show/hide
Query:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK
        MDRV SVDGI+DQF         T PEESSKLNRSASEWSFRRFLQE ASVSDSS S  PPPASPS  +   N     ESGE   QI +KQSNRN     
Subjt:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK

Query:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL
          +ICSS +A +                   SDS +YQA LKSKLNLACAAVALCRGS+ MK QDSCASS  AS        SSKGI+ SPCVQKK GIL
Subjt:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL

Query:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA
         SSA ISSS EQT  +EDD  EGE+  +E+ DPA  KR                            VAELRVENSALLK F+DISQKY+E+AVNNRVLKA
Subjt:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA

Query:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI
        DLETL+AKVQMAEETVKRITG+K MF+ M SE  S+SMQSF+GSPS+ S DA +NH ADISSA+ Q N   + ATVS  KM +T S+RRVASLE LQKRI
Subjt:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI

Query:  RGSSSSCHPSGKGDHK
        RGSS  C PSGKGDH+
Subjt:  RGSSSSCHPSGKGDHK

XP_038877555.1 light-inducible protein CPRF2-like [Benincasa hispida]2.4e-14779.03Show/hide
Query:  MMDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRA
        MMDRV SVDGI +QFW SPEESS+LNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNA++I E+GE LKQ WEKQSNR GG EKERK      
Subjt:  MMDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRA

Query:  AAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG
          ++  DS+EYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST      QASAS L SQ   KGIS SP V KKAGILVSSANISSSREQTDEDDDAEG
Subjt:  AAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG

Query:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK
        ENDMNEQMDPAS KR                            VAELRVENS LLKRF DISQKY+EAAVNNRVLKADLETLRAKVQMAEE VKRITGTK
Subjt:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK

Query:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        SMFHAMS+VSSISMQSFEGSPSE+STDAPNNHIADISSANIQKNSP+MAT SRNKMARTAS++RVASLEHLQKRIRGSSS+CHPSGKGD +
Subjt:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

TrEMBL top hitse value%identityAlignment
A0A0A0M1N8 BZIP domain-containing protein7.8e-13676.47Show/hide
Query:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNR-NGGVEKERKI
        MMDRV+ SVDGISDQFW S   PEESSKLNRSASEWSFRRFLQEAASVSDSS+SPPPASP      SNAV+I ESGE+LKQ  EKQSNR NGG++KERK 
Subjt:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNR-NGGVEKERKI

Query:  CSSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQA-SASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG
               ++  DSEEY+AFLKSKLNLACAAVA+CRGSF K++DSCASST A + S L SQS SKGI CSPCVQK+ GI VSSANISSSREQTDE+DD EG
Subjt:  CSSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQA-SASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEG

Query:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK
        ENDMNEQMDPAS KR                            VAELR ENS LLKRFSDISQKY+EAAVNNRVLKADLETLRAKVQMAEETVKRITGTK
Subjt:  ENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTK

Query:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        SMFHAMSEVSSIS+QSFEGSPSEISTDA N+HIADISSANIQKNS EMATV RNKMARTAS+RRVASLEHLQKRIRGSSS CHPSGKGD +
Subjt:  SMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

A0A1S3BPV6 light-inducible protein CPRF2-like1.5e-13976.96Show/hide
Query:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC
        MMDRV+ SVDGISDQFW S   PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASP      SNAV++ ESGEKLK+ WEKQS RN G  +E ++ 
Subjt:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC

Query:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD
        SS        DSEEY+AFLKSKLNLACAAVALCRGSF K QDSCASST      QASAS L SQS SKGISCSPCVQKK GI VSSANISSSREQTDE+D
Subjt:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD

Query:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI
        D EGENDMNEQMDPAS KR                            VAELR+ENS LLKRFSDISQKY+EAAVNNRVLKADLETLRAKVQMAEETVKRI
Subjt:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI

Query:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        TGTKSMFHAMSEVSSIS+QSFEGSPSEISTDA N+HIADISSANIQKNSPEMATV RNKMARTAS+RRVASLEHLQKRIRG+SS+CHPSGKGD +
Subjt:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

A0A5D3CEH5 Light-inducible protein CPRF2-like1.5e-13976.96Show/hide
Query:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC
        MMDRV+ SVDGISDQFW S   PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASP      SNAV++ ESGEKLK+ WEKQS RN G  +E ++ 
Subjt:  MMDRVI-SVDGISDQFWTS---PEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKIC

Query:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD
        SS        DSEEY+AFLKSKLNLACAAVALCRGSF K QDSCASST      QASAS L SQS SKGISCSPCVQKK GI VSSANISSSREQTDE+D
Subjt:  SSRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASST------QASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDD

Query:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI
        D EGENDMNEQMDPAS KR                            VAELR+ENS LLKRFSDISQKY+EAAVNNRVLKADLETLRAKVQMAEETVKRI
Subjt:  DAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRI

Query:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK
        TGTKSMFHAMSEVSSIS+QSFEGSPSEISTDA N+HIADISSANIQKNSPEMATV RNKMARTAS+RRVASLEHLQKRIRG+SS+CHPSGKGD +
Subjt:  TGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSSCHPSGKGDHK

A0A6J1EGV1 light-inducible protein CPRF2-like isoform X15.7e-9460.14Show/hide
Query:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK
        MDRV SVDGI+DQF         T PEESSKLNRSASEWSFRRFLQE ASVSDSS S  PPPASPS  +   N     ESGE   QI +KQSNRN     
Subjt:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK

Query:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL
          +ICSS +A +                   SDS +YQAFLKSKLNLACAAVALCRGS+ MK QDSCASS  AS  Q       KGI+ SPCVQKK GIL
Subjt:  ERKICSSRAAAA-----------------AASDSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL

Query:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA
          +A ISSS EQT  +EDD  EGE+  +E+ DPA  KR                            VAELRVENSALLKRF+DISQKY+E+AVNNRVLKA
Subjt:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA

Query:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI
        DLETL+AKVQMAEETVKRITG+K MF+ M SEV S+SMQSF+GSPS+ S DA +NH ADISSA+ Q N   + ATVS   M +T S+RRVASLE LQKRI
Subjt:  DLETLRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRI

Query:  RGSSSSCHPSGKGD
        RGSS  C PSGKGD
Subjt:  RGSSSSCHPSGKGD

A0A6J1KSH6 light-inducible protein CPRF2-like isoform X12.0e-9159.95Show/hide
Query:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK
        MDRV SVDGI+DQF         T PEESSKLNRSASEWSFRRFLQE ASVSDSS S  PPPASPS  +   N     ESGE   QI +KQSNRN     
Subjt:  MDRVISVDGISDQFW--------TSPEESSKLNRSASEWSFRRFLQEAASVSDSSIS--PPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEK

Query:  ERKICSSRAAA-------AAAS----------DSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL
          +ICSS +A        AA S          +S +YQAFLKSKLNLACAAVALCRGS+ MK QDSCASS  AS     S   SKGI+ SPCVQKK GIL
Subjt:  ERKICSSRAAA-------AAAS----------DSEEYQAFLKSKLNLACAAVALCRGSF-MKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGIL

Query:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA
         SSA ISSS +QT  +EDD  EGE+  +E  DPA  KR                            VAELRVENSALLKRF+DISQKY+E+A+NNRVLKA
Subjt:  VSSANISSSREQT--DEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKA

Query:  DLETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRIR
        DLETL+AKVQMAEETV+RI G+K MF+   EVSS+SMQSF+GSPS+ S DA +NH ADISSA+ Q N   + ATVS  KM +T S+ RVASLE LQKRIR
Subjt:  DLETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNS-PEMATVSRNKMARTASMRRVASLEHLQKRIR

Query:  GSSSSCHPSGKG
        GSS  C PSGKG
Subjt:  GSSSSCHPSGKG

SwissProt top hitse value%identityAlignment
B9DGI8 Basic leucine zipper 635.0e-3937.84Show/hide
Query:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA
        M++V S + IS     S    + LNRSASEW+F RF+QE+++ +D   S      S +   +  V                                   
Subjt:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA

Query:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ
             DSEEY+AFLKSKLNLACAAVA+ RG+F+K QD+   S    A++  S+ +S   S       KA  ++SSA I+S  E + ++++A+GE +MN  
Subjt:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ

Query:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS
          P +VKR                            V++LRVENS L+K  +D++Q +++A+V NRVLKA++ETLRAKV+MAEETVKR+TG   MFH M 
Subjt:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS

Query:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR
        + VS++S+      PSE S ++P+   + +++  I  +  +   +   KM RTASMRRV SLEHLQKRIR
Subjt:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR

O22763 Basic leucine zipper 105.0e-2331.2Show/hide
Query:  MDRVISVDGISDQFWTSP-----EESSK------LNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVE
        M+ + S+D  SD FW +P      +SSK      +++S  EW+F  FL+E   +S S++S  P   +     +NA+    S + L  +  +    +    
Subjt:  MDRVISVDGISDQFWTSP-----EESSK------LNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVE

Query:  KERKICSSRAAAAAAS-----DSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCAS-STQASASQLTSQSSSK-GISCS-PCVQKKAGILVSSANISSS
        ++R   +   AA   +     DS++Y+  LK+KL   CA V   R   +K +DS +S  TQ    Q +  +  + G++ S P   KK G+ +      SS
Subjt:  KERKICSSRAAAAAAS-----DSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCAS-STQASASQLTSQSSSK-GISCS-PCVQKKAGILVSSANISSS

Query:  REQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQM
        RE +D D+D + EN+    + P  VK+                            V +L+ E+S+LLK+ S+++ KY EAAV NR+LKAD+ETLRAKV+M
Subjt:  REQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQM

Query:  AEETVKRITGTKSMF------HAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANI
        AEETVKR+TG   M       H  +    I+  +   S S I    P++++  +S+ NI
Subjt:  AEETVKRITGTKSMF------HAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANI

Q7X9A8 bZIP transcription factor RISBZ22.2e-3434.9Show/hide
Query:  MDRVISVDGISDQFWTSPEE-----------------------------SSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESG
        M+RV SV+ ISD FW  P                                + +NR  SEW F++FL+EA  V DS +  P     A  IR          
Subjt:  MDRVISVDGISDQFWTSPEE-----------------------------SSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESG

Query:  EKLKQIWEKQSNRNGGV----EKERKICSSRAAAAAAS---DSEEYQAFLKSKLNLACAAVALCR------------GSFMKTQD----SCASSTQASAS
                      GGV     K+ ++ ++ AAAA  S   D  EY A LK KL    AAVA+ R            GS +   D       +S   +A+
Subjt:  EKLKQIWEKQSNRNGGV----EKERKICSSRAAAAAAS---DSEEYQAFLKSKLNLACAAVALCR------------GSFMKTQD----SCASSTQASAS

Query:  QLTSQ-SSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQMDPASVK----------------------------RVAELRVENSAL
         + +  S   G S S  VQ    +LV     SSSREQ+D DDD EGE +      PA  +                            +V++LRVENS+L
Subjt:  QLTSQ-SSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQMDPASVK----------------------------RVAELRVENSAL

Query:  LKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDA-------PNNHIA-------------
        L+R +D++QKY++AAV+NRVLKAD+ETLRAKV+MAE++VKR+TG  ++F A S++SS+SM  F  SPSE ++DA       PNN+ A             
Subjt:  LKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDA-------PNNHIA-------------

Query:  DISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSS
        DI S+  +        ++  K+ RTAS++RVASLEHLQKR+ G  +S
Subjt:  DISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIRGSSSS

Q99090 Light-inducible protein CPRF27.4e-5941.8Show/hide
Query:  MDRVISVDGISDQFWTSP--EESSKL--NRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICS
        MDRV SV+ ISDQFW+ P  E+SSKL  NRS SEW+F+ FLQ+A+++  S   P    P A +++ N V+I                             
Subjt:  MDRVISVDGISDQFWTSP--EESSKL--NRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICS

Query:  SRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCA----SSTQASASQLTSQSSSKG------------------ISCSPCVQKKAGILVS
            A    DSE+YQA+LKS+L+LACAAVAL R S +K QDS A     S  ++ SQL SQ   KG                      P +QKK+ I V 
Subjt:  SRAAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCA----SSTQASASQLTSQSSSKG------------------ISCSPCVQKKAGILVS

Query:  SANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLET
        S    SSR+ +D+DD+ EGE +     DP+  KR                            V++LRVENS+LLKR +DISQ+Y++AAV+NRVLKAD+ET
Subjt:  SANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLET

Query:  LRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTD-------------APNNHIADISSA------------NIQKNSPEMATVSRNK
        +RAKV+MAEETVKR+TG   MF +M SE+S+I MQSF GSPS+ S D             AP +H+                  N+Q++S     V  NK
Subjt:  LRAKVQMAEETVKRITGTKSMFHAM-SEVSSISMQSFEGSPSEISTD-------------APNNHIADISSA------------NIQKNSPEMATVSRNK

Query:  MARTASMRRVASLEHLQKRIRGSSSSCHPSGKG
        M RT+SM+RVASLEHLQKRIRG  SSC     G
Subjt:  MARTASMRRVASLEHLQKRIRGSSSSCHPSGKG

Q9M1G6 Basic leucine zipper 251.0e-2332.03Show/hide
Query:  MDRVISVDGISDQFW------TSPEESSK----------LNRSASEWSFRRFLQE-AASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSN
        M  V SVD +++ FW       SP  SS           + RS SEW+F R + E + S S  + +    SP   +  S   +  +  E + +I + Q++
Subjt:  MDRVISVDGISDQFW------TSPEESSK----------LNRSASEWSFRRFLQE-AASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSN

Query:  RNGGVE---KERKICSSR----AAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASA-----SQLTSQSSSKGISCSPCVQKKAGI
        R   V+   K R    S     ++A    D  +Y A LKSKL LACAAVA   G+      S ++S Q  A     +Q +  +SS   S +   QKK  +
Subjt:  RNGGVE---KERKICSSR----AAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASA-----SQLTSQSSSKGISCSPCVQKKAGI

Query:  LVSSANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKAD
             +ISS  +  D+D D + +N      DP  VKR                            V +LR E+S L+ R SD++ KY  AAV+NR+L+AD
Subjt:  LVSSANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKAD

Query:  LETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPN-NHIADISSANIQKNSPEMATVSRNKMARTAS
        +ETLR KV+MAEETVKR+TG   +  +   +       F  +PS  S+  PN NHI  +  AN   N+   A +++N+   TA+
Subjt:  LETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPN-NHIADISSANIQKNSPEMATVSRNKMARTAS

Arabidopsis top hitse value%identityAlignment
AT3G54620.1 basic leucine zipper 257.2e-2532.03Show/hide
Query:  MDRVISVDGISDQFW------TSPEESSK----------LNRSASEWSFRRFLQE-AASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSN
        M  V SVD +++ FW       SP  SS           + RS SEW+F R + E + S S  + +    SP   +  S   +  +  E + +I + Q++
Subjt:  MDRVISVDGISDQFW------TSPEESSK----------LNRSASEWSFRRFLQE-AASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSN

Query:  RNGGVE---KERKICSSR----AAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASA-----SQLTSQSSSKGISCSPCVQKKAGI
        R   V+   K R    S     ++A    D  +Y A LKSKL LACAAVA   G+      S ++S Q  A     +Q +  +SS   S +   QKK  +
Subjt:  RNGGVE---KERKICSSR----AAAAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASA-----SQLTSQSSSKGISCSPCVQKKAGI

Query:  LVSSANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKAD
             +ISS  +  D+D D + +N      DP  VKR                            V +LR E+S L+ R SD++ KY  AAV+NR+L+AD
Subjt:  LVSSANISSSREQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKAD

Query:  LETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPN-NHIADISSANIQKNSPEMATVSRNKMARTAS
        +ETLR KV+MAEETVKR+TG   +  +   +       F  +PS  S+  PN NHI  +  AN   N+   A +++N+   TA+
Subjt:  LETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPN-NHIADISSANIQKNSPEMATVSRNKMARTAS

AT4G02640.1 bZIP transcription factor family protein3.6e-2431.2Show/hide
Query:  MDRVISVDGISDQFWTSP-----EESSK------LNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVE
        M+ + S+D  SD FW +P      +SSK      +++S  EW+F  FL+E   +S S++S  P   +     +NA+    S + L  +  +    +    
Subjt:  MDRVISVDGISDQFWTSP-----EESSK------LNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVE

Query:  KERKICSSRAAAAAAS-----DSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCAS-STQASASQLTSQSSSK-GISCS-PCVQKKAGILVSSANISSS
        ++R   +   AA   +     DS++Y+  LK+KL   CA V   R   +K +DS +S  TQ    Q +  +  + G++ S P   KK G+ +      SS
Subjt:  KERKICSSRAAAAAAS-----DSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCAS-STQASASQLTSQSSSK-GISCS-PCVQKKAGILVSSANISSS

Query:  REQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQM
        RE +D D+D + EN+    + P  VK+                            V +L+ E+S+LLK+ S+++ KY EAAV NR+LKAD+ETLRAKV+M
Subjt:  REQTDEDDDAEGENDMNEQMDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQM

Query:  AEETVKRITGTKSMF------HAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANI
        AEETVKR+TG   M       H  +    I+  +   S S I    P++++  +S+ NI
Subjt:  AEETVKRITGTKSMF------HAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANI

AT5G28770.1 bZIP transcription factor family protein2.4e-3636.49Show/hide
Query:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA
        M++V S + IS     S    + LNRSASEW+F RF+QE+++ +D   S      S +   +  V                                   
Subjt:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA

Query:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ
             DSEEY+AFLKSKLNLACAAVA+ R +  ++ +       A+ S+  S +SS           KA  ++SSA I+S  E + ++++A+GE +MN  
Subjt:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ

Query:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS
          P +VKR                            V++LRVENS L+K  +D++Q +++A+V NRVLKA++ETLRAKV+MAEETVKR+TG   MFH M 
Subjt:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS

Query:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR
        + VS++S+      PSE S ++P+   + +++  I  +  +   +   KM RTASMRRV SLEHLQKRIR
Subjt:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR

AT5G28770.2 bZIP transcription factor family protein3.6e-4037.84Show/hide
Query:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA
        M++V S + IS     S    + LNRSASEW+F RF+QE+++ +D   S      S +   +  V                                   
Subjt:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA

Query:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ
             DSEEY+AFLKSKLNLACAAVA+ RG+F+K QD+   S    A++  S+ +S   S       KA  ++SSA I+S  E + ++++A+GE +MN  
Subjt:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ

Query:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS
          P +VKR                            V++LRVENS L+K  +D++Q +++A+V NRVLKA++ETLRAKV+MAEETVKR+TG   MFH M 
Subjt:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMS

Query:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR
        + VS++S+      PSE S ++P+   + +++  I  +  +   +   KM RTASMRRV SLEHLQKRIR
Subjt:  E-VSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRRVASLEHLQKRIR

AT5G28770.3 bZIP transcription factor family protein6.1e-2434.64Show/hide
Query:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA
        M++V S + IS     S    + LNRSASEW+F RF+QE+++ +D   S      S +   +  V                                   
Subjt:  MDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAA

Query:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ
             DSEEY+AFLKSKLNLACAAVA+ RG+F+K QD+   S    A++  S+ +S   S       KA  ++SSA I+S  E + ++++A+GE +MN  
Subjt:  AAAASDSEEYQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQ

Query:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQ
          P +VKR                            V++LRVENS L+K  +D++Q +++A+V NRVLKA++ETLRAKV+
Subjt:  MDPASVKR----------------------------VAELRVENSALLKRFSDISQKYSEAAVNNRVLKADLETLRAKVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATAGAGTAATCTCCGTCGACGGAATCTCCGATCAATTCTGGACGTCGCCGGAGGAATCGTCGAAGTTGAACCGAAGCGCATCCGAGTGGTCTTTCCGGAGATT
TCTTCAGGAAGCTGCCTCCGTATCGGATTCCTCTATTTCTCCGCCTCCGGCTTCGCCTTCCGCGGCGGAAATTCGATCTAACGCCGTCAAGATTACGGAATCTGGAGAGA
AATTGAAGCAGATTTGGGAGAAACAGAGTAATCGGAATGGCGGAGTTGAGAAGGAAAGGAAGATATGCAGTAGTAGGGCGGCGGCGGCGGCGGCGTCGGATTCCGAGGAG
TATCAGGCGTTTCTGAAAAGCAAGCTGAATCTGGCGTGTGCGGCTGTGGCCTTGTGTCGAGGATCTTTTATGAAGACTCAAGATTCTTGTGCAAGCTCCACTCAAGCCAG
TGCCTCACAATTAACATCTCAATCCTCTTCAAAAGGAATTTCTTGCTCACCTTGTGTGCAAAAAAAGGCTGGAATTCTGGTCAGCTCAGCAAATATATCATCATCAAGAG
AGCAGACTGATGAAGATGATGATGCTGAAGGAGAAAACGATATGAATGAGCAAATGGATCCAGCATCCGTTAAACGTGTTGCTGAATTAAGAGTTGAAAATTCAGCATTG
CTGAAGCGTTTCAGTGATATAAGCCAAAAGTACAGTGAAGCAGCTGTTAATAACAGAGTTCTGAAAGCTGACCTTGAAACTTTGAGAGCAAAGGTACAGATGGCTGAGGA
AACCGTAAAGCGAATCACTGGTACGAAATCTATGTTCCATGCCATGTCGGAAGTATCCTCAATTAGCATGCAATCCTTTGAGGGGAGCCCCTCAGAGATATCAACAGATG
CACCTAACAATCATATTGCAGACATTTCTTCTGCAAATATTCAGAAGAATTCTCCAGAAATGGCAACTGTGTCGAGGAACAAGATGGCAAGAACAGCTTCCATGCGGCGA
GTGGCAAGCTTGGAGCATCTTCAGAAGCGCATACGGGGGAGTTCAAGCTCCTGTCATCCATCAGGAAAGGGAGATCACAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAAATTTTGCCCAAATTTCAAAGTTTATATATATGAGGGGCAGGTAGAGAAACGTAGAGGAAGTCATAAGATTCAGAGATGATGGATAGAGTAATCTCCGTCGACGGAAT
CTCCGATCAATTCTGGACGTCGCCGGAGGAATCGTCGAAGTTGAACCGAAGCGCATCCGAGTGGTCTTTCCGGAGATTTCTTCAGGAAGCTGCCTCCGTATCGGATTCCT
CTATTTCTCCGCCTCCGGCTTCGCCTTCCGCGGCGGAAATTCGATCTAACGCCGTCAAGATTACGGAATCTGGAGAGAAATTGAAGCAGATTTGGGAGAAACAGAGTAAT
CGGAATGGCGGAGTTGAGAAGGAAAGGAAGATATGCAGTAGTAGGGCGGCGGCGGCGGCGGCGTCGGATTCCGAGGAGTATCAGGCGTTTCTGAAAAGCAAGCTGAATCT
GGCGTGTGCGGCTGTGGCCTTGTGTCGAGGATCTTTTATGAAGACTCAAGATTCTTGTGCAAGCTCCACTCAAGCCAGTGCCTCACAATTAACATCTCAATCCTCTTCAA
AAGGAATTTCTTGCTCACCTTGTGTGCAAAAAAAGGCTGGAATTCTGGTCAGCTCAGCAAATATATCATCATCAAGAGAGCAGACTGATGAAGATGATGATGCTGAAGGA
GAAAACGATATGAATGAGCAAATGGATCCAGCATCCGTTAAACGTGTTGCTGAATTAAGAGTTGAAAATTCAGCATTGCTGAAGCGTTTCAGTGATATAAGCCAAAAGTA
CAGTGAAGCAGCTGTTAATAACAGAGTTCTGAAAGCTGACCTTGAAACTTTGAGAGCAAAGGTACAGATGGCTGAGGAAACCGTAAAGCGAATCACTGGTACGAAATCTA
TGTTCCATGCCATGTCGGAAGTATCCTCAATTAGCATGCAATCCTTTGAGGGGAGCCCCTCAGAGATATCAACAGATGCACCTAACAATCATATTGCAGACATTTCTTCT
GCAAATATTCAGAAGAATTCTCCAGAAATGGCAACTGTGTCGAGGAACAAGATGGCAAGAACAGCTTCCATGCGGCGAGTGGCAAGCTTGGAGCATCTTCAGAAGCGCAT
ACGGGGGAGTTCAAGCTCCTGTCATCCATCAGGAAAGGGAGATCACAAGTAACAGTTGTGATATATTGTTCCAAATGTATGAAAGAGATCCTGTAATATTCTTTAAATGA
ACAGGTTTCCCATTTTAAATGAAACTTACTTTAAGTATGAGATGACATTTGGAGAGTGGTTTACTAGTTTTTCTATTTAAGCACATTGGTACAGTATTTCCCGT
Protein sequenceShow/hide protein sequence
MMDRVISVDGISDQFWTSPEESSKLNRSASEWSFRRFLQEAASVSDSSISPPPASPSAAEIRSNAVKITESGEKLKQIWEKQSNRNGGVEKERKICSSRAAAAAASDSEE
YQAFLKSKLNLACAAVALCRGSFMKTQDSCASSTQASASQLTSQSSSKGISCSPCVQKKAGILVSSANISSSREQTDEDDDAEGENDMNEQMDPASVKRVAELRVENSAL
LKRFSDISQKYSEAAVNNRVLKADLETLRAKVQMAEETVKRITGTKSMFHAMSEVSSISMQSFEGSPSEISTDAPNNHIADISSANIQKNSPEMATVSRNKMARTASMRR
VASLEHLQKRIRGSSSSCHPSGKGDHK