; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003383 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003383
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiondehydration-responsive element-binding protein 2C
Genome locationscaffold234:2473557..2474675
RNA-Seq ExpressionMS003383
SyntenyMS003383
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016025.1 Dehydration-responsive element-binding protein 2C, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-14666.51Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M L  QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE ++SCND  KPIRKAPAKGSKKGCMKGKGGPLNS C YRGVRQRTWGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA EAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSSTATSTCS ST TTSNQSEVCVP EF L P    SNIK
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDA------------------------------------------------------------DAKNGDQS
         EDGEGE RT D + H  TPM LEK VKHED                                                             DAK+ DQ 
Subjt:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDA------------------------------------------------------------DAKNGDQS

Query:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC
         N+Q+  SG  IP  D+LQNFQMDE+FDV+ELLGLIN D LYDPSIL+GN +G N+  PSQVG+ G EK S+L YQFQNPDAKLLGSLQH +Q+P+D D 
Subjt:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC

Query:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
         FDFLKQGREED+NAAAD+Y+RYLNYE  D+GF
Subjt:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

XP_008441482.1 PREDICTED: dehydration-responsive element-binding protein 2C isoform X1 [Cucumis melo]6.0e-14969.29Show/hide
Query:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
        QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE  +S NDGGK IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
Subjt:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR

Query:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE
        LWLGTFPTAIEAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSST TSTCSESTTTTSNQSEVCVPEEFT+RPQLV  N+K+EDGE
Subjt:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE

Query:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS
        GE RT D  D  ATPM LE QVKHED                           + + G Q+                         S++Q+ +S  GI S
Subjt:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS

Query:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM
          +LQNF+MDE+FDVEELL LI+SD L+DP +IL+GNA+G  +MVPSQVG++G EKP N  YQ QNPDAKLLGS Q  E+TPAD D GFDFLKQGREED+
Subjt:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM

Query:  NAAADDYVRYLNYETGDLGF
        NAAADD VRYLN E GDLGF
Subjt:  NAAADDYVRYLNYETGDLGF

XP_022134534.1 dehydration-responsive element-binding protein 2C [Momordica charantia]2.2e-19999.15Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
        RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA

Query:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA
        LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA
Subjt:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA

Query:  TPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQ
        TPM LEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNM+PSQVGNVGFEKPSNLLYQFQ
Subjt:  TPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQ

Query:  NPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
        NPDAKLLGSLQHTEQTPADFD GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
Subjt:  NPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

XP_023549809.1 dehydration-responsive element-binding protein 2C-like isoform X1 [Cucurbita pepo subsp. pepo]6.4e-15167.9Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M L  QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE ++SCND  KPIRKAPAKGSKKGCMKGKGGPLNS C YRGVRQRTWGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTAIEAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSSTATSTCS ST TTSNQSEVCVPEEFTLR   + SN+K
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDADAKN----------------------------------GDQS--------------------------
         EDGEGE RT D + HI T M LE  VKHED +AK                                   GDQ+                          
Subjt:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDADAKN----------------------------------GDQS--------------------------

Query:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC
         N+Q+  +G  IPS D+LQNFQMDE+FDV+ELLGLINSD LYDPSIL+GN +G N+  PSQVG+ G EKPS+L YQFQNPDAKLLGSLQH EQ+PAD D 
Subjt:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC

Query:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
         FDFLKQGREEDMNAAAD+Y+RYLNYE  D+GF
Subjt:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

XP_038886635.1 dehydration-responsive element-binding protein 2C-like isoform X1 [Benincasa hispida]4.1e-15068.63Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M + + YS+++SLP +S+RKRKSRSRRDRSTVAETLAKWKAYNE  +SCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA+EAALAYDEAAR MYG SARLNLPNI NRGQL+GILL+DYL LR SDSSTATSTCSESTTT SNQSEVCVPEEFT+RP+LV  NIK
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDAD--------------------------AKNGDQSS-------------------------NEQSFISG
         EDGEGE RT D +D  ATPM LE QVKHED +                           + G+Q++                         N+Q+ +S 
Subjt:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDAD--------------------------AKNGDQSS-------------------------NEQSFISG

Query:  IGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGR
         GI   ++L NFQMDE+FDVEELL LI+SD LYDPSIL+GNA+G  NM PSQV N G EKPS+  YQFQNPD KLLGSLQ TE   AD D GFDFLKQGR
Subjt:  IGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGR

Query:  EEDMNAAADDYVRYLNYETGDLGF
        EED+NAAADD VRYLN E GDLGF
Subjt:  EEDMNAAADDYVRYLNYETGDLGF

TrEMBL top hitse value%identityAlignment
A0A1S3B339 dehydration-responsive element-binding protein 2C isoform X21.4e-14369.29Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
        RKRKSRSRRDRSTVAETLAKWKAYNE  +S NDGGK IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA

Query:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA
        LAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSST TSTCSESTTTTSNQSEVCVPEEFT+RPQLV  N+K+EDGEGE RT D  D  A
Subjt:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA

Query:  TPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPSWDELQNFQMDEVF
        TPM LE QVKHED                           + + G Q+                         S++Q+ +S  GI S  +LQNF+MDE+F
Subjt:  TPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPSWDELQNFQMDEVF

Query:  DVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNY
        DVEELL LI+SD L+DP +IL+GNA+G  +MVPSQVG++G EKP N  YQ QNPDAKLLGS Q  E+TPAD D GFDFLKQGREED+NAAADD VRYLN 
Subjt:  DVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNY

Query:  ETGDLGF
        E GDLGF
Subjt:  ETGDLGF

A0A1S3B4B1 dehydration-responsive element-binding protein 2C isoform X12.9e-14969.29Show/hide
Query:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
        QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE  +S NDGGK IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
Subjt:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR

Query:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE
        LWLGTFPTAIEAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSST TSTCSESTTTTSNQSEVCVPEEFT+RPQLV  N+K+EDGE
Subjt:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE

Query:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS
        GE RT D  D  ATPM LE QVKHED                           + + G Q+                         S++Q+ +S  GI S
Subjt:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS

Query:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM
          +LQNF+MDE+FDVEELL LI+SD L+DP +IL+GNA+G  +MVPSQVG++G EKP N  YQ QNPDAKLLGS Q  E+TPAD D GFDFLKQGREED+
Subjt:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM

Query:  NAAADDYVRYLNYETGDLGF
        NAAADD VRYLN E GDLGF
Subjt:  NAAADDYVRYLNYETGDLGF

A0A5A7UIC1 Dehydration-responsive element-binding protein 2C isoform X12.9e-14969.29Show/hide
Query:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
        QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE  +S NDGGK IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR
Subjt:  QYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSR

Query:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE
        LWLGTFPTAIEAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSST TSTCSESTTTTSNQSEVCVPEEFT+RPQLV  N+K+EDGE
Subjt:  LWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGE

Query:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS
        GE RT D  D  ATPM LE QVKHED                           + + G Q+                         S++Q+ +S  GI S
Subjt:  GELRTSDRADHIATPMSLEKQVKHEDA--------------------------DAKNGDQS-------------------------SNEQSFISGIGIPS

Query:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM
          +LQNF+MDE+FDVEELL LI+SD L+DP +IL+GNA+G  +MVPSQVG++G EKP N  YQ QNPDAKLLGS Q  E+TPAD D GFDFLKQGREED+
Subjt:  WDELQNFQMDEVFDVEELLGLINSDPLYDP-SILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDM

Query:  NAAADDYVRYLNYETGDLGF
        NAAADD VRYLN E GDLGF
Subjt:  NAAADDYVRYLNYETGDLGF

A0A6J1BZ01 dehydration-responsive element-binding protein 2C1.0e-19999.15Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
        RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA

Query:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA
        LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA
Subjt:  LAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIA

Query:  TPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQ
        TPM LEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNM+PSQVGNVGFEKPSNLLYQFQ
Subjt:  TPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQ

Query:  NPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
        NPDAKLLGSLQHTEQTPADFD GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
Subjt:  NPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

A0A6J1FL95 dehydration-responsive element-binding protein 2C-like isoform X12.3e-14666.51Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M L  QYS+A+SLP +SIRKRKSRSRRDRSTVAETLAKWKAYNE ++SCND  KPIRKAPAKGSKKGCMKGKGGPLNS C YRGVRQRTWGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA EAALAYDEAAR MYG +ARLNLPNI NRGQL+GILLEDYL LR SDSSTATSTCS ST TTSNQSEVCVP EF L P    SNIK
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDA------------------------------------------------------------DAKNGDQS
         EDGEGE RT D + H  TPM LEK VKHED                                                             DAK+ DQ 
Subjt:  TEDGEGELRTSDRADHIATPMSLEKQVKHEDA------------------------------------------------------------DAKNGDQS

Query:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC
         N+Q+  SG  IP  D+LQNFQMDE+FDV+ELLGLIN D LYDPSIL+GN +G N+  PSQV + G EK S+L YQFQNPDAKLLGSLQH EQ+P+D D 
Subjt:  SNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDC

Query:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
         FDFLKQGREED+NAAAD+Y+RYLNYE  D+GF
Subjt:  GFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

SwissProt top hitse value%identityAlignment
O82132 Dehydration-responsive element-binding protein 2A7.3e-4940.95Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M++  Q  +     +D+ RKRKSRSR D +TVAE L +WK YNE  E  +      RK PAKGSKKGCMKGKGGP NS C++RGVRQR WGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA EAA AYDEAA+AMYGP ARLN P                     SD+S  TS        TS+QSEVC  E         C ++K
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TED--------------------GEGELRTSDRAD---------HIATPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEE
        TED                    G  E++   +AD         +  + +  EK+ + E    +   Q   +   ++  G P+  +  +    ++FDV+E
Subjt:  TED--------------------GEGELRTSDRAD---------HIATPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEE

Query:  LLGLINSDPLY---DPSILEGN--ANGSNNMVPSQVG
        LL  +N D ++   +     GN  ANGS      Q G
Subjt:  LLGLINSDPLY---DPSILEGN--ANGSNNMVPSQVG

O82133 Dehydration-responsive element-binding protein 2B5.6e-4140.74Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIE
        +KRKSR+R    TVA+ L KWK YNE  E  +  +G KP RK PAKGSKKGCMKGKGGP NSHC++RGVRQR WGKWVAEIREP  G+RLWLGTFPTA +
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIE

Query:  AALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQ-----SEVCVPEEFT---LRPQLVCSNIKTED-----
        AA AYDEAA AMYG  ARLN P                     S  S  TST S+S   T         +VCV  E T     P     +++ E      
Subjt:  AALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQ-----SEVCVPEEFT---LRPQLVCSNIKTED-----

Query:  -----GEGELRTSDRAD--------HIATPMSLEKQVKHEDADAKNGDQSSNEQSF------ISGIGIPSWDELQNFQM----DEVFDVEELLGLIN
             G  ++ +S   D        +    +  +++ K E+ + +   Q   +Q        ++  G P  +++ N Q     +E FD+ ELLG +N
Subjt:  -----GEGELRTSDRAD--------HIATPMSLEKQVKHEDADAKNGDQSSNEQSF------ISGIGIPSWDELQNFQM----DEVFDVEELLGLIN

Q5W6R4 Dehydration-responsive element-binding protein 2B1.2e-4337.95Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA
        +KR  RSR   ++VAET+ +W   N   E    G K  RKAPAKGSKKGCMKGKGGP N+ C++RGVRQRTWGKWVAEIREPN+ SRLWLGTFPTA  AA
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIEAA

Query:  LAYDEAARAMYGPSARLNL-----PNITNRGQLKGI--------LLEDYLRLRTSDSST--------ATSTCSESTTTTSNQSEVC----VPEEFT-LRP
         AYDEAARAMYGP AR N      P  + +  L  +        L     R  T  +S           S C  +TTT +NQS+V      PEE + +  
Subjt:  LAYDEAARAMYGPSARLNL-----PNITNRGQLKGI--------LLEDYLRLRTSDSST--------ATSTCSESTTTTSNQSEVC----VPEEFT-LRP

Query:  QLVCSNIKTEDGEGELRTSDRADHIATPMSLEKQVK-HEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNA
         L       EDG  E    D+A+ +    ++  Q +   +A+A NG     E+ F     I S  E Q    D  FD++E+L ++ +DP  +  + +G+ 
Subjt:  QLVCSNIKTEDGEGELRTSDRADHIATPMSLEKQVK-HEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNA

Query:  NGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREED
        +GS+ ++      +G ++P    + ++  D  +L +L  +++          F+  G E+D
Subjt:  NGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREED

Q8LFR2 Dehydration-responsive element-binding protein 2C2.3e-5541.99Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNE--FSESCNDGGKP--IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA
        RKRKSR  RD   VAE L +W+ YNE   +ESC DGG P  IRK P KGS+KGCMKGKGGP N  C+YRGVRQR WGKWVAEIREP+ G+RLWLGTF ++
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNE--FSESCNDGGKP--IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA

Query:  IEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRA
         EAALAYDEAA+A+YG SARLNLP ITNR                S S+ AT+T S S T  S++SEVC  E+           +K ED   E    D +
Subjt:  IEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRA

Query:  DHIATPMSLEKQVKHEDADAKN---GDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPS
          I   +  +++V+ E   A     G  S  E      +G  +  E   F +DE FD+ ELLG++N          + N +G   M          ++  
Subjt:  DHIATPMSLEKQVKHEDADAKN---GDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPS

Query:  NLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
        N  YQ Q P++ LLGSL   E      D G  +++    E+      D+ R+ + +  DL F
Subjt:  NLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

Q9SIZ0 Putative dehydration-responsive element-binding protein 2H1.1e-4171.77Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGG--KPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA
        RKRKSR  RD   VAE L KW+ YNE +E  SC DGG  KPIRKAP K S+KGCMKGKGGP N  C+Y GVRQRTWGKWVAEIREP RG++LWLGTF ++
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGG--KPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA

Query:  IEAALAYDEAARAMYGPSARLNLP
         EAALAYDEA++A+YG SARLNLP
Subjt:  IEAALAYDEAARAMYGPSARLNLP

Arabidopsis top hitse value%identityAlignment
AT2G40340.1 Integrase-type DNA-binding superfamily protein1.7e-5641.99Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNE--FSESCNDGGKP--IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA
        RKRKSR  RD   VAE L +W+ YNE   +ESC DGG P  IRK P KGS+KGCMKGKGGP N  C+YRGVRQR WGKWVAEIREP+ G+RLWLGTF ++
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNE--FSESCNDGGKP--IRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA

Query:  IEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRA
         EAALAYDEAA+A+YG SARLNLP ITNR                S S+ AT+T S S T  S++SEVC  E+           +K ED   E    D +
Subjt:  IEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRA

Query:  DHIATPMSLEKQVKHEDADAKN---GDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPS
          I   +  +++V+ E   A     G  S  E      +G  +  E   F +DE FD+ ELLG++N          + N +G   M          ++  
Subjt:  DHIATPMSLEKQVKHEDADAKN---GDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPS

Query:  NLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF
        N  YQ Q P++ LLGSL   E      D G  +++    E+      D+ R+ + +  DL F
Subjt:  NLLYQFQNPDAKLLGSLQHTEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF

AT2G40350.1 Integrase-type DNA-binding superfamily protein8.0e-4371.77Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGG--KPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA
        RKRKSR  RD   VAE L KW+ YNE +E  SC DGG  KPIRKAP K S+KGCMKGKGGP N  C+Y GVRQRTWGKWVAEIREP RG++LWLGTF ++
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGG--KPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTA

Query:  IEAALAYDEAARAMYGPSARLNLP
         EAALAYDEA++A+YG SARLNLP
Subjt:  IEAALAYDEAARAMYGPSARLNLP

AT3G11020.1 DRE/CRT-binding protein 2B4.0e-4240.74Show/hide
Query:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIE
        +KRKSR+R    TVA+ L KWK YNE  E  +  +G KP RK PAKGSKKGCMKGKGGP NSHC++RGVRQR WGKWVAEIREP  G+RLWLGTFPTA +
Subjt:  RKRKSRSRRDRSTVAETLAKWKAYNEFSE--SCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGTFPTAIE

Query:  AALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQ-----SEVCVPEEFT---LRPQLVCSNIKTED-----
        AA AYDEAA AMYG  ARLN P                     S  S  TST S+S   T         +VCV  E T     P     +++ E      
Subjt:  AALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQ-----SEVCVPEEFT---LRPQLVCSNIKTED-----

Query:  -----GEGELRTSDRAD--------HIATPMSLEKQVKHEDADAKNGDQSSNEQSF------ISGIGIPSWDELQNFQM----DEVFDVEELLGLIN
             G  ++ +S   D        +    +  +++ K E+ + +   Q   +Q        ++  G P  +++ N Q     +E FD+ ELLG +N
Subjt:  -----GEGELRTSDRAD--------HIATPMSLEKQVKHEDADAKNGDQSSNEQSF------ISGIGIPSWDELQNFQM----DEVFDVEELLGLIN

AT5G05410.1 DRE-binding protein 2A5.2e-5040.95Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M++  Q  +     +D+ RKRKSRSR D +TVAE L +WK YNE  E  +      RK PAKGSKKGCMKGKGGP NS C++RGVRQR WGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA EAA AYDEAA+AMYGP ARLN P                     SD+S  TS        TS+QSEVC  E         C ++K
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TED--------------------GEGELRTSDRAD---------HIATPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEE
        TED                    G  E++   +AD         +  + +  EK+ + E    +   Q   +   ++  G P+  +  +    ++FDV+E
Subjt:  TED--------------------GEGELRTSDRAD---------HIATPMSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEE

Query:  LLGLINSDPLY---DPSILEGN--ANGSNNMVPSQVG
        LL  +N D ++   +     GN  ANGS      Q G
Subjt:  LLGLINSDPLY---DPSILEGN--ANGSNNMVPSQVG

AT5G05410.2 DRE-binding protein 2A4.9e-4854.59Show/hide
Query:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP
        M++  Q  +     +D+ RKRKSRSR D +TVAE L +WK YNE  E  +      RK PAKGSKKGCMKGKGGP NS C++RGVRQR WGKWVAEIREP
Subjt:  MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREP

Query:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK
        NRGSRLWLGTFPTA EAA AYDEAA+AMYGP ARLN P                     SD+S  TS        TS+QSEVC  E         C ++K
Subjt:  NRGSRLWLGTFPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIK

Query:  TEDGEGE
        TED + E
Subjt:  TEDGEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTTTGTCATCAATATTCTGAGGCCGTTTCTTTACCTCTTGATTCTATCAGGAAGAGGAAGTCACGGAGTAGACGAGACCGATCGACCGTGGCTGAGACTCTTGC
TAAGTGGAAAGCTTATAATGAATTTTCTGAGTCTTGTAACGATGGGGGCAAGCCAATTCGTAAAGCTCCTGCAAAAGGGTCTAAGAAGGGATGTATGAAAGGCAAGGGAG
GGCCCTTAAATTCACATTGCAATTACAGAGGTGTGAGGCAGAGGACATGGGGAAAATGGGTTGCGGAGATTCGTGAACCGAACAGGGGAAGCAGGCTGTGGCTCGGTACC
TTCCCCACGGCCATTGAAGCCGCTCTAGCCTATGACGAGGCTGCCCGGGCGATGTATGGCCCGTCTGCTCGCCTTAACCTTCCCAACATTACTAACAGAGGGCAGCTCAA
AGGGATTCTTTTAGAGGATTATTTGAGGCTGCGAACCTCAGATTCTTCAACTGCTACTTCGACATGTTCTGAATCGACAACGACGACATCGAACCAATCTGAGGTTTGCG
TACCTGAGGAGTTTACATTAAGGCCGCAACTTGTCTGCTCAAACATTAAGACTGAAGATGGAGAAGGCGAGTTGAGAACCAGTGATCGGGCCGATCACATTGCAACGCCA
ATGAGCTTAGAGAAGCAAGTGAAGCACGAAGATGCTGATGCTAAAAATGGTGACCAAAGCAGCAATGAACAATCATTCATTTCAGGAATTGGGATTCCTAGTTGGGATGA
GTTACAGAACTTTCAAATGGATGAAGTGTTTGATGTGGAGGAGCTGCTAGGCCTGATAAACAGCGATCCCTTGTATGATCCAAGCATTTTGGAGGGCAATGCGAATGGTT
CTAATAACATGGTGCCTTCCCAAGTTGGCAATGTTGGGTTTGAGAAGCCATCAAACTTGTTATATCAGTTCCAAAATCCAGACGCAAAGCTGCTCGGAAGTCTCCAACAC
ACGGAGCAAACTCCGGCCGATTTTGACTGTGGATTCGATTTCTTGAAGCAGGGGAGGGAGGAGGATATGAATGCTGCAGCAGATGATTATGTAAGATACTTGAATTATGA
GACGGGCGATTTGGGTTTC
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTTTGTCATCAATATTCTGAGGCCGTTTCTTTACCTCTTGATTCTATCAGGAAGAGGAAGTCACGGAGTAGACGAGACCGATCGACCGTGGCTGAGACTCTTGC
TAAGTGGAAAGCTTATAATGAATTTTCTGAGTCTTGTAACGATGGGGGCAAGCCAATTCGTAAAGCTCCTGCAAAAGGGTCTAAGAAGGGATGTATGAAAGGCAAGGGAG
GGCCCTTAAATTCACATTGCAATTACAGAGGTGTGAGGCAGAGGACATGGGGAAAATGGGTTGCGGAGATTCGTGAACCGAACAGGGGAAGCAGGCTGTGGCTCGGTACC
TTCCCCACGGCCATTGAAGCCGCTCTAGCCTATGACGAGGCTGCCCGGGCGATGTATGGCCCGTCTGCTCGCCTTAACCTTCCCAACATTACTAACAGAGGGCAGCTCAA
AGGGATTCTTTTAGAGGATTATTTGAGGCTGCGAACCTCAGATTCTTCAACTGCTACTTCGACATGTTCTGAATCGACAACGACGACATCGAACCAATCTGAGGTTTGCG
TACCTGAGGAGTTTACATTAAGGCCGCAACTTGTCTGCTCAAACATTAAGACTGAAGATGGAGAAGGCGAGTTGAGAACCAGTGATCGGGCCGATCACATTGCAACGCCA
ATGAGCTTAGAGAAGCAAGTGAAGCACGAAGATGCTGATGCTAAAAATGGTGACCAAAGCAGCAATGAACAATCATTCATTTCAGGAATTGGGATTCCTAGTTGGGATGA
GTTACAGAACTTTCAAATGGATGAAGTGTTTGATGTGGAGGAGCTGCTAGGCCTGATAAACAGCGATCCCTTGTATGATCCAAGCATTTTGGAGGGCAATGCGAATGGTT
CTAATAACATGGTGCCTTCCCAAGTTGGCAATGTTGGGTTTGAGAAGCCATCAAACTTGTTATATCAGTTCCAAAATCCAGACGCAAAGCTGCTCGGAAGTCTCCAACAC
ACGGAGCAAACTCCGGCCGATTTTGACTGTGGATTCGATTTCTTGAAGCAGGGGAGGGAGGAGGATATGAATGCTGCAGCAGATGATTATGTAAGATACTTGAATTATGA
GACGGGCGATTTGGGTTTC
Protein sequenceShow/hide protein sequence
MSLCHQYSEAVSLPLDSIRKRKSRSRRDRSTVAETLAKWKAYNEFSESCNDGGKPIRKAPAKGSKKGCMKGKGGPLNSHCNYRGVRQRTWGKWVAEIREPNRGSRLWLGT
FPTAIEAALAYDEAARAMYGPSARLNLPNITNRGQLKGILLEDYLRLRTSDSSTATSTCSESTTTTSNQSEVCVPEEFTLRPQLVCSNIKTEDGEGELRTSDRADHIATP
MSLEKQVKHEDADAKNGDQSSNEQSFISGIGIPSWDELQNFQMDEVFDVEELLGLINSDPLYDPSILEGNANGSNNMVPSQVGNVGFEKPSNLLYQFQNPDAKLLGSLQH
TEQTPADFDCGFDFLKQGREEDMNAAADDYVRYLNYETGDLGF