; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G001880 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G001880
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionethylene-responsive transcription factor-like protein isoform X2
Genome locationCmo_Chr19:1217835..1221729
RNA-Seq ExpressionCmoCh19G001880
SyntenyCmoCh19G001880
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571592.1 Ethylene-responsive transcription factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9390.5Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDE VADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAI NRKQKRLSPESK+S  L
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL

KAG7011335.1 Ethylene-responsive transcription factor-like protein [Cucurbita argyrosperma subsp. argyrosperma]3.4e-9391.5Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDE VADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAI NRKQKRLSPES KSEQL
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL

XP_022967326.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita maxima]1.9e-8890.37Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLT EDHVYGTDF+SVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQ
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDL EAEKQELRRFNWDEFLAMTRRAITNRK+
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQ

XP_022967327.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucurbita maxima]2.1e-9591.5Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLT EDHVYGTDF+SVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDL EAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL

XP_023511398.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita pepo subsp. pepo]3.2e-9188.5Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLT EDHVYGTDFVSVHP+CSDK NKI +NPVARIEPEPSGVSV+DTSKEKNDEPVADPPVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDL EAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPES KSEQL
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein3.0e-7978.71Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEK----NDEPVADPPVKRRKRHR
        MVSLRRRKLLGL +              +LT EDHV+ T FV V+P+CSDKVNKI+ENP A IEPE SGVSVLDTSKE+    NDEP+ADPPVKRRKRHR
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEK----NDEPVADPPVKRRKRHR

Query:  RKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKK
        RK F DE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNF+L E EKQELR+FNWDEFLAMTR  ITNRKQKRLSPESKK
Subjt:  RKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKK

Query:  SE
        SE
Subjt:  SE

A0A1S3CCB8 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.8e-7669.87Show/hide
Query:  MDEFENSTAEFDE------LLLPANQIMVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVL
        M++FEN      E      L+  +  IMVSLRRRKLLGL +              +LT E HV+ T  V V+P+CSD+VNKI+ENP+A IEPE SGVSVL
Subjt:  MDEFENSTAEFDE------LLLPANQIMVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVL

Query:  DTSKEK----NDEPVADPPVKRRKRHRRKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRF
        DTSKE+    N EP+ADPPVKRRKRHRRK F DE FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNF+L E EKQELR+F
Subjt:  DTSKEK----NDEPVADPPVKRRKRHRRKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRF

Query:  NWDEFLAMTRRAITNRKQKRLSPESKKSE
        NWDEFLAMTR  ITNRKQKRL+PESKKSE
Subjt:  NWDEFLAMTRRAITNRKQKRLSPESKKSE

A0A1S3CCT4 ethylene-responsive transcription factor-like protein At4g13040 isoform X25.3e-7673.81Show/hide
Query:  LLLPANQIMVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEK----NDEPVADPP
        L+  +  IMVSLRRRKLLGL +              +LT E HV+ T  V V+P+CSD+VNKI+ENP+A IEPE SGVSVLDTSKE+    N EP+ADPP
Subjt:  LLLPANQIMVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEK----NDEPVADPP

Query:  VKRRKRHRRKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK
        VKRRKRHRRK F DE FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNF+L E EKQELR+FNWDEFLAMTR  ITNRKQK
Subjt:  VKRRKRHRRKQFLDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK

Query:  RLSPESKKSE
        RL+PESKKSE
Subjt:  RLSPESKKSE

A0A6J1HQI0 ethylene-responsive transcription factor-like protein At4g13040 isoform X21.0e-9591.5Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLT EDHVYGTDF+SVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDL EAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL

A0A6J1HRP0 ethylene-responsive transcription factor-like protein At4g13040 isoform X19.3e-8990.37Show/hide
Query:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
        MVSLRRRKLLGLCT              DLT EDHVYGTDF+SVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCT--------------DLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQF

Query:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQ
        LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDL EAEKQELRRFNWDEFLAMTRRAITNRK+
Subjt:  LDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQ

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130408.8e-3647.6Show/hide
Query:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK
        MVSLRRR+LLGLC            LT E+ + G               + V+   V    + +      ++     S  S +      +  P   PP K
Subjt:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK

Query:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-
        RRK+HRRK+  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNF+LSE   +EL++ +W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK
        R+  E  K
Subjt:  RLSPESKK

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein6.2e-3747.6Show/hide
Query:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK
        MVSLRRR+LLGLC            LT E+ + G               + V+   V    + +      ++     S  S +      +  P   PP K
Subjt:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK

Query:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-
        RRK+HRRK+  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNF+LSE   +EL++ +W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK
        R+  E  K
Subjt:  RLSPESKK

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.8e-3667.83Show/hide
Query:  VADPPVKRRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAI
        ++D P KRRK+HRRK+  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNF+LSE   +EL++ +W+EFL  TRR I
Subjt:  VADPPVKRRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAI

Query:  TNRKQK-RLSPESKK
        TN+K K R+  E  K
Subjt:  TNRKQK-RLSPESKK

AT4G13040.3 Integrase-type DNA-binding superfamily protein6.2e-3747.6Show/hide
Query:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK
        MVSLRRR+LLGLC            LT E+ + G               + V+   V    + +      ++     S  S +      +  P   PP K
Subjt:  MVSLRRRKLLGLCTD----------LTIEDHVYG--------------TDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVK

Query:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-
        RRK+HRRK+  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNF+LSE   +EL++ +W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKQFLD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK
        R+  E  K
Subjt:  RLSPESKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGTTTGAGAATTCCACCGCCGAATTTGATGAGCTGCTATTGCCTGCAAATCAAATTATGGTGAGCTTAAGAAGGCGAAAACTCCTGGGACTTTGCACTGATTT
GACTATTGAAGATCACGTGTACGGTACGGATTTCGTTAGTGTTCATCCCGTCTGCTCAGACAAAGTGAACAAGATAAAAGAGAATCCCGTTGCACGTATAGAGCCTGAAC
CGTCAGGGGTATCTGTTTTGGATACATCAAAAGAGAAAAATGACGAGCCAGTTGCAGACCCGCCCGTCAAGCGTCGAAAGAGACACCGGAGAAAGCAGTTTCTGGACGAA
CCGTTCTTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAAGCTGCTATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTCGGATCACAAGAAGAAGCTGC
TCATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTTGATCTCTCTGAGGCGGAGAAGCAAGAACTGAGACGGTTTAATTGGGACGAGTTTTTAGCCA
TGACCCGTCGTGCGATTACGAATAGAAAACAGAAGAGACTTAGTCCAGAATCAAAGAAGTCTGAACAACTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGTTTGAGAATTCCACCGCCGAATTTGATGAGCTGCTATTGCCTGCAAATCAAATTATGGTGAGCTTAAGAAGGCGAAAACTCCTGGGACTTTGCACTGATTT
GACTATTGAAGATCACGTGTACGGTACGGATTTCGTTAGTGTTCATCCCGTCTGCTCAGACAAAGTGAACAAGATAAAAGAGAATCCCGTTGCACGTATAGAGCCTGAAC
CGTCAGGGGTATCTGTTTTGGATACATCAAAAGAGAAAAATGACGAGCCAGTTGCAGACCCGCCCGTCAAGCGTCGAAAGAGACACCGGAGAAAGCAGTTTCTGGACGAA
CCGTTCTTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAAGCTGCTATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTCGGATCACAAGAAGAAGCTGC
TCATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTTGATCTCTCTGAGGCGGAGAAGCAAGAACTGAGACGGTTTAATTGGGACGAGTTTTTAGCCA
TGACCCGTCGTGCGATTACGAATAGAAAACAGAAGAGACTTAGTCCAGAATCAAAGAAGTCTGAACAACTCTAA
Protein sequenceShow/hide protein sequence
MDEFENSTAEFDELLLPANQIMVSLRRRKLLGLCTDLTIEDHVYGTDFVSVHPVCSDKVNKIKENPVARIEPEPSGVSVLDTSKEKNDEPVADPPVKRRKRHRRKQFLDE
PFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFDLSEAEKQELRRFNWDEFLAMTRRAITNRKQKRLSPESKKSEQL