; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g00860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g00860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionethylene-responsive transcription factor ERF098-like
Genome locationchr5:582091..582543
RNA-Seq ExpressionMoc05g00860
SyntenyMoc05g00860
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily
IPR044808 - Ethylene-responsive transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582380.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]2.3e-5174.83Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--
        ME+SRK K+Q KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTYDTAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPH VFP+  
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--

Query:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
          NRG  T   G  GSSSS   + +  QV V+EY+DG++LDDLL QEE+KK
Subjt:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

KAG7018789.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-5275.5Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--
        ME+SRKGK+Q KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTYDTAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPH VFP+  
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--

Query:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
          NRG  T   G  GSSSS   + +  QV V+EY+DG++LDDLL QEE+KK
Subjt:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

XP_008438535.1 PREDICTED: ethylene-responsive transcription factor ERF098-like [Cucumis melo]3.0e-5172.48Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT
        ME++RKGKEQ K G+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHL    P 
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT

Query:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
        N G    GG GGGSS+S +   + ++V V EYVDG++L+DLL QE++KK
Subjt:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

XP_023526049.1 ethylene-responsive transcription factor ERF098-like [Cucurbita pepo subsp. pepo]3.6e-5274.67Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV---FP
        ME+SRKGK+Q KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTYDTAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPH      P
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV---FP

Query:  TNRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
         NRG  T   G  GSSSS   + +  QV V+EY+DG++LDDLL QEE+KK
Subjt:  TNRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

XP_038877370.1 ethylene-responsive transcription factor ERF098-like [Benincasa hispida]2.9e-5475Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT
        ME+SRKGKEQ KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPHL    P 
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT

Query:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK
        NR  G + GGGG SSS    + + +QV V+EY+DG++LDDLL QEE+KK  K
Subjt:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F1 AP2/ERF domain-containing protein2.4e-4673.13Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR
        ME+ RKGKEQ K G+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPP+L   T+ 
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR

Query:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGK
          G   GG GG SS++  D +K  V V EYVDG+
Subjt:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGK

A0A1S3AX97 ethylene-responsive transcription factor ERF098-like1.5e-5172.48Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT
        ME++RKGKEQ K G+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHL    P 
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT

Query:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
        N G    GG GGGSS+S +   + ++V V EYVDG++L+DLL QE++KK
Subjt:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

A0A5D3D3D4 Ethylene-responsive transcription factor ERF098-like protein4.3e-5171.14Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT
        ME++RKGKEQ K G+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHL    P 
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV--FPT

Query:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
        N G  + G GGG S+S    + + ++V V EYVDG++L+DLL QE++KK
Subjt:  NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

A0A6J1E9Y3 ethylene-responsive transcription factor ERF098-like2.5e-5174.17Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--
        ME+SRK K+Q KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTYDTAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPH VFP+  
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPT--

Query:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
          NRG  T+     GSSSS   + +  QV V+EY+DG++LDDLL QEE+KK
Subjt:  --NRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

A0A6J1ISH5 ethylene-responsive transcription factor ERF098-like2.5e-5173.33Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV---FP
        ME+SRKGK+Q KQG+D IKYRGVR+RPWGKYAAEIRDPSKNGARQWLGTYDTAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPH      P
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLV---FP

Query:  TNRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
         NRG  T   G  GSSSS   + +  QV V+EY+D ++L+DLL QEE+KK
Subjt:  TNRGPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

SwissProt top hitse value%identityAlignment
P93822 Ethylene-responsive transcription factor 145.0e-2548.53Show/hide
Query:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS
        G +  KYRGVR+RPWGKYAAEIRD  K+G R WLGT+DTAE+AARAYDR A+ ++G  A LNFP EY                  N G G+       SS
Subjt:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS

Query:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNT
        SS       +QVF  EY+D  +LD+LLE  E    T
Subjt:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNT

Q8L9K1 Ethylene-responsive transcription factor 134.4e-2156.82Show/hide
Query:  EDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGP
        +  ++YRGVR+RPWGK+AAEIRDP KNGAR WLGTY+T E+AA AYDR AF L+G  A LNFP      ++GS  + P  + P  R P
Subjt:  EDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGP

Q9LSX0 Ethylene-responsive transcription factor ERF0967.8e-2648.18Show/hide
Query:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS
        G +  KYRGVR+RPWGKYAAEIRD  K+G R WLGT+DTAEEAARAYD+ A+ ++G  A LNFP EY    MGS             G  +     G SS
Subjt:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS

Query:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK
        +S    ++ +QVF  EY+D  +L++LLE+ E+    K
Subjt:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK

Q9LTC5 Ethylene-responsive transcription factor ERF0981.8e-3550.34Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR
        ME+S +      Q +   ++RGVR+RPWGK+AAEIRDPS+NGAR WLGT++TAEEAARAYDR AF+L+GHLA LNFP+EYY R+      PP+       
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR

Query:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
           +      GS+S+ ++  N+++VF  EY+D K+L++LL+ EERK+
Subjt:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

Q9LTC6 Ethylene-responsive transcription factor ERF0952.7e-2650Show/hide
Query:  IKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSSSSTI
        +KYRGVRKRPWGKYAAEIRD +++GAR WLGT++TAE+AARAYDR AF ++G  A LNFP EY  ++M   P+  H     +   G +GGGGG       
Subjt:  IKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSSSSTI

Query:  NDANKKQVFVIEYVDGKILDDLLEQEER
             ++V   EY+D  +L++LL+  ER
Subjt:  NDANKKQVFVIEYVDGKILDDLLEQEER

Arabidopsis top hitse value%identityAlignment
AT1G04370.1 Ethylene-responsive element binding factor 143.6e-2648.53Show/hide
Query:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS
        G +  KYRGVR+RPWGKYAAEIRD  K+G R WLGT+DTAE+AARAYDR A+ ++G  A LNFP EY                  N G G+       SS
Subjt:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS

Query:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNT
        SS       +QVF  EY+D  +LD+LLE  E    T
Subjt:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNT

AT2G44840.1 ethylene-responsive element binding factor 133.1e-2256.82Show/hide
Query:  EDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGP
        +  ++YRGVR+RPWGK+AAEIRDP KNGAR WLGTY+T E+AA AYDR AF L+G  A LNFP      ++GS  + P  + P  R P
Subjt:  EDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGP

AT3G23220.1 Integrase-type DNA-binding superfamily protein1.9e-2750Show/hide
Query:  IKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSSSSTI
        +KYRGVRKRPWGKYAAEIRD +++GAR WLGT++TAE+AARAYDR AF ++G  A LNFP EY  ++M   P+  H     +   G +GGGGG       
Subjt:  IKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSSSSTI

Query:  NDANKKQVFVIEYVDGKILDDLLEQEER
             ++V   EY+D  +L++LL+  ER
Subjt:  NDANKKQVFVIEYVDGKILDDLLEQEER

AT3G23230.1 Integrase-type DNA-binding superfamily protein1.3e-3650.34Show/hide
Query:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR
        ME+S +      Q +   ++RGVR+RPWGK+AAEIRDPS+NGAR WLGT++TAEEAARAYDR AF+L+GHLA LNFP+EYY R+      PP+       
Subjt:  MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNR

Query:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK
           +      GS+S+ ++  N+++VF  EY+D K+L++LL+ EERK+
Subjt:  GPGTQGGGGGGSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKK

AT5G43410.1 Integrase-type DNA-binding superfamily protein5.5e-2748.18Show/hide
Query:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS
        G +  KYRGVR+RPWGKYAAEIRD  K+G R WLGT+DTAEEAARAYD+ A+ ++G  A LNFP EY    MGS             G  +     G SS
Subjt:  GEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGGGSS

Query:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK
        +S    ++ +QVF  EY+D  +L++LLE+ E+    K
Subjt:  SSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATTCTCGCAAGGGTAAGGAACAGTTGAAGCAAGGTGAGGATGCGATCAAGTACCGAGGGGTGCGGAAGCGGCCGTGGGGGAAATATGCGGCGGAGATTCGAGA
CCCGTCGAAGAACGGGGCGAGGCAATGGCTCGGGACGTATGACACGGCCGAGGAAGCGGCGAGGGCTTACGATCGGATGGCATTTGATTTGAAAGGTCATTTGGCTAGTC
TGAATTTCCCTAGTGAATATTATGCTCGTGTCATGGGTTCTCCTCCTCATCCTCCTCACCTTGTTTTTCCCACGAACCGAGGTCCCGGGACTCAAGGCGGTGGCGGTGGC
GGTAGCTCGTCTTCTACTATCAACGATGCCAATAAAAAGCAAGTTTTCGTGATTGAATATGTGGACGGCAAAATTTTGGATGATCTTCTCGAGCAAGAAGAGAGGAAGAA
GAATACCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATTCTCGCAAGGGTAAGGAACAGTTGAAGCAAGGTGAGGATGCGATCAAGTACCGAGGGGTGCGGAAGCGGCCGTGGGGGAAATATGCGGCGGAGATTCGAGA
CCCGTCGAAGAACGGGGCGAGGCAATGGCTCGGGACGTATGACACGGCCGAGGAAGCGGCGAGGGCTTACGATCGGATGGCATTTGATTTGAAAGGTCATTTGGCTAGTC
TGAATTTCCCTAGTGAATATTATGCTCGTGTCATGGGTTCTCCTCCTCATCCTCCTCACCTTGTTTTTCCCACGAACCGAGGTCCCGGGACTCAAGGCGGTGGCGGTGGC
GGTAGCTCGTCTTCTACTATCAACGATGCCAATAAAAAGCAAGTTTTCGTGATTGAATATGTGGACGGCAAAATTTTGGATGATCTTCTCGAGCAAGAAGAGAGGAAGAA
GAATACCAAATAA
Protein sequenceShow/hide protein sequence
MENSRKGKEQLKQGEDAIKYRGVRKRPWGKYAAEIRDPSKNGARQWLGTYDTAEEAARAYDRMAFDLKGHLASLNFPSEYYARVMGSPPHPPHLVFPTNRGPGTQGGGGG
GSSSSTINDANKKQVFVIEYVDGKILDDLLEQEERKKNTK