; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G004372 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G004372
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionethylene-responsive transcription factor ERF098-like
Genome locationCG_Chr05:4062435..4062887
RNA-Seq ExpressionClCG05G004372
SyntenyClCG05G004372
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily
IPR044808 - Ethylene-responsive transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049258.1 ethylene-responsive transcription factor ERF098-like protein [Cucumis melo var. makuwa]7.9e-6888.67Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED+RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHLFPS PI
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
        N GFES   GGGSS+SN  PQ+VIVFE VDG+VLEDLLAQEDKKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

XP_004134412.1 ethylene-responsive transcription factor ERF098 [Cucumis sativus]1.4e-6486Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPP+LF S  I
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
        N GF+SG  GGGSS+SN  P +VIVFE VDG+VLEDLLAQEDKKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

XP_008438535.1 PREDICTED: ethylene-responsive transcription factor ERF098-like [Cucumis melo]1.6e-6889.33Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED+RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHLFPS PI
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
        N GFESG  GGGSS+SN  PQ+VIVFE VDG+VLEDLLAQEDKKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

XP_023526049.1 ethylene-responsive transcription factor ERF098-like [Cucurbita pepo subsp. pepo]1.4e-6486Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP
        MEDSRKGK+QQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYDRMAFHLKGHLASLNFP EYYARVMGSPPHPPH FP SAP
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP

Query:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS
        INRGF++  G  GSSSSN  P QVIV E +DG+VL+DLLAQE+KKKKKNS
Subjt:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS

XP_038877370.1 ethylene-responsive transcription factor ERF098-like [Benincasa hispida]3.9e-6789.4Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYDRMAF LKGHLASLNFP EYYARVMGSPPHPPHLFPSAPI
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQED-KKKKKNSK
        NRGFES  GGGGSSSSN  P+QVIV E +DG+VL+DLL QE+ KKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQED-KKKKKNSK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F1 AP2/ERF domain-containing protein9.8e-5684.09Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPP+LF S  I
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGK
        N GF+SG  GGGSS+SN  P +VIVFE VDG+
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGK

A0A1S3AX97 ethylene-responsive transcription factor ERF098-like7.7e-6989.33Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED+RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHLFPS PI
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
        N GFESG  GGGSS+SN  PQ+VIVFE VDG+VLEDLLAQEDKKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

A0A5D3D3D4 Ethylene-responsive transcription factor ERF098-like protein3.8e-6888.67Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        MED+RKGKEQQK GDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAE+AARAYD+ AF LKGHLASLNFPSEYYARVMGSPPHPPHLFPS PI
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
        N GFES   GGGSS+SN  PQ+VIVFE VDG+VLEDLLAQEDKKKKKNSK
Subjt:  NRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

A0A6J1E9Y3 ethylene-responsive transcription factor ERF098-like3.7e-6384.67Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP
        MEDSRK K+QQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYDRMAFHLKGHLASLNFP EYYARVMGSPPHPPH+FP SAP
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP

Query:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS
        INRGF++     GSSSSN  P QVIV E +DG+VL+DLLAQE+KKKKKNS
Subjt:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS

A0A6J1ISH5 ethylene-responsive transcription factor ERF098-like1.5e-6486Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP
        MEDSRKGK+QQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTY+TAE+AARAYDRMAFHLKGHLASLNFP EYYARVMGSPPHPPH FP SAP
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFP-SAP

Query:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS
        INRGF++  G  GSSSSN  P QVIV E +D +VLEDLLAQE+KKKKKNS
Subjt:  INRGFESGVGGGGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNS

SwissProt top hitse value%identityAlignment
P93822 Ethylene-responsive transcription factor 145.2e-2248.09Show/hide
Query:  KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSNA
        KYRGVRRRPWGKYAAEIRD  K+G R WLGT++TAE+AARAYDR A+ ++G  A LNFP EY                    N G  S      SSSS+ 
Subjt:  KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSNA

Query:  SPQQVIVFECVDGKVLEDLLAQEDKKKKKNS
          QQV  FE +D  VL++LL   +   K ++
Subjt:  SPQQVIVFECVDGKVLEDLLAQEDKKKKKNS

Q8L9K1 Ethylene-responsive transcription factor 135.8e-2177.05Show/hide
Query:  GIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFP
        G++YRGVRRRPWGK+AAEIRDP KNGAR WLGTYET E+AA AYDR AF L+G  A LNFP
Subjt:  GIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFP

Q9LSX0 Ethylene-responsive transcription factor ERF0961.9e-2448.98Show/hide
Query:  QGDDGI-----KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESG
        QG  G+     KYRGVRRRPWGKYAAEIRD  K+G R WLGT++TAEEAARAYD+ A+ ++G  A LNFP EY    MGS               G  S 
Subjt:  QGDDGI-----KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESG

Query:  VGGGGSSSSNA----SPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
            GSSS++A    S +QV  FE +D  VLE+LL + +K  K   K
Subjt:  VGGGGSSSSNA----SPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK

Q9LTC5 Ethylene-responsive transcription factor ERF0985.9e-3454.05Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        ME S +    Q Q D   ++RGVRRRPWGK+AAEIRDPS+NGAR WLGT+ETAEEAARAYDR AF+L+GHLA LNFP+EYY R+      PP+       
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQ---QVIVFECVDGKVLEDLLAQEDKKK
             S     GS+S+N S Q   +V  FE +D KVLE+LL  E++K+
Subjt:  NRGFESGVGGGGSSSSNASPQ---QVIVFECVDGKVLEDLLAQEDKKK

Q9LTC6 Ethylene-responsive transcription factor ERF0951.5e-2448.85Show/hide
Query:  IKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSN
        +KYRGVR+RPWGKYAAEIRD +++GAR WLGT+ TAE+AARAYDR AF ++G  A LNFP EY  ++M   P+  H    A  + G+    GGGG     
Subjt:  IKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSN

Query:  ASPQQVIVFECVDGKVLEDLLAQEDKKKKKN
           ++VI FE +D  +LE+LL   ++  + N
Subjt:  ASPQQVIVFECVDGKVLEDLLAQEDKKKKKN

Arabidopsis top hitse value%identityAlignment
AT1G04370.1 Ethylene-responsive element binding factor 143.7e-2348.09Show/hide
Query:  KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSNA
        KYRGVRRRPWGKYAAEIRD  K+G R WLGT++TAE+AARAYDR A+ ++G  A LNFP EY                    N G  S      SSSS+ 
Subjt:  KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSNA

Query:  SPQQVIVFECVDGKVLEDLLAQEDKKKKKNS
          QQV  FE +D  VL++LL   +   K ++
Subjt:  SPQQVIVFECVDGKVLEDLLAQEDKKKKKNS

AT2G44840.1 ethylene-responsive element binding factor 134.1e-2277.05Show/hide
Query:  GIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFP
        G++YRGVRRRPWGK+AAEIRDP KNGAR WLGTYET E+AA AYDR AF L+G  A LNFP
Subjt:  GIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFP

AT3G23220.1 Integrase-type DNA-binding superfamily protein1.0e-2548.85Show/hide
Query:  IKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSN
        +KYRGVR+RPWGKYAAEIRD +++GAR WLGT+ TAE+AARAYDR AF ++G  A LNFP EY  ++M   P+  H    A  + G+    GGGG     
Subjt:  IKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGGGGSSSSN

Query:  ASPQQVIVFECVDGKVLEDLLAQEDKKKKKN
           ++VI FE +D  +LE+LL   ++  + N
Subjt:  ASPQQVIVFECVDGKVLEDLLAQEDKKKKKN

AT3G23230.1 Integrase-type DNA-binding superfamily protein4.2e-3554.05Show/hide
Query:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI
        ME S +    Q Q D   ++RGVRRRPWGK+AAEIRDPS+NGAR WLGT+ETAEEAARAYDR AF+L+GHLA LNFP+EYY R+      PP+       
Subjt:  MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPI

Query:  NRGFESGVGGGGSSSSNASPQ---QVIVFECVDGKVLEDLLAQEDKKK
             S     GS+S+N S Q   +V  FE +D KVLE+LL  E++K+
Subjt:  NRGFESGVGGGGSSSSNASPQ---QVIVFECVDGKVLEDLLAQEDKKK

AT5G43410.1 Integrase-type DNA-binding superfamily protein1.4e-2548.98Show/hide
Query:  QGDDGI-----KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESG
        QG  G+     KYRGVRRRPWGKYAAEIRD  K+G R WLGT++TAEEAARAYD+ A+ ++G  A LNFP EY    MGS               G  S 
Subjt:  QGDDGI-----KYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESG

Query:  VGGGGSSSSNA----SPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK
            GSSS++A    S +QV  FE +D  VLE+LL + +K  K   K
Subjt:  VGGGGSSSSNA----SPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATTCTCGCAAGGGTAAGGAACAACAAAAGCAAGGTGACGATGGGATCAAGTACCGGGGAGTGCGACGGCGACCGTGGGGGAAGTACGCAGCGGAGATACGTGA
TCCATCGAAGAATGGGGCGAGACAATGGCTTGGGACCTACGAAACGGCGGAGGAAGCGGCTAGGGCTTACGATCGGATGGCATTCCATTTGAAAGGTCATCTTGCTAGTT
TGAATTTCCCTAGTGAATATTATGCTCGTGTCATGGGTTCGCCTCCTCATCCTCCTCACTTGTTTCCTTCTGCTCCGATCAATCGGGGTTTCGAGAGTGGTGTCGGTGGT
GGTGGATCATCGTCTTCCAACGCCAGTCCACAACAAGTTATTGTGTTTGAGTGTGTGGATGGCAAAGTTTTGGAAGACCTTCTAGCTCAGGAGGACAAAAAGAAGAAGAA
GAATAGCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATTCTCGCAAGGGTAAGGAACAACAAAAGCAAGGTGACGATGGGATCAAGTACCGGGGAGTGCGACGGCGACCGTGGGGGAAGTACGCAGCGGAGATACGTGA
TCCATCGAAGAATGGGGCGAGACAATGGCTTGGGACCTACGAAACGGCGGAGGAAGCGGCTAGGGCTTACGATCGGATGGCATTCCATTTGAAAGGTCATCTTGCTAGTT
TGAATTTCCCTAGTGAATATTATGCTCGTGTCATGGGTTCGCCTCCTCATCCTCCTCACTTGTTTCCTTCTGCTCCGATCAATCGGGGTTTCGAGAGTGGTGTCGGTGGT
GGTGGATCATCGTCTTCCAACGCCAGTCCACAACAAGTTATTGTGTTTGAGTGTGTGGATGGCAAAGTTTTGGAAGACCTTCTAGCTCAGGAGGACAAAAAGAAGAAGAA
GAATAGCAAATAA
Protein sequenceShow/hide protein sequence
MEDSRKGKEQQKQGDDGIKYRGVRRRPWGKYAAEIRDPSKNGARQWLGTYETAEEAARAYDRMAFHLKGHLASLNFPSEYYARVMGSPPHPPHLFPSAPINRGFESGVGG
GGSSSSNASPQQVIVFECVDGKVLEDLLAQEDKKKKKNSK