; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021079 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021079
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionethylene-responsive transcription factor ERF019
Genome locationchr03:306055..306566
RNA-Seq ExpressionIVF0021079
SyntenyIVF0021079
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652236.1 hypothetical protein Csa_022603 [Cucumis sativus]5.18e-9794.59Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNS GE KKFKGVRQRKWGKWVSEIR+PGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSL DRGSSGRAGAFQASGD+QY+  NNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

XP_004138829.1 ethylene-responsive transcription factor ERF020 [Cucumis sativus]3.44e-9794.59Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNS GE KKFKGVRQRKWGKWVSEIR+PGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSL DRGSSGRAGAFQASGD+QY+  NNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

XP_008441160.1 PREDICTED: ethylene-responsive transcription factor ERF019 [Cucumis melo]1.58e-102100Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

XP_023517176.1 ethylene-responsive transcription factor ERF020-like [Cucurbita pepo subsp. pepo]1.40e-8181.58Show/hide
Query:  MASSSN-SLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASD
        MAS+SN S  E KK+KGVR+RKWGKWVSEIRVPG+Q+R+WLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQ+AASD
Subjt:  MASSSN-SLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASD

Query:  AAMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNN---DQDLSIQDYL
        AAMAVDAQYICNS  DRGSSG  GAF  SG + YTA N+   D+D+SI+DYL
Subjt:  AAMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNN---DQDLSIQDYL

XP_038886239.1 ethylene-responsive transcription factor ERF020 [Benincasa hispida]7.11e-8485.81Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNSLGE KK+KGVR+RKWGKWVSEIRVPG+Q+RLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQ+AASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSLADRGS+G AG +       YTA  +DQDLS+QDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

TrEMBL top hitse value%identityAlignment
A0A0A0LPY2 AP2/ERF domain-containing protein1.0e-7394.59Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNS GE KKFKGVRQRKWGKWVSEIR+PGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSL DRGSSGRAGAFQASGD+QY+  NNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

A0A1S3B2T2 ethylene-responsive transcription factor ERF0199.0e-78100Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

A0A5A7T282 Ethylene-responsive transcription factor ERF0199.0e-78100Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

A0A6J1FMW6 ethylene-responsive transcription factor ERF0207.6e-6183.11Show/hide
Query:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA
        MAS+S S  + KK+KGVR+RKWGKWVSEIRVPGSQ+RLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPP VLPLTNHLLIRDDMSPGSIQ+AASDA
Subjt:  MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDA

Query:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        AMAVDAQYICN+L DRGSSG AG FQ      YTA  NDQDLS++DYL
Subjt:  AMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

A0A6J1HG65 ethylene-responsive transcription factor ERF020-like1.5e-6180.92Show/hide
Query:  MASSSNS-LGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASD
        MAS+SN+   E KK+KGVR+RKWGKWVSEIRVPG+Q+R+WLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQ+AASD
Subjt:  MASSSNS-LGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASD

Query:  AAMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNN---DQDLSIQDYL
        AAMAVDAQYICNS  DRGSSG  GAF  SG + YTA N+   D+D+SI+DYL
Subjt:  AAMAVDAQYICNSLADRGSSGRAGAFQASGDDQYTAFNN---DQDLSIQDYL

SwissProt top hitse value%identityAlignment
O80542 Ethylene-responsive transcription factor ERF0191.3e-2851.05Show/hide
Query:  SLGEIK-KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVD
        S GE + K+KG+R+RKWGKWVSEIRVPG++DRLWLGS+S+ E AAVAHDVA++CL +P +L+ LNFP ++ P    L+ R   SP SIQ+AAS+A MA+D
Subjt:  SLGEIK-KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVD

Query:  AQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        A  I +S +     G    +  +G DQ    N    +S+ DYL
Subjt:  AQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

Q9C9I8 Ethylene-responsive transcription factor ERF0204.1e-2749.65Show/hide
Query:  KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLD--HLNFPPMVLPLTNHLL---IRDDMSPGSIQRAASDAAMAVDAQ
        K+KG+R+RKWGKWVSEIRVPG++ RLWLGS+S+ E AAVAHDVA+YCL RPS+LD    NFP        HLL   +  ++SP SIQ+AASDA MAVDA 
Subjt:  KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLD--HLNFPPMVLPLTNHLL---IRDDMSPGSIQRAASDAAMAVDAQ

Query:  YICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        +        G   R+       +D+ +       +S+ DYL
Subjt:  YICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

Q9LPE8 Ethylene-responsive transcription factor ERF0143.1e-1950.48Show/hide
Query:  ASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPM---VLPLTNHLLIRDDMSPGSIQRAAS
        +S+S+S   +KK+KGVR R WG WVSEIR P  + R+WLGSYS+ EAAA A+D A  CL + S+ ++LNFP +   +  + N+    +DMSP SIQR A+
Subjt:  ASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPM---VLPLTNHLLIRDDMSPGSIQRAAS

Query:  DAAMA
         AA A
Subjt:  DAAMA

Q9SFE4 Ethylene-responsive transcription factor ERF0122.0e-1851.46Show/hide
Query:  SSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLL--IRDD---MSPGSIQRAAS
        + N   +IKK+KGVR R WG WVSEIR P  + R+WLGSYS+ EAAA A+DVA  CL+ P    +LNFP      ++HLL  + D+   +SP SIQR A+
Subjt:  SSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLL--IRDD---MSPGSIQRAAS

Query:  DAA
         AA
Subjt:  DAA

Q9SNE1 Ethylene-responsive transcription factor ERF0115.9e-1851.58Show/hide
Query:  KKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVDA
        + FKG+R RKWGKWV+EIR P  + RLWLGSYS+PEAAA A+D A + LR P+    LNFP + LP T+     +DMS  +I++ A++    VDA
Subjt:  KKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVDA

Arabidopsis top hitse value%identityAlignment
AT1G21910.1 Integrase-type DNA-binding superfamily protein1.4e-1951.46Show/hide
Query:  SSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLL--IRDD---MSPGSIQRAAS
        + N   +IKK+KGVR R WG WVSEIR P  + R+WLGSYS+ EAAA A+DVA  CL+ P    +LNFP      ++HLL  + D+   +SP SIQR A+
Subjt:  SSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLL--IRDD---MSPGSIQRAAS

Query:  DAA
         AA
Subjt:  DAA

AT1G22810.1 Integrase-type DNA-binding superfamily protein9.0e-3051.05Show/hide
Query:  SLGEIK-KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVD
        S GE + K+KG+R+RKWGKWVSEIRVPG++DRLWLGS+S+ E AAVAHDVA++CL +P +L+ LNFP ++ P    L+ R   SP SIQ+AAS+A MA+D
Subjt:  SLGEIK-KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVD

Query:  AQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        A  I +S +     G    +  +G DQ    N    +S+ DYL
Subjt:  AQYICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

AT1G44830.1 Integrase-type DNA-binding superfamily protein2.2e-2050.48Show/hide
Query:  ASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPM---VLPLTNHLLIRDDMSPGSIQRAAS
        +S+S+S   +KK+KGVR R WG WVSEIR P  + R+WLGSYS+ EAAA A+D A  CL + S+ ++LNFP +   +  + N+    +DMSP SIQR A+
Subjt:  ASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPM---VLPLTNHLLIRDDMSPGSIQRAAS

Query:  DAAMA
         AA A
Subjt:  DAAMA

AT1G71520.1 Integrase-type DNA-binding superfamily protein2.9e-2849.65Show/hide
Query:  KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLD--HLNFPPMVLPLTNHLL---IRDDMSPGSIQRAASDAAMAVDAQ
        K+KG+R+RKWGKWVSEIRVPG++ RLWLGS+S+ E AAVAHDVA+YCL RPS+LD    NFP        HLL   +  ++SP SIQ+AASDA MAVDA 
Subjt:  KFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLD--HLNFPPMVLPLTNHLL---IRDDMSPGSIQRAASDAAMAVDAQ

Query:  YICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL
        +        G   R+       +D+ +       +S+ DYL
Subjt:  YICNSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL

AT3G50260.1 cooperatively regulated by ethylene and jasmonate 14.2e-1951.58Show/hide
Query:  KKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVDA
        + FKG+R RKWGKWV+EIR P  + RLWLGSYS+PEAAA A+D A + LR P+    LNFP + LP T+     +DMS  +I++ A++    VDA
Subjt:  KKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCAGTTCCAACAGTCTCGGTGAGATAAAGAAATTCAAGGGTGTACGGCAACGTAAGTGGGGGAAGTGGGTGTCAGAGATTCGAGTTCCGGGTAGTCAAGATCG
GCTATGGCTAGGTTCTTACTCATCGCCCGAGGCCGCCGCCGTGGCTCACGACGTTGCATATTATTGCCTAAGAAGACCCTCAAACCTGGACCACCTGAACTTTCCGCCGA
TGGTGCTGCCCTTGACCAACCACCTCCTTATCCGAGACGACATGTCACCTGGCTCCATTCAGAGGGCAGCCTCCGACGCCGCCATGGCTGTCGACGCACAGTATATATGC
AACAGTTTGGCAGACCGAGGCAGTAGTGGCCGTGCAGGGGCATTTCAGGCATCAGGAGATGACCAGTATACGGCTTTCAATAATGATCAAGATCTTTCCATCCAAGATTA
TCTGTAA
mRNA sequenceShow/hide mRNA sequence
CTTCCTCAAACTCCACCTTACTCCAGTAGTATACTTTATCCATTACAGTCAATTCATAAAAGGATATGGCCTCCAGTTCCAACAGTCTCGGTGAGATAAAGAAATTCAAG
GGTGTACGGCAACGTAAGTGGGGGAAGTGGGTGTCAGAGATTCGAGTTCCGGGTAGTCAAGATCGGCTATGGCTAGGTTCTTACTCATCGCCCGAGGCCGCCGCCGTGGC
TCACGACGTTGCATATTATTGCCTAAGAAGACCCTCAAACCTGGACCACCTGAACTTTCCGCCGATGGTGCTGCCCTTGACCAACCACCTCCTTATCCGAGACGACATGT
CACCTGGCTCCATTCAGAGGGCAGCCTCCGACGCCGCCATGGCTGTCGACGCACAGTATATATGCAACAGTTTGGCAGACCGAGGCAGTAGTGGCCGTGCAGGGGCATTT
CAGGCATCAGGAGATGACCAGTATACGGCTTTCAATAATGATCAAGATCTTTCCATCCAAGATTATCTGTAA
Protein sequenceShow/hide protein sequence
MASSSNSLGEIKKFKGVRQRKWGKWVSEIRVPGSQDRLWLGSYSSPEAAAVAHDVAYYCLRRPSNLDHLNFPPMVLPLTNHLLIRDDMSPGSIQRAASDAAMAVDAQYIC
NSLADRGSSGRAGAFQASGDDQYTAFNNDQDLSIQDYL