; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0017559 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0017559
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionDSBA domain-containing protein
Genome locationchr07:10175028..10178041
RNA-Seq ExpressionIVF0017559
SyntenyIVF0017559
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001853 - DSBA-like thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048695.1 hypothetical protein E6C27_scaffold4358G00140 [Cucumis melo var. makuwa]3.49e-158100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

KAE8649326.1 hypothetical protein Csa_014666 [Cucumis sativus]4.92e-15298.62Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

TYK14915.1 hypothetical protein E5676_scaffold1623G00040 [Cucumis melo var. makuwa]3.49e-158100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

XP_004137140.1 uncharacterized protein LOC101208448 [Cucumis sativus]4.30e-15798.62Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

XP_008464570.1 PREDICTED: uncharacterized protein YwbO isoform X1 [Cucumis melo]6.36e-159100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

TrEMBL top hitse value%identityAlignment
A0A0A0KZG9 DSBA domain-containing protein1.1e-12198.62Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A1S3CLT4 uncharacterized protein YwbO isoform X14.6e-123100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A1S3CMA4 uncharacterized protein YwbO isoform X29.4e-11696.33Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGD        RKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A5A7U4Z3 DSBA domain-containing protein4.6e-123100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A5D3CTQ6 DSBA domain-containing protein4.6e-123100Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

SwissProt top hitse value%identityAlignment
P39598 Uncharacterized protein YwbO3.0e-1026.96Show/hide
Query:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPS-APKEGVVK--TEYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDSH
        + I + SD VCP+CFVGK   ++AI       D E+ W PF+L PS +P+   V   ++ Y  +  IQ       MAE    LG++ +   ++ +     
Subjt:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPS-APKEGVVK--TEYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDSH

Query:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV
                +   K H+    +   +F + + IGD D L + A + G++GA+          ++V+ +  K++     I+ VP ++I G   + GA   +V
Subjt:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV

Query:  FLRA
        F +A
Subjt:  FLRA

Arabidopsis top hitse value%identityAlignment
AT5G38900.1 Thioredoxin superfamily protein1.1e-8167.13Show/hide
Query:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ES  S    KKLIQID+SSD+VCPWCFVGKKNLDKAI AS+DQY+FE+ W PF L+PSAPKEGV K E+Y  K+G + + M ARM+EVF+GLGL++DT
Subjt:  MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        +GLTGN+LDSH+LI+  G+Q   KQH LVEEL +GYFTQGK+IGDR+FL+E A K G+EGA EFL   +NGV EVKEEL KYS  I+GVP Y INGK KL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVA
        SGAQPPE F  AF+ A
Subjt:  SGAQPPEVFLRAFQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAGTCAGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATCGATATAAGCTCCGACACGGTTTGCCCGTGGTGTTTTGTTGGCAAAAAAAATCTTGACAA
AGCAATTTCTGCCTCTCAGGATCAATATGATTTTGAGTTGAATTGGCATCCTTTCCAGCTCAACCCTTCTGCTCCCAAAGAAGGTGTTGTTAAGACAGAATATTACAGGA
GTAAGTTTGGAATTCAATCTGAACAGATGGAAGCTCGGATGGCAGAGGTGTTTAGGGGTCTTGGACTGGACTATGACACTTCTGGGCTGACGGGAAATACTCTAGACAGC
CATAAGCTTATATATTTGGCTGGCCAACAAGGCTTAGGCAAACAACATGATCTTGTGGAGGAGTTATGCCTTGGATACTTCACTCAGGGAAAATACATTGGTGACAGAGA
TTTTCTTTTGGAATGTGCTAGAAAGGCAGGGGTGGAAGGAGCAGCAGAGTTTCTCGAGACCGCTGATAATGGAGTTAAGGAGGTCAAGGAGGAGCTTGAGAAGTACTCGG
GTAAAATTTCAGGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAATTGAGTGGTGCTCAACCCCCTGAGGTTTTTCTAAGAGCTTTTCAAGTGGCAGGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
AGACGCTAAAAATATCGCCATTCACCGTATTCAACTGCCACTGCGGTTCGTTTCTCCCGGCAGACTCCCAATTCCATCCTCTTAATTATTCCAGGGAGAGTACAAATTCA
ACAGGATCATGACTGAGTCAGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATCGATATAAGCTCCGACACGGTTTGCCCGTGGTGTTTTGTTGGCAAAAAAAAT
CTTGACAAAGCAATTTCTGCCTCTCAGGATCAATATGATTTTGAGTTGAATTGGCATCCTTTCCAGCTCAACCCTTCTGCTCCCAAAGAAGGTGTTGTTAAGACAGAATA
TTACAGGAGTAAGTTTGGAATTCAATCTGAACAGATGGAAGCTCGGATGGCAGAGGTGTTTAGGGGTCTTGGACTGGACTATGACACTTCTGGGCTGACGGGAAATACTC
TAGACAGCCATAAGCTTATATATTTGGCTGGCCAACAAGGCTTAGGCAAACAACATGATCTTGTGGAGGAGTTATGCCTTGGATACTTCACTCAGGGAAAATACATTGGT
GACAGAGATTTTCTTTTGGAATGTGCTAGAAAGGCAGGGGTGGAAGGAGCAGCAGAGTTTCTCGAGACCGCTGATAATGGAGTTAAGGAGGTCAAGGAGGAGCTTGAGAA
GTACTCGGGTAAAATTTCAGGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAATTGAGTGGTGCTCAACCCCCTGAGGTTTTTCTAAGAGCTTTTCAAGTGGCAGGGA
AGTGACTGAAACTTAAATGAAGCTTGAATTTAACCGTCAATAAGAATGTAACTTTCTGTTACTAAAATCAGTATTCTATACTTATTCCTGTATTTACTACCATAATGGAA
TACATTTTCAGATCAAACTGCTAATTATAGTTAGAAATTTGTTTGCCAAGTAATGAAATACTTGGATGTACACTATTAGTATTATGCCATGCGGCTGTCCTTCATGCGAT
AGAGATTACATACCCAACTTCTAATAAAAGCGAGCACCTCTCAGTGCCTAAACGCCA
Protein sequenceShow/hide protein sequence
MTESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKTEYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDS
HKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLETADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK