; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021084 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021084
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDSBA domain-containing protein
Genome locationchr07:17032997..17036041
RNA-Seq ExpressionPI0021084
SyntenyPI0021084
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001853 - DSBA-like thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048695.1 hypothetical protein E6C27_scaffold4358G00140 [Cucumis melo var. makuwa]2.0e-12098.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

KAE8649326.1 hypothetical protein Csa_014666 [Cucumis sativus]4.4e-12097.71Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

TYK14915.1 hypothetical protein E5676_scaffold1623G00040 [Cucumis melo var. makuwa]2.0e-12098.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

XP_004137140.1 uncharacterized protein LOC101208448 [Cucumis sativus]4.4e-12097.71Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

XP_008464570.1 PREDICTED: uncharacterized protein YwbO isoform X1 [Cucumis melo]2.0e-12098.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

TrEMBL top hitse value%identityAlignment
A0A0A0KZG9 DSBA domain-containing protein2.2e-12097.71Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNP+APKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A1S3CLT4 uncharacterized protein YwbO isoform X19.7e-12198.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A5A7U4Z3 DSBA domain-containing protein9.7e-12198.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A5D3CTQ6 DSBA domain-containing protein9.7e-12198.17Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        M ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ DNGVKEVKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

A0A6J1DQG1 uncharacterized protein LOC1110221674.6e-11593.12Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MAESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFEL WHPFQLNPSAPKEGVVKRE+YR+KFGIQSEQME+RMAEVFRGLGLDYD 
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        SGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECARKAGVEGAAEFL S DNGV +VKEELEKYSGKISGVPFYVINGKHKL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVAGK
        SGAQPPEVFLRAFQVAGK
Subjt:  SGAQPPEVFLRAFQVAGK

SwissProt top hitse value%identityAlignment
P39598 Uncharacterized protein YwbO6.7e-1026.96Show/hide
Query:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPS-APKEGVVK--REYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDSH
        + I + SD VCP+CFVGK   ++AI       D E+ W PF+L PS +P+   V    + Y  +  IQ       MAE    LG++ +   ++ +     
Subjt:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPS-APKEGVVK--REYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDSH

Query:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV
                +   K H+    +   +F + + IGD D L + A + G++GA+          ++V+ +  K++     I+ VP ++I G   + GA   +V
Subjt:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV

Query:  FLRA
        F +A
Subjt:  FLRA

Arabidopsis top hitse value%identityAlignment
AT5G38900.1 Thioredoxin superfamily protein1.8e-8267.59Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT
        MAES  S    KKLIQID+SSD+VCPWCFVGKKNLDKAI AS+DQY+FE+ W PF L+PSAPKEGV K+E+Y  K+G + + M ARM+EVF+GLGL++DT
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDT

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        +GLTGN+LDSH+LI+  G+Q   KQH LVEEL +GYFTQGK+IGDR+FL+E A K G+EGA EFL   +NGV EVKEEL KYS  I+GVP Y INGK KL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVA
        SGAQPPE F  AF+ A
Subjt:  SGAQPPEVFLRAFQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGTCAGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATTGATATAAGCTCCGACACGGTTTGCCCGTGGTGCTTTGTTGGCAAAAAAAATCTTGACAA
AGCAATTTCTGCTTCTCAGGATCAATATGATTTTGAGTTGAATTGGCATCCTTTCCAGCTCAACCCTTCTGCTCCCAAAGAAGGTGTTGTTAAGAGGGAATATTACAGGA
GTAAGTTTGGAATTCAATCTGAACAGATGGAAGCTCGGATGGCAGAGGTGTTTAGGGGTCTTGGACTGGATTATGACACTTCTGGGCTGACGGGAAATACTCTAGACAGC
CATAAGCTTATATATTTGGCAGGTCAACAAGGCTTAGGCAAACAACATGATCTTGTGGAGGAGTTGTGCCTTGGATACTTCACTCAGGGAAAATACATTGGCGACCGGGA
TTTTCTTTTGGAATGTGCTAGAAAGGCAGGGGTGGAAGGAGCAGCAGAGTTTCTCGAGTCCACTGATAATGGAGTTAAGGAGGTCAAGGAGGAGCTTGAGAAGTACTCGG
GAAAAATTTCAGGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAATTGAGTGGTGCTCAACCCCCTGAGGTTTTTCTAAGAGCTTTTCAAGTGGCAGGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAGTCAAGTAATAGGTGTTAGGTAGACATTGAAAATGGAGACGCTAAAAATATCGCCATTCTCCGTATCCAACTGCCACTGCGGTTCGTTTCTCCCAACAGACTCCC
AATTCCATCCTCTCAATTTTTCCAGGGAGAGTGCAAATTTATCAGGATCATGGCTGAGTCAGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATTGATATAAGCT
CCGACACGGTTTGCCCGTGGTGCTTTGTTGGCAAAAAAAATCTTGACAAAGCAATTTCTGCTTCTCAGGATCAATATGATTTTGAGTTGAATTGGCATCCTTTCCAGCTC
AACCCTTCTGCTCCCAAAGAAGGTGTTGTTAAGAGGGAATATTACAGGAGTAAGTTTGGAATTCAATCTGAACAGATGGAAGCTCGGATGGCAGAGGTGTTTAGGGGTCT
TGGACTGGATTATGACACTTCTGGGCTGACGGGAAATACTCTAGACAGCCATAAGCTTATATATTTGGCAGGTCAACAAGGCTTAGGCAAACAACATGATCTTGTGGAGG
AGTTGTGCCTTGGATACTTCACTCAGGGAAAATACATTGGCGACCGGGATTTTCTTTTGGAATGTGCTAGAAAGGCAGGGGTGGAAGGAGCAGCAGAGTTTCTCGAGTCC
ACTGATAATGGAGTTAAGGAGGTCAAGGAGGAGCTTGAGAAGTACTCGGGAAAAATTTCAGGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAATTGAGTGGTGCTCA
ACCCCCTGAGGTTTTTCTAAGAGCTTTTCAAGTGGCAGGGAAGTGACTGAAACTTAAATGAAGCTTTGAATTTAACCATCAATAAGAATGCAACTTTCTTTTGCTAAAAT
CAGTATACATACTCTTATTCCTGTATTTACTACCATAATGGAATACATTTTCAGATCAAACTGCTAATTATAGTTAGAAATTTGTTTGCCAAGTAATGAAATACTTGGTT
GTACACTATTAGTATTAATACATGTTTGAGATTTAAATAGTTTCA
Protein sequenceShow/hide protein sequence
MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELNWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDTSGLTGNTLDS
HKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESTDNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK