; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr3:13270436..13275750
RNA-Seq ExpressionMoc03g19690
SyntenyMoc03g19690
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031708.1 hypothetical protein E6C27_scaffold139G004940 [Cucumis melo var. makuwa]8.8e-3637.45Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR
        V VD+V   +C +P+P+  G  +L+ E+GS +LWP HL+   ++ + +  +    L     + +  P+ LR LL E+    S++ + V   VFG+   C 
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR

Query:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI
        ++++ +Q F +M+PI+  CID ++ +LYK +   G    YKF DAGSIS   +SKE R + L  +L+  +  Q+L  PYNSG+HW LI ID+S+   Y +
Subjt:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI

Query:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +PLRNR++ND  DVV    +  NK K  VW +  C
Subjt:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

KAA0046954.1 uncharacterized protein E6C27_scaffold230G001320 [Cucumis melo var. makuwa]8.8e-3637.02Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR
        V VD+V   +C +P+P+  G  +L+ E+GS +LWP HL+   ++ + +  +    L     + +  P+ LR LL E+    S++ + V   VFG+   C 
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR

Query:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI
        ++++ +Q F +M+PI+  CID ++ +LYK +   G    YKF DAGS+S   +SKE R + L  +L+  +  Q+L  PYNSG+HW LI ID+S+   Y +
Subjt:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI

Query:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +PLRNR++ND  DVV    +  NK K  VW +  C
Subjt:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]9.4e-3839.09Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV
        V +DVV   DC +PIPS +G   ++ E+GSHILWP        + +V T    ++   F  D ST         P+ LR LLR ++   S + +    DV
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV

Query:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY
        FG    C + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLL  PYNSG+HW L+VI+ 
Subjt:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY

Query:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +K   + I+PL+NR+D D+ +VV R+ N  NK K   W +  C
Subjt:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]9.4e-3839.09Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV
        V +DVV   DC +PIPS +G   ++ E+GSHILWP        + +V T    ++   F  D ST         P+ LR LLR ++   S + +    DV
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV

Query:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY
        FG    C + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLL  PYNSG+HW L+VI+ 
Subjt:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY

Query:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +K   + I+PL+NR+D D+ +VV R+ N  NK K   W +  C
Subjt:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

TYK16521.1 uncharacterized protein E5676_scaffold21G003330 [Cucumis melo var. makuwa]3.6e-3739.41Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITIT--EKGVVATREEVLELYPFEYDRSTPIVLRCLLREMKICKSQVILPVTDDVFG--HIC
        V+V+VV   DC +PIPS +G   ++ E+GSHILWP  L+     +    A  +++    P    ++ PI LR LLR ++   S + +    DVFG    C
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITIT--EKGVVATREEVLELYPFEYDRSTPIVLRCLLREMKICKSQVILPVTDDVFG--HIC

Query:  RVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYS
         + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKF DAGSIS  + SKE R + LT++L+  +  QLL  PYNSG+HW L+VI+ +K   + 
Subjt:  RVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYS

Query:  INPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        I+PL+NR+D D+ +VV R+ N  NK K  VW +  C
Subjt:  INPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

TrEMBL top hitse value%identityAlignment
A0A5A7SM56 ULP_PROTEASE domain-containing protein4.3e-3637.45Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR
        V VD+V   +C +P+P+  G  +L+ E+GS +LWP HL+   ++ + +  +    L     + +  P+ LR LL E+    S++ + V   VFG+   C 
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR

Query:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI
        ++++ +Q F +M+PI+  CID ++ +LYK +   G    YKF DAGSIS   +SKE R + L  +L+  +  Q+L  PYNSG+HW LI ID+S+   Y +
Subjt:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI

Query:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +PLRNR++ND  DVV    +  NK K  VW +  C
Subjt:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

A0A5A7TVG6 ULP_PROTEASE domain-containing protein4.3e-3637.02Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR
        V VD+V   +C +P+P+  G  +L+ E+GS +LWP HL+   ++ + +  +    L     + +  P+ LR LL E+    S++ + V   VFG+   C 
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYD-RSTPIVLRCLLREMKICKSQVILPVTDDVFGH--ICR

Query:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI
        ++++ +Q F +M+PI+  CID ++ +LYK +   G    YKF DAGS+S   +SKE R + L  +L+  +  Q+L  PYNSG+HW LI ID+S+   Y +
Subjt:  VWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSI

Query:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +PLRNR++ND  DVV    +  NK K  VW +  C
Subjt:  NPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein4.5e-3839.09Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV
        V +DVV   DC +PIPS +G   ++ E+GSHILWP        + +V T    ++   F  D ST         P+ LR LLR ++   S + +    DV
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV

Query:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY
        FG    C + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLL  PYNSG+HW L+VI+ 
Subjt:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY

Query:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +K   + I+PL+NR+D D+ +VV R+ N  NK K   W +  C
Subjt:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

A0A5D3CX60 ULP_PROTEASE domain-containing protein1.7e-3739.41Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITIT--EKGVVATREEVLELYPFEYDRSTPIVLRCLLREMKICKSQVILPVTDDVFG--HIC
        V+V+VV   DC +PIPS +G   ++ E+GSHILWP  L+     +    A  +++    P    ++ PI LR LLR ++   S + +    DVFG    C
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITIT--EKGVVATREEVLELYPFEYDRSTPIVLRCLLREMKICKSQVILPVTDDVFG--HIC

Query:  RVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYS
         + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKF DAGSIS  + SKE R + LT++L+  +  QLL  PYNSG+HW L+VI+ +K   + 
Subjt:  RVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYS

Query:  INPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        I+PL+NR+D D+ +VV R+ N  NK K  VW +  C
Subjt:  INPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein4.5e-3839.09Show/hide
Query:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV
        V +DVV   DC +PIPS +G   ++ E+GSHILWP        + +V T    ++   F  D ST         P+ LR LLR ++   S + +    DV
Subjt:  VLVDVVFQNDCPLPIPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRST---------PIVLRCLLREMKICKSQVILPVTDDV

Query:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY
        FG    C + ++ ++ F  M+PI  +C+D YI+YLY  + +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLL  PYNSG+HW L+VI+ 
Subjt:  FG--HICRVWMDIVQTFYEMKPITDSCIDVYIIYLYKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDY

Query:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC
        +K   + I+PL+NR+D D+ +VV R+ N  NK K   W +  C
Subjt:  SKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCATCAGGTAACTTCGATCACCGAGTTTGTGGTGATCGATCGCAGCTCAGCCTATAATGCCATACTAGGCCAACCCTTCATTCACCATCTGAAGGTCATCCCTTC
GACGTACCACCAGATGATGAAATACCCTACCGCGGCTAGGATAGCAACCATCCGAGGCGAGCAGAAAATGTCAAAGGAATACTATGCTGCAGCCTTGAGAGGCTCTATTA
CCTGCACAAATATAACAGCTGAATCTCCGCCTCTGGATGAGCCGACCTGGGGAACTTCAGTAGAAGAGTTAGAGCTTGTGCTGCTATTGAGCTCTGACAGACAAACTGAA
CTACCCTGCGCAGTCCCAGTCGAGGTACAGAATGAGCCATCCATCGACCAAGCCGAGGTGATGGATATTCAGCCAACTTCCCCAACTTGGATGGATCCAATCAAGGATTA
CCTGAGTGGGAAGGTTCCTGATGACCGTGTGGGTTTACCAACCTTTAGAGTTCAGCACTTCGATCAGCAAGCTAACGCTGAGGCACTTCTGCTCAACCTCGACCTCCTTG
AGGAAAGGCGTGTTCAGTCACAACTTCGCCGAGCCAAGTATCAGAACCGCATGGCACGATATTATAATGTATTAGTTGATGTAGTGTTCCAGAATGATTGCCCTTTGCCC
ATTCCATCTAACAGAGGATCGAATGTACTAGCTGACGAGTTAGGTTCGCACATATTATGGCCAGGACACCTCATTACCATCACTGAAAAAGGGGTGGTTGCAACCAGAGA
GGAGGTGCTTGAGCTGTACCCGTTCGAATATGATCGAAGCACCCCAATTGTTCTGCGATGCCTGCTTCGTGAGATGAAGATATGTAAATCTCAGGTAATATTGCCAGTGA
CTGATGACGTGTTCGGCCACATCTGTAGAGTATGGATGGACATTGTTCAAACCTTTTATGAGATGAAGCCGATAACTGATTCGTGCATAGATGTGTATATCATATACTTA
TACAAGAACTTGGTGACAAGAGGTGCATCTCATATGTACAAATTTCTTGATGCTGGATCAATATCGACATCCAATCTTTCAAAAGAAGCACGAACGGAGAATTTGACAAA
ACAACTTATGCAAATGGAGTCGGGTCAATTGCTTTTTGCTCCCTACAACAGTGGATCACATTGGATATTGATAGTCATTGATTACTCGAAGACCATGGTGTATTCAATCA
ACCCTTTGAGAAATCGTCTGGACAATGATATAATGGATGTCGTCAACCGGACTTTGAATAAGTGCAACAAAGTAAAGCTTAGTGTTTGGGAAATGTCTTGTTGTGTCCAT
AGTGCCCAAGACAACCTGGCTCAACAGAATGTGGGAGCAATCTTCGCACTGGAAATGCTTGTCGAACTGCGTTCGGGTGAGCTGAAGCATGTTGTGGCCGTTTCATTCAA
AGATCCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCATCAGGTAACTTCGATCACCGAGTTTGTGGTGATCGATCGCAGCTCAGCCTATAATGCCATACTAGGCCAACCCTTCATTCACCATCTGAAGGTCATCCCTTC
GACGTACCACCAGATGATGAAATACCCTACCGCGGCTAGGATAGCAACCATCCGAGGCGAGCAGAAAATGTCAAAGGAATACTATGCTGCAGCCTTGAGAGGCTCTATTA
CCTGCACAAATATAACAGCTGAATCTCCGCCTCTGGATGAGCCGACCTGGGGAACTTCAGTAGAAGAGTTAGAGCTTGTGCTGCTATTGAGCTCTGACAGACAAACTGAA
CTACCCTGCGCAGTCCCAGTCGAGGTACAGAATGAGCCATCCATCGACCAAGCCGAGGTGATGGATATTCAGCCAACTTCCCCAACTTGGATGGATCCAATCAAGGATTA
CCTGAGTGGGAAGGTTCCTGATGACCGTGTGGGTTTACCAACCTTTAGAGTTCAGCACTTCGATCAGCAAGCTAACGCTGAGGCACTTCTGCTCAACCTCGACCTCCTTG
AGGAAAGGCGTGTTCAGTCACAACTTCGCCGAGCCAAGTATCAGAACCGCATGGCACGATATTATAATGTATTAGTTGATGTAGTGTTCCAGAATGATTGCCCTTTGCCC
ATTCCATCTAACAGAGGATCGAATGTACTAGCTGACGAGTTAGGTTCGCACATATTATGGCCAGGACACCTCATTACCATCACTGAAAAAGGGGTGGTTGCAACCAGAGA
GGAGGTGCTTGAGCTGTACCCGTTCGAATATGATCGAAGCACCCCAATTGTTCTGCGATGCCTGCTTCGTGAGATGAAGATATGTAAATCTCAGGTAATATTGCCAGTGA
CTGATGACGTGTTCGGCCACATCTGTAGAGTATGGATGGACATTGTTCAAACCTTTTATGAGATGAAGCCGATAACTGATTCGTGCATAGATGTGTATATCATATACTTA
TACAAGAACTTGGTGACAAGAGGTGCATCTCATATGTACAAATTTCTTGATGCTGGATCAATATCGACATCCAATCTTTCAAAAGAAGCACGAACGGAGAATTTGACAAA
ACAACTTATGCAAATGGAGTCGGGTCAATTGCTTTTTGCTCCCTACAACAGTGGATCACATTGGATATTGATAGTCATTGATTACTCGAAGACCATGGTGTATTCAATCA
ACCCTTTGAGAAATCGTCTGGACAATGATATAATGGATGTCGTCAACCGGACTTTGAATAAGTGCAACAAAGTAAAGCTTAGTGTTTGGGAAATGTCTTGTTGTGTCCAT
AGTGCCCAAGACAACCTGGCTCAACAGAATGTGGGAGCAATCTTCGCACTGGAAATGCTTGTCGAACTGCGTTCGGGTGAGCTGAAGCATGTTGTGGCCGTTTCATTCAA
AGATCCATCATAA
Protein sequenceShow/hide protein sequence
MEHQVTSITEFVVIDRSSAYNAILGQPFIHHLKVIPSTYHQMMKYPTAARIATIRGEQKMSKEYYAAALRGSITCTNITAESPPLDEPTWGTSVEELELVLLLSSDRQTE
LPCAVPVEVQNEPSIDQAEVMDIQPTSPTWMDPIKDYLSGKVPDDRVGLPTFRVQHFDQQANAEALLLNLDLLEERRVQSQLRRAKYQNRMARYYNVLVDVVFQNDCPLP
IPSNRGSNVLADELGSHILWPGHLITITEKGVVATREEVLELYPFEYDRSTPIVLRCLLREMKICKSQVILPVTDDVFGHICRVWMDIVQTFYEMKPITDSCIDVYIIYL
YKNLVTRGASHMYKFLDAGSISTSNLSKEARTENLTKQLMQMESGQLLFAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRTLNKCNKVKLSVWEMSCCVH
SAQDNLAQQNVGAIFALEMLVELRSGELKHVVAVSFKDPS