; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr8:18895511..18897377
RNA-Seq ExpressionMoc08g26320
SyntenyMoc08g26320
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060105.1 hypothetical protein E6C27_scaffold39G00240 [Cucumis melo var. makuwa]8.6e-4240.14Show/hide
Query:  TLE-FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEK--EIKHSRELFSLASGK
        TLE  K G  C+LA  + D+VVA  TI +S  +  NVK+++DVVVD D  +PIP+  G   +SQE+ LHILWP+ LVI+NN K    K ++++ + A   
Subjt:  TLE-FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEK--EIKHSRELFSLASGK

Query:  SPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQ
        +P Q A V+L  L R + +   AIQ+    DVF    K  IM+E+++ F  M P  T C+DAY  +L+  + +    + YKF+DAG+ S  +  KE   Q
Subjt:  SPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQ

Query:  VLTKRLSELELNQLLMFPYHSG-----AVRNIQK-----------------KPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
        +LT RL     +QLL+FPY+SG      V N+ K                      + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  VLTKRLSELELNQLLMFPYHSG-----AVRNIQK-----------------KPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]1.9e-4134.92Show/hide
Query:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL
        SK+ P       S  E DENV   E++   L+ E+ +E                            K GT C+LA  + D+VVA GTI +S  +  NVK+
Subjt:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL

Query:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL
        ++DVVVD D  +PIP+  G   +SQE+  HILWP+ LVI+NN K    + ++++ + A   +P Q A V+L  L R + +   AIQ+    DVF    K 
Subjt:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL

Query:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------
         IM+E+++ F  M P  T C+DAY  +L+  + +    + YKFLDAG+ S  +  KE  +Q+LT RL   + +QLL+FPY+SG                 
Subjt:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------

Query:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
                          +   + KK  A + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]1.9e-4134.92Show/hide
Query:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL
        SK+ P       S  E DENV   E++   L+ E+ +E                            K GT C+LA  + D+VVA GTI +S  +  NVK+
Subjt:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL

Query:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL
        ++DVVVD D  +PIP+  G   +SQE+  HILWP+ LVI+NN K    + ++++ + A   +P Q A V+L  L R + +   AIQ+    DVF    K 
Subjt:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL

Query:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------
         IM+E+++ F  M P  T C+DAY  +L+  + +    + YKFLDAG+ S  +  KE  +Q+LT RL   + +QLL+FPY+SG                 
Subjt:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------

Query:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
                          +   + KK  A + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

XP_022156814.1 uncharacterized protein LOC111023655 [Momordica charantia]3.2e-6085.42Show/hide
Query:  HGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTT
        HGGNDILSQEI  HIL PQSLVI +NEKEIKHSRELFSLASGKSPTQPA VS TCLTREINY  RAIQ+IISKDVF HE+KLFIMVE+VQK FHMEPTTT
Subjt:  HGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTT

Query:  PCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQV
        PCIDAY TFLHRSLGNENE SPYKFLD GA SITNL KEN +QV
Subjt:  PCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQV

XP_022156878.1 uncharacterized protein LOC111023711 [Momordica charantia]6.2e-13386.88Show/hide
Query:  EILDKEETLEFKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSL
        +ILDKEETLEFKEGTHCRLALGSIDNVVAA TIFESGRKDGNVK+S+DVVVDDDSRLPIPTHGGN+ILSQEI  HILWPQ+LVIS+NEKEIKHSRELFS 
Subjt:  EILDKEETLEFKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSL

Query:  ASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKE
        ASGKSPTQPA VSLTCLTREINY  RAIQ+IISKDVF HEHKLFIMVE+VQK FHMEPTTTPCIDAYTTFLHRSLGNE ESSPYKFLDAGATSITNL KE
Subjt:  ASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKE

Query:  NHMQVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP--------------CPKQPNAVECGYYVMRFMHDIVFARN
        N +QVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP              CPKQPNAVECGYYV+RFM DIVFARN
Subjt:  NHMQVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP--------------CPKQPNAVECGYYVMRFMHDIVFARN

TrEMBL top hitse value%identityAlignment
A0A5A7V2K1 ULP_PROTEASE domain-containing protein4.2e-4240.14Show/hide
Query:  TLE-FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEK--EIKHSRELFSLASGK
        TLE  K G  C+LA  + D+VVA  TI +S  +  NVK+++DVVVD D  +PIP+  G   +SQE+ LHILWP+ LVI+NN K    K ++++ + A   
Subjt:  TLE-FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEK--EIKHSRELFSLASGK

Query:  SPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQ
        +P Q A V+L  L R + +   AIQ+    DVF    K  IM+E+++ F  M P  T C+DAY  +L+  + +    + YKF+DAG+ S  +  KE   Q
Subjt:  SPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQ

Query:  VLTKRLSELELNQLLMFPYHSG-----AVRNIQK-----------------KPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
        +LT RL     +QLL+FPY+SG      V N+ K                      + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  VLTKRLSELELNQLLMFPYHSG-----AVRNIQK-----------------KPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein9.3e-4234.92Show/hide
Query:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL
        SK+ P       S  E DENV   E++   L+ E+ +E                            K GT C+LA  + D+VVA GTI +S  +  NVK+
Subjt:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL

Query:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL
        ++DVVVD D  +PIP+  G   +SQE+  HILWP+ LVI+NN K    + ++++ + A   +P Q A V+L  L R + +   AIQ+    DVF    K 
Subjt:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL

Query:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------
         IM+E+++ F  M P  T C+DAY  +L+  + +    + YKFLDAG+ S  +  KE  +Q+LT RL   + +QLL+FPY+SG                 
Subjt:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------

Query:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
                          +   + KK  A + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein9.3e-4234.92Show/hide
Query:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL
        SK+ P       S  E DENV   E++   L+ E+ +E                            K GT C+LA  + D+VVA GTI +S  +  NVK+
Subjt:  SKKSPQSKRGTLSKHERDENVLKKENIDEILDKEETLE---------------------------FKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKL

Query:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL
        ++DVVVD D  +PIP+  G   +SQE+  HILWP+ LVI+NN K    + ++++ + A   +P Q A V+L  L R + +   AIQ+    DVF    K 
Subjt:  SLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEI--KHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKL

Query:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------
         IM+E+++ F  M P  T C+DAY  +L+  + +    + YKFLDAG+ S  +  KE  +Q+LT RL   + +QLL+FPY+SG                 
Subjt:  FIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTKRLSELELNQLLMFPYHSG-----------------

Query:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA
                          +   + KK  A + V CPKQ   VECGYYVMRFM DI+ +
Subjt:  ------------------AVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFA

A0A6J1DRE3 uncharacterized protein LOC1110236551.5e-6085.42Show/hide
Query:  HGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTT
        HGGNDILSQEI  HIL PQSLVI +NEKEIKHSRELFSLASGKSPTQPA VS TCLTREINY  RAIQ+IISKDVF HE+KLFIMVE+VQK FHMEPTTT
Subjt:  HGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSLASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTT

Query:  PCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQV
        PCIDAY TFLHRSLGNENE SPYKFLD GA SITNL KEN +QV
Subjt:  PCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQV

A0A6J1DRT3 uncharacterized protein LOC1110237113.0e-13386.88Show/hide
Query:  EILDKEETLEFKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSL
        +ILDKEETLEFKEGTHCRLALGSIDNVVAA TIFESGRKDGNVK+S+DVVVDDDSRLPIPTHGGN+ILSQEI  HILWPQ+LVIS+NEKEIKHSRELFS 
Subjt:  EILDKEETLEFKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSL

Query:  ASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKE
        ASGKSPTQPA VSLTCLTREINY  RAIQ+IISKDVF HEHKLFIMVE+VQK FHMEPTTTPCIDAYTTFLHRSLGNE ESSPYKFLDAGATSITNL KE
Subjt:  ASGKSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKE

Query:  NHMQVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP--------------CPKQPNAVECGYYVMRFMHDIVFARN
        N +QVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP              CPKQPNAVECGYYV+RFM DIVFARN
Subjt:  NHMQVLTKRLSELELNQLLMFPYHSGAVRNIQKKPFALKRVP--------------CPKQPNAVECGYYVMRFMHDIVFARN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTCTTGTGAAAGAACTTCAAGCGAAACTGAAGAAGCATGAAAAAGGTTCCCCAAAATCAAAACATGGCACGCCAGCTAAGAAAACTCCAAAAAAATCT
CCTAAACTGAAGCGTATCACACCATCTAAAAATGCTCCAAAGAAATCTCCTCGGTCGAAGCGTACCACACCATCAAAAAACGATCCAAAGAAATCTCCTCAGTCA
AAGCATACCACACCATCTAAAAACGCTTCAAAAAAATCTCCGCAATCAAAGCGTGGTACTTTATCGAAGCATGAGCGAGATGAAAACGTTTTAAAAAAGGAGAAT
ATAGATGAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAAGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGCTGCGGGCACTATA
TTTGAATCTGGGAGGAAGGATGGAAACGTGAAATTGTCCTTAGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTCCGACGCATGGAGGAAATGATATTCTC
TCACAAGAAATATGTTTACATATATTATGGCCACAAAGTCTAGTCATATCCAATAATGAGAAGGAAATAAAACATTCGAGGGAGCTATTCAGTTTGGCAAGTGGA
AAATCTCCAACACAACCTGCGCTCGTTAGTCTAACATGTTTAACTCGTGAGATAAACTACTTTAGAAGGGCAATTCAAGTGATTATATCGAAGGATGTGTTCAAT
CATGAACATAAATTGTTTATTATGGTGGAGAATGTACAGAAGTTTTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTACACGACGTTCTTACATAGA
TCGTTGGGCAATGAAAATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGCCACTTCCATAACTAATCTGTTTAAAGAAAACCACATGCAAGTATTGACTAAA
AGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCCATATCATTCCGGGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACCGTGC
CCAAAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGCGGTTCATGCATGATATAGTCTTCGCTCGTAACCAACAATCCCAGAATGCGTACGTTTATAGC
CTAGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCTCTTGTGAAAGAACTTCAAGCGAAACTGAAGAAGCATGAAAAAGGTTCCCCAAAATCAAAACATGGCACGCCAGCTAAGAAAACTCCAAAAAAATCT
CCTAAACTGAAGCGTATCACACCATCTAAAAATGCTCCAAAGAAATCTCCTCGGTCGAAGCGTACCACACCATCAAAAAACGATCCAAAGAAATCTCCTCAGTCA
AAGCATACCACACCATCTAAAAACGCTTCAAAAAAATCTCCGCAATCAAAGCGTGGTACTTTATCGAAGCATGAGCGAGATGAAAACGTTTTAAAAAAGGAGAAT
ATAGATGAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAAGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGCTGCGGGCACTATA
TTTGAATCTGGGAGGAAGGATGGAAACGTGAAATTGTCCTTAGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTCCGACGCATGGAGGAAATGATATTCTC
TCACAAGAAATATGTTTACATATATTATGGCCACAAAGTCTAGTCATATCCAATAATGAGAAGGAAATAAAACATTCGAGGGAGCTATTCAGTTTGGCAAGTGGA
AAATCTCCAACACAACCTGCGCTCGTTAGTCTAACATGTTTAACTCGTGAGATAAACTACTTTAGAAGGGCAATTCAAGTGATTATATCGAAGGATGTGTTCAAT
CATGAACATAAATTGTTTATTATGGTGGAGAATGTACAGAAGTTTTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTACACGACGTTCTTACATAGA
TCGTTGGGCAATGAAAATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGCCACTTCCATAACTAATCTGTTTAAAGAAAACCACATGCAAGTATTGACTAAA
AGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCCATATCATTCCGGGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACCGTGC
CCAAAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGCGGTTCATGCATGATATAGTCTTCGCTCGTAACCAACAATCCCAGAATGCGTACGTTTATAGC
CTAGTGTGA
Protein sequenceShow/hide protein sequence
MAALVKELQAKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRITPSKNAPKKSPRSKRTTPSKNDPKKSPQSKHTTPSKNASKKSPQSKRGTLSKHERDENVLKKEN
IDEILDKEETLEFKEGTHCRLALGSIDNVVAAGTIFESGRKDGNVKLSLDVVVDDDSRLPIPTHGGNDILSQEICLHILWPQSLVISNNEKEIKHSRELFSLASG
KSPTQPALVSLTCLTREINYFRRAIQVIISKDVFNHEHKLFIMVENVQKFFHMEPTTTPCIDAYTTFLHRSLGNENESSPYKFLDAGATSITNLFKENHMQVLTK
RLSELELNQLLMFPYHSGAVRNIQKKPFALKRVPCPKQPNAVECGYYVMRFMHDIVFARNQQSQNAYVYSLV