; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g02850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g02850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionproline-rich receptor-like protein kinase PERK5
Genome locationchr4:1794589..1797663
RNA-Seq ExpressionMoc04g02850
SyntenyMoc04g02850
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144518.1 uncharacterized protein LOC101206223 [Cucumis sativus]6.6e-7978.7Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPT--------DHT
        MAKTNKY+SINFNHIYDKNL SSNSKTGNNP + KNPSS+ SSSFA+ATYSSISSPNKSHGRMLVLTRPTP+PITSP     QP+S P+        DH 
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPT--------DHT

Query:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG
        RPQSDSD+ISLRPLGRTGT     SPIP  EKD+EI PP VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSRE NQRQYGNYG   RYGEDGRPKSGGG
Subjt:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG

Query:  YERMGGAGGA--GPMMNRPRSSGNRPSSSG
        YERM   G A  G M+NRPRSSGNRPSSSG
Subjt:  YERMGGAGGA--GPMMNRPRSSGNRPSSSG

XP_022135737.1 proline-rich receptor-like protein kinase PERK5 [Momordica charantia]1.4e-108100Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR
        MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR

Query:  PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR
        PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR
Subjt:  PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR

Query:  SSGNRPSSSG
        SSGNRPSSSG
Subjt:  SSGNRPSSSG

XP_023554572.1 uncharacterized protein LOC111811777 isoform X1 [Cucurbita pepo subsp. pepo]8.6e-7976.69Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH
        MAKTNKY+SIN+NHIYDK+L S NSKTGNNP +KNP  SS+ SSSFASATYSSISS NKSHGRMLVLTRPTPKPI+SPP    QP+S        P  D 
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH

Query:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG
        TRPQ +SD+ISLRPLGRTGT     SPIP+ EKDKEIPPP VTLHK EKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYG P RYGEDGRPKSGG
Subjt:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG

Query:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSGWKGEK
         YER+ GAG A  G M+NRPRSSGN P+SSGWK +K
Subjt:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSGWKGEK

XP_038888443.1 translation initiation factor IF-2 isoform X1 [Benincasa hispida]1.8e-8180.25Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP------SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP-----TQPRSPPTDH--
        MAKTNKY+SINFNHIYDKNL S NSKTGNNP +KNP      SS+ SSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP      Q R P  DH  
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP------SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP-----TQPRSPPTDH--

Query:  ----TR-PQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP--VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDG
            TR PQSDSD+ISLRPLGRTGT     SPIPS EKDKEI PP  VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYG PSRYGEDG
Subjt:  ----TR-PQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP--VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDG

Query:  RPKSGGGYERMGGAGGA--GPMMNRPRSSGNRPSSSGW
        RPKSGGGYERM GAG A  G M+NRPRSSGNRPSSSGW
Subjt:  RPKSGGGYERMGGAGGA--GPMMNRPRSSGNRPSSSGW

XP_038888444.1 putative protein TPRXL isoform X2 [Benincasa hispida]3.5e-8080.17Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP------SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP-----TQPRSPPTDH--
        MAKTNKY+SINFNHIYDKNL S NSKTGNNP +KNP      SS+ SSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP      Q R P  DH  
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP------SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPP-----TQPRSPPTDH--

Query:  ----TR-PQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP--VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDG
            TR PQSDSD+ISLRPLGRTGT     SPIPS EKDKEI PP  VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYG PSRYGEDG
Subjt:  ----TR-PQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP--VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDG

Query:  RPKSGGGYERMGGAGGA--GPMMNRPRSSGNRPSSSG
        RPKSGGGYERM GAG A  G M+NRPRSSGNRPSSSG
Subjt:  RPKSGGGYERMGGAGGA--GPMMNRPRSSGNRPSSSG

TrEMBL top hitse value%identityAlignment
A0A0A0K3D7 Uncharacterized protein1.7e-8078.79Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPT--------DHT
        MAKTNKY+SINFNHIYDKNL SSNSKTGNNP + KNPSS+ SSSFA+ATYSSISSPNKSHGRMLVLTRPTP+PITSP     QP+S P+        DH 
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPT--------DHT

Query:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG
        RPQSDSD+ISLRPLGRTGT     SPIP  EKD+EI PP VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSRE NQRQYGNYG   RYGEDGRPKSGGG
Subjt:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG

Query:  YERMGGAGGA--GPMMNRPRSSGNRPSSSGW
        YERM   G A  G M+NRPRSSGNRPSSSGW
Subjt:  YERMGGAGGA--GPMMNRPRSSGNRPSSSGW

A0A1S3C160 translation initiation factor IF-2 isoform X17.1e-7978.7Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRSPPT--------DHT
        MAKTNKY+SINFNHIYDKNL SSNSKTGNNP + KNPSS+ SSSFA+ATYSSISSPNKSHGRMLVLTRPTPKPITSP     QP+S P+        D  
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGA-KNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRSPPT--------DHT

Query:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG
        RPQSDSD+ISLRPLGRTGT     SPI   EK++EI PP VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSRE NQRQYGNYG  SRYGEDGRPKSGGG
Subjt:  RPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGG

Query:  YERMGGAGGA--GPMMNRPRSSGNRPSSSG
        YERM G G A  G M+NRPRSSGNRPSSSG
Subjt:  YERMGGAGGA--GPMMNRPRSSGNRPSSSG

A0A6J1C3K9 proline-rich receptor-like protein kinase PERK56.6e-109100Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR
        MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLR

Query:  PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR
        PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR
Subjt:  PLGRTGTSPIPSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPR

Query:  SSGNRPSSSG
        SSGNRPSSSG
Subjt:  SSGNRPSSSG

A0A6J1GJZ3 uncharacterized protein LOC111455051 isoform X13.5e-7877.92Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH
        MAKTNKY+SIN+NHIYDK+L S NSKTGNNP +KNP  SS+ SSSFASATYSSISS NKSHGRMLVLTRPTPKPI+SPP    QP+S        P  D 
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH

Query:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG
        TRPQ +SD+ISLRPLGRTGT     SPIP+ EKDKEIPPP VTLHKPEKFVPPHLRAGFVGKEERPV+VGIRSREANQRQYGNYG P RYGEDGRPKSGG
Subjt:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG

Query:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSG
         YERM GAG A  G M+NRPRSSGNRP+SSG
Subjt:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSG

A0A6J1GK55 uncharacterized protein LOC111455051 isoform X23.5e-7877.92Show/hide
Query:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH
        MAKTNKY+SIN+NHIYDK+L S NSKTGNNP +KNP  SS+ SSSFASATYSSISS NKSHGRMLVLTRPTPKPI+SPP    QP+S        P  D 
Subjt:  MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNP--SSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPT---QPRS--------PPTDH

Query:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG
        TRPQ +SD+ISLRPLGRTGT     SPIP+ EKDKEIPPP VTLHKPEKFVPPHLRAGFVGKEERPV+VGIRSREANQRQYGNYG P RYGEDGRPKSGG
Subjt:  TRPQSDSDAISLRPLGRTGT-----SPIPSAEKDKEIPPP-VTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGG

Query:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSG
         YERM GAG A  G M+NRPRSSGNRP+SSG
Subjt:  GYERMGGAGGA--GPMMNRPRSSGNRPSSSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54680.1 proteophosphoglycan-related8.4e-2441.7Show/hide
Query:  NKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPTDHTRPQ---------S
        NKY+SINFNHI  K+  SS                 SSS +SA+YSS++   +S+GRMLVLT+ +PKP+ SP   PT   + P   T P+          
Subjt:  NKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSP---PTQPRSPPTDHTRPQ---------S

Query:  DSDAISLRPLGRTGTS-----PIPSAEKDK----EIPPPVTLH-KPEKFVPPHLRAGFVGKEERPVNVGIRSREAN---QRQYGNYGPPSR----YGEDG
        D   ISLRPLG TG+S     PI + E +K      P PV+L  KP++FVPPHLR GFV K+E+P     R R+ N    ++  N   P +    YG+ G
Subjt:  DSDAISLRPLGRTGTS-----PIPSAEKDK----EIPPPVTLH-KPEKFVPPHLRAGFVGKEERPVNVGIRSREAN---QRQYGNYGPPSR----YGEDG

Query:  RPKSGGGYERMGGAGGAGPMMNRPRSSGNRPSSSG
        RPKS GGYER         +   PR +GNRP +SG
Subjt:  RPKSGGGYERMGGAGGAGPMMNRPRSSGNRPSSSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACCAACAAGTATTCCTCCATCAATTTCAATCACATCTACGACAAGAATCTCTCCTCCTCCAATTCCAAAACTGGCAACAATCCCGGGGCTAAGAATCCCTC
TTCCGCGCCTTCCTCCTCCTTCGCCTCTGCAACCTATTCCTCCATTTCTTCCCCTAACAAGTCCCATGGCCGCATGCTCGTCCTGACCCGCCCAACCCCCAAACCCATCA
CCTCGCCGCCGACCCAACCCCGATCCCCGCCCACCGATCACACCCGCCCTCAATCCGATTCCGACGCCATCTCTCTTCGCCCTCTCGGTCGGACCGGTACGTCTCCGATT
CCGAGCGCTGAGAAGGACAAGGAGATTCCTCCGCCTGTGACGTTGCATAAGCCGGAGAAATTTGTTCCGCCCCATCTCAGGGCCGGTTTCGTCGGGAAGGAGGAGAGGCC
TGTGAACGTGGGGATCCGATCTAGGGAGGCGAATCAGAGGCAGTACGGGAACTACGGGCCTCCGAGCCGGTACGGCGAAGATGGGCGGCCCAAGTCCGGGGGTGGGTACG
AGAGGATGGGAGGGGCCGGTGGGGCGGGGCCGATGATGAATCGGCCAAGATCCAGTGGGAATCGCCCGAGTTCTAGCGGATGGAAGGGGGAGAAAAAGCGATATTATTCG
ATTCAAATTGGCATGCAACATGGTTATTTACTTATTCATTCTGAAGTTGGATACTTTTCCAAGGAAGAAGATGGATTACTTTTGAAGCACAGCTTTGCTGACGAGCACCC
ATCTTTTTCTCCTTCTGGAATCAGCATTCGTCAATCAGAAGTTTTATTTTTTAGAATTCTTAGTCATGGAAGATTAAACATACACCAAGTTTGCCTTCACCCTCTTCTGC
ATTTATGGAAGAGACTGAAGGTTGTTTTGGAATGTAGAACTGACCAGTGTTATGCGTCGCCATCGTCGTTAACTGCTTTTAGAGGCTTAGCATCTCGTGTGGAGGATGCT
GCTGTATTACTTCACGACAAGACTACTCCAAATGATGAGGACAAGATTACTCTAGGACAAGATTACTTCTTACCGTTAAGATTATTTCAAGACAAGACTACTCCAAATGA
TGTCAGGACAAGATTACTCCTTGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACCAACAAGTATTCCTCCATCAATTTCAATCACATCTACGACAAGAATCTCTCCTCCTCCAATTCCAAAACTGGCAACAATCCCGGGGCTAAGAATCCCTC
TTCCGCGCCTTCCTCCTCCTTCGCCTCTGCAACCTATTCCTCCATTTCTTCCCCTAACAAGTCCCATGGCCGCATGCTCGTCCTGACCCGCCCAACCCCCAAACCCATCA
CCTCGCCGCCGACCCAACCCCGATCCCCGCCCACCGATCACACCCGCCCTCAATCCGATTCCGACGCCATCTCTCTTCGCCCTCTCGGTCGGACCGGTACGTCTCCGATT
CCGAGCGCTGAGAAGGACAAGGAGATTCCTCCGCCTGTGACGTTGCATAAGCCGGAGAAATTTGTTCCGCCCCATCTCAGGGCCGGTTTCGTCGGGAAGGAGGAGAGGCC
TGTGAACGTGGGGATCCGATCTAGGGAGGCGAATCAGAGGCAGTACGGGAACTACGGGCCTCCGAGCCGGTACGGCGAAGATGGGCGGCCCAAGTCCGGGGGTGGGTACG
AGAGGATGGGAGGGGCCGGTGGGGCGGGGCCGATGATGAATCGGCCAAGATCCAGTGGGAATCGCCCGAGTTCTAGCGGATGGAAGGGGGAGAAAAAGCGATATTATTCG
ATTCAAATTGGCATGCAACATGGTTATTTACTTATTCATTCTGAAGTTGGATACTTTTCCAAGGAAGAAGATGGATTACTTTTGAAGCACAGCTTTGCTGACGAGCACCC
ATCTTTTTCTCCTTCTGGAATCAGCATTCGTCAATCAGAAGTTTTATTTTTTAGAATTCTTAGTCATGGAAGATTAAACATACACCAAGTTTGCCTTCACCCTCTTCTGC
ATTTATGGAAGAGACTGAAGGTTGTTTTGGAATGTAGAACTGACCAGTGTTATGCGTCGCCATCGTCGTTAACTGCTTTTAGAGGCTTAGCATCTCGTGTGGAGGATGCT
GCTGTATTACTTCACGACAAGACTACTCCAAATGATGAGGACAAGATTACTCTAGGACAAGATTACTTCTTACCGTTAAGATTATTTCAAGACAAGACTACTCCAAATGA
TGTCAGGACAAGATTACTCCTTGAGTAA
Protein sequenceShow/hide protein sequence
MAKTNKYSSINFNHIYDKNLSSSNSKTGNNPGAKNPSSAPSSSFASATYSSISSPNKSHGRMLVLTRPTPKPITSPPTQPRSPPTDHTRPQSDSDAISLRPLGRTGTSPI
PSAEKDKEIPPPVTLHKPEKFVPPHLRAGFVGKEERPVNVGIRSREANQRQYGNYGPPSRYGEDGRPKSGGGYERMGGAGGAGPMMNRPRSSGNRPSSSGWKGEKKRYYS
IQIGMQHGYLLIHSEVGYFSKEEDGLLLKHSFADEHPSFSPSGISIRQSEVLFFRILSHGRLNIHQVCLHPLLHLWKRLKVVLECRTDQCYASPSSLTAFRGLASRVEDA
AVLLHDKTTPNDEDKITLGQDYFLPLRLFQDKTTPNDVRTRLLLE