; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014614 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014614
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description17.6 kDa class I heat shock protein 2
Genome locationtig00000892:249265..249898
RNA-Seq ExpressionSgr014614
SyntenySgr014614
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR002068 - Alpha crystallin/Hsp20 domain
IPR008978 - HSP20-like chaperone
IPR045045 - Small heat shock protein RTM2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605846.1 Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subsp. sororia]1.1e-4161.25Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF +EQIRVQVSSTRKLRISGERPYRN  WQRFHKEF+IP+NC TSKITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P TAAP   PP+QQ             +G+QQA   EKKSS   +GK EAKS+ +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

XP_022153773.1 inactive protein RESTRICTED TEV MOVEMENT 2-like [Momordica charantia]7.2e-4666.46Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSK
        MA N ++TYE  EPPVE S+E+GCSILTI+LPGFSKEQIRVQVSST KLRISGERPYRN  WQRFHKEFQIP NCDTS ITAKYKG ILHVRQPL+E+SK
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSK

Query:  PHTA-APTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQKKLL
           A AP AQPP+QQS   K            QA A EKKS+V  DG  EA+ S++    L
Subjt:  PHTA-APTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQKKLL

XP_022958345.1 inactive protein RESTRICTED TEV MOVEMENT 2-like [Cucurbita moschata]1.2e-4060Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF +EQIRVQVSSTRKLRISGERPY+N  WQRFHKEF+IP+NC TS ITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P TAAP   PP+QQ             +G+QQA   EKKSS   +GK EAKS+ +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

XP_022995333.1 uncharacterized protein LOC111490909 [Cucurbita maxima]7.0e-4161.25Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF KEQIRVQVSSTRKLRISGERPYRN  WQRFHKEF+IP+NC TSKITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P +AAP   PP QQ             +G+QQA   EKKSS   +GK +AKSS +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

XP_023534144.1 uncharacterized protein LOC111795792 [Cucurbita pepo subsp. pepo]2.9e-3960Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF +EQIRVQVSSTRKLRISGERPYRN   QRFHKEF+IP+NC TS ITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P TAAP   PP QQ             +G+QQA   EKKSS   +GK EAKS+ +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

TrEMBL top hitse value%identityAlignment
A0A0A0L1N9 SHSP domain-containing protein1.3e-2964.15Show/hide
Query:  HRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNN---VWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQE----
        H +TYE  EPPVE SEE+GC+IL +++PGF+KEQI+VQVSS RKLRISGER  +NN   + QRF+KEF+IP+NC+T+ ITAKYK  ILHVRQPLQ+    
Subjt:  HRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNN---VWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQE----

Query:  -KSKPH
         K +PH
Subjt:  -KSKPH

A0A5A7T9W8 Circumsporozoite protein-like8.6e-2964.15Show/hide
Query:  HRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNN---VWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQE----
        H +TYE  EPPVE SEE+GC ILT+++PGF KEQI+VQVSS RKLRISGER  ++N     QRF+KEF+IP+NC+T+ ITAKYK  ILHVRQPLQ+    
Subjt:  HRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNN---VWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQE----

Query:  -KSKPH
         K +PH
Subjt:  -KSKPH

A0A6J1DIF9 inactive protein RESTRICTED TEV MOVEMENT 2-like3.5e-4666.46Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSK
        MA N ++TYE  EPPVE S+E+GCSILTI+LPGFSKEQIRVQVSST KLRISGERPYRN  WQRFHKEFQIP NCDTS ITAKYKG ILHVRQPL+E+SK
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSK

Query:  PHTA-APTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQKKLL
           A AP AQPP+QQS   K            QA A EKKS+V  DG  EA+ S++    L
Subjt:  PHTA-APTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQKKLL

A0A6J1H377 inactive protein RESTRICTED TEV MOVEMENT 2-like5.8e-4160Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF +EQIRVQVSSTRKLRISGERPY+N  WQRFHKEF+IP+NC TS ITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P TAAP   PP+QQ             +G+QQA   EKKSS   +GK EAKS+ +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

A0A6J1K7M4 uncharacterized protein LOC1114909093.4e-4161.25Show/hide
Query:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--
        MA  ++ TYE  EP VE SEE+GCSIL++H+PGF KEQIRVQVSSTRKLRISGERPYRN  WQRFHKEF+IP+NC TSKITAKYK  ILHVRQPLQ +  
Subjt:  MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--

Query:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK
           P +AAP   PP QQ             +G+QQA   EKKSS   +GK +AKSS +++
Subjt:  -SKPHTAAPTAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQK

SwissProt top hitse value%identityAlignment
D5K211 Inactive protein RESTRICTED TEV MOVEMENT 27.8e-1131.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

D9UBX4 Inactive protein RESTRICTED TEV MOVEMENT 27.8e-1131.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

D9UBX6 Inactive protein RESTRICTED TEV MOVEMENT 27.8e-1131.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

D9UC01 Inactive protein RESTRICTED TEV MOVEMENT 21.3e-1031.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

Q9M670 Protein RESTRICTED TEV MOVEMENT 27.8e-1131.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

Arabidopsis top hitse value%identityAlignment
AT1G53540.1 HSP20-like chaperones superfamily protein7.0e-0730.93Show/hide
Query:  VEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGER----PYRNNVWQR-------FHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKP
        V+W E     +    LPG  KE+++V+V     L+ISGER      +N+ W R       F + F++P N    +I A  +  +L V  P   + KP
Subjt:  VEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGER----PYRNNVWQR-------FHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKP

AT2G27140.1 HSP20-like chaperones superfamily protein2.9e-2142.74Show/hide
Query:  ANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--SK
        AN  + Y+  EP   W  E+G   LTI+LPGF KEQ++VQV++TRKLR+ G+RP   N W RF KEF IP N D   ++AK++G  L VR P  E    +
Subjt:  ANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEK--SK

Query:  PHTAAPTAQPPKQQSVTGKQPADS
        P       +PP         P+ S
Subjt:  PHTAAPTAQPPKQQSVTGKQPADS

AT3G46230.1 heat shock protein 17.41.9e-0729.9Show/hide
Query:  VEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRN----NVWQR-------FHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKP
        V+W E     +    +PG  KE+++V+V     L+ISGER   N    + W R       F + F++P N    ++ A  +  +L V  P  ++SKP
Subjt:  VEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRN----NVWQR-------FHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKP

AT5G04890.1 HSP20-like chaperones superfamily protein5.6e-1231.25Show/hide
Query:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK
        YE   P  EW ++   +IL I L GF+KEQ++V  V S++ +R++GERP  N  W RF++ F +P NC   KI   +K  +L +  P         L E 
Subjt:  YEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRV-QVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQP---------LQEK

Query:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED
        S+   AA    A+  +++ +   +  + E    KQ +    E+K ++    + EAK+ E+
Subjt:  SKPHTAA--PTAQPPKQQSVTGKQPADSETYSGKQ-QAAAAEKKSSVTGDGKVEAKSSED

AT5G20970.1 HSP20-like chaperones superfamily protein5.2e-1836.57Show/hide
Query:  KTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKPHTAAP
        + Y+  EP   W+ E    +L   LPGF KEQ++V V++TRKLR++GERP   N W RFH+E  +P   D   ++A +K   L++R P  +   P T  P
Subjt:  KTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKPHTAAP

Query:  TAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKS
        T        V  K     E   G+   A  EK S
Subjt:  TAQPPKQQSVTGKQPADSETYSGKQQAAAAEKKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCAATCATCGGAAAACTTATGAACATGTTGAACCGCCGGTGGAGTGGTCAGAAGAGGAAGGCTGCAGCATTCTCACTATACACCTTCCTGGTTTCAGTAAAGA
GCAAATTCGGGTTCAGGTATCCTCCACCAGAAAGTTGAGGATTTCCGGAGAGCGGCCTTACCGGAACAATGTGTGGCAGCGCTTCCACAAGGAGTTCCAGATCCCAGCAA
ACTGCGACACAAGTAAAATCACAGCAAAATACAAGGGCTGCATACTTCATGTCCGCCAGCCATTGCAGGAAAAGTCGAAGCCACACACGGCAGCACCCACAGCACAACCA
CCAAAACAGCAAAGTGTCACAGGAAAGCAGCCAGCTGATTCTGAAACCTACAGTGGCAAACAACAAGCAGCGGCTGCAGAGAAAAAGAGCAGTGTTACTGGCGACGGAAA
AGTGGAAGCAAAAAGCAGCGAAGATCAAAAGAAACTGCTAATGCTTCGAGAGACAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCAATCATCGGAAAACTTATGAACATGTTGAACCGCCGGTGGAGTGGTCAGAAGAGGAAGGCTGCAGCATTCTCACTATACACCTTCCTGGTTTCAGTAAAGA
GCAAATTCGGGTTCAGGTATCCTCCACCAGAAAGTTGAGGATTTCCGGAGAGCGGCCTTACCGGAACAATGTGTGGCAGCGCTTCCACAAGGAGTTCCAGATCCCAGCAA
ACTGCGACACAAGTAAAATCACAGCAAAATACAAGGGCTGCATACTTCATGTCCGCCAGCCATTGCAGGAAAAGTCGAAGCCACACACGGCAGCACCCACAGCACAACCA
CCAAAACAGCAAAGTGTCACAGGAAAGCAGCCAGCTGATTCTGAAACCTACAGTGGCAAACAACAAGCAGCGGCTGCAGAGAAAAAGAGCAGTGTTACTGGCGACGGAAA
AGTGGAAGCAAAAAGCAGCGAAGATCAAAAGAAACTGCTAATGCTTCGAGAGACAATGTAG
Protein sequenceShow/hide protein sequence
MAANHRKTYEHVEPPVEWSEEEGCSILTIHLPGFSKEQIRVQVSSTRKLRISGERPYRNNVWQRFHKEFQIPANCDTSKITAKYKGCILHVRQPLQEKSKPHTAAPTAQP
PKQQSVTGKQPADSETYSGKQQAAAAEKKSSVTGDGKVEAKSSEDQKKLLMLRETM