; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21790 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21790
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr4:15873472..15877048
RNA-Seq ExpressionMoc04g21790
SyntenyMoc04g21790
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]3.1e-4958.1Show/hide
Query:  EAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------GEN--NASTSNAPTHQQKG
        EAFNI+E ISSNN+SW DP+A+Q K SK L ESESY+ LN KIENLTDLVMRS+TQQS  GAS    N             G++  N    N  +   +G
Subjt:  EAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------GEN--NASTSNAPTHQQKG

Query:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP
            G SN      +R+S GSFASLE LMK+YM  NDATV+ Q + LRNLE+QVGQL TDL SRP GALPSDT++PKRD KEQCKALTL SGKALP  H 
Subjt:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP

Query:  NTPGIESEPT
        N P +  E T
Subjt:  NTPGIESEPT

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]2.7e-5353.54Show/hide
Query:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGA--------STSKANGENNASTSNAPTHQQKG
        +L KP+ +A NI+E ISS+N+SWSD RAI+ K SK L ESESY+ LN KIE LTDL  R+ +  +T             S   G +N   SNAPT QQK 
Subjt:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGA--------STSKANGENNASTSNAPTHQQKG

Query:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP
        S PPGF+ Q Q+    +S GS  SLEN+MK+YM  NDATVQSQ  SLRNLE+QVGQL  DLKSRP GALPSDT++PKRD KEQC ALTL+SGKALP  HP
Subjt:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP

Query:  NTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIE-YKPAPPYPKRLQKKE
        N P +  EP +  QGE             S+ D  P E   P PP     Q KE
Subjt:  NTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIE-YKPAPPYPKRLQKKE

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.7e-5245.79Show/hide
Query:  DPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------------------------------------------
        DPRA+Q K SKGL ESESY+ LN  IENLT LVMRSM QQS+VGA T  AN                                                 
Subjt:  DPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------------------------------------------

Query:  -------------GENNASTSNAPTHQQKGSCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALP
                     G +NA TS+AP  Q K S PPGF NQ Q+ A+R+S GS ASLE LMK+YM  NDATVQSQ TSLRNL++QVGQL TDLKS+      
Subjt:  -------------GENNASTSNAPTHQQKGSCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALP

Query:  SDTKLPKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHV
                                                        P R+ ++ K+A +H++ P EY PAPPYPKRLQKKE+NVQFNKFLDVLKQLHV
Subjt:  SDTKLPKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHV

Query:  NIPLVEALEQMPNYVRFHQEV
        NIPLVEALEQMPNYVRF +E+
Subjt:  NIPLVEALEQMPNYVRFHQEV

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]3.8e-4743.28Show/hide
Query:  AFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGENNASTSNAPTHQQKGSCPPGFSNQSQVTAQR
        +++  EG    N   S+P+++   G+   N +  YS               +    S    S S+  G N+  TSNAP +QQKG+ PP  +NQ Q   Q+
Subjt:  AFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGENNASTSNAPTHQQKGSCPPGFSNQSQVTAQR

Query:  RSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKL---PKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKE
           GSFASLENLMK+YMEKN+ TVQS   SLRNLE+QVGQL TDLKSRPYGALPSDTK+   PK       K +   + KA  F                
Subjt:  RSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKL---PKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKE

Query:  QGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFHQEV-----KFLVYDSMRFPAKSEEC
           +  A++ ++ K+  +H+D P E++P PPYPKRL+KKEQ+VQF KFLDVL QLHVNIPLVEA EQM  YVRF +++     K   Y ++    +S   
Subjt:  QGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFHQEV-----KFLVYDSMRFPAKSEEC

Query:  SVLKI
         + KI
Subjt:  SVLKI

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]4.3e-5157.52Show/hide
Query:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGEN--NASTSNAPTHQQKGSCP---
        +LAKP+ EAFNI+E ISSNN SWSDPRAI  KGSKG NESES++ALNLKIENLTDLVMRSMT QSTVGAS  KAN  +    S S      +  +CP   
Subjt:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGEN--NASTSNAPTHQQKGSCP---

Query:  -------PGFSNQSQVTAQRR--SVGSFASL------ENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALT
                  +N++   + R   + G+  +L      +  M KYME ND TVQSQ  SLRNLEMQVGQL TDLKS+P G LPSD K+PKRD KEQC ALT
Subjt:  -------PGFSNQSQVTAQRR--SVGSFASL------ENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALT

Query:  LQSGKALPFAHPNTPGIESEPTRKEQ
        L+SGK LP AHPN  GI  E  + E+
Subjt:  LQSGKALPFAHPNTPGIESEPTRKEQ

TrEMBL top hitse value%identityAlignment
A0A6J1DTD1 uncharacterized protein LOC1110241361.5e-4958.1Show/hide
Query:  EAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------GEN--NASTSNAPTHQQKG
        EAFNI+E ISSNN+SW DP+A+Q K SK L ESESY+ LN KIENLTDLVMRS+TQQS  GAS    N             G++  N    N  +   +G
Subjt:  EAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------GEN--NASTSNAPTHQQKG

Query:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP
            G SN      +R+S GSFASLE LMK+YM  NDATV+ Q + LRNLE+QVGQL TDL SRP GALPSDT++PKRD KEQCKALTL SGKALP  H 
Subjt:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP

Query:  NTPGIESEPT
        N P +  E T
Subjt:  NTPGIESEPT

A0A6J1DWK1 uncharacterized protein LOC1110250531.3e-5353.54Show/hide
Query:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGA--------STSKANGENNASTSNAPTHQQKG
        +L KP+ +A NI+E ISS+N+SWSD RAI+ K SK L ESESY+ LN KIE LTDL  R+ +  +T             S   G +N   SNAPT QQK 
Subjt:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGA--------STSKANGENNASTSNAPTHQQKG

Query:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP
        S PPGF+ Q Q+    +S GS  SLEN+MK+YM  NDATVQSQ  SLRNLE+QVGQL  DLKSRP GALPSDT++PKRD KEQC ALTL+SGKALP  HP
Subjt:  SCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHP

Query:  NTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIE-YKPAPPYPKRLQKKE
        N P +  EP +  QGE             S+ D  P E   P PP     Q KE
Subjt:  NTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIE-YKPAPPYPKRLQKKE

A0A6J1DWN2 uncharacterized protein LOC1110252031.8e-4743.28Show/hide
Query:  AFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGENNASTSNAPTHQQKGSCPPGFSNQSQVTAQR
        +++  EG    N   S+P+++   G+   N +  YS               +    S    S S+  G N+  TSNAP +QQKG+ PP  +NQ Q   Q+
Subjt:  AFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGENNASTSNAPTHQQKGSCPPGFSNQSQVTAQR

Query:  RSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKL---PKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKE
           GSFASLENLMK+YMEKN+ TVQS   SLRNLE+QVGQL TDLKSRPYGALPSDTK+   PK       K +   + KA  F                
Subjt:  RSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKL---PKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKE

Query:  QGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFHQEV-----KFLVYDSMRFPAKSEEC
           +  A++ ++ K+  +H+D P E++P PPYPKRL+KKEQ+VQF KFLDVL QLHVNIPLVEA EQM  YVRF +++     K   Y ++    +S   
Subjt:  QGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFHQEV-----KFLVYDSMRFPAKSEEC

Query:  SVLKI
         + KI
Subjt:  SVLKI

A0A6J1DXK5 uncharacterized protein LOC1110255002.1e-5157.52Show/hide
Query:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGEN--NASTSNAPTHQQKGSCP---
        +LAKP+ EAFNI+E ISSNN SWSDPRAI  KGSKG NESES++ALNLKIENLTDLVMRSMT QSTVGAS  KAN  +    S S      +  +CP   
Subjt:  MLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKANGEN--NASTSNAPTHQQKGSCP---

Query:  -------PGFSNQSQVTAQRR--SVGSFASL------ENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALT
                  +N++   + R   + G+  +L      +  M KYME ND TVQSQ  SLRNLEMQVGQL TDLKS+P G LPSD K+PKRD KEQC ALT
Subjt:  -------PGFSNQSQVTAQRR--SVGSFASL------ENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALPSDTKLPKRDDKEQCKALT

Query:  LQSGKALPFAHPNTPGIESEPTRKEQ
        L+SGK LP AHPN  GI  E  + E+
Subjt:  LQSGKALPFAHPNTPGIESEPTRKEQ

A0A6J1E1F3 uncharacterized protein LOC1110250658.5e-5345.79Show/hide
Query:  DPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------------------------------------------
        DPRA+Q K SKGL ESESY+ LN  IENLT LVMRSM QQS+VGA T  AN                                                 
Subjt:  DPRAIQEKGSKGLNESESYSALNLKIENLTDLVMRSMTQQSTVGASTSKAN-------------------------------------------------

Query:  -------------GENNASTSNAPTHQQKGSCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALP
                     G +NA TS+AP  Q K S PPGF NQ Q+ A+R+S GS ASLE LMK+YM  NDATVQSQ TSLRNL++QVGQL TDLKS+      
Subjt:  -------------GENNASTSNAPTHQQKGSCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLKSRPYGALP

Query:  SDTKLPKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHV
                                                        P R+ ++ K+A +H++ P EY PAPPYPKRLQKKE+NVQFNKFLDVLKQLHV
Subjt:  SDTKLPKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHV

Query:  NIPLVEALEQMPNYVRFHQEV
        NIPLVEALEQMPNYVRF +E+
Subjt:  NIPLVEALEQMPNYVRFHQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCCTAACCCACCTCCGCGCCAGCCTATTCCACCAAATGTGAGGATTGAGGAAATAGTAGACGGGGTTCCTATTGCTGCTAACCCTGTGTTAGCGAGTCTTAG
GAGCGTTTTGAACGACTTCTGCAAAAATGCCCCCACCACAGGATCCCAAGATGCATCTAGATCGAAACGTATTACAATGGTTTGGACGATGCTAGCAAAACCTCATGTTG
AAGCATTCAATATCATAGAAGGGATATCATCGAACAATTATTCATGGTCTGACCCTAGAGCCATTCAAGAAAAAGGAAGCAAGGGACTGAACGAATCTGAGTCATACTCT
GCACTGAATTTAAAAATTGAGAATCTAACTGACTTGGTTATGAGGAGCATGACACAGCAAAGCACGGTGGGAGCATCTACCAGTAAAGCAAATGGAGAAAACAATGCTAG
TACATCTAATGCTCCCACACATCAGCAAAAAGGAAGTTGTCCCCCAGGTTTTTCAAACCAAAGTCAGGTAACAGCACAAAGGCGATCTGTAGGGTCATTCGCATCATTGG
AAAATCTGATGAAGAAGTATATGGAAAAGAATGATGCAACGGTGCAAAGTCAGACAACATCGCTAAGAAATTTGGAGATGCAGGTAGGTCAACTCATAACAGATTTGAAA
AGCAGACCCTATGGAGCATTACCTAGTGACACTAAGTTGCCAAAGCGGGACGACAAGGAGCAATGTAAAGCTCTCACACTGCAAAGTGGGAAAGCATTACCCTTTGCACA
CCCGAATACTCCAGGAATAGAGAGCGAACCTACTCGAAAAGAACAAGGAGAGTCTGACCCAGCAAGGATAAGTGACCGAGAAAAAGAAGCAAGCAAACATGACGATACTC
CAATAGAGTATAAACCAGCACCACCATATCCCAAGCGGTTGCAAAAGAAAGAGCAAAACGTTCAATTTAATAAATTCTTAGATGTGCTGAAGCAGCTGCATGTGAACATA
CCATTGGTGGAAGCTCTGGAGCAAATGCCGAATTATGTAAGGTTCCATCAAGAAGTAAAGTTCTTAGTGTATGACTCGATGAGGTTCCCTGCTAAATCAGAAGAATGCTC
AGTGCTTAAAATTCTAGATGAAGCATTAATAGAAGAATTGGAAGCAGAAGCAATGCTAGAGCAGCTGGAATTTACCGACTCTGGAAGCGATGTTGATGAGTTACTTGATG
TTCGACCTTGCCATTTATTCCCAAGGCCTAGCACCACTACTTCGCAAGCACCACTCTACTGTAGTCCTTCAAGCCTAACTCCTCGAGTCGCATTGGCCCTTGACATTTAT
CCCCAAGGCCTCAATACTCCTCTACAGAAACTTGAAGTTCGGTCTTGCCATTTATTCCCAAGGCCTGAACTGTGCACCGGTCCTTGCCGTTTATCCCCAAGATACCGATG
A
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGCCTAACCCACCTCCGCGCCAGCCTATTCCACCAAATGTGAGGATTGAGGAAATAGTAGACGGGGTTCCTATTGCTGCTAACCCTGTGTTAGCGAGTCTTAG
GAGCGTTTTGAACGACTTCTGCAAAAATGCCCCCACCACAGGATCCCAAGATGCATCTAGATCGAAACGTATTACAATGGTTTGGACGATGCTAGCAAAACCTCATGTTG
AAGCATTCAATATCATAGAAGGGATATCATCGAACAATTATTCATGGTCTGACCCTAGAGCCATTCAAGAAAAAGGAAGCAAGGGACTGAACGAATCTGAGTCATACTCT
GCACTGAATTTAAAAATTGAGAATCTAACTGACTTGGTTATGAGGAGCATGACACAGCAAAGCACGGTGGGAGCATCTACCAGTAAAGCAAATGGAGAAAACAATGCTAG
TACATCTAATGCTCCCACACATCAGCAAAAAGGAAGTTGTCCCCCAGGTTTTTCAAACCAAAGTCAGGTAACAGCACAAAGGCGATCTGTAGGGTCATTCGCATCATTGG
AAAATCTGATGAAGAAGTATATGGAAAAGAATGATGCAACGGTGCAAAGTCAGACAACATCGCTAAGAAATTTGGAGATGCAGGTAGGTCAACTCATAACAGATTTGAAA
AGCAGACCCTATGGAGCATTACCTAGTGACACTAAGTTGCCAAAGCGGGACGACAAGGAGCAATGTAAAGCTCTCACACTGCAAAGTGGGAAAGCATTACCCTTTGCACA
CCCGAATACTCCAGGAATAGAGAGCGAACCTACTCGAAAAGAACAAGGAGAGTCTGACCCAGCAAGGATAAGTGACCGAGAAAAAGAAGCAAGCAAACATGACGATACTC
CAATAGAGTATAAACCAGCACCACCATATCCCAAGCGGTTGCAAAAGAAAGAGCAAAACGTTCAATTTAATAAATTCTTAGATGTGCTGAAGCAGCTGCATGTGAACATA
CCATTGGTGGAAGCTCTGGAGCAAATGCCGAATTATGTAAGGTTCCATCAAGAAGTAAAGTTCTTAGTGTATGACTCGATGAGGTTCCCTGCTAAATCAGAAGAATGCTC
AGTGCTTAAAATTCTAGATGAAGCATTAATAGAAGAATTGGAAGCAGAAGCAATGCTAGAGCAGCTGGAATTTACCGACTCTGGAAGCGATGTTGATGAGTTACTTGATG
TTCGACCTTGCCATTTATTCCCAAGGCCTAGCACCACTACTTCGCAAGCACCACTCTACTGTAGTCCTTCAAGCCTAACTCCTCGAGTCGCATTGGCCCTTGACATTTAT
CCCCAAGGCCTCAATACTCCTCTACAGAAACTTGAAGTTCGGTCTTGCCATTTATTCCCAAGGCCTGAACTGTGCACCGGTCCTTGCCGTTTATCCCCAAGATACCGATG
A
Protein sequenceShow/hide protein sequence
MNLPNPPPRQPIPPNVRIEEIVDGVPIAANPVLASLRSVLNDFCKNAPTTGSQDASRSKRITMVWTMLAKPHVEAFNIIEGISSNNYSWSDPRAIQEKGSKGLNESESYS
ALNLKIENLTDLVMRSMTQQSTVGASTSKANGENNASTSNAPTHQQKGSCPPGFSNQSQVTAQRRSVGSFASLENLMKKYMEKNDATVQSQTTSLRNLEMQVGQLITDLK
SRPYGALPSDTKLPKRDDKEQCKALTLQSGKALPFAHPNTPGIESEPTRKEQGESDPARISDREKEASKHDDTPIEYKPAPPYPKRLQKKEQNVQFNKFLDVLKQLHVNI
PLVEALEQMPNYVRFHQEVKFLVYDSMRFPAKSEECSVLKILDEALIEELEAEAMLEQLEFTDSGSDVDELLDVRPCHLFPRPSTTTSQAPLYCSPSSLTPRVALALDIY
PQGLNTPLQKLEVRSCHLFPRPELCTGPCRLSPRYR