; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g30440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g30440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022160
Genome locationchr9:22933903..22936470
RNA-Seq ExpressionMoc09g30440
SyntenyMoc09g30440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]6.1e-4495.88Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDI TSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAP
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]5.9e-4769.62Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQP
        IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIVTSMQKE+VTMNQ +KE+AL  K   + P   
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQP

Query:  VQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEVRVVLIKGRASKTNSPM
        VQ +Y +P   MGTI T T I+T T + G      H EVKEV V   KGR S  +SPM
Subjt:  VQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEVRVVLIKGRASKTNSPM

XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]5.3e-4034.55Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATP---
        IEHF+R  D PTKMMLN  ANG FT KT+NEI+ IL+ L  HN LWCS+RSR  PK  D AGV  LD ++SMQ ++ T+ Q +K M      P  T    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATP---

Query:  -IQPV---------------QSDYCTPAP---------DMGTIGTLTHIRTPTTQVGGTTLI-----SHGE--------------------------VKE
         + PV                S+ C   P              G + + +  T    G  ++      H +                          +KE
Subjt:  -IQPV---------------QSDYCTPAP---------DMGTIGTLTHIRTPTTQVGGTTLI-----SHGE--------------------------VKE

Query:  VRV--------------VLIKGRASKTNSPMFHLHS----------NTSRHRNSSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQI
          +                I+   S+ ++ M +L +          N  R    S TE PK EG+E CK +T RSGLAY+ P MP               
Subjt:  VRV--------------VLIKGRASKTNSPMFHLHS----------NTSRHRNSSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQI

Query:  PENPTTPEKENIRKGNEDTPSVPPQMPNYAKVLKYIVSRKKKIGEHELVAMTKCSSEAVGSLLPMKCNDPGSFTIPCSIGGKNLG
         E  + P KE       D P +  +MP YAK LK I++RKKK+GE+E VA+T+CSS    S +  K  DPGSFTIPCSIGGK++G
Subjt:  PENPTTPEKENIRKGNEDTPSVPPQMPNYAKVLKYIVSRKKKIGEHELVAMTKCSSEAVGSLLPMKCNDPGSFTIPCSIGGKNLG

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]9.8e-11149.82Show/hide
Query:  MSTRSFLLPLDPEIERTLQKTKKEQRLRKQLEKQKEREGEISPESEVENTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------
        MSTRSFLLPLDPEIERTL+KT+KEQRLRKQLE QKEREGEISPESEVE+TSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRD         
Subjt:  MSTRSFLLPLDPEIERTLQKTKKEQRLRKQLEKQKEREGEISPESEVENTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP--------------------DMGTIG----------
        CSQRSRAAPKKQDPAGVLALDI TSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT AP                      G+ G          
Subjt:  CSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP--------------------DMGTIG----------

Query:  ------TLTHI---------RTPTTQVGGTTLISHGEVKE----VRVVLIKGRASKTN--SPMFHLHSNTSRHRNSS---HTELPKREGKEQCKAVTLRS
              T  HI         RT T  +          +KE       V+    AS  N  + + HL +        S   HTELP+REGKEQCKAVTLRS
Subjt:  ------TLTHI---------RTPTTQVGGTTLISHGEVKE----VRVVLIKGRASKTN--SPMFHLHSNTSRHRNSS---HTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPTVQIPENPTTPEKENIRKGNEDTPSVPPQ
        GL YDGPTMPTTDVQIPST+PTV+IPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  GLAYDGPTMPTTDVQIPSTEPTVQIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]2.1e-6852.78Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDI +SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAP   
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---

Query:  ---------------------------------------------------------DMGTIG----------------TLTHI---------RTPTTQV
                                                                   G+ G                T  +I         RT T  V
Subjt:  ---------------------------------------------------------DMGTIG----------------TLTHI---------RTPTTQV

Query:  GGTTLISHGEVKEVRV---VLIKGRASKTNSPMFHLHSNTSRHRN------SSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQIPE
                  +KE       +I+ +A+   +    L    +  +N        HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTV+IPE
Subjt:  GGTTLISHGEVKEVRV---VLIKGRASKTNSPMFHLHSNTSRHRN------SSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQM
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQM

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134642.9e-4495.88Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDI TSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAP
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221602.8e-4769.62Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQP
        IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIVTSMQKE+VTMNQ +KE+AL  K   + P   
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQP

Query:  VQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEVRVVLIKGRASKTNSPM
        VQ +Y +P   MGTI T T I+T T + G      H EVKEV V   KGR S  +SPM
Subjt:  VQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEVRVVLIKGRASKTNSPM

A0A6J1DW02 uncharacterized protein LOC1110248974.8e-11149.82Show/hide
Query:  MSTRSFLLPLDPEIERTLQKTKKEQRLRKQLEKQKEREGEISPESEVENTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------
        MSTRSFLLPLDPEIERTL+KT+KEQRLRKQLE QKEREGEISPESEVE+TSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRD         
Subjt:  MSTRSFLLPLDPEIERTLQKTKKEQRLRKQLEKQKEREGEISPESEVENTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP--------------------DMGTIG----------
        CSQRSRAAPKKQDPAGVLALDI TSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT AP                      G+ G          
Subjt:  CSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP--------------------DMGTIG----------

Query:  ------TLTHI---------RTPTTQVGGTTLISHGEVKE----VRVVLIKGRASKTN--SPMFHLHSNTSRHRNSS---HTELPKREGKEQCKAVTLRS
              T  HI         RT T  +          +KE       V+    AS  N  + + HL +        S   HTELP+REGKEQCKAVTLRS
Subjt:  ------TLTHI---------RTPTTQVGGTTLISHGEVKE----VRVVLIKGRASKTN--SPMFHLHSNTSRHRNSS---HTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPTVQIPENPTTPEKENIRKGNEDTPSVPPQ
        GL YDGPTMPTTDVQIPST+PTV+IPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  GLAYDGPTMPTTDVQIPSTEPTVQIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DYG0 uncharacterized protein LOC1110257641.0e-6852.78Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDI +SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAP   
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---

Query:  ---------------------------------------------------------DMGTIG----------------TLTHI---------RTPTTQV
                                                                   G+ G                T  +I         RT T  V
Subjt:  ---------------------------------------------------------DMGTIG----------------TLTHI---------RTPTTQV

Query:  GGTTLISHGEVKEVRV---VLIKGRASKTNSPMFHLHSNTSRHRN------SSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQIPE
                  +KE       +I+ +A+   +    L    +  +N        HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTV+IPE
Subjt:  GGTTLISHGEVKEVRV---VLIKGRASKTNSPMFHLHSNTSRHRN------SSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQM
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQM

A0A6J1DZC3 uncharacterized protein LOC1110244492.6e-4034.55Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATP---
        IEHF+R  D PTKMMLN  ANG FT KT+NEI+ IL+ L  HN LWCS+RSR  PK  D AGV  LD ++SMQ ++ T+ Q +K M      P  T    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATP---

Query:  -IQPV---------------QSDYCTPAP---------DMGTIGTLTHIRTPTTQVGGTTLI-----SHGE--------------------------VKE
         + PV                S+ C   P              G + + +  T    G  ++      H +                          +KE
Subjt:  -IQPV---------------QSDYCTPAP---------DMGTIGTLTHIRTPTTQVGGTTLI-----SHGE--------------------------VKE

Query:  VRV--------------VLIKGRASKTNSPMFHLHS----------NTSRHRNSSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQI
          +                I+   S+ ++ M +L +          N  R    S TE PK EG+E CK +T RSGLAY+ P MP               
Subjt:  VRV--------------VLIKGRASKTNSPMFHLHS----------NTSRHRNSSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQI

Query:  PENPTTPEKENIRKGNEDTPSVPPQMPNYAKVLKYIVSRKKKIGEHELVAMTKCSSEAVGSLLPMKCNDPGSFTIPCSIGGKNLG
         E  + P KE       D P +  +MP YAK LK I++RKKK+GE+E VA+T+CSS    S +  K  DPGSFTIPCSIGGK++G
Subjt:  PENPTTPEKENIRKGNEDTPSVPPQMPNYAKVLKYIVSRKKKIGEHELVAMTKCSSEAVGSLLPMKCNDPGSFTIPCSIGGKNLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCAAAAAACTAAAAAAGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAATACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAATAGAGACATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCT
GCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAA
GAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGTGACCTCCATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATC
CATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGA
GGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGTGGTTTTAATCAAGGGCAGAGCCAGCAAAACAAACAGCCCTATGTTCCACCTACACAGCAATACATC
CCGCCACCGCAACAGCAGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCAA
CAACAGATGTACAGATTCCGTCCACTGAACCAACTGTACAGATACCAGAGAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTT
CCTCCACAGATGCCAAATTATGCCAAGGTTTTGAAATATATAGTTTCTAGGAAGAAAAAGATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGT
AGGCAGCCTGCTACCCATGAAGTGTAACGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAACTAACT
TGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAATGTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCAAAAAACTAAAAAAGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAATACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAATAGAGACATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCT
GCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAA
GAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGTGACCTCCATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATC
CATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGA
GGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGTGGTTTTAATCAAGGGCAGAGCCAGCAAAACAAACAGCCCTATGTTCCACCTACACAGCAATACATC
CCGCCACCGCAACAGCAGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCAA
CAACAGATGTACAGATTCCGTCCACTGAACCAACTGTACAGATACCAGAGAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTT
CCTCCACAGATGCCAAATTATGCCAAGGTTTTGAAATATATAGTTTCTAGGAAGAAAAAGATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGT
AGGCAGCCTGCTACCCATGAAGTGTAACGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAACTAACT
TGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAATGTGACTAA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLQKTKKEQRLRKQLEKQKEREGEISPESEVENTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDIEHFFRGLDHPTKMMLNNA
ANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIVTSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQVG
GTTLISHGEVKEVRVVLIKGRASKTNSPMFHLHSNTSRHRNSSHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVQIPENPTTPEKENIRKGNEDTPSV
PPQMPNYAKVLKYIVSRKKKIGEHELVAMTKCSSEAVGSLLPMKCNDPGSFTIPCSIGGKNLGDFEECSAITNLNPVMFDEFYDLLVTEIEEELDKIAEGPECD