; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005002 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005002
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlutamic acid-rich protein
Genome locationChr08:22068079..22068882
RNA-Seq ExpressionHG10005002
SyntenyHG10005002
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060961.1 glutamic acid-rich protein [Cucumis melo var. makuwa]1.9e-9676.3Show/hide
Query:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG+T  PS+PSNYV+IL LRERWLKENE   KEKEDLKQL+ +E+QQKV RTEP++KDDQKPVV+KPR NASSWSRNG    + YYRAVP SNP+NV
Subjt:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        NETCKK E++ EP+L VP IERD+KKEK    KG KK  R QK +MDKTSPTPEENLKEE+ E ECEV VQNRKKN+RARKIDEKKGTGSQ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVS EIKRG NNGV RG SSQGNRN R HRNLDR GPRKQ+DV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

XP_008444535.1 PREDICTED: glutamic acid-rich protein [Cucumis melo]7.9e-9575.56Show/hide
Query:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG+T  PS+PSNYV+IL LRERWLKENE   KEKEDLKQL+ +E+QQKV RTEP++KDDQKPVV+KPR NASSWSRNG    + YYRAVP SNP+NV
Subjt:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        NETCKK E++ EP+L VP IERD+KKEK    KG KK    QK +MDKTSPTPEENLKEE+ E ECEV VQN KKN+RARKIDEKKGTGSQ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVS EIKRG NNGV RG SSQGNRN R HRNLDR GPRKQ+DV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

XP_011649562.1 uncharacterized protein DDB_G0286299 [Cucumis sativus]8.8e-9475.09Show/hide
Query:  MQGWG--QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG  Q  SQPSNYV+IL LRERWLKENE   KEKEDLKQL+ +E++QKV RTEP +KDDQKPVV+KPR NASSWSRNGS   +RYYRA    NPRNV
Subjt:  MQGWG--QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        N+ CKK  ++ EP+LA+P+IERD+KKEK    KG KKN R++K +MDKTSPTPEENLKE  RE ECEV VQN KKN+RARKIDEKK TGS+ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVSFEIKRGRNNGV RGSSQGNRN RD RNLDR GPRKQRDV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

XP_022131328.1 myb-like protein X [Momordica charantia]8.3e-6850Show/hide
Query:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQL-DQQEQQQKVLRTEPKEK-----DDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASN
        M+  G  PS PSNYV+IL LRERWLKENE NQKEKE+ +Q   +Q Q+QKV R++P+EK     DD + V+QKPR   +SWS   SG F+  YR VPASN
Subjt:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQL-DQQEQQQKVLRTEPKEK-----DDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASN

Query:  PRNVNETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETE-------------------------------
        PRNVNE CKKTE +G   +  P+I RDE KEK +KRKG KKNP+ + +++D+TS TPEEN KEEMRETE                               
Subjt:  PRNVNETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETE-------------------------------

Query:  --------------------------------------------CEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHKMKEMENKLSLISVSFEIKR
                                                    CE+ VQN KKN+RA ++DEKKG G QC PE+    H+MKEME KLS ISV FEIKR
Subjt:  --------------------------------------------CEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHKMKEMENKLSLISVSFEIKR

Query:  GRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
         RNN VYRGSSQG +N R H+NL RWGP KQR+VGMIWVRKDELS+ A
Subjt:  GRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

XP_038884414.1 gelsolin-related protein of 125 kDa-like [Benincasa hispida]2.4e-11584.27Show/hide
Query:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNVNE
        M+ WGQ PSQPSN+VSILHLRERWLKENE NQKEKEDLKQLDQQEQQQKV +TEP+EKDDQKPVV+K R NASSWSRNGS  FQRYYRAVP SNPRNVNE
Subjt:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNVNE

Query:  TCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHK
        TCKKTE++GEP+LAVP++ERDEK EK SKRKG KKNPR QK+E+DKTSPTPE+NLKEE +  ECE++VQN KKN+RARKIDEK  T SQCGPEKTGYTH+
Subjt:  TCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHK

Query:  MKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        MKEMENKLSLISVSFEIKRGRNNGVYRG SQGNRN RD RNLDRWGPRKQRD+GMIWVRKDELSKVA
Subjt:  MKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

TrEMBL top hitse value%identityAlignment
A0A0A0LNR9 Uncharacterized protein4.2e-9475.09Show/hide
Query:  MQGWG--QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG  Q  SQPSNYV+IL LRERWLKENE   KEKEDLKQL+ +E++QKV RTEP +KDDQKPVV+KPR NASSWSRNGS   +RYYRA    NPRNV
Subjt:  MQGWG--QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        N+ CKK  ++ EP+LA+P+IERD+KKEK    KG KKN R++K +MDKTSPTPEENLKE  RE ECEV VQN KKN+RARKIDEKK TGS+ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVSFEIKRGRNNGV RGSSQGNRN RD RNLDR GPRKQRDV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

A0A1S3BBD2 glutamic acid-rich protein3.8e-9575.56Show/hide
Query:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG+T  PS+PSNYV+IL LRERWLKENE   KEKEDLKQL+ +E+QQKV RTEP++KDDQKPVV+KPR NASSWSRNG    + YYRAVP SNP+NV
Subjt:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        NETCKK E++ EP+L VP IERD+KKEK    KG KK    QK +MDKTSPTPEENLKEE+ E ECEV VQN KKN+RARKIDEKKGTGSQ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVS EIKRG NNGV RG SSQGNRN R HRNLDR GPRKQ+DV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

A0A5A7V598 Glutamic acid-rich protein9.1e-9776.3Show/hide
Query:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV
        M+ WG+T  PS+PSNYV+IL LRERWLKENE   KEKEDLKQL+ +E+QQKV RTEP++KDDQKPVV+KPR NASSWSRNG    + YYRAVP SNP+NV
Subjt:  MQGWGQT--PSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNV

Query:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT
        NETCKK E++ EP+L VP IERD+KKEK    KG KK  R QK +MDKTSPTPEENLKEE+ E ECEV VQNRKKN+RARKIDEKKGTGSQ GPEKTGY 
Subjt:  NETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYT

Query:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
        H+MKEMENKLSLISVS EIKRG NNGV RG SSQGNRN R HRNLDR GPRKQ+DV MIWVRKDELSK A
Subjt:  HKMKEMENKLSLISVSFEIKRGRNNGVYRG-SSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

A0A6J1BT17 myb-like protein X4.0e-6850Show/hide
Query:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQL-DQQEQQQKVLRTEPKEK-----DDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASN
        M+  G  PS PSNYV+IL LRERWLKENE NQKEKE+ +Q   +Q Q+QKV R++P+EK     DD + V+QKPR   +SWS   SG F+  YR VPASN
Subjt:  MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQL-DQQEQQQKVLRTEPKEK-----DDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASN

Query:  PRNVNETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETE-------------------------------
        PRNVNE CKKTE +G   +  P+I RDE KEK +KRKG KKNP+ + +++D+TS TPEEN KEEMRETE                               
Subjt:  PRNVNETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETE-------------------------------

Query:  --------------------------------------------CEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHKMKEMENKLSLISVSFEIKR
                                                    CE+ VQN KKN+RA ++DEKKG G QC PE+    H+MKEME KLS ISV FEIKR
Subjt:  --------------------------------------------CEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHKMKEMENKLSLISVSFEIKR

Query:  GRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA
         RNN VYRGSSQG +N R H+NL RWGP KQR+VGMIWVRKDELS+ A
Subjt:  GRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA

A0A6J1HDF8 uncharacterized protein LOC111462917 isoform X21.9e-6563.02Show/hide
Query:  MQGWG-QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNVN
        M+GWG +  SQPSNY S             P QKEKE        EQQQKV RTEP EKDDQKP VQKPR  A+S SR+GS  FQ YYRA PASNPRNV 
Subjt:  MQGWG-QTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNVN

Query:  ETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTH
        ET KKTE+ GEP L     E+DE KEK S+RKG KKN R  K+EM +TSPTPEENLKEEMRE EC+      +KN+  RK+DEK+GT  QC PE  GYTH
Subjt:  ETCKKTENKGEPQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTH

Query:  KMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELS
        +MKEMENKLS ISVS EI+R + NGVYRG SQGN      RNL RW PRKQR+VGMIW RKDELS
Subjt:  KMKEMENKLSLISVSFEIKRGRNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGGTGGGGACAGACTCCATCCCAGCCCTCCAACTATGTCAGTATTCTTCATCTTCGAGAACGGTGGCTCAAAGAAAATGAGCCAAACCAGAAAGAGAAGGAAGA
TTTGAAACAGCTTGACCAACAAGAACAGCAACAGAAGGTTCTGCGGACGGAACCTAAGGAGAAGGATGATCAGAAGCCGGTGGTTCAAAAACCTCGAAGCAATGCTTCTT
CTTGGAGCAGAAATGGGAGTGGGAATTTTCAGAGGTATTATCGAGCTGTACCGGCGAGTAATCCGCGCAATGTGAACGAGACGTGCAAGAAGACTGAGAATAAAGGCGAG
CCTCAGCTGGCTGTTCCTCAAATCGAGCGAGATGAGAAGAAGGAAAAGAACTCTAAAAGGAAGGGTCCCAAGAAGAACCCTAGATTACAGAAGGATGAGATGGATAAAAC
GTCACCAACACCTGAGGAGAACTTGAAGGAGGAAATGCGAGAGACGGAATGCGAGGTTAAGGTGCAGAACCGGAAGAAAAATATGAGGGCAAGAAAAATCGATGAAAAGA
AGGGGACTGGGAGTCAATGTGGACCTGAAAAAACTGGCTACACGCATAAGATGAAGGAGATGGAAAACAAATTAAGTCTTATTTCAGTGAGTTTTGAAATTAAAAGAGGG
AGGAATAATGGTGTTTACAGAGGCAGTAGTCAAGGCAATCGGAATTCCAGAGATCATCGGAATCTTGATCGTTGGGGTCCCCGTAAACAGAGGGATGTTGGAATGATTTG
GGTGAGGAAGGACGAGCTTTCAAAAGTTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGGTGGGGACAGACTCCATCCCAGCCCTCCAACTATGTCAGTATTCTTCATCTTCGAGAACGGTGGCTCAAAGAAAATGAGCCAAACCAGAAAGAGAAGGAAGA
TTTGAAACAGCTTGACCAACAAGAACAGCAACAGAAGGTTCTGCGGACGGAACCTAAGGAGAAGGATGATCAGAAGCCGGTGGTTCAAAAACCTCGAAGCAATGCTTCTT
CTTGGAGCAGAAATGGGAGTGGGAATTTTCAGAGGTATTATCGAGCTGTACCGGCGAGTAATCCGCGCAATGTGAACGAGACGTGCAAGAAGACTGAGAATAAAGGCGAG
CCTCAGCTGGCTGTTCCTCAAATCGAGCGAGATGAGAAGAAGGAAAAGAACTCTAAAAGGAAGGGTCCCAAGAAGAACCCTAGATTACAGAAGGATGAGATGGATAAAAC
GTCACCAACACCTGAGGAGAACTTGAAGGAGGAAATGCGAGAGACGGAATGCGAGGTTAAGGTGCAGAACCGGAAGAAAAATATGAGGGCAAGAAAAATCGATGAAAAGA
AGGGGACTGGGAGTCAATGTGGACCTGAAAAAACTGGCTACACGCATAAGATGAAGGAGATGGAAAACAAATTAAGTCTTATTTCAGTGAGTTTTGAAATTAAAAGAGGG
AGGAATAATGGTGTTTACAGAGGCAGTAGTCAAGGCAATCGGAATTCCAGAGATCATCGGAATCTTGATCGTTGGGGTCCCCGTAAACAGAGGGATGTTGGAATGATTTG
GGTGAGGAAGGACGAGCTTTCAAAAGTTGCATAG
Protein sequenceShow/hide protein sequence
MQGWGQTPSQPSNYVSILHLRERWLKENEPNQKEKEDLKQLDQQEQQQKVLRTEPKEKDDQKPVVQKPRSNASSWSRNGSGNFQRYYRAVPASNPRNVNETCKKTENKGE
PQLAVPQIERDEKKEKNSKRKGPKKNPRLQKDEMDKTSPTPEENLKEEMRETECEVKVQNRKKNMRARKIDEKKGTGSQCGPEKTGYTHKMKEMENKLSLISVSFEIKRG
RNNGVYRGSSQGNRNSRDHRNLDRWGPRKQRDVGMIWVRKDELSKVA