; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:12859244..12862427
RNA-Seq ExpressionMoc06g16320
SyntenyMoc06g16320
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.0e-3250Show/hide
Query:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG
        MFEY  RLP+HP VQE                               R+ EE +LL  D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV 
Subjt:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG

Query:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        KWFY SGEWLAK++S + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]9.8e-4442.19Show/hide
Query:  ELSQSTFDI--LKEVQRKSP-KKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRAS
        E S   FD+  L+EVQRKSP  KSK  KRKT SSDDV   VR D           DP+AR+GAT DI MRFK+EPSSA +KE++ + S  C DR  ++AS
Subjt:  ELSQSTFDI--LKEVQRKSP-KKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRAS

Query:  KFVSALGSVIQRLLDYSAEVHAAACRAAVMVKVEVDGHDLLIARER-----LLETATA-QAELKEAQAASQAWKSTSEA----DKAELKS----------
        KFV    S I++++DY+ +VHA +C AA+++K ++D  DL++  ER      LE AT  + ELKEA+  ++  KS  EA    D+ E++           
Subjt:  KFVSALGSVIQRLLDYSAEVHAAACRAAVMVKVEVDGHDLLIARER-----LLETATA-QAELKEAQAASQAWKSTSEA----DKAELKS----------

Query:  ------------------------------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGVEDIASEFDLEPIKLRYTEKWA
                                       E++  V + K +  N VLLEEAF+ HLDFD FV DFSD  F+FLMKG+ ++A + DLEP+K  YT+KWA
Subjt:  ------------------------------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGVEDIASEFDLEPIKLRYTEKWA

Query:  S
        S
Subjt:  S

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.0e-3250Show/hide
Query:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG
        MFEY  RLP+HP VQE                               R+ EE +LL  D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV 
Subjt:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG

Query:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        KWFY SGEWLAK++S + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.2e-3642.56Show/hide
Query:  ELANTLDSELEEEKDNFRFSEDEGDDSDTSTS-------------------------------------------EGFVTLYLKMFEYSFRLPVHPLVQE
        +LA  L+S+L EE +N R S D+G+DSD STS                                           EG+VTLY KMFEY  RLP+HP VQE
Subjt:  ELANTLDSELEEEKDNFRFSEDEGDDSDTSTS-------------------------------------------EGFVTLYLKMFEYSFRLPVHPLVQE

Query:  ------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSD
                                       R+ EE +L   D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV KWFY SGEWLAK++S 
Subjt:  ------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSD

Query:  QPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  QPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.6e-4633.06Show/hide
Query:  LSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILK-------------------------------
        + ARKG GGI+KGP+SIK WVGKWF+ SGEWLAK++S + F+ VP RF NLV+I+ +PEL+Q+TFD LK                               
Subjt:  LSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILK-------------------------------

Query:  -----------------------------------------------------------------------------------------------EVQRK
                                                                                                       EV+ +
Subjt:  -----------------------------------------------------------------------------------------------EVQRK

Query:  SPKKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRASKFVSALGSVIQRLLDYSAE
        SP + ++KK+KT+SS   +EA       +  A    DPEARM  T ++ MRF +EPSS+ VK+++  +S +CLDR  RRASKFVS  GSV+QR +D  AE
Subjt:  SPKKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRASKFVSALGSVIQRLLDYSAE

Query:  VHAAACRAAVMVKVEVDGHDLLIARER-----LLETAT--------AQAEL----KEAQAASQAWKSTSEADKAELKS----------------------
           A+   AVMVK E+DG + L A+ER      LE AT        AQ E+     E  A     K   E  KA L++                      
Subjt:  VHAAACRAAVMVKVEVDGHDLLIARER-----LLETAT--------AQAEL----KEAQAASQAWKSTSEADKAELKS----------------------

Query:  -----------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGV-EDIAS-EFDLEPIKLRYTEKWAS
                   G +   +   K +  N  LLEE+FR+H DFDGF KDFSDAGF+FLMKG+  D+   + DL  +K +Y+EKWAS
Subjt:  -----------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGV-EDIAS-EFDLEPIKLRYTEKWAS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138264.9e-3350Show/hide
Query:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG
        MFEY  RLP+HP VQE                               R+ EE +LL  D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV 
Subjt:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG

Query:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        KWFY SGEWLAK++S + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

A0A6J1DBX9 uncharacterized protein LOC1110189134.7e-4442.19Show/hide
Query:  ELSQSTFDI--LKEVQRKSP-KKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRAS
        E S   FD+  L+EVQRKSP  KSK  KRKT SSDDV   VR D           DP+AR+GAT DI MRFK+EPSSA +KE++ + S  C DR  ++AS
Subjt:  ELSQSTFDI--LKEVQRKSP-KKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRAS

Query:  KFVSALGSVIQRLLDYSAEVHAAACRAAVMVKVEVDGHDLLIARER-----LLETATA-QAELKEAQAASQAWKSTSEA----DKAELKS----------
        KFV    S I++++DY+ +VHA +C AA+++K ++D  DL++  ER      LE AT  + ELKEA+  ++  KS  EA    D+ E++           
Subjt:  KFVSALGSVIQRLLDYSAEVHAAACRAAVMVKVEVDGHDLLIARER-----LLETATA-QAELKEAQAASQAWKSTSEA----DKAELKS----------

Query:  ------------------------------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGVEDIASEFDLEPIKLRYTEKWA
                                       E++  V + K +  N VLLEEAF+ HLDFD FV DFSD  F+FLMKG+ ++A + DLEP+K  YT+KWA
Subjt:  ------------------------------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGVEDIASEFDLEPIKLRYTEKWA

Query:  S
        S
Subjt:  S

A0A6J1DWD2 uncharacterized protein LOC1110246804.9e-3350Show/hide
Query:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG
        MFEY  RLP+HP VQE                               R+ EE +LL  D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV 
Subjt:  MFEYSFRLPVHPLVQE------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVG

Query:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        KWFY SGEWLAK++S + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  KWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

A0A6J1DXS5 uncharacterized protein LOC1110255025.6e-3742.56Show/hide
Query:  ELANTLDSELEEEKDNFRFSEDEGDDSDTSTS-------------------------------------------EGFVTLYLKMFEYSFRLPVHPLVQE
        +LA  L+S+L EE +N R S D+G+DSD STS                                           EG+VTLY KMFEY  RLP+HP VQE
Subjt:  ELANTLDSELEEEKDNFRFSEDEGDDSDTSTS-------------------------------------------EGFVTLYLKMFEYSFRLPVHPLVQE

Query:  ------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSD
                                       R+ EE +L   D  LACF+ KRI++K GR+Y+ ARKGAGGI+KGP+SIK WV KWFY SGEWLAK++S 
Subjt:  ------------------------------CREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSD

Query:  QPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK
        + F+ VP RF NLV+IRPVPEL+Q++FD LK  + + P+  K
Subjt:  QPFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSK

A0A6J1DZB3 uncharacterized protein LOC1110256657.8e-4733.06Show/hide
Query:  LSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILK-------------------------------
        + ARKG GGI+KGP+SIK WVGKWF+ SGEWLAK++S + F+ VP RF NLV+I+ +PEL+Q+TFD LK                               
Subjt:  LSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSDQPFYLVPCRFENLVAIRPVPELSQSTFDILK-------------------------------

Query:  -----------------------------------------------------------------------------------------------EVQRK
                                                                                                       EV+ +
Subjt:  -----------------------------------------------------------------------------------------------EVQRK

Query:  SPKKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRASKFVSALGSVIQRLLDYSAE
        SP + ++KK+KT+SS   +EA       +  A    DPEARM  T ++ MRF +EPSS+ VK+++  +S +CLDR  RRASKFVS  GSV+QR +D  AE
Subjt:  SPKKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDRCWRRASKFVSALGSVIQRLLDYSAE

Query:  VHAAACRAAVMVKVEVDGHDLLIARER-----LLETAT--------AQAEL----KEAQAASQAWKSTSEADKAELKS----------------------
           A+   AVMVK E+DG + L A+ER      LE AT        AQ E+     E  A     K   E  KA L++                      
Subjt:  VHAAACRAAVMVKVEVDGHDLLIARER-----LLETAT--------AQAEL----KEAQAASQAWKSTSEADKAELKS----------------------

Query:  -----------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGV-EDIAS-EFDLEPIKLRYTEKWAS
                   G +   +   K +  N  LLEE+FR+H DFDGF KDFSDAGF+FLMKG+  D+   + DL  +K +Y+EKWAS
Subjt:  -----------GEVECRVGVRKVQARNEVLLEEAFRKHLDFDGFVKDFSDAGFRFLMKGV-EDIAS-EFDLEPIKLRYTEKWAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAGAGGGCAAAATCTCCGACGCTCAAGTCAATAAGCAGCTCGGACTCAATATTAAGTTCGAGGAGAAATTCCATTCCTGATGGTGAAATGGGGCCCTCTATTTA
TAGGTTTATACCTCGACAGGGGTTACAACAGGCGGTTACCCCGACCAAAGTTGAGATCGAACCTGTCGGGCTAAACAGATCGAACCCAGGTCCGTTTTTCACGCGCTCTA
TAATCTTGGAGTCCGACTTGAGCAACTCGGAACATACTCTAGCCGACTCTGTAGAAGAATTAGCCAATACGTTAGACTCCGAATTGGAGGAAGAAAAAGACAACTTTAGG
TTTTCCGAGGATGAAGGGGATGATAGTGACACCTCCACCTCGGAAGGGTTTGTAACTCTGTACTTGAAGATGTTCGAGTACAGCTTTCGCCTGCCCGTTCATCCTCTTGT
GCAAGAGTGTCGCGAGGTAGAGGAACTAGACCTCCTTAGGACCGACCATCAGTTAGCTTGCTTCAAAGTTAAGCGCATTTCCAGGAAGCTTGGGCGATACTACCTGTCCG
CTAGGAAAGGTGCAGGAGGCATCCTCAAGGGTCCGAGCTCCATAAAGAAATGGGTTGGAAAGTGGTTCTATGTCTCTGGTGAATGGTTGGCCAAGAACAAGTCCGACCAA
CCCTTCTATCTCGTCCCTTGTAGGTTTGAGAACTTAGTTGCCATAAGGCCTGTCCCTGAGCTTTCTCAGTCAACTTTTGATATCCTGAAGGAGGTTCAGAGAAAGTCTCC
TAAGAAGTCAAAAAAGAAGAAGAGGAAAACTACCTCTTCAGACGACGTGGCGGAAGCAGTCCGAGCTGACTTGAGGGGGAGCTGTTTTGCTAGGTTCACTAAAGACCCGG
AGGCGAGAATGGGCGCCACCTTGGACATTGAAATGAGGTTCAAGGTTGAACCTTCAAGTGCCAGGGTGAAGGAGGAGATGATAGAAATGTCGGGGTCTTGCCTTGACCGT
TGCTGGAGGAGGGCTTCCAAGTTTGTCAGTGCTCTGGGGTCGGTCATCCAACGACTGCTGGACTATAGTGCCGAGGTTCACGCTGCTGCCTGCCGCGCGGCAGTTATGGT
GAAGGTCGAAGTTGACGGTCACGACCTCCTCATTGCGAGGGAGCGGCTTCTCGAAACGGCTACTGCCCAAGCTGAACTCAAAGAGGCCCAAGCTGCAAGTCAAGCCTGGA
AGTCTACTTCTGAGGCCGATAAGGCAGAGCTTAAGAGTGGAGAAGTTGAATGCCGAGTTGGAGTTCGAAAAGTCCAGGCTCGAAACGAGGTGCTACTGGAGGAGGCATTT
CGCAAGCATCTAGACTTTGACGGGTTTGTCAAAGACTTCAGTGATGCAGGCTTCAGGTTCCTGATGAAGGGGGTCGAGGACATTGCTTCTGAGTTCGACCTCGAACCAAT
CAAACTGCGCTACACTGAGAAGTGGGCATCAGAACGAGGGCATCTCATCCTAGGAAGTCGACGAAGCGAGCCTTCATGCACTGGAATGGCCTCTTCTCAAGAGGCTGGGG
TGCTGGAATCCCAAGAGTCTGATATCCTAGCTTTACAAAGTGAGCTTGGTTCACACCTCGGAAGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACAAGAGGGCAAAATCTCCGACGCTCAAGTCAATAAGCAGCTCGGACTCAATATTAAGTTCGAGGAGAAATTCCATTCCTGATGGTGAAATGGGGCCCTCTATTTA
TAGGTTTATACCTCGACAGGGGTTACAACAGGCGGTTACCCCGACCAAAGTTGAGATCGAACCTGTCGGGCTAAACAGATCGAACCCAGGTCCGTTTTTCACGCGCTCTA
TAATCTTGGAGTCCGACTTGAGCAACTCGGAACATACTCTAGCCGACTCTGTAGAAGAATTAGCCAATACGTTAGACTCCGAATTGGAGGAAGAAAAAGACAACTTTAGG
TTTTCCGAGGATGAAGGGGATGATAGTGACACCTCCACCTCGGAAGGGTTTGTAACTCTGTACTTGAAGATGTTCGAGTACAGCTTTCGCCTGCCCGTTCATCCTCTTGT
GCAAGAGTGTCGCGAGGTAGAGGAACTAGACCTCCTTAGGACCGACCATCAGTTAGCTTGCTTCAAAGTTAAGCGCATTTCCAGGAAGCTTGGGCGATACTACCTGTCCG
CTAGGAAAGGTGCAGGAGGCATCCTCAAGGGTCCGAGCTCCATAAAGAAATGGGTTGGAAAGTGGTTCTATGTCTCTGGTGAATGGTTGGCCAAGAACAAGTCCGACCAA
CCCTTCTATCTCGTCCCTTGTAGGTTTGAGAACTTAGTTGCCATAAGGCCTGTCCCTGAGCTTTCTCAGTCAACTTTTGATATCCTGAAGGAGGTTCAGAGAAAGTCTCC
TAAGAAGTCAAAAAAGAAGAAGAGGAAAACTACCTCTTCAGACGACGTGGCGGAAGCAGTCCGAGCTGACTTGAGGGGGAGCTGTTTTGCTAGGTTCACTAAAGACCCGG
AGGCGAGAATGGGCGCCACCTTGGACATTGAAATGAGGTTCAAGGTTGAACCTTCAAGTGCCAGGGTGAAGGAGGAGATGATAGAAATGTCGGGGTCTTGCCTTGACCGT
TGCTGGAGGAGGGCTTCCAAGTTTGTCAGTGCTCTGGGGTCGGTCATCCAACGACTGCTGGACTATAGTGCCGAGGTTCACGCTGCTGCCTGCCGCGCGGCAGTTATGGT
GAAGGTCGAAGTTGACGGTCACGACCTCCTCATTGCGAGGGAGCGGCTTCTCGAAACGGCTACTGCCCAAGCTGAACTCAAAGAGGCCCAAGCTGCAAGTCAAGCCTGGA
AGTCTACTTCTGAGGCCGATAAGGCAGAGCTTAAGAGTGGAGAAGTTGAATGCCGAGTTGGAGTTCGAAAAGTCCAGGCTCGAAACGAGGTGCTACTGGAGGAGGCATTT
CGCAAGCATCTAGACTTTGACGGGTTTGTCAAAGACTTCAGTGATGCAGGCTTCAGGTTCCTGATGAAGGGGGTCGAGGACATTGCTTCTGAGTTCGACCTCGAACCAAT
CAAACTGCGCTACACTGAGAAGTGGGCATCAGAACGAGGGCATCTCATCCTAGGAAGTCGACGAAGCGAGCCTTCATGCACTGGAATGGCCTCTTCTCAAGAGGCTGGGG
TGCTGGAATCCCAAGAGTCTGATATCCTAGCTTTACAAAGTGAGCTTGGTTCACACCTCGGAAGCAGTTAG
Protein sequenceShow/hide protein sequence
MHKRAKSPTLKSISSSDSILSSRRNSIPDGEMGPSIYRFIPRQGLQQAVTPTKVEIEPVGLNRSNPGPFFTRSIILESDLSNSEHTLADSVEELANTLDSELEEEKDNFR
FSEDEGDDSDTSTSEGFVTLYLKMFEYSFRLPVHPLVQECREVEELDLLRTDHQLACFKVKRISRKLGRYYLSARKGAGGILKGPSSIKKWVGKWFYVSGEWLAKNKSDQ
PFYLVPCRFENLVAIRPVPELSQSTFDILKEVQRKSPKKSKKKKRKTTSSDDVAEAVRADLRGSCFARFTKDPEARMGATLDIEMRFKVEPSSARVKEEMIEMSGSCLDR
CWRRASKFVSALGSVIQRLLDYSAEVHAAACRAAVMVKVEVDGHDLLIARERLLETATAQAELKEAQAASQAWKSTSEADKAELKSGEVECRVGVRKVQARNEVLLEEAF
RKHLDFDGFVKDFSDAGFRFLMKGVEDIASEFDLEPIKLRYTEKWASERGHLILGSRRSEPSCTGMASSQEAGVLESQESDILALQSELGSHLGSS