; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr6:13015440..13021961
RNA-Seq ExpressionMoc06g16540
SyntenyMoc06g16540
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]2.2e-7060.43Show/hide
Query:  MFRKTIFSHLLDVDLVFNGPLLGT---------------------KVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR
        MFRKT F HLLDVDLVFNG L+                       ++SF R +F +ISGLKY R+PVR+ T P R  TLYFN+ TDL+LS+ EKMYT+ R
Subjt:  MFRKTIFSHLLDVDLVFNGPLLGT---------------------KVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR

Query:  FEDDIDAVKVL-VYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGH
        FEDD D VKVL VY V + LLGRER  KFDH LLGIVDDWE CCN++WA LSF+KTI SLQRG    SK+G LRKSYSLYGFPW FQVWAY+ ISSLS  
Subjt:  FEDDIDAVKVL-VYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGH

Query:  IVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSS
        +   V  D VP I +WRY+HSTA+H+L R+IF S+
Subjt:  IVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSS

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]1.5e-5861.46Show/hide
Query:  MFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR
        MFRKTIF HLLDVDLVFNGPL+                     G +VSFGRREFD+ISGL Y RSPVRK+T+  +  TLYFN+ T+ +LS+  K+Y +  
Subjt:  MFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR

Query:  FEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYE
        F+DD D +KV ++Y VELVLLGRE + KFD ILLG+VDDWE CCNHD A LSFDKTI SL RG +N +K+ GLRKSYSLYGFPW FQVW YE
Subjt:  FEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYE

XP_022157998.1 uncharacterized protein LOC111024595 [Momordica charantia]6.4e-7875.36Show/hide
Query:  LTPKQLAMFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELE
        +TPKQLAMFRKT+F HLLDVDLVFNGPL+                      TKVSFGRREFDIISGLKY RSPVRKITYPQR  TLYFNNSTDLLLSELE
Subjt:  LTPKQLAMFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELE

Query:  KMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEI
        KMYTSI FEDD DAVKV +VYFVELVLL          +LLGIVDDWEACCNHDWALLSF+KTIYSLQRGAS KSKEGGLRKSYSL+GFPW FQVWAYE 
Subjt:  KMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEI

Query:  ISSLSGHIVTI
        ISSLSG + TI
Subjt:  ISSLSGHIVTI

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]6.3e-142100Show/hide
Query:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN
        MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN
Subjt:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN

Query:  STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW
        STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW
Subjt:  STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW

Query:  AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS
        AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS
Subjt:  AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS

XP_022159253.1 uncharacterized protein LOC111025666 [Momordica charantia]3.3e-6660.36Show/hide
Query:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLL----------------------GTKVSFGRREFDIISGLK
        M+ KI P++ ASA L  LSH+AKT+  IK+KLTP QL+MFRKT+F HLLD+DLVFN  L+                      G+KV F RREFDIISGLK
Subjt:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLL----------------------GTKVSFGRREFDIISGLK

Query:  YSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ
        Y RSPVRK T P R   LYFN+S D+LLS+ EK+Y   RFEDD DA K+ +VY +ELVLLGRER+ K+D+ LLGIVDD E CCNHDW ++SFDKTIYSL+
Subjt:  YSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ

Query:  RGASNKSKEGGLRKSYSLYGFP
        RG + +SK+GG RK YSLYGFP
Subjt:  RGASNKSKEGGLRKSYSLYGFP

TrEMBL top hitse value%identityAlignment
A0A6J1DP34 uncharacterized protein LOC1110218021.1e-7060.43Show/hide
Query:  MFRKTIFSHLLDVDLVFNGPLLGT---------------------KVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR
        MFRKT F HLLDVDLVFNG L+                       ++SF R +F +ISGLKY R+PVR+ T P R  TLYFN+ TDL+LS+ EKMYT+ R
Subjt:  MFRKTIFSHLLDVDLVFNGPLLGT---------------------KVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR

Query:  FEDDIDAVKVL-VYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGH
        FEDD D VKVL VY V + LLGRER  KFDH LLGIVDDWE CCN++WA LSF+KTI SLQRG    SK+G LRKSYSLYGFPW FQVWAY+ ISSLS  
Subjt:  FEDDIDAVKVL-VYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGH

Query:  IVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSS
        +   V  D VP I +WRY+HSTA+H+L R+IF S+
Subjt:  IVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSS

A0A6J1DQC8 uncharacterized protein LOC1110233537.2e-5961.46Show/hide
Query:  MFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR
        MFRKTIF HLLDVDLVFNGPL+                     G +VSFGRREFD+ISGL Y RSPVRK+T+  +  TLYFN+ T+ +LS+  K+Y +  
Subjt:  MFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIR

Query:  FEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYE
        F+DD D +KV ++Y VELVLLGRE + KFD ILLG+VDDWE CCNHD A LSFDKTI SL RG +N +K+ GLRKSYSLYGFPW FQVW YE
Subjt:  FEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYE

A0A6J1DUW1 uncharacterized protein LOC1110245953.1e-7875.36Show/hide
Query:  LTPKQLAMFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELE
        +TPKQLAMFRKT+F HLLDVDLVFNGPL+                      TKVSFGRREFDIISGLKY RSPVRKITYPQR  TLYFNNSTDLLLSELE
Subjt:  LTPKQLAMFRKTIFSHLLDVDLVFNGPLL---------------------GTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELE

Query:  KMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEI
        KMYTSI FEDD DAVKV +VYFVELVLL          +LLGIVDDWEACCNHDWALLSF+KTIYSLQRGAS KSKEGGLRKSYSL+GFPW FQVWAYE 
Subjt:  KMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEI

Query:  ISSLSGHIVTI
        ISSLSG + TI
Subjt:  ISSLSGHIVTI

A0A6J1DYB1 uncharacterized protein LOC1110256661.6e-6660.36Show/hide
Query:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLL----------------------GTKVSFGRREFDIISGLK
        M+ KI P++ ASA L  LSH+AKT+  IK+KLTP QL+MFRKT+F HLLD+DLVFN  L+                      G+KV F RREFDIISGLK
Subjt:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLL----------------------GTKVSFGRREFDIISGLK

Query:  YSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ
        Y RSPVRK T P R   LYFN+S D+LLS+ EK+Y   RFEDD DA K+ +VY +ELVLLGRER+ K+D+ LLGIVDD E CCNHDW ++SFDKTIYSL+
Subjt:  YSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRFEDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ

Query:  RGASNKSKEGGLRKSYSLYGFP
        RG + +SK+GG RK YSLYGFP
Subjt:  RGASNKSKEGGLRKSYSLYGFP

A0A6J1E0A9 uncharacterized protein LOC1110252093.1e-142100Show/hide
Query:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN
        MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN
Subjt:  MIPKIDPATYASAKLNCLSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNN

Query:  STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW
        STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW
Subjt:  STDLLLSELEKMYTSIRFEDDIDAVKVLVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPW

Query:  AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS
        AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS
Subjt:  AFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHSTAYHMLAREIFRSSTVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCCAATGGGCTTAAAGAACTTATGATTATGGCCCAACTTATTGAGGACAAAAATGGGGCACAGATTACAGAGGCCCACTCAAACCAAACAACTTCAGGAGGAAA
AGGAGGCCCGGGCAGTGGAATAGGGGTGGGTAAAACAACTGGGTCATCTTTTGGATCGGCCCGAACTGTTCCTTTTAGTTCCAATCGCCCACAATCGACGCGTAACAACG
AGTTGGCGAATAAAGGGGGGAACACGTCATCAGGGGTTGGGCAATTCCGCCGACTTTCGGATACGGAACTCAAGGCACGACGAGAGAAAGGCCTTTGCTACCGCTGTGAT
GAGAAATTTGCACCGGGACACCGCCGCAAGAAAAAAGAATTGCAAATCCTCGTGGTTCAAGAAATCGAAACAGCCCTGGAAGATTTTCACGATGCAGTAGATTCCATTGG
AGAAGAAATCGAAGCTGTGGTGCGGGAATGCCATCTGCCGGTAAGTGAAACCAAAGGCTATGGAATCGTGTTGGGGACAGGAATGGATGTGACCCATTCGGGTGTGTGTA
AAGCGATAGAGTTGGTGCTTCCGGGCATAATCAATAAACAACAGGTTATGTTGCAGGGGGTACCAAGCTTACAAAAGACGCAAGTTTCGTTGAAGGCTATGCTACGAGCC
ATACAACGGGAGAAACACGAAGTTATTGTGGAATTACAAGCCATAGGGGCTTGGGATTCTCAATTGGAGCTGGCTAAGACAACAATGACTCTGCGGGAGGTACAGCCGGA
GGGAGGCACGGCAGCCGACGCCACATGGGAACCCACTGCCACTATTCAAGCTCAATTTCAGGATTTTCACCTTGAGGACAAGGTGTCTTTTTGGCGGGTGGGTAATGATG
GACCTCCCATCACAAAGGTATATGTGAGGAAGAAGGGGAGGGTGAGAGAGTGTTGTAAAATGATTCCTAAAATCGATCCTGCAACCTACGCCTCTGCCAAGCTAAATTGT
CTATCGCATATAGCAAAAACAAGCAATGATATCAAGACAAAACTGACCCCAAAACAGCTTGCTATGTTTAGGAAAACTATCTTCAGCCACCTACTTGATGTGGACCTCGT
ATTTAACGGGCCCCTTTTAGGGACTAAGGTGTCGTTCGGTCGGAGAGAGTTTGACATTATTAGCGGTCTAAAGTATTCTAGGAGTCCAGTTAGGAAAATCACGTATCCTC
AAAGACATGGAACCCTATACTTCAATAATAGCACGGACCTGTTGTTGAGTGAGTTGGAGAAGATGTATACATCCATTCGGTTCGAGGATGACATTGATGCAGTTAAGGTG
TTAGTGTACTTCGTAGAGTTAGTACTGTTGGGGAGAGAGAGAAGCACGAAGTTTGACCATATTTTGTTGGGAATAGTGGATGATTGGGAAGCCTGCTGCAACCATGATTG
GGCATTGTTGTCGTTCGATAAGACAATCTATAGTCTGCAGCGTGGCGCATCTAACAAGTCGAAGGAGGGAGGATTGAGGAAATCATATAGTCTCTATGGTTTTCCATGGG
CGTTCCAGGTGTGGGCGTACGAGATTATATCTTCTCTTTCGGGGCATATTGTGACTATAGTAAGTCAGGACGTCGTGCCACGCATTCTCCAGTGGAGGTATGAACATTCA
ACTGCATATCACATGCTGGCCAGAGAGATTTTTCGATCTTCTACTGTAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCCCAATGGGCTTAAAGAACTTATGATTATGGCCCAACTTATTGAGGACAAAAATGGGGCACAGATTACAGAGGCCCACTCAAACCAAACAACTTCAGGAGGAAA
AGGAGGCCCGGGCAGTGGAATAGGGGTGGGTAAAACAACTGGGTCATCTTTTGGATCGGCCCGAACTGTTCCTTTTAGTTCCAATCGCCCACAATCGACGCGTAACAACG
AGTTGGCGAATAAAGGGGGGAACACGTCATCAGGGGTTGGGCAATTCCGCCGACTTTCGGATACGGAACTCAAGGCACGACGAGAGAAAGGCCTTTGCTACCGCTGTGAT
GAGAAATTTGCACCGGGACACCGCCGCAAGAAAAAAGAATTGCAAATCCTCGTGGTTCAAGAAATCGAAACAGCCCTGGAAGATTTTCACGATGCAGTAGATTCCATTGG
AGAAGAAATCGAAGCTGTGGTGCGGGAATGCCATCTGCCGGTAAGTGAAACCAAAGGCTATGGAATCGTGTTGGGGACAGGAATGGATGTGACCCATTCGGGTGTGTGTA
AAGCGATAGAGTTGGTGCTTCCGGGCATAATCAATAAACAACAGGTTATGTTGCAGGGGGTACCAAGCTTACAAAAGACGCAAGTTTCGTTGAAGGCTATGCTACGAGCC
ATACAACGGGAGAAACACGAAGTTATTGTGGAATTACAAGCCATAGGGGCTTGGGATTCTCAATTGGAGCTGGCTAAGACAACAATGACTCTGCGGGAGGTACAGCCGGA
GGGAGGCACGGCAGCCGACGCCACATGGGAACCCACTGCCACTATTCAAGCTCAATTTCAGGATTTTCACCTTGAGGACAAGGTGTCTTTTTGGCGGGTGGGTAATGATG
GACCTCCCATCACAAAGGTATATGTGAGGAAGAAGGGGAGGGTGAGAGAGTGTTGTAAAATGATTCCTAAAATCGATCCTGCAACCTACGCCTCTGCCAAGCTAAATTGT
CTATCGCATATAGCAAAAACAAGCAATGATATCAAGACAAAACTGACCCCAAAACAGCTTGCTATGTTTAGGAAAACTATCTTCAGCCACCTACTTGATGTGGACCTCGT
ATTTAACGGGCCCCTTTTAGGGACTAAGGTGTCGTTCGGTCGGAGAGAGTTTGACATTATTAGCGGTCTAAAGTATTCTAGGAGTCCAGTTAGGAAAATCACGTATCCTC
AAAGACATGGAACCCTATACTTCAATAATAGCACGGACCTGTTGTTGAGTGAGTTGGAGAAGATGTATACATCCATTCGGTTCGAGGATGACATTGATGCAGTTAAGGTG
TTAGTGTACTTCGTAGAGTTAGTACTGTTGGGGAGAGAGAGAAGCACGAAGTTTGACCATATTTTGTTGGGAATAGTGGATGATTGGGAAGCCTGCTGCAACCATGATTG
GGCATTGTTGTCGTTCGATAAGACAATCTATAGTCTGCAGCGTGGCGCATCTAACAAGTCGAAGGAGGGAGGATTGAGGAAATCATATAGTCTCTATGGTTTTCCATGGG
CGTTCCAGGTGTGGGCGTACGAGATTATATCTTCTCTTTCGGGGCATATTGTGACTATAGTAAGTCAGGACGTCGTGCCACGCATTCTCCAGTGGAGGTATGAACATTCA
ACTGCATATCACATGCTGGCCAGAGAGATTTTTCGATCTTCTACTGTAAGTTGA
Protein sequenceShow/hide protein sequence
MKPNGLKELMIMAQLIEDKNGAQITEAHSNQTTSGGKGGPGSGIGVGKTTGSSFGSARTVPFSSNRPQSTRNNELANKGGNTSSGVGQFRRLSDTELKARREKGLCYRCD
EKFAPGHRRKKKELQILVVQEIETALEDFHDAVDSIGEEIEAVVRECHLPVSETKGYGIVLGTGMDVTHSGVCKAIELVLPGIINKQQVMLQGVPSLQKTQVSLKAMLRA
IQREKHEVIVELQAIGAWDSQLELAKTTMTLREVQPEGGTAADATWEPTATIQAQFQDFHLEDKVSFWRVGNDGPPITKVYVRKKGRVRECCKMIPKIDPATYASAKLNC
LSHIAKTSNDIKTKLTPKQLAMFRKTIFSHLLDVDLVFNGPLLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRFEDDIDAVKV
LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQRGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHS
TAYHMLAREIFRSSTVS