; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g19430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g19430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:15300014..15301900
RNA-Seq ExpressionMoc06g19430
SyntenyMoc06g19430
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.6e-9066.19Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG----
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E SRPNS L     
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG----

Query:  -------KSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEAL-------DVSPLNE
               KS      +E A    ++P       P+SE P PVIEL+S G   REKRPR+++EA+       DV PL E
Subjt:  -------KSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEAL-------DVSPLNE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.9e-8379.7Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGK
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E+SRPNSELG+
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGK

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]3.9e-8479.9Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGA GIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGKSS
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+HFP GRK+ TLVTDKLLLESGLLDYNP VRP+E+SRPNSELG  S
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGKSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.8e-10661.6Show/hide
Query:  SDSGEDLALRLESKLEEIENFRFSDDGDDSDTSTSGLGLEYPSKMPEHYLGPFRRGL-------------------------------------------
        S+   DLA RLESKLEEIEN R SDDG+DSD STSG GLEYPS++PEHYLG  RRG                                            
Subjt:  SDSGEDLALRLESKLEEIENFRFSDDGDDSDTSTSGLGLEYPSKMPEHYLGPFRRGL-------------------------------------------

Query:  --EFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKN
          EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAEL  V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK WV KWF+ASGEWLAK+
Subjt:  --EFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKN

Query:  ESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG-----------KSSSSVEGM
        ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E+SRPNSEL            KS      +
Subjt:  ESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG-----------KSSSSVEGM

Query:  EPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALD
        E A    ++P       P+SE P  VIEL+S G   REKRPR+++EA+D
Subjt:  EPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.6e-7271.09Show/hide
Query:  MCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIK WVGKWFFASGEWLAK+ESGR FFDVP RFGNLVSI+ IPEL QA+FDTLK+YKDHFP  RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNP

Query:  LVRPVEASRPNSEL-----------------GKSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALDVSPLNEVRGE
        LVR +EASRPNSEL                   +  +V G EP TPTV R      + PSS VPTPVIELD  G    EKR R ESEALDVSPLNEVRGE
Subjt:  LVRPVEASRPNSEL-----------------GKSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALDVSPLNEVRGE

Query:  SPLKRRRKKKK
        SPL+RRRKKKK
Subjt:  SPLKRRRKKKK

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.8e-9066.19Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG----
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E SRPNS L     
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG----

Query:  -------KSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEAL-------DVSPLNE
               KS      +E A    ++P       P+SE P PVIEL+S G   REKRPR+++EA+       DV PL E
Subjt:  -------KSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEAL-------DVSPLNE

A0A6J1DWD2 uncharacterized protein LOC1110246809.4e-8479.7Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGK
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E+SRPNSELG+
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGK

A0A6J1DWF1 uncharacterized protein LOC1110251081.9e-8479.9Show/hide
Query:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV
        EY  ++P H   PF +  EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAELL V+QLL CFEAKRIAKKPGR+ MCARKGA GIVKG TSIK 
Subjt:  EYPSKMPEHYLGPFRRGLEFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKV

Query:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGKSS
        WV KWF+ASGEWLAK+ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+HFP GRK+ TLVTDKLLLESGLLDYNP VRP+E+SRPNSELG  S
Subjt:  WVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGKSS

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-10661.6Show/hide
Query:  SDSGEDLALRLESKLEEIENFRFSDDGDDSDTSTSGLGLEYPSKMPEHYLGPFRRGL-------------------------------------------
        S+   DLA RLESKLEEIEN R SDDG+DSD STSG GLEYPS++PEHYLG  RRG                                            
Subjt:  SDSGEDLALRLESKLEEIENFRFSDDGDDSDTSTSGLGLEYPSKMPEHYLGPFRRGL-------------------------------------------

Query:  --EFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKN
          EFL RTGL PAQVAPNGWGVIFALAILF LRARD +EAEL  V+QLL CFEAKRIAKKPGR+ MCARKGAGGIVKG TSIK WV KWF+ASGEWLAK+
Subjt:  --EFLNRTGLTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKN

Query:  ESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG-----------KSSSSVEGM
        ESGR FFDVP RFGNLVSIRP+PELTQASFDTLKYYK+ FP GRK+ TLVTD+LLLESGLLDYNP VRP+E+SRPNSEL            KS      +
Subjt:  ESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELG-----------KSSSSVEGM

Query:  EPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALD
        E A    ++P       P+SE P  VIEL+S G   REKRPR+++EA+D
Subjt:  EPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.7e-7271.09Show/hide
Query:  MCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIK WVGKWFFASGEWLAK+ESGR FFDVP RFGNLVSI+ IPEL QA+FDTLK+YKDHFP  RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKNESGRPFFDVPVRFGNLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNP

Query:  LVRPVEASRPNSEL-----------------GKSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALDVSPLNEVRGE
        LVR +EASRPNSEL                   +  +V G EP TPTV R      + PSS VPTPVIELD  G    EKR R ESEALDVSPLNEVRGE
Subjt:  LVRPVEASRPNSEL-----------------GKSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELDSVGEHFREKRPRNESEALDVSPLNEVRGE

Query:  SPLKRRRKKKK
        SPL+RRRKKKK
Subjt:  SPLKRRRKKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAACCCAGTCTCCTCGGCAGGTTCGAGGTGGACCTGGGATATTTAATGAGCGTCCTCCACGTGTCCCAGGTATTTTTACCCCCAAACATTGGCCCCCTC
TCTGCTAGGTCCGATCTTGATCTGGCAGAGAAGTTATCCGACTCAATTTTAGACACATGGAGACTTCTAATTCGCAGGGAAAATACGACTGTTGTGGAAGGTGTT
TCGACCTGTCAAGCTGTCGGACTACTTGAGCATTCCATCGATACGAAATTCGAGATGATCCTGGCCGCTCGTTCTTTACACGTGTACATTGTAGAGCTCGAACCC
TCTGTAGGTCAGTCACGTGTTTTAATCTTGCTTTCGAATATGGTAGTTTTCATGTCTTCTCCCTCCAGTAGTGATAGTTTAGGTAGTGTAGGTCGAACTATAAGC
AGTTCGCCCCCCAAACCTAGTGACTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCAAGCTGGAAGAGATAGAAAATTTTAGGTTTTCTGATGATGGGGATGAT
AGCGACACTTCCACCTCGGGCCTGGGTTTGGAATACCCTTCGAAGATGCCTGAACATTACCTCGGACCCTTCCGTAGAGGTTTAGAGTTCTTAAATCGAACTGGA
CTGACACCTGCTCAAGTGGCCCCCAACGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTGGGTTACGAGCTCGAGATGAGGACGAGGCCGAGCTACTCAGT
GTTGAGCAGCTTCTTGGGTGCTTCGAAGCCAAGAGAATAGCTAAGAAGCCAGGTCGGTACAATATGTGTGCAAGGAAAGGCGCGGGTGGTATAGTCAAAGGATCG
ACCTCCATCAAAGTATGGGTAGGGAAATGGTTCTTTGCCTCTGGGGAGTGGCTGGCAAAGAACGAATCAGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGG
AACCTAGTATCGATCAGACCGATTCCCGAGCTCACTCAAGCCTCCTTCGACACCCTCAAGTATTACAAGGATCACTTCCCAGGGGGCCGAAAGATCAGAACCTTG
GTGACCGACAAGCTGCTCCTAGAATCTGGGTTGTTGGACTACAATCCCTTGGTGCGTCCAGTTGAAGCTTCAAGGCCAAACTCTGAGCTTGGTAAGTCGAGCTCG
TCCGTTGAGGGTATGGAGCCTGCAACCCCTACTGTGGCTCGACCTGTGGTTCATGACAAGGCTGAACCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGAC
TCTGTTGGGGAGCACTTCAGAGAAAAGCGCCCAAGGAATGAGTCCGAAGCGCTAGACGTATCTCCCCTAAATGAGGTAAGGGGAGAGTCTCCTTTGAAGAGGAGA
CGGAAGAAGAAGAAGCTCCTCCTCAGAGGTTGGACCTCGTGGGCCCCTGCCCATGAGCCACGTCGACTTGGTGGACGACCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAACCCAGTCTCCTCGGCAGGTTCGAGGTGGACCTGGGATATTTAATGAGCGTCCTCCACGTGTCCCAGGTATTTTTACCCCCAAACATTGGCCCCCTC
TCTGCTAGGTCCGATCTTGATCTGGCAGAGAAGTTATCCGACTCAATTTTAGACACATGGAGACTTCTAATTCGCAGGGAAAATACGACTGTTGTGGAAGGTGTT
TCGACCTGTCAAGCTGTCGGACTACTTGAGCATTCCATCGATACGAAATTCGAGATGATCCTGGCCGCTCGTTCTTTACACGTGTACATTGTAGAGCTCGAACCC
TCTGTAGGTCAGTCACGTGTTTTAATCTTGCTTTCGAATATGGTAGTTTTCATGTCTTCTCCCTCCAGTAGTGATAGTTTAGGTAGTGTAGGTCGAACTATAAGC
AGTTCGCCCCCCAAACCTAGTGACTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCAAGCTGGAAGAGATAGAAAATTTTAGGTTTTCTGATGATGGGGATGAT
AGCGACACTTCCACCTCGGGCCTGGGTTTGGAATACCCTTCGAAGATGCCTGAACATTACCTCGGACCCTTCCGTAGAGGTTTAGAGTTCTTAAATCGAACTGGA
CTGACACCTGCTCAAGTGGCCCCCAACGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTGGGTTACGAGCTCGAGATGAGGACGAGGCCGAGCTACTCAGT
GTTGAGCAGCTTCTTGGGTGCTTCGAAGCCAAGAGAATAGCTAAGAAGCCAGGTCGGTACAATATGTGTGCAAGGAAAGGCGCGGGTGGTATAGTCAAAGGATCG
ACCTCCATCAAAGTATGGGTAGGGAAATGGTTCTTTGCCTCTGGGGAGTGGCTGGCAAAGAACGAATCAGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGG
AACCTAGTATCGATCAGACCGATTCCCGAGCTCACTCAAGCCTCCTTCGACACCCTCAAGTATTACAAGGATCACTTCCCAGGGGGCCGAAAGATCAGAACCTTG
GTGACCGACAAGCTGCTCCTAGAATCTGGGTTGTTGGACTACAATCCCTTGGTGCGTCCAGTTGAAGCTTCAAGGCCAAACTCTGAGCTTGGTAAGTCGAGCTCG
TCCGTTGAGGGTATGGAGCCTGCAACCCCTACTGTGGCTCGACCTGTGGTTCATGACAAGGCTGAACCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGAC
TCTGTTGGGGAGCACTTCAGAGAAAAGCGCCCAAGGAATGAGTCCGAAGCGCTAGACGTATCTCCCCTAAATGAGGTAAGGGGAGAGTCTCCTTTGAAGAGGAGA
CGGAAGAAGAAGAAGCTCCTCCTCAGAGGTTGGACCTCGTGGGCCCCTGCCCATGAGCCACGTCGACTTGGTGGACGACCCTGA
Protein sequenceShow/hide protein sequence
MGEPSLLGRFEVDLGYLMSVLHVSQVFLPPNIGPLSARSDLDLAEKLSDSILDTWRLLIRRENTTVVEGVSTCQAVGLLEHSIDTKFEMILAARSLHVYIVELEP
SVGQSRVLILLSNMVVFMSSPSSSDSLGSVGRTISSSPPKPSDSGEDLALRLESKLEEIENFRFSDDGDDSDTSTSGLGLEYPSKMPEHYLGPFRRGLEFLNRTG
LTPAQVAPNGWGVIFALAILFGLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYNMCARKGAGGIVKGSTSIKVWVGKWFFASGEWLAKNESGRPFFDVPVRFG
NLVSIRPIPELTQASFDTLKYYKDHFPGGRKIRTLVTDKLLLESGLLDYNPLVRPVEASRPNSELGKSSSSVEGMEPATPTVARPVVHDKAEPSSEVPTPVIELD
SVGEHFREKRPRNESEALDVSPLNEVRGESPLKRRRKKKKLLLRGWTSWAPAHEPRRLGGRP