; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C031851 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C031851
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr06:8117077..8121538
RNA-Seq ExpressionMELO3C031851
SyntenyMELO3C031851
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005622 - intracellular (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046575.1 uncharacterized protein E6C27_scaffold114G001050 [Cucumis melo var. makuwa]7.0e-10387.78Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-------NHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLI
        I  DLDADEDTP+NKGVEGTPCQLSIGSINNIVAVATIVEDNIGC N        HST KDVVY SNYTDV GIIKLLNRH +NNM+DVDMI IPMN LI
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-------NHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLI

Query:  FGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN
        FGSDKF YLAREDLLHYC MVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN
Subjt:  FGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN

Query:  CVYVLDSLRSKVNEDIHGVIN
         VYVLDSLRSKVNEDIHG+IN
Subjt:  CVYVLDSLRSKVNEDIHGVIN

KAA0055185.1 uncharacterized protein E6C27_scaffold80G00170 [Cucumis melo var. makuwa]1.3e-11776.61Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-----------------------------------------------NHSTRK
        I  DLDAD+DTP+NKGVEGT CQLSIGSINNIVAVATIVEDNIGC N                                                HSTRK
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-----------------------------------------------NHSTRK

Query:  DVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDC
        DVVY SNYTDV GIIKLLNRH V NMKDVDMI IPMN LIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD 
Subjt:  DVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDC

Query:  DLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM
        DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLR+NCVYVLDSLRSKVNEDIHG+IN VEDMASET+STTL INSKMETCKVPSSIGFCR+
Subjt:  DLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM

KAA0059922.1 40S ribosomal protein S13-like [Cucumis melo var. makuwa]7.0e-11176.57Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN------------------------------------------------NHSTR
        I  DLD DEDTPSNKGVE TPCQLSIGSINNIVAVATIVEDNIGC N                                                  STR
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN------------------------------------------------NHSTR

Query:  KDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKD
        KDVVYSSNYTDV GIIKLLNRH VNNMKDVDMI IPMN LIFGSDKFVYLAREDLLHYC M+EIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD
Subjt:  KDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKD

Query:  CDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK
         DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVIN VEDMASETRSTT  INSKMETCK
Subjt:  CDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK

KAA0067182.1 transposase [Cucumis melo var. makuwa]5.5e-11686.61Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIG------CFNNHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIF
        I  DLDADEDTP NKGVEGTPCQLSIGSINN VAVATIVEDNI           HSTRKDVVY SNY DV GIIKLLNRH +NNM+DVDMI IPMN LIF
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIG------CFNNHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIF

Query:  GSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC
        GSDKFVYLAREDLLH  DMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR +NLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC
Subjt:  GSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC

Query:  VYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM
        VYVLDSLRSKVNEDIHG+IN VEDMASETRSTTL INSKM+TCKVPSSIGFCR+
Subjt:  VYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM

TYJ96572.1 uncharacterized protein E5676_scaffold791G00020 [Cucumis melo var. makuwa]5.3e-11178.06Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN----------------------------------------NHSTRKDVVYSSN
        IT DLDADEDTP+NKGVEGTPCQLSI SINNIVAVATIVEDNIG  N                                         HSTRKDVVY SN
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN----------------------------------------NHSTRKDVVYSSN

Query:  YTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNL
        YTDV GIIKLLNRHVVNNMKDVDMI IPMN LIFGSDKFVY+AREDLLHYCDMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR RNL
Subjt:  YTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNL

Query:  ANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK
        AN LEAVNLEQKVLI YNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHG+IN VEDM SETRSTTL IN KMETCK
Subjt:  ANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK

TrEMBL top hitse value%identityAlignment
A0A5A7TYY0 ULP_PROTEASE domain-containing protein3.4e-10387.78Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-------NHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLI
        I  DLDADEDTP+NKGVEGTPCQLSIGSINNIVAVATIVEDNIGC N        HST KDVVY SNYTDV GIIKLLNRH +NNM+DVDMI IPMN LI
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-------NHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLI

Query:  FGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN
        FGSDKF YLAREDLLHYC MVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN
Subjt:  FGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLREN

Query:  CVYVLDSLRSKVNEDIHGVIN
         VYVLDSLRSKVNEDIHG+IN
Subjt:  CVYVLDSLRSKVNEDIHGVIN

A0A5A7UNG9 ULP_PROTEASE domain-containing protein6.4e-11876.61Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-----------------------------------------------NHSTRK
        I  DLDAD+DTP+NKGVEGT CQLSIGSINNIVAVATIVEDNIGC N                                                HSTRK
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN-----------------------------------------------NHSTRK

Query:  DVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDC
        DVVY SNYTDV GIIKLLNRH V NMKDVDMI IPMN LIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD 
Subjt:  DVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDC

Query:  DLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM
        DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLR+NCVYVLDSLRSKVNEDIHG+IN VEDMASET+STTL INSKMETCKVPSSIGFCR+
Subjt:  DLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM

A0A5A7UVK8 40S ribosomal protein S13-like3.4e-11176.57Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN------------------------------------------------NHSTR
        I  DLD DEDTPSNKGVE TPCQLSIGSINNIVAVATIVEDNIGC N                                                  STR
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN------------------------------------------------NHSTR

Query:  KDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKD
        KDVVYSSNYTDV GIIKLLNRH VNNMKDVDMI IPMN LIFGSDKFVYLAREDLLHYC M+EIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD
Subjt:  KDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKD

Query:  CDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK
         DLR RNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVIN VEDMASETRSTT  INSKMETCK
Subjt:  CDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK

A0A5A7VFM4 Transposase2.7e-11686.61Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIG------CFNNHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIF
        I  DLDADEDTP NKGVEGTPCQLSIGSINN VAVATIVEDNI           HSTRKDVVY SNY DV GIIKLLNRH +NNM+DVDMI IPMN LIF
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIG------CFNNHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIF

Query:  GSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC
        GSDKFVYLAREDLLH  DMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR +NLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC
Subjt:  GSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENC

Query:  VYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM
        VYVLDSLRSKVNEDIHG+IN VEDMASETRSTTL INSKM+TCKVPSSIGFCR+
Subjt:  VYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRM

A0A5D3B9N1 Uncharacterized protein2.6e-11178.06Show/hide
Query:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN----------------------------------------NHSTRKDVVYSSN
        IT DLDADEDTP+NKGVEGTPCQLSI SINNIVAVATIVEDNIG  N                                         HSTRKDVVY SN
Subjt:  ITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFN----------------------------------------NHSTRKDVVYSSN

Query:  YTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNL
        YTDV GIIKLLNRHVVNNMKDVDMI IPMN LIFGSDKFVY+AREDLLHYCDMVEIGYMCILAYITCLWDKCDCA NFFVIDQSKISSHIKD DLR RNL
Subjt:  YTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDKFVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNL

Query:  ANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK
        AN LEAVNLEQKVLI YNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHG+IN VEDM SETRSTTL IN KMETCK
Subjt:  ANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNEDIHGVINRVEDMASETRSTTLLINSKMETCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCGAGATGCAAAAGTGACAAGAAGAGAAGTAATCACTCGAGATCTTGATGCCGATGAGGACACACCTAGCAACAAAGGAGTGGAGGGAACACCATGCCAATTGTC
TATAGGATCCATTAATAATATTGTTGCAGTAGCCACGATAGTTGAGGATAACATTGGATGTTTCAATAATCATTCAACTAGGAAGGATGTCGTATACTCTTCAAATTATA
CTGACGTTAAGGGGATTATTAAGCTTTTAAATAGACATGTCGTGAACAACATGAAGGATGTAGACATGATTTGCATACCAATGAACGTGCTAATATTTGGAAGTGACAAA
TTTGTTTATTTAGCGCGTGAAGATTTGTTGCATTACTGCGACATGGTTGAAATCGGCTATATGTGTATACTAGCGTATATTACATGTCTTTGGGATAAATGTGACTGTGC
AGGGAATTTTTTTGTCATTGACCAATCAAAAATATCGTCTCATATCAAAGATTGTGATCTTCGATTCAGAAATTTAGCCAACCAGCTAGAAGCAGTTAACTTGGAACAGA
AAGTGCTAATTCCATATAATACCGGTTTTCATTGGATGTTGCATGTTATCGATCTTCGTGAAAATTGCGTTTATGTTTTGGACTCTCTTCGGAGTAAAGTCAATGAAGAC
ATTCATGGAGTCATAAATAGGGTTGAAGACATGGCAAGCGAAACACGATCTACAACGCTATTGATCAACTCCAAAATGGAGACCTGTAAAGTGCCCTCGTCAATTGGATT
CTGTAGGATGACGAACATTTCCGGACGTAATTCGAGTCAAATAATGTCGAAAGGAGAGAAGGACCTCGAGGAGAGGATAAAGGAGCGTCAGAAACAAAGCCTACGGCATA
AAATCATTCATCAGAGTTCCAGTTCGTATGAGAAGGATAGGTTGCTAGGTAGACTAGCTCTGACCTCCATCTCATCAATCAGTCCCTGTAGCATGAGCAGTAGATGCTGT
CTAACGTCAAGTAGTCTTTGCCTAAACTTTTGGACACTTCCTTCTTCTTTAAGAGCTAATTATCCCAATGTAGTAGGTTTAAGGAAAAGGCTCAAAGAATTTGAGACGCT
GCTCCATCTTCTGCGTAGTGGGCTTGGGGAGAAAGAAAGCCAATCCTCCAAAAGGGAGGCAAGAAACCTACTAAAAGGAATTGAACCTAGCAAACAAGCCCTTGTTCATT
AA
mRNA sequenceShow/hide mRNA sequence
ATGACTCGAGATGCAAAAGTGACAAGAAGAGAAGTAATCACTCGAGATCTTGATGCCGATGAGGACACACCTAGCAACAAAGGAGTGGAGGGAACACCATGCCAATTGTC
TATAGGATCCATTAATAATATTGTTGCAGTAGCCACGATAGTTGAGGATAACATTGGATGTTTCAATAATCATTCAACTAGGAAGGATGTCGTATACTCTTCAAATTATA
CTGACGTTAAGGGGATTATTAAGCTTTTAAATAGACATGTCGTGAACAACATGAAGGATGTAGACATGATTTGCATACCAATGAACGTGCTAATATTTGGAAGTGACAAA
TTTGTTTATTTAGCGCGTGAAGATTTGTTGCATTACTGCGACATGGTTGAAATCGGCTATATGTGTATACTAGCGTATATTACATGTCTTTGGGATAAATGTGACTGTGC
AGGGAATTTTTTTGTCATTGACCAATCAAAAATATCGTCTCATATCAAAGATTGTGATCTTCGATTCAGAAATTTAGCCAACCAGCTAGAAGCAGTTAACTTGGAACAGA
AAGTGCTAATTCCATATAATACCGGTTTTCATTGGATGTTGCATGTTATCGATCTTCGTGAAAATTGCGTTTATGTTTTGGACTCTCTTCGGAGTAAAGTCAATGAAGAC
ATTCATGGAGTCATAAATAGGGTTGAAGACATGGCAAGCGAAACACGATCTACAACGCTATTGATCAACTCCAAAATGGAGACCTGTAAAGTGCCCTCGTCAATTGGATT
CTGTAGGATGACGAACATTTCCGGACGTAATTCGAGTCAAATAATGTCGAAAGGAGAGAAGGACCTCGAGGAGAGGATAAAGGAGCGTCAGAAACAAAGCCTACGGCATA
AAATCATTCATCAGAGTTCCAGTTCGTATGAGAAGGATAGGTTGCTAGGTAGACTAGCTCTGACCTCCATCTCATCAATCAGTCCCTGTAGCATGAGCAGTAGATGCTGT
CTAACGTCAAGTAGTCTTTGCCTAAACTTTTGGACACTTCCTTCTTCTTTAAGAGCTAATTATCCCAATGTAGTAGGTTTAAGGAAAAGGCTCAAAGAATTTGAGACGCT
GCTCCATCTTCTGCGTAGTGGGCTTGGGGAGAAAGAAAGCCAATCCTCCAAAAGGGAGGCAAGAAACCTACTAAAAGGAATTGAACCTAGCAAACAAGCCCTTGTTCATT
AA
Protein sequenceShow/hide protein sequence
MTRDAKVTRREVITRDLDADEDTPSNKGVEGTPCQLSIGSINNIVAVATIVEDNIGCFNNHSTRKDVVYSSNYTDVKGIIKLLNRHVVNNMKDVDMICIPMNVLIFGSDK
FVYLAREDLLHYCDMVEIGYMCILAYITCLWDKCDCAGNFFVIDQSKISSHIKDCDLRFRNLANQLEAVNLEQKVLIPYNTGFHWMLHVIDLRENCVYVLDSLRSKVNED
IHGVINRVEDMASETRSTTLLINSKMETCKVPSSIGFCRMTNISGRNSSQIMSKGEKDLEERIKERQKQSLRHKIIHQSSSSYEKDRLLGRLALTSISSISPCSMSSRCC
LTSSSLCLNFWTLPSSLRANYPNVVGLRKRLKEFETLLHLLRSGLGEKESQSSKREARNLLKGIEPSKQALVH