; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1281 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1281
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationMC09:18903586..18911534
RNA-Seq ExpressionMC09g1281
SyntenyMC09g1281
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]2.34e-10576.13Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN L P+TI R SPPFR   P N+FV++NP IR C  NA I A DPL SE+GFSNHE EGSMEKNE+ Q+H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAP PAKMGYVAAAGRFLKIMATV AGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

XP_022139172.1 uncharacterized protein LOC111010142 [Momordica charantia]1.28e-12688.64Show/hide
Query:  MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD
        MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD
Subjt:  MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD

Query:  KLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI
        KLRR                         FYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI
Subjt:  KLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI

Query:  VGFCFGLALLLFIAVTLLSA
        VGFCFGLALLLFIAVTLLSA
Subjt:  VGFCFGLALLLFIAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]2.23e-10375.23Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN L P+TI   SPPFR   P N+FV++NP IR C +NA I A DPL SE+GFSNHE EGSMEKNE+ Q+H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAP PAKMGYVAAAGRFLKIMATV AGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]9.99e-10776.13Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN+L P+TI R SPPFR   P N+FV++NP IR C +NA I A DPL SESGFSNHE EGSMEKNE+ ++H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAPAPAKMGYVAAAGRFLKIMAT+ AGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]3.32e-10575.68Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN L P+TI R SPPFR   P N+FV++NP IR C +NA I A DPL SE+GFSNHE EGSMEKNE+ ++H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAP PAKMGYVAAAGRFLKIMATV AGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X56.15e-9477Show/hide
Query:  SNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVY
        S    P    R SPP   +      V++NP +RLC SNA I A DPL SE  FSNHE EGSMEKNE+ Q+H  KSN VLDKLRRYG+SGILSYGLLNT Y
Subjt:  SNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVY

Query:  YLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA
        YLTTF++VWFYIAPAPAKMGYVAAAGRFLKIMATV AGSQVTKLARAAGALA+APFVDRGLSWFTV YNFESQGKAFMAIVGFC GLALLLFI VTLLSA
Subjt:  YLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA

A0A6J1CD92 uncharacterized protein LOC1110101426.18e-12788.64Show/hide
Query:  MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD
        MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD
Subjt:  MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLD

Query:  KLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI
        KLRR                         FYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI
Subjt:  KLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAI

Query:  VGFCFGLALLLFIAVTLLSA
        VGFCFGLALLLFIAVTLLSA
Subjt:  VGFCFGLALLLFIAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582381.08e-10375.23Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN L P+TI   SPPFR   P N+FV++NP IR C +NA I A DPL SE+GFSNHE EGSMEKNE+ Q+H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAP PAKMGYVAAAGRFLKIMATV AGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X24.84e-10776.13Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN+L P+TI R SPPFR   P N+FV++NP IR C +NA I A DPL SESGFSNHE EGSMEKNE+ ++H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAPAPAKMGYVAAAGRFLKIMAT+ AGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCFGLALLLFIAVTLLSA
        AIVGFC GL+LLLFIAVTLLSA
Subjt:  AIVGFCFGLALLLFIAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X11.45e-9375.13Show/hide
Query:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV
        M +P+KL NG     V+ P  GSN+L P+TI R SPPFR   P N+FV++NP IR C +NA I A DPL SESGFSNHE EGSMEKNE+ ++H  KS  V
Subjt:  MSSPVKLVNGGNFRWVILP--GSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAV

Query:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGK
        LDKLRRYG+SGILSYGLLNTVYYLTTF++VWFYIAPAPAKMGYVAAAGRFLKIMAT+ AGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  LDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein3.2e-4769.93Show/hide
Query:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFV
        EG M +KN  S+++   S  +L KL+RYG+SGILSYGLLNTVYY T F++VWFY+APAP KMGY+AAA RFLK+MA V AGSQVTKL R  GA+A+AP V
Subjt:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFV

Query:  DRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA
        DRGLSWFTVK NFESQGKAF A+VG C G+AL+LFI VTLL A
Subjt:  DRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA

AT2G38695.2 unknown protein2.0e-2566.3Show/hide
Query:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAG
        EG M +KN  S+++   S  +L KL+RYG+SGILSYGLLNTVYY T F++VWFY+APAP KMGY+AAA RFLK+MA V AGSQVTKL R  G
Subjt:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAG

AT2G38695.3 unknown protein2.2e-4052.36Show/hide
Query:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAG--------
        EG M +KN  S+++   S  +L KL+RYG+SGILSYGLLNTVYY T F++VWFY+APAP KMGY+AAA RFLK+MA V AGSQVTKL R  G        
Subjt:  EGSM-EKNESSQRHLPKSNAVLDKLRRYGISGILSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAG--------

Query:  ----------------------------------------ALAVAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA
                                                A+A+AP VDRGLSWFTVK NFESQGKAF A+VG C G+AL+LFI VTLL A
Subjt:  ----------------------------------------ALAVAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTCCTGTCAAGCTCGTGAATGGCGGCAATTTCCGCTGGGTCATTCTCCCGGGTTCGAACGATTTGCTGCCGCGTACAATTTCCAGAATTTCCCCGCCATTTCG
TGCCCTAAACCCGAAGAACGTTTTCGTTAATCAGAACCCTTACATCCGACTCTGCCATAGCAATGCCGGAATCGGCGCCACTGATCCACTGAATTCTGAAAGTGGCTTTT
CCAATCATGAAATGGAAGGTTCAATGGAAAAGAATGAAAGTAGTCAAAGACATCTGCCGAAATCGAACGCGGTACTGGATAAATTGAGGAGGTATGGAATTTCCGGAATA
TTGTCTTACGGATTATTAAACACAGTCTACTATCTTACAACATTTGTCATTGTGTGGTTCTACATTGCACCAGCACCAGCGAAAATGGGTTATGTTGCTGCTGCTGGAAG
ATTTCTAAAAATAATGGCTACAGTCTGTGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTGGCTGTAGCGCCATTCGTTGACAGAGGGTTGTCATGGT
TCACGGTCAAATACAACTTTGAGTCTCAGGGGAAGGCATTCATGGCTATTGTTGGATTCTGCTTTGGATTGGCCCTCTTGTTATTCATTGCTGTGACTCTGCTTTCAGCA
TAA
mRNA sequenceShow/hide mRNA sequence
GTCACACACACTTGGTCAGCATCTCCATCAAATGTCAAGTCCTGTCAAGCTCGTGAATGGCGGCAATTTCCGCTGGGTCATTCTCCCGGGTTCGAACGATTTGCTGCCGC
GTACAATTTCCAGAATTTCCCCGCCATTTCGTGCCCTAAACCCGAAGAACGTTTTCGTTAATCAGAACCCTTACATCCGACTCTGCCATAGCAATGCCGGAATCGGCGCC
ACTGATCCACTGAATTCTGAAAGTGGCTTTTCCAATCATGAAATGGAAGGTTCAATGGAAAAGAATGAAAGTAGTCAAAGACATCTGCCGAAATCGAACGCGGTACTGGA
TAAATTGAGGAGGTATGGAATTTCCGGAATATTGTCTTACGGATTATTAAACACAGTCTACTATCTTACAACATTTGTCATTGTGTGGTTCTACATTGCACCAGCACCAG
CGAAAATGGGTTATGTTGCTGCTGCTGGAAGATTTCTAAAAATAATGGCTACAGTCTGTGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTGGCTGTA
GCGCCATTCGTTGACAGAGGGTTGTCATGGTTCACGGTCAAATACAACTTTGAGTCTCAGGGGAAGGCATTCATGGCTATTGTTGGATTCTGCTTTGGATTGGCCCTCTT
GTTATTCATTGCTGTGACTCTGCTTTCAGCATAAGACAAGGTTTTTCCCTTGGGAAGGTTTTTCGAAAATTTTGTTGTTTTCGAGAACGGGCATGGCGAAGGAGGAAGGA
GGGGAGAGAGTGATGGTGATCCAAGATGGGAGCAGAAAGTTCAGTTGGAATGCAGCAATGGGAATTGGATGGATGCTGAGGAGCTCTCAACTTAAGAAGGGAGATGAAAT
AAAACTCATAGTTGTTCTTCATCAGGTTAATAATCCTTGTATGTATACTTTGATGGAAGCTCCAATGTGTAAGATTTCATTCACCTTTTTTTTTTATTTTTAATTTCTTT
GGGTCGGTTAGCGAGATTCCTTAACCGCTGAGTTGTGTTCAAATTTGATACAAAAAGGGCTGATTTATAGATATATCACTGATTGACACCGGTTATATTAGTAAATTTAC
TTGTATGTAATTCCTGTGTTTACTAATTCTCTAAACGTTTTCTGACTATATGTGAGCATTGTTGTGGTCAGACGAGACTATTTTGATAAGTTTTGGTTTTTTTGAACCTT
GTTAGAAATTTTTATTTGGACT
Protein sequenceShow/hide protein sequence
MSSPVKLVNGGNFRWVILPGSNDLLPRTISRISPPFRALNPKNVFVNQNPYIRLCHSNAGIGATDPLNSESGFSNHEMEGSMEKNESSQRHLPKSNAVLDKLRRYGISGI
LSYGLLNTVYYLTTFVIVWFYIAPAPAKMGYVAAAGRFLKIMATVCAGSQVTKLARAAGALAVAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCFGLALLLFIAVTLLSA