; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G012870 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G012870
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationCma_Chr04:6560104..6564768
RNA-Seq ExpressionCmaCh04G012870
SyntenyCmaCh04G012870
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]4.5e-11296.38Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIFRASPPFRVSKP NIFVSRNPSIRQCL+NAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]5.0e-11195.93Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIF ASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

XP_022983929.1 uncharacterized protein LOC111482401 isoform X1 [Cucurbita maxima]1.6e-104100Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]3.9e-116100Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]1.3e-11498.19Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENHKKHPRKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X52.1e-7884.13Show/hide
Query:  PFRVSKPNN---IFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFY
        P R S P +     VSRNPS+R CL+NA+ISANDPLKSE  FSNHETEGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNT YYLTTFLVVWFY
Subjt:  PFRVSKPNN---IFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFY

Query:  IAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
        IAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKA +AIVGFCLGL+LLLFI VTLLSA
Subjt:  IAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA

A0A5A7SW01 Uncharacterized protein5.4e-7976.13Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPP-FRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEV
        M TP KLTNG IPCV FP   S ELQ K+I RASPP     +PN                    ANDPLKSE  FSNHETEGSMEKNEN +KHP+KS EV
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPP-FRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEV

Query:  LDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVV
        LDKLRRYG+SGILSYGLLNT YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKA +
Subjt:  LDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVV

Query:  AIVGFCLGLSLLLFIAVTLLSA
        AIVGFCLGL+LLLFI VTLLSA
Subjt:  AIVGFCLGLSLLLFIAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582382.4e-11195.93Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIF ASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X21.9e-116100Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X17.5e-105100Show/hide
Query:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
        MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein2.0e-4965.84Show/hide
Query:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS
        +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGS
Subjt:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS

Query:  QVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
        QVTKL R  GA+A+AP VDRGLSWFTVK NF+SQGKA  A+VG CLG++L+LFI VTLL A
Subjt:  QVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA

AT2G38695.2 unknown protein8.7e-2962.73Show/hide
Query:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS
        +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGS
Subjt:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS

Query:  QVTKLARAAG
        QVTKL R  G
Subjt:  QVTKLARAAG

AT2G38695.3 unknown protein1.4e-4250.72Show/hide
Query:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS
        +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGS
Subjt:  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGS

Query:  QVTKLARAAG------------------------------------------------ALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLL
        QVTKL R  G                                                A+A+AP VDRGLSWFTVK NF+SQGKA  A+VG CLG++L+L
Subjt:  QVTKLARAAG------------------------------------------------ALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLL

Query:  FIAVTLLSA
        FI VTLL A
Subjt:  FIAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGTGGCGGGTTCGAACGAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCG
CCTTTTCGCGTCTCAAAGCCGAATAACATTTTTGTTAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGCCAATGATCCATTGAAATCG
GAAAGTGGCTTTTCCAATCATGAAACTGAAGGTTCGATGGAAAAGAATGAAAATCATAAAAAACATCCGCGAAAATCGATTGAGGTGCTGGATAAATTGAGGAGA
TATGGAGTTTCTGGAATATTGTCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTCTACATTGCACCAGCACCTGCGAAAATG
GGCTATGTTGCAGCTGCTGGAAGATTTCTCAAGATAATGGCTACAATCTGGGCTGGAAGCCAAGTTACTAAGCTGGCAAGAGCAGCAGGAGCTCTTGCTATGGCG
CCGTTCGTCGACAGAGGATTGTCGTGGTTCACAGTCAAATACAACTTCAAGTCTCAGGGGAAGGCAGTTGTGGCGATTGTTGGATTCTGCTTAGGGTTGTCTCTC
TTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
TCTCTCTCTCTCTCTCTCTTTCAAATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGTGGCGGGTTCGAACGAATTGCAGCCG
AAGACAATTTTCAGAGCTTCCCCGCCTTTTCGCGTCTCAAAGCCGAATAACATTTTTGTTAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATT
AGCGCCAATGATCCATTGAAATCGGAAAGTGGCTTTTCCAATCATGAAACTGAAGGTTCGATGGAAAAGAATGAAAATCATAAAAAACATCCGCGAAAATCGATT
GAGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTGTCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTCTAC
ATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCAGCTGCTGGAAGATTTCTCAAGATAATGGCTACAATCTGGGCTGGAAGCCAAGTTACTAAGCTGGCAAGA
GCAGCAGGAGCTCTTGCTATGGCGCCGTTCGTCGACAGAGGATTGTCGTGGTTCACAGTCAAATACAACTTCAAGTCTCAGGGGAAGGCAGTTGTGGCGATTGTT
GGATTCTGCTTAGGGTTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAAGACAGGTTCTTCCCTTGGGAAAGTAAGTACTTTTTTCTCTCACAT
CATTCATTACCCTATTCCCAATGAGTGTTTCCTTGTCCTCAATATTGAAGGCTATGACTTTTTTTCTTCCTTAAAATTAAAATTTCCAATATTTTAAGAGATAGG
GTGCAGGTTTAACCAAAATTTTATCAATGAGATTGTCGTTATTGTCGAGAATATGTTACACG
Protein sequenceShow/hide protein sequence
MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRR
YGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSL
LLFIAVTLLSA