; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029061 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029061
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionuclacyanin-3
Genome locationchr8:34859822..34861649
RNA-Seq ExpressionLag0029061
SyntenyLag0029061
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022133832.1 uncharacterized protein LOC111006293 isoform X1 [Momordica charantia]1.6e-10280.24Show/hide
Query:  IMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFN
        +MKIH  + +FLCFLALLFQLCCSSTTIVVDGVSEWKNPSVH GDSIIFKHKFHY LFIFH+QRAFNLCN+THATLLSKPNSTTF WHPSRPG+FFFSF+
Subjt:  IMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFN

Query:  NGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSN
        NGSK+SCNGSQKLAVKVSAS PP+  HLSPQ PPMAAPAP+SGGVLPSSP YPWPF PRQ  P       PSLP S SSPLT+P L+PEKGG LPFINSN
Subjt:  NGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSN

Query:  PAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFILL
        PAVPLPTGEVDSATIRPLPTS HG+HRV+M    A+KL L+SLLF+ L
Subjt:  PAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFILL

XP_022939349.1 uclacyanin-3 [Cucurbita moschata]8.1e-10280.65Show/hide
Query:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF
        M  KIH LV+VF  F ALL QLCCSS TIVVDGVS+WKNPSVHIGDSI+FKHKFHYELFIFHNQRAF+LCNYTHATLLSKPNST FMWHPSR GVFFF+F
Subjt:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF

Query:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS
        NNGSKSSCNGSQKLAVKV+ SAPP+ SHLSP  PPMAAPAPISGGVLPS+PAYPWPFHPRQ  PS      PSLP S S PLT    +PEKGG+LPFINS
Subjt:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS

Query:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL
        NPAVPLPTGEVD+ATIRPLPTS HGTHR IMGFPL +KL L+S LF+L
Subjt:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL

XP_022992896.1 uclacyanin-3 [Cucurbita maxima]1.9e-10382.04Show/hide
Query:  KIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNG
        KIH LV+VF CF ALL QLCCSS TIVVDGVS+WKNPSVHIGDSIIFKHKFHYELFIFHNQRAF+LCNYTHATLLSKPNST FMWHPSR GVFFF+FNNG
Subjt:  KIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNG

Query:  SKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPA
        SKSSCNGSQKLAVKV+ SAPP+ SHLSPQ PPMAAPAPISGGVLPS+PAYPWPFHPRQ  PS      PSLP S S PLT    +PEKGG+LPFINSNPA
Subjt:  SKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPA

Query:  VPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL
        VPLPTGEVD+ATIRPLPTS HGTHR IMGFPL +KL L+S  F+L
Subjt:  VPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL

XP_023551509.1 uclacyanin-3 [Cucurbita pepo subsp. pepo]8.1e-10281.89Show/hide
Query:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF
        M  KIH LV+VF  F ALL QLCCSS TIVVDGVS+WKNPSVHIGDSIIFKHKFHYELFIFHNQRAF+LCNYTHATLLSKPNST FMWHPSR GVFFF+F
Subjt:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF

Query:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS
        NNGSKSSCNGSQKLAVKV+ SAPP+ SHLSP  PPMAAPAPISGGVLPS+PAYPWPFHPRQ  PS      PSLP S S PLT    +PEKGG+LPFINS
Subjt:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS

Query:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLS
        NPAVPLPTGEVD+ATIRPLPTSAHGTHR IMGFPL +KL L+S
Subjt:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLS

XP_038884501.1 early nodulin-like protein 1 isoform X2 [Benincasa hispida]2.9e-9981.63Show/hide
Query:  MKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNN
        MKIH L+LV LCF   LFQLC SS TIV+DGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLL+KPNST+FMWHPSR G+FFFSFNN
Subjt:  MKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNN

Query:  GSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNP
        GSKSSCNGSQKLAVKVSAS P + SHLSPQ PPMAAPAPISGGVLPS+PAYPWPFHPRQ   + SPSP PSLP S S P T    +PEKGG L FINSNP
Subjt:  GSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNP

Query:  AVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFI
        AVPLPTGEVD+ATIRPL TS HGTHRVIM FPL IKL L+S+LF+
Subjt:  AVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFI

TrEMBL top hitse value%identityAlignment
A0A1S3B358 uncharacterized protein LOC1034856112.8e-9274.6Show/hide
Query:  MKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNN
        MKI  L+LVFLCF   LF +C SS TIVVDGVS+WK+PSVHIGDSIIFKHKFHYELFIF +QRAF+LCNYTHATLL+KPNST+FMWHPSR G+FFFSFNN
Subjt:  MKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNN

Query:  GSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNP
        GSKSSCNGSQK AVKVSAS+PP+ SHLSP  PPMAAPAP+S GVLPS+PAYPWPFHPRQ   SPSPS  P +P S S PLT    +P KGG + FINSNP
Subjt:  GSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNP

Query:  AVPLPTGEVDSATIRPLPTSAHGTHR---VIMGFPLAIKLVLLSLLFI
        AVPLPTGEVD+ATIRPL TS  GTHR   VIM  PL +K+ L+S+LF+
Subjt:  AVPLPTGEVDSATIRPLPTSAHGTHR---VIMGFPLAIKLVLLSLLFI

A0A6J1BXW5 uncharacterized protein LOC111006293 isoform X17.9e-10380.24Show/hide
Query:  IMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFN
        +MKIH  + +FLCFLALLFQLCCSSTTIVVDGVSEWKNPSVH GDSIIFKHKFHY LFIFH+QRAFNLCN+THATLLSKPNSTTF WHPSRPG+FFFSF+
Subjt:  IMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFN

Query:  NGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSN
        NGSK+SCNGSQKLAVKVSAS PP+  HLSPQ PPMAAPAP+SGGVLPSSP YPWPF PRQ  P       PSLP S SSPLT+P L+PEKGG LPFINSN
Subjt:  NGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSN

Query:  PAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFILL
        PAVPLPTGEVDSATIRPLPTS HG+HRV+M    A+KL L+SLLF+ L
Subjt:  PAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFILL

A0A6J1C0C5 uncharacterized protein LOC111006293 isoform X21.3e-9781.3Show/hide
Query:  FQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNGSKSSCNGSQKLAVKVS
        FQLCCSSTTIVVDGVSEWKNPSVH GDSIIFKHKFHY LFIFH+QRAFNLCN+THATLLSKPNSTTF WHPSRPG+FFFSF+NGSK+SCNGSQKLAVKVS
Subjt:  FQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNGSKSSCNGSQKLAVKVS

Query:  ASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPAVPLPTGEVDSATIRPL
        AS PP+  HLSPQ PPMAAPAP+SGGVLPSSP YPWPF PRQ  P       PSLP S SSPLT+P L+PEKGG LPFINSNPAVPLPTGEVDSATIRPL
Subjt:  ASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPAVPLPTGEVDSATIRPL

Query:  PTSAHGTHRVIMGFPLAIKLVLLSLLFILL
        PTS HG+HRV+M    A+KL L+SLLF+ L
Subjt:  PTSAHGTHRVIMGFPLAIKLVLLSLLFILL

A0A6J1FGW9 uclacyanin-33.9e-10280.65Show/hide
Query:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF
        M  KIH LV+VF  F ALL QLCCSS TIVVDGVS+WKNPSVHIGDSI+FKHKFHYELFIFHNQRAF+LCNYTHATLLSKPNST FMWHPSR GVFFF+F
Subjt:  MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSF

Query:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS
        NNGSKSSCNGSQKLAVKV+ SAPP+ SHLSP  PPMAAPAPISGGVLPS+PAYPWPFHPRQ  PS      PSLP S S PLT    +PEKGG+LPFINS
Subjt:  NNGSKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS

Query:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL
        NPAVPLPTGEVD+ATIRPLPTS HGTHR IMGFPL +KL L+S LF+L
Subjt:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL

A0A6J1JUT2 uclacyanin-39.4e-10482.04Show/hide
Query:  KIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNG
        KIH LV+VF CF ALL QLCCSS TIVVDGVS+WKNPSVHIGDSIIFKHKFHYELFIFHNQRAF+LCNYTHATLLSKPNST FMWHPSR GVFFF+FNNG
Subjt:  KIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNG

Query:  SKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPA
        SKSSCNGSQKLAVKV+ SAPP+ SHLSPQ PPMAAPAPISGGVLPS+PAYPWPFHPRQ  PS      PSLP S S PLT    +PEKGG+LPFINSNPA
Subjt:  SKSSCNGSQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPA

Query:  VPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL
        VPLPTGEVD+ATIRPLPTS HGTHR IMGFPL +KL L+S  F+L
Subjt:  VPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21090.1 Cupredoxin superfamily protein1.1e-4849.19Show/hide
Query:  LVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNGSK--
        L   F CFL+ LF     S T +VDGVS WK+P+VH GDS+IF+HK+ Y+L+IF N+ AFN+CN+T ATLL+KPNST+F W+PSR G ++FSF N +   
Subjt:  LVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNGSK--

Query:  SSCNGSQKLAVKV-SASAPPKGSHLSPQIPPMAAPAPIS-GGVLPSSPAYPWPFHPRQ---VVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS
         +C  +QKL V+V  A+A P      P  PP  AP P+S GGV+ S  +YPWP  PR+     P PSPS   S+             +P K G +PFINS
Subjt:  SSCNGSQKLAVKV-SASAPPKGSHLSPQIPPMAAPAPIS-GGVLPSSPAYPWPFHPRQ---VVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINS

Query:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL
        NPAVPLPTG+VDS +I PLPTS +  H+V+M   L +KL L  +   L
Subjt:  NPAVPLPTGEVDSATIRPLPTSAHGTHRVIMGFPLAIKLVLLSLLFIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATGAAGATCCATTCATTAGTGCTTGTGTTCTTGTGTTTTCTAGCATTGTTGTTTCAATTATGTTGCTCTTCCACTACCATTGTTGTTGATGGAGTTTCAGAATG
GAAAAATCCCTCTGTTCATATTGGAGATTCCATCATTTTCAAGCATAAGTTTCATTATGAGCTCTTCATTTTCCACAATCAAAGGGCTTTCAATTTGTGCAATTACACTC
ATGCCACTCTTCTCAGCAAACCCAATTCCACTACATTTATGTGGCATCCATCACGGCCTGGAGTTTTCTTCTTCTCTTTCAACAATGGCTCTAAGAGCTCCTGCAATGGC
TCTCAAAAGCTTGCTGTGAAGGTTTCTGCTTCAGCCCCACCAAAAGGTTCCCATCTTTCTCCACAGATCCCTCCAATGGCGGCTCCGGCGCCGATTTCCGGCGGAGTTCT
GCCGTCTTCTCCGGCATACCCTTGGCCATTTCACCCTCGGCAAGTGGTGCCGTCGCCGTCGCCTAGTCCACAGCCGAGTCTGCCGTCGAGTGGAAGTTCGCCGTTGACGT
TGCCGCCGTTGATGCCGGAGAAAGGAGGAGCTCTGCCGTTTATTAACAGTAATCCGGCGGTTCCTCTGCCCACCGGCGAAGTGGACTCTGCCACTATTCGTCCTTTGCCA
ACTTCAGCCCATGGAACACATCGGGTAATCATGGGCTTTCCACTTGCAATTAAATTGGTTCTACTTTCACTTTTGTTTATTTTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAATGAAGATCCATTCATTAGTGCTTGTGTTCTTGTGTTTTCTAGCATTGTTGTTTCAATTATGTTGCTCTTCCACTACCATTGTTGTTGATGGAGTTTCAGAATG
GAAAAATCCCTCTGTTCATATTGGAGATTCCATCATTTTCAAGCATAAGTTTCATTATGAGCTCTTCATTTTCCACAATCAAAGGGCTTTCAATTTGTGCAATTACACTC
ATGCCACTCTTCTCAGCAAACCCAATTCCACTACATTTATGTGGCATCCATCACGGCCTGGAGTTTTCTTCTTCTCTTTCAACAATGGCTCTAAGAGCTCCTGCAATGGC
TCTCAAAAGCTTGCTGTGAAGGTTTCTGCTTCAGCCCCACCAAAAGGTTCCCATCTTTCTCCACAGATCCCTCCAATGGCGGCTCCGGCGCCGATTTCCGGCGGAGTTCT
GCCGTCTTCTCCGGCATACCCTTGGCCATTTCACCCTCGGCAAGTGGTGCCGTCGCCGTCGCCTAGTCCACAGCCGAGTCTGCCGTCGAGTGGAAGTTCGCCGTTGACGT
TGCCGCCGTTGATGCCGGAGAAAGGAGGAGCTCTGCCGTTTATTAACAGTAATCCGGCGGTTCCTCTGCCCACCGGCGAAGTGGACTCTGCCACTATTCGTCCTTTGCCA
ACTTCAGCCCATGGAACACATCGGGTAATCATGGGCTTTCCACTTGCAATTAAATTGGTTCTACTTTCACTTTTGTTTATTTTGCTTTAG
Protein sequenceShow/hide protein sequence
MIMKIHSLVLVFLCFLALLFQLCCSSTTIVVDGVSEWKNPSVHIGDSIIFKHKFHYELFIFHNQRAFNLCNYTHATLLSKPNSTTFMWHPSRPGVFFFSFNNGSKSSCNG
SQKLAVKVSASAPPKGSHLSPQIPPMAAPAPISGGVLPSSPAYPWPFHPRQVVPSPSPSPQPSLPSSGSSPLTLPPLMPEKGGALPFINSNPAVPLPTGEVDSATIRPLP
TSAHGTHRVIMGFPLAIKLVLLSLLFILL