; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011652 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011652
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr01:8743569..8745674
RNA-Seq ExpressionHG10011652
SyntenyHG10011652
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015466.1 hypothetical protein SDJN02_23102 [Cucurbita argyrosperma subsp. argyrosperma]4.7e-11588.8Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGG--GGGGGDDGAT
        MTSFIRFS+FN+LHDNF  KPT+FNPLPPPKVASC RNGFRLR+  G+L FL GGW N  V GK G FKGKRGL+VARFNQGFGFNGG  GGGGGDDGAT
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGG--GGGGGDDGAT

Query:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        ARLLGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
         FSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

XP_004149291.1 uncharacterized protein LOC101205846 [Cucumis sativus]3.3e-11688.21Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGG------GGGGG
        MTSFIRFSTFNILH+NFC KPTKFNPLPPPKV  C  +G RLR YG RLLF GGGWV++RVFGK GGFKGKRG L+VARFNQGFGFNGG      GG GG
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGG------GGGGG

Query:  DDGATARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF
        DDGATARL+GNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF
Subjt:  DDGATARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF

Query:  VRDSVKFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        VRDSVKFSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  VRDSVKFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

XP_008452285.1 PREDICTED: uncharacterized protein LOC103493356 [Cucumis melo]7.3e-11689.17Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA
        MTSFIRFSTFNILH+NFC KPTKFNPLPPPKV  C+ +G RLR YG RL F GGGWV +RVFGK  GFKGKRG L+VARFNQGFGFNGGGG GGDDGATA
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA

Query:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
        RLLGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV 
Subjt:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK

Query:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        FS KT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

XP_022985259.1 uncharacterized protein LOC111483302 [Cucurbita maxima]1.1e-11690.04Show/hide
Query:  MTSFIRFSTFN-ILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN-GGGGGGGDDGAT
        MTSFIRFS+FN +LHDNF KKPT+FNPLPPPKVASCDRNGFRLR+ GG+L FL GGW N+ V GK G FKGKRGL+VARFNQGFGFN GGGGGGGDDGAT
Subjt:  MTSFIRFSTFN-ILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN-GGGGGGGDDGAT

Query:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        ARLLGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
         FSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

XP_038905451.1 uncharacterized protein LOC120091477 isoform X2 [Benincasa hispida]5.6e-11691.29Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN--GGGGGGGDDGAT
        MTSFIRFST NILHD+FC KPTKFNPLP PKVAS DRNGFRLRVYG RL FL     N RVFGK GG KGKRGL+VARFNQGFGFN  GGGGGGGDDGAT
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN--GGGGGGGDDGAT

Query:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        KFSNKT+ TFGQAF DFTSPQKGKETSGAVVDIEAEVKDVD
Subjt:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

TrEMBL top hitse value%identityAlignment
A0A0A0L5A4 Uncharacterized protein1.6e-11688.21Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGG------GGGGG
        MTSFIRFSTFNILH+NFC KPTKFNPLPPPKV  C  +G RLR YG RLLF GGGWV++RVFGK GGFKGKRG L+VARFNQGFGFNGG      GG GG
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGG------GGGGG

Query:  DDGATARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF
        DDGATARL+GNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF
Subjt:  DDGATARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKF

Query:  VRDSVKFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        VRDSVKFSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  VRDSVKFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

A0A1S3BUA7 uncharacterized protein LOC1034933563.5e-11689.17Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA
        MTSFIRFSTFNILH+NFC KPTKFNPLPPPKV  C+ +G RLR YG RL F GGGWV +RVFGK  GFKGKRG L+VARFNQGFGFNGGGG GGDDGATA
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA

Query:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
        RLLGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV 
Subjt:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK

Query:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        FS KT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

A0A5A7TVG5 Uncharacterized protein3.5e-11689.17Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA
        MTSFIRFSTFNILH+NFC KPTKFNPLPPPKV  C+ +G RLR YG RL F GGGWV +RVFGK  GFKGKRG L+VARFNQGFGFNGGGG GGDDGATA
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRG-LVVARFNQGFGFNGGGGGGGDDGATA

Query:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
        RLLGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV 
Subjt:  RLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK

Query:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
        FS KT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  FSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

A0A6J1EMN9 uncharacterized protein LOC1114359815.1e-11588.8Show/hide
Query:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGG--GGGGGDDGAT
        MTSFIRFS+FN+LHDNF  KPT+FNPLPPPKVASC RNGFRLR   G+L FL GGW N  V GK G FKGKRGL+VARFNQGFGFNGG  GGGGGDDGAT
Subjt:  MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGG--GGGGGDDGAT

Query:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        ARLLGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
         FSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

A0A6J1JCT5 uncharacterized protein LOC1114833025.4e-11790.04Show/hide
Query:  MTSFIRFSTFN-ILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN-GGGGGGGDDGAT
        MTSFIRFS+FN +LHDNF KKPT+FNPLPPPKVASCDRNGFRLR+ GG+L FL GGW N+ V GK G FKGKRGL+VARFNQGFGFN GGGGGGGDDGAT
Subjt:  MTSFIRFSTFN-ILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFN-GGGGGGGDDGAT

Query:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        ARLLGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD
         FSNKT+ TFGQAFSDFTSP+KGKETSGAVVDIEAEVKDVD
Subjt:  KFSNKTAPTFGQAFSDFTSPQKGKETSGAVVDIEAEVKDVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G27390.1 unknown protein3.1e-5653.19Show/hide
Query:  LHDNFCKKPT-----KFNPLPPPKVASCDRNGFR--LRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGGGGGGGDDGATARLLGNI
        +HD F K+P+     K   L P  + +  + G R  L  +  R   +G G +   +   +     KR +V  R   G  FN     GGD+    R+LGN+
Subjt:  LHDNFCKKPT-----KFNPLPPPKVASCDRNGFR--LRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGGGGGGGDDGATARLLGNI

Query:  ALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTA
        ALA+GLTYLS+TGQLGW+LDAIVS+WL+ V+VPI+G+ AF WWA RDIVQS CPNCGNEFQIFKS +++E+QLCPFC+QPFSVVDDKFV++ VKFSN+T 
Subjt:  ALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTA

Query:  PTFGQAFSDFTS-PQKGKETSGAVVDIEAEVKDVD
          FGQ  + F+S P+KGK +S AVVDIEAEV D D
Subjt:  PTFGQAFSDFTS-PQKGKETSGAVVDIEAEVKDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGTTTCATCAGATTTTCTACCTTCAACATACTACACGACAATTTCTGCAAGAAACCCACCAAATTTAATCCTCTTCCGCCTCCGAAGGTTGCCTCCTGCGATCG
AAATGGGTTCCGTTTGAGGGTTTATGGAGGGAGATTGTTGTTTCTGGGTGGGGGTTGGGTTAACGAACGGGTTTTTGGAAAAGATGGAGGGTTTAAGGGAAAGAGAGGGC
TAGTTGTAGCAAGGTTTAATCAGGGTTTTGGGTTTAACGGCGGCGGCGGCGGCGGCGGAGACGATGGGGCAACTGCGAGGCTTCTGGGTAATATTGCTTTGGCTGTTGGG
TTAACTTATCTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCGATTGTTTCCATTTGGCTTGTTGCAGTTCTTGTACCAATTGTTGGAGTGGCTGCTTTTATATG
GTGGGCAGGACGAGATATAGTTCAAAGCACTTGCCCTAACTGTGGAAATGAATTTCAAATATTCAAATCAACACTGAATGAAGAGCTGCAACTATGCCCTTTTTGTAGCC
AACCTTTCTCTGTGGTGGATGACAAGTTTGTGAGGGATTCAGTGAAGTTCTCCAACAAAACTGCTCCTACCTTTGGACAGGCATTCAGTGACTTTACTTCTCCCCAAAAA
GGGAAGGAAACTTCTGGTGCAGTGGTTGACATAGAAGCAGAAGTAAAAGATGTGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAGTTTCATCAGATTTTCTACCTTCAACATACTACACGACAATTTCTGCAAGAAACCCACCAAATTTAATCCTCTTCCGCCTCCGAAGGTTGCCTCCTGCGATCG
AAATGGGTTCCGTTTGAGGGTTTATGGAGGGAGATTGTTGTTTCTGGGTGGGGGTTGGGTTAACGAACGGGTTTTTGGAAAAGATGGAGGGTTTAAGGGAAAGAGAGGGC
TAGTTGTAGCAAGGTTTAATCAGGGTTTTGGGTTTAACGGCGGCGGCGGCGGCGGCGGAGACGATGGGGCAACTGCGAGGCTTCTGGGTAATATTGCTTTGGCTGTTGGG
TTAACTTATCTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCGATTGTTTCCATTTGGCTTGTTGCAGTTCTTGTACCAATTGTTGGAGTGGCTGCTTTTATATG
GTGGGCAGGACGAGATATAGTTCAAAGCACTTGCCCTAACTGTGGAAATGAATTTCAAATATTCAAATCAACACTGAATGAAGAGCTGCAACTATGCCCTTTTTGTAGCC
AACCTTTCTCTGTGGTGGATGACAAGTTTGTGAGGGATTCAGTGAAGTTCTCCAACAAAACTGCTCCTACCTTTGGACAGGCATTCAGTGACTTTACTTCTCCCCAAAAA
GGGAAGGAAACTTCTGGTGCAGTGGTTGACATAGAAGCAGAAGTAAAAGATGTGGACTGA
Protein sequenceShow/hide protein sequence
MTSFIRFSTFNILHDNFCKKPTKFNPLPPPKVASCDRNGFRLRVYGGRLLFLGGGWVNERVFGKDGGFKGKRGLVVARFNQGFGFNGGGGGGGDDGATARLLGNIALAVG
LTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTAPTFGQAFSDFTSPQK
GKETSGAVVDIEAEVKDVD