; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012663 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012663
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGATA zinc finger domain-containing protein 10
Genome locationchr1:43085052..43087753
RNA-Seq ExpressionLag0012663
SyntenyLag0012663
Gene Ontology termsGO:0022904 - respiratory electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016174 - Di-haem cytochrome, transmembrane
IPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604398.1 hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia]1.1e-9691.84Show/hide
Query:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK
        EFGDL        VEMRET ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEK
Subjt:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK

Query:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

KAG7034548.1 hypothetical protein SDJN02_04278, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-9792.35Show/hide
Query:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK
        EFGDLVE    E  E RET ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEK
Subjt:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK

Query:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_022925916.1 uncharacterized protein LOC111433189 [Cucurbita moschata]1.1e-9691.84Show/hide
Query:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK
        EFGDL        VEMRET ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEK
Subjt:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK

Query:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_023543365.1 uncharacterized protein LOC111803268 [Cucurbita pepo subsp. pepo]1.1e-9691.84Show/hide
Query:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK
        EFGDLVE    E  E  ET+ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEK
Subjt:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK

Query:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_038880983.1 uncharacterized protein LOC120072635 [Benincasa hispida]4.0e-9992.42Show/hide
Query:  DEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEE
        DEFGD         VEM+ETSETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEE
Subjt:  DEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEE

Query:  KANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFE NPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  KANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

TrEMBL top hitse value%identityAlignment
A0A0A0KIF5 Uncharacterized protein1.2e-9680.43Show/hide
Query:  VAKSSQDPSPFLFLFGWNEKSSVQ--SVLCSGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHW
        + ++S   S F F      ++SV+        DEFGD      +EE +  ETSE RLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSW+LPEWLVHW
Subjt:  VAKSSQDPSPFLFLFGWNEKSSVQ--SVLCSGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHW

Query:  GHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPG
        GHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGG+TSLLTSDKPI ESPHAVTGFIGLTLLTVQTLLPSLFE NPG
Subjt:  GHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPG

Query:  LRNVHGILGSGIMTLFLIHAALGLQLGLSY
        LRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  LRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A5D3C7L2 Uncharacterized protein7.2e-9476.67Show/hide
Query:  SIAFHFIPRFVAKSSQDPSPFLFLFGWNEKSSVQSVLC--SGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKS
        S +F F    + K+ ++ S F F      ++S++   C    D FGD VE          ETSE RLYSL+PFPLLF+AALPG GTVRSLFGPFVELVKS
Subjt:  SIAFHFIPRFVAKSSQDPSPFLFLFGWNEKSSVQSVLC--SGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKS

Query:  WSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTL
         +LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGGI SLLTSDKPIFESPHAVTGFIGL LLTVQTL
Subjt:  WSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTL

Query:  LPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        LPSLFE NPGLRNVHGILGSGIMTLFLIHAA GLQLGLSY
Subjt:  LPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1CRM9 uncharacterized protein LOC1110135571.5e-9692.67Show/hide
Query:  ENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDL
        ++D  E VEM+E SET LYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDL
Subjt:  ENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDL

Query:  HPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        HPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLTVQTLLPSLFE NPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  HPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1EDG7 uncharacterized protein LOC1114331895.3e-9791.84Show/hide
Query:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK
        EFGDL        VEMRET ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEK
Subjt:  EFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEK

Query:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  ANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

A0A6J1IMY7 uncharacterized protein LOC1114779001.5e-9678.33Show/hide
Query:  SSIAFHFIPRFVAKSSQDPSPFLFLFGWNEKSSVQ--SVLCSGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVK
        SS +F  +  +++K  +D SPF        +  V+         EFGDLVE         RET ETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVK
Subjt:  SSIAFHFIPRFVAKSSQDPSPFLFLFGWNEKSSVQ--SVLCSGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVK

Query:  SWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQT
        SW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+
Subjt:  SWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQT

Query:  LLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        LLPSLFE NPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  LLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36885.1 unknown protein4.6e-8578.87Show/hide
Query:  DGYEEVEMR-----ETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANA
        DG EE+E R     E  ET + S++P PLL VA+LPGA TVRS+FGP VE+VKS +LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKA A
Subjt:  DGYEEVEMR-----ETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANA

Query:  KDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KDLHPKLL GMFFFFALGATGG+ SLLTSDKPIFESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  KDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

AT2G36885.2 unknown protein6.7e-8478.87Show/hide
Query:  DGYEEVEMR-----ETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANA
        DG EE+E R     E  ET + S++P PLL VA+LPGA TVRS+FGP VE+VKS +LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKA A
Subjt:  DGYEEVEMR-----ETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKANA

Query:  KDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KDLHPKLL GMFFFFALGATGG+ SLLTSDKPIFESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  KDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGTATAGAGACTGAATCTACAAATCAAAATTCATGGGTATTTCAAAAGAAAAAAAATCAAAATTCATGGGCTAAGAAAATATTGGCGGGTCCATCGCGTAGAAC
GGGGAGCAGTATCAGCGTGTCTCTTTCTGGACCTTTCCCTCCATTTCTGCACTTTTTCTTCTTCTATCACTTTCTCTTTCACTCAAAATCCAATCTCTGTTTCTCTGCAA
CTGGAAGCCACACTCCAACCCTCGTTATCTCCCTCCATCTCCATGGCGACGCTTACTGGAGCTTCTTCTTCTTCATACATGTGTTTAACCAAGGTATCACCAGCTCCATT
GCCTTCCACTTCATTCCCCGTTTCGTTGCCAAATCTTCCCAAGATCCCTCGCCATTCTTGTTTCTCTTTGGCTGGAACGAGAAAAGTTCGGTTCAATCCGTCCTATGCTC
GGGCGACGAGTTCGGCGATTTGGTGGAAAATGATGGCTATGAAGAAGTGGAAATGAGGGAGACGAGCGAGACGCGTTTGTACTCTCTTGCGCCTTTTCCTTTACTGTTCG
TCGCTGCTCTTCCGGGAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTTGTTGAGCTTGTTAAATCTTGGAGTCTTCCTGAATGGCTGGTACATTGGGGTCATCCAGGC
AACATGGCAGTTGTGCTCTTCGCCATGGGTGGCTATGGAACGTATCTAGGTTTCCGTATCCGTTACTCTGACGATGTGGAGGAGAAGGCTAATGCCAAGGACTTGCATCC
AAAGCTTCTAGGTGGGATGTTTTTCTTTTTTGCTCTTGGAGCAACAGGTGGAATCACTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGG
GGTTCATTGGCCTCACGCTCTTGACTGTACAAACTCTTCTGCCTTCACTTTTTGAGGGTAATCCTGGACTGAGGAATGTTCATGGTATTTTGGGTAGTGGAATCATGACA
CTGTTCCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGTATAGAGACTGAATCTACAAATCAAAATTCATGGGTATTTCAAAAGAAAAAAAATCAAAATTCATGGGCTAAGAAAATATTGGCGGGTCCATCGCGTAGAAC
GGGGAGCAGTATCAGCGTGTCTCTTTCTGGACCTTTCCCTCCATTTCTGCACTTTTTCTTCTTCTATCACTTTCTCTTTCACTCAAAATCCAATCTCTGTTTCTCTGCAA
CTGGAAGCCACACTCCAACCCTCGTTATCTCCCTCCATCTCCATGGCGACGCTTACTGGAGCTTCTTCTTCTTCATACATGTGTTTAACCAAGGTATCACCAGCTCCATT
GCCTTCCACTTCATTCCCCGTTTCGTTGCCAAATCTTCCCAAGATCCCTCGCCATTCTTGTTTCTCTTTGGCTGGAACGAGAAAAGTTCGGTTCAATCCGTCCTATGCTC
GGGCGACGAGTTCGGCGATTTGGTGGAAAATGATGGCTATGAAGAAGTGGAAATGAGGGAGACGAGCGAGACGCGTTTGTACTCTCTTGCGCCTTTTCCTTTACTGTTCG
TCGCTGCTCTTCCGGGAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTTGTTGAGCTTGTTAAATCTTGGAGTCTTCCTGAATGGCTGGTACATTGGGGTCATCCAGGC
AACATGGCAGTTGTGCTCTTCGCCATGGGTGGCTATGGAACGTATCTAGGTTTCCGTATCCGTTACTCTGACGATGTGGAGGAGAAGGCTAATGCCAAGGACTTGCATCC
AAAGCTTCTAGGTGGGATGTTTTTCTTTTTTGCTCTTGGAGCAACAGGTGGAATCACTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGG
GGTTCATTGGCCTCACGCTCTTGACTGTACAAACTCTTCTGCCTTCACTTTTTGAGGGTAATCCTGGACTGAGGAATGTTCATGGTATTTTGGGTAGTGGAATCATGACA
CTGTTCCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAA
Protein sequenceShow/hide protein sequence
MKSIETESTNQNSWVFQKKKNQNSWAKKILAGPSRRTGSSISVSLSGPFPPFLHFFFFYHFLFHSKSNLCFSATGSHTPTLVISLHLHGDAYWSFFFFIHVFNQGITSSI
AFHFIPRFVAKSSQDPSPFLFLFGWNEKSSVQSVLCSGDEFGDLVENDGYEEVEMRETSETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPG
NMAVVLFAMGGYGTYLGFRIRYSDDVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFEGNPGLRNVHGILGSGIMT
LFLIHAALGLQLGLSY