; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022634 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022634
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF3685 domain-containing protein
Genome locationChr05:26461746..26469355
RNA-Seq ExpressionHG10022634
SyntenyHG10022634
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022552 - Uncharacterised protein family Ycf55


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447264.1 PREDICTED: uncharacterized protein LOC103489745 isoform X1 [Cucumis melo]2.3e-24686.87Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS AL NLSFKRE RKSS ESYK++RISTWRRHEL GSCGS+ IVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQ LLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHAL VVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNIS ATIQNMDDLSKIFSEFI KSS+PVCMSWLRNELSMEN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGARSRDYCYYDHSLYVKH ISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTLGSQ IDLPGS QANIDNWWMK+I RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

XP_008447265.1 PREDICTED: uncharacterized protein LOC103489745 isoform X2 [Cucumis melo]1.7e-24486.68Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS AL NLSFKRE RKSS ESYK++RISTWRRHEL GSCGS+ IVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQ LLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHAL VVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNIS ATIQNMDDLSKIFSEFI KSS+PVCMSWLRNELSMEN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGAR RDYCYYDHSLYVKH ISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTLGSQ IDLPGS QANIDNWWMK+I RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

XP_011659179.1 uncharacterized protein LOC101223105 isoform X1 [Cucumis sativus]8.8e-24686.29Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS ALCNL FK+EQRKSS ESYK++RISTWR HEL GSCGS+LIVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHALDVVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNISNATIQ+MDDLSKIFSEFI KS +PVCMSWLRNELS+EN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGAR RDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTL SQ IDLPGS Q NIDNWWMK+ILRR+ETLSSQ+YYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

XP_038898894.1 uncharacterized protein LOC120086352 isoform X2 [Benincasa hispida]5.9e-24279.72Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWRTPFKVKSS LCNL+F+REQRKSSC+S  ++RISTWRR EL G CGS+LIVNP PRK FREHAYLRSLVN+DGTTASEALF+
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIP+PKSNQPGNI S TNSVSDNQT SGSG+KTDGQ+NPKHALDVVKGKILD LDAFERRKSMES                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNEL MEN+DSSKAFLSLMSE FKAEDNILPGIKKSGK+ELYAELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGAR--------------------------------------------SRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDE
        HFLSFGAR                                             RD+C YDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDE
Subjt:  HFLSFGAR--------------------------------------------SRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDE

Query:  VDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKE
        VDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTL SQQIDLPG+RQANIDNWWMKHILRRRETLSSQLYYVVI SFAMPVKRTKE
Subjt:  VDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKE

Query:  LRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        LRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  LRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

XP_038898895.1 uncharacterized protein LOC120086352 isoform X3 [Benincasa hispida]6.1e-24786.49Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWRTPFKVKSS LCNL+F+REQRKSSC+S  ++RISTWRR EL G CGS+LIVNP PRK FREHAYLRSLVN+DGTTASEALF+
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIP+PKSNQPGNI S TNSVSDNQT SGSG+KTDGQ+NPKHALDVVKGKILD LDAFERRKSMES                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNEL MEN+DSSKAFLSLMSE FKAEDNILPGIKKSGK+ELYAELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGAR RD+C YDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTL SQQIDLPG+RQANIDNWWMKHILRRRETLSSQLYYVVI SFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

TrEMBL top hitse value%identityAlignment
A0A0A0K4I5 Uncharacterized protein1.1e-24185.52Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS ALCNL      RKSS ESYK++RISTWR HEL GSCGS+LIVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHALDVVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNISNATIQ+MDDLSKIFSEFI KS +PVCMSWLRNELS+EN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGAR RDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTL SQ IDLPGS Q NIDNWWMK+ILRR+ETLSSQ+YYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

A0A1S3BH10 uncharacterized protein LOC103489745 isoform X28.1e-24586.68Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS AL NLSFKRE RKSS ESYK++RISTWRRHEL GSCGS+ IVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQ LLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHAL VVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNIS ATIQNMDDLSKIFSEFI KSS+PVCMSWLRNELSMEN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGAR RDYCYYDHSLYVKH ISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTLGSQ IDLPGS QANIDNWWMK+I RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

A0A1S3BH17 uncharacterized protein LOC103489745 isoform X11.1e-24686.87Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAEHVAVTPSPCIKLQIWR PFKVKS AL NLSFKRE RKSS ESYK++RISTWRRHEL GSCGS+ IVNP PRKTFREHAYLRSLVNVDGTTASEA+FV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQ LLMTSIFLTYMAGVIPVPKSNQ GNI SQTNSV DNQT SGSGMKTDGQ+NPKHAL VVKGKILDFLDAFERRKSME+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNNIS ATIQNMDDLSKIFSEFI KSS+PVCMSWLRNELSMEN+DSSKAFLSLMSE FKAEDNILPGIKKSGKEEL+AELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFGARSRDYCYYDHSLYVKH ISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTLGSQ IDLPGS QANIDNWWMK+I RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

A0A6J1GNV1 uncharacterized protein LOC111456110 isoform X13.0e-23182.24Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAE+V V  +PCIKLQI RTPF+ KS+A C+ SFKREQR+SSC SYK+ RISTWRR  LSG  GS+LIV+P PRK FREHAYLRSLVNVDGTTASE LFV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIPVPKSNQPGNIIS TNS SDN T SGSGMKTD Q+N K+ALDVVKGKILDFLDAFERRKS+E+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNN+S ATIQNMDDLS IFS+FIQKSS PVCMSWL+NELSM+N+DSSKAFLSLMSE  KAEDNILPGIKKSGKEELYAELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFG RSRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTL SQQI+LPGSRQANIDNWWMKHILRRRETLSS+L YVVI SFAMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

A0A6J1JWY6 uncharacterized protein LOC111488215 isoform X17.8e-23282.63Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        MAE+V V  +PCIKLQI RTPF+ KSSA C+ SFKRE+RKSSC SYK+ RISTWRR  LSG  GS+LIV+P PRK FREHA LRSLVNVDGTTASE LFV
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------
        DQLLLMTSIFLTYMAGVIPVPKSNQPGNIIS TNS SDN T SGSGMKTD Q+N K+ALDVVKGKILDFLDAFE RKS+E+                   
Subjt:  DQLLLMTSIFLTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMES-------------------

Query:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM
                          EVNN+S ATIQNMDDLS IFS+FIQKSSQPVCMSWL+NELSM+N+DSSKAFLSLMSE  KAEDNILPGIKKSGKEELYAELM
Subjt:  ------------------EVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELM

Query:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
        HFLSFG RSRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF
Subjt:  HFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRF

Query:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
        DLCTL SQQI+LPGSRQANIDNWWMKHILRRRETLSS+L YVVI SFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI
Subjt:  DLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLI

Query:  GRSLGLIYTGIRQSLRWK
        GRSLGLIYTGIRQSLRWK
Subjt:  GRSLGLIYTGIRQSLRWK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48830.1 unknown protein5.4e-10043.43Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        M  HV V+PS  ++L++        S    N   K  QR     S K  + +      L  SC      + T + T        SL + DG   S  + +
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  -DQLLLMTSIFLTYMAGVIPVPK----SNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERR-------------------
         DQ+LL  SIFLTYMAGVIPV K    S+    I+ +   V    T   SG +TD + + K   DVVK K+LD LDA +R                    
Subjt:  -DQLLLMTSIFLTYMAGVIPVPK----SNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERR-------------------

Query:  ------------------KSMESEVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEEL
                          + +E E N IS     N D+    F++ ++++ Q  C +WL+ EL +EN+DS  A   L+      +D I   I+KSGKE+L
Subjt:  ------------------KSMESEVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEEL

Query:  YAELMHFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSM
        +AE ++F  FG+  + +C YD S +  HG++ILED +IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+VSM
Subjt:  YAELMHFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSM

Query:  YEDRFDLCTLGSQQI-DLPGSRQANIDNWWMKHIL-RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGIS
        YEDRFDL  L +Q I +L GS      +WW K  L + +   SS L Y +I  F++PVKRTKEL+AL GWRYYFSL +ELSDI MP+IRVV+DK+SS IS
Subjt:  YEDRFDLCTLGSQQI-DLPGSRQANIDNWWMKHIL-RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGIS

Query:  FFLVCLIGRSLGLIYTGIRQSLRWK
        FFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  FFLVCLIGRSLGLIYTGIRQSLRWK

AT5G48830.2 unknown protein4.7e-9642.67Show/hide
Query:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV
        M  HV V+PS  ++L++        S    N   K  QR     S K  + +      L  SC      + T + T        SL + DG   S  + +
Subjt:  MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFV

Query:  -DQLLLMTSIFLTYMAGVIPVPK----SNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERR-------------------
         DQ+LL  SIFLTYMAGVIPV K    S+    I+ +   V    T   SG +TD + + K   DVVK K+LD LDA +R                    
Subjt:  -DQLLLMTSIFLTYMAGVIPVPK----SNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERR-------------------

Query:  ------------------KSMESEVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSS-------KAFLSLMSETFKAEDNILPGIK
                          + +E E N IS     N D+    F++ ++++ Q  C +WL+ EL +EN+DS        +A   L+      +D I   I+
Subjt:  ------------------KSMESEVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNELSMENSDSS-------KAFLSLMSETFKAEDNILPGIK

Query:  KSGKEELYAELMHFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQN
        KSGKE+L+AE ++F  FG+  + +C YD S +  HG++ILED +IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LRNEVA+ QWL+QN
Subjt:  KSGKEELYAELMHFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQN

Query:  IEAIVSMYEDRFDLCTLGSQQI-DLPGSRQANIDNWWMKHIL-RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVID
        +EA+VSMYEDRFDL  L +Q I +L GS      +WW K  L + +   SS L Y +I  F++PVKRTKEL+AL GW YYFSL +ELSDI MP+IRVV+D
Subjt:  IEAIVSMYEDRFDLCTLGSQQI-DLPGSRQANIDNWWMKHIL-RRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVID

Query:  KISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        K+SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  KISSGISFFLVCLIGRSLGLIYTGIRQSLRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCATGTGGCAGTCACACCATCACCATGTATCAAGCTGCAAATTTGGAGAACTCCATTCAAAGTGAAAAGCTCTGCACTATGCAATCTCAGTTTTAAAAGAGA
ACAGAGAAAATCCTCTTGTGAGAGCTATAAGTACGTAAGGATCTCAACTTGGAGAAGGCATGAGCTTAGTGGTTCTTGTGGCTCAAGCTTAATTGTAAATCCTACTCCCA
GGAAGACCTTCAGAGAGCATGCTTACCTAAGGTCCTTGGTAAACGTCGATGGAACAACAGCTTCCGAGGCACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTT
CTAACATATATGGCTGGAGTAATACCTGTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAACCAATTCAGTCTCAGATAACCAAACCATTTCTGGTAGTGGCAT
GAAGACTGATGGTCAAGTTAATCCGAAGCATGCATTAGATGTAGTTAAAGGAAAAATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAAAAGTATGGAAAGTGAGGTCA
ACAATATTTCCAATGCTACTATTCAGAACATGGATGATTTGTCTAAAATATTTTCTGAATTTATTCAAAAATCCTCTCAACCTGTATGCATGTCTTGGCTGAGAAACGAA
TTGTCAATGGAAAATAGTGATTCTAGTAAGGCATTTCTTTCTTTGATGTCTGAAACATTTAAAGCAGAAGACAACATTTTACCCGGAATTAAGAAGTCTGGCAAGGAAGA
GCTGTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGCAGGGATTATTGCTATTATGACCATAGCCTGTATGTCAAGCATGGGATTTCAATATTAGAAGATT
TGCTGATAACCTTTGCTGACGGGATTGCAAGTATGTATCTAGAATTTATTTCTGTTGACAGCAGTTTCTTTGATGAAGTGGATAATATTGGCTTGGCATTGTGTACCCTA
TCAACACGAGCACTCCAAAGACTGCGTAATGAGGTGGCTATGAACCAATGGTTGTATCAAAACATCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTGTGTAC
ACTTGGTAGTCAACAGATTGACCTACCAGGCAGTAGACAGGCCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAAGAGAAACTTTGTCTTCTCAGTTATATT
ATGTTGTTATACGCTCCTTTGCCATGCCTGTAAAGCGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGACATTACG
ATGCCGTTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCTAGTCTGTCTGATTGGAAGATCTTTAGGGCTCATCTATACAGGAATCAGGCAGTC
ACTAAGGTGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGCATGTGGCAGTCACACCATCACCATGTATCAAGCTGCAAATTTGGAGAACTCCATTCAAAGTGAAAAGCTCTGCACTATGCAATCTCAGTTTTAAAAGAGA
ACAGAGAAAATCCTCTTGTGAGAGCTATAAGTACGTAAGGATCTCAACTTGGAGAAGGCATGAGCTTAGTGGTTCTTGTGGCTCAAGCTTAATTGTAAATCCTACTCCCA
GGAAGACCTTCAGAGAGCATGCTTACCTAAGGTCCTTGGTAAACGTCGATGGAACAACAGCTTCCGAGGCACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTT
CTAACATATATGGCTGGAGTAATACCTGTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAACCAATTCAGTCTCAGATAACCAAACCATTTCTGGTAGTGGCAT
GAAGACTGATGGTCAAGTTAATCCGAAGCATGCATTAGATGTAGTTAAAGGAAAAATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAAAAGTATGGAAAGTGAGGTCA
ACAATATTTCCAATGCTACTATTCAGAACATGGATGATTTGTCTAAAATATTTTCTGAATTTATTCAAAAATCCTCTCAACCTGTATGCATGTCTTGGCTGAGAAACGAA
TTGTCAATGGAAAATAGTGATTCTAGTAAGGCATTTCTTTCTTTGATGTCTGAAACATTTAAAGCAGAAGACAACATTTTACCCGGAATTAAGAAGTCTGGCAAGGAAGA
GCTGTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGCAGGGATTATTGCTATTATGACCATAGCCTGTATGTCAAGCATGGGATTTCAATATTAGAAGATT
TGCTGATAACCTTTGCTGACGGGATTGCAAGTATGTATCTAGAATTTATTTCTGTTGACAGCAGTTTCTTTGATGAAGTGGATAATATTGGCTTGGCATTGTGTACCCTA
TCAACACGAGCACTCCAAAGACTGCGTAATGAGGTGGCTATGAACCAATGGTTGTATCAAAACATCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTGTGTAC
ACTTGGTAGTCAACAGATTGACCTACCAGGCAGTAGACAGGCCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAAGAGAAACTTTGTCTTCTCAGTTATATT
ATGTTGTTATACGCTCCTTTGCCATGCCTGTAAAGCGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGACATTACG
ATGCCGTTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCTAGTCTGTCTGATTGGAAGATCTTTAGGGCTCATCTATACAGGAATCAGGCAGTC
ACTAAGGTGGAAATGA
Protein sequenceShow/hide protein sequence
MAEHVAVTPSPCIKLQIWRTPFKVKSSALCNLSFKREQRKSSCESYKYVRISTWRRHELSGSCGSSLIVNPTPRKTFREHAYLRSLVNVDGTTASEALFVDQLLLMTSIF
LTYMAGVIPVPKSNQPGNIISQTNSVSDNQTISGSGMKTDGQVNPKHALDVVKGKILDFLDAFERRKSMESEVNNISNATIQNMDDLSKIFSEFIQKSSQPVCMSWLRNE
LSMENSDSSKAFLSLMSETFKAEDNILPGIKKSGKEELYAELMHFLSFGARSRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTL
STRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLGSQQIDLPGSRQANIDNWWMKHILRRRETLSSQLYYVVIRSFAMPVKRTKELRALRGWRYYFSLLIELSDIT
MPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK