; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023867 (gene) of Chayote v1 genome

Gene IDSed0023867
OrganismSechium edule (Chayote v1)
DescriptionDUF3685 domain-containing protein
Genome locationLG07:1766553..1774198
RNA-Seq ExpressionSed0023867
SyntenySed0023867
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022552 - Uncharacterised protein family Ycf55


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575973.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.0e-25287.18Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAE+V V PC+KLQ+ RTPF+ +S   C+FSFKREQR+ SC  YKFTRISTWRRR LSGFRGSNLIV+PAPRK F EHAY+RSLVNVDGT ASE+LFVDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLMTSIFLTYMAGVIP+ KSNQPGNIIS+TNSASDNPTFSGS MKTDDQIN K+AL +VKGKILDFLDAFERR+S+E EV+EFAE HAKQPLSLNAI E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNN+S  TIQNMDDLS  FSKFIQKSS P+CMSWLK+ELSM+NNDSS+AFLSLMSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDY YY+++L+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEV NIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQANIDNWWMKHILRR ETL+S+L Y VI SF+MPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

XP_022151384.1 uncharacterized protein LOC111019333 isoform X1 [Momordica charantia]4.0e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

XP_022151393.1 uncharacterized protein LOC111019333 isoform X2 [Momordica charantia]4.0e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

XP_022151402.1 uncharacterized protein LOC111019333 isoform X3 [Momordica charantia]4.0e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

XP_022953640.1 uncharacterized protein LOC111456110 isoform X2 [Cucurbita moschata]8.0e-25387.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAE+V V PC+KLQ+ RTPF+ +S   C+FSFKREQR+ SC  YKFTRISTWRRR LSGFRGSNLIV+PAPRK F EHAY+RSLVNVDGT ASE+LFVDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLMTSIFLTYMAGVIP+ KSNQPGNIIS+TNSASDNPTFSGSGMKTDDQIN K+AL +VKGKILDFLDAFERR+S+E EV+EFAE HAKQPLSLNAI E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNN+S  TIQNMDDLS  FSKFIQKSS P+CMSWLK+ELSM+NNDSS+AFLSLMSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDY YY+++L+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEV NIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQANIDNWWMKHILRR ETL+S+L Y VI SF+MPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

TrEMBL top hitse value%identityAlignment
A0A6J1DC25 uncharacterized protein LOC111019333 isoform X21.9e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

A0A6J1DCX7 uncharacterized protein LOC111019333 isoform X11.9e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

A0A6J1DDE2 uncharacterized protein LOC111019333 isoform X31.9e-25287.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAEHVAVTPC+KLQ+ RTPFK +S T CNFSFK EQRK SC+  KF RIS WRR +LSGF GS LIVNPAPRKTF EHAY+RSLVNVDGT ASE+L VDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLM SIFLTYMAGVIP+ KSNQPG+IISHT++ASDNPTFSGSGMKT+DQIN K+AL +VKGKILDFLDAFERR+SME EV EFAECH KQPLSLNAI+E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSK FSKFIQKSS P+C SWLK ELSME NDSS+AFLSLMSEKLKAEDNIL+GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDY YY+H+LYVKHGISILEDLLITFADGIASMYLEFISVDSSF DEV N+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQA IDNWWM+  LRRTETL+SQLHY VIRSFSMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

A0A6J1GNV1 uncharacterized protein LOC111456110 isoform X19.5e-25287.21Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAE+V V PC+KLQ+ RTPF+ +S   C+FSFKREQR+ SC  YKFTRISTWRRR LSGFRGSNLIV+PAPRK F EHAY+RSLVNVDGT ASE+LFVDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLMTSIFLTYMAGVIP+ KSNQPGNIIS+TNSASDNPTFSGSGMKTDDQIN K+AL +VKGKILDFLDAFERR+S+E EV+EFAE HAKQPLSLNAI E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNN+S  TIQNMDDLS  FSKFIQKSS P+CMSWLK+ELSM+NNDSS+AFLSLMSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGAR-RDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDL
        LSFG R RDY YY+++L+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEV NIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDL
Subjt:  LSFGAR-RDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDL

Query:  CTLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGR
        CTL SQQIELP SRQANIDNWWMKHILRR ETL+S+L Y VI SF+MPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGR
Subjt:  CTLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGR

Query:  SLGLIYTGIRQSLRWK
        SLGLIYTGIRQSLRWK
Subjt:  SLGLIYTGIRQSLRWK

A0A6J1GQ87 uncharacterized protein LOC111456110 isoform X23.9e-25387.38Show/hide
Query:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ
        MAE+V V PC+KLQ+ RTPF+ +S   C+FSFKREQR+ SC  YKFTRISTWRRR LSGFRGSNLIV+PAPRK F EHAY+RSLVNVDGT ASE+LFVDQ
Subjt:  MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQ

Query:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE
        LLLMTSIFLTYMAGVIP+ KSNQPGNIIS+TNSASDNPTFSGSGMKTDDQIN K+AL +VKGKILDFLDAFERR+S+E EV+EFAE HAKQPLSLNAI E
Subjt:  LLLMTSIFLTYMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISE

Query:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF
        GPRLRLLWASFQLIEEEVNN+S  TIQNMDDLS  FSKFIQKSS P+CMSWLK+ELSM+NNDSS+AFLSLMSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  GPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHF

Query:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDY YY+++L+VKHGISILEDLLITFADGIASMYLEFISVDSSFFDEV NIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQANIDNWWMKHILRR ETL+S+L Y VI SF+MPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

SwissProt top hitse value%identityAlignment
P74126 Ycf55-like protein2.0e-0428.1Show/hide
Query:  FDLCTLGSQQIELPRSRQA---NIDNWWM--KHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF
        ++   L S+Q+   R+  A       +W+  K I     TL   L  A I +  +   R +EL  LR   ++ ++++E  D  +P +R VI+ + +G+ F
Subjt:  FDLCTLGSQQIELPRSRQA---NIDNWWM--KHILRRTETLTSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF

Query:  FLVCLIGRSLGLIYTGIRQSL
         L  ++GR++GL+  GI Q +
Subjt:  FLVCLIGRSLGLIYTGIRQSL

Arabidopsis top hitse value%identityAlignment
AT5G48830.1 unknown protein3.1e-10946.37Show/hide
Query:  MAEHVAVTP--CVKLQVWRTPFKTRSLTACNFSFKREQRK---LSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASE-
        M  HV V+P   V+L++        S  + N   K  QR    +S   YKF  +ST            NL  + +   +  +     SL + DG   S  
Subjt:  MAEHVAVTP--CVKLQVWRTPFKTRSLTACNFSFKREQRK---LSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASE-

Query:  ILFVDQLLLMTSIFLTYMAGVIPLSK-SNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPL
        +   DQ+LL  SIFLTYMAGVIP+ K S       +      +  T   SG +TD + +LK    +VK K+LD LDA +R  ++  +V++      K PL
Subjt:  ILFVDQLLLMTSIFLTYMAGVIPLSK-SNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPL

Query:  SLNAISEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEEL
        SL AISEGP+L LLW+ FQ +EEE N IS T   N D+   +F++ ++++ Q  C +WLK EL +EN DS  A   L+   L  +D I   I+KSGKE+L
Subjt:  SLNAISEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEEL

Query:  YAELMHFLSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMY
        +AE ++F  FG+    + Y+ + +  HG++ILED +IT ADG+AS+YLE ISVDS F +E+ + GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+VSMY
Subjt:  YAELMHFLSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMY

Query:  EDRFDLCTLGSQQI-ELPRSRQANIDNWWMKHILRRTETL-TSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF
        EDRFDL  L +Q I  L  S      +WW K  L +T+   +S L Y++I  FS+PVKRTKEL+AL GWRYYFSL +ELSDI  P+IRVV+DK+SS ISF
Subjt:  EDRFDLCTLGSQQI-ELPRSRQANIDNWWMKHILRRTETL-TSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF

Query:  FLVCLIGRSLGLIYTGIRQSLRWK
        FLV LIGRS+GLI+TGIRQSLRWK
Subjt:  FLVCLIGRSLGLIYTGIRQSLRWK

AT5G48830.2 unknown protein2.1e-10545.57Show/hide
Query:  MAEHVAVTP--CVKLQVWRTPFKTRSLTACNFSFKREQRK---LSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASE-
        M  HV V+P   V+L++        S  + N   K  QR    +S   YKF  +ST            NL  + +   +  +     SL + DG   S  
Subjt:  MAEHVAVTP--CVKLQVWRTPFKTRSLTACNFSFKREQRK---LSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASE-

Query:  ILFVDQLLLMTSIFLTYMAGVIPLSK-SNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPL
        +   DQ+LL  SIFLTYMAGVIP+ K S       +      +  T   SG +TD + +LK    +VK K+LD LDA +R  ++  +V++      K PL
Subjt:  ILFVDQLLLMTSIFLTYMAGVIPLSK-SNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPL

Query:  SLNAISEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSS-------EAFLSLMSEKLKAEDNILKGIK
        SL AISEGP+L LLW+ FQ +EEE N IS T   N D+   +F++ ++++ Q  C +WLK EL +EN DS        +A   L+   L  +D I   I+
Subjt:  SLNAISEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSS-------EAFLSLMSEKLKAEDNILKGIK

Query:  KSGKEELYAELMHFLSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNV
        KSGKE+L+AE ++F  FG+    + Y+ + +  HG++ILED +IT ADG+AS+YLE ISVDS F +E+ + GL++C+LS+RALQ+LRNEVA+ QWL+QN+
Subjt:  KSGKEELYAELMHFLSFGARRDYYYYEHNLYVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNV

Query:  EAIVSMYEDRFDLCTLGSQQI-ELPRSRQANIDNWWMKHILRRTETL-TSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDK
        EA+VSMYEDRFDL  L +Q I  L  S      +WW K  L +T+   +S L Y++I  FS+PVKRTKEL+AL GW YYFSL +ELSDI  P+IRVV+DK
Subjt:  EAIVSMYEDRFDLCTLGSQQI-ELPRSRQANIDNWWMKHILRRTETL-TSQLHYAVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDK

Query:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        +SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCATGTTGCTGTCACACCTTGTGTCAAGCTGCAAGTTTGGAGAACTCCCTTCAAAACCAGAAGCCTCACAGCATGCAATTTCAGTTTTAAAAGAGAACAAAG
AAAATTGTCTTGTGACGGCTATAAGTTCACAAGGATCTCAACTTGGAGAAGGCGTGAGCTTAGTGGTTTTCGTGGCTCAAACTTGATTGTAAATCCTGCTCCCAGGAAGA
CCTTCTCAGAGCATGCTTACGTGAGATCTTTGGTAAACGTTGATGGAACAATAGCCTCTGAGATACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACG
TATATGGCTGGAGTAATACCTCTATCAAAGTCTAATCAACCTGGAAATATCATCTCTCATACCAATTCAGCCTCAGATAACCCGACCTTTTCTGGTAGTGGCATGAAGAC
TGATGATCAAATAAATCTGAAGCATGCATTAGTTATAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAGAAGTATGGAAAAAGAGGTAATTGAAT
TTGCAGAGTGTCATGCCAAGCAACCTCTAAGCTTGAATGCAATTTCTGAGGGTCCAAGGTTAAGATTGCTTTGGGCTTCTTTTCAACTAATCGAGGAAGAGGTCAATAAT
ATCTCCAATACTACTATTCAGAACATGGATGATTTGTCTAAAACATTTTCAAAATTTATCCAAAAATCCTCTCAACCTCTTTGCATGTCTTGGCTGAAAAGCGAACTGTC
GATGGAAAATAATGACTCCAGTGAGGCATTTCTATCTTTAATGTCTGAGAAGCTTAAAGCAGAAGATAATATTTTAAAAGGTATTAAAAAGTCTGGCAAGGAAGAGCTGT
ACGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTACTATTATGAGCACAACCTGTATGTCAAGCATGGGATTTCAATATTAGAAGATTTGCTGATA
ACCTTTGCGGACGGGATTGCAAGCATGTATCTGGAATTCATTTCTGTTGACAGCAGTTTCTTTGATGAAGTGGTTAACATCGGCTTGGCATTGTGTACCCTATCAACACG
AGCACTCCAAAGATTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGCAATAGTATCGATGTATGAAGACCGATTTGATTTATGCACACTTGGTA
GTCAACAGATTGAGCTACCAAGGAGTAGACAAGCCAATATTGATAACTGGTGGATGAAACATATCCTCAGAAGAACAGAAACTTTGACCTCTCAGTTACATTATGCTGTG
ATACGCTCCTTCTCCATGCCTGTAAAGCGGACTAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGATATCACGACGCCGAT
GATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCTAGTCTGCTTGATTGGAAGATCTTTAGGGCTCATTTATACAGGAATCAGACAATCTCTAAGGT
GGAAATAA
mRNA sequenceShow/hide mRNA sequence
AATGATGAAAGTATCATGCTACCCTCCCAAATAAAAATCATCAATGGAATTTAGGTCGAAAATTGATAGGTGGTGGTAAAAAAAAAACTCCGTTGATTGTTGGTTTTTGT
GATTGGGATAATAAAGAAAGAAGAGTACGCACATTTCCAATCCTCATCTTCTCCATTAATCTTCTTCTTCGCTTCACAGCCTTTTCTCATGTTCAACTCTACACTCTTCG
ATGTTCAACCCATAGATTCAAATGTTCTTTCTCACAATTCATCCGATTTTGACTCTTACTTCAATCTCTGACTTCTCTCCATAGAAGTTGAGGTCTCGATTGATCTTCAA
GGTTAACGGCTCCGCCTCCGATTTCTATTCACGATGGCAGAGCATGTTGCTGTCACACCTTGTGTCAAGCTGCAAGTTTGGAGAACTCCCTTCAAAACCAGAAGCCTCAC
AGCATGCAATTTCAGTTTTAAAAGAGAACAAAGAAAATTGTCTTGTGACGGCTATAAGTTCACAAGGATCTCAACTTGGAGAAGGCGTGAGCTTAGTGGTTTTCGTGGCT
CAAACTTGATTGTAAATCCTGCTCCCAGGAAGACCTTCTCAGAGCATGCTTACGTGAGATCTTTGGTAAACGTTGATGGAACAATAGCCTCTGAGATACTTTTTGTTGAT
CAATTGCTTCTGATGACCAGTATATTTCTAACGTATATGGCTGGAGTAATACCTCTATCAAAGTCTAATCAACCTGGAAATATCATCTCTCATACCAATTCAGCCTCAGA
TAACCCGACCTTTTCTGGTAGTGGCATGAAGACTGATGATCAAATAAATCTGAAGCATGCATTAGTTATAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAAC
GTAGGAGAAGTATGGAAAAAGAGGTAATTGAATTTGCAGAGTGTCATGCCAAGCAACCTCTAAGCTTGAATGCAATTTCTGAGGGTCCAAGGTTAAGATTGCTTTGGGCT
TCTTTTCAACTAATCGAGGAAGAGGTCAATAATATCTCCAATACTACTATTCAGAACATGGATGATTTGTCTAAAACATTTTCAAAATTTATCCAAAAATCCTCTCAACC
TCTTTGCATGTCTTGGCTGAAAAGCGAACTGTCGATGGAAAATAATGACTCCAGTGAGGCATTTCTATCTTTAATGTCTGAGAAGCTTAAAGCAGAAGATAATATTTTAA
AAGGTATTAAAAAGTCTGGCAAGGAAGAGCTGTACGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTACTATTATGAGCACAACCTGTATGTCAAG
CATGGGATTTCAATATTAGAAGATTTGCTGATAACCTTTGCGGACGGGATTGCAAGCATGTATCTGGAATTCATTTCTGTTGACAGCAGTTTCTTTGATGAAGTGGTTAA
CATCGGCTTGGCATTGTGTACCCTATCAACACGAGCACTCCAAAGATTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGCAATAGTATCGATGT
ATGAAGACCGATTTGATTTATGCACACTTGGTAGTCAACAGATTGAGCTACCAAGGAGTAGACAAGCCAATATTGATAACTGGTGGATGAAACATATCCTCAGAAGAACA
GAAACTTTGACCTCTCAGTTACATTATGCTGTGATACGCTCCTTCTCCATGCCTGTAAAGCGGACTAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCT
GTTGATTGAATTATCCGATATCACGACGCCGATGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCTAGTCTGCTTGATTGGAAGATCTTTAGGGC
TCATTTATACAGGAATCAGACAATCTCTAAGGTGGAAATAATAAAGGTTTGATGCTTCTGATTTTTATTTGGCTGTTTCTCCATTATTTTGGTTTCCTTTTCTTTAGTTG
TATAAACTTGAATTTTGTTAGTTTTTAACAATTTTAGAGATTTTAAGCAATCTTTGATAGGGAAGAATGATTTTATTTTTATTCTTTTTTAGAAAAGAGATATCATAGAA
TGCAGAAGTGCATTTTTGCTTCCTATCATATGAATTTCACTGAAAA
Protein sequenceShow/hide protein sequence
MAEHVAVTPCVKLQVWRTPFKTRSLTACNFSFKREQRKLSCDGYKFTRISTWRRRELSGFRGSNLIVNPAPRKTFSEHAYVRSLVNVDGTIASEILFVDQLLLMTSIFLT
YMAGVIPLSKSNQPGNIISHTNSASDNPTFSGSGMKTDDQINLKHALVIVKGKILDFLDAFERRRSMEKEVIEFAECHAKQPLSLNAISEGPRLRLLWASFQLIEEEVNN
ISNTTIQNMDDLSKTFSKFIQKSSQPLCMSWLKSELSMENNDSSEAFLSLMSEKLKAEDNILKGIKKSGKEELYAELMHFLSFGARRDYYYYEHNLYVKHGISILEDLLI
TFADGIASMYLEFISVDSSFFDEVVNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPRSRQANIDNWWMKHILRRTETLTSQLHYAV
IRSFSMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK