; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011404 (gene) of Chayote v1 genome

Gene IDSed0011404
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationLG03:9686906..9691421
RNA-Seq ExpressionSed0011404
SyntenySed0011404
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-18181.21Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKW ALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTV ISFPGGFP+NSI+F+FVLASWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL+SKTDKNFFNSGD+VVCS VQLCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_022951427.1 uncharacterized protein LOC111454247 [Cucurbita moschata]3.1e-18180.97Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKW ALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTV ISFPGGFP+NSI+F+FVLASWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLL L+SKTDKNFFNSGD+VVCS VQLCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]2.3e-18180.51Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKWCALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AY+LFPSFF ELAHKIIFFSTV ISFPGGFP+NSI+F+FVL SWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL+SKT+KNFFNSGD+VVCS VQLCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_023537461.1 uncharacterized protein LOC111798502 [Cucurbita pepo subsp. pepo]6.8e-18180.93Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKW ALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTV ISFPGGFP+NSI+F+FVLASWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL+SKTDKNFFNSGD+VVCS V+LCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVL
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVL
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVL

XP_038884363.1 uncharacterized protein LOC120075225 [Benincasa hispida]2.3e-18181.15Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SGE+ LPLL  HS+ R  SKSL S D+LR+F+ISLKWCALDHSSAA KF SY  F  L F VP V I+AVRDP  DPISFNKLVQVPESGLAAVGF S
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFP---GGFPVNSIVFVFVLASWVYRTGVFLLVCI
        LCRFFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTV ISFP    GFPVNSI+F+FVL SW+YRTGVFLLVC+
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFP---GGFPVNSIVFVFVLASWVYRTGVFLLVCI

Query:  LFRLTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFF
        LFRLTCELQILRLQGV++L ETS G GSD V+IFNEHLRIRKQL VTSHRYRFFIIVCLLVITVSQFVALLLVL+SKTDKNFFNSGDVVVCSAVQLCGFF
Subjt:  LFRLTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFF

Query:  LCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKS-PLPPDVVEDKE-GGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGG
        LC+FGAARITHRAQGIA+VATRWHM+VTSAA  S+ TKS PLPP V + K+     N D   D+DS SSSDVLI V+TQ+PSSFQ RQALVSYLQHNNGG
Subjt:  LCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKS-PLPPDVVEDKE-GGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGG

Query:  ITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        ITLFG+ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  ITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein6.2e-17277.55Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+S E+ LPLL  HS+ RS S    ++D+LR+++ISLKWCALDHSSAA KF SY  F +L   VPAV I+AV+DP  DPISFNKLVQVPESGLAAVGF S
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTV I FP GFPVNSI+F+ VL SWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGV++L ETS G  SD+V+IFNEHLRIRKQL +TSHRYRFFIIVCLLVITVSQFVALLLVL+SKTDKNFFNSGDVVVCS VQLCGFFLC+
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTK-SPLPPDVVEDKEGGGGNDDG--GADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGIT
        FGAARITHRAQGIA+VATRWHM+VTSAA  S+ TK  PL   +V  K+    +  G    D+DS SSSDVLI V  ++ SSFQTRQALVSYLQHNNGGIT
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTK-SPLPPDVVEDKEGGGGNDDG--GADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGIT

Query:  LFGFALDRGLLHTLFAFEFSLVMWILSKVVVL
        LFG+ALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  LFGFALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A5A7VCT9 Uncharacterized protein7.1e-17678.01Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SGE+ LPLL  HS+ RS S    ++D+LR+F+ISLKWCALDHSSA  KF SY AF  L   VPAV ++AV+DP  DPISFNKLVQVPESGLAAV F S
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTV ISFP GFP+NSI+F+FVL SWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGV++L ETS G  SD+V+IFNEHLRIRKQL +TSHRYRFFIIVCLLVITVSQFVALLLVL+SK DKNFFNSGDVVVCS VQLCGFFLC+
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLPPDVVEDKEGGGGNDDG--GADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITL
        FGAARITHRAQGIA++ATRWHM+VTSAA  S+ TKSP     V  K+    +  G    D+DS SSSDVLI V  Q+ SSFQTRQALVSYLQHNNGGITL
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLPPDVVEDKEGGGGNDDG--GADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITL

Query:  FGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        FG+ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  FGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1BR58 uncharacterized protein LOC1110050152.1e-16779.42Show/hide
Query:  SKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQ
        SKSL S DELR+F+ISLKWCALDHSS A KF SY AF + A  VPAV IVAV  P  DPISFNKLVQVPESGLAA+GFF+L RFFRRYGLRQLLFLDGLQ
Subjt:  SKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQ

Query:  EDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETS
        EDS+YVQRGYARELDKSFRY+AYILFPSFFVELAHKI FF+TV ISFPGGFP+NSI+FV VL +WVYRTGVFLLVCILFRLTCELQILRLQGV +L E S
Subjt:  EDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETS

Query:  IGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRW
         G GS+A +IF+EH+RI+KQL VTSHRYRFFII CLLVITVSQFVALLLVL+SKT KNFFNSGDVVVCSAVQLCGFFLC+FGAARITHRAQGI ++A+RW
Subjt:  IGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRW

Query:  HMVVTS--AAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFS
        HM+VTS  +A  SD TK P P PD       G G D    DSDSS    VLI VS  QPSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFS
Subjt:  HMVVTS--AAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFS

Query:  LVMWILSKVVVLS
        LVMWILSKVVVLS
Subjt:  LVMWILSKVVVLS

A0A6J1GHK9 uncharacterized protein LOC1114542471.5e-18180.97Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKW ALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTV ISFPGGFP+NSI+F+FVLASWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLL L+SKTDKNFFNSGD+VVCS VQLCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1KJB4 uncharacterized protein LOC1114962251.1e-18180.51Show/hide
Query:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS
        M+SG+  LPLL  HS+ R  SKSL S DELR+F+ISLKWCALDHSSAA KF SY AF IL   VP V IV+VRDP  DPIS NKLVQVPESGLAA+GFFS
Subjt:  MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFS

Query:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR
        LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AY+LFPSFF ELAHKIIFFSTV ISFPGGFP+NSI+F+FVL SWVYRTGVFLLVC+LFR
Subjt:  LCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV
        LTCELQILRLQGVN+L ETS G GSDA++IFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL+SKT+KNFFNSGD+VVCS VQLCGFFLCV
Subjt:  LTCELQILRLQGVNRLLETSIGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCV

Query:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF
         GAARITHRAQ IA+VATRWHM+VTSAA   + TK P P P  V         D+ G D+DS SSS+VLI +STQ+PSSFQTRQALVSYLQHNNGGITLF
Subjt:  FGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLP-PDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLF

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GFALDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)3.9e-6537.26Show/hide
Query:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL
        DEL  F+  L+W  +DHSS      S+  F +    VPA+   ++A  D D      ++ +VQ+  S +A V F  L RF  +YGLR+ LF D L ++S 
Subjt:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL

Query:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI
         V+R Y  +L+ S   V+Y + P F    A+KI ++++    I F G   + +++  +  L SW+YRT V  LVC+LFRL C LQILRLQ   +L +   
Subjt:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI

Query:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH
         + SD   I +EHLRIR+ LR+ SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + +  A++ITH+AQ +  +A +WH
Subjt:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH

Query:  MVVT----SAAVESDCTKSPLPPDVVEDKEGGGG-----------NDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRG
        +  T       VES       P  V  +                 +D+ G + D   ++D+ IPV      SFQ RQALVSY ++N+ GIT++GF LDRG
Subjt:  MVVT----SAAVESDCTKSPLPPDVVEDKEGGGG-----------NDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRG

Query:  LLHTLFAFEFSLVMWILSKVVVLS
         LHT+F  E SLV+W+L K + +S
Subjt:  LLHTLFAFEFSLVMWILSKVVVLS

AT1G50630.2 Protein of unknown function (DUF3537)2.3e-4934.81Show/hide
Query:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL
        DEL  F+  L+W  +DHSS      S+  F +    VPA+   ++A  D D      ++ +VQ+  S +A V F  L RF  +YGLR+ LF D L ++S 
Subjt:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL

Query:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI
         V+R Y  +L+ S   V+Y + P F    A+KI ++++    I F G   + +++  +  L SW+YRT V  LVC+LFRL C LQILRLQ   +L +   
Subjt:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI

Query:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH
         + SD   I +EHLRIR+ LR+ SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + +  A++ITH+AQ +  +A +WH
Subjt:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH

Query:  MVVT----SAAVESDCTKSPLPPDVVEDKEGGGG-----------NDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQH
        +  T       VES       P  V  +                 +D+ G + D   ++D+ IPV      SFQ RQAL   +Q+
Subjt:  MVVT----SAAVESDCTKSPLPPDVVEDKEGGGG-----------NDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQH

AT2G21080.1 unknown protein5.9e-11455.41Show/hide
Query:  MNSGETTLPL--LPSHSHV-RSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPI------SFNKLVQVPES
        MN GE    L  LP+ S+     SKSL S D+LR+F++ LKWCALDHSS+  K  SY  F +    VP +  + ++ P   P       SFN LVQ PES
Subjt:  MNSGETTLPL--LPSHSHV-RSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPI------SFNKLVQVPES

Query:  GLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFP----GGFPVNSIVFVFVLASWVY
        GLA +GF +L  FFR Y L +LLFLD    DS  V+ GY+RELDK+ RY+AYIL PSF VEL HK IFF +  +SFP        +N ++F  VL SWVY
Subjt:  GLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFP----GGFPVNSIVFVFVLASWVY

Query:  RTGVFLLVCILFRLTCELQILRLQGVNRLLETSIGLGSDAVM-IFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVV
        RTGVFLLVCILFRLTCELQILR +G+++L +     GSD +  +  EH+RI+KQL  TSHRYRFFII   +VI+ SQFVALLLVL+SK++K+F +SGD+V
Subjt:  RTGVFLLVCILFRLTCELQILRLQGVNRLLETSIGLGSDAVM-IFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVV

Query:  VCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLPPDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALV
        VCSAVQL GFFLC+ GAARITHRAQG+  +ATRWHM +T       C    + P+               +D+DSS +  + +  S    S FQ RQALV
Subjt:  VCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCTKSPLPPDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALV

Query:  SYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
         YL+HNN GITL+G+ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  SYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT3G20300.1 Protein of unknown function (DUF3537)3.3e-6437.23Show/hide
Query:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL
        DEL  F+  L+W  +D SS      S++ F +    VPA    ++A  D D      ++ +VQ+  S  AA+ F  L RF  +YGLR+ LF D L ++S 
Subjt:  DELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAV--VIVAVRDPDG-DPISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSL

Query:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI
         V+ GY  +L++S + ++Y + P F    ++KI ++++    I F G   + +++  +  L SW+YRT V  LVC+LFRL C LQILRLQ   ++ +   
Subjt:  YVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VNISFPGGFPV-NSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSI

Query:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH
         + SD   I +EHLRIR+ LR+ SHRYR FI++ L+++T SQF +LL+   +  + N + +G++ +CS   +    + +  A++ITH+AQ +  +A +WH
Subjt:  GLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWH

Query:  MVVTSAAVESDCTKSPLPPDVVEDKEGGG--GNDDGGADSDSSSSSDV--------LIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTL
        +  T   +ES  T     P +V+   G G    DD   +SDS    D         LIP       SFQ RQALV+Y ++N  GIT+FGF LDR  LHT+
Subjt:  MVVTSAAVESDCTKSPLPPDVVEDKEGGG--GNDDGGADSDSSSSSDV--------LIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        F  E SLV+W+L K + +S
Subjt:  FAFEFSLVMWILSKVVVLS

AT4G22270.1 Protein of unknown function (DUF3537)4.7e-4731.58Show/hide
Query:  DHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGD---PISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFR
        D S+      S++ F +L   VP +    +   D D      ++ +VQ+  S  A + F SL  + R++G+R+ LFLD L + S  V+  Y  E+ +S +
Subjt:  DHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGD---PISFNKLVQVPESGLAAVGFFSLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFR

Query:  YVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFV-----LASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSIGLGSDAVMIFNEH
         +   + PS  +E  ++I ++ +     P  + +N I+   V     L+SW+YR  +F++VCIL+++TC LQ LRL    R   + I   +D      EH
Subjt:  YVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFV-----LASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETSIGLGSDAVMIFNEH

Query:  LRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCT
         +IR+ LR+ SHR+R FI++ L+++T +QF+ALL    +    N +  G++ +CS   + G F+C+  A +ITH+AQ + ++A +W++  T  + +    
Subjt:  LRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWHMVVTSAAVESDCT

Query:  KSPLPPDVVEDKEGGGGN-----DDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVV
        ++P    ++E +    GN     DD   + D    +  + P+      S+Q RQALV+YL++N  GIT++GF +DR  L+T+F  E +L++W+L+K +V
Subjt:  KSPLPPDVVEDKEGGGGN-----DDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCCGGCGAAACAACCCTCCCTCTCCTCCCCAGCCACTCCCACGTCCGATCCTCCTCCAAATCCCTAGCCTCCACCGACGAGCTCCGCCACTTCAAAATCAGCCT
AAAATGGTGCGCACTCGACCACTCCTCCGCCGCCGCAAAGTTCGCCTCGTACGCCGCCTTCGCGATTCTCGCCTTCGGGGTCCCGGCCGTGGTGATCGTGGCCGTGAGGG
ATCCGGACGGCGATCCGATCTCGTTCAACAAGCTGGTCCAGGTGCCGGAGTCCGGGCTGGCGGCGGTCGGGTTCTTCAGCCTGTGCCGGTTTTTCCGGCGGTACGGACTC
CGGCAGCTCTTGTTCTTGGACGGATTGCAAGAGGATTCGCTGTACGTCCAGAGAGGCTATGCGCGTGAACTCGACAAGTCGTTCCGTTACGTTGCGTACATTCTTTTTCC
GTCCTTCTTCGTCGAATTGGCTCATAAAATCATTTTCTTCTCTACCGTTAATATCTCCTTTCCGGGAGGGTTTCCAGTAAACTCGATTGTGTTCGTGTTCGTGTTGGCGT
CGTGGGTGTATCGGACCGGAGTGTTCTTGCTCGTGTGCATATTGTTTCGACTCACGTGCGAGTTGCAGATACTAAGGCTACAAGGGGTGAACAGGTTGCTTGAGACGAGC
ATCGGGTTGGGATCGGATGCGGTGATGATCTTCAACGAACACTTGAGGATACGAAAACAGTTGAGGGTTACAAGCCACCGATACCGGTTCTTCATAATTGTGTGCTTGTT
GGTTATCACTGTTAGCCAGTTCGTGGCTCTGCTGCTGGTTTTGTCTTCCAAAACTGATAAGAACTTCTTCAATTCTGGTGATGTTGTGGTTTGCTCTGCAGTTCAATTAT
GCGGATTTTTCTTGTGCGTGTTTGGGGCTGCAAGAATTACGCATAGAGCTCAAGGAATTGCAGCCGTTGCAACCAGATGGCACATGGTCGTCACTTCCGCCGCGGTCGAG
TCCGACTGTACTAAATCTCCGCTGCCACCGGACGTAGTCGAAGACAAGGAAGGCGGAGGCGGAAACGATGACGGTGGCGCAGATTCAGACTCCTCGTCCTCCTCGGACGT
TTTAATTCCAGTGAGCACTCAGCAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCGTATTTGCAGCACAACAACGGCGGAATTACGTTGTTTGGATTTGCATTGG
ATAGAGGACTGCTTCATACGCTCTTCGCATTTGAATTCTCTCTGGTTATGTGGATTCTAAGTAAGGTCGTCGTATTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTCCGGCGAAACAACCCTCCCTCTCCTCCCCAGCCACTCCCACGTCCGATCCTCCTCCAAATCCCTAGCCTCCACCGACGAGCTCCGCCACTTCAAAATCAGCCT
AAAATGGTGCGCACTCGACCACTCCTCCGCCGCCGCAAAGTTCGCCTCGTACGCCGCCTTCGCGATTCTCGCCTTCGGGGTCCCGGCCGTGGTGATCGTGGCCGTGAGGG
ATCCGGACGGCGATCCGATCTCGTTCAACAAGCTGGTCCAGGTGCCGGAGTCCGGGCTGGCGGCGGTCGGGTTCTTCAGCCTGTGCCGGTTTTTCCGGCGGTACGGACTC
CGGCAGCTCTTGTTCTTGGACGGATTGCAAGAGGATTCGCTGTACGTCCAGAGAGGCTATGCGCGTGAACTCGACAAGTCGTTCCGTTACGTTGCGTACATTCTTTTTCC
GTCCTTCTTCGTCGAATTGGCTCATAAAATCATTTTCTTCTCTACCGTTAATATCTCCTTTCCGGGAGGGTTTCCAGTAAACTCGATTGTGTTCGTGTTCGTGTTGGCGT
CGTGGGTGTATCGGACCGGAGTGTTCTTGCTCGTGTGCATATTGTTTCGACTCACGTGCGAGTTGCAGATACTAAGGCTACAAGGGGTGAACAGGTTGCTTGAGACGAGC
ATCGGGTTGGGATCGGATGCGGTGATGATCTTCAACGAACACTTGAGGATACGAAAACAGTTGAGGGTTACAAGCCACCGATACCGGTTCTTCATAATTGTGTGCTTGTT
GGTTATCACTGTTAGCCAGTTCGTGGCTCTGCTGCTGGTTTTGTCTTCCAAAACTGATAAGAACTTCTTCAATTCTGGTGATGTTGTGGTTTGCTCTGCAGTTCAATTAT
GCGGATTTTTCTTGTGCGTGTTTGGGGCTGCAAGAATTACGCATAGAGCTCAAGGAATTGCAGCCGTTGCAACCAGATGGCACATGGTCGTCACTTCCGCCGCGGTCGAG
TCCGACTGTACTAAATCTCCGCTGCCACCGGACGTAGTCGAAGACAAGGAAGGCGGAGGCGGAAACGATGACGGTGGCGCAGATTCAGACTCCTCGTCCTCCTCGGACGT
TTTAATTCCAGTGAGCACTCAGCAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCGTATTTGCAGCACAACAACGGCGGAATTACGTTGTTTGGATTTGCATTGG
ATAGAGGACTGCTTCATACGCTCTTCGCATTTGAATTCTCTCTGGTTATGTGGATTCTAAGTAAGGTCGTCGTATTATCTTAG
Protein sequenceShow/hide protein sequence
MNSGETTLPLLPSHSHVRSSSKSLASTDELRHFKISLKWCALDHSSAAAKFASYAAFAILAFGVPAVVIVAVRDPDGDPISFNKLVQVPESGLAAVGFFSLCRFFRRYGL
RQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVNISFPGGFPVNSIVFVFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVNRLLETS
IGLGSDAVMIFNEHLRIRKQLRVTSHRYRFFIIVCLLVITVSQFVALLLVLSSKTDKNFFNSGDVVVCSAVQLCGFFLCVFGAARITHRAQGIAAVATRWHMVVTSAAVE
SDCTKSPLPPDVVEDKEGGGGNDDGGADSDSSSSSDVLIPVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS