; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010023 (gene) of Snake gourd v1 genome

Gene IDTan0010023
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationLG02:94638255..94647608
RNA-Seq ExpressionTan0010023
SyntenyTan0010023
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-20087.38Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTVKISFPGGFP NSI+FMFVLASWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLLVL SKTDKNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_022951427.1 uncharacterized protein LOC111454247 [Cucurbita moschata]2.5e-19987.15Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTVKISFPGGFP NSI+FMFVLASWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLL L SKTDKNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]1.9e-19986.68Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AY+LFPSFF ELAHKIIFFSTVKISFPGGFP NSI+FMFVL SWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLLVL SKT+KNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_023537461.1 uncharacterized protein LOC111798502 [Cucurbita pepo subsp. pepo]7.2e-19987.12Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTVKISFPGGFP NSI+FMFVLASWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLLVL SKTDKNFFNSGD+VVCS V+LCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVL
        LDRGLLHTLFAFEFSLVMWI+SKVVVL
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVL

XP_038884363.1 uncharacterized protein LOC120075225 [Benincasa hispida]2.1e-19886.34Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSGE+NLPLLLQHSYCRSKSLTSD+LRNFRISLKWCALDHSSAAGKF SYF F FLT VVP   I+AVRDPADDPISFNKLVQVPESGLAA+GF +LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFP---GGFPANSIMFMFVLASWVYRTGVFLLVCILFR
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTVKISFP    GFP NSI+F+FVL SW+YRTGVFLLVC+LFR
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFP---GGFPANSIMFMFVLASWVYRTGVFLLVCILFR

Query:  LTCELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCL
        LTCELQILRLQGV KL ETSSGSGSD VVIFNEHLRI+KQL VTSHRYRFFIIVCL VITVSQFV LLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCL
Subjt:  LTCELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCL

Query:  FGAARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPPPPPIDDDKQD---GFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL
        FGAARITHRAQGIA+VATRWHMLVTSAA GSN TKS P PP  DDK+D       DD++  DSDSSSDVLISV+TQ+PSSFQ RQALVSYLQHNNGGITL
Subjt:  FGAARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPPPPPIDDDKQD---GFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL

Query:  FGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        FG+ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  FGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein2.5e-18983.22Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDS E+NLPLLLQHSYCRSKS+TSD+LRN+RISLKWCALDHSSAAGKF SY  F  LTIVVPA TI+AV+DP DDPISFNKLVQVPESGLAA+GF +LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTVKI FP GFP NSI+F+ VL SWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GS SD+V+IFNEHLRI+KQL +TSHRYRFFIIVCL VITVSQFV LLLVLASKTDKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPP--PPPIDDDKQD--GFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQGIA+VATRWHMLVTSAA GSN TK PP   P ++  K +     G+DED  DSDSSSDVLISV  ++ SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPP--PPPIDDDKQD--GFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG

Query:  FALDRGLLHTLFAFEFSLVMWILSKVVVL
        +ALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  FALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A5A7VCT9 Uncharacterized protein1.2e-19484.38Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSGE+NLPLLLQHSYCRSKS+TSD+LRNFRISLKWCALDHSSA GKF SY AF FLTIVVPA T++AV+DP DDPISFNKLVQVPESGLAA+ F +LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR VAYILFPSFFVELAHKIIFFSTVKISFP GFP NSI+F+FVL SWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GS SD+V+IFNEHLRI+KQL +TSHRYRFFIIVCL VITVSQFV LLLVLASK DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPPPPPIDDDKQ---DGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGF
        ARITHRAQGIA++ATRWHMLVTSAA GSN TKSPP  P  + K+   +   G+DED  DSDSSSDVLISV  Q+ SSFQTRQALVSYLQHNNGGITLFG+
Subjt:  ARITHRAQGIAAVATRWHMLVTSAA-GSNQTKSPPPPPIDDDKQ---DGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGF

Query:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS
        ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1BR58 uncharacterized protein LOC1110050152.9e-18584.45Show/hide
Query:  LLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKWCALDHSS AGKFVSYFAFV   +VVPA  IVAV  P DDPISFNKLVQVPESGLAAIGFFTL RFFRRYGLRQL
Subjt:  LLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQL

Query:  LFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDKSFRY+AYILFPSFFVELAHKI FF+TVKISFPGGFP NSIMF+ VL +WVYRTGVFLLVCILFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  IKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KL E SSGSGS+A VIF+EH+RIKKQL VTSHRYRFFII CL VITVSQFV LLLVLASKT KNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
Subjt:  IKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  AAVATRWHMLVT---SAAGSNQTKSPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLF
         ++A+RWHMLVT   SAAGS+QTK P P P      DG GGD +DTD    SS VLISVS  QPSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLF
Subjt:  AAVATRWHMLVT---SAAGSNQTKSPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLF

Query:  AFEFSLVMWILSKVVVLS
        AFEFSLVMWILSKVVVLS
Subjt:  AFEFSLVMWILSKVVVLS

A0A6J1GHK9 uncharacterized protein LOC1114542471.2e-19987.15Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AYILFPSFFVELAHKIIFFSTVKISFPGGFP NSI+FMFVLASWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLL L SKTDKNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1KJB4 uncharacterized protein LOC1114962259.1e-20086.68Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFV LT+VVP  TIV+VRDPADDPIS NKLVQVPESGLAAIGFF+LCR
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC
        FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDK+FRY+AY+LFPSFF ELAHKIIFFSTVKISFPGGFP NSI+FMFVL SWVYRTGVFLLVC+LFRLTC
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTC

Query:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV KL ETS+GSGSDA+VIFNEHLRI+KQL VTSHRYRFFIIVCLF+ITVSQFV LLLVL SKT+KNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA
        ARITHRAQ IA+VATRWHMLVTS AAG N TK  +P PP +D+D      GDD    DSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFGFA
Subjt:  ARITHRAQGIAAVATRWHMLVTS-AAGSNQTK--SPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)3.1e-6737.01Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLD
        R  S   DEL +FR  L+W  +DHSS     +S+  F+  T+VVPA +  ++A  D  +     ++ +VQ+  S +A + F  L RF  +YGLR+ LF D
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLD

Query:  GLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPG-GFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVI
         L ++S  V+R Y  +L+ S   V+Y + P F    A+KI ++++   +I F G    ++++  +  L SW+YRT V  LVC+LFRL C LQILRLQ   
Subjt:  GLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPG-GFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVI

Query:  KLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIA
        KL +  S  GS    I +EHLRI++ LR+ SHRYR FI+  L ++T SQF  LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIA

Query:  AVATRWHMLVT------SAAGSNQTKSPPP---------------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGG
         +A +WH+  T      +    +QT   P                  + +   D +G +++D D++D     +I V      SFQ RQALVSY ++N+ G
Subjt:  AVATRWHMLVT------SAAGSNQTKSPPP---------------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGG

Query:  ITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        IT++GF LDRG LHT+F  E SLV+W+L K + +S
Subjt:  ITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT1G50630.2 Protein of unknown function (DUF3537)1.8e-5134.6Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLD
        R  S   DEL +FR  L+W  +DHSS     +S+  F+  T+VVPA +  ++A  D  +     ++ +VQ+  S +A + F  L RF  +YGLR+ LF D
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLD

Query:  GLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPG-GFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVI
         L ++S  V+R Y  +L+ S   V+Y + P F    A+KI ++++   +I F G    ++++  +  L SW+YRT V  LVC+LFRL C LQILRLQ   
Subjt:  GLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPG-GFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVI

Query:  KLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIA
        KL +  S  GS    I +EHLRI++ LR+ SHRYR FI+  L ++T SQF  LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIA

Query:  AVATRWHMLVT------SAAGSNQTKSPPP---------------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQH
         +A +WH+  T      +    +QT   P                  + +   D +G +++D D++D     +I V      SFQ RQAL   +Q+
Subjt:  AVATRWHMLVT------SAAGSNQTKSPPP---------------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQH

AT2G21080.1 unknown protein1.7e-12159.53Show/hide
Query:  NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPI------SFNKLVQVPESGLAAIGFFTLCR
        +LP L       SKSL SD+LRNFR+ LKWCALDHSS+ GK VSY  FV  T++VP  + + ++ P + P       SFN LVQ PESGLA IGF TL  
Subjt:  NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPI------SFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFP----GGFPANSIMFMFVLASWVYRTGVFLLVCILF
        FFR Y L +LLFLD    DS  V+ GY+RELDK+ RY+AYIL PSF VEL HK IFF + ++SFP         N +MF  VL SWVYRTGVFLLVCILF
Subjt:  FFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFP----GGFPANSIMFMFVLASWVYRTGVFLLVCILF

Query:  RLTCELQILRLQGVIKLLETSSGSGSDAVV-IFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFL
        RLTCELQILR +G+ KL +     GSD +  +  EH+RIKKQL  TSHRYRFFII    VI+ SQFV LLLVLASK++K+F +SGD+VVCSAVQL GFFL
Subjt:  RLTCELQILRLQGVIKLLETSSGSGSDAVV-IFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFL

Query:  CLFGAARITHRAQGIAAVATRWHMLVTSAAGSNQTKSPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG
        CL GAARITHRAQG+  +ATRWHM +T A   ++  SP                + DTD SD+   + +S S    S FQ RQALV YL+HNN GITL+G
Subjt:  CLFGAARITHRAQGIAAVATRWHMLVTSAAGSNQTKSPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG

Query:  FALDRGLLHTLFAFEFSLVMWILSKVVVLS
        +ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  FALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT3G20300.1 Protein of unknown function (DUF3537)3.4e-6636.83Show/hide
Query:  QHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQ
        ++ + RS S   DEL +FR  L+W  +D SS     +S+  FV  T+VVPA +  ++A  D  +     ++ +VQ+  S  AA+ F  L RF  +YGLR+
Subjt:  QHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAAT--IVAVRD-PADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQ

Query:  LLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPGG-FPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILR
         LF D L ++S  V+ GY  +L++S + ++Y + P F    ++KI ++++   +I F G    ++++  +  L SW+YRT V  LVC+LFRL C LQILR
Subjt:  LLFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFST--VKISFPGG-FPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILR

Query:  LQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR
        LQ   ++ +  S  GS    I +EHLRI++ LR+ SHRYR FI++ L ++T SQF  LL+   +  + N + +G++ +CS   +    + L  A++ITH+
Subjt:  LQGVIKLLETSSGSGSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR

Query:  AQGIAAVATRWHMLVTSAAGSNQTKSPPP----------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGF
        AQ +  +A +WH+  T  +        P            P DDD  +    D  D +D D  ++ LI        SFQ RQALV+Y ++N  GIT+FGF
Subjt:  AQGIAAVATRWHMLVTSAAGSNQTKSPPP----------PPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGF

Query:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS
         LDR  LHT+F  E SLV+W+L K + +S
Subjt:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT4G22270.1 Protein of unknown function (DUF3537)8.8e-4630.73Show/hide
Query:  DHSSAAGKFVSYFAFVFLTIVVPAAT---IVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFR
        D S+     +S+  F  L ++VP  +   +V           ++ +VQ+  S  A I F +L  + R++G+R+ LFLD L + S  V+  Y  E+ +S +
Subjt:  DHSSAAGKFVSYFAFVFLTIVVPAAT---IVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLDGLQEDSLYVQRGYARELDKSFR

Query:  YVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFV-----LASWVYRTGVFLLVCILFRLTCELQILRLQGVIKLLETSSGSGSDAVVIFNEH
         +   + PS  +E  ++I ++ +     P  +  N I+   V     L+SW+YR  +F++VCIL+++TC LQ LRL    +   +     +D      EH
Subjt:  YVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFV-----LASWVYRTGVFLLVCILFRLTCELQILRLQGVIKLLETSSGSGSDAVVIFNEH

Query:  LRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIAAVATRWHMLVTSAAGSNQTK
         +I++ LR+ SHR+R FI++ L ++T +QF+ LL    +    N +  G++ +CS   + G F+CL  A +ITH+AQ + ++A +W++  T  +  +   
Subjt:  LRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIAAVATRWHMLVTSAAGSNQTK

Query:  SPPPPPIDDDKQDGFG-----GDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVV
          P   I + +    G      DDE+ +  D   +  I        S+Q RQALV+YL++N  GIT++GF +DR  L+T+F  E +L++W+L+K +V
Subjt:  SPPPPPIDDDKQDGFG-----GDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCGGCGAAACAAACCTCCCTCTCCTCCTCCAACACTCCTACTGCCGATCCAAATCCCTAACCTCCGACGAGCTTCGCAACTTCCGAATCAGCCTCAAATGGTG
CGCTCTCGACCACTCCTCCGCCGCCGGAAAGTTCGTCTCGTACTTCGCCTTCGTTTTCCTCACAATCGTCGTCCCCGCCGCGACAATCGTCGCCGTACGAGACCCGGCCG
ACGATCCGATCTCGTTTAACAAACTCGTTCAGGTCCCGGAATCCGGGCTGGCTGCGATCGGATTCTTCACTCTGTGCCGATTCTTCCGGCGGTACGGCCTCCGGCAGCTG
CTGTTCCTCGACGGATTGCAGGAGGATTCTCTGTACGTTCAGCGAGGCTATGCGCGTGAACTCGACAAGTCGTTCCGTTATGTGGCGTACATTCTTTTTCCGTCGTTCTT
CGTTGAGTTGGCCCACAAGATCATATTCTTCTCAACTGTTAAGATTTCGTTTCCGGGTGGGTTCCCAGCAAACTCGATCATGTTCATGTTCGTGTTGGCGTCGTGGGTGT
ATCGGACCGGGGTGTTCTTGCTAGTGTGCATATTGTTTCGGCTGACGTGCGAGTTGCAAATCCTAAGGCTACAAGGGGTGATCAAGTTGCTCGAGACGAGCAGCGGGTCG
GGATCGGACGCGGTGGTGATCTTCAACGAACATTTGAGGATCAAAAAACAGTTGAGGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGTGTGCTTGTTCGTCATCAC
TGTTAGCCAGTTCGTCGAGCTGTTGCTGGTTTTGGCCTCCAAAACTGATAAGAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCGGCGGTTCAATTATGCGGATTCT
TCTTGTGCCTGTTCGGAGCTGCAAGAATTACTCACAGAGCTCAAGGAATTGCGGCGGTTGCAACCAGATGGCACATGCTCGTCACCTCCGCCGCCGGATCCAACCAGACC
AAATCTCCGCCGCCACCGCCGATCGATGACGACAAGCAAGATGGCTTCGGTGGCGACGATGAGGACACCGACGATTCGGACTCGTCGTCAGACGTTTTAATTTCAGTGAG
CACACAACAACCTTCTTCCTTCCAAACCAGACAAGCCTTAGTATCGTATTTGCAGCACAACAATGGCGGAATTACACTGTTCGGATTTGCATTGGATCGGGGATTGCTTC
ATACGCTCTTCGCATTCGAATTCTCTCTAGTCATGTGGATTTTGAGTAAGGTCGTTGTATTATCTTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCCCACAAGGCAGAACTCTGCACCTTCCTTTATCTTCAGTCCTCACTATCTAAAAAGCTTCCCTCCTTTTCCTCACAGAAATCTCGCCGCCGTCCGCCGATTCAACT
TTTAAAAACCCGCCGATTATTAATTCACATCCCATTCTCAAAAATCTCCGCGGTTTCCGCCATGGACTCCGGCGAAACAAACCTCCCTCTCCTCCTCCAACACTCCTACT
GCCGATCCAAATCCCTAACCTCCGACGAGCTTCGCAACTTCCGAATCAGCCTCAAATGGTGCGCTCTCGACCACTCCTCCGCCGCCGGAAAGTTCGTCTCGTACTTCGCC
TTCGTTTTCCTCACAATCGTCGTCCCCGCCGCGACAATCGTCGCCGTACGAGACCCGGCCGACGATCCGATCTCGTTTAACAAACTCGTTCAGGTCCCGGAATCCGGGCT
GGCTGCGATCGGATTCTTCACTCTGTGCCGATTCTTCCGGCGGTACGGCCTCCGGCAGCTGCTGTTCCTCGACGGATTGCAGGAGGATTCTCTGTACGTTCAGCGAGGCT
ATGCGCGTGAACTCGACAAGTCGTTCCGTTATGTGGCGTACATTCTTTTTCCGTCGTTCTTCGTTGAGTTGGCCCACAAGATCATATTCTTCTCAACTGTTAAGATTTCG
TTTCCGGGTGGGTTCCCAGCAAACTCGATCATGTTCATGTTCGTGTTGGCGTCGTGGGTGTATCGGACCGGGGTGTTCTTGCTAGTGTGCATATTGTTTCGGCTGACGTG
CGAGTTGCAAATCCTAAGGCTACAAGGGGTGATCAAGTTGCTCGAGACGAGCAGCGGGTCGGGATCGGACGCGGTGGTGATCTTCAACGAACATTTGAGGATCAAAAAAC
AGTTGAGGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGTGTGCTTGTTCGTCATCACTGTTAGCCAGTTCGTCGAGCTGTTGCTGGTTTTGGCCTCCAAAACTGAT
AAGAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCGGCGGTTCAATTATGCGGATTCTTCTTGTGCCTGTTCGGAGCTGCAAGAATTACTCACAGAGCTCAAGGAAT
TGCGGCGGTTGCAACCAGATGGCACATGCTCGTCACCTCCGCCGCCGGATCCAACCAGACCAAATCTCCGCCGCCACCGCCGATCGATGACGACAAGCAAGATGGCTTCG
GTGGCGACGATGAGGACACCGACGATTCGGACTCGTCGTCAGACGTTTTAATTTCAGTGAGCACACAACAACCTTCTTCCTTCCAAACCAGACAAGCCTTAGTATCGTAT
TTGCAGCACAACAATGGCGGAATTACACTGTTCGGATTTGCATTGGATCGGGGATTGCTTCATACGCTCTTCGCATTCGAATTCTCTCTAGTCATGTGGATTTTGAGTAA
GGTCGTTGTATTATCTTAGACCGGAAAAAAAACCCTACTATTATTATTATTAAAAAAAAACAAAATCTTACATTTTTAATTTAATTAGCTCTCATAATTCTATAAGAAAT
ATTGTGTTTAAGTTTTCATGTTGTAATACAAAGAAGCATATATGTATATACGTACGTGTTAATTAAAAAGAGGAGGAAAAAAGGATTTTCTGCCGG
Protein sequenceShow/hide protein sequence
MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYFAFVFLTIVVPAATIVAVRDPADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQL
LFLDGLQEDSLYVQRGYARELDKSFRYVAYILFPSFFVELAHKIIFFSTVKISFPGGFPANSIMFMFVLASWVYRTGVFLLVCILFRLTCELQILRLQGVIKLLETSSGS
GSDAVVIFNEHLRIKKQLRVTSHRYRFFIIVCLFVITVSQFVELLLVLASKTDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIAAVATRWHMLVTSAAGSNQT
KSPPPPPIDDDKQDGFGGDDEDTDDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGFALDRGLLHTLFAFEFSLVMWILSKVVVLS