; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005127 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005127
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr6:11009738..11016317
RNA-Seq ExpressionLag0005127
SyntenyLag0005127
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-19985.25Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA+IL PSFFVELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLLVL SKTDKNFFNSGD+VVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+LS
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

XP_022951427.1 uncharacterized protein LOC111454247 [Cucurbita moschata]1.6e-19885.02Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA+IL PSFFVELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLL L SKTDKNFFNSGD+VVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+LS
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]7.3e-19984.79Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA++L PSFF ELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLLVL SKT+KNFFNSGD+VVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+LS
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

XP_023537461.1 uncharacterized protein LOC111798502 [Cucurbita pepo subsp. pepo]7.3e-19985.45Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA+IL PSFFVELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLLVL SKTDKNFFNSGD+VVCS VELCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVIL
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+L
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVIL

XP_038884363.1 uncharacterized protein LOC120075225 [Benincasa hispida]1.5e-19684.74Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSGE+NLPLLLQHSYCRSKSLTSD+LRNFRISLKWCALDHSSAAGKF SY  F FLT VVP V I+AVR P      ADDPISFNKLVQVPESGLAA+G
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFP---GGFPVNSMVFVLVLGSWVYRTGVFLL
        F +LCRFFRRYGLRQLLFL+GLQEDSVYVQ+GYARELEK FR +A+IL PSFFVELAHKI+FFSTV ISFP    GFPVNS++F+ VLGSW+YRTGVFLL
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFP---GGFPVNSMVFVLVLGSWVYRTGVFLL

Query:  VCVLFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELC
        VCVLFRLTCELQILRLQGV+KLFETSSGSGSD VVIFNEHLRIRKQL VTSHRYRFFIIVCL VITVSQFVALLLVLASKTDKNFFNSGDVVVCSAV+LC
Subjt:  VCVLFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELC

Query:  GFFLCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDD--NSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQH
        GFFLCLFGAARITHRAQGIASVATRWHMLVTSAA GSN TKS  P PP VDD  ++    + D+D+DTDSDSSSDVLISVNTQEPSSFQ RQALVSYLQH
Subjt:  GFFLCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDD--NSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQH

Query:  NNGGITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        NNGGIT+FGYALDRGLLHTLF FEFSLV+WI+SKVV+LS
Subjt:  NNGGITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein7.4e-18981.84Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDS E+NLPLLLQHSYCRSKS+TSD+LRN+RISLKWCALDHSSAAGKF SYL F  LT+VVPAVTI+AV+ P       DDPISFNKLVQVPESGLAA+G
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        F +LCRFFRRYGLRQLLFL+GLQEDSVYVQ+GYARELEK FR +A+IL PSFFVELAHKI+FFSTV I FP GFPVNS++F+LVLGSWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGV+KLFETS+GS SD+V+IFNEHLRIRKQL +TSHRYRFFIIVCL VITVSQFVALLLVLASKTDKNFFNSGDVVVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVD--DNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNG
        LCLFGAARITHRAQGIASVATRWHMLVTSAA+GSN TK  P   P V+   N+      +++DDTDSDSSSDVLISV  +E SSFQTRQALVSYLQHNNG
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVD--DNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNG

Query:  GITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVIL
        GIT+FGYALDRGLLHTLF FEFSLV+ I+SKVV+L
Subjt:  GITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVIL

A0A5A7VCT9 Uncharacterized protein5.5e-19282.34Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSGE+NLPLLLQHSYCRSKS+TSD+LRNFRISLKWCALDHSSA GKF SYLAF FLT+VVPAVT++AV+ P       DDPISFNKLVQVPESGLAA+ 
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        F +LCRFFRRYGLRQLLFL+GLQEDSVYVQ+GYARELEK FR +A+IL PSFFVELAHKI+FFSTV ISFP GFP+NS++F+ VLGSWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGV+KLFETS+GS SD+V+IFNEHLRIRKQL +TSHRYRFFIIVCL VITVSQFVALLLVLASK DKNFFNSGDVVVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVD--DNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNG
        LCLFGAARITHRAQGIAS+ATRWHMLVTSAA+GSN TKS PP  P V+   N+      +++DDTDSDSSSDVLISV  QE SSFQTRQALVSYLQHNNG
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVD--DNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNG

Query:  GITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        GIT+FGYALDRGLLHTLF FEFSLV+WI+SKVV+LS
Subjt:  GITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

A0A6J1BR58 uncharacterized protein LOC1110050155.5e-18482.39Show/hide
Query:  LLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRR
        LL   YCRSKSLTSDELRNFRISLKWCALDHSS AGKFVSY AFV   VVVPAV IVAV +P       DDPISFNKLVQVPESGLAAIGFFTL RFFRR
Subjt:  LLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRR

Query:  YGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCVLFRLTCELQI
        YGLRQLLFLDGLQEDSVYVQRGYAREL+KSFRYLA+IL PSFFVELAHKI FF+TV ISFPGGFP+NS++FVLVLG+WVYRTGVFLLVC+LFRLTCELQI
Subjt:  YGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCVLFRLTCELQI

Query:  LRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARIT
        LRLQGV KLFE SSGSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL VITVSQFVALLLVLASKT KNFFNSGDVVVCSAV+LCGFFLCLFGAARIT
Subjt:  LRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARIT

Query:  HRAQGIASVATRWHMLVTS--AAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITVFGYALD
        HRAQGI S+A+RWHMLVTS  +AAGS+QTK    PP P  D SGG G     DDTDSDSS  VLISV+  +PSSFQTRQALVSYL+HNNGGITV+G+ALD
Subjt:  HRAQGIASVATRWHMLVTS--AAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITVFGYALD

Query:  RGLLHTLFTFEFSLVLWIMSKVVILS
        RGLLHTLF FEFSLV+WI+SKVV+LS
Subjt:  RGLLHTLFTFEFSLVLWIMSKVVILS

A0A6J1GHK9 uncharacterized protein LOC1114542477.9e-19985.02Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKW ALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA+IL PSFFVELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLL L SKTDKNFFNSGD+VVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+LS
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

A0A6J1KJB4 uncharacterized protein LOC1114962253.5e-19984.79Show/hide
Query:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG
        MDSG+ NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSY AFV LTVVVP VTIV+VR P      ADDPIS NKLVQVPESGLAAIG
Subjt:  MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIG

Query:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV
        FF+LCRFFRRYGLRQLLFLDGLQEDS+YVQRGYAREL+K+FRYLA++L PSFF ELAHKI+FFSTV ISFPGGFP+NS++F+ VL SWVYRTGVFLLVCV
Subjt:  FFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCV

Query:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF
        LFRLTCELQILRLQGVNKLFETS+GSGSDA+VIFNEHLRIRKQLWVTSHRYRFFIIVCLF+ITVSQFVALLLVL SKT+KNFFNSGD+VVCS V+LCGFF
Subjt:  LFRLTCELQILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFF

Query:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI
        LC+ GAARITHRAQ IASVATRWHMLVTSAAAG N TK  P P PP  DN G        DDTDSDSSS+VLIS++TQEPSSFQTRQALVSYLQHNNGGI
Subjt:  LCLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGI

Query:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        T+FG+ALDRGLLHTLF FEFSLV+WIMSKVV+LS
Subjt:  TVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)7.8e-6637.41Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLL
        R  S   DEL +FR  L+W  +DHSS     +S+  F+  T+VVPA++   +   + A   +     ++ +VQ+  S +A + F  L RF  +YGLR+ L
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLL

Query:  FLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMV-FVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQ
        F D L ++S  V+R Y  +L  S   +++ ++P F    A+KI ++++    I F G   ++  V  ++ L SW+YRT V  LVCVLFRL C LQILRLQ
Subjt:  FLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMV-FVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQ

Query:  GVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQ
           KLF+  S  GS    I +EHLRIR+ L + SHRYR FI+  L ++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ
Subjt:  GVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQ

Query:  GIASVATRWHMLVT-----SAAAGSNQTKSTPPPPPPVDDNSGG--------GGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGIT
         +  +A +WH+  T           +QT  TP      ++++            D DE  D + D  ++ +I V      SFQ RQALVSY ++N+ GIT
Subjt:  GIASVATRWHMLVT-----SAAAGSNQTKSTPPPPPPVDDNSGG--------GGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGIT

Query:  VFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        V+G+ LDRG LHT+F  E SLVLW++ K + +S
Subjt:  VFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

AT1G50630.2 Protein of unknown function (DUF3537)2.1e-5035.03Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLL
        R  S   DEL +FR  L+W  +DHSS     +S+  F+  T+VVPA++   +   + A   +     ++ +VQ+  S +A + F  L RF  +YGLR+ L
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLL

Query:  FLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMV-FVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQ
        F D L ++S  V+R Y  +L  S   +++ ++P F    A+KI ++++    I F G   ++  V  ++ L SW+YRT V  LVCVLFRL C LQILRLQ
Subjt:  FLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMV-FVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQ

Query:  GVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQ
           KLF+  S  GS    I +EHLRIR+ L + SHRYR FI+  L ++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ
Subjt:  GVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQ

Query:  GIASVATRWHMLVT-----SAAAGSNQTKSTPPPPPPVDDNSGG--------GGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQH
         +  +A +WH+  T           +QT  TP      ++++            D DE  D + D  ++ +I V      SFQ RQAL   +Q+
Subjt:  GIASVATRWHMLVT-----SAAAGSNQTKSTPPPPPPVDDNSGG--------GGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQH

AT2G21080.1 unknown protein1.3e-12156.81Show/hide
Query:  NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCR
        +LP L       SKSL SD+LRNFR+ LKWCALDHSS+ GK VSY+ FV  T++VP ++ + ++ P   P+   D  SFN LVQ PESGLA IGF TL  
Subjt:  NLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCR

Query:  FFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFP----GGFPVNSMVFVLVLGSWVYRTGVFLLVCVLF
        FFR Y L +LLFLD    DS  V+ GY+REL+K+ RYLA+IL+PSF VEL HK +FF +  +SFP        +N ++F LVL SWVYRTGVFLLVC+LF
Subjt:  FFRRYGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFP----GGFPVNSMVFVLVLGSWVYRTGVFLLVCVLF

Query:  RLTCELQILRLQGVNKLFETSSGSGSDAVV-IFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFL
        RLTCELQILR +G++KLF+     GSD +  +  EH+RI+KQL  TSHRYRFFII    VI+ SQFVALLLVLASK++K+F +SGD+VVCSAV+L GFFL
Subjt:  RLTCELQILRLQGVNKLFETSSGSGSDAVV-IFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFL

Query:  CLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGIT
        CL GAARITHRAQG+  +ATRWHM +T A+   +                        + DTDS  +  + +S +    S FQ RQALV YL+HNN GIT
Subjt:  CLFGAARITHRAQGIASVATRWHMLVTSAAAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGIT

Query:  VFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        ++GYALDRGLLHTLF FEFSLV+WI+SKVV+LS
Subjt:  VFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

AT3G20300.1 Protein of unknown function (DUF3537)3.0e-6537.5Show/hide
Query:  QHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYG
        ++ + RS S   DEL +FR  L+W  +D SS     +S+  FV  T+VVPA +   +   + +   +     ++ +VQ+  S  AA+ F  L RF  +YG
Subjt:  QHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYG

Query:  LRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMVFVLV-LGSWVYRTGVFLLVCVLFRLTCELQ
        LR+ LF D L ++S  V+ GY  +L +S + L++ + P F    ++KI ++++    I F G   ++  V  L+ L SW+YRT V  LVCVLFRL C LQ
Subjt:  LRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFST--VNISFPGGFPVNSMVFVLV-LGSWVYRTGVFLLVCVLFRLTCELQ

Query:  ILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARI
        ILRLQ   ++F+  S  GS    I +EHLRIR+ L + SHRYR FI++ L ++T SQF +LL+   +  + N + +G++ +CS   +    + L  A++I
Subjt:  ILRLQGVNKLFETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARI

Query:  THRAQGIASVATRWHMLVT--SAAAGSNQT-----KSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITV
        TH+AQ +  +A +WH+  T  S      +T     +++     P DD++G    ED  D+ D D  ++ LI        SFQ RQALV+Y ++N  GITV
Subjt:  THRAQGIASVATRWHMLVT--SAAAGSNQT-----KSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITV

Query:  FGYALDRGLLHTLFTFEFSLVLWIMSKVVILS
        FG+ LDR  LHT+F  E SLVLW++ K + +S
Subjt:  FGYALDRGLLHTLFTFEFSLVLWIMSKVVILS

AT4G03820.2 Protein of unknown function (DUF3537)1.1e-4631.11Show/hide
Query:  DHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEK
        D S+     +S+  F  L V+VP ++   +     A         ++ LVQ+  S  A I F +L  + ++YG+R+ LF D L++ S  V+ GY  ++++
Subjt:  DHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELEK

Query:  SFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNS-----MVFVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVNKLFETSSGSGSDAVVIF
        S + LA  +LPS  ++  ++I ++++     P  + +N      +   L L SW+YRT +F++ C+L++  C LQ+LRL    + F +      D   I 
Subjt:  SFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNS-----MVFVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVNKLFETSSGSGSDAVVIF

Query:  NEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQGIASVATRWHMLVTSAAAGS
         EHL+IR++L + SHR+R FI++ LF +T +QF+ALL  + +    N +  G++ +CS   + G F+CL  A ++TH+AQ + S+AT+W++  +      
Subjt:  NEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQGIASVATRWHMLVTSAAAGS

Query:  NQTKSTPPPPPPVDD--------NSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITVFGYALDRGLLHTLFTFEFSLVLWI
             TP  P             N     D+DE+ + D    +D+ I        S Q RQALV+YL++N  GITV+G+ +D+  L  +F+ E +L+LW+
Subjt:  NQTKSTPPPPPPVDD--------NSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITVFGYALDRGLLHTLFTFEFSLVLWI

Query:  MSKVV
        + K +
Subjt:  MSKVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCGGCGAAACCAACCTCCCTCTCCTTCTCCAACACTCATACTGCCGATCCAAATCCCTAACCTCCGACGAGCTTCGCAACTTCCGAATCAGCCTCAAATGGTG
CGCTCTCGACCACTCCTCCGCCGCCGGAAAGTTCGTCTCGTACTTGGCCTTCGTTTTCCTCACCGTCGTCGTCCCCGCCGTGACGATCGTCGCCGTCCGAATTCCTTCCA
CTGCTCCCGCCGGCGCCGACGATCCGATCTCGTTTAACAAACTCGTTCAGGTTCCGGAGTCCGGCCTGGCGGCGATCGGATTCTTCACTCTGTGTCGATTCTTCCGGCGG
TACGGGCTCCGGCAGCTTCTGTTCCTCGACGGATTGCAAGAGGATTCTGTGTACGTTCAGAGAGGCTATGCGCGTGAACTGGAGAAGTCGTTTCGTTACCTTGCGTTCAT
TCTTTTGCCGTCGTTCTTCGTTGAGTTGGCCCACAAGATCTTGTTCTTCTCCACCGTTAACATCTCCTTTCCGGGCGGGTTTCCAGTAAACTCGATGGTGTTCGTGCTCG
TGTTGGGGTCGTGGGTTTATCGAACCGGGGTGTTCTTGCTGGTGTGCGTGTTGTTTCGGCTGACGTGCGAGCTGCAGATTCTGAGGCTACAAGGGGTGAACAAATTGTTC
GAGACGAGCAGCGGGTCAGGATCGGACGCAGTGGTGATCTTCAACGAACATTTGAGGATCAGAAAACAGCTATGGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGT
CTGCTTGTTCGTCATCACCGTCAGCCAGTTCGTGGCGCTGCTGCTCGTTTTAGCCTCAAAAACTGATAAGAATTTCTTCAACTCCGGCGACGTTGTGGTTTGCTCGGCGG
TTGAATTATGCGGATTCTTCTTGTGCTTGTTTGGAGCTGCAAGAATTACTCACAGAGCTCAAGGAATCGCGTCGGTTGCAACCAGATGGCACATGCTTGTCACTTCCGCC
GCCGCAGGATCCAACCAGACCAAATCCACGCCGCCGCCTCCGCCGCCCGTCGACGACAATTCAGGCGGCGGCGGAGACGAAGACGAAGACGACGACACCGATTCCGACTC
GTCGTCGGACGTTTTAATTTCAGTGAACACGCAGGAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCGTATTTGCAGCACAACAATGGCGGAATTACAGTGTTTG
GATATGCATTGGATCGAGGATTGCTTCATACGCTTTTCACATTCGAATTCTCTCTCGTCCTGTGGATTATGAGTAAGGTTGTCATATTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCGGCGAAACCAACCTCCCTCTCCTTCTCCAACACTCATACTGCCGATCCAAATCCCTAACCTCCGACGAGCTTCGCAACTTCCGAATCAGCCTCAAATGGTG
CGCTCTCGACCACTCCTCCGCCGCCGGAAAGTTCGTCTCGTACTTGGCCTTCGTTTTCCTCACCGTCGTCGTCCCCGCCGTGACGATCGTCGCCGTCCGAATTCCTTCCA
CTGCTCCCGCCGGCGCCGACGATCCGATCTCGTTTAACAAACTCGTTCAGGTTCCGGAGTCCGGCCTGGCGGCGATCGGATTCTTCACTCTGTGTCGATTCTTCCGGCGG
TACGGGCTCCGGCAGCTTCTGTTCCTCGACGGATTGCAAGAGGATTCTGTGTACGTTCAGAGAGGCTATGCGCGTGAACTGGAGAAGTCGTTTCGTTACCTTGCGTTCAT
TCTTTTGCCGTCGTTCTTCGTTGAGTTGGCCCACAAGATCTTGTTCTTCTCCACCGTTAACATCTCCTTTCCGGGCGGGTTTCCAGTAAACTCGATGGTGTTCGTGCTCG
TGTTGGGGTCGTGGGTTTATCGAACCGGGGTGTTCTTGCTGGTGTGCGTGTTGTTTCGGCTGACGTGCGAGCTGCAGATTCTGAGGCTACAAGGGGTGAACAAATTGTTC
GAGACGAGCAGCGGGTCAGGATCGGACGCAGTGGTGATCTTCAACGAACATTTGAGGATCAGAAAACAGCTATGGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGT
CTGCTTGTTCGTCATCACCGTCAGCCAGTTCGTGGCGCTGCTGCTCGTTTTAGCCTCAAAAACTGATAAGAATTTCTTCAACTCCGGCGACGTTGTGGTTTGCTCGGCGG
TTGAATTATGCGGATTCTTCTTGTGCTTGTTTGGAGCTGCAAGAATTACTCACAGAGCTCAAGGAATCGCGTCGGTTGCAACCAGATGGCACATGCTTGTCACTTCCGCC
GCCGCAGGATCCAACCAGACCAAATCCACGCCGCCGCCTCCGCCGCCCGTCGACGACAATTCAGGCGGCGGCGGAGACGAAGACGAAGACGACGACACCGATTCCGACTC
GTCGTCGGACGTTTTAATTTCAGTGAACACGCAGGAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCGTATTTGCAGCACAACAATGGCGGAATTACAGTGTTTG
GATATGCATTGGATCGAGGATTGCTTCATACGCTTTTCACATTCGAATTCTCTCTCGTCCTGTGGATTATGAGTAAGGTTGTCATATTATCTTAG
Protein sequenceShow/hide protein sequence
MDSGETNLPLLLQHSYCRSKSLTSDELRNFRISLKWCALDHSSAAGKFVSYLAFVFLTVVVPAVTIVAVRIPSTAPAGADDPISFNKLVQVPESGLAAIGFFTLCRFFRR
YGLRQLLFLDGLQEDSVYVQRGYARELEKSFRYLAFILLPSFFVELAHKILFFSTVNISFPGGFPVNSMVFVLVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVNKLF
ETSSGSGSDAVVIFNEHLRIRKQLWVTSHRYRFFIIVCLFVITVSQFVALLLVLASKTDKNFFNSGDVVVCSAVELCGFFLCLFGAARITHRAQGIASVATRWHMLVTSA
AAGSNQTKSTPPPPPPVDDNSGGGGDEDEDDDTDSDSSSDVLISVNTQEPSSFQTRQALVSYLQHNNGGITVFGYALDRGLLHTLFTFEFSLVLWIMSKVVILS