; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011644 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011644
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationscaffold1:1708969..1713537
RNA-Seq ExpressionSpg011644
SyntenySpg011644
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594467.1 hypothetical protein SDJN03_11020, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19282.27Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL
        SDYWMVLNEIQ M WNSS  SS+LRYL  NAD+F GNF+AEKRFSYFD  ++ +V +PCGFLKKFPV+DSD+ AME CNGVVVVSAIFNDHDKIRQPRGL
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL

Query:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE
        GSKTLD VCFFMFVD+TTVRGLENHKIIP RNS PDIIGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE
Subjt:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE

Query:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR
        +AD+AISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYTT                             VPDSALILRRHGR
Subjt:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR

Query:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ
        GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLK K   G EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQ
Subjt:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ

Query:  MWGDVS
        MWGDVS
Subjt:  MWGDVS

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]4.6e-19684.03Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRN--DSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR
        SDYWMV NEIQAM  NSSSPSS+L YL+AN+D+FAGNFTA KRFS+FD R+  ++TV VPCGFLKKFPV DSDRIAME CNGVVVVSAIFNDHDKIRQPR
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRN--DSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR

Query:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
        GLGSKTLD VCFFMFVDETTV+GLENHKII  +NSSPDI IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
Subjt:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI

Query:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR
        +TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSP+KLPYTT                             VPDSALILRR
Subjt:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR

Query:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY
        HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKK  Y GP+L P ISKPKRTKRAGPDLLYVNG+CCSKC NY
Subjt:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY

Query:  LLQMWGD
        LLQMWG+
Subjt:  LLQMWGD

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]3.0e-19582.8Show/hide
Query:  DYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRG
        DYWMVLN+IQ + WNSS  SS+LRYL+ANADTF GNFTA+ RFS+FD R  NDS+VAVPCGFLKKFPV+DSDRIAMERC+GVVVVSAIFNDHDKIRQPRG
Subjt:  DYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRG

Query:  LGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT
        LGSKTL+ VCFFMFVDETTVRGLE+H +I  RNSSPDIIGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LIVT
Subjt:  LGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT

Query:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG
        ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTT                             VPDSA ILR+H 
Subjt:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG

Query:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLL
        R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EVFEQVALEYRHNLKKKGY GP+LGPHISKPKRTKRAGPDLLYVNGTCCSKCQ YLL
Subjt:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLL

Query:  QMWGDVS
        QMWGD S
Subjt:  QMWGDVS

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]1.1e-19282.27Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL
        SDYWMVLNEIQ M WNSS  SS+LRYL  NAD+F GNF+AEKRFSYFD  ++ +V +PCGFLKKFPV+DSD+ AME CNGVVVVSAIFNDHDKIRQPRGL
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL

Query:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE
        GSKTLD VCFFMFVD+TTVRGLENHK+IP RNS PDIIGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE
Subjt:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE

Query:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR
        +ADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYTT                             VPDSALILRRHGR
Subjt:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR

Query:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ
        GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLK K   G EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQ
Subjt:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ

Query:  MWGDVS
        MWGDVS
Subjt:  MWGDVS

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]3.5e-19683.74Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDST--VAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR
        SDYWMV NEI  MQ +SSS SS+LRYL+AN+DTF GNFTAE+RFS+FD R+  T  V VPCGFLKKFPVSDSDRIAME CNGVVVVSAIFNDHDKIRQPR
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDST--VAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR

Query:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIV
        GLGSKTL  VCFFMFVDETTV+GLENHKII  +NSS DIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI+
Subjt:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIV

Query:  TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRH
        TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGL+PWSP+KLPYTT                             VPDSALILRRH
Subjt:  TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRH

Query:  GRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYL
        GRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEVFEQVALEYRHNLK K Y GPELGPHISKPKRTKRAGPDL YVNG+CCSKCQNYL
Subjt:  GRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYL

Query:  LQMWGD
        LQMWG+
Subjt:  LQMWGD

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein2.6e-19282.06Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR
        SDYWMVLNEIQAM  NSSS SS+L YL+AN+D+FAGNFTA KRFS+FD R  +++TV +PCGFLKKFPVSDSDRIAME CNGVVVVSAIFNDHDKIRQPR
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR

Query:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
        GLGSKTLD VCFFMFVDE TV+GLENHK++  +N+SPDI IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
Subjt:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI

Query:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR
        +T+NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSP+KLPYTT                             VPDSALILRR
Subjt:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR

Query:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY
        HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTPSIKINMFEGEVFEQVALEYRHNLKK  Y GPEL P ISKPKRTKRAGPDLLYVNG+CCSKC +Y
Subjt:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY

Query:  LLQMWGD
        LL MWG+
Subjt:  LLQMWGD

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X12.2e-19684.03Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRN--DSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR
        SDYWMV NEIQAM  NSSSPSS+L YL+AN+D+FAGNFTA KRFS+FD R+  ++TV VPCGFLKKFPV DSDRIAME CNGVVVVSAIFNDHDKIRQPR
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRN--DSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPR

Query:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
        GLGSKTLD VCFFMFVDETTV+GLENHKII  +NSSPDI IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI
Subjt:  GLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDI-IGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLI

Query:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR
        +TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSP+KLPYTT                             VPDSALILRR
Subjt:  VTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRR

Query:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY
        HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKK  Y GP+L P ISKPKRTKRAGPDLLYVNG+CCSKC NY
Subjt:  HGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNY

Query:  LLQMWGD
        LLQMWG+
Subjt:  LLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157181.4e-19582.8Show/hide
Query:  DYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRG
        DYWMVLN+IQ + WNSS  SS+LRYL+ANADTF GNFTA+ RFS+FD R  NDS+VAVPCGFLKKFPV+DSDRIAMERC+GVVVVSAIFNDHDKIRQPRG
Subjt:  DYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRR--NDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRG

Query:  LGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT
        LGSKTL+ VCFFMFVDETTVRGLE+H +I  RNSSPDIIGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LIVT
Subjt:  LGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT

Query:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG
        ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTT                             VPDSA ILR+H 
Subjt:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG

Query:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLL
        R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EVFEQVALEYRHNLKKKGY GP+LGPHISKPKRTKRAGPDLLYVNGTCCSKCQ YLL
Subjt:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLL

Query:  QMWGDVS
        QMWGD S
Subjt:  QMWGDVS

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X22.0e-19282.27Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL
        SDYWMVLNEIQ M WNSS  SS+LRYL  NAD+F GNF+AEKRFSYFD  ++ +V +PCGFLKKFPV+DSD+ AME CNGVVVVSAIFNDHDKIRQPRGL
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL

Query:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE
        GSKTLD VCFFMFVD+TTVRGLENHKIIP  NS PDIIGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE
Subjt:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTE

Query:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR
        +ADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYTT                             VPDSALILRRHGR
Subjt:  NADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGR

Query:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ
        GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLK K   G EL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQ
Subjt:  GSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQ

Query:  MWGDVS
        MWGDVS
Subjt:  MWGDVS

A0A6J1KT34 uncharacterized protein LOC1114969875.7e-19276.44Show/hide
Query:  MASTSTPFPPSAPLAPPLSSSQVPIHETIAFFFSFFLSFLSLSASDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVA
        MASTSTPFPPSAP APPLSSSQV                                 M WNSS  SS+LRYL  NAD+F GNF+AEKRFSYFD  ++ +V 
Subjt:  MASTSTPFPPSAPLAPPLSSSQVPIHETIAFFFSFFLSFLSLSASDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVA

Query:  VPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPA
        +PCGFLK FPV+DSD+ AME CNGVVVVSAIFNDHDKIRQPRGLGSKTLD VCFFMFVD+TTVRGLENHKIIP RNS PDIIGAWRIVRVS+KNLY+NPA
Subjt:  VPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPA

Query:  MNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPY
        MNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVT++ADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPY
Subjt:  MNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPY

Query:  TTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNL
        TT                             VPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNL
Subjt:  TTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNL

Query:  KKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS
        K K   GPEL P ISKP RTKRAGPDLLYVNG+CCSKCQ YLLQMWGDVS
Subjt:  KKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQNYLLQMWGDVS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI707.7e-4534.62Show/hide
Query:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------V
        F G  T + R   FD +   T++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T       
Subjt:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------V

Query:  RGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEA
        RGL+ +K           +G WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA
Subjt:  RGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEA

Query:  MATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPR
         A     K +D  S+  Q++ Y   GL P+S +KLP T+                             VP+  +ILR H   SNLF+CL FNE++ F  R
Subjt:  MATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPR

Query:  DQLAFAFVRDHL
        DQ++F+ VRD +
Subjt:  DQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)5.5e-4634.62Show/hide
Query:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------V
        F G  T + R   FD +   T++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T       
Subjt:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETT------V

Query:  RGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEA
        RGL+ +K           +G WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA
Subjt:  RGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEA

Query:  MATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPR
         A     K +D  S+  Q++ Y   GL P+S +KLP T+                             VP+  +ILR H   SNLF+CL FNE++ F  R
Subjt:  MATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPR

Query:  DQLAFAFVRDHL
        DQ++F+ VRD +
Subjt:  DQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)8.2e-4234.07Show/hide
Query:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENH
        F G  + E R + FD +   T  V CGF+K         F + +     +++ + V+V SAIF  +D I++P  +       + F+MFVDE T   L+N 
Subjt:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENH

Query:  KIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARW
              N     +G WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A    
Subjt:  KIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARW

Query:  KKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFA
        +K +D  S+  Q+E Y + GL P++ +KLP T+                             VP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA
Subjt:  KKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFA

Query:  FVRDHLTPSI--KINMF
          RD +   +   INMF
Subjt:  FVRDHLTPSI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)8.2e-4234.07Show/hide
Query:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENH
        F G  + E R + FD +   T  V CGF+K         F + +     +++ + V+V SAIF  +D I++P  +       + F+MFVDE T   L+N 
Subjt:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLK--------KFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENH

Query:  KIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARW
              N     +G WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A    
Subjt:  KIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARW

Query:  KKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFA
        +K +D  S+  Q+E Y + GL P++ +KLP T+                             VP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA
Subjt:  KKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFA

Query:  FVRDHLTPSI--KINMF
          RD +   +   INMF
Subjt:  FVRDHLTPSI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)4.1e-4131.29Show/hide
Query:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLKK--FPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRR
        F GN +  +R   F  + +  + V CGF+ +    +S  D+  +++C   VV + IF+ +D+  QP  +  +++++ CF M VDE ++  L  +  + + 
Subjt:  FAGNFTAEKRFSYFDRRNDSTVAVPCGFLKK--FPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRR

Query:  NSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDV
              +G WR++ + +   Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H ++ +  EEA A  R +K +  
Subjt:  NSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDV

Query:  DSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
          +   M+ Y   GL+PWS  K                               V  VP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L
Subjt:  DSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Query:  TPSIKINMFE
          + K+ MF+
Subjt:  TPSIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)3.7e-13559.12Show/hide
Query:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL
        SDYW VL EIQ++  + SSP  +LRY+   +++F GNF+ +KRFSYF+  N   V VPCGF + FPVS+SDR+ ME+C G+VV SAIFNDHDKIRQP GL
Subjt:  SDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFPVSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGL

Query:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT
        G KTL+ VCF+MF+D+ T+  L +H +I + N S   +GAWRI+++S S+NLY NPAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS++V 
Subjt:  GSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIVT

Query:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG
           DMAISKHP++++TMEEAMATARWKKW DVD L+ QMETYCE+GLKPWS SKLPY T                             VPD+ALILRRHG
Subjt:  ENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVVYVPDSALILRRHG

Query:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISK-------PKRTKRAGPDLLYVNGTCCS
          SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV +EYRHNLKK   +  E      K        KR K    +   +N    S
Subjt:  RGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISK-------PKRTKRAGPDLLYVNGTCCS

Query:  KCQNYLLQMWG
         C+NYL  MWG
Subjt:  KCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCACAAGTACGCCGTTCCCACCCTCCGCTCCTCTTGCTCCTCCCCTGTCTTCTTCTCAGGTACCAATTCACGAAACCATTGCGTTTTTCTTCTCGTTTTTCCT
CTCATTTCTCTCTCTCTCTGCCTCAGATTATTGGATGGTTTTGAATGAGATCCAAGCAATGCAGTGGAATTCCTCTTCGCCATCGTCTGATTTGAGGTACCTCGTAGCGA
ATGCCGATACTTTCGCCGGTAATTTCACTGCCGAGAAGAGGTTTTCTTACTTCGATCGTCGAAATGATTCTACTGTAGCGGTTCCTTGTGGATTTCTCAAGAAATTTCCT
GTCAGTGATTCTGATCGAATTGCGATGGAGCGTTGCAACGGCGTGGTCGTGGTTTCCGCGATCTTCAACGATCATGATAAAATTCGGCAACCGAGAGGCCTCGGATCGAA
AACTTTGGACGTCGTGTGCTTTTTCATGTTTGTTGATGAAACTACGGTGAGAGGACTCGAGAATCACAAAATAATTCCTAGAAGAAACTCATCTCCGGATATAATTGGCG
CTTGGAGAATTGTGAGAGTTTCGAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTGATACCTAAGTATTTAGTTCACAGACTGTTTCCGAACTCAAAATTCAGT
ATATGGGTGGACGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGCTGATTGTGACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATAT
TCACACAATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGATTGAAACCATGGA
GTCCCAGTAAGCTTCCATATACCACAGGTCAAGAGCCTCCAATAATCATTTCAATAATGGCCATACTACCTTTAGCTTTGGCTTTGTCTTTGGCTTTTGTTATAGTTGTA
TATGTACCAGATAGTGCTTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTATTCAACGAATTGGAAGCTTTCAACCCAAGGGACCAATTGGC
TTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAAGAAAG
GATATGCTGGGCCTGAACTGGGCCCCCACATCTCCAAGCCCAAACGAACCAAAAGGGCCGGCCCAGATTTGTTGTACGTCAATGGCACCTGTTGCAGCAAGTGCCAAAAC
TATCTTCTCCAGATGTGGGGTGACGTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCACAAGTACGCCGTTCCCACCCTCCGCTCCTCTTGCTCCTCCCCTGTCTTCTTCTCAGGTACCAATTCACGAAACCATTGCGTTTTTCTTCTCGTTTTTCCT
CTCATTTCTCTCTCTCTCTGCCTCAGATTATTGGATGGTTTTGAATGAGATCCAAGCAATGCAGTGGAATTCCTCTTCGCCATCGTCTGATTTGAGGTACCTCGTAGCGA
ATGCCGATACTTTCGCCGGTAATTTCACTGCCGAGAAGAGGTTTTCTTACTTCGATCGTCGAAATGATTCTACTGTAGCGGTTCCTTGTGGATTTCTCAAGAAATTTCCT
GTCAGTGATTCTGATCGAATTGCGATGGAGCGTTGCAACGGCGTGGTCGTGGTTTCCGCGATCTTCAACGATCATGATAAAATTCGGCAACCGAGAGGCCTCGGATCGAA
AACTTTGGACGTCGTGTGCTTTTTCATGTTTGTTGATGAAACTACGGTGAGAGGACTCGAGAATCACAAAATAATTCCTAGAAGAAACTCATCTCCGGATATAATTGGCG
CTTGGAGAATTGTGAGAGTTTCGAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTGATACCTAAGTATTTAGTTCACAGACTGTTTCCGAACTCAAAATTCAGT
ATATGGGTGGACGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGCTGATTGTGACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATAT
TCACACAATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGATTGAAACCATGGA
GTCCCAGTAAGCTTCCATATACCACAGGTCAAGAGCCTCCAATAATCATTTCAATAATGGCCATACTACCTTTAGCTTTGGCTTTGTCTTTGGCTTTTGTTATAGTTGTA
TATGTACCAGATAGTGCTTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTATTCAACGAATTGGAAGCTTTCAACCCAAGGGACCAATTGGC
TTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAAGAAAG
GATATGCTGGGCCTGAACTGGGCCCCCACATCTCCAAGCCCAAACGAACCAAAAGGGCCGGCCCAGATTTGTTGTACGTCAATGGCACCTGTTGCAGCAAGTGCCAAAAC
TATCTTCTCCAGATGTGGGGTGACGTTTCTTGA
Protein sequenceShow/hide protein sequence
MASTSTPFPPSAPLAPPLSSSQVPIHETIAFFFSFFLSFLSLSASDYWMVLNEIQAMQWNSSSPSSDLRYLVANADTFAGNFTAEKRFSYFDRRNDSTVAVPCGFLKKFP
VSDSDRIAMERCNGVVVVSAIFNDHDKIRQPRGLGSKTLDVVCFFMFVDETTVRGLENHKIIPRRNSSPDIIGAWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFS
IWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPSKLPYTTGQEPPIIISIMAILPLALALSLAFVIVV
YVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKKKGYAGPELGPHISKPKRTKRAGPDLLYVNGTCCSKCQN
YLLQMWGDVS