; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019155 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019155
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationtig00153293:169980..181279
RNA-Seq ExpressionSgr019155
SyntenySgr019155
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616
IPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026465.1 hypothetical protein SDJN02_10465 [Cucurbita argyrosperma subsp. argyrosperma]3.2e-18079.25Show/hide
Query:  MASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGW------FSPRNSDQFSV
        MASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NADSFGGNF+AEKRFSYFD  +N SVP+PCGFLKKFPV+DSG       F    S  FSV
Subjt:  MASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGW------FSPRNSDQFSV

Query:  R----IFYDFNTFLT-DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL---
        R      Y F+  ++ D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+TTVRGLENHKIIP R+S PDIIGAWRIVRVS+KNL   
Subjt:  R----IFYDFNTFLT-DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL---

Query:  ----------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKL
                              IW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKL
Subjt:  ----------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKL

Query:  PYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPD
        PYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPD
Subjt:  PYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPD

Query:  LLYVNGSCCSKCQKYLLQMWGDVS
        LLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  LLYVNGSCCSKCQKYLLQMWGDVS

XP_022926657.1 uncharacterized protein LOC111433726 isoform X1 [Cucurbita moschata]2.4e-19978.56Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSDPLPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NA
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
        DSFGGNF+AEKRFSYFD  +N SVP+PCGFLKKFPV+DS                        D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP  +S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVTE+ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

XP_022926659.1 uncharacterized protein LOC111433726 isoform X3 [Cucurbita moschata]1.3e-17371.34Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSDPLPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRY     
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
                                                                     L D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP  +S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVTE+ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

XP_023003354.1 uncharacterized protein LOC111496987 [Cucurbita maxima]1.5e-19878.13Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSD LPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NA
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
        DSFGGNF+AEKRFSYFD  +N SVP+PCGFLK FPV+DS                        D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP R+S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVT++ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   GPEL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

XP_023517976.1 uncharacterized protein LOC111781548 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-19978.56Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSDPLPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NA
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
        DSFGGNF+AEKRFSYFD  +N SVP+PCGFLKKFPV+DS                        D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHK+IP R+S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVTE+ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYT7 uncharacterized protein LOC103484369 isoform X35.4e-17376.57Show/hide
Query:  MASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRN--NHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFY
        MA+TSTPFP S PL  PL SSQ M SNSSS SSNL YL+AN+DSF GNFTA KRFS+FD+R+  N +VPVPCGFLKKFPV DS                 
Subjt:  MASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRN--NHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFY

Query:  DFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIP-RSSSPDI-IGAWRIVRVSSKNL-----------
               DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTV+GLENHKII  ++SSPDI IGAWRIVRVSSKNL           
Subjt:  DFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIP-RSSSPDI-IGAWRIVRVSSKNL-----------

Query:  --------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPD
                      IW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGLKPWSP+KLPYT+DVPD
Subjt:  --------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPD

Query:  SALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSC
        SALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEVFEQVALEYRHNLK+  + GP+L P ISKPK+TKRAGPDLLYVNGSC
Subjt:  SALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSC

Query:  CSKCQKYLLQMWGD
        CSKC  YLLQMWG+
Subjt:  CSKCQKYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157181.1e-16869.72Show/hide
Query:  LCFSNRNSSASLFFISSPPSSSLSTLL-SPPPNASSDPLPSTPSN-SLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSF
        LCFS     +S+F       SS   L  S P +    PL S PS+       LP + ST +           L+  QV+  NSS RSSNLRYL+ANAD+F
Subjt:  LCFSNRNSSASLFFISSPPSSSLSTLL-SPPPNASSDPLPSTPSN-SLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSF

Query:  GGNFTAEKRFSYFDHRNNH--SVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNV
        GGNFTA+ RFS+FDHRNN+  SV VPCGFLKKFPV+DS                        DRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NV
Subjt:  GGNFTAEKRFSYFDHRNNH--SVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNV

Query:  CFFMFVDETTVRGLENHKIIPRSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKH
        CFFMFVDETTVRGLE+H +I R+SSPDIIGAWRIVRVS+KNL                         IWIDAKLQLMVDPLLLIHTLIVTENADMAISKH
Subjt:  CFFMFVDETTVRGLENHKIIPRSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAISKH

Query:  PYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKI
        PYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWS  KLPYT+DVPDSA ILR+H R S+LFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKI
Subjt:  PYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKI

Query:  NMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        NMFE EVFEQVALEYRHNLK+KG+GGP+LGPHISKPK+TKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  NMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

A0A6J1EEZ8 uncharacterized protein LOC111433726 isoform X11.2e-19978.56Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSDPLPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NA
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
        DSFGGNF+AEKRFSYFD  +N SVP+PCGFLKKFPV+DS                        D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP  +S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVTE+ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

A0A6J1EFS8 uncharacterized protein LOC111433726 isoform X36.4e-17471.34Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSDPLPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRY     
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
                                                                     L D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP  +S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVTE+ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIPRSSS-PDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSPSKLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   G EL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

A0A6J1KT34 uncharacterized protein LOC1114969877.5e-19978.13Show/hide
Query:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA
        +VG RLC+SN++SS SL F  SPP SSLSTLLSPPPNASSD LPS PS+SLSSP  PPMASTSTPFPPSAP APPLSSSQVM  NSS RSSNLRYL  NA
Subjt:  KVGRRLCFSNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANA

Query:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
        DSFGGNF+AEKRFSYFD  +N SVP+PCGFLK FPV+DS                        D+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN
Subjt:  DSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDN

Query:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS
        VCFFMFVD+TTVRGLENHKIIP R+S PDIIGAWRIVRVS+KNL                         IW+DAKLQLMVDPLLLIH+LIVT++ADMAIS
Subjt:  VCFFMFVDETTVRGLENHKIIP-RSSSPDIIGAWRIVRVSSKNL-------------------------IWIDAKLQLMVDPLLLIHTLIVTENADMAIS

Query:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI
        KHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPYT+DVPDSALILRRHGRGS+LFSCLLFNELEAFNPRDQLAFAFVRDHLTP I
Subjt:  KHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI

Query:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
        KINMFEGEVFEQVALEYRHNLK K   GPEL P ISKP +TKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS
Subjt:  KINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI701.1e-3231.47Show/hide
Query:  PPPNASSDPLP-STPSNSLSSPTLPPMASTSTPFPPSAPLA-PPLSSSQVMQSNSSSRS---SNLRYLVA---------NADSFGGNFTAEKRFSYFDHR
        PP +     LP   P NS + P  PP A      P   P+   P+  +  +  N+ S S    NL Y+               FGG  T + R   FD +
Subjt:  PPPNASSDPLP-STPSNSLSSPTLPPMASTSTPFPPSAPLA-PPLSSSQVMQSNSSSRS---SNLRYLVA---------NADSFGGNFTAEKRFSYFDHR

Query:  NNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHK
           S  V CGF+K       G    RN+          F+    D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  +
Subjt:  NNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHK

Query:  IIPRSSSPDIIGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKW
         +  +     +G WR+V V                        +++  +WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K 
Subjt:  IIPRSSSPDIIGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKW

Query:  WDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHL
        +D  S+  Q++ Y   GL P+S +KLP TSDVP+  +ILR H   S+LF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  WDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)7.5e-3431.47Show/hide
Query:  PPPNASSDPLP-STPSNSLSSPTLPPMASTSTPFPPSAPLA-PPLSSSQVMQSNSSSRS---SNLRYLVA---------NADSFGGNFTAEKRFSYFDHR
        PP +     LP   P NS + P  PP A      P   P+   P+  +  +  N+ S S    NL Y+               FGG  T + R   FD +
Subjt:  PPPNASSDPLP-STPSNSLSSPTLPPMASTSTPFPPSAPLA-PPLSSSQVMQSNSSSRS---SNLRYLVA---------NADSFGGNFTAEKRFSYFDHR

Query:  NNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHK
           S  V CGF+K       G    RN+          F+    D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  +
Subjt:  NNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHK

Query:  IIPRSSSPDIIGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKW
         +  +     +G WR+V V                        +++  +WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K 
Subjt:  IIPRSSSPDIIGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKW

Query:  WDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHL
        +D  S+  Q++ Y   GL P+S +KLP TSDVP+  +ILR H   S+LF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  WDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)5.0e-3030.77Show/hide
Query:  PPPNASSDPLPS---TPSNSLSSPTLPPMASTSTPFPPSAPLA--PPLSSSQVMQSN--SSSRSSNLRYL----------VANADSFGGNFTAEKRFSYF
        PPP      LPS    P +S S P  PP      P P   P+   PP  +   M      S    NL Y+                FGG  + E R + F
Subjt:  PPPNASSDPLPS---TPSNSLSSPTLPPMASTSTPFPPSAPLA--PPLSSSQVMQSN--SSSRSSNLRYL----------VANADSFGGNFTAEKRFSYF

Query:  DHRNNHSVPVPCGFLK-KFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGL
        D +   S+ V CGF+K   P   +G               +D +  +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L
Subjt:  DHRNNHSVPVPCGFLK-KFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGL

Query:  ENHKIIPRSSSPDIIGAWRIVRVSS------------------------KNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR
        +N       +    +G WRI+ V +                        +  IW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A   
Subjt:  ENHKIIPRSSSPDIIGAWRIVRVSS------------------------KNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
         +K +D  S+  Q+E Y + GL P++ +KLP TSDVP+   I+R H   ++LF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)6.0e-3129.06Show/hide
Query:  NLRYLVANADS------FGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHD
        NL Y+  +  S      FGGN +  +R   F  +    + V CGF+            PR   + S            D+  ++ C   VV + IF+ +D
Subjt:  NLRYLVANADS------FGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHD

Query:  KIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIPRSSSPDI-IGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLI
        +  QP  +  ++++  CF M VDE ++  L  +  + +     I +G WR++ +                         ++  IWID K++L+VDPLL++
Subjt:  KIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIPRSSSPDI-IGAWRIVRV------------------------SSKNLIWIDAKLQLMVDPLLLI

Query:  HTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQ
           +       AI++H ++ +  EEA A  R +K +    + + M+ Y   GL+PWS  K    SDVP+ A+I+R H   ++LFSCL FNE+    PRDQ
Subjt:  HTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNPRDQ

Query:  LAFAFVRDHLTPPIKINMFE
        L+F +V D L    K+ MF+
Subjt:  LAFAFVRDHLTPPIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)2.2e-11052.19Show/hide
Query:  NSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDH
        + SS   NLRY+   ++SFGGNF+ +KRFSYF+H +N  V VPCGF + FPVS+S                        DR+ ME C G+VV SAIFNDH
Subjt:  NSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDHRNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDH

Query:  DKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIPRSSSPDI-IGAWRIVRVS--------------------------SKNLIWIDAKLQLMVDPL
        DKIRQP GLG KTL+ VCF+MF+D+ T+  L +H +I +++  D  +GAWRI+++S                          SK  IW+DAK+QLM+DPL
Subjt:  DKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIPRSSSPDI-IGAWRIVRVS--------------------------SKNLIWIDAKLQLMVDPL

Query:  LLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNP
        LLIH+++V    DMAISKHP++++TMEEAMATARWKKW DVD L++QMETYCE+GLKPWS SKLPY +DVPD+ALILRRHG  S+LFSC +FNELEAFNP
Subjt:  LLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHGRGSDLFSCLLFNELEAFNP

Query:  RDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCC----SKCQKYLLQMWG
        RDQLAFAFVRDH+ P +K+NMFE EVFEQV +EYRHNLK+      E      K +  +       +++        S C+ YL  MWG
Subjt:  RDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCC----SKCQKYLLQMWG

AT5G46230.1 Protein of unknown function, DUF5382.0e-3147.46Show/hide
Query:  KVSRVLDQFFLPRGLLPLNDIIEVGYNRTSGFIWLKQQKKKEHRFAAIGRTVLYDTEVTAFIEERRLRRLTGVKSKEFFLSITVSDIYIDEQNSSRITFG
        K   +L    LP+GLLPL+++ E+G+N+++G++W+K + K +HRF AIGR V YD+EVTA +E RR+ +LTG+KSKE  + +T+S+I+++ Q+ ++ITF 
Subjt:  KVSRVLDQFFLPRGLLPLNDIIEVGYNRTSGFIWLKQQKKKEHRFAAIGRTVLYDTEVTAFIEERRLRRLTGVKSKEFFLSITVSDIYIDEQNSSRITFG

Query:  TLTGISKSFPVSAFQLEE
          TG+S++FPV+AF+ +E
Subjt:  TLTGISKSFPVSAFQLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATTGGAGCTCTACATGGTGGCAAACTAGCCATGAAAAGATTGCTTCTATACCATAAAATGCGAGCAAACCAGAAAATGCAAGATGGAGCTCTGGAGAAGTTAGA
GAAAATGATCAAAGACGATACTCCTGTTTTCCCAAAGCTGCACTGTGCAAAAGAGCGTTCTGAAAATCTGGAAATGAGAGGACAAGAAGACAGAGCTATTGAATTATTAA
AGAAAGCAGCAAAAGAAGCCAAGGAGAATTCACTTTTGCACTATGAATATGAATATCAGATGCTTCTTGTGGAAACGCTCATTTATAAGGTTGGTCGACGCCTCTGCTTT
TCCAATCGAAACTCCTCTGCTTCTCTCTTCTTTATCTCTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTCTCTCCTCCACCAAATGCCTCTTCCGATCCTCTCCCTTC
GACCCCATCCAATTCCCTCTCTTCTCCTACCCTTCCTCCTATGGCGAGCACAAGTACGCCATTCCCACCCTCCGCTCCACTTGCTCCTCCCCTGTCTTCTTCTCAGGTAA
TGCAGTCCAATTCTTCTTCGCGATCGTCTAATTTGAGGTATCTCGTCGCCAATGCCGATAGTTTCGGAGGGAATTTCACTGCCGAGAAGAGATTTTCTTACTTCGATCAT
CGGAATAATCATAGCGTACCGGTTCCTTGTGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGGTTGGTTTTCACCGCGAAATTCGGATCAATTTTCTGTTCGTATTTT
TTATGATTTCAACACGTTTCTTACTGATCGAATTGCCATGGAAAGCTGCAATGGTGTGGTCGTGGTTTCCGCAATCTTCAACGATCATGATAAAATCCGGCAACCGAGAG
GCCTCGGATCGAAAACTTTGGATAACGTATGCTTCTTCATGTTCGTAGACGAAACTACAGTGAGAGGACTCGAAAACCACAAAATAATTCCCAGAAGCTCATCTCCAGAC
ATAATTGGGGCTTGGAGAATTGTGAGAGTTTCAAGCAAGAATCTCATATGGATTGACGCGAAGCTTCAGTTGATGGTCGATCCATTGTTGTTGATTCATACGTTGATTGT
GACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGA
AGATGCAAATGGAGACTTACTGTGAAAATGGGTTGAAGCCATGGAGTCCCAGCAAGCTTCCCTATACCTCAGATGTACCAGATAGTGCCTTAATCTTGAGGAGACATGGA
AGGGGAAGCGATCTGTTCTCTTGCCTTCTGTTCAATGAGTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGACCATTTGACCCCACCCATCAA
AATCAACATGTTTGAAGGAGAAGTATTCGAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAGGAAAGGATTTGGTGGGCCTGAACTGGGCCCCCACATCTCCAAGC
CCAAACAAACCAAAAGGGCCGGCCCTGATTTGTTGTATGTCAATGGCAGCTGTTGCAGCAAGTGCCAGAAATATCTTCTCCAGATGTGGGGTGACGTTTCAATTACCAAA
TCGAGCACAACACCAGAGAATGACGACGCAACTCGCAAGCCACAGAGCAGACGCCGAGATCTACCATGGCGACGCTCTCTGCAAGCAAAAGTCTCAAGAGTCCTCGATCA
ATTCTTTCTTCCCCGAGGCCTTCTCCCCTTAAATGACATCATCGAGGTCGGCTACAATCGGACGTCGGGCTTCATCTGGCTCAAGCAGCAGAAGAAGAAGGAGCACCGGT
TCGCCGCCATCGGACGCACCGTCTTATACGACACCGAGGTCACAGCCTTCATCGAGGAGCGTCGCCTGCGCCGACTCACCGGAGTTAAGAGCAAGGAGTTTTTTCTTTCG
ATCACCGTCTCCGATATTTACATTGATGAACAGAACTCGAGTAGGATTACGTTCGGTACTCTGACTGGGATTTCGAAGTCCTTCCCGGTCTCTGCTTTTCAGCTTGAAGA
AGAGAATGATCAGAAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATTGGAGCTCTACATGGTGGCAAACTAGCCATGAAAAGATTGCTTCTATACCATAAAATGCGAGCAAACCAGAAAATGCAAGATGGAGCTCTGGAGAAGTTAGA
GAAAATGATCAAAGACGATACTCCTGTTTTCCCAAAGCTGCACTGTGCAAAAGAGCGTTCTGAAAATCTGGAAATGAGAGGACAAGAAGACAGAGCTATTGAATTATTAA
AGAAAGCAGCAAAAGAAGCCAAGGAGAATTCACTTTTGCACTATGAATATGAATATCAGATGCTTCTTGTGGAAACGCTCATTTATAAGGTTGGTCGACGCCTCTGCTTT
TCCAATCGAAACTCCTCTGCTTCTCTCTTCTTTATCTCTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTCTCTCCTCCACCAAATGCCTCTTCCGATCCTCTCCCTTC
GACCCCATCCAATTCCCTCTCTTCTCCTACCCTTCCTCCTATGGCGAGCACAAGTACGCCATTCCCACCCTCCGCTCCACTTGCTCCTCCCCTGTCTTCTTCTCAGGTAA
TGCAGTCCAATTCTTCTTCGCGATCGTCTAATTTGAGGTATCTCGTCGCCAATGCCGATAGTTTCGGAGGGAATTTCACTGCCGAGAAGAGATTTTCTTACTTCGATCAT
CGGAATAATCATAGCGTACCGGTTCCTTGTGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGGTTGGTTTTCACCGCGAAATTCGGATCAATTTTCTGTTCGTATTTT
TTATGATTTCAACACGTTTCTTACTGATCGAATTGCCATGGAAAGCTGCAATGGTGTGGTCGTGGTTTCCGCAATCTTCAACGATCATGATAAAATCCGGCAACCGAGAG
GCCTCGGATCGAAAACTTTGGATAACGTATGCTTCTTCATGTTCGTAGACGAAACTACAGTGAGAGGACTCGAAAACCACAAAATAATTCCCAGAAGCTCATCTCCAGAC
ATAATTGGGGCTTGGAGAATTGTGAGAGTTTCAAGCAAGAATCTCATATGGATTGACGCGAAGCTTCAGTTGATGGTCGATCCATTGTTGTTGATTCATACGTTGATTGT
GACTGAGAATGCAGATATGGCCATTTCCAAACATCCTTATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGA
AGATGCAAATGGAGACTTACTGTGAAAATGGGTTGAAGCCATGGAGTCCCAGCAAGCTTCCCTATACCTCAGATGTACCAGATAGTGCCTTAATCTTGAGGAGACATGGA
AGGGGAAGCGATCTGTTCTCTTGCCTTCTGTTCAATGAGTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGACCATTTGACCCCACCCATCAA
AATCAACATGTTTGAAGGAGAAGTATTCGAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAGGAAAGGATTTGGTGGGCCTGAACTGGGCCCCCACATCTCCAAGC
CCAAACAAACCAAAAGGGCCGGCCCTGATTTGTTGTATGTCAATGGCAGCTGTTGCAGCAAGTGCCAGAAATATCTTCTCCAGATGTGGGGTGACGTTTCAATTACCAAA
TCGAGCACAACACCAGAGAATGACGACGCAACTCGCAAGCCACAGAGCAGACGCCGAGATCTACCATGGCGACGCTCTCTGCAAGCAAAAGTCTCAAGAGTCCTCGATCA
ATTCTTTCTTCCCCGAGGCCTTCTCCCCTTAAATGACATCATCGAGGTCGGCTACAATCGGACGTCGGGCTTCATCTGGCTCAAGCAGCAGAAGAAGAAGGAGCACCGGT
TCGCCGCCATCGGACGCACCGTCTTATACGACACCGAGGTCACAGCCTTCATCGAGGAGCGTCGCCTGCGCCGACTCACCGGAGTTAAGAGCAAGGAGTTTTTTCTTTCG
ATCACCGTCTCCGATATTTACATTGATGAACAGAACTCGAGTAGGATTACGTTCGGTACTCTGACTGGGATTTCGAAGTCCTTCCCGGTCTCTGCTTTTCAGCTTGAAGA
AGAGAATGATCAGAAGAAGTGA
Protein sequenceShow/hide protein sequence
MSIGALHGGKLAMKRLLLYHKMRANQKMQDGALEKLEKMIKDDTPVFPKLHCAKERSENLEMRGQEDRAIELLKKAAKEAKENSLLHYEYEYQMLLVETLIYKVGRRLCF
SNRNSSASLFFISSPPSSSLSTLLSPPPNASSDPLPSTPSNSLSSPTLPPMASTSTPFPPSAPLAPPLSSSQVMQSNSSSRSSNLRYLVANADSFGGNFTAEKRFSYFDH
RNNHSVPVPCGFLKKFPVSDSGWFSPRNSDQFSVRIFYDFNTFLTDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVRGLENHKIIPRSSSPD
IIGAWRIVRVSSKNLIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSPSKLPYTSDVPDSALILRRHG
RGSDLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLKRKGFGGPELGPHISKPKQTKRAGPDLLYVNGSCCSKCQKYLLQMWGDVSITK
SSTTPENDDATRKPQSRRRDLPWRRSLQAKVSRVLDQFFLPRGLLPLNDIIEVGYNRTSGFIWLKQQKKKEHRFAAIGRTVLYDTEVTAFIEERRLRRLTGVKSKEFFLS
ITVSDIYIDEQNSSRITFGTLTGISKSFPVSAFQLEEENDQKK