; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g33200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g33200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF616)
Genome locationchr1:23325422..23330342
RNA-Seq ExpressionMoc01g33200
SyntenyMoc01g33200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]2.1e-21585.84Show/hide
Query:  LCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPP------LSSSKIQVVHWNSSLRSSNLRYL
        L + ++  C SL +  S  +  L T LS       S P   IQ    S P+       + P   S   +P       +  ++IQVVHWNSSLRSSNLRYL
Subjt:  LCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPP------LSSSKIQVVHWNSSLRSSNLRYL

Query:  LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH
        LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH
Subjt:  LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH

Query:  NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK
        NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK
Subjt:  NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH
        KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH
Subjt:  KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH

Query:  NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
Subjt:  NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

XP_022926657.1 uncharacterized protein LOC111433726 isoform X1 [Cucurbita moschata]4.1e-21682.19Show/hide
Query:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH
        M+ +N N +  LWE RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSDP PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ 
Subjt:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH

Query:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM
        WNSS RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFM
Subjt:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM

Query:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY
        FVD+TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYY
Subjt:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
        IHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF
Subjt:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        E EVFEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

XP_022926659.1 uncharacterized protein LOC111433726 isoform X3 [Cucurbita moschata]4.9e-19376.18Show/hide
Query:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH
        M+ +N N +  LWE RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSDP PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ 
Subjt:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH

Query:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM
        WNSS RSSNLRYL                                            +D+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFM
Subjt:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM

Query:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY
        FVD+TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYY
Subjt:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
        IHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF
Subjt:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        E EVFEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

XP_023003354.1 uncharacterized protein LOC111496987 [Cucurbita maxima]2.7e-21582.33Show/hide
Query:  ENQNPNLQALWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWN
        EN NP +  LW  RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSD  PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ WN
Subjt:  ENQNPNLQALWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWN

Query:  SSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFV
        SS RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLK FPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFV
Subjt:  SSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFV

Query:  DETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIH
        D+TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT++ADMAISKHPYYIH
Subjt:  DETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIH

Query:  TMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEV
        TMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE 
Subjt:  TMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEV

Query:  EVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        EVFEQVALEYRHNLK K   GP+L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

XP_023517976.1 uncharacterized protein LOC111781548 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-21682.4Show/hide
Query:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH
        M+ +N N +  LWE RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSDP PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ 
Subjt:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH

Query:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM
        WNSS RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFM
Subjt:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM

Query:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY
        FVD+TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYY
Subjt:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
        IHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF
Subjt:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        E EVFEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

TrEMBL top hitse value%identityAlignment
A0A1S3AYT7 uncharacterized protein LOC103484369 isoform X37.9e-18984.44Show/hide
Query:  MASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVS
        MA+TSTPFP S PL  PL SS  Q +  NSS  SSNL YLLAN+D+F GNFTA  RFSFFD+R+  +++V VPCGFLKKFPV DSDRIAME C+GVVVVS
Subjt:  MASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVS

Query:  AIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQ
        AIFNDHDKIRQPRGLGSKTL+NVCFFMFVDETTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQ
Subjt:  AIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQ

Query:  LMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNE
        LMVDPLLLIH+LI+TENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNE
Subjt:  LMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNE

Query:  LEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        LEAFNPRDQLAFAFVRDHLTP IKINMFE EVFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  LEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157189.9e-21685.84Show/hide
Query:  LCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPP------LSSSKIQVVHWNSSLRSSNLRYL
        L + ++  C SL +  S  +  L T LS       S P   IQ    S P+       + P   S   +P       +  ++IQVVHWNSSLRSSNLRYL
Subjt:  LCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPP------LSSSKIQVVHWNSSLRSSNLRYL

Query:  LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH
        LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH
Subjt:  LANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESH

Query:  NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK
        NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK
Subjt:  NVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWK

Query:  KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH
        KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH
Subjt:  KWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRH

Query:  NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
Subjt:  NLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

A0A6J1EEZ8 uncharacterized protein LOC111433726 isoform X12.0e-21682.19Show/hide
Query:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH
        M+ +N N +  LWE RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSDP PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ 
Subjt:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH

Query:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM
        WNSS RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFM
Subjt:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM

Query:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY
        FVD+TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYY
Subjt:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
        IHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF
Subjt:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        E EVFEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

A0A6J1EFS8 uncharacterized protein LOC111433726 isoform X32.4e-19376.18Show/hide
Query:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH
        M+ +N N +  LWE RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSDP PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ 
Subjt:  MENQNPNLQA-LWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVH

Query:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM
        WNSS RSSNLRYL                                            +D+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFM
Subjt:  WNSSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFM

Query:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY
        FVD+TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYY
Subjt:  FVDETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
        IHTMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF
Subjt:  IHTMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        E EVFEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

A0A6J1KT34 uncharacterized protein LOC1114969871.3e-21582.33Show/hide
Query:  ENQNPNLQALWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWN
        EN NP +  LW  RVGL LCYSNQ+  VSLCFT SPP+  LST LSPPPNASSD  PSI SSSLSSP PPPMASTSTPFPPSAP APPLSSS  QV+ WN
Subjt:  ENQNPNLQALWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWN

Query:  SSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFV
        SS RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLK FPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFV
Subjt:  SSLRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFV

Query:  DETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIH
        D+TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT++ADMAISKHPYYIH
Subjt:  DETTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIH

Query:  TMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEV
        TMEEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE 
Subjt:  TMEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEV

Query:  EVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS
        EVFEQVALEYRHNLK K   GP+L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  EVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDAS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI705.5e-4634.78Show/hide
Query:  PPYFWLSTHL--SPPPNASSDPPPSIQSSSLSSPTPPPM----ASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS
        PP  +L   L    P N+ + PPP       + P P P+       +    P+AP   P+  + +  ++     R +           FGG  T  +R  
Subjt:  PPYFWLSTHL--SPPPNASSDPPPSIQSSSLSSPTPPPM----ASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS

Query:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD
         FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+    +  N    
Subjt:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD

Query:  IIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK
         +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+ 
Subjt:  IIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK

Query:  MQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
         Q++ Y   GL P+S  KLP T+DVP+   ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  MQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)3.9e-4734.78Show/hide
Query:  PPYFWLSTHL--SPPPNASSDPPPSIQSSSLSSPTPPPM----ASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS
        PP  +L   L    P N+ + PPP       + P P P+       +    P+AP   P+  + +  ++     R +           FGG  T  +R  
Subjt:  PPYFWLSTHL--SPPPNASSDPPPSIQSSSLSSPTPPPM----ASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS

Query:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD
         FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+    +  N    
Subjt:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD

Query:  IIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK
         +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+ 
Subjt:  IIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLK

Query:  MQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
         Q++ Y   GL P+S  KLP T+DVP+   ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  MQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.8e-4433.86Show/hide
Query:  PPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTP------FPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS
        PP F  S H  P  + S  PPP      +  P P P      P        P  P   PL  + +  +   S ++             FGG  + ++R +
Subjt:  PPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTP------FPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS

Query:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD
         FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L   N  S      
Subjt:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD

Query:  IIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKM
         +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A    +K +D  S+  
Subjt:  IIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKM

Query:  QMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  QMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.8e-4433.86Show/hide
Query:  PPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTP------FPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS
        PP F  S H  P  + S  PPP      +  P P P      P        P  P   PL  + +  +   S ++             FGG  + ++R +
Subjt:  PPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTP------FPPSAPLAPPLSSSKIQVVHWNSSLRSSNLRYLLANADTFGGNFTADNRFS

Query:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD
         FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L   N  S      
Subjt:  FFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPD

Query:  IIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKM
         +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A    +K +D  S+  
Subjt:  IIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKM

Query:  QMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  QMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)1.6e-4535.21Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS
        FGGN +   R   F  +      + V CGF+ +    ++  D+  +++C   VV + IF+ +D+  QP  +  +++   CF M VDE ++  L  +  + 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS

Query:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWW
        ++    I +G WR++ + T   Y+ P  NG +PK L HRLFP +++SIWID K++L+VDPLL++   +       AI++H ++ +  EEA A  R +K +
Subjt:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWW

Query:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE
            + + M+ Y   GL+PWS +K    +DVP+ A I+R+H+  +NLFSCL FNE+    PRDQL+F +V D L    K+ MF+
Subjt:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)3.8e-12754.79Show/hide
Query:  CVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRS-----SNLRYLLANADTFGG
        C SL +  S  + +L   LS         P   IQ+   S P+       + P   S+  +P   S    V+    S+ S      NLRY+   +++FGG
Subjt:  CVSLCFTSSPPYFWLSTHLSPPPNA-SSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRS-----SNLRYLLANADTFGG

Query:  NFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSP
        NF+   RFS+F+H N     V VPCGF + FPV++SDR+ ME+C G+VV SAIFNDHDKIRQP GLG KTLE VCF+MF+D+ T+  L  HNVI +N+  
Subjt:  NFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSP

Query:  DI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDS
        D  +GAWRI+++S ++NLY NPAMNGVIPKYL+HRLFPNSKFSIW+DAK+QLM+DPLLLIH+++V    DMAISKHP++++TMEEAMATARWKKW DVD 
Subjt:  DI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDS

Query:  LKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGY
        L++QMETYCE+GLKPWS  KLPY TDVPD+A ILR+H   SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFEVEVFEQV +EYRHNLKK   
Subjt:  LKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGY

Query:  GGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG
           +      K        KR K    +   +N    S C+ YL  MWG
Subjt:  GGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATCAAAATCCCAATCTGCAGGCGTTATGGGAAAGGCGGGTTGGTCTACACCTCTGCTATTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCC
TCCATATTTCTGGCTCTCTACACATCTCTCTCCTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGG
CGAGCACAAGTACGCCCTTCCCACCGTCCGCTCCACTTGCTCCTCCCCTGTCTTCTTCCAAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGG
TATCTCCTCGCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGG
ATTTCTCAAGAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAAC
CGAGAGGGCTCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCC
CCTGACATAATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCC
AAACTCTAAATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGAAAATGCAGATATGGCCATTTCCA
AACATCCTTATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAAT
GGGCTGAAGCCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCATTTATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCT
GTTCAACGAGTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCG
AGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGAT
TTGTTGTACGTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGACGCTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATCAAAATCCCAATCTGCAGGCGTTATGGGAAAGGCGGGTTGGTCTACACCTCTGCTATTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCC
TCCATATTTCTGGCTCTCTACACATCTCTCTCCTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGG
CGAGCACAAGTACGCCCTTCCCACCGTCCGCTCCACTTGCTCCTCCCCTGTCTTCTTCCAAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGG
TATCTCCTCGCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGG
ATTTCTCAAGAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAAC
CGAGAGGGCTCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCC
CCTGACATAATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCC
AAACTCTAAATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGAAAATGCAGATATGGCCATTTCCA
AACATCCTTATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAAT
GGGCTGAAGCCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCATTTATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCT
GTTCAACGAGTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCG
AGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGAT
TTGTTGTACGTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGACGCTTCCTGA
Protein sequenceShow/hide protein sequence
MENQNPNLQALWERRVGLHLCYSNQNCCVSLCFTSSPPYFWLSTHLSPPPNASSDPPPSIQSSSLSSPTPPPMASTSTPFPPSAPLAPPLSSSKIQVVHWNSSLRSSNLR
YLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSS
PDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCEN
GLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPD
LLYVNGTCCSKCQKYLLQMWGDAS