; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1556 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1556
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationMC01:19725066..19730271
RNA-Seq ExpressionMC01g1556
SyntenyMC01g1556
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594467.1 hypothetical protein SDJN03_11020, partial [Cucurbita argyrosperma subsp. sororia]3.77e-29384.63Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM
        TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT +AD+AISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGDVS
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]1.80e-29485.68Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+  +++V VPCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T NADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]0.099.57Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
        LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTME
        TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVT NADMAISKHPYYIHTME
Subjt:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTME

Query:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
        EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
Subjt:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF

Query:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS
        EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD S
Subjt:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]1.87e-29384.85Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM
        TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT +ADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGDVS
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]5.66e-30186.3Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWS+PLLFQSKLLCFSL YLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMV N+I  +  +SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         +SSNLRYLLAN+DTFGGNFTA+ RFSFFD+R+ + ++V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM
        TTV+GLE+H +IS +NSS DIIGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T NADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVIS-RNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        FEQVALEYRHNLK K YGGP+LGPHISKPKRTKRAGPDL YVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein2.85e-29084.16Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTSLSS+KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMVLN+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+ ++++V +PCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT
         TV+GLE+H ++S +N+SPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T NADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GP+L P ISKPKRTKRAGPDLLYVNG+CCSKC  YLL MWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X18.69e-29585.68Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+  +++V VPCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T NADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X22.00e-27982.86Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R                    D DRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T NADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157180.099.57Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
        LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTME
        TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVT NADMAISKHPYYIHTME
Subjt:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTME

Query:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
        EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
Subjt:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF

Query:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS
        EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD S
Subjt:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X25.23e-29384.63Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM
        TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT +ADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGDVS
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGDVS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI708.2e-5039.64Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  T  +R   FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATA
            +  N     +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +   NA  AIS+H      + EA A  
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+   ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)5.8e-5139.64Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  T  +R   FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATA
            +  N     +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +   NA  AIS+H      + EA A  
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+   ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.7e-4737.59Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  + ++R + FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATAR
          N  S       +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A   
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.7e-4737.59Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  + ++R + FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATAR
          N  S       +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A   
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)8.7e-4735.56Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS
        FGGN +   R   F  +      + V CGF+ +    ++  D+  +++C   VV + IF+ +D+  QP  +  +++   CF M VDE ++  L  +  + 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS

Query:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATARWKKWW
        ++    I +G WR++ + T   Y+ P  NG +PK L HRLFP +++SIWID K++L+VDPLL++   +  G    AI++H ++ +  EEA A  R +K +
Subjt:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATARWKKWW

Query:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE
            + + M+ Y   GL+PWS +K    +DVP+ A I+R+H+  +NLFSCL FNE+    PRDQL+F +V D L    K+ MF+
Subjt:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)3.1e-16963.77Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNL
        S PL  +SKLLCFSL YLFS+IFL LY SLS  +C+FR SPFDPIQ  LFSYPSSYGEHKYALPT RS+CSSP+FF DYW VL +IQ +   SS +  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNL

Query:  RYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGL
        RY+   +++FGGNF+   RFS+F+H N     V VPCGF + FPV++SDR+ ME+C G+VV SAIFNDHDKIRQP GLG KTLE VCF+MF+D+ T+  L
Subjt:  RYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGL

Query:  ESHNVISRNSSPDI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMA
          HNVI +N+  D  +GAWRI+++S ++NLY NPAMNGVIPKYL+HRLFPNSKFSIW+DAK+QLM+DPLLLIH+++V    DMAISKHP++++TMEEAMA
Subjt:  ESHNVISRNSSPDI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVA
        TARWKKW DVD L++QMETYCE+GLKPWS  KLPY TDVPD+A ILR+H   SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFEVEVFEQV 
Subjt:  TARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVA

Query:  LEYRHNLKKKGYGGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG
        +EYRHNLKK      +      K        KR K    +   +N    S C+ YL  MWG
Subjt:  LEYRHNLKKKGYGGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGGCGGGTTGGTCTACACCTCTGCTATTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCCTCCATATTTCTGGCTCTCTACACATCTCTCTC
CTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGGCGAGCACAAGTACGCCCTTCCCACCGTCCGCT
CCACTTGCTCCTCCCCTGTCTTCTTCCAAGATTATTGGATGGTTTTGAATCAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGGTATCTCCTC
GCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAACCGAGAGGGC
TCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCCCCTGACATA
ATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCCAAACTCTAA
ATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGGAAATGCAGATATGGCCATTTCCAAACATCCTT
ATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAATGGGCTGAAG
CCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCATTTATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCTGTTCAACGA
GTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCGAGCAAGTTG
CTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGATTTGTTGTAC
GTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGACGTTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGGCGGGTTGGTCTACACCTCTGCTATTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCCTCCATATTTCTGGCTCTCTACACATCTCTCTC
CTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGGCGAGCACAAGTACGCCCTTCCCACCGTCCGCT
CCACTTGCTCCTCCCCTGTCTTCTTCCAAGATTATTGGATGGTTTTGAATCAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGGTATCTCCTC
GCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAACCGAGAGGGC
TCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCCCCTGACATA
ATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCCAAACTCTAA
ATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGGAAATGCAGATATGGCCATTTCCAAACATCCTT
ATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAATGGGCTGAAG
CCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCATTTATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCTGTTCAACGA
GTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCGAGCAAGTTG
CTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGATTTGTTGTAC
GTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGACGTTTCCTGATTTTTACCTTTTTGTCCCTTTCCTCTGCCACGTGTTTCTTCTTC
CCAGACAGACACAAAGGTCAGGTCAGACGGTTTATCTACTCTTTCTGGTTCTTACGACGCCGTCTGGAAAATTTGAACATTCTTTTTAATGTAAATGTTTTTAAATAAAA
TGAAAATAGGTATAGGAGTTTCATAATTTTAAACTGCGTTATTTTTTAAGTCGTTTTTAATTTTTCTTTTCCAATAGTTTATAAAAGGCTTAAAAGGTGTCAATGTTATG
GTATTCTATTTAATAGGAGTGTTTGGCAACAATTAAAG
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNLRYLL
ANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPDI
IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTGNADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLK
PWSRRKLPYTTDVPDSAFILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLY
VNGTCCSKCQKYLLQMWGDVS