; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011015 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011015
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationscaffold35:3591761..3596624
RNA-Seq ExpressionMS011015
SyntenyMS011015
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594467.1 hypothetical protein SDJN03_11020, partial [Cucurbita argyrosperma subsp. sororia]1.2e-23185Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM
        TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+AD+AISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]9.7e-23486.12Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+  +++V VPCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]4.8e-27399.78Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
        LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME
        TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME
Subjt:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME

Query:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
        EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSA ILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
Subjt:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF

Query:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
Subjt:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]6.9e-23285.22Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM
        TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]1.0e-23886.74Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWS+PLLFQSKLLCFSL YLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMV N+I  +  +SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         +SSNLRYLLAN+DTFGGNFTA+ RFSFFD+R+ + ++V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM
        TTV+GLE+H +IS +NSS DIIGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVIS-RNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        FEQVALEYRHNLK K YGGP+LGPHISKPKRTKRAGPDL YVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein3.1e-23084.38Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTSLSS+KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMVLN+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+ ++++V +PCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT
         TV+GLE+H ++S +N+SPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+T+NADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GP+L P ISKPKRTKRAGPDLLYVNG+CCSKC  YLL MWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X14.7e-23486.12Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R+  +++V VPCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X27.0e-22283.3Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTSLS++KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CS+PVFF DYWMV N+IQ +  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
          SSNL YLLAN+D+F GNFTA  RFSFFD+R                    D DRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT
        TTV+GLE+H +IS +NSSPDI IGAWRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHT
Subjt:  TTVRGLESHNVIS-RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE
        MEEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVE

Query:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        VFEQVALEYRHNLKK  Y GPQL P ISKPKRTKRAGPDLLYVNG+CCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157182.3e-27399.78Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
        LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME
        TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME
Subjt:  TTVRGLESHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTME

Query:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
        EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSA ILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF
Subjt:  EAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVF

Query:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
Subjt:  EQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X21.3e-23185Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSS

Query:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLKKFPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  LRSSNLRYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDE

Query:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM
        TTVRGLE+H +I + NS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVTE+ADMAISKHPYYIHTM
Subjt:  TTVRGLESHNVI-SRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEV

Query:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD
        FEQVALEYRHNLK K   G +L P ISKP RTKRAGPDLLYVNG+CCSKCQKYLLQMWGD
Subjt:  FEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLYVNGTCCSKCQKYLLQMWGD

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI703.6e-5039.64Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  T  +R   FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATA
            +  N     +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)2.6e-5139.64Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  T  +R   FD +     +++V CGF+K         F + ++D + M++C G+VV SA+F+  D ++ P+ +     E VCF+MFVDE T   L+
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATA
            +  N     +G WR+V V   NL Y +   NG +PK LVHR+FPN+++S+WID KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNL-YENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR+H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.7e-4737.59Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  + ++R + FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR
          N  S       +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A   
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.7e-4737.59Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE
        FGG  + ++R + FD +     S+ V CGF+K         F + +     +++   V+V SAIF  +D I++P  +     +N+ F+MFVDE T   L 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLK--------KFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLE

Query:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR
          N  S       +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIW+DAKLQL+VDP  ++   +   N+  AIS+H        EA A   
Subjt:  SHNVISRNSSPDIIGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R+H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)9.5e-4635.21Show/hide
Query:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS
        FGGN +   R   F  +      + V CGF+ +    ++  D+  +++C   VV + IF+ +D+  QP  +  +++   CF M VDE ++  L  +  + 
Subjt:  FGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKK--FPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVIS

Query:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWW
        ++    I +G WR++ + T   Y+ P  NG +PK L HRLFP +++SIWID K++L+VDPLL++   +       AI++H ++ +  EEA A  R +K +
Subjt:  RNSSPDI-IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWW

Query:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE
            + + M+ Y   GL+PWS +K    +DVP+ A+I+R+H+  +NLFSCL FNE+    PRDQL+F +V D L    K+ MF+
Subjt:  DVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)1.1e-16963.99Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNL
        S PL  +SKLLCFSL YLFS+IFL LY SLS  +C+FR SPFDPIQ  LFSYPSSYGEHKYALPT RS+CSSP+FF DYW VL +IQ +   SS +  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNL

Query:  RYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGL
        RY+   +++FGGNF+   RFS+F+H N     V VPCGF + FPV++SDR+ ME+C G+VV SAIFNDHDKIRQP GLG KTLE VCF+MF+D+ T+  L
Subjt:  RYLLANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGL

Query:  ESHNVISRNSSPDI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMA
          HNVI +N+  D  +GAWRI+++S ++NLY NPAMNGVIPKYL+HRLFPNSKFSIW+DAK+QLM+DPLLLIH+++V    DMAISKHP++++TMEEAMA
Subjt:  ESHNVISRNSSPDI-IGAWRIVRVS-TKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVA
        TARWKKW DVD L++QMETYCE+GLKPWS  KLPY TDVPD+ALILR+H   SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFEVEVFEQV 
Subjt:  TARWKKWWDVDSLKMQMETYCENGLKPWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVA

Query:  LEYRHNLKKKGYGGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG
        +EYRHNLKK      +      K        KR K    +   +N    S C+ YL  MWG
Subjt:  LEYRHNLKKKGYGGPQLGPHISK-------PKRTKRAGPDLLYVNGTCCSKCQKYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGGCGGGTTGGTCTACACCTCTGCTTTTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCCTCCATATTTCTGGCTCTCTACACATCTCTCTC
CTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGGCGAGCACAAGTACGCCCTTCCCACCGTTCGCT
CCACTTGCTCCTCCCCTGTCTTCTTCCAAGATTATTGGATGGTTTTGAATCAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGGTATCTCCTC
GCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAACCGAGAGGGC
TCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCCCCTGACATA
ATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCCAAACTCTAA
ATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGAAAATGCAGATATGGCCATTTCCAAACATCCTT
ATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAATGGGCTGAAG
CCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCCTTGATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCTGTTCAACGA
GTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCGAGCAAGTTG
CTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGATTTGTTGTAC
GTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGAC
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGGCGGGTTGGTCTACACCTCTGCTTTTCCAATCAAAACTGCTGTGTTTCTCTCTGTTTTACCTCTTCTCCTCCATATTTCTGGCTCTCTACACATCTCTCTC
CTCCACCAAATGCCTCTTCCGATCCTCCCCCTTCGATCCAATCCAGTTCCCTCTCTTCTCCTACCCCTCCTCCTATGGCGAGCACAAGTACGCCCTTCCCACCGTTCGCT
CCACTTGCTCCTCCCCTGTCTTCTTCCAAGATTATTGGATGGTTTTGAATCAGATCCAGGTTGTGCACTGGAATTCTTCTCTGCGATCCTCCAATTTGCGGTATCTCCTC
GCCAATGCCGATACATTTGGCGGCAATTTCACTGCCGACAACAGGTTTTCCTTCTTCGATCATCGAAACAACAACGATAGCAGCGTTGCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCGCTGATTCTGATCGAATTGCCATGGAAAGATGCGACGGCGTGGTTGTGGTTTCGGCAATTTTCAACGATCACGACAAAATTCGGCAACCGAGAGGGC
TCGGATCGAAAACTCTGGAGAACGTATGTTTCTTCATGTTTGTGGATGAAACTACGGTGCGAGGACTCGAAAGCCACAACGTAATTTCCAGAAACTCATCCCCTGACATA
ATTGGGGCTTGGAGAATTGTGAGAGTTTCGACCAAAAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCCAAGTATTTAGTTCACAGACTCTTTCCAAACTCTAA
ATTCAGTATATGGATAGACGCGAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATACGCTGATTGTGACTGAAAATGCAGATATGGCCATTTCCAAACATCCTT
ATTATATTCACACCATGGAAGAGGCCATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGATGCAAATGGAAACTTACTGTGAAAATGGGCTGAAG
CCATGGAGTCGCCGCAAGCTTCCCTATACCACAGATGTACCAGATAGTGCCTTGATCTTGAGGAAACATAGCAGGGAAAGCAACTTATTCTCTTGCCTTCTGTTCAACGA
GTTGGAAGCTTTCAATCCAAGAGACCAGTTGGCGTTTGCATTTGTGAGGGACCATCTCACCCCACCCATCAAAATCAACATGTTTGAAGTTGAAGTTTTCGAGCAAGTTG
CTTTGGAATATAGGCACAATCTCAAAAAGAAAGGATATGGTGGGCCTCAACTGGGCCCCCACATCTCCAAGCCCAAACGTACCAAAAGGGCCGGCCCTGATTTGTTGTAC
GTCAATGGCACCTGCTGCAGCAAGTGCCAGAAATATCTTCTCCAAATGTGGGGCGAC
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSLSSTKCLFRSSPFDPIQFPLFSYPSSYGEHKYALPTVRSTCSSPVFFQDYWMVLNQIQVVHWNSSLRSSNLRYLL
ANADTFGGNFTADNRFSFFDHRNNNDSSVAVPCGFLKKFPVADSDRIAMERCDGVVVVSAIFNDHDKIRQPRGLGSKTLENVCFFMFVDETTVRGLESHNVISRNSSPDI
IGAWRIVRVSTKNLYENPAMNGVIPKYLVHRLFPNSKFSIWIDAKLQLMVDPLLLIHTLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKMQMETYCENGLK
PWSRRKLPYTTDVPDSALILRKHSRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEVEVFEQVALEYRHNLKKKGYGGPQLGPHISKPKRTKRAGPDLLY
VNGTCCSKCQKYLLQMWGD