; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G079330 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G079330
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationCicolChr04:33595916..33599881
RNA-Seq ExpressionCcUC04G079330
SyntenyCcUC04G079330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134914.1 uncharacterized protein LOC101215259 [Cucumis sativus]1.0e-24389.59Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAG DLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]1.4e-24891.76Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAG DLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]4.4e-24288.04Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRL PNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAG DLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]1.0e-23586.96Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHK+I  RNS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRL PN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAG DLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]1.3e-25793.32Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWS+PLLFQSKLLCFSL YLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCSSPVFFSDYWMV NEIH+M S+SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNLRYLLANSD+FGGNFTAERRFSFF Y DYDT++VPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTVKGLENHKIIS +NSS DIIG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLKKQMETYCENGL+PWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGEGDAS
        FEQVALEYRHNLK   +GGP+LGPHISKPKRTKRAG DL YVNGSCCSKCQNYLLQMWGEGD S
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGEGDAS

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein5.1e-24489.59Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAG DLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X16.8e-24991.76Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAG DLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X21.9e-23588.5Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRL PNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAG DLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

A0A6J1CXF7 uncharacterized protein LOC1110157182.1e-24288.04Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRL PNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAG DLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X21.9e-23586.96Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHKII   NS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRL PN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAG DLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGE

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI701.3e-4732.78Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+ PN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)9.3e-4932.78Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+ PN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.1e-4637.46Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRL PN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.1e-4637.46Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRL PN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)2.1e-4534.77Show/hide
Query:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF
        NL Y+  +  S      FGGN + +ER  SF   P+     + V CGF+ +    +S  D+  ++ C   VV + IF+ +D+  QP  +  ++++  CF 
Subjt:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF

Query:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY
        M VDE ++  L  +  + K       +G+WR++ + +   Y+ P  NG +PK L HRL P +++SIW+D K++L+VDPLL++   +       AI++H +
Subjt:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY

Query:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINM
        + +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L    K+ M
Subjt:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINM

Query:  FE
        F+
Subjt:  FE

AT5G46220.1 Protein of unknown function (DUF616)7.9e-17364.86Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL
        S PL  +SKLLCFSL YLFS+IFL LY S S ++C+FR SPFDPIQ  LFSYPSSYG+HKYA+PT RSSCSSP+FFSDYW VL EI  + S  SSP  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL

Query:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL
        RY+   S+SFGGNF+ ++RFS+F++ + D   V VPCGF + FPVS+SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL+ VCF+MF+D+ T+  L
Subjt:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL

Query:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA
         +H +I K N S   +G WRI+++S S+NLY NPAMNGVIPKYL+HRL PNSKFSIWVDAK+QLM+DPLLLIHS+++    DMAISKHP++++TMEEAMA
Subjt:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVA
        TARWKKW DVD L+ QMETYCE+GLKPWS +KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV 
Subjt:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVA

Query:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGADLLYVNGSCCSKCQNYLLQMWG
        +EYRHNLKKI     +      K        KR K    +   +N    S C+NYL  MWG
Subjt:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGADLLYVNGSCCSKCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACTCTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCT
TTCTCTTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTTCTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCCATTCCC
ACTCTCCGCTCCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGAGATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAAT
TTGAGGTACCTCCTCGCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATTTTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCA
GTTCCTTGCGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCAT
GATAAAATTCGGCAACCGAGAGGCCTTGGATCGAAAACTTTGGATAACGTATGTTTTTTCATGTTTGTTGATGAGACAACGGTAAAAGGGTTGGAAAATCACAAG
ATAATTTCTAAAAGAAACTCATCGCCGGACATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATA
CCTAAATATTTAGTTCATAGACTACTTCCAAATTCTAAATTCAGTATATGGGTGGATGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTG
ATTATTACTGAAAATGCAGATATGGCTATTTCTAAACATCCTTACTATATTCACACCATGGAAGAGGCTATGGCAACCGCCAGATGGAAGAAATGGTGGGATGTT
GATTCTTTAAAGAAGCAAATGGAAACTTATTGTGAAAATGGCTTGAAACCATGGAGTCCCAATAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATT
TTGAGGAGACACGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAACGAATTGGAAGCTTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGAT
CATTTGACCCCACCCATCAAAATCAACATGTTTGAAGCAGAAGTTTTCGAACAAGTTGCTTTGGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCT
CAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGCCGGCGCTGATTTGTTGTATGTCAATGGTAGCTGCTGCAGCAAGTGCCAAAATTATCTTCTC
CAGATGTGGGGTGAAGGCGATGCTTCCTGA
mRNA sequenceShow/hide mRNA sequence
CCTAAAAAGTGAAAAACAGAAACGGTTATAGTTTTTGTGGCACTGAAAAGAAGGTGGGAGTGAAGTGAGGCAATAAACGACGGAAGAAGAGGAATTTATATTTGA
GAAGGAAAGGGAAAATGATGGGAAAAATCGAAATCCAGAGGCTATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACT
CTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTCTTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTT
CTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCCATTCCCACTCTCCGCTCCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGA
GATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAATTTGAGGTACCTCCTCGCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATT
TTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCAGTTCCTTGCGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAG
TTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCATGATAAAATTCGGCAACCGAGAGGCCTTGGATCGAAAACTTTGGATAACGTATGTTTTTTCAT
GTTTGTTGATGAGACAACGGTAAAAGGGTTGGAAAATCACAAGATAATTTCTAAAAGAAACTCATCGCCGGACATAATTGGGGTTTGGAGAATCGTGAGAGTTTC
TAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCATAGACTACTTCCAAATTCTAAATTCAGTATATGGGTGGATGCGAA
GCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTGATTATTACTGAAAATGCAGATATGGCTATTTCTAAACATCCTTACTATATTCACACCATGGA
AGAGGCTATGGCAACCGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTAAAGAAGCAAATGGAAACTTATTGTGAAAATGGCTTGAAACCATGGAGTCCCAA
TAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATTTTGAGGAGACACGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAACGAATTGGAAGC
TTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGCAGAAGTTTTCGAACAAGTTGCTTT
GGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCTCAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGCCGGCGCTGATTTGTTGTA
TGTCAATGGTAGCTGCTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGGCGATGCTTCCTGATTTTACATTTTTGTCCTCCTTTTCCATCTGCA
ATGTGTTCGTTCTTCCCAGAGGACAGAAAGGTCAGAAGGAAAAAGCTTGAGGGGTCAGCCGCTCAATACATTCTCTTTTCCTTTCTACCATTACTATTCATACGA
CGCCGTTTGGAAGCTATCAACAAGTCTTTT
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSN
LRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHK
IISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLLPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDV
DSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVALEYRHNLKKIGFGGP
QLGPHISKPKRTKRAGADLLYVNGSCCSKCQNYLLQMWGEGDAS