; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G024230 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G024230
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationCiama_Chr01:36201271..36205214
RNA-Seq ExpressionCaUC01G024230
SyntenyCaUC01G024230
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134914.1 uncharacterized protein LOC101215259 [Cucumis sativus]2.1e-24489.8Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]2.8e-24991.97Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]1.6e-24488.7Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H RESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-23687.17Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHK+I  RNS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAGPDLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]2.6e-25893.53Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWS+PLLFQSKLLCFSL YLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCSSPVFFSDYWMV NEIH+M S+SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNLRYLLANSD+FGGNFTAERRFSFF Y DYDT++VPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTVKGLENHKIIS +NSS DIIG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLKKQMETYCENGL+PWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEGDAS
        FEQVALEYRHNLK   +GGP+LGPHISKPKRTKRAGPDL YVNGSCCSKCQNYLLQMWGEGD S
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEGDAS

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein1.0e-24489.8Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X11.4e-24991.97Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X23.9e-23688.72Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A6J1CXF7 uncharacterized protein LOC1110157187.8e-24588.7Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H RESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X23.9e-23687.17Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHKII   NS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAGPDLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI702.6e-4833.06Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)1.9e-4933.06Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)2.3e-4737.8Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)2.3e-4737.8Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)3.3e-4635.1Show/hide
Query:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF
        NL Y+  +  S      FGGN + +ER  SF   P+     + V CGF+ +    +S  D+  ++ C   VV + IF+ +D+  QP  +  ++++  CF 
Subjt:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF

Query:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY
        M VDE ++  L  +  + K       +G+WR++ + +   Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H +
Subjt:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY

Query:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINM
        + +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L    K+ M
Subjt:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINM

Query:  FE
        F+
Subjt:  FE

AT5G46220.1 Protein of unknown function (DUF616)9.3e-17465.08Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL
        S PL  +SKLLCFSL YLFS+IFL LY S S ++C+FR SPFDPIQ  LFSYPSSYG+HKYA+PT RSSCSSP+FFSDYW VL EI  + S  SSP  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL

Query:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL
        RY+   S+SFGGNF+ ++RFS+F++ + D   V VPCGF + FPVS+SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL+ VCF+MF+D+ T+  L
Subjt:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL

Query:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA
         +H +I K N S   +G WRI+++S S+NLY NPAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS+++    DMAISKHP++++TMEEAMA
Subjt:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVA
        TARWKKW DVD L+ QMETYCE+GLKPWS +KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV 
Subjt:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVA

Query:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG
        +EYRHNLKKI     +      K        KR K    +   +N    S C+NYL  MWG
Subjt:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACTCTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTC
TTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTTCTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCCATTCCCACTCTCCGCT
CCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGAGATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAATTTGAGGTACCTCCTC
GCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATTTTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCAGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCATGATAAAATTCGGCAACCGAGAGGCC
TTGGATCGAAAACTTTGGATAACGTATGTTTTTTCATGTTTGTTGATGAGACAACGGTAAAAGGATTGGAAAATCACAAGATAATTTCTAAAAGAAACTCATCGCCGGAC
ATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCATAGACTATTTCCAAATTC
TAAATTCAGTATATGGGTGGATGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTGATTATTACTGAAAATGCAGATATGGCTATTTCTAAACATC
CTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTAAAGAAGCAAATGGAAACTTATTGTGAAAATGGCTTG
AAACCATGGAGTCCCAATAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATTTTGAGGAGACATGGAAGGGAAAGCAACCTATTTTCATGCCTTTTGTTCAA
CGAATTGGAAGCTTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGCAGAAGTTTTCGAACAAG
TTGCTTTGGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCTCAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGCCGGCCCTGATTTGTTG
TATGTCAATGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGGTGATGCTTCCTGA
mRNA sequenceShow/hide mRNA sequence
TCGAAATCCAGAGGCTATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACTCTTTTACCTTTTCTCCTCCATCTTCCTCGCTC
TCTACACTTCTTTCTCTTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTTCTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCC
ATTCCCACTCTCCGCTCCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGAGATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAA
TTTGAGGTACCTCCTCGCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATTTTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCAGTTC
CTTGTGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCATGATAAAATT
CGGCAACCGAGAGGCCTTGGATCGAAAACTTTGGATAACGTATGTTTTTTCATGTTTGTTGATGAGACAACGGTAAAAGGATTGGAAAATCACAAGATAATTTCTAAAAG
AAACTCATCGCCGGACATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCATA
GACTATTTCCAAATTCTAAATTCAGTATATGGGTGGATGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTGATTATTACTGAAAATGCAGATATG
GCTATTTCTAAACATCCTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTAAAGAAGCAAATGGAAACTTA
TTGTGAAAATGGCTTGAAACCATGGAGTCCCAATAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATTTTGAGGAGACATGGAAGGGAAAGCAACCTATTTT
CATGCCTTTTGTTCAACGAATTGGAAGCTTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGCA
GAAGTTTTCGAACAAGTTGCTTTGGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCTCAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGC
CGGCCCTGATTTGTTGTATGTCAATGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGGTGATGCTTCCTGATTTTACATTTTTGTCCTCC
TTTTCCATCTGCAATGTGTTCGTTCTTCCCAGAGGACAGAAAGGTCAGAAGGAAAAAGCTTGAGGGGTCAGCCGCTCCATACATTCTCTTTTCCTTTCTACCATTACTAT
ACATACGACGCCGTTTGGAAGCTATCAGCAAGTC
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNLRYLL
ANSDSFGGNFTAERRFSFFHYPDYDTSSVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPD
IIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGL
KPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEAEVFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLL
YVNGSCCSKCQNYLLQMWGEGDAS