; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G024090 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G024090
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationCmU531Chr01:35588321..35592397
RNA-Seq ExpressionCmUC01G024090
SyntenyCmUC01G024090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134914.1 uncharacterized protein LOC101215259 [Cucumis sativus]4.0e-24389.37Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR++LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]4.1e-24891.54Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VP+PCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]1.4e-24388.26Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV +PCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H RESNLFSCLLFNELEAFNPRDQLAFAFVR+HLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]2.3e-23586.96Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHK+I  RNS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAGPDLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]8.2e-25792.89Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWS+PLLFQSKLLCFSL YLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCSSPVFFSDYWMV NEIH+M S+SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNLRYLLANSD+FGGNFTAERRFSFF Y DYDT++VP+PCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTVKGLENHKIIS +NSS DIIG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV
        EEAMATARWKKWWDVDSLKKQMETYCENGL+PWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEGDAS
        FEQVALEYRHNLK   +GGP+LGPHISKPKRTKRAGPDL YVNGSCCSKCQNYLLQMWGEGD S
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEGDAS

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein1.9e-24389.37Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCSSPVFFSDYWMVLNEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFF Y DYD ++VP+PCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLENHK++S +N+SPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR++LTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X12.0e-24891.54Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY  ++VP+PCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X22.5e-23588.5Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LFSYPSSYG+HKYA+PTLRSSCS+PVFFSDYWMV NEI  M SNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFF Y DY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLENHKIIS +NSSPDI IG WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLENHKIISKRNSSPDI-IGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTP IKINMFE E
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETE

Query:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNLKK  + GPQL P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A6J1CXF7 uncharacterized protein LOC1110157186.6e-24488.26Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLFSYPSSYG+HKYA+PT+RS+CSSPVFF DYWMVLN+I  +H NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTA+ RFSFF + + + SSV +PCGFLKKFPV DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+H +IS RNSSPDIIG WRIVRVS+KNLYENPAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H RESNLFSCLLFNELEAFNPRDQLAFAFVR+HLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLKK G+GGPQLGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X24.3e-23586.96Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYG+HKYAIPTLRSSCS+P+FFSDYWMVLNEI  M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSS

Query:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE
          SSNLRYL  N+DSFGGNF+AE+RFS+F     D  SVP+PCGFLKKFPV DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDE

Query:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLENHKII   NS PDIIG WRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWSP+KLPYTTDVPDSALILRRHGR SNLFSCLLFNELEAFNPRDQLAFAFVR+HLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEV

Query:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNLK     G +L P ISKP RTKRAGPDLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI701.7e-4732.51Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ + CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VR+ +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)1.2e-4832.51Show/hide
Query:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP
        P  PI F  +S P  +  + + +  P      + P      ++ + E   +  N+ S S    NL Y+               FGG  T + R   F   
Subjt:  PFDPIQFPLFSYPSSYGQHKYAI--PTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPS---SNLRYLLA---------NSDSFGGNFTAERRFSFFHYP

Query:  DYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV
        +    ++ + CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVDE T   L+  + +         +G+
Subjt:  DYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPDIIGV

Query:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET
        WR+V V +   Y +   NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A     K +D  S+  Q++ 
Subjt:  WRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMET

Query:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHL
        Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VR+ +
Subjt:  YCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHL

AT1G53040.1 Protein of unknown function (DUF616)1.5e-4637.11Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ + CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  R+ +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.5e-4637.11Show/hide
Query:  FGGNFTAERRFSFFHYPDYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE
        FGG  + E R + F   +    S+ + CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVDE T   L+
Subjt:  FGGNFTAERRFSFFHYPDYDTSSVPLPCGFLK--------KFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLE

Query:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
        N    +  N     +G+WRI+ V +   Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  R+ +   +   INMF
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)4.8e-4534.11Show/hide
Query:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPLPCGFLKK--FPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF
        NL Y+  +  S      FGGN + +ER  SF   P+     + + CGF+ +    +   D+  ++ C   VV + IF+ +D+  QP  +  ++++  CF 
Subjt:  NLRYLLANSDS------FGGNFT-AERRFSFFHYPDYDTSSVPLPCGFLKK--FPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFF

Query:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY
        M VDE ++  L  +  + K       +G+WR++ + +   Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H +
Subjt:  MFVDETTVKGLENHKIISKRNSSPDIIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPY

Query:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINM
        + +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V + L    K+ M
Subjt:  YIHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINM

Query:  FE
        F+
Subjt:  FE

AT5G46220.1 Protein of unknown function (DUF616)1.8e-17264.43Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL
        S PL  +SKLLCFSL YLFS+IFL LY S S ++C+FR SPFDPIQ  LFSYPSSYG+HKYA+PT RSSCSSP+FFSDYW VL EI  + S  SSP  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNL

Query:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL
        RY+   S+SFGGNF+ ++RFS+F++ + D   V +PCGF + FPV +SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL+ VCF+MF+D+ T+  L
Subjt:  RYLLANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGL

Query:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA
         +H +I K N S   +G WRI+++S S+NLY NPAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS+++    DMAISKHP++++TMEEAMA
Subjt:  ENHKIISKRNSSPDIIGVWRIVRVS-SKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEVFEQVA
        TARWKKW DVD L+ QMETYCE+GLKPWS +KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVR+H+ P +K+NMFE EVFEQV 
Subjt:  TARWKKWWDVDSLKKQMETYCENGLKPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEVFEQVA

Query:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG
        +EYRHNLKKI     +      K        KR K    +   +N    S C+NYL  MWG
Subjt:  LEYRHNLKKIGFGGPQLGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACTCTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTC
TTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTTCTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCCATTCCCACTCTCCGCT
CCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGAGATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAATTTGAGGTACCTCCTC
GCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATTTTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCACTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCGGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCATGATAAAATTCGGCAACCGAGAGGCC
TTGGATCGAAAACTTTGGATAACGTATGTTTTTTCATGTTTGTTGATGAGACAACGGTAAAAGGATTGGAAAATCACAAGATAATTTCTAAAAGAAACTCATCGCCGGAC
ATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCATAGACTATTTCCAAATTC
TAAATTCAGTATATGGGTGGATGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTGATTATTACTGAAAATGCAGATATGGCTATTTCTAAACATC
CTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTAAAGAAGCAAATGGAAACTTATTGTGAAAATGGCTTG
AAACCATGGAGTCCCAATAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATTTTGAGGAGACATGGAAGGGAAAGCAACCTATTTTCATGCCTTTTGTTCAA
CGAATTGGAAGCTTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGAGCATTTGACCCCACCCATCAAAATCAACATGTTTGAAACAGAAGTTTTCGAACAAG
TTGCTTTGGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCTCAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGCCGGCCCTGATTTGTTG
TATGTCAATGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGGTGATGCTTCCTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTGTCGCACTGAAAAGAAGGTGGGAGTGAAGTGAGGCAATAAACGACGGAAGAAGAGAAATTTATATTTGAGAAGAAAAGGGAAAATGATGGGAAAAATCGAAATC
CAGAGGCTATGGGGAAGGCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCACTCTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACT
TCTTTCTCTTCCTCCAAATGCCTCTTCCGTTCTTCTCCCTTCGATCCCATCCAGTTTCCTCTCTTCTCTTATCCTTCCTCTTATGGCCAACACAAGTACGCCATTCCCAC
TCTCCGCTCCTCTTGCTCCTCCCCTGTTTTCTTCTCAGATTACTGGATGGTTCTCAACGAGATCCATAAAATGCACTCGAATTCCTCTTCGCCATCCTCCAATTTGAGGT
ACCTCCTCGCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGAGAGGAGATTTTCTTTCTTCCATTATCCAGATTATGATACCAGTAGCGTCCCACTTCCTTGTGGA
TTTCTCAAGAAATTTCCCGTCGGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGTGTGGTTGTGGTTTCGGCGATTTTTAACGATCATGATAAAATTCGGCAACC
GAGAGGCCTTGGATCGAAAACTTTGGATAACGTATGTTTTTTCATGTTTGTTGATGAGACAACGGTAAAAGGATTGGAAAATCACAAGATAATTTCTAAAAGAAACTCAT
CGCCGGACATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTGTACGAAAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCATAGACTATTT
CCAAATTCTAAATTCAGTATATGGGTGGATGCGAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCGTTGATTATTACTGAAAATGCAGATATGGCTATTTC
TAAACATCCTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTAAAGAAGCAAATGGAAACTTATTGTGAAA
ATGGCTTGAAACCATGGAGTCCCAATAAGCTTCCCTATACCACAGATGTACCCGATAGTGCATTAATTTTGAGGAGACATGGAAGGGAAAGCAACCTATTTTCATGCCTT
TTGTTCAACGAATTGGAAGCTTTCAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGAGCATTTGACCCCACCCATCAAAATCAACATGTTTGAAACAGAAGTTTT
CGAACAAGTTGCTTTGGAATATAGGCATAACCTCAAAAAGATAGGATTTGGTGGGCCTCAATTGGGCCCCCACATATCCAAGCCCAAACGAACCAAAAGGGCCGGCCCTG
ATTTGTTGTATGTCAATGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGGTGATGCTTCCTGATTTTACATTTTTGTCCTTTTCAATCTG
CAATGTGTTCGTTCTTCCCAGAGGACAGAAAGGTCAGAAGGAAAAAGCTTGAGGGGTCAGCCGCTCCATACATTCTCATTTCCTTTCTACCATTACTATTCATACGACGC
CTTTTGGAAGCTATCAACAAGTCTTTTTTTTTTTAATTATTAAAAATATTTTTTTTCCGTTGTACATATTTAAAATATTTGTCGATTATGAGAGAAATGATCATGGATTG
AGTCATTGATTTTACTTTCTTATCATAAGAGATATACGGATTCTATCATATAGATAATAACCTGTTTATGTAGTTTGTTATGTGGTTGG
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFSYPSSYGQHKYAIPTLRSSCSSPVFFSDYWMVLNEIHKMHSNSSSPSSNLRYLL
ANSDSFGGNFTAERRFSFFHYPDYDTSSVPLPCGFLKKFPVGDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDETTVKGLENHKIISKRNSSPD
IIGVWRIVRVSSKNLYENPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGL
KPWSPNKLPYTTDVPDSALILRRHGRESNLFSCLLFNELEAFNPRDQLAFAFVREHLTPPIKINMFETEVFEQVALEYRHNLKKIGFGGPQLGPHISKPKRTKRAGPDLL
YVNGSCCSKCQNYLLQMWGEGDAS