; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G001500 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G001500
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationchr01:1540930..1546726
RNA-Seq ExpressionLsi01G001500
SyntenyLsi01G001500
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616
IPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134914.1 uncharacterized protein LOC101215259 [Cucumis sativus]1.2e-23190.02Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCSSPVFFSDYWMV +EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFFDYRDYD + VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL+SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLE+HK++S KN+SPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GPEL P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]1.2e-23491.38Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY  + VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

XP_008439636.1 PREDICTED: uncharacterized protein LOC103484369 isoform X2 [Cucumis melo]2.5e-22187.98Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]8.0e-22887.73Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLF YPSSYGEHKYA+PT+RS+CSSPVFF DYWMV ++I  +  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTAD RFSFFD+R+ + S V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLESH +ISR NSSPDIIG WRIVRVS+KNLYE PAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        FEQVALEYRHNL+K GYGGP+LGPHISKPKRTKRAGPDLL
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]9.4e-24594.53Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGKPGWS+PLLFQSKLLCFSL YLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLF YPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVF+EIH+MQS+SS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNLRYLLANSD+FGGNFTA+RRFSFFDYRDYDT+ VPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTVKGLE+HKIIS KNSS DIIG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLKKQMETYCENGL+PWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDL
        FEQVALEYRHNL+   YGGPELGPHISKPKRTKRAGPDL
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDL

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein5.8e-23290.02Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCSSPVFFSDYWMV +EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFFDYRDYD + VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL+SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLE+HK++S KN+SPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GPEL P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X15.6e-23591.38Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY  + VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X21.2e-22187.98Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLL
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

A0A6J1CXF7 uncharacterized protein LOC1110157183.9e-22887.73Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLF YPSSYGEHKYA+PT+RS+CSSPVFF DYWMV ++I  +  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTAD RFSFFD+R+ + S V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLESH +ISR NSSPDIIG WRIVRVS+KNLYE PAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        FEQVALEYRHNL+K GYGGP+LGPHISKPKRTKRAGPDLL
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X21.3e-22085.91Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGKPGWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLF YPSSYGEHKYAIPTLRSSCS+P+FFSDYWMV +EI  M  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYL  N+DSFGGNF+A++RFS+FD    D   VP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+HKII   NS PDIIG WRIVRVS+KNLY+ PAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
        FEQVALEYRHNL+     G EL P ISKP RTKRAGPDLL
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI703.1e-4937.86Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  T   R   FD ++     + V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +      +VCF+MFVDE T   L+
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
          + +         +G+WR+V V +   Y     NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)2.2e-5037.86Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  T   R   FD ++     + V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +      +VCF+MFVDE T   L+
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
          + +         +G+WR+V V +   Y     NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.5e-4637.07Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  + + R + FD ++  T    V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE      E
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM
        +H  +   +S  D    +G+WRI+ V +   Y     NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA 
Subjt:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        A    +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.5e-4637.07Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  + + R + FD ++  T    V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE      E
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM
        +H  +   +S  D    +G+WRI+ V +   Y     NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA 
Subjt:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        A    +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)7.3e-4634.22Show/hide
Query:  NLRYLLANSDS------FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFM
        NL Y+  +  S      FGGN +   R   F  +      + V CGF+ +    +S  D+  ++ C   VV + IF+ +D+  QP  +  +++N  CF M
Subjt:  NLRYLLANSDS------FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFM

Query:  FVDETTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYY
         VDE ++  L  +  + +       +G+WR++ + +   Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H ++
Subjt:  FVDETTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
         +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L    K+ MF
Subjt:  IHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  E
        +
Subjt:  E

AT5G46220.1 Protein of unknown function (DUF616)2.0e-16869.19Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNL
        S PL  +SKLLCFSL YLFS+IFL LY S S ++C+FR SPFDPIQ  LF YPSSYGEHKYA+PT RSSCSSP+FFSDYW V  EI  + S  SSP  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNL

Query:  RYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGL
        RY+   S+SFGGNF+  +RFS+F++ + D   V VPCGF + FPVS+SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL +VCF+MF+D+ T+  L
Subjt:  RYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGL

Query:  ESHKIISRKNSSPDIIGVWRIVRVS-SKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA
          H +I + N S   +G WRI+++S S+NLY  PAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS+++    DMAISKHP++++TMEEAMA
Subjt:  ESHKIISRKNSSPDIIGVWRIVRVS-SKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVA
        TARWKKW DVD L+ QMETYCE+GLKPWSS KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV 
Subjt:  TARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVA

Query:  LEYRHNLRK
        +EYRHNL+K
Subjt:  LEYRHNLRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGCCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCTCTGTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTC
TTCCTCCAAATGCCTCTTCCGTTCCTCTCCCTTCGATCCCATTCAGTTTCCTCTCTTCTTCTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACTCTCCGTT
CCTCTTGCTCATCCCCTGTTTTCTTCTCAGATTATTGGATGGTTTTCGATGAGATCCATAAAATGCAGTCGAATTCATCTTCGCCATCCTCCAATTTGAGATACCTCCTC
GCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGATAGGAGATTTTCTTTCTTCGATTATCGAGATTATGATACCAGTATCGTACCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGCGTGGTTGTTGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCC
TTGGATCGAAAACTTTGAATAGCGTATGTTTTTTCATGTTTGTTGATGAAACTACAGTGAAAGGATTGGAAAGTCACAAGATAATTTCTAGAAAAAACTCATCGCCGGAT
ATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTATACGAAACTCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTCCAAATTC
TAAATTCAGTATATGGGTGGATGCAAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATCCATTCATTGATAATTACTGAGAATGCAGATATGGCTATTTCTAAACATC
CTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGCTTG
AAACCATGGAGTTCCCATAAGCTTCCCTATACCACAGATGTGCCTGATAGTGCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAA
CGAATTGGAAGCTTTTAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAACAAG
TTGCTTTGGAATATAGGCATAATCTCAGAAAGTTCGGATATGGTGGGCCTGAATTGGGCCCCCACATCTCCAAGCCCAAACGAACTAAAAGGGCCGGCCCTGATTTGTTA
AACGCTCGATCGGACAAAGGAGAATTACCAAGTCAAGACACCAAAAAAATGACGACGCAACTTGCGACTCACAGAGCAGACGCCGAGATCTACAACGGCGACGCTCTCTG
CAAGCAAAAGTCTCAAGAACTTCTCGATCAATTCCTTCTTCCCCGAGGCCTTCTCCCCTTAAACGATATCCTTGAGGTCGGGTACAATAAGACCTCCGGTTTCATTTGGC
TCAAGCAGCAGAAGAAAAAGGAGCACCGGTTCGCCGCCATCGGACGCACTGTCTTGTACGACACCGAGGTCACCGCCTTCATCGAGGAACGCCGCCTCCGCCGTCTCACC
GGAGTTAAGAGCAAGGAGTTCTTTCTTTCGATTACCGTTTCCGATATTTATATTGATGAACAGAACACGAGTAGGATCACGTTCGGTACTCTGACTGGAATTGCCAAGTC
CTTCCCGGTCTCTGCGTTTCTGATTGAAGAAGAAACTGATCAGAAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
GAAATTTATATGAGGGAAAAAGAAAGAAAATTGATGTGAAAAATCGAAATCCAGAGGCTATGGGGAAGCCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTC
TGTTTCTCTCTGTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTCTTCCTCCAAATGCCTCTTCCGTTCCTCTCCCTTCGATCCCATTCAGTTTCC
TCTCTTCTTCTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACTCTCCGTTCCTCTTGCTCATCCCCTGTTTTCTTCTCAGATTATTGGATGGTTTTCGATG
AGATCCATAAAATGCAGTCGAATTCATCTTCGCCATCCTCCAATTTGAGATACCTCCTCGCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGATAGGAGATTTTCT
TTCTTCGATTATCGAGATTATGATACCAGTATCGTACCGGTTCCTTGTGGATTTCTCAAGAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGG
CGTGGTTGTTGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCCTTGGATCGAAAACTTTGAATAGCGTATGTTTTTTCATGTTTGTTGATGAAA
CTACAGTGAAAGGATTGGAAAGTCACAAGATAATTTCTAGAAAAAACTCATCGCCGGATATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTATACGAA
ACTCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTCCAAATTCTAAATTCAGTATATGGGTGGATGCAAAGCTTCAGTTAATGGTGGATCCATT
GTTGTTGATCCATTCATTGATAATTACTGAGAATGCAGATATGGCTATTTCTAAACATCCTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGA
AATGGTGGGATGTTGATTCTTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGCTTGAAACCATGGAGTTCCCATAAGCTTCCCTATACCACAGATGTGCCTGATAGT
GCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAACGAATTGGAAGCTTTTAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAG
AGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAACAAGTTGCTTTGGAATATAGGCATAATCTCAGAAAGTTCGGATATGGTGGGCCTG
AATTGGGCCCCCACATCTCCAAGCCCAAACGAACTAAAAGGGCCGGCCCTGATTTGTTAAACGCTCGATCGGACAAAGGAGAATTACCAAGTCAAGACACCAAAAAAATG
ACGACGCAACTTGCGACTCACAGAGCAGACGCCGAGATCTACAACGGCGACGCTCTCTGCAAGCAAAAGTCTCAAGAACTTCTCGATCAATTCCTTCTTCCCCGAGGCCT
TCTCCCCTTAAACGATATCCTTGAGGTCGGGTACAATAAGACCTCCGGTTTCATTTGGCTCAAGCAGCAGAAGAAAAAGGAGCACCGGTTCGCCGCCATCGGACGCACTG
TCTTGTACGACACCGAGGTCACCGCCTTCATCGAGGAACGCCGCCTCCGCCGTCTCACCGGAGTTAAGAGCAAGGAGTTCTTTCTTTCGATTACCGTTTCCGATATTTAT
ATTGATGAACAGAACACGAGTAGGATCACGTTCGGTACTCTGACTGGAATTGCCAAGTCCTTCCCGGTCTCTGCGTTTCTGATTGAAGAAGAAACTGATCAGAAGAAGTG
A
Protein sequenceShow/hide protein sequence
MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNLRYLL
ANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLESHKIISRKNSSPD
IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGL
KPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
NARSDKGELPSQDTKKMTTQLATHRADAEIYNGDALCKQKSQELLDQFLLPRGLLPLNDILEVGYNKTSGFIWLKQQKKKEHRFAAIGRTVLYDTEVTAFIEERRLRRLT
GVKSKEFFLSITVSDIYIDEQNTSRITFGTLTGIAKSFPVSAFLIEEETDQKK