; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015922 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015922
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationChr03:1475201..1478986
RNA-Seq ExpressionHG10015922
SyntenyHG10015922
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134914.1 uncharacterized protein LOC101215259 [Cucumis sativus]1.6e-24489.42Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCSSPVFFSDYWMV +EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFFDYRDYD + VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL+SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLE+HK++S KN+SPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEAD
        VFEQVALEYRHNL+K  Y GPEL P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE +
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEAD

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]1.4e-24891.54Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY  + VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_008439636.1 PREDICTED: uncharacterized protein LOC103484369 isoform X2 [Cucumis melo]3.0e-23588.29Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]1.3e-24187.64Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLF YPSSYGEHKYA+PT+RS+CSSPVFF DYWMV ++I  +  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTAD RFSFFD+R+ + S V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLESH +ISR NSSPDIIG WRIVRVS+KNLYE PAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEA
        FEQVALEYRHNL+K GYGGP+LGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+A
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEA

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]5.5e-26194.4Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGKPGWS+PLLFQSKLLCFSL YLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLF YPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVF+EIH+MQS+SS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNLRYLLANSD+FGGNFTA+RRFSFFDYRDYDT+ VPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTVKGLE+HKIIS KNSS DIIG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLKKQMETYCENGL+PWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEADVS
        FEQVALEYRHNL+   YGGPELGPHISKPKRTKRAGPDL YVNGSCCSKCQNYLLQMWGE DVS
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEADVS

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein7.8e-24589.42Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSK  CFSLFYL SSIFLALYTS SSSKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCSSPVFFSDYWMV +EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        S SSNL YLLANSDSF GNFTA +RFSFFDYRDYD + VP+PCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL+SVCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
         TVKGLE+HK++S KN+SPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIIT+NADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLK+QMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEAD
        VFEQVALEYRHNL+K  Y GPEL P ISKPKRTKRAGPDLLYVNGSCCSKC +YLL MWGE +
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEAD

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X16.8e-24991.54Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY  + VPVPCGFLKKFPV DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X21.5e-23588.29Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYL SSIFLALYTS S+SKCLFRSSPFDPIQF LF YPSSYGEHKYA+PTLRSSCS+PVFFSDYWMVF+EI  M SNSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
        SPSSNL YLLANSDSF GNFTA +RFSFFDYRDY                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
        TTVKGLE+HKIIS KNSSPDI IG WRIVRVSSKNLYE PAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT
Subjt:  TTVKGLESHKIISRKNSSPDI-IGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE
        MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWS +KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGE

Query:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        VFEQVALEYRHNL+K  Y GP+L P ISKPKRTKRAGPDLLYVNGSCCSKC NYLLQMWGE
Subjt:  VFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

A0A6J1CXF7 uncharacterized protein LOC1110157186.2e-24287.64Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGK GWSTPLLFQSKLLCFSLFYLFSSIFLALYTS SS+KCLFRSSPFDPIQFPLF YPSSYGEHKYA+PT+RS+CSSPVFF DYWMV ++I  +  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYLLAN+D+FGGNFTAD RFSFFD+R+ + S V VPCGFLKKFPV+DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVDE
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLESH +ISR NSSPDIIG WRIVRVS+KNLYE PAMNGVIPKYLVHRLFPNSKFSIW+DAKLQLMVDPLLLIH+LI+TENADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEA
        FEQVALEYRHNL+K GYGGP+LGPHISKPKRTKRAGPDLLYVNG+CCSKCQ YLLQMWG+A
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGEA

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X22.8e-23486.09Show/hide
Query:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS
        MGKPGWSTPLLFQSKL CFSLFYL SSIFLALYTSFSSSKCLFRSSPFDPIQFPLF YPSSYGEHKYAIPTLRSSCS+P+FFSDYWMV +EI  M  NSS
Subjt:  MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSS

Query:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE
          SSNLRYL  N+DSFGGNF+A++RFS+FD    D   VP+PCGFLKKFPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVD+
Subjt:  SPSSNLRYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDE

Query:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM
        TTV+GLE+HKII   NS PDIIG WRIVRVS+KNLY+ PAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+TE+ADMAISKHPYYIHTM
Subjt:  TTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEV
Subjt:  EEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEV

Query:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE
        FEQVALEYRHNL+     G EL P ISKP RTKRAGPDLLYVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLLYVNGSCCSKCQNYLLQMWGE

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI702.4e-4937.86Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  T   R   FD ++     + V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +      +VCF+MFVDE T   L+
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
          + +         +G+WR+V V +   Y     NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)1.7e-5037.86Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  T   R   FD ++     + V CGF+K         F + ++D + M+ C G+VV SA+F+  D ++ P+ +      +VCF+MFVDE T   L+
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA
          + +         +G+WR+V V +   Y     NG +PK LVHR+FPN+++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A  
Subjt:  SHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
           K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  RWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.1e-4637.07Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  + + R + FD ++  T    V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE      E
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM
        +H  +   +S  D    +G+WRI+ V +   Y     NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA 
Subjt:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        A    +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.1e-4637.07Show/hide
Query:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE
        FGG  + + R + FD ++  T    V CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE      E
Subjt:  FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLK--------KFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLE

Query:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM
        +H  +   +S  D    +G+WRI+ V +   Y     NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA 
Subjt:  SHKIISRKNSSPD---IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF
        A    +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  ATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)5.7e-4634.22Show/hide
Query:  NLRYLLANSDS------FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFM
        NL Y+  +  S      FGGN +   R   F  +      + V CGF+ +    +S  D+  ++ C   VV + IF+ +D+  QP  +  +++N  CF M
Subjt:  NLRYLLANSDS------FGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKK--FPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFM

Query:  FVDETTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYY
         VDE ++  L  +  + +       +G+WR++ + +   Y+ P  NG +PK L HRLFP +++SIW+D K++L+VDPLL++   +       AI++H ++
Subjt:  FVDETTVKGLESHKIISRKNSSPDIIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF
         +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L    K+ MF
Subjt:  IHTMEEAMATARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMF

Query:  E
        +
Subjt:  E

AT5G46220.1 Protein of unknown function (DUF616)4.3e-17164.43Show/hide
Query:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNL
        S PL  +SKLLCFSL YLFS+IFL LY S S ++C+FR SPFDPIQ  LF YPSSYGEHKYA+PT RSSCSSP+FFSDYW V  EI  + S  SSP  NL
Subjt:  STPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNL

Query:  RYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGL
        RY+   S+SFGGNF+  +RFS+F++ + D   V VPCGF + FPVS+SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL +VCF+MF+D+ T+  L
Subjt:  RYLLANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGL

Query:  ESHKIISRKNSSPDIIGVWRIVRVS-SKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA
          H +I + N S   +G WRI+++S S+NLY  PAMNGVIPKYL+HRLFPNSKFSIWVDAK+QLM+DPLLLIHS+++    DMAISKHP++++TMEEAMA
Subjt:  ESHKIISRKNSSPDIIGVWRIVRVS-SKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVA
        TARWKKW DVD L+ QMETYCE+GLKPWSS KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV 
Subjt:  TARWKKWWDVDSLKKQMETYCENGLKPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVA

Query:  LEYRHNLRKFGYGGPELGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG
        +EYRHNL+K      E      K        KR K    +   +N    S C+NYL  MWG
Subjt:  LEYRHNLRKFGYGGPELGPHISK-------PKRTKRAGPDLLYVNGSCCSKCQNYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGCCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCTCTGTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTC
TTCCTCCAAATGCCTCTTCCGTTCCTCTCCCTTCGATCCCATTCAGTTTCCTCTCTTCTTCTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACTCTCCGTT
CCTCTTGCTCATCCCCTGTTTTCTTCTCAGATTATTGGATGGTTTTCGATGAGATCCATAAAATGCAGTCGAATTCATCTTCGCCATCCTCCAATTTGAGATACCTCCTC
GCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGATAGGAGATTTTCTTTCTTCGATTATCGAGATTATGATACCAGTATCGTACCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGCGTGGTTGTTGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCC
TTGGATCGAAAACTTTGAATAGCGTATGTTTTTTCATGTTTGTTGATGAAACTACAGTGAAAGGATTGGAAAGTCACAAGATAATTTCTAGAAAAAACTCATCGCCGGAT
ATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTATACGAAACTCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTCCAAATTC
TAAATTCAGTATATGGGTGGATGCAAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATCCATTCATTGATAATTACTGAGAATGCAGATATGGCTATTTCTAAACATC
CTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGCTTG
AAACCATGGAGTTCCCATAAGCTTCCCTATACCACAGATGTGCCTGATAGTGCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAA
CGAATTGGAAGCTTTTAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAACAAG
TTGCTTTGGAATATAGGCATAATCTCAGAAAGTTCGGATATGGTGGGCCTGAATTGGGCCCCCACATCTCCAAGCCCAAACGAACTAAAAGGGCCGGCCCTGATTTGTTG
TATGTCAACGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGCTGATGTTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGCCAGGTTGGTCTACTCCTCTGCTTTTCCAATCAAAACTCCTCTGTTTCTCTCTGTTTTACCTTTTCTCCTCCATCTTCCTCGCTCTCTACACTTCTTTCTC
TTCCTCCAAATGCCTCTTCCGTTCCTCTCCCTTCGATCCCATTCAGTTTCCTCTCTTCTTCTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACTCTCCGTT
CCTCTTGCTCATCCCCTGTTTTCTTCTCAGATTATTGGATGGTTTTCGATGAGATCCATAAAATGCAGTCGAATTCATCTTCGCCATCCTCCAATTTGAGATACCTCCTC
GCTAATTCCGATTCTTTCGGCGGCAATTTCACTGCCGATAGGAGATTTTCTTTCTTCGATTATCGAGATTATGATACCAGTATCGTACCGGTTCCTTGTGGATTTCTCAA
GAAATTTCCCGTCAGTGATTCTGATCGGATTGCTATGGAGAGTTGTAACGGCGTGGTTGTTGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCC
TTGGATCGAAAACTTTGAATAGCGTATGTTTTTTCATGTTTGTTGATGAAACTACAGTGAAAGGATTGGAAAGTCACAAGATAATTTCTAGAAAAAACTCATCGCCGGAT
ATAATTGGGGTTTGGAGAATCGTGAGAGTTTCTAGCAAGAATCTATACGAAACTCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTCCAAATTC
TAAATTCAGTATATGGGTGGATGCAAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATCCATTCATTGATAATTACTGAGAATGCAGATATGGCTATTTCTAAACATC
CTTACTATATTCACACCATGGAAGAGGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCTTTGAAGAAGCAAATGGAAACTTACTGTGAAAATGGCTTG
AAACCATGGAGTTCCCATAAGCTTCCCTATACCACAGATGTGCCTGATAGTGCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTTTCATGCCTTTTGTTCAA
CGAATTGGAAGCTTTTAACCCAAGAGACCAGTTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCACCCATCAAAATCAACATGTTTGAAGGAGAAGTTTTCGAACAAG
TTGCTTTGGAATATAGGCATAATCTCAGAAAGTTCGGATATGGTGGGCCTGAATTGGGCCCCCACATCTCCAAGCCCAAACGAACTAAAAGGGCCGGCCCTGATTTGTTG
TATGTCAACGGTAGCTGTTGCAGCAAGTGCCAAAATTATCTTCTCCAGATGTGGGGTGAAGCTGATGTTTCCTGA
Protein sequenceShow/hide protein sequence
MGKPGWSTPLLFQSKLLCFSLFYLFSSIFLALYTSFSSSKCLFRSSPFDPIQFPLFFYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVFDEIHKMQSNSSSPSSNLRYLL
ANSDSFGGNFTADRRFSFFDYRDYDTSIVPVPCGFLKKFPVSDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLNSVCFFMFVDETTVKGLESHKIISRKNSSPD
IIGVWRIVRVSSKNLYETPAMNGVIPKYLVHRLFPNSKFSIWVDAKLQLMVDPLLLIHSLIITENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKKQMETYCENGL
KPWSSHKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPPIKINMFEGEVFEQVALEYRHNLRKFGYGGPELGPHISKPKRTKRAGPDLL
YVNGSCCSKCQNYLLQMWGEADVS