; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh07G001070 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh07G001070
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF616)
Genome locationCma_Chr07:587046..590311
RNA-Seq ExpressionCmaCh07G001070
SyntenyCmaCh07G001070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594467.1 hypothetical protein SDJN03_11020, partial [Cucurbita argyrosperma subsp. sororia]1.2e-24298.79Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
        WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK FPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR

Query:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM
        GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVT+DAD+AISKHPYYIHTMEEAM
Subjt:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
        ATARWKKWWDVDSLKNQMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
Subjt:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV

Query:  ALEYRHNLKIKA
        ALEYRHNLKIKA
Subjt:  ALEYRHNLKIKA

XP_008439635.1 PREDICTED: uncharacterized protein LOC103484369 isoform X1 [Cucumis melo]4.4e-21387.92Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS S+SKCLFR SPFDPIQF LFSYPSSYGEHKYA+PTLRSSCSTP+FFSDYWMV NEIQ ML NSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
          SSNL YL  N+DSF GNF+A KRFS+FD     N +VP+PCGFLK FPV DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT
        TTV+GLENHKII  +NS PDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+T++ADMAISKHPYYIHT
Subjt:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE
        MEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE

Query:  VFEQVALEYRHNLK
        VFEQVALEYRHNLK
Subjt:  VFEQVALEYRHNLK

XP_022926658.1 uncharacterized protein LOC111433726 isoform X2 [Cucurbita moschata]2.7e-24298.79Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
        WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK FPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR

Query:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM
        GLENHKIIPT NSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVT+DADMAISKHPYYIHTMEEAM
Subjt:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
        ATARWKKWWDVDSLKNQMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
Subjt:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV

Query:  ALEYRHNLKIKA
        ALEYRHNLKIKA
Subjt:  ALEYRHNLKIKA

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]7.0e-24398.79Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
        WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK FPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR

Query:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM
        GLENHK+IPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVT+DADMAISKHPYYIHTMEEAM
Subjt:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
        ATARWKKWWDVDSLKNQMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
Subjt:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV

Query:  ALEYRHNLKIKA
        ALEYRHNLKIKA
Subjt:  ALEYRHNLKIKA

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]8.9e-21487.23Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWS+PLLFQSKL CFSL YL SSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCS+P+FFSDYWMV NEI  M  +SS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
         +SSNLRYL  N+D+FGGNF+AE+RFS+FD    D  +VP+PCGFLK FPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL +VCFFMFVD+
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM
        TTV+GLENHKII  +NS  DIIGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+T++ADMAISKHPYYIHTM
Subjt:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFEGEV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEV

Query:  FEQVALEYRHNLKIK
        FEQVALEYRHNLK K
Subjt:  FEQVALEYRHNLKIK

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein2.4e-21287.2Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGK GWSTPLLFQSK FCFSLFYL SSIFLALYTS SSSKCLFR SPFDPIQF LFSYPSSYGEHKYA+PTLRSSCS+P+FFSDYWMVLNEIQ ML NSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
          SSNL YL  N+DSF GNF+A KRFS+FD    DN +VPIPCGFLK FPV+DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVD+
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT
         TV+GLENHK++  +N+ PDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+T++ADMAISKHPYYIHT
Subjt:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE
        MEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTPSIKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE

Query:  VFEQVALEYRHNLK
        VFEQVALEYRHNLK
Subjt:  VFEQVALEYRHNLK

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X12.1e-21387.92Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS S+SKCLFR SPFDPIQF LFSYPSSYGEHKYA+PTLRSSCSTP+FFSDYWMV NEIQ ML NSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
          SSNL YL  N+DSF GNF+A KRFS+FD     N +VP+PCGFLK FPV DSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVD+
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT
        TTV+GLENHKII  +NS PDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLI+T++ADMAISKHPYYIHT
Subjt:  TTVRGLENHKIIPTRNSYPDI-IGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE
        MEEAMATARWKKWWDVDSLK QMETYCENGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGE

Query:  VFEQVALEYRHNLK
        VFEQVALEYRHNLK
Subjt:  VFEQVALEYRHNLK

A0A6J1CXF7 uncharacterized protein LOC1110157181.0e-20784.82Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGK GWSTPLLFQSKL CFSLFYL SSIFLALYTS SS+KCLFR SPFDPIQFPLFSYPSSYGEHKYA+PT+RS+CS+P+FF DYWMVLN+IQV+ WNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
         RSSNLRYL  NAD+FGGNF+A+ RFS+FD    ++ SV +PCGFLK FPV DSD+ AME C+GVVVVSAIFNDHDKIRQPRGLGSKTL+NVCFFMFVD+
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFD----DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM
        TTVRGLE+H +I +RNS PDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN KFSIW+DAKLQLMVDPLLLIH+LIVT++ADMAISKHPYYIHTM
Subjt:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEV
        EEAMATARWKKWWDVDSLK QMETYCENGL+PWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMFE EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEV

Query:  FEQVALEYRHNLKIK
        FEQVALEYRHNLK K
Subjt:  FEQVALEYRHNLKIK

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X21.3e-24298.79Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
        WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK FPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR

Query:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM
        GLENHKIIPT NSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVT+DADMAISKHPYYIHTMEEAM
Subjt:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
        ATARWKKWWDVDSLKNQMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
Subjt:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV

Query:  ALEYRHNLKIKA
        ALEYRHNLKIKA
Subjt:  ALEYRHNLKIKA

A0A6J1ELS0 uncharacterized protein LOC111433726 isoform X43.1e-21289.32Show/hide
Query:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
        MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFR SPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS
Subjt:  MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSS

Query:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
        WRSSNLRYLA                                        DQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR
Subjt:  WRSSNLRYLAGNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVR

Query:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM
        GLENHKIIPT NSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVT+DADMAISKHPYYIHTMEEAM
Subjt:  GLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAM

Query:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
        ATARWKKWWDVDSLKNQMETYCENGLQPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV
Subjt:  ATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQV

Query:  ALEYRHNLKIKA
        ALEYRHNLKIKA
Subjt:  ALEYRHNLKIKA

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI705.5e-4937.85Show/hide
Query:  GNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTT----
        G +D FGG  + + R   FD   ++ + CGF+K         F + ++D   M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVD+ T    
Subjt:  GNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTT----

Query:  --VRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNL-YQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT
           RGL+ +K           +G WR+V V   NL Y +   NG +PK LVHR+FPN ++S+W+D KL+L+VDP  ++   +  ++A  AIS+H      
Subjt:  --VRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNL-YQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        + EA A     K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)3.9e-5037.85Show/hide
Query:  GNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTT----
        G +D FGG  + + R   FD   ++ + CGF+K         F + ++D   M+ C G+VV SA+F+  D ++ P+ +     + VCF+MFVD+ T    
Subjt:  GNADSFGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTT----

Query:  --VRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNL-YQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT
           RGL+ +K           +G WR+V V   NL Y +   NG +PK LVHR+FPN ++S+W+D KL+L+VDP  ++   +  ++A  AIS+H      
Subjt:  --VRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNL-YQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL
        + EA A     K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHL

AT1G53040.1 Protein of unknown function (DUF616)1.1e-4937.93Show/hide
Query:  FGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENHKI
        FGG  S E R + FD   S+ + CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVD+      E H  
Subjt:  FGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENHKI

Query:  IPTRNSYPD---IIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR
        +   +SY D    +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   ++  AIS+H        EA A   
Subjt:  IPTRNSYPD---IIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)1.1e-4937.93Show/hide
Query:  FGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENHKI
        FGG  S E R + FD   S+ + CGF+K         F + +     ++  + V+V SAIF  +D I++P  +      N+ F+MFVD+      E H  
Subjt:  FGGNFSAEKRFSYFDDNISVPIPCGFLK--------NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENHKI

Query:  IPTRNSYPD---IIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR
        +   +SY D    +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   ++  AIS+H        EA A   
Subjt:  IPTRNSYPD---IIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
         +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)3.8e-4534.01Show/hide
Query:  NLRYLAGNADS------FGGNFSAEKRFSYFDDNISVPIPCGFLK--NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD
        NL Y+  +  S      FGGN S  +R   F     + + CGF+      ++  D+  ++ C   VV + IF+ +D+  QP  +  ++++  CF M VD+
Subjt:  NLRYLAGNADS------FGGNFSAEKRFSYFDDNISVPIPCGFLK--NFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDD

Query:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM
         ++  L  +  +         +G WR++ + T   Y  P  NG +PK L HRLFP  ++SIW+D K++L+VDPLL++   +       AI++H ++ +  
Subjt:  TTVRGLENHKIIPTRNSYPDIIGAWRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE
        EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L  + K+ MF+
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFE

AT5G46220.1 Protein of unknown function (DUF616)6.5e-17070.12Show/hide
Query:  STPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSSWRSSNL
        S PL  +SKL CFSL YL S+IFL LY S S ++C+FRYSPFDPIQ  LFSYPSSYGEHKYA+PT RSSCS+PIFFSDYW VL EIQ +L  SS    NL
Subjt:  STPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSSWRSSNL

Query:  RYLAGNADSFGGNFSAEKRFSYFD-DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENH
        RY+ G ++SFGGNFS +KRFSYF+  NI V +PCGF ++FPV++SD+  ME C G+VV SAIFNDHDKIRQP GLG KTL+ VCF+MF+DD T+  L +H
Subjt:  RYLAGNADSFGGNFSAEKRFSYFD-DNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENH

Query:  KIIPTRNSYPDIIGAWRIVRVS-TKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR
         +I   N     +GAWRI+++S ++NLY NPAMNGVIPKYL+HRLFPN KFSIWVDAK+QLM+DPLLLIHS++V  + DMAISKHP++++TMEEAMATAR
Subjt:  KIIPTRNSYPDIIGAWRIVRVS-TKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATAR

Query:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEY
        WKKW DVD L+ QMETYCE+GL+PWS  KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMFE EVFEQV +EY
Subjt:  WKKWWDVDSLKNQMETYCENGLQPWSPGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEY

Query:  RHNLK
        RHNLK
Subjt:  RHNLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGCCGGGTTGGTCTACGCCTCTGCTATTCCAATCAAAGCTCTTCTGTTTCTCTCTGTTTTACCTTATCTCCTCCATTTTCCTCGCTCTCTACACTTCTTTCTC
CTCCTCCAAATGCCTCTTCCGATACTCTCCCTTCGATCCCATCCAGTTCCCTCTCTTCTCCTATCCCTCCTCCTATGGCGAGCACAAGTACGCCATTCCCACCCTCCGCT
CCTCCTGCTCCACCCCTATCTTCTTCTCAGATTATTGGATGGTTTTGAATGAGATCCAGGTAATGCTGTGGAATTCCTCTTGGCGGTCCTCCAATTTGAGGTATCTCGCT
GGTAATGCAGATAGTTTCGGCGGCAATTTCTCTGCCGAAAAGCGATTTTCATATTTCGATGATAATATTTCTGTCCCGATTCCTTGTGGATTTCTCAAGAATTTTCCTGT
GACTGATTCTGATCAAACTGCCATGGAGAGTTGCAACGGCGTGGTTGTGGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCCTCGGATCGAAGA
CTTTGGATAACGTGTGTTTTTTCATGTTTGTTGATGATACCACGGTGAGAGGACTCGAAAACCACAAAATAATTCCTACAAGAAACTCGTATCCGGATATAATCGGAGCT
TGGAGAATCGTGAGAGTTTCAACCAAGAATCTGTACCAAAATCCTGCCATGAATGGCGTAATACCTAAATATTTGGTTCACAGACTATTTCCAAACTGTAAATTCAGTAT
ATGGGTGGACGCCAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCTTTAATTGTGACTCAGGATGCAGATATGGCCATTTCCAAACATCCGTACTATATTC
ACACAATGGAAGAGGCAATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAACCAAATGGAAACTTACTGCGAAAATGGGTTGCAACCATGGAGT
CCCGGTAAGCTTCCGTATACCACAGATGTACCAGATAGTGCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTGTTCAACGAATTGGAAGC
TTTCAACCCAAGGGACCAATTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTCGAAGGAGAAGTTTTCGAGCAAGTTGCTTTGGAAT
ATAGGCACAATCTCAAAATCAAAGCAACTCTTGTTGCAGCAAGTGTCAGAAATATCTTCTCCAGATGTGGGGTGACGTTTCCTGATTTTACTTCTTTGTCCTTTTCTTAT
GCCACGTGTTTGTTCTTCCCCGGGGACAATAGCGTCAAAAGTAAAAGCTTGACAGGTCAGCCACACTACCCCTTTTTTCCTTTCTTCTATTATGGACGATACGGCGTCGT
CTGGATGCCTGTGTAA
mRNA sequenceShow/hide mRNA sequence
GGAATCCAGAGGTGTTATTATGGGGAAGCCGGGTTGGTCTACGCCTCTGCTATTCCAATCAAAGCTCTTCTGTTTCTCTCTGTTTTACCTTATCTCCTCCATTTTCCTCG
CTCTCTACACTTCTTTCTCCTCCTCCAAATGCCTCTTCCGATACTCTCCCTTCGATCCCATCCAGTTCCCTCTCTTCTCCTATCCCTCCTCCTATGGCGAGCACAAGTAC
GCCATTCCCACCCTCCGCTCCTCCTGCTCCACCCCTATCTTCTTCTCAGATTATTGGATGGTTTTGAATGAGATCCAGGTAATGCTGTGGAATTCCTCTTGGCGGTCCTC
CAATTTGAGGTATCTCGCTGGTAATGCAGATAGTTTCGGCGGCAATTTCTCTGCCGAAAAGCGATTTTCATATTTCGATGATAATATTTCTGTCCCGATTCCTTGTGGAT
TTCTCAAGAATTTTCCTGTGACTGATTCTGATCAAACTGCCATGGAGAGTTGCAACGGCGTGGTTGTGGTTTCGGCGATTTTCAACGATCACGATAAAATTCGGCAACCG
AGAGGCCTCGGATCGAAGACTTTGGATAACGTGTGTTTTTTCATGTTTGTTGATGATACCACGGTGAGAGGACTCGAAAACCACAAAATAATTCCTACAAGAAACTCGTA
TCCGGATATAATCGGAGCTTGGAGAATCGTGAGAGTTTCAACCAAGAATCTGTACCAAAATCCTGCCATGAATGGCGTAATACCTAAATATTTGGTTCACAGACTATTTC
CAAACTGTAAATTCAGTATATGGGTGGACGCCAAGCTTCAGTTAATGGTGGATCCATTGTTGTTGATTCATTCTTTAATTGTGACTCAGGATGCAGATATGGCCATTTCC
AAACATCCGTACTATATTCACACAATGGAAGAGGCAATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAACCAAATGGAAACTTACTGCGAAAA
TGGGTTGCAACCATGGAGTCCCGGTAAGCTTCCGTATACCACAGATGTACCAGATAGTGCCTTAATCTTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTT
TGTTCAACGAATTGGAAGCTTTCAACCCAAGGGACCAATTGGCTTTTGCATTTGTGAGAGATCATTTGACCCCATCCATCAAAATCAACATGTTCGAAGGAGAAGTTTTC
GAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAATCAAAGCAACTCTTGTTGCAGCAAGTGTCAGAAATATCTTCTCCAGATGTGGGGTGACGTTTCCTGATTTTAC
TTCTTTGTCCTTTTCTTATGCCACGTGTTTGTTCTTCCCCGGGGACAATAGCGTCAAAAGTAAAAGCTTGACAGGTCAGCCACACTACCCCTTTTTTCCTTTCTTCTATT
ATGGACGATACGGCGTCGTCTGGATGCCTGTGTAA
Protein sequenceShow/hide protein sequence
MGKPGWSTPLLFQSKLFCFSLFYLISSIFLALYTSFSSSKCLFRYSPFDPIQFPLFSYPSSYGEHKYAIPTLRSSCSTPIFFSDYWMVLNEIQVMLWNSSWRSSNLRYLA
GNADSFGGNFSAEKRFSYFDDNISVPIPCGFLKNFPVTDSDQTAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDNVCFFMFVDDTTVRGLENHKIIPTRNSYPDIIGA
WRIVRVSTKNLYQNPAMNGVIPKYLVHRLFPNCKFSIWVDAKLQLMVDPLLLIHSLIVTQDADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKNQMETYCENGLQPWS
PGKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFEGEVFEQVALEYRHNLKIKATLVAASVRNIFSRCGVTFPDFTSLSFSY
ATCLFFPGDNSVKSKSLTGQPHYPFFPFFYYGRYGVVWMPV