; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015769 (gene) of Snake gourd v1 genome

Gene IDTan0015769
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF616)
Genome locationLG03:79882351..79886577
RNA-Seq ExpressionTan0015769
SyntenyTan0015769
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594467.1 hypothetical protein SDJN03_11020, partial [Cucurbita argyrosperma subsp. sororia]5.0e-24689.39Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKL CFSLFYL S+IFLALYTSFSSSKCLFRSSPFDPIQF LFSYPSSYGEHKYAIPTLRSSCS+P+FFSDYWMVLN+IQVM WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
        W+SSNLRYLA +ADSFGGNFSAE RFSYFD    D  SVP+PCGFL KFP+TDSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVD+
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLENH +IP RNS PDIIGAWRIVRVSTKNLY NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+AD+AISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLKNQMETYC+NGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K   G E+ PQISKP  TKRAGPDLLYVNGSCCSKCQKYLLQMWGD+S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

XP_022146515.1 uncharacterized protein LOC111015718 [Momordica charantia]7.4e-24287.45Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFS+IFLALYTS SS+KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMVLNQIQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
         +SSNLRYL A+AD+FGGNF+A++RFS+FD+RN + +SV VPCGFL KFP+ DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLE+HNVI  RNSSPDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN+KFSIW+DAKLQLMVDPLLLIH+LIVTENADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLK QMETYC+NGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K YGGP++GP ISKPK TKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

XP_022926658.1 uncharacterized protein LOC111433726 isoform X2 [Cucurbita moschata]1.1e-24589.39Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKL CFSLFYL S+IFLALYTSFSSSKCLFRSSPFDPIQF LFSYPSSYGEHKYAIPTLRSSCS+P+FFSDYWMVLN+IQVM WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
        W+SSNLRYLA +ADSFGGNFSAE RFSYFD    D  SVP+PCGFL KFP+TDSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVD+
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLENH +IP  NS PDIIGAWRIVRVSTKNLY NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLKNQMETYC+NGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K   G E+ PQISKP  TKRAGPDLLYVNGSCCSKCQKYLLQMWGD+S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

XP_023517977.1 uncharacterized protein LOC111781548 isoform X2 [Cucurbita pepo subsp. pepo]2.9e-24689.61Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKL CFSLFYL S+IFLALYTSFSSSKCLFRSSPFDPIQF LFSYPSSYGEHKYAIPTLRSSCS+P+FFSDYWMVLN+IQVM WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
        W+SSNLRYLA +ADSFGGNFSAE RFSYFD    D  SVP+PCGFL KFP+TDSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVD+
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLENH +IP RNS PDIIGAWRIVRVSTKNLY NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLKNQMETYC+NGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K   G E+ PQISKP  TKRAGPDLLYVNGSCCSKCQKYLLQMWGD+S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

XP_038881256.1 uncharacterized protein LOC120072816 [Benincasa hispida]1.0e-24689.35Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWS+PLLFQSKLLCFSL YLFS+IFLALYTSFSSSKCLFRSSPFDPIQF LFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMV N+I  MQ +SS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
         QSSNLRYL A++D+FGGNF+AE RFS+FDYR+YDTT+VPVPCGFL KFP++DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTL SVCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTV+GLENH +I  +NSS DIIGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN+KFSIWVDAKLQLMVDPLLLIHSLI+TENADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLK QMETYC+NGL+PWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD
        FEQVALEYRHNLKNKRYGGPE+GP ISKPK TKRAGPDL YVNGSCCSKCQ YLLQMWG+
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD

TrEMBL top hitse value%identityAlignment
A0A0A0KNW1 Uncharacterized protein6.4e-23986.77Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSK  CFSLFYL S+IFLALYTS SSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYA+PTLRSSCSSPVFFSDYWMVLN+IQ M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
          SSNL YL A++DSF GNF+A  RFS+FDYR+YD  +VP+PCGFL KFP++DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT
         TV+GLENH ++  +N+SPDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN+KFSIWVDAKLQLMVDPLLLIHSLI+T+NADMAISKHPYYIHT
Subjt:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE
        MEEAMATARWKKWWDVDSLK QMETYC+NGLKPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRD+LTPSIKINMF+ E
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE

Query:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD
        VFEQVALEYRHNLK  RY GPE+ PQISKPK TKRAGPDLLYVNGSCCSKC  YLL MWG+
Subjt:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD

A0A1S3AZV0 uncharacterized protein LOC103484369 isoform X11.2e-24087.42Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKLLCFSLFYL S+IFLALYTS S+SKCLFRSSPFDPIQFSLFSYPSSYGEHKYA+PTLRSSCS+PVFFSDYWMV N+IQ M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
          SSNL YL A++DSF GNF+A  RFS+FDYR+Y   +VPVPCGFL KFP+ DSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT
        TTV+GLENH +I  +NSSPDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN+KFSIWVDAKLQLMVDPLLLIHSLI+TENADMAISKHPYYIHT
Subjt:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE
        MEEAMATARWKKWWDVDSLK QMETYC+NGLKPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ E
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE

Query:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD
        VFEQVALEYRHNLK  RY GP++ PQISKPK TKRAGPDLLYVNGSCCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD

A0A1S3AZW9 uncharacterized protein LOC103484369 isoform X22.3e-22884.6Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKLLCFSLFYL S+IFLALYTS S+SKCLFRSSPFDPIQFSLFSYPSSYGEHKYA+PTLRSSCS+PVFFSDYWMV N+IQ M  NSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
          SSNL YL A++DSF GNF+A  RFS+FDYR+Y                    DRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT
        TTV+GLENH +I  +NSSPDI IGAWRIVRVS+KNLY+NPAMNGVIPKYLVHRLFPN+KFSIWVDAKLQLMVDPLLLIHSLI+TENADMAISKHPYYIHT
Subjt:  TTVRGLENHNVIPRRNSSPDI-IGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHT

Query:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE
        MEEAMATARWKKWWDVDSLK QMETYC+NGLKPWSP KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ E
Subjt:  MEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEE

Query:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD
        VFEQVALEYRHNLK  RY GP++ PQISKPK TKRAGPDLLYVNGSCCSKC  YLLQMWG+
Subjt:  VFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGD

A0A6J1CXF7 uncharacterized protein LOC1110157183.6e-24287.45Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGKAGWSTPLLFQSKLLCFSLFYLFS+IFLALYTS SS+KCLFRSSPFDPIQF LFSYPSSYGEHKYA+PT+RS+CSSPVFF DYWMVLNQIQV+ WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
         +SSNLRYL A+AD+FGGNF+A++RFS+FD+RN + +SV VPCGFL KFP+ DSDRIAME C+GVVVVSAIFNDHDKIRQPRGLGSKTL++VCFFMFVDE
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLE+HNVI  RNSSPDIIGAWRIVRVSTKNLY+NPAMNGVIPKYLVHRLFPN+KFSIW+DAKLQLMVDPLLLIH+LIVTENADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLK QMETYC+NGLKPWS  KLPYTTDVPDSA ILR+H R SNLFSCLLFNELEAFNPRDQLAFAFVRDHLTP IKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K YGGP++GP ISKPK TKRAGPDLLYVNG+CCSKCQKYLLQMWGD S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

A0A6J1EIT1 uncharacterized protein LOC111433726 isoform X25.4e-24689.39Show/hide
Query:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS
        MGK GWSTPLLFQSKL CFSLFYL S+IFLALYTSFSSSKCLFRSSPFDPIQF LFSYPSSYGEHKYAIPTLRSSCS+P+FFSDYWMVLN+IQVM WNSS
Subjt:  MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSS

Query:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE
        W+SSNLRYLA +ADSFGGNFSAE RFSYFD    D  SVP+PCGFL KFP+TDSD+ AMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLD+VCFFMFVD+
Subjt:  WQSSNLRYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDE

Query:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM
        TTVRGLENH +IP  NS PDIIGAWRIVRVSTKNLY NPAMNGVIPKYLVHRLFPN KFSIWVDAKLQLMVDPLLLIHSLIVTE+ADMAISKHPYYIHTM
Subjt:  TTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTM

Query:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV
        EEAMATARWKKWWDVDSLKNQMETYC+NGL+PWSP+KLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF+ EV
Subjt:  EEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEV

Query:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS
        FEQVALEYRHNLK K   G E+ PQISKP  TKRAGPDLLYVNGSCCSKCQKYLLQMWGD+S
Subjt:  FEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCCSKCQKYLLQMWGDIS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI702.2e-5034.71Show/hide
Query:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE
        FGG  + + R   FD +     ++ V CGF+        T F + ++D + M+ C G+VV SA+F+  D ++ P+ +     ++VCF+MFVDE T   L+
Subjt:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE

Query:  NHNVIPRRNSSPDIIGAWRIVRVSTKNL-YDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMAT
            +         +G WR+V V   NL Y +   NG +PK LVHR+FPNA++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A 
Subjt:  NHNVIPRRNSSPDIIGAWRIVRVSTKNL-YDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMAT

Query:  ARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFDEEVFEQV
            K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +       ++MF +      
Subjt:  ARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFDEEVFEQV

Query:  ALEYRHNLKNKRYGG-----PEMGPQISKPKPTKRAGPDL
         ++  H  + +R+       P   P    P P      DL
Subjt:  ALEYRHNLKNKRYGG-----PEMGPQISKPKPTKRAGPDL

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)1.5e-5134.71Show/hide
Query:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE
        FGG  + + R   FD +     ++ V CGF+        T F + ++D + M+ C G+VV SA+F+  D ++ P+ +     ++VCF+MFVDE T   L+
Subjt:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE

Query:  NHNVIPRRNSSPDIIGAWRIVRVSTKNL-YDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMAT
            +         +G WR+V V   NL Y +   NG +PK LVHR+FPNA++S+W+D KL+L+VDP  ++   +  +NA  AIS+H      + EA A 
Subjt:  NHNVIPRRNSSPDIIGAWRIVRVSTKNL-YDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMAT

Query:  ARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFDEEVFEQV
            K +D  S+  Q++ Y   GL P+S  KLP T+DVP+  +ILR H   SNLF+CL FNE++ F  RDQ++F+ VRD +       ++MF +      
Subjt:  ARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIK--INMFDEEVFEQV

Query:  ALEYRHNLKNKRYGG-----PEMGPQISKPKPTKRAGPDL
         ++  H  + +R+       P   P    P P      DL
Subjt:  ALEYRHNLKNKRYGG-----PEMGPQISKPKPTKRAGPDL

AT1G53040.1 Protein of unknown function (DUF616)3.5e-4838.14Show/hide
Query:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE
        FGG  S E R + FD +     S+ V CGF+        T F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE T   L+
Subjt:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE

Query:  NHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATA
        N +     N     +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT1G53040.2 Protein of unknown function (DUF616)3.5e-4838.14Show/hide
Query:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE
        FGG  S E R + FD +     S+ V CGF+        T F + +     ++  + V+V SAIF  +D I++P  +      ++ F+MFVDE T   L+
Subjt:  FGGNFSAESRFSYFDYRNYDTTSVPVPCGFL--------TKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLE

Query:  NHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATA
        N +     N     +G WRI+ V     Y +   NG +PK L+HRLFPN ++SIWVDAKLQL+VDP  ++   +   N+  AIS+H        EA A  
Subjt:  NHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATA

Query:  RWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF
          +K +D  S+  Q+E Y + GL P++  KLP T+DVP+   I+R H   +NLF+C+ FNE++ F  RDQL+FA  RD +   +   INMF
Subjt:  RWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSI--KINMF

AT4G38500.1 Protein of unknown function (DUF616)8.7e-4735Show/hide
Query:  NLRYLAAHADS------FGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTK--FPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFM
        NL Y+     S      FGGN S   R   F  +      + V CGF+ +    ++  D+  ++ C   VV + IF+ +D+  QP  +  ++++  CF M
Subjt:  NLRYLAAHADS------FGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTK--FPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFM

Query:  FVDETTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYY
         VDE ++  L  +  + +       +G WR++ + T   YD P  NG +PK L HRLFP A++SIW+D K++L+VDPLL++   +       AI++H ++
Subjt:  FVDETTVRGLENHNVIPRRNSSPDIIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYY

Query:  IHTMEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF
         +  EEA A  R +K +    +   M+ Y   GL+PWS  K    +DVP+ A+I+R H   +NLFSCL FNE+    PRDQL+F +V D L  + K+ MF
Subjt:  IHTMEEAMATARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMF

AT5G46220.1 Protein of unknown function (DUF616)1.3e-17063.97Show/hide
Query:  STPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSSWQSSNL
        S PL  +SKLLCFSL YLFSTIFL LY S S ++C+FR SPFDPIQ  LFSYPSSYGEHKYA+PT RSSCSSP+FFSDYW VL +IQ +   SS    NL
Subjt:  STPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSSWQSSNL

Query:  RYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGL
        RY+   ++SFGGNFS + RFSYF++ N D   V VPCGF   FP+++SDR+ ME C G+VV SAIFNDHDKIRQP GLG KTL++VCF+MF+D+ T+  L
Subjt:  RYLAAHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGL

Query:  ENHNVIPRRNSSPDIIGAWRIVRVS-TKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMA
         +HNVI + N S   +GAWRI+++S ++NLY NPAMNGVIPKYL+HRLFPN+KFSIWVDAK+QLM+DPLLLIHS++V    DMAISKHP++++TMEEAMA
Subjt:  ENHNVIPRRNSSPDIIGAWRIVRVS-TKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMA

Query:  TARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEVFEQVA
        TARWKKW DVD L+ QMETYC++GLKPWS +KLPY TDVPD+ALILRRHG  SNLFSC +FNELEAFNPRDQLAFAFVRDH+ P +K+NMF+ EVFEQV 
Subjt:  TARWKKWWDVDSLKNQMETYCQNGLKPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEVFEQVA

Query:  LEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCC----SKCQKYLLQMWG
        +EYRHNLK       E   +  K +  +       +++        S C+ YL  MWG
Subjt:  LEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLLYVNGSCC----SKCQKYLLQMWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGGCAGGTTGGTCTACGCCTCTGCTATTCCAATCAAAACTGCTCTGTTTCTCTCTGTTTTATCTTTTCTCCACCATCTTCCTCGCTCTCTACACTTCTTTCTC
CTCCTCCAAATGCCTCTTCCGATCCTCTCCCTTCGATCCCATCCAGTTCTCTCTCTTCTCTTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACCCTCCGCT
CCTCCTGCTCCTCCCCTGTCTTCTTCTCAGATTATTGGATGGTTCTGAACCAGATCCAAGTAATGCAGTGGAATTCCTCTTGGCAATCCTCCAATTTGAGGTATCTCGCT
GCTCATGCCGATAGTTTCGGCGGCAATTTCTCTGCCGAGAGCAGATTTTCTTATTTCGATTATCGAAATTATGATACTACTTCTGTTCCGGTTCCTTGTGGATTTCTCAC
AAAATTTCCTCTCACTGATTCTGATCGAATTGCTATGGAGAGTTGCAACGGCGTGGTTGTGGTTTCCGCGATTTTCAACGATCACGATAAAATTCGGCAACCGAGAGGCC
TCGGATCGAAGACTTTGGATAGCGTGTGCTTTTTCATGTTTGTCGATGAAACTACGGTAAGAGGACTCGAAAATCACAACGTAATTCCTAGAAGAAATTCATCCCCGGAT
ATAATTGGGGCTTGGAGAATTGTGAGAGTTTCAACCAAAAATCTGTACGATAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTCCAAACGC
TAAATTCAGTATATGGGTGGACGCCAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATTCGTTGATTGTGACTGAGAATGCAGATATGGCCATTTCTAAACATC
CTTATTATATTCATACAATGGAAGAAGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAACCAAATGGAAACTTACTGTCAAAATGGGTTG
AAACCATGGAGTCCCACTAAGCTTCCATATACCACAGATGTACCAGATAGTGCCTTAATATTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTTTGTTCAA
CGAATTGGAAGCTTTCAACCCAAGGGATCAATTGGCTTTTGCATTTGTGAGAGACCATTTGACCCCATCAATTAAAATCAACATGTTTGACGAAGAAGTTTTCGAGCAAG
TTGCTTTGGAATATAGGCACAATCTCAAAAACAAAAGATATGGTGGGCCTGAAATGGGCCCCCAAATCTCCAAGCCCAAACCAACTAAAAGGGCCGGCCCTGATCTATTG
TATGTCAATGGCAGCTGTTGCAGCAAGTGCCAAAAATATCTTCTCCAGATGTGGGGCGACATTTCCTGA
mRNA sequenceShow/hide mRNA sequence
GGTTAATATGGGAAAGGCAGGTTGGTCTACGCCTCTGCTATTCCAATCAAAACTGCTCTGTTTCTCTCTGTTTTATCTTTTCTCCACCATCTTCCTCGCTCTCTACACTT
CTTTCTCCTCCTCCAAATGCCTCTTCCGATCCTCTCCCTTCGATCCCATCCAGTTCTCTCTCTTCTCTTATCCCTCCTCCTATGGCGAACACAAGTACGCCATTCCCACC
CTCCGCTCCTCCTGCTCCTCCCCTGTCTTCTTCTCAGATTATTGGATGGTTCTGAACCAGATCCAAGTAATGCAGTGGAATTCCTCTTGGCAATCCTCCAATTTGAGGTA
TCTCGCTGCTCATGCCGATAGTTTCGGCGGCAATTTCTCTGCCGAGAGCAGATTTTCTTATTTCGATTATCGAAATTATGATACTACTTCTGTTCCGGTTCCTTGTGGAT
TTCTCACAAAATTTCCTCTCACTGATTCTGATCGAATTGCTATGGAGAGTTGCAACGGCGTGGTTGTGGTTTCCGCGATTTTCAACGATCACGATAAAATTCGGCAACCG
AGAGGCCTCGGATCGAAGACTTTGGATAGCGTGTGCTTTTTCATGTTTGTCGATGAAACTACGGTAAGAGGACTCGAAAATCACAACGTAATTCCTAGAAGAAATTCATC
CCCGGATATAATTGGGGCTTGGAGAATTGTGAGAGTTTCAACCAAAAATCTGTACGATAATCCGGCCATGAATGGCGTAATACCTAAATATTTAGTTCACAGACTATTTC
CAAACGCTAAATTCAGTATATGGGTGGACGCCAAGCTTCAGTTAATGGTGGATCCGTTGTTGTTGATTCATTCGTTGATTGTGACTGAGAATGCAGATATGGCCATTTCT
AAACATCCTTATTATATTCATACAATGGAAGAAGCTATGGCAACTGCCAGATGGAAGAAATGGTGGGATGTTGATTCCTTGAAGAACCAAATGGAAACTTACTGTCAAAA
TGGGTTGAAACCATGGAGTCCCACTAAGCTTCCATATACCACAGATGTACCAGATAGTGCCTTAATATTGAGGAGACATGGAAGGGGAAGCAACCTATTCTCTTGCCTTT
TGTTCAACGAATTGGAAGCTTTCAACCCAAGGGATCAATTGGCTTTTGCATTTGTGAGAGACCATTTGACCCCATCAATTAAAATCAACATGTTTGACGAAGAAGTTTTC
GAGCAAGTTGCTTTGGAATATAGGCACAATCTCAAAAACAAAAGATATGGTGGGCCTGAAATGGGCCCCCAAATCTCCAAGCCCAAACCAACTAAAAGGGCCGGCCCTGA
TCTATTGTATGTCAATGGCAGCTGTTGCAGCAAGTGCCAAAAATATCTTCTCCAGATGTGGGGCGACATTTCCTGATTTTTACCTTTTTTGTCCTCTTCTTATGCCACGC
GTTGGTTCTTACCAGAAGGACAAAAAGGTCAAAGTAAATGCTTGACGGGTCAGCCACACCACCCCTTCTCTTTTCCTTTCTTCTACTTCTATTATGATCGATACGGCGCC
GTTTGG
Protein sequenceShow/hide protein sequence
MGKAGWSTPLLFQSKLLCFSLFYLFSTIFLALYTSFSSSKCLFRSSPFDPIQFSLFSYPSSYGEHKYAIPTLRSSCSSPVFFSDYWMVLNQIQVMQWNSSWQSSNLRYLA
AHADSFGGNFSAESRFSYFDYRNYDTTSVPVPCGFLTKFPLTDSDRIAMESCNGVVVVSAIFNDHDKIRQPRGLGSKTLDSVCFFMFVDETTVRGLENHNVIPRRNSSPD
IIGAWRIVRVSTKNLYDNPAMNGVIPKYLVHRLFPNAKFSIWVDAKLQLMVDPLLLIHSLIVTENADMAISKHPYYIHTMEEAMATARWKKWWDVDSLKNQMETYCQNGL
KPWSPTKLPYTTDVPDSALILRRHGRGSNLFSCLLFNELEAFNPRDQLAFAFVRDHLTPSIKINMFDEEVFEQVALEYRHNLKNKRYGGPEMGPQISKPKPTKRAGPDLL
YVNGSCCSKCQKYLLQMWGDIS