; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015027 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015027
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
Genome locationscaffold2:670610..673957
RNA-Seq ExpressionMS015027
SyntenyMS015027
Gene Ontology termsGO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR044575 - Beta-arabinofuranosyltransferase RAY1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573710.1 Beta-arabinofuranosyltransferase RAY1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.06Show/hide
Query:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
        AIEAGL SIWLSGLVLIALSLYATQSLPS KDRFV+PKLR    G R NP++SIFSAPR F G+IGVRQSLAIRSWLALSPQITV+LFSQD SV + ARS
Subjt:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS

Query:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
        +SSRVY+DS+IDFTFLGTPYFHSMMARSQSF SDI  FV PETILLPDFISTLNYA KLDRDWLLVASSRNISYIPFYFDES+RHFP ED+K TR+QKEL
Subjt:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL

Query:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
        LNEHWRWSHC GKELLAWNN +IPLHSGVLPPFLYGRGIHNNWVINEA+ASEFRFVFDASWTISSFYLQDPEQ SD           TRSWEY GNY LG
Subjt:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG

Query:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL
        SLYGSSFH +A  SSLVKLL CN QYIL NTTE+ TY PKN+R+LSLWNT+LLH G+K  KKP AC+HGFRS  RLHDCSLE R+S S TLE P+SLE L
Subjt:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL

Query:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
        LPLIADKNKTIVL VAGYSYKDMLMSW CRLR LQIPN++VCALD DTY FSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
Subjt:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY

Query:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL
        NVLLSDVD+YWF+NPLPFLY+FGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD+STIAAMEKVVKHAATSGQSEQPSFYDTLCG+ G NR GS KCL
Subjt:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL

Query:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        EPETNLTVHFLDRNLFPNGAY ELWKKKNIK ACRKKGC+VLHNNWISGRLKKLERQMFSGLWEYD ST+MCK +L
Subjt:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

XP_022151768.1 uncharacterized protein LOC111019673 [Momordica charantia]0.0e+0099.55Show/hide
Query:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
        AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRS VVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
Subjt:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS

Query:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
        LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
Subjt:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL

Query:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
        LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
Subjt:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG

Query:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP
        SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRR+LSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP
Subjt:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP

Query:  LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV
        LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV
Subjt:  LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV

Query:  LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEP
        LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSN+CLEP
Subjt:  LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEP

Query:  ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
Subjt:  ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

XP_022945744.1 uncharacterized protein LOC111449891 [Cucurbita moschata]0.0e+0085.21Show/hide
Query:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
        AIEAGL SIWLSGLVLIALSLYATQSLPS KDRFV+PKLR T  G R +P++SIFSAPR F G+IGVRQSLAIRSWLAL+PQITV+LFSQD SVV+ ARS
Subjt:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS

Query:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
        LSSRVYVDS+IDFTFLGTPYFHSMMARSQSF SDI  FV PETILLPDFISTLNYA KL+RDWLLVASSRNISYIPFYFDES+RHFP ED+K TR+QKEL
Subjt:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL

Query:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
        LNEHWRWSHC GKELLAWNN +IPLHSGVLPPFLYGRGIHNNWVINEA+ASEFRFVFDASWTISSFYLQDPEQ SD           TRSWEY GNY LG
Subjt:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG

Query:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL
        SLYGSSFH +A  SSLVKLL CN QYIL NTTE+ TY PKN+R+LSLWNT+LLH G+K  KKP AC+HGFRS  RLHDCSLE R+S S TLE P+SLE L
Subjt:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL

Query:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
        LPLIADKNKTIVL VAGYSYKDMLMSW CRLR LQIPN++VCALD DTY FSVLQGLPVYR PLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
Subjt:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY

Query:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL
        NVLLSDVD+YWFKNPLPFLY+FGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD STIAAMEKVVKHAATSGQSEQPSFYDTLCG+ G NR GS KCL
Subjt:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL

Query:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        EPETNLTVHFLDRNLFPNGAY ELWKKKNIK ACRKKGC+VLHNNWISGRLKKLERQMFSGLWEYD ST+MCK +L
Subjt:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

XP_022966691.1 uncharacterized protein LOC111466319 [Cucurbita maxima]0.0e+0082.73Show/hide
Query:  RIEGKNEVFSVDIKVLLRTENDRPHMAGACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAP
        R EG NE  SV IKVLL +END PHMAG       +      AIEAGL SIWLSGLVLIALSLYATQSLPSFKDRFV+PKLR  V G R NP++SIFSAP
Subjt:  RIEGKNEVFSVDIKVLLRTENDRPHMAGACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAP

Query:  RPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANK
        R F+G+IGVRQSLAIRSWLALSPQITV+LFSQD SV + ARS+SSRVY+DS+IDFTFLGTPYFHSMMARSQSF SDI  FV PETILLPDFISTLNYA K
Subjt:  RPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANK

Query:  LDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFD
        LDRDWLLVASSRNISYIPFYFDES+RHFP E++K TR+QKELL+EHWRWSHC GKELLAWNN +IPLHSGVLPPFLYGRGIHNNWVINEA+ASEFRFVFD
Subjt:  LDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFD

Query:  ASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMG--
        ASWTISSFYLQDPEQ SD            R WEY GNY LGSLYGSSFH +A  SSLVKLL CN QYIL NTTE+ TY PKN+RMLSLWNT+LLH G  
Subjt:  ASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMG--

Query:  RKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLP
        +KKKP AC+HGFRS  RLHDCSLE  +S S TLELP+SLE LLPLIADKNKTIVL VAGYSYKDMLMSW CRLR LQIPN++VCALD DTY FS LQGLP
Subjt:  RKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLP

Query:  VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDS
        VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVD+YWFKNPLPFLY+FGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD+S
Subjt:  VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDS

Query:  TIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQM
        TIAAMEKVVKHAATSGQSEQPSFYDTLCG+ G NR GS KCLEPETNLTVHFLDRNLFPNGAY ELWKKKNIK ACRKKGC+VLHNNWISGRLKKLERQM
Subjt:  TIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQM

Query:  FSGLWEYDMSTKMCKRSL
        FSGLWEYD ST+MCK +L
Subjt:  FSGLWEYDMSTKMCKRSL

XP_038891724.1 uncharacterized protein LOC120081123 [Benincasa hispida]0.0e+0080.41Show/hide
Query:  MVLPDFDIIVRNRIEGKNEVFSVDIKVLLRTENDRPHMAG--ACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVV
        MVL +FDI  +    GKNE FSVDIK+LLR++ND  HMAG  AC+V+D    +G  AIEAGLCSIWLSGL+LIALSLYATQ LPSFKDRFV+P LRS   
Subjt:  MVLPDFDIIVRNRIEGKNEVFSVDIKVLLRTENDRPHMAG--ACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVV

Query:  GVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETI
        G   NP++SIFSAPR F G+IGVRQSLAIRSWLALSPQI VILFSQD S+VSSA S SSRVY+DS+IDFTFLGTPYFHSMM RSQSF SDI  F+DPETI
Subjt:  GVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETI

Query:  LLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWV
        LLPDFISTLNYA KLDRDWLLVASSRNISYIPFYF+ES  +F MEDQ+ TR+ KELLNEHW+WSHC GKELLAWNNW+ PLHSGVLPPFLYGRGIHN+WV
Subjt:  LLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWV

Query:  INEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRM
        INEAMASEFRFVFDASWTISSF+LQDP+QSS+GR  H NS   TRSWEYFGNY LGSLYGSSFH +A  S+L+KL+ CNGQYILINTTE+T  QP     
Subjt:  INEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRM

Query:  LSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDH
                LH GRKKKP+ C+HGF+S  + HDCS+ N IS S TLELP+SLE+LLPLIADKNKTIVLA+AGYSYKDMLMSW CRLR LQIPN++VCALD 
Subjt:  LSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDH

Query:  DTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRL
        DTY+FSVLQGLPVYRDPL PTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWF NPLPFLY+FGSGVL AQSDEYKKTGPINLPRRL
Subjt:  DTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRL

Query:  NSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNW
        NSGFYFARSD+STIAAMEKVVKHAATSGQSEQPSFYDTLCG+ GINR+GSN+CLEPET+LTVHFLDRNLFPNGAY ELWKKKNIK  CRKKGC+VLHNNW
Subjt:  NSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNW

Query:  ISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        I+GRLKKLERQMFSGLWEYDMST+MCK +L
Subjt:  ISGRLKKLERQMFSGLWEYDMSTKMCKRSL

TrEMBL top hitse value%identityAlignment
A0A1S4DW58 uncharacterized protein LOC1034895500.0e+0080.92Show/hide
Query:  PHMAG--ACSVSD-GVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLA
        PHMAG  ACSV+D  V  SG FAIEAGLCSIWLSGL+LIALSLYATQ LPSFKD FV+P+L S  +G  LNP+ISIFSAPRPF G+IGVRQSLAIRSWLA
Subjt:  PHMAG--ACSVSD-GVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLA

Query:  LSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFY
        LSPQITVILFSQD S+VSSA S SSRVY+DS+IDFTFLGTPYFHSMM RSQSF SDI  FVDPETILLPDFISTLNYA KLDRDWLLVASSRNISYIPFY
Subjt:  LSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFY

Query:  FDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGR
        F+ES+ +F MED++FTRIQK LLNEHW+WS+C GKEL+AWN+ + PLH GVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISS YLQD EQ S+GR
Subjt:  FDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGR

Query:  YGHPNSDI-GTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDC
          H NS + GTRSWEYFGN+ LGSLYGSSFH +A  S+LVKLL CNG YILINTTE+T  Q                 GRKKKP  C HGFRS  +L +C
Subjt:  YGHPNSDI-GTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDC

Query:  SLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE
        S+ N IS S TLELP+SLE+LLPL+ADKNKTIVLA+AGYSYKDMLMSW CRLRRL+I N++VCALD DTYQFSVLQGLPVYRDPL PTNISFNDCHFGTE
Subjt:  SLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE

Query:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQP
        CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWF NPLPFLY FGSGVL AQSDEYKKTGPINLPRRLNSGFYFARSD+ TIAAMEKVVKHAATS QSEQP
Subjt:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQP

Query:  SFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        SFYDTLCG+ GINR+GSNKCLEPETNLT+HFLDRNLFPNGAY  LW KKNIK+ACRKKGC+VLHNNWISGRLKKLERQMFSGLW+YDMST+MC  +L
Subjt:  SFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

A0A5A7SYJ9 UDP-galactose:fucoside alpha-3-galactosyltransferase0.0e+0080.92Show/hide
Query:  PHMAG--ACSVSD-GVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLA
        PHMAG  ACSV+D  V  SG FAIEAGLCSIWLSGL+LIALSLYATQ LPSFKD FV+P+L S  +G  LNP+ISIFSAPRPF G+IGVRQSLAIRSWLA
Subjt:  PHMAG--ACSVSD-GVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLA

Query:  LSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFY
        LSPQITVILFSQD S+VSSA S SSRVY+DS+IDFTFLGTPYFHSMM RSQSF SDI  FVDPETILLPDFISTLNYA KLDRDWLLVASSRNISYIPFY
Subjt:  LSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFY

Query:  FDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGR
        F+ES+ +F MED++FTRIQK LLNEHW+WS+C GKEL+AWN+ + PLH GVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISS YLQD EQ S+GR
Subjt:  FDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGR

Query:  YGHPNSDI-GTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDC
          H NS + GTRSWEYFGN+ LGSLYGSSFH +A  S+LVKLL CNG YILINTTE+T  Q                 GRKKKP  C HGFRS  +L +C
Subjt:  YGHPNSDI-GTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDC

Query:  SLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE
        S+ N IS S TLELP+SLE+LLPL+ADKNKTIVLA+AGYSYKDMLMSW CRLRRL+I N++VCALD DTYQFSVLQGLPVYRDPL PTNISFNDCHFGTE
Subjt:  SLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE

Query:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQP
        CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWF NPLPFLY FGSGVL AQSDEYKKTGPINLPRRLNSGFYFARSD+ TIAAMEKVVKHAATS QSEQP
Subjt:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQP

Query:  SFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        SFYDTLCG+ GINR+GSNKCLEPETNLT+HFLDRNLFPNGAY  LW KKNIK+ACRKKGC+VLHNNWISGRLKKLERQMFSGLW+YDMST+MC  +L
Subjt:  SFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

A0A6J1DD34 uncharacterized protein LOC1110196730.0e+0099.55Show/hide
Query:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
        AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRS VVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
Subjt:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS

Query:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
        LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
Subjt:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL

Query:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
        LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
Subjt:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG

Query:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP
        SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRR+LSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP
Subjt:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLP

Query:  LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV
        LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV
Subjt:  LIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNV

Query:  LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEP
        LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSN+CLEP
Subjt:  LLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEP

Query:  ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
Subjt:  ETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

A0A6J1G1U1 uncharacterized protein LOC1114498910.0e+0085.21Show/hide
Query:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS
        AIEAGL SIWLSGLVLIALSLYATQSLPS KDRFV+PKLR T  G R +P++SIFSAPR F G+IGVRQSLAIRSWLAL+PQITV+LFSQD SVV+ ARS
Subjt:  AIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAPRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARS

Query:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL
        LSSRVYVDS+IDFTFLGTPYFHSMMARSQSF SDI  FV PETILLPDFISTLNYA KL+RDWLLVASSRNISYIPFYFDES+RHFP ED+K TR+QKEL
Subjt:  LSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKEL

Query:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG
        LNEHWRWSHC GKELLAWNN +IPLHSGVLPPFLYGRGIHNNWVINEA+ASEFRFVFDASWTISSFYLQDPEQ SD           TRSWEY GNY LG
Subjt:  LNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLG

Query:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL
        SLYGSSFH +A  SSLVKLL CN QYIL NTTE+ TY PKN+R+LSLWNT+LLH G+K  KKP AC+HGFRS  RLHDCSLE R+S S TLE P+SLE L
Subjt:  SLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRK--KKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEIL

Query:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
        LPLIADKNKTIVL VAGYSYKDMLMSW CRLR LQIPN++VCALD DTY FSVLQGLPVYR PLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY
Subjt:  LPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGY

Query:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL
        NVLLSDVD+YWFKNPLPFLY+FGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD STIAAMEKVVKHAATSGQSEQPSFYDTLCG+ G NR GS KCL
Subjt:  NVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCL

Query:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL
        EPETNLTVHFLDRNLFPNGAY ELWKKKNIK ACRKKGC+VLHNNWISGRLKKLERQMFSGLWEYD ST+MCK +L
Subjt:  EPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL

A0A6J1HQ10 uncharacterized protein LOC1114663190.0e+0082.73Show/hide
Query:  RIEGKNEVFSVDIKVLLRTENDRPHMAGACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAP
        R EG NE  SV IKVLL +END PHMAG       +      AIEAGL SIWLSGLVLIALSLYATQSLPSFKDRFV+PKLR  V G R NP++SIFSAP
Subjt:  RIEGKNEVFSVDIKVLLRTENDRPHMAGACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFSAP

Query:  RPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANK
        R F+G+IGVRQSLAIRSWLALSPQITV+LFSQD SV + ARS+SSRVY+DS+IDFTFLGTPYFHSMMARSQSF SDI  FV PETILLPDFISTLNYA K
Subjt:  RPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANK

Query:  LDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFD
        LDRDWLLVASSRNISYIPFYFDES+RHFP E++K TR+QKELL+EHWRWSHC GKELLAWNN +IPLHSGVLPPFLYGRGIHNNWVINEA+ASEFRFVFD
Subjt:  LDRDWLLVASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFD

Query:  ASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMG--
        ASWTISSFYLQDPEQ SD            R WEY GNY LGSLYGSSFH +A  SSLVKLL CN QYIL NTTE+ TY PKN+RMLSLWNT+LLH G  
Subjt:  ASWTISSFYLQDPEQSSDGRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMG--

Query:  RKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLP
        +KKKP AC+HGFRS  RLHDCSLE  +S S TLELP+SLE LLPLIADKNKTIVL VAGYSYKDMLMSW CRLR LQIPN++VCALD DTY FS LQGLP
Subjt:  RKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLP

Query:  VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDS
        VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVD+YWFKNPLPFLY+FGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD+S
Subjt:  VYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDS

Query:  TIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQM
        TIAAMEKVVKHAATSGQSEQPSFYDTLCG+ G NR GS KCLEPETNLTVHFLDRNLFPNGAY ELWKKKNIK ACRKKGC+VLHNNWISGRLKKLERQM
Subjt:  TIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQM

Query:  FSGLWEYDMSTKMCKRSL
        FSGLWEYD ST+MCK +L
Subjt:  FSGLWEYDMSTKMCKRSL

SwissProt top hitse value%identityAlignment
F4I6V0 Beta-arabinofuranosyltransferase RAY13.2e-18960.22Show/hide
Query:  MMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQ----KFTRIQKELLNEHWRWSHCRGKELLAWN
        MM+R +++ SDI+V +DPET+LLPDFIS L+YA++L RDWLLV+SS  I   PF++DE+ RHF  +D     +F  +QK +     + +    K ++AWN
Subjt:  MMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQ----KFTRIQKELLNEHWRWSHCRGKELLAWN

Query:  NWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPN--SDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLV
        N ++PLH GVLPPFLY RG HN W+INEAM+ + RFVFDA+ TISSF+L + E      Y   +  S+  TR+WEY GN HLG LYGS +   +   +L 
Subjt:  NWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPN--SDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLV

Query:  KLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYS
        KLL CN +YI ++ +E +T        LS+   + L    ++K  AC    +S+    D   ++   P   L+ P+ LE LLPL+ADKN+T+VL+VAGYS
Subjt:  KLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYS

Query:  YKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL
        YKDMLMSW CRLRRL++PN +VCALD +TYQFS+LQGLPV+ DP AP NISFNDCHFG++CFQRVTKVKSR VL+ILKLGYNVLLSDVDVYWF+NPLP L
Subjt:  YKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL

Query:  YAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNG
         +FG  VLAAQSDEY  T PIN PRRLNSGFYFARSD  TIAAMEKVVKHAATSG SEQPSFYDTLCG+ G  R+G ++C+EPETNLTV FLDR LFPNG
Subjt:  YAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNG

Query:  AYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMC
        AY +LW K++++A C KK C+VLHNNWISGRLKKLERQM  GLWEYD S +MC
Subjt:  AYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMC

Q54RP0 UDP-galactose:fucoside alpha-3-galactosyltransferase3.3e-1325.68Show/hide
Query:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQI--PNHIVCALDHDTYQ-FSVLQGLPVY---RDPLAPTNIS----FNDCH--------------FGTECFQ
        +N  IVL +  Y ++DM ++      +L I    +I+  +D   YQ F+  +G+      RD +  ++ S    F+D +              +G   F+
Subjt:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQI--PNHIVCALDHDTYQ-FSVLQGLPVY---RDPLAPTNIS----FNDCH--------------FGTECFQ

Query:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINL-----PRRLNSGFYFARSDDSTIAAMEKVVK--HAATSGQ
         +   K  +VL +LK GYNVL +D D+ W ++P    Y         Q +++     I+L        + +GFYF RS+  TI  ++  +   +     Q
Subjt:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINL-----PRRLNSGFYFARSDDSTIAAMEKVVK--HAATSGQ

Query:  SEQPSFYDTLCGDRGINRMGSNKCLEPETN-----LTVHFLDRNLFPNGA---YLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLW
             F  +    +GIN    N  L    N     +    LD+ LFPNG     L++ ++ NI         +++HNN I G   K +R +  GLW
Subjt:  SEQPSFYDTLCGDRGINRMGSNKCLEPETN-----LTVHFLDRNLFPNGA---YLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLW

Q8VXZ5 Arabinosyltransferase XEG1132.1e-1226.72Show/hide
Query:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSD
        K+  I++    Y++ D +++W   L  L + N +V A+D    +    +G+PV+      +++S  D  +G+  F ++ + K  ++  +L  GY +L+ D
Subjt:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSD

Query:  VDVYWFKNPLPFLYAFGSGVLAAQSDEYKKT
         D+ W KNP+P+L  F    +   SD+   T
Subjt:  VDVYWFKNPLPFLYAFGSGVLAAQSDEYKKT

Q9M146 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase MGP46.0e-1524.57Show/hide
Query:  NRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQ
        N++SPS +    YSL   +  +A KN T+++    Y Y   L +W   + R +  + ++   +     + V +  P +   + P   S     FG++ F 
Subjt:  NRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQ

Query:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL------YAFGSGVLAAQSDEYKKTGPINLPRR--LNSGFYFARSDDSTIAAMEKVVKHAATSG
          T  + + +L IL+LGYNV+ +DVD+ W ++P  +L      Y           D      P     R  + S   F R  +     M+K ++   T  
Subjt:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL------YAFGSGVLAAQSDEYKKTGPINLPRR--LNSGFYFARSDDSTIAAMEKVVKHAATSG

Query:  QSEQPSFYDTLCGDRGINRMGSNKCLEPETN-LTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD
         S      D         + G N  L    N + ++ L +  FP G    L+ K        K    ++HNN+I G  KK++R     LW  D
Subjt:  QSEQPSFYDTLCGDRGINRMGSNKCLEPETN-LTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD

Q9ZSJ2 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 19.6e-1324.5Show/hide
Query:  LENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE
        L + +SPS   +   Y+L      +A KN T+++      +   L +W   + R +  + ++   +     + V +  P +   + P   S     FG++
Subjt:  LENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE

Query:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVVK
         F   T  + + +L+IL+LGYNV+ +DVD+ W ++  PFLY  GS   A  +D+  +  P+N    LP    +G         + R  +     M+K  +
Subjt:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVVK

Query:  HAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD
           +   SE   F      D+    +  NK       + ++ L +  FP G    L+ K        K    ++HNN+I G  +K++R    GLW  D
Subjt:  HAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD

Arabidopsis top hitse value%identityAlignment
AT1G70630.1 Nucleotide-diphospho-sugar transferase family protein2.2e-19060.22Show/hide
Query:  MMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQ----KFTRIQKELLNEHWRWSHCRGKELLAWN
        MM+R +++ SDI+V +DPET+LLPDFIS L+YA++L RDWLLV+SS  I   PF++DE+ RHF  +D     +F  +QK +     + +    K ++AWN
Subjt:  MMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLVASSRNISYIPFYFDESRRHFPMEDQ----KFTRIQKELLNEHWRWSHCRGKELLAWN

Query:  NWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPN--SDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLV
        N ++PLH GVLPPFLY RG HN W+INEAM+ + RFVFDA+ TISSF+L + E      Y   +  S+  TR+WEY GN HLG LYGS +   +   +L 
Subjt:  NWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSDGRYGHPN--SDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLV

Query:  KLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYS
        KLL CN +YI ++ +E +T        LS+   + L    ++K  AC    +S+    D   ++   P   L+ P+ LE LLPL+ADKN+T+VL+VAGYS
Subjt:  KLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYS

Query:  YKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL
        YKDMLMSW CRLRRL++PN +VCALD +TYQFS+LQGLPV+ DP AP NISFNDCHFG++CFQRVTKVKSR VL+ILKLGYNVLLSDVDVYWF+NPLP L
Subjt:  YKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL

Query:  YAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNG
         +FG  VLAAQSDEY  T PIN PRRLNSGFYFARSD  TIAAMEKVVKHAATSG SEQPSFYDTLCG+ G  R+G ++C+EPETNLTV FLDR LFPNG
Subjt:  YAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNG

Query:  AYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMC
        AY +LW K++++A C KK C+VLHNNWISGRLKKLERQM  GLWEYD S +MC
Subjt:  AYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMC

AT2G35610.1 xyloglucanase 1131.5e-1326.72Show/hide
Query:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSD
        K+  I++    Y++ D +++W   L  L + N +V A+D    +    +G+PV+      +++S  D  +G+  F ++ + K  ++  +L  GY +L+ D
Subjt:  KNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSD

Query:  VDVYWFKNPLPFLYAFGSGVLAAQSDEYKKT
         D+ W KNP+P+L  F    +   SD+   T
Subjt:  VDVYWFKNPLPFLYAFGSGVLAAQSDEYKKT

AT4G01220.1 Nucleotide-diphospho-sugar transferase family protein4.3e-1624.57Show/hide
Query:  NRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQ
        N++SPS +    YSL   +  +A KN T+++    Y Y   L +W   + R +  + ++   +     + V +  P +   + P   S     FG++ F 
Subjt:  NRISPSATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQ

Query:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL------YAFGSGVLAAQSDEYKKTGPINLPRR--LNSGFYFARSDDSTIAAMEKVVKHAATSG
          T  + + +L IL+LGYNV+ +DVD+ W ++P  +L      Y           D      P     R  + S   F R  +     M+K ++   T  
Subjt:  RVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFL------YAFGSGVLAAQSDEYKKTGPINLPRR--LNSGFYFARSDDSTIAAMEKVVKHAATSG

Query:  QSEQPSFYDTLCGDRGINRMGSNKCLEPETN-LTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD
         S      D         + G N  L    N + ++ L +  FP G    L+ K        K    ++HNN+I G  KK++R     LW  D
Subjt:  QSEQPSFYDTLCGDRGINRMGSNKCLEPETN-LTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD

AT4G01750.1 rhamnogalacturonan xylosyltransferase 22.9e-1225.08Show/hide
Query:  SLENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGT
        S  + +SP A  E   Y+L      +A     IV AV+   +   L +W   + R +    ++   +     + V +  P +   + P   S     FG+
Subjt:  SLENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGT

Query:  ECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVV
        + F   T  + + +L+IL+LGYNV+ +DVD+ W ++  PF Y  GS   A  +D+  +  P+N    LP    +G         + R  +     M+K  
Subjt:  ECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVV

Query:  KHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYL---ELWKKKNIKAACRKKGCYVL-HNNWISGRLKKLERQMFSGLW
        +   +   SE   F      D+    +  NK       + ++ L +  FP G        W K+        KG +V+ HNN+I G  +K+ R    GLW
Subjt:  KHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYL---ELWKKKNIKAACRKKGCYVL-HNNWISGRLKKLERQMFSGLW

Query:  EYD
          D
Subjt:  EYD

AT4G01770.1 rhamnogalacturonan xylosyltransferase 16.9e-1424.5Show/hide
Query:  LENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE
        L + +SPS   +   Y+L      +A KN T+++      +   L +W   + R +  + ++   +     + V +  P +   + P   S     FG++
Subjt:  LENRISPSATLE-LPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTE

Query:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVVK
         F   T  + + +L+IL+LGYNV+ +DVD+ W ++  PFLY  GS   A  +D+  +  P+N    LP    +G         + R  +     M+K  +
Subjt:  CFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPIN----LPRRLNSG-------FYFARSDDSTIAAMEKVVK

Query:  HAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD
           +   SE   F      D+    +  NK       + ++ L +  FP G    L+ K        K    ++HNN+I G  +K++R    GLW  D
Subjt:  HAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTVHFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTACCGGATTTCGATATCATAGTTAGAAATAGAATTGAAGGAAAAAACGAAGTATTCTCAGTGGACATTAAAGTATTATTGCGTACAGAAAACGACAGACCTCA
CATGGCCGGTGCGTGCTCCGTTTCTGATGGCGTTAATGGAAGTGGTGGGTTTGCGATCGAAGCGGGACTTTGTTCAATCTGGCTCTCCGGATTGGTTTTGATTGCTCTCT
CGCTATATGCTACTCAAAGTTTGCCTTCCTTCAAGGATCGTTTCGTGAGGCCCAAGCTTCGCTCCACAGTTGTCGGCGTTCGACTCAATCCTACAATTTCCATATTCTCT
GCGCCTCGCCCTTTCGCTGGTAGTATTGGAGTTCGGCAGAGCTTAGCTATTCGGTCGTGGCTTGCTCTGTCCCCACAAATTACAGTCATTCTGTTTAGCCAAGACTCTTC
CGTTGTATCATCTGCACGTTCTCTCAGTTCGCGAGTTTATGTTGATAGCAACATTGATTTTACGTTCCTGGGAACCCCATATTTCCATTCAATGATGGCAAGATCTCAGT
CGTTCACATCAGACATCTCTGTTTTCGTTGATCCTGAAACTATTCTTCTGCCTGATTTTATTTCTACTCTGAATTATGCTAACAAACTTGACCGTGATTGGCTCCTGGTT
GCTTCATCGAGAAATATTTCATACATACCATTCTACTTTGACGAGTCCAGGAGGCATTTCCCAATGGAGGATCAAAAATTTACAAGAATCCAGAAGGAGTTGCTCAATGA
GCATTGGCGATGGAGTCACTGCAGAGGGAAAGAACTATTGGCGTGGAACAACTGGGAAATTCCGTTGCACAGTGGAGTTCTTCCTCCCTTCTTATATGGAAGAGGGATTC
ATAACAATTGGGTCATAAATGAAGCTATGGCATCTGAATTCAGGTTTGTGTTTGATGCCAGTTGGACCATCAGTAGTTTCTATCTTCAAGATCCTGAGCAGTCATCCGAT
GGAAGATATGGACATCCAAATTCTGATATTGGAACAAGAAGCTGGGAATATTTTGGCAACTACCATCTTGGTTCACTATATGGGTCTTCATTTCATGACGAAGCCAATCT
TTCAAGTCTGGTGAAACTTCTCAACTGTAATGGGCAATATATTCTGATCAACACCACAGAAGACACAACATATCAACCGAAGAACCGGAGAATGCTAAGTTTATGGAACA
CACGATTGTTGCATATGGGGAGGAAGAAGAAACCTGTGGCTTGTAACCATGGTTTTCGATCACAGGGGAGACTACATGATTGCTCATTGGAAAATAGGATATCGCCTTCA
GCAACTTTAGAGCTTCCATATTCCTTAGAGATCCTTCTTCCCCTAATTGCAGATAAGAATAAGACAATTGTGCTAGCAGTTGCAGGATATAGTTACAAAGACATGTTAAT
GAGCTGGGCATGCAGATTGCGCCGCCTCCAGATCCCGAACCATATAGTTTGCGCTCTTGATCACGATACATATCAGTTCTCTGTCCTGCAGGGCCTGCCGGTCTACAGGG
ATCCATTGGCTCCGACCAATATTAGCTTCAATGACTGTCACTTTGGAACAGAGTGCTTCCAGAGGGTGACAAAAGTGAAGTCCAGAATGGTTTTGAGGATATTAAAGCTG
GGTTACAACGTACTTCTTAGTGACGTTGATGTATATTGGTTTAAAAATCCTCTTCCTTTTCTTTACGCTTTTGGTTCTGGTGTTCTTGCAGCACAATCTGATGAATACAA
GAAGACAGGACCAATAAATTTACCCAGACGCTTGAACTCTGGGTTCTATTTTGCTCGTTCTGATGACTCAACAATAGCTGCCATGGAGAAAGTGGTGAAGCATGCAGCAA
CTTCGGGACAGTCGGAGCAGCCAAGCTTCTATGATACCCTTTGCGGGGACAGAGGTATTAATCGCATGGGTAGTAATAAATGCTTGGAACCTGAAACAAATTTAACTGTT
CATTTCTTGGATAGAAACCTCTTTCCTAACGGTGCATACTTAGAACTTTGGAAAAAGAAAAATATAAAAGCAGCCTGTAGGAAGAAGGGCTGTTATGTTCTCCACAACAA
CTGGATTAGTGGAAGACTAAAGAAACTCGAACGTCAGATGTTTTCAGGCCTTTGGGAATATGACATGAGCACAAAGATGTGCAAGCGCAGCTTG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTACCGGATTTCGATATCATAGTTAGAAATAGAATTGAAGGAAAAAACGAAGTATTCTCAGTGGACATTAAAGTATTATTGCGTACAGAAAACGACAGACCTCA
CATGGCCGGTGCGTGCTCCGTTTCTGATGGCGTTAATGGAAGTGGTGGGTTTGCGATCGAAGCGGGACTTTGTTCAATCTGGCTCTCCGGATTGGTTTTGATTGCTCTCT
CGCTATATGCTACTCAAAGTTTGCCTTCCTTCAAGGATCGTTTCGTGAGGCCCAAGCTTCGCTCCACAGTTGTCGGCGTTCGACTCAATCCTACAATTTCCATATTCTCT
GCGCCTCGCCCTTTCGCTGGTAGTATTGGAGTTCGGCAGAGCTTAGCTATTCGGTCGTGGCTTGCTCTGTCCCCACAAATTACAGTCATTCTGTTTAGCCAAGACTCTTC
CGTTGTATCATCTGCACGTTCTCTCAGTTCGCGAGTTTATGTTGATAGCAACATTGATTTTACGTTCCTGGGAACCCCATATTTCCATTCAATGATGGCAAGATCTCAGT
CGTTCACATCAGACATCTCTGTTTTCGTTGATCCTGAAACTATTCTTCTGCCTGATTTTATTTCTACTCTGAATTATGCTAACAAACTTGACCGTGATTGGCTCCTGGTT
GCTTCATCGAGAAATATTTCATACATACCATTCTACTTTGACGAGTCCAGGAGGCATTTCCCAATGGAGGATCAAAAATTTACAAGAATCCAGAAGGAGTTGCTCAATGA
GCATTGGCGATGGAGTCACTGCAGAGGGAAAGAACTATTGGCGTGGAACAACTGGGAAATTCCGTTGCACAGTGGAGTTCTTCCTCCCTTCTTATATGGAAGAGGGATTC
ATAACAATTGGGTCATAAATGAAGCTATGGCATCTGAATTCAGGTTTGTGTTTGATGCCAGTTGGACCATCAGTAGTTTCTATCTTCAAGATCCTGAGCAGTCATCCGAT
GGAAGATATGGACATCCAAATTCTGATATTGGAACAAGAAGCTGGGAATATTTTGGCAACTACCATCTTGGTTCACTATATGGGTCTTCATTTCATGACGAAGCCAATCT
TTCAAGTCTGGTGAAACTTCTCAACTGTAATGGGCAATATATTCTGATCAACACCACAGAAGACACAACATATCAACCGAAGAACCGGAGAATGCTAAGTTTATGGAACA
CACGATTGTTGCATATGGGGAGGAAGAAGAAACCTGTGGCTTGTAACCATGGTTTTCGATCACAGGGGAGACTACATGATTGCTCATTGGAAAATAGGATATCGCCTTCA
GCAACTTTAGAGCTTCCATATTCCTTAGAGATCCTTCTTCCCCTAATTGCAGATAAGAATAAGACAATTGTGCTAGCAGTTGCAGGATATAGTTACAAAGACATGTTAAT
GAGCTGGGCATGCAGATTGCGCCGCCTCCAGATCCCGAACCATATAGTTTGCGCTCTTGATCACGATACATATCAGTTCTCTGTCCTGCAGGGCCTGCCGGTCTACAGGG
ATCCATTGGCTCCGACCAATATTAGCTTCAATGACTGTCACTTTGGAACAGAGTGCTTCCAGAGGGTGACAAAAGTGAAGTCCAGAATGGTTTTGAGGATATTAAAGCTG
GGTTACAACGTACTTCTTAGTGACGTTGATGTATATTGGTTTAAAAATCCTCTTCCTTTTCTTTACGCTTTTGGTTCTGGTGTTCTTGCAGCACAATCTGATGAATACAA
GAAGACAGGACCAATAAATTTACCCAGACGCTTGAACTCTGGGTTCTATTTTGCTCGTTCTGATGACTCAACAATAGCTGCCATGGAGAAAGTGGTGAAGCATGCAGCAA
CTTCGGGACAGTCGGAGCAGCCAAGCTTCTATGATACCCTTTGCGGGGACAGAGGTATTAATCGCATGGGTAGTAATAAATGCTTGGAACCTGAAACAAATTTAACTGTT
CATTTCTTGGATAGAAACCTCTTTCCTAACGGTGCATACTTAGAACTTTGGAAAAAGAAAAATATAAAAGCAGCCTGTAGGAAGAAGGGCTGTTATGTTCTCCACAACAA
CTGGATTAGTGGAAGACTAAAGAAACTCGAACGTCAGATGTTTTCAGGCCTTTGGGAATATGACATGAGCACAAAGATGTGCAAGCGCAGCTTG
Protein sequenceShow/hide protein sequence
MVLPDFDIIVRNRIEGKNEVFSVDIKVLLRTENDRPHMAGACSVSDGVNGSGGFAIEAGLCSIWLSGLVLIALSLYATQSLPSFKDRFVRPKLRSTVVGVRLNPTISIFS
APRPFAGSIGVRQSLAIRSWLALSPQITVILFSQDSSVVSSARSLSSRVYVDSNIDFTFLGTPYFHSMMARSQSFTSDISVFVDPETILLPDFISTLNYANKLDRDWLLV
ASSRNISYIPFYFDESRRHFPMEDQKFTRIQKELLNEHWRWSHCRGKELLAWNNWEIPLHSGVLPPFLYGRGIHNNWVINEAMASEFRFVFDASWTISSFYLQDPEQSSD
GRYGHPNSDIGTRSWEYFGNYHLGSLYGSSFHDEANLSSLVKLLNCNGQYILINTTEDTTYQPKNRRMLSLWNTRLLHMGRKKKPVACNHGFRSQGRLHDCSLENRISPS
ATLELPYSLEILLPLIADKNKTIVLAVAGYSYKDMLMSWACRLRRLQIPNHIVCALDHDTYQFSVLQGLPVYRDPLAPTNISFNDCHFGTECFQRVTKVKSRMVLRILKL
GYNVLLSDVDVYWFKNPLPFLYAFGSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDDSTIAAMEKVVKHAATSGQSEQPSFYDTLCGDRGINRMGSNKCLEPETNLTV
HFLDRNLFPNGAYLELWKKKNIKAACRKKGCYVLHNNWISGRLKKLERQMFSGLWEYDMSTKMCKRSL