; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G007010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G007010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGlycosyltransferases
Genome locationCmo_Chr07:3162479..3168633
RNA-Seq ExpressionCmoCh07G007010
SyntenyCmoCh07G007010
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009834 - plant-type secondary cell wall biogenesis (biological process)
GO:0010417 - glucuronoxylan biosynthetic process (biological process)
GO:0071555 - cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0042285 - xylosyltransferase activity (molecular function)
GO:0015018 - galactosylgalactosylxylosylprotein 3-beta-glucuronosyltransferase activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR039417 - Papain-like cysteine endopeptidase
IPR038765 - Papain-like cysteine peptidase superfamily
IPR029044 - Nucleotide-diphospho-sugar transferases
IPR025661 - Cysteine peptidase, asparagine active site
IPR025660 - Cysteine peptidase, histidine active site
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR005027 - Glycosyl transferase, family 43
IPR000668 - Peptidase C1A, papain C-terminal
IPR000169 - Cysteine peptidase, cysteine active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8531663.1 hypothetical protein F0562_006620 [Nyssa sinensis]0.0e+0064.57Show/hide
Query:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFST-SNPTTLYLT-PFRSATNLNIHSTLPNPIVN
        MKLS LQQ+YA RR+NSFRG + LDSS D  +KS A + W++LHG CCLISLVLGFRFSR+VFFL FST S  T LY T PF +A ++     L   I +
Subjt:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFST-SNPTTLYLT-PFRSATNLNIHSTLPNPIVN

Query:  KTT------PAT-ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWI
         T+      PA   + S+SRVVVGRHGI IRPWPHP+P+EVMKAH+IIE VQREQ  Q+GVKNPR +I ITPTYVRT Q LH+TGVMHSLM VPY L WI
Subjt:  KTT------PAT-ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWI

Query:  VVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQD
        VVEAGG TNETA +LA+ G+ TIH+GF  +M NSWEGRHR+EA MR  A+R+V +  LDG ++F DDSNM+SME FDEIQNV+W GA+SVGI+  S   D
Subjt:  VVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQD

Query:  ELS-----EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLF
        E S     E+ E + +P QGPACNSS+KLVGWHTFN+ PY  KSA +IGD+  VLPRKLEW+GFVLNSRLLWK+A+DKPEWV D DT+    E  E+PL 
Subjt:  ELS-----EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLF

Query:  LLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        LLKD S+VEPLG CGR+VL+WWLRVEAR DSKFP GW+I+PPLE+TVPAKRTPWPD PPELP  EK VI+ Q  T K   KT S RS+RSSR+       
Subjt:  LLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

Query:  NSESSIFPVLQATFAMALATTFLAFLSFFVLSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLT
        N  S                                  R++ EV  +Y+ WLA+HGKAYNG+ E+EKRF IFKDNL F+DEHNS+N TY VGLN FADLT
Subjt:  NSESSIFPVLQATFAMALATTFLAFLSFFVLSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLT

Query:  NEEYRATFLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGC
        NEEYR+ +LGTR+   RR +K+K AS+RYA    D+LPESVDWR +GAVAPIK+QG+CGSCWAFST+AAVEGIN+I T ELISLSEQELV CD  YN+GC
Subjt:  NEEYRATFLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGC

Query:  NGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVV
         GGLMDYAF+FII NGG+DTE+DYPY G+D +C  +R+NAKVVSIDGYEDVP NDE+ALKKAVA+QPVSVAIEA+G ALQLYQSG+F+  CG+ALDHGV 
Subjt:  NGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVV

Query:  AVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN
         VGYGTENG+DYW+V+NSWGT WGE+GY ++ERNV  T  GKCGIAM+ASYP+KN  N
Subjt:  AVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN

KAG6595019.1 putative beta-1,4-xylosyltransferase IRX14H, partial [Cucurbita argyrosperma subsp. sororia]5.8e-27298.15Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT
        MKLSALQQTYAARRANS+RGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRS TNLNIHST LPNPIVNKT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        TPA+ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA
        ETASILAQPG+ETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQ+VKWFGALSVGIIVLSDKQDELSEEVEKA
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA

Query:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC
        SIPAQGPACNSSNKLVGWHTFNALPYAGKSAK IGDKISVLPRKLEWSGFVLNS+LLWKDAEDKPEWVNDFDTLDVGDEA ESPLFLLKDASMVEPLGSC
Subjt:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC

Query:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
Subjt:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

KAG7027047.1 putative beta-1,4-xylosyltransferase IRX14H [Cucurbita argyrosperma subsp. argyrosperma]1.5e-27298.15Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT
        MKLSALQQTYAARRANS+RGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRS TNLNIHST LPNPIVNKT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        TPA+ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA
        ETASILAQPG+ETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQ+VKWFGALSVGIIVLSDKQDELSEEVEKA
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA

Query:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC
        SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISV+PRKLEWSGFVLNS+LLWKDAEDKPEWVNDFDTLDVGDEA ESPLFLLKDASMVEPLGSC
Subjt:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC

Query:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
Subjt:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

XP_022963267.1 probable beta-1,4-xylosyltransferase IRX14 [Cucurbita moschata]1.6e-277100Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT
        MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT

Query:  PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE
        PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE
Subjt:  PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE

Query:  TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS
        TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS
Subjt:  TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS

Query:  IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG
        IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG
Subjt:  IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG

Query:  RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
Subjt:  RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

XP_023518141.1 probable beta-1,4-xylosyltransferase IRX14 [Cucurbita pepo subsp. pepo]2.0e-27298.35Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT
        MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRS TNLNIHST LPNPIVNKT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA
        ETASILAQPG+ETIHVGFNQRMPNSWE RHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEK 
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA

Query:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC
        SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNS+LLWKDAEDKPEWVNDFDTLDVGDEA ESPLFLLKDASMVEPLGSC
Subjt:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC

Query:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        GRQVLVWWLRVEARFDSKFPHGWLIEPPLEI VPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
Subjt:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

TrEMBL top hitse value%identityAlignment
A0A0A0KJK9 Glycosyltransferases3.6e-24387.6Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPI----
        MKLSALQQTYAARRANSFRGS LDSSADSPIKS AGI WLILHG CCLISLVLGFRFSRLVFFLFFSTS  T LYLTPFRSAT+LN+HST L NP     
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPI----

Query:  --VNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE
          VNKTT  TI+ SSSRVVVGRHGIRIRPWPHPNP EVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRT QALHMTGVMHSLMLVPY LVWIVVE
Subjt:  --VNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE

Query:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS
        AGGITNETAS+LA+ G+ETIHVGFNQRMP SWEGRHR+EA MRLHALRIVSKMMLDG V FVDDSNM+SMEFFDEIQNVKWFGALSVGIIV SDKQDE S
Subjt:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS

Query:  EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMV
        +E+E   IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDK SVLPRKLEW GFVLNS+LLWKDAEDKPEWVN+FDTL+VGD+A ESPLFLLKDASMV
Subjt:  EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMV

Query:  EPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        EPLGSCGRQVL+WWLRVEARFDSKFPHGWLI+PPLEITVPAKRTPWPDVPPELPT+EKA     EET+K PAK+HSSRSRRSSRSKRKR EP
Subjt:  EPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

A0A5D3CNX8 Glycosyltransferases3.6e-24387.6Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPI----
        MKLSALQQTYAARRANSFRGS LDSSADSPIKS AGI WLILHG CCLISLVLGFRFSRLVFFLFFSTS  T LYLTPFRSAT+LN+HST L NP     
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPI----

Query:  --VNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE
          VNKTT  TI+ SSSRVVVGRHGIRIRPWPHPNP EVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRT QALHMTGVMH+LMLVPY LVWIVVE
Subjt:  --VNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE

Query:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS
        AGGITNETAS+LA+ G+ETIHVGFNQRMP SWEGRHR+EA MRLHALRIVSKMMLDG VIFVDDSNM+SMEFFDEIQNVKWFGALSVGIIV SDKQDE S
Subjt:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS

Query:  EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMV
        EE+E   IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDK SVLPRKLEW GFVLNS+LLWKDAEDKPEWVN+FDTL+VGD+A ESPLFLLKD SMV
Subjt:  EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMV

Query:  EPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        EPLGSCGRQVL+WWLRVEARFDSKFPHGWLI+PPLEITVPAKRTPWPDVPPELPT+EKA     EET+K PAK+HSSRSRRSSRSKRKR EP
Subjt:  EPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

A0A5J5AKZ2 Glycosyltransferases0.0e+0064.57Show/hide
Query:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFST-SNPTTLYLT-PFRSATNLNIHSTLPNPIVN
        MKLS LQQ+YA RR+NSFRG + LDSS D  +KS A + W++LHG CCLISLVLGFRFSR+VFFL FST S  T LY T PF +A ++     L   I +
Subjt:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFST-SNPTTLYLT-PFRSATNLNIHSTLPNPIVN

Query:  KTT------PAT-ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWI
         T+      PA   + S+SRVVVGRHGI IRPWPHP+P+EVMKAH+IIE VQREQ  Q+GVKNPR +I ITPTYVRT Q LH+TGVMHSLM VPY L WI
Subjt:  KTT------PAT-ISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWI

Query:  VVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQD
        VVEAGG TNETA +LA+ G+ TIH+GF  +M NSWEGRHR+EA MR  A+R+V +  LDG ++F DDSNM+SME FDEIQNV+W GA+SVGI+  S   D
Subjt:  VVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQD

Query:  ELS-----EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLF
        E S     E+ E + +P QGPACNSS+KLVGWHTFN+ PY  KSA +IGD+  VLPRKLEW+GFVLNSRLLWK+A+DKPEWV D DT+    E  E+PL 
Subjt:  ELS-----EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLF

Query:  LLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        LLKD S+VEPLG CGR+VL+WWLRVEAR DSKFP GW+I+PPLE+TVPAKRTPWPD PPELP  EK VI+ Q  T K   KT S RS+RSSR+       
Subjt:  LLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

Query:  NSESSIFPVLQATFAMALATTFLAFLSFFVLSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLT
        N  S                                  R++ EV  +Y+ WLA+HGKAYNG+ E+EKRF IFKDNL F+DEHNS+N TY VGLN FADLT
Subjt:  NSESSIFPVLQATFAMALATTFLAFLSFFVLSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLT

Query:  NEEYRATFLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGC
        NEEYR+ +LGTR+   RR +K+K AS+RYA    D+LPESVDWR +GAVAPIK+QG+CGSCWAFST+AAVEGIN+I T ELISLSEQELV CD  YN+GC
Subjt:  NEEYRATFLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGC

Query:  NGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVV
         GGLMDYAF+FII NGG+DTE+DYPY G+D +C  +R+NAKVVSIDGYEDVP NDE+ALKKAVA+QPVSVAIEA+G ALQLYQSG+F+  CG+ALDHGV 
Subjt:  NGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVV

Query:  AVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN
         VGYGTENG+DYW+V+NSWGT WGE+GY ++ERNV  T  GKCGIAM+ASYP+KN  N
Subjt:  AVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN

A0A6J1HET8 Glycosyltransferases7.6e-278100Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT
        MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTT

Query:  PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE
        PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE
Subjt:  PATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNE

Query:  TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS
        TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS
Subjt:  TASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKAS

Query:  IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG
        IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG
Subjt:  IPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCG

Query:  RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
Subjt:  RQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

A0A6J1KQJ7 Glycosyltransferases2.6e-27097.12Show/hide
Query:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT
        MKLSALQQTYAARRAN FRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPT+LYLTPFRSATNLNIHST LPNPIVNKT
Subjt:  MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHST-LPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA
        ETASILAQPG+ETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSN+YSMEFFDEIQ+VKWFGALSVGIIV SDKQDELSEEVEKA
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKA

Query:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC
        SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNS+LLWKDAEDKPEWVNDFDTLDVGDEA ESPLFLLKDA MVEPLGSC
Subjt:  SIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSC

Query:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP
        GRQVLVWWLRVEARFDSKFPHGWLI PPLEITVPAKRTPWPD+PPELPTDEKAVI+KQEETSKHPAKTHSSRSRRSSRSKRKR+EP
Subjt:  GRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEP

SwissProt top hitse value%identityAlignment
P25776 Oryzain alpha chain5.0e-12563.25Show/hide
Query:  AMALATTFLAFLSFFV---LSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQN----RTYTVGLNMFADLTNEEYRAT
        +MALA   L  L       +SI +  +R++ E R +Y  W A+HGK+YN + E E+R+  F+DNL ++DEHN+       ++ +GLN FADLTNEEYR T
Subjt:  AMALATTFLAFLSFFV---LSISALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFKDNLNFLDEHNSQN----RTYTVGLNMFADLTNEEYRAT

Query:  FLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDY
        +LG R+ P R     +  S RY   D++ LPESVDWRTKGAVA IK+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV CDT YN GCNGGLMDY
Subjt:  FLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDY

Query:  AFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTE
        AF FII+NGG+DTE+DYPY+G D +CD  R+NAKVV+ID YEDV  N E +L+KAVA+QPVSVAIEA G A QLY SG+FTGKCG+ALDHGV AVGYGTE
Subjt:  AFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTE

Query:  NGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN
        NG DYW+VRNSWG  WGE GY ++ERN+K  + GKCGIA+E SYP+K G N
Subjt:  NGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNN

Q653F4 Probable beta-1,4-xylosyltransferase IRX141.0e-13849.71Show/hide
Query:  SALQQTYAARRANSFRGSSLDSSADSPIKSSAG---------ILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNP
        S L+++ AA  A    G    S       S  G           W +LH  CCL+SL LGFRFSRL+FFL FST   T LY +   S+++  + +T    
Subjt:  SALQQTYAARRANSFRGSSLDSSADSPIKSSAG---------ILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNP

Query:  IVNKTTPATIST------------------------------SSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTY
            TT  T +T                              + S VVVGRHGIRIRPWPHP+P+EVM+AH+I+E VQ EQRR +GVK PR ++ +TPTY
Subjt:  IVNKTTPATIST------------------------------SSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTY

Query:  VRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSME
         R  QALH+TG++HSL  VPYPL WIVVEAGG TN TAS+LA+  +  +H+ F  RMP+ W  RH  E  MRLHALR++ +  +DG ++F DDSN++S+E
Subjt:  VRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSME

Query:  FFDEIQNVKWFGALSVGIIVLSDKQDE--LSEE-VEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAE
         FDE+Q V+W GA+SVGI+  +   D+  LSEE  +   +P QGPACNSS  L GWHTFN+LP+AGK+A  +G+   VLPR LEW+GFVLNSR+LWK+AE
Subjt:  FFDEIQNVKWFGALSVGIIVLSDKQDE--LSEE-VEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAE

Query:  DKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETS
         KP+WV D D +    E  E+PL LL D S VEPLG+CG+++L+WWLRVEAR DSKFP GW+IEPPL+I VPAKRTPWP+   EL  +   V  KQ++  
Subjt:  DKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETS

Query:  KHPAKT-HSSRSRRSSRSK
        +  ++T  SSRSR +++ K
Subjt:  KHPAKT-HSSRSRRSSRSK

Q8L707 Beta-1,4-xylosyltransferase IRX142.7e-17161.69Show/hide
Query:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFR---SATNLNIHS---TLPN
        MKLSAL Q+Y  RR+NSFR  +SLDSS D   KS   + WLILH  CCLISLVLGFRFSRLVFF  FSTS+ T LY  PFR      +L++H+   TL +
Subjt:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFR---SATNLNIHS---TLPN

Query:  PIVNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE
        P  N TT    +T SSRVVVGRHGIRIRPWPHPNP+EVMKAHQII  VQ+EQ+  FG+K+ + +IA+TPTYVRT QALH+TGVMHSLMLVPY LVWIVVE
Subjt:  PIVNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE

Query:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDEL-
        AGG TNET  I+A+ G+ TIHVG +QRMPN+WE R ++E  MRL ALR+V +  LDG V+F DDSNM+SME FDEIQNVKWFG +SVGI+  S   +E+ 
Subjt:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDEL-

Query:  -----------SEEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFE
                    EE E +S+P QGPACNS+++L+GWH FN LPYAGKSA +I D  +VLP+KLEWSGFVLNSRLLW++AE+KPEWV DF +L+  +E  E
Subjt:  -----------SEEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFE

Query:  SPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEK-----------AVIIKQEETSKHPAKTHSS
        SPL LLKD SMVEPLGSCGRQVL+WWLRVEAR DSKFP GW+I+PPLEITV AKRTPWPDVPPE PT +K            VI KQ++   HP K    
Subjt:  SPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEK-----------AVIIKQEETSKHPAKTHSS

Query:  RSRRSSRSKRKRQEPNSESSIF
        + R+S +SK + +  ++ + ++
Subjt:  RSRRSSRSKRKRQEPNSESSIF

Q9FH90 Probable beta-1,4-xylosyltransferase IRX14H3.9e-16260.36Show/hide
Query:  MKLSALQQTYAARRANSFRGS-SLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKT
        MKLS  + +Y  RR +SFR S SLD S D   KS + + W ++HG CCLISL+LGFRFS LV F  FSTS  T LY TPF  A N  +   L    +   
Subjt:  MKLSALQQTYAARRANSFRGS-SLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        T +T+   +SRVVVGRHGIRIRPWPHPNPIEV++AHQ++  VQ+EQ+  +GV++PR +I +TPTYVRT QALH+TGVMHSLMLVPY LVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS------
        ETAS +A+ G++TIH+GF+Q+MPN+WE RH++E  MRLHALR+V +  LDG V+F DDSNM+SME FDEIQ VKWFGALSVGI+  S   DELS      
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS------

Query:  --EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDA-EDKPEWVNDFDTLDVGDEAFESPLFLLKDA
          +  EK S+P QGP+CNSS KLVGWH FN  PYA K+A +I +K  V+P K+EWSGFVLNSRLLWK++ +DKP WV D   LD G    ESPL L+KD 
Subjt:  --EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDA-EDKPEWVNDFDTLDVGDEAFESPLFLLKDA

Query:  SMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEPNS
        SMVEPLGSCGR+VL+WWLRVEAR DSKFP GW+I+ PLEITVP+KRTPWPD   ELP    A  IK+       AK++S      S+S +++QEP +
Subjt:  SMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEPNS

Q9FMH8 Probable cysteine protease RD21B1.9e-12466.06Show/hide
Query:  ISALNQRTDGEVREIYDIWLAKHGKA---YNGI-EEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRR
        I+    R+D EV  IY+ W+ +HGK     NG+  E+++RF IFKDNL F+DEHN++N +Y +GL  FADLTNEEYR+ +LG +  P +RV+K    S R
Subjt:  ISALNQRTDGEVREIYDIWLAKHGKA---YNGI-EEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRR

Query:  YAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG
        Y     D LP+SVDWR +GAVA +K+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV CDT YN GCNGGLMDYAF+FII NGG+DTE DYPY+ 
Subjt:  YAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG

Query:  LDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGY
         DG+CD  R+NAKVV+ID YEDVP N E +LKKA+AHQP+SVAIEA G A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY
Subjt:  LDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGY

Query:  FKLERNVKHTTDGKCGIAMEASYPVKNGNN
         K+ RN++  T GKCGIAMEASYP+K G N
Subjt:  FKLERNVKHTTDGKCGIAMEASYPVKNGNN

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein5.1e-12564.02Show/hide
Query:  ISALNQRTDGEVREIYDIWLAKHGKA--YNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYA
        +S    R++ EV  IY+ WL KHGKA   N + E+++RF IFKDNL F+DEHN +N +Y +GL  FADLTN+EYR+ +LG +          +  S RY 
Subjt:  ISALNQRTDGEVREIYDIWLAKHGKA--YNGIEEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYA

Query:  VNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLD
            D LPES+DWR KGAVA +K+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV CDT YN GCNGGLMDYAF+FII NGG+DT++DYPY+G+D
Subjt:  VNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLD

Query:  GQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFK
        G CD  R+NAKVV+ID YEDVP   EE+LKKAVAHQP+S+AIEA G A QLY SG+F G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY +
Subjt:  GQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFK

Query:  LERNVKHTTDGKCGIAMEASYPVKNGNN
        + RN+  ++ GKCGIA+E SYP+KNG N
Subjt:  LERNVKHTTDGKCGIAMEASYPVKNGNN

AT4G36880.1 cysteine proteinase11.3e-12365.85Show/hide
Query:  RTDGEVREIYDIWLAKHGKAYNG----IEEREKRFLIFKDNLNFLDEHNSQNR--TYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRY--A
        RTD EVR IY  W A+HGK  N     I +++KRF IFKDNL F+D HN  N+  TY +GL  F DLTN+EYR  +LG R+ PARR+ KAK+ +++Y  A
Subjt:  RTDGEVREIYDIWLAKHGKAYNG----IEEREKRFLIFKDNLNFLDEHNSQNR--TYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRY--A

Query:  VNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLD
        VN  + +PE+VDWR KGAV PIK+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV CD  YN GCNGGLMDYAFQFI+ NGGL+TE+DYPY G  
Subjt:  VNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLD

Query:  GQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFK
        G+C+   +N++VVSIDGYEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG CG+ LDH VVAVGYG+ENGVDYW+VRNSWG  WGE+GY +
Subjt:  GQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFK

Query:  LERNVKHTTDGKCGIAMEASYPVKNGNN
        +ERN+  +  GKCGIA+EASYPVK   N
Subjt:  LERNVKHTTDGKCGIAMEASYPVKNGNN

AT4G36890.1 Nucleotide-diphospho-sugar transferases superfamily protein1.9e-17261.69Show/hide
Query:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFR---SATNLNIHS---TLPN
        MKLSAL Q+Y  RR+NSFR  +SLDSS D   KS   + WLILH  CCLISLVLGFRFSRLVFF  FSTS+ T LY  PFR      +L++H+   TL +
Subjt:  MKLSALQQTYAARRANSFRG-SSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFR---SATNLNIHS---TLPN

Query:  PIVNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE
        P  N TT    +T SSRVVVGRHGIRIRPWPHPNP+EVMKAHQII  VQ+EQ+  FG+K+ + +IA+TPTYVRT QALH+TGVMHSLMLVPY LVWIVVE
Subjt:  PIVNKTTPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVE

Query:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDEL-
        AGG TNET  I+A+ G+ TIHVG +QRMPN+WE R ++E  MRL ALR+V +  LDG V+F DDSNM+SME FDEIQNVKWFG +SVGI+  S   +E+ 
Subjt:  AGGITNETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDEL-

Query:  -----------SEEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFE
                    EE E +S+P QGPACNS+++L+GWH FN LPYAGKSA +I D  +VLP+KLEWSGFVLNSRLLW++AE+KPEWV DF +L+  +E  E
Subjt:  -----------SEEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFE

Query:  SPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEK-----------AVIIKQEETSKHPAKTHSS
        SPL LLKD SMVEPLGSCGRQVL+WWLRVEAR DSKFP GW+I+PPLEITV AKRTPWPDVPPE PT +K            VI KQ++   HP K    
Subjt:  SPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEK-----------AVIIKQEETSKHPAKTHSS

Query:  RSRRSSRSKRKRQEPNSESSIF
        + R+S +SK + +  ++ + ++
Subjt:  RSRRSSRSKRKRQEPNSESSIF

AT5G43060.1 Granulin repeat cysteine protease family protein1.3e-12566.06Show/hide
Query:  ISALNQRTDGEVREIYDIWLAKHGKA---YNGI-EEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRR
        I+    R+D EV  IY+ W+ +HGK     NG+  E+++RF IFKDNL F+DEHN++N +Y +GL  FADLTNEEYR+ +LG +  P +RV+K    S R
Subjt:  ISALNQRTDGEVREIYDIWLAKHGKA---YNGI-EEREKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRR

Query:  YAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG
        Y     D LP+SVDWR +GAVA +K+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV CDT YN GCNGGLMDYAF+FII NGG+DTE DYPY+ 
Subjt:  YAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG

Query:  LDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGY
         DG+CD  R+NAKVV+ID YEDVP N E +LKKA+AHQP+SVAIEA G A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY
Subjt:  LDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGY

Query:  FKLERNVKHTTDGKCGIAMEASYPVKNGNN
         K+ RN++  T GKCGIAMEASYP+K G N
Subjt:  FKLERNVKHTTDGKCGIAMEASYPVKNGNN

AT5G67230.1 Nucleotide-diphospho-sugar transferases superfamily protein2.8e-16360.36Show/hide
Query:  MKLSALQQTYAARRANSFRGS-SLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKT
        MKLS  + +Y  RR +SFR S SLD S D   KS + + W ++HG CCLISL+LGFRFS LV F  FSTS  T LY TPF  A N  +   L    +   
Subjt:  MKLSALQQTYAARRANSFRGS-SLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKT

Query:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN
        T +T+   +SRVVVGRHGIRIRPWPHPNPIEV++AHQ++  VQ+EQ+  +GV++PR +I +TPTYVRT QALH+TGVMHSLMLVPY LVWIVVEAGGITN
Subjt:  TPATISTSSSRVVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITN

Query:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS------
        ETAS +A+ G++TIH+GF+Q+MPN+WE RH++E  MRLHALR+V +  LDG V+F DDSNM+SME FDEIQ VKWFGALSVGI+  S   DELS      
Subjt:  ETASILAQPGVETIHVGFNQRMPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELS------

Query:  --EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDA-EDKPEWVNDFDTLDVGDEAFESPLFLLKDA
          +  EK S+P QGP+CNSS KLVGWH FN  PYA K+A +I +K  V+P K+EWSGFVLNSRLLWK++ +DKP WV D   LD G    ESPL L+KD 
Subjt:  --EEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSAKFIGDKISVLPRKLEWSGFVLNSRLLWKDA-EDKPEWVNDFDTLDVGDEAFESPLFLLKDA

Query:  SMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEPNS
        SMVEPLGSCGR+VL+WWLRVEAR DSKFP GW+I+ PLEITVP+KRTPWPD   ELP    A  IK+       AK++S      S+S +++QEP +
Subjt:  SMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWPDVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTTCTGCATTGCAGCAGACATATGCTGCCCGTCGGGCTAATAGCTTCAGAGGGTCGTCCTTGGATTCGTCGGCGGATAGTCCAATCAAGTCGTCCGCCGGGAT
TTTGTGGCTGATTCTTCATGGGTTTTGTTGTCTCATAAGTCTTGTACTTGGATTTCGCTTCTCTCGTCTTGTATTTTTCCTTTTCTTCTCTACTTCCAACCCCACTACCC
TCTACTTGACTCCTTTTCGATCTGCTACTAACCTCAACATTCACTCTACGCTTCCAAATCCGATTGTAAATAAGACGACGCCTGCGACCATTTCCACTTCTAGTAGCCGC
GTGGTTGTTGGGAGGCATGGAATCCGAATCCGGCCATGGCCACATCCAAACCCAATTGAAGTTATGAAGGCTCACCAAATCATTGAGACGGTACAGAGGGAACAGAGGCG
TCAATTCGGGGTCAAGAATCCACGGAAGATAATTGCTATAACTCCAACTTATGTGCGGACTTTACAGGCGCTCCACATGACTGGCGTTATGCACTCGCTGATGTTAGTCC
CTTACCCGCTGGTTTGGATCGTGGTGGAGGCCGGTGGAATCACCAATGAGACTGCTTCAATTCTTGCTCAGCCTGGGGTGGAGACTATCCACGTTGGGTTCAATCAGCGG
ATGCCGAATTCTTGGGAGGGTAGGCATCGAATTGAGGCTCATATGAGGCTTCATGCCTTGAGAATTGTGAGCAAAATGATGCTGGATGGAACTGTGATATTTGTGGATGA
CAGTAATATGTATAGTATGGAGTTTTTTGATGAAATCCAGAACGTGAAGTGGTTTGGTGCTCTGTCTGTTGGGATTATTGTTCTGTCTGATAAACAAGACGAATTATCAG
AGGAGGTGGAGAAGGCATCAATCCCTGCTCAGGGTCCTGCTTGCAATTCTTCCAATAAGTTGGTTGGTTGGCATACCTTCAATGCGCTTCCATATGCTGGAAAGAGTGCT
AAATTCATTGGTGACAAGATATCAGTTCTTCCGAGGAAGCTGGAGTGGTCTGGGTTTGTGTTGAATTCCAGGTTGCTATGGAAAGATGCAGAAGATAAGCCAGAATGGGT
TAATGATTTTGATACATTGGATGTCGGTGACGAGGCTTTCGAGAGTCCACTATTTCTTTTGAAGGATGCATCGATGGTTGAGCCTCTTGGAAGTTGTGGCCGCCAAGTTT
TGGTCTGGTGGCTCAGAGTTGAAGCTCGTTTTGATAGCAAATTTCCTCATGGGTGGTTAATTGAGCCTCCTTTAGAGATTACTGTACCTGCAAAACGAACACCATGGCCG
GATGTTCCTCCTGAACTCCCAACTGACGAAAAAGCTGTGATAATCAAGCAAGAAGAAACATCTAAGCATCCTGCAAAGACTCATTCATCCAGATCCAGAAGAAGTTCTCG
AAGCAAGAGAAAGCGGCAGGAACCAAACTCCGAATCTTCAATCTTCCCTGTTCTTCAAGCAACCTTCGCCATGGCCCTCGCCACTACTTTCCTCGCCTTCCTCTCCTTCT
TCGTCCTTTCCATTTCCGCCCTGAACCAGCGGACCGACGGCGAGGTTCGAGAAATCTACGACATATGGCTCGCGAAGCACGGCAAGGCCTACAACGGAATCGAAGAACGG
GAGAAGAGGTTTCTGATCTTCAAGGACAATCTCAACTTCCTCGATGAGCACAATTCCCAGAATCGGACGTATACGGTTGGATTGAACATGTTTGCTGATTTGACCAACGA
GGAGTATCGGGCTACGTTTTTGGGGACTAGGTCTCATCCTGCTCGCAGAGTCATGAAGGCCAAGAGCGCCAGCCGCCGATACGCCGTCAACGACGATGATCGGTTGCCGG
AATCCGTCGATTGGAGGACTAAAGGTGCCGTTGCTCCGATTAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCATTCTCGACCATAGCAGCTGTGGAAGGAATAAACCAG
ATCGTCACAGGAGAACTCATCTCTCTCTCGGAACAAGAGCTTGTTAGTTGTGACACAAAGTACAATTCAGGCTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCAT
CATTGACAATGGCGGTTTGGACACCGAGGAAGATTATCCTTATGAGGGCTTGGATGGACAATGCGATCCGACAAGGGAAAATGCCAAGGTTGTTAGCATTGATGGCTACG
AGGATGTCCCTGCCAATGACGAGGAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTGAGTGTCGCCATTGAAGCTAGTGGCTTGGCCTTACAACTTTACCAGTCGGGT
GTATTTACTGGTAAATGTGGCTCGGCTCTTGACCATGGCGTGGTGGCTGTTGGATATGGAACAGAGAATGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGG
ATGGGGTGAGGATGGCTACTTCAAGCTAGAGCGCAATGTGAAGCATACAACCGATGGGAAGTGTGGGATTGCAATGGAGGCTTCTTACCCTGTTAAGAATGGCAACAACA
ACAACCCAACAGGATCATATTTAGGTTTGGAACTTGCTGGAGACAAGAACAAGATCAGCAGTGCTTGA
mRNA sequenceShow/hide mRNA sequence
CGTCAAACAAGAATCAGAAAAAGACATTACTCAAGCTTTTCATTCTTAATCTCAAACAAATTCTCCTCTTTTCTGCTCTAATCATCAACGCCTTTACCTATTCTCTTGCT
ATCTTTACAAGCCAAATTGAATCTCAACTTTAAAGGAACATTAGAAACAACTCCTTTTCCGCGATTCGTCCAATTCTTGGGACCCAACCCCTCATCGTTTTCCTTTTGCT
GGAGTCAAGGGAGGGTTAGTGGGGCAAACCCAGAATATGAAGCTTTCTGCATTGCAGCAGACATATGCTGCCCGTCGGGCTAATAGCTTCAGAGGGTCGTCCTTGGATTC
GTCGGCGGATAGTCCAATCAAGTCGTCCGCCGGGATTTTGTGGCTGATTCTTCATGGGTTTTGTTGTCTCATAAGTCTTGTACTTGGATTTCGCTTCTCTCGTCTTGTAT
TTTTCCTTTTCTTCTCTACTTCCAACCCCACTACCCTCTACTTGACTCCTTTTCGATCTGCTACTAACCTCAACATTCACTCTACGCTTCCAAATCCGATTGTAAATAAG
ACGACGCCTGCGACCATTTCCACTTCTAGTAGCCGCGTGGTTGTTGGGAGGCATGGAATCCGAATCCGGCCATGGCCACATCCAAACCCAATTGAAGTTATGAAGGCTCA
CCAAATCATTGAGACGGTACAGAGGGAACAGAGGCGTCAATTCGGGGTCAAGAATCCACGGAAGATAATTGCTATAACTCCAACTTATGTGCGGACTTTACAGGCGCTCC
ACATGACTGGCGTTATGCACTCGCTGATGTTAGTCCCTTACCCGCTGGTTTGGATCGTGGTGGAGGCCGGTGGAATCACCAATGAGACTGCTTCAATTCTTGCTCAGCCT
GGGGTGGAGACTATCCACGTTGGGTTCAATCAGCGGATGCCGAATTCTTGGGAGGGTAGGCATCGAATTGAGGCTCATATGAGGCTTCATGCCTTGAGAATTGTGAGCAA
AATGATGCTGGATGGAACTGTGATATTTGTGGATGACAGTAATATGTATAGTATGGAGTTTTTTGATGAAATCCAGAACGTGAAGTGGTTTGGTGCTCTGTCTGTTGGGA
TTATTGTTCTGTCTGATAAACAAGACGAATTATCAGAGGAGGTGGAGAAGGCATCAATCCCTGCTCAGGGTCCTGCTTGCAATTCTTCCAATAAGTTGGTTGGTTGGCAT
ACCTTCAATGCGCTTCCATATGCTGGAAAGAGTGCTAAATTCATTGGTGACAAGATATCAGTTCTTCCGAGGAAGCTGGAGTGGTCTGGGTTTGTGTTGAATTCCAGGTT
GCTATGGAAAGATGCAGAAGATAAGCCAGAATGGGTTAATGATTTTGATACATTGGATGTCGGTGACGAGGCTTTCGAGAGTCCACTATTTCTTTTGAAGGATGCATCGA
TGGTTGAGCCTCTTGGAAGTTGTGGCCGCCAAGTTTTGGTCTGGTGGCTCAGAGTTGAAGCTCGTTTTGATAGCAAATTTCCTCATGGGTGGTTAATTGAGCCTCCTTTA
GAGATTACTGTACCTGCAAAACGAACACCATGGCCGGATGTTCCTCCTGAACTCCCAACTGACGAAAAAGCTGTGATAATCAAGCAAGAAGAAACATCTAAGCATCCTGC
AAAGACTCATTCATCCAGATCCAGAAGAAGTTCTCGAAGCAAGAGAAAGCGGCAGGAACCAAACTCCGAATCTTCAATCTTCCCTGTTCTTCAAGCAACCTTCGCCATGG
CCCTCGCCACTACTTTCCTCGCCTTCCTCTCCTTCTTCGTCCTTTCCATTTCCGCCCTGAACCAGCGGACCGACGGCGAGGTTCGAGAAATCTACGACATATGGCTCGCG
AAGCACGGCAAGGCCTACAACGGAATCGAAGAACGGGAGAAGAGGTTTCTGATCTTCAAGGACAATCTCAACTTCCTCGATGAGCACAATTCCCAGAATCGGACGTATAC
GGTTGGATTGAACATGTTTGCTGATTTGACCAACGAGGAGTATCGGGCTACGTTTTTGGGGACTAGGTCTCATCCTGCTCGCAGAGTCATGAAGGCCAAGAGCGCCAGCC
GCCGATACGCCGTCAACGACGATGATCGGTTGCCGGAATCCGTCGATTGGAGGACTAAAGGTGCCGTTGCTCCGATTAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCA
TTCTCGACCATAGCAGCTGTGGAAGGAATAAACCAGATCGTCACAGGAGAACTCATCTCTCTCTCGGAACAAGAGCTTGTTAGTTGTGACACAAAGTACAATTCAGGCTG
CAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGTTTGGACACCGAGGAAGATTATCCTTATGAGGGCTTGGATGGACAATGCGATCCGACAA
GGGAAAATGCCAAGGTTGTTAGCATTGATGGCTACGAGGATGTCCCTGCCAATGACGAGGAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTGAGTGTCGCCATTGAA
GCTAGTGGCTTGGCCTTACAACTTTACCAGTCGGGTGTATTTACTGGTAAATGTGGCTCGGCTCTTGACCATGGCGTGGTGGCTGTTGGATATGGAACAGAGAATGGAGT
TGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAGGATGGCTACTTCAAGCTAGAGCGCAATGTGAAGCATACAACCGATGGGAAGTGTGGGATTGCAA
TGGAGGCTTCTTACCCTGTTAAGAATGGCAACAACAACAACCCAACAGGATCATATTTAGGTTTGGAACTTGCTGGAGACAAGAACAAGATCAGCAGTGCTTGAATTGAA
CATCAAGGCAGTTGAAAGATGGTTACTTCTAGTTTTATGTTTTAATTCCTGTTGGTCTGAAGTTAGGCCTATTGCTATACCTACAAATCTAAAAGTGTTCGTAAGTGATT
GGACATCAATGTTCCTTCACAAGTTAATGATATGTGTGATATCAAAAGATGGAGTTTGTTAATATCGCACACCCCACCCATCAAGATGAAGGACAGAAGAGAAAATGCAA
AGAAAAATATAGATTGAGATACCAAAGTGGAAACTATAGTTACAATAAGATAATTCATTCTGAAAATTAAATAAGATAATTCATAG
Protein sequenceShow/hide protein sequence
MKLSALQQTYAARRANSFRGSSLDSSADSPIKSSAGILWLILHGFCCLISLVLGFRFSRLVFFLFFSTSNPTTLYLTPFRSATNLNIHSTLPNPIVNKTTPATISTSSSR
VVVGRHGIRIRPWPHPNPIEVMKAHQIIETVQREQRRQFGVKNPRKIIAITPTYVRTLQALHMTGVMHSLMLVPYPLVWIVVEAGGITNETASILAQPGVETIHVGFNQR
MPNSWEGRHRIEAHMRLHALRIVSKMMLDGTVIFVDDSNMYSMEFFDEIQNVKWFGALSVGIIVLSDKQDELSEEVEKASIPAQGPACNSSNKLVGWHTFNALPYAGKSA
KFIGDKISVLPRKLEWSGFVLNSRLLWKDAEDKPEWVNDFDTLDVGDEAFESPLFLLKDASMVEPLGSCGRQVLVWWLRVEARFDSKFPHGWLIEPPLEITVPAKRTPWP
DVPPELPTDEKAVIIKQEETSKHPAKTHSSRSRRSSRSKRKRQEPNSESSIFPVLQATFAMALATTFLAFLSFFVLSISALNQRTDGEVREIYDIWLAKHGKAYNGIEER
EKRFLIFKDNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYAVNDDDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQ
IVTGELISLSEQELVSCDTKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSG
VFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNNNNPTGSYLGLELAGDKNKISSA