; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022711 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022711
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationtig00000589:1102465..1105437
RNA-Seq ExpressionSgr022711
SyntenySgr022711
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580525.1 hypothetical protein SDJN03_20527, partial [Cucurbita argyrosperma subsp. sororia]1.2e-21178.94Show/hide
Query:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW
        NC+ CHR       RKI    IAVE  Q+   TN+SHLLFGIAGSTKTWKKRQ YCELWW PN+TRGFVWVDEKPN TWPATSPPYRVSADTS+F+YTCW
Subjt:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW

Query:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR
        YGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLVTVL KYDH QMYYIGSNSESVEQ++IH Y TAYGGGGYAISY LA ELVRILDGCL+R
Subjt:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR

Query:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV
        YASLYGGDQKVQACV+EIGVPLTK  GFHQVDIRGSQYG LAAHPVAPLVSLHHVDYLP IFPTM QIDALK LKT Y+LDPGRTLQQSFCY  ARNWS+
Subjt:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV

Query:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT
        +VSWGY+VQLYP L T K+MEK+FQTF+TWKSWSDGPFTFNTRPV SDPC+MP++F LD    PNRTVTSYK+ +DVW KECS DEFQ AQ VE FRVVT
Subjt:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT

Query:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
            S + W KAPRRQCCQVV+G +     +DS+VNV +RGCN FETVTPP
Subjt:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

XP_002309080.2 uncharacterized protein LOC7487270 [Populus trichocarpa]1.2e-17664.67Show/hide
Query:  CSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYG
        C ECH+++  S HRKIT  +++ +  + TNISH+LFGI GS KTW KR+ Y ELWW PNITRG+VW+D+KP  N TWP TSP Y+VSADTS+F YTC YG
Subjt:  CSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYG

Query:  SRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYA
        SRSA+R+ARIVKESFELG ENVRWFVMGDDDTVFF+ENLV VLAKYDH QMYYIG NSESVEQD+IHSY  AYGGGG+AISYPLA ELVR+LDGC++RYA
Subjt:  SRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYA

Query:  SLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTV
        S YG DQK+Q C+SEIGVPLTK +GFHQVDIRG  YG LAAHP+APLVSLHH+DY+ +IFP +N+ID++K L + Y +DPGR LQ SFCYDL RNWSV+ 
Subjt:  SLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTV

Query:  SWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGG-PNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTS
        SWGY++Q++PSL+TAK++E  F+TFQTW+SWS+GPFTFNTRP+   PC  PVV+ LD V    + T+T+YK+ +  + + C   E+     V+   V TS
Subjt:  SWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGG-PNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTS

Query:  AQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
          L P +W+ APRRQCC+V+     G  GV+S+V ++IRGCN+FE+VTPP
Subjt:  AQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

XP_018842960.1 uncharacterized protein LOC109007653 [Juglans regia]9.5e-17761.55Show/hide
Query:  YGKTAITCMIISSSPLVSKAGSVFWRQESSNNDDSKFNCSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITR
        Y   ++ C I S S +   A S +              C ECHR +    H+K++   I+   ++ TNISH+LFGI GS KTW+KR+ Y ELWWKPN++R
Subjt:  YGKTAITCMIISSSPLVSKAGSVFWRQESSNNDDSKFNCSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITR

Query:  GFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVE
        GFVW+DE+P  NVTWP T PP++VS DTS+FNYTCWYG RSAIR+ARIVKESFELGL+NVRWFVMGDDDTVFF ENL+ VLAKYDH QMYYIG NSESVE
Subjt:  GFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVE

Query:  QDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPT
        QD+IHSY  AYGGGGYAISYPLAAELVR+LDGC++RYA  +G DQKVQ C+SEIGVP+TK +GFHQVDIRG+ YG LAAHP+APLVSLHH+DY+ +IFP 
Subjt:  QDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPT

Query:  MNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--
          QID+LKTL   Y+ DP R LQ SFCYDL RNWSV+VSWGYSVQLYPSL TAK +E   +TF+TW+SWSD PFTFNTRPV +D C+ P+++ LD V   
Subjt:  MNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--

Query:  GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
        G  +T+T+YKK+      EC   E+  A  V  F  V+++   P +W KAPRRQCC++++    G  GVD+++ V IRGCNRF++VTPP
Subjt:  GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

XP_022935184.1 uncharacterized protein LOC111442138 [Cucurbita moschata]4.5e-21178.49Show/hide
Query:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW
        NC+ CHR       RKI    IAVE  Q+   TN+SHLLFGIAGSTKTWKKRQ YCELWW PN+TRGFVWVDEKPN TWPATSPPYRVSADTS+F+YTCW
Subjt:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW

Query:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR
        YGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLVTVL KYDH QMYYIGSNSESVEQ++IH Y TAYGGGGYAISY LA ELVRILDGCL+R
Subjt:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR

Query:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV
        YASLYGGDQKVQACV+EIGVPLTK  GFHQVDIRGSQYG LAAHPVAPLVSLHHVDYLP IFPTM QIDALK LKT Y+LDPGRTLQQSFCYD ARNWS+
Subjt:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV

Query:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT
        +VSWGY+VQLYP L T K++EK+FQTF+TWKSWSDGPFTFNTRPV SDPC+MP++F LD    PNRTVTSYK+ +DVW KEC  DEFQ AQ VE FRVVT
Subjt:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT

Query:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
            S + W KAPRRQCCQVV+G +     +D++VNV  RGCN FETVTPP
Subjt:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

XP_022983299.1 uncharacterized protein LOC111481920 [Cucurbita maxima]1.4e-21279.16Show/hide
Query:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW
        NC++CHR       RKI    IAVE  Q+   TNISHLLFGIAGSTKTWKKRQ YCELWW PN+TRGFVWVDEKPN TWPATSPPYRVSADTS+F+YTCW
Subjt:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW

Query:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR
        YGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLVTVL KYDH QMYYIGSNSESVEQD+IH Y TAYGGGGYAISY LA ELVRILDGCL+R
Subjt:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR

Query:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV
        YASLYGGDQKVQACV+EIGVPLTK  GFHQVDIRGSQYG LAAHPVAPLVSLHHVDYLP IFPTM +IDALK LKT Y+LDPGRTLQQSFCYD ARNWS+
Subjt:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV

Query:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT
        +VSWGY+VQLYP L T K+MEK+FQTF+TWKSWSDGPFTFNTRPV SDPCEMP++F LD V  PNRTVT+YK+ +DVW KECS DEFQ  Q VE FRVVT
Subjt:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT

Query:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
        S   S + W KAPRRQCCQVV+  +     +D++VNV +RGCN FETVTPP
Subjt:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

TrEMBL top hitse value%identityAlignment
A0A2I4GGD7 uncharacterized protein LOC1090076534.6e-17761.55Show/hide
Query:  YGKTAITCMIISSSPLVSKAGSVFWRQESSNNDDSKFNCSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITR
        Y   ++ C I S S +   A S +              C ECHR +    H+K++   I+   ++ TNISH+LFGI GS KTW+KR+ Y ELWWKPN++R
Subjt:  YGKTAITCMIISSSPLVSKAGSVFWRQESSNNDDSKFNCSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITR

Query:  GFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVE
        GFVW+DE+P  NVTWP T PP++VS DTS+FNYTCWYG RSAIR+ARIVKESFELGL+NVRWFVMGDDDTVFF ENL+ VLAKYDH QMYYIG NSESVE
Subjt:  GFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVE

Query:  QDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPT
        QD+IHSY  AYGGGGYAISYPLAAELVR+LDGC++RYA  +G DQKVQ C+SEIGVP+TK +GFHQVDIRG+ YG LAAHP+APLVSLHH+DY+ +IFP 
Subjt:  QDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPT

Query:  MNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--
          QID+LKTL   Y+ DP R LQ SFCYDL RNWSV+VSWGYSVQLYPSL TAK +E   +TF+TW+SWSD PFTFNTRPV +D C+ P+++ LD V   
Subjt:  MNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--

Query:  GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
        G  +T+T+YKK+      EC   E+  A  V  F  V+++   P +W KAPRRQCC++++    G  GVD+++ V IRGCNRF++VTPP
Subjt:  GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

A0A2P6R9S4 Uncharacterized protein7.9e-17766.29Show/hide
Query:  HRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEK--PNVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVK
        HRKIT     V     TNISH+LFGI GS K+W KR+ Y +LWWKPN+TRGF+W+D+K  PN+TWP TSP Y+VSADTS+F Y+CWYGSRSAIR+ARIVK
Subjt:  HRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEK--PNVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVK

Query:  ESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQAC
        ESFELG ENVRWFVMGDDDTVFF +NLVTVLAKYDH QMYYIG NSESVEQD+IHSY  AYGGGG+AISYPLAAELVR+LDGC++RY + YG DQKV+ C
Subjt:  ESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQAC

Query:  VSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSL
        +SEIGVPLTK +GFHQVDIRGS YG LAAHPVAPLVSLHH+DY+  +FP +  ID++K L  VY +DPGRTLQ SFCYDL RNWSV+VSWGY+VQLYPSL
Subjt:  VSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSL

Query:  VTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKA
        VTAK +E   +TFQTW+SW  GP+TFNTR V  DPC+ P+V+ LD V   G + T+T YK++ +   K+C   ++ AA  V+ F  V+++   P +W KA
Subjt:  VTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVG--GPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKA

Query:  PRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
        PRRQCC+++D    GG GV ++V + +RGCNRFE+VTPP
Subjt:  PRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

A0A6J1F4P4 uncharacterized protein LOC1114421382.2e-21178.49Show/hide
Query:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW
        NC+ CHR       RKI    IAVE  Q+   TN+SHLLFGIAGSTKTWKKRQ YCELWW PN+TRGFVWVDEKPN TWPATSPPYRVSADTS+F+YTCW
Subjt:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW

Query:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR
        YGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLVTVL KYDH QMYYIGSNSESVEQ++IH Y TAYGGGGYAISY LA ELVRILDGCL+R
Subjt:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR

Query:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV
        YASLYGGDQKVQACV+EIGVPLTK  GFHQVDIRGSQYG LAAHPVAPLVSLHHVDYLP IFPTM QIDALK LKT Y+LDPGRTLQQSFCYD ARNWS+
Subjt:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV

Query:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT
        +VSWGY+VQLYP L T K++EK+FQTF+TWKSWSDGPFTFNTRPV SDPC+MP++F LD    PNRTVTSYK+ +DVW KEC  DEFQ AQ VE FRVVT
Subjt:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT

Query:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
            S + W KAPRRQCCQVV+G +     +D++VNV  RGCN FETVTPP
Subjt:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

A0A6J1J6Y4 uncharacterized protein LOC1114819206.8e-21379.16Show/hide
Query:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW
        NC++CHR       RKI    IAVE  Q+   TNISHLLFGIAGSTKTWKKRQ YCELWW PN+TRGFVWVDEKPN TWPATSPPYRVSADTS+F+YTCW
Subjt:  NCSECHRTVQNSVHRKITVDHIAVEMQQT---TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCW

Query:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR
        YGSRSA+RLARIVKESFELG+ENVRWFVMGDDDTVFFVENLVTVL KYDH QMYYIGSNSESVEQD+IH Y TAYGGGGYAISY LA ELVRILDGCL+R
Subjt:  YGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNR

Query:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV
        YASLYGGDQKVQACV+EIGVPLTK  GFHQVDIRGSQYG LAAHPVAPLVSLHHVDYLP IFPTM +IDALK LKT Y+LDPGRTLQQSFCYD ARNWS+
Subjt:  YASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSV

Query:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT
        +VSWGY+VQLYP L T K+MEK+FQTF+TWKSWSDGPFTFNTRPV SDPCEMP++F LD V  PNRTVT+YK+ +DVW KECS DEFQ  Q VE FRVVT
Subjt:  TVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVT

Query:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
        S   S + W KAPRRQCCQVV+  +     +D++VNV +RGCN FETVTPP
Subjt:  SAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

B9HBV0 Uncharacterized protein6.0e-17764.67Show/hide
Query:  CSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYG
        C ECH+++  S HRKIT  +++ +  + TNISH+LFGI GS KTW KR+ Y ELWW PNITRG+VW+D+KP  N TWP TSP Y+VSADTS+F YTC YG
Subjt:  CSECHRTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYRVSADTSKFNYTCWYG

Query:  SRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYA
        SRSA+R+ARIVKESFELG ENVRWFVMGDDDTVFF+ENLV VLAKYDH QMYYIG NSESVEQD+IHSY  AYGGGG+AISYPLA ELVR+LDGC++RYA
Subjt:  SRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYA

Query:  SLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTV
        S YG DQK+Q C+SEIGVPLTK +GFHQVDIRG  YG LAAHP+APLVSLHH+DY+ +IFP +N+ID++K L + Y +DPGR LQ SFCYDL RNWSV+ 
Subjt:  SLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTV

Query:  SWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGG-PNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTS
        SWGY++Q++PSL+TAK++E  F+TFQTW+SWS+GPFTFNTRP+   PC  PVV+ LD V    + T+T+YK+ +  + + C   E+     V+   V TS
Subjt:  SWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGG-PNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTS

Query:  AQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP
          L P +W+ APRRQCC+V+     G  GV+S+V ++IRGCN+FE+VTPP
Subjt:  AQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)3.1e-16558.15Show/hide
Query:  FNCSECH-------------RTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYR
        F CS+CH                 NS     T+     +  + T+ISH+ FGI GS +TW+ R  Y ELWW+PN+TRGF+W+DE+P  N+TW +TSPPY+
Subjt:  FNCSECH-------------RTVQNSVHRKITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP--NVTWPATSPPYR

Query:  VSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLA
        VSADTS+F+YTCWYGSRSAIR+ARI+KE+FELGL +VRWF+MGDDDTVFFV+NL+TVL KYDH QMYYIG NSESVEQD++HSYA AYGGGG AISYPLA
Subjt:  VSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLA

Query:  AELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQ
         ELV++LDGC++RYASLYG DQK++AC+SEIGVPLTK +GFHQVDIRG+ YG LAAHPVAPLV+LHH+DY+  IFP   QIDAL+ L + Y  DP R +Q
Subjt:  AELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQ

Query:  QSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPV--GGPNRTVTSYKKHMDVW-AKECSG
         SFC+D  RNW V+VSWGY++Q+YP+LVTAKE+E  F TF++W++ S  PF+F+TRP+  DPCE P+V+ LD V   G  +T+T+Y+KH++V  + +C+ 
Subjt:  QSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPV--GGPNRTVTSYKKHMDVW-AKECSG

Query:  DEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTP
         ++  A  VE F  V++  L+P +W  APRRQCC++V+         +S++NV IR  N  E+VTP
Subjt:  DEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTP

AT3G11420.1 Protein of unknown function (DUF604)1.4e-12050Show/hide
Query:  TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADT-------SKFNYTCWYGSRSAIRLARIVKESFELGLENVR
        TNISH+ F IAG+ +TW  R  Y  LWW+ N TRGFVW+DE   +  P      R S  T       ++F ++    SR+A+R+ARI+ +S+ L L NVR
Subjt:  TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADT-------SKFNYTCWYGSRSAIRLARIVKESFELGLENVR

Query:  WFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKV
        WFVMGDDDTVFF ENLV VL+KYDH QM+YIG NSESVEQD++H+Y  A+GGGG+A+S PLAA L   +D CL RY   YG DQ++ +C+SEIGVP T+ 
Subjt:  WFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKV

Query:  IGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQ
         GFHQ+DIRG  YGFLAAHP+APLVSLHH+ YL  +FP  N I++L+TL   Y LDP R LQQ  C+D  R WS+++SWGY++Q+Y   +TA E+    Q
Subjt:  IGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQ

Query:  TFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAK---ECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVD
        TF+TW+S SDGPF FNTRP+  DPCE PV + +D  G  +   +  K    +  K    C   E      V+   +VTS +  P  W+KAPRRQCC+V++
Subjt:  TFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPNRTVTSYKKHMDVWAK---ECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVD

Query:  GIAAGGGGVDSLVNVHIRGCNRFETV
        G    G   +  + + IR C   E +
Subjt:  GIAAGGGGVDSLVNVHIRGCNRFETV

AT4G11350.1 Protein of unknown function (DUF604)3.2e-10646.15Show/hide
Query:  ITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPA----TSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKE
        +TV  +  E Q+ T+++H++FGIA S+K WK+R++Y ++W+KP   RG+VW+DE+  +        + P  R+S DTS F YT   G RSAIR++RIV E
Subjt:  ITVDHIAVEMQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPA----TSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKE

Query:  SF----ELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKV
        +         +NVRWFVMGDDDTVF  +NL+ VL KYDH QMYYIGS SES  Q++I SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++
Subjt:  SF----ELGLENVRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKV

Query:  QACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLY
        QAC++E+GVPLTK IGFHQ D+ G+ +G LAAHP+ P VS+HH+D +  IFP M ++ A+K L T   +D    LQQS CYD  ++W+++VSWG++VQ++
Subjt:  QACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLY

Query:  PSLVTAKEMEKNFQTFQTWKSWSD-GPFTFNTRPVGSDPCEMPVVFLL-----DPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLS
            + +EME   +TF  W   +D   + FNTRPV  + C+ P VF +     DP    N TV+ Y +H  V    C  D    A   E   +V   +  
Subjt:  PSLVTAKEMEKNFQTFQTWKSWSD-GPFTFNTRPVGSDPCEMPVVFLL-----DPVGGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLS

Query:  PAVWSKAPRRQCCQVV
        P +W+++PRR CC+V+
Subjt:  PAVWSKAPRRQCCQVV

AT4G23490.1 Protein of unknown function (DUF604)8.5e-10745.52Show/hide
Query:  MQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATS-----PPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLEN
        + + T+++H++FGIA S+K WK+R++Y ++W+KP   RG+VW+D++   +          PP ++S  T+ F YT   G RSA+R++RIV E+  LG +N
Subjt:  MQQTTNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATS-----PPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLEN

Query:  VRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLT
        VRWFVMGDDDTVF ++NL+ VL KYDH QMYYIGS SES  Q++  SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++QAC++E+GVPLT
Subjt:  VRWFVMGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLT

Query:  KVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKN
        K +GFHQ D+ G+ +G LAAHPV P VS+HH+D +  IFP M ++ ALK +     LD    LQQS CYD  ++W+++VSWGY+VQ++  + + +EME  
Subjt:  KVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKN

Query:  FQTFQTWKSWSD-GPFTFNTRPVGSDPCEMPVVFLLDPVGGP---NRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQ
         +TF  W   +D   + FNTRPV  +PC+ P VF +         N TV+ Y  H  V    C    ++     E   +V   +  P +W ++PRR CC+
Subjt:  FQTFQTWKSWSD-GPFTFNTRPVGSDPCEMPVVFLLDPVGGP---NRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQ

Query:  VV
        V+
Subjt:  VV

AT5G41460.1 Protein of unknown function (DUF604)5.3e-10948.74Show/hide
Query:  TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP----NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFV
        T   H++FGIA S + WK+R++Y ++W+KPN  R +VW+ EKP    +     + PP ++S DTSKF Y    G RSAIR++RIV E+ +LGL++VRWFV
Subjt:  TNISHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKP----NVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFV

Query:  MGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGF
        MGDDDTVF  ENL+ VL KYDH QMYYIGS SES  Q++  SY  AYGGGG+AISYPLA  L ++ D C+ RY +LYG D ++QAC++E+GVPLTK +GF
Subjt:  MGDDDTVFFVENLVTVLAKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGF

Query:  HQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQ
        HQ D+ G+ +G LAAHPVAPLV+LHH+D +  IFP M ++DALK L+    LD    +QQS CYD  R W+V+VSWG++VQ++  + +A+E+E   +TF 
Subjt:  HQVDIRGSQYGFLAAHPVAPLVSLHHVDYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQ

Query:  TWKSWSD-GPFTFNTRPVGSDPCEMPVVFLLDPV---GGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQV
         W   +D   + FNTRPV   PC+ P VF +         N TV+ Y+ H  V   EC    ++ A   +   V+   +  P +W ++PRR CC+V
Subjt:  TWKSWSD-GPFTFNTRPVGSDPCEMPVVFLLDPV---GGPNRTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTCAAAGCCAATGGGCCACGCCCTGAAGTTTTGACTGAAGCCCGATATGAAATTCATGGCCCAATAATTAGCTTAACGAACTGTATGATATTGGATCAGGACAT
GACTGTCAGAACTTATGGAAAGACCGCTATAACCTGCATGATCATATCTTCGTCCCCTCTGGTGAGCAAAGCTGGTTCTGTTTTCTGGAGACAGGAGAGCTCCAACAATG
ACGATTCAAAATTCAACTGTTCTGAATGCCATCGCACGGTCCAAAACTCCGTCCATCGGAAAATCACTGTGGATCACATCGCCGTCGAGATGCAACAAACGACGAATATA
TCCCACCTTCTATTCGGCATCGCCGGGTCCACCAAGACGTGGAAAAAGCGTCAAGACTACTGCGAGCTCTGGTGGAAGCCAAATATCACCCGCGGGTTTGTCTGGGTTGA
CGAGAAGCCCAACGTCACGTGGCCGGCGACCTCCCCACCGTACAGAGTCTCGGCCGACACGTCGAAATTCAACTACACTTGCTGGTACGGGTCTCGGTCGGCTATAAGGT
TAGCAAGGATCGTAAAGGAGAGCTTCGAATTGGGATTGGAAAACGTCCGGTGGTTTGTAATGGGCGACGACGACACAGTATTCTTCGTGGAGAATCTGGTGACTGTTTTG
GCGAAATACGATCACACTCAGATGTATTACATCGGCTCCAACTCGGAGAGCGTCGAGCAAGACATGATCCATTCATATGCTACGGCGTACGGCGGCGGCGGATATGCGAT
AAGTTATCCACTGGCAGCAGAGCTCGTAAGGATTTTGGATGGTTGCCTGAACCGCTACGCCAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCCTGCGTTAGCGAAATCG
GTGTCCCCTTGACCAAAGTGATTGGCTTTCACCAAGTTGATATTCGAGGAAGCCAATATGGGTTCTTGGCGGCGCATCCAGTGGCTCCGTTGGTGTCGCTCCACCACGTC
GACTACCTTCCGGCCATATTTCCGACGATGAACCAAATCGACGCCCTGAAAACTTTAAAAACTGTGTACGACCTCGATCCCGGGCGGACCCTGCAGCAAAGTTTCTGTTA
CGACTTGGCTCGTAACTGGTCTGTTACAGTGTCGTGGGGCTACAGCGTCCAGCTCTACCCGTCGCTGGTGACGGCGAAGGAAATGGAGAAGAACTTTCAGACGTTTCAGA
CATGGAAGAGTTGGAGCGACGGTCCGTTCACGTTCAACACCCGACCAGTCGGGTCGGACCCGTGCGAGATGCCGGTAGTATTTTTGTTGGACCCGGTCGGAGGTCCGAAC
CGGACAGTGACCAGTTACAAGAAGCACATGGACGTGTGGGCAAAAGAGTGTAGCGGAGACGAGTTCCAGGCAGCGCAGGATGTGGAGCATTTTCGCGTGGTGACTTCCGC
CCAGTTGAGCCCTGCCGTGTGGTCTAAGGCCCCACGTAGACAATGTTGCCAAGTGGTGGATGGTATAGCTGCCGGTGGCGGTGGAGTTGATAGTTTGGTAAATGTCCATA
TCAGAGGCTGCAACCGCTTTGAGACGGTGACTCCTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTCAAAGCCAATGGGCCACGCCCTGAAGTTTTGACTGAAGCCCGATATGAAATTCATGGCCCAATAATTAGCTTAACGAACTGTATGATATTGGATCAGGACAT
GACTGTCAGAACTTATGGAAAGACCGCTATAACCTGCATGATCATATCTTCGTCCCCTCTGGTGAGCAAAGCTGGTTCTGTTTTCTGGAGACAGGAGAGCTCCAACAATG
ACGATTCAAAATTCAACTGTTCTGAATGCCATCGCACGGTCCAAAACTCCGTCCATCGGAAAATCACTGTGGATCACATCGCCGTCGAGATGCAACAAACGACGAATATA
TCCCACCTTCTATTCGGCATCGCCGGGTCCACCAAGACGTGGAAAAAGCGTCAAGACTACTGCGAGCTCTGGTGGAAGCCAAATATCACCCGCGGGTTTGTCTGGGTTGA
CGAGAAGCCCAACGTCACGTGGCCGGCGACCTCCCCACCGTACAGAGTCTCGGCCGACACGTCGAAATTCAACTACACTTGCTGGTACGGGTCTCGGTCGGCTATAAGGT
TAGCAAGGATCGTAAAGGAGAGCTTCGAATTGGGATTGGAAAACGTCCGGTGGTTTGTAATGGGCGACGACGACACAGTATTCTTCGTGGAGAATCTGGTGACTGTTTTG
GCGAAATACGATCACACTCAGATGTATTACATCGGCTCCAACTCGGAGAGCGTCGAGCAAGACATGATCCATTCATATGCTACGGCGTACGGCGGCGGCGGATATGCGAT
AAGTTATCCACTGGCAGCAGAGCTCGTAAGGATTTTGGATGGTTGCCTGAACCGCTACGCCAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCCTGCGTTAGCGAAATCG
GTGTCCCCTTGACCAAAGTGATTGGCTTTCACCAAGTTGATATTCGAGGAAGCCAATATGGGTTCTTGGCGGCGCATCCAGTGGCTCCGTTGGTGTCGCTCCACCACGTC
GACTACCTTCCGGCCATATTTCCGACGATGAACCAAATCGACGCCCTGAAAACTTTAAAAACTGTGTACGACCTCGATCCCGGGCGGACCCTGCAGCAAAGTTTCTGTTA
CGACTTGGCTCGTAACTGGTCTGTTACAGTGTCGTGGGGCTACAGCGTCCAGCTCTACCCGTCGCTGGTGACGGCGAAGGAAATGGAGAAGAACTTTCAGACGTTTCAGA
CATGGAAGAGTTGGAGCGACGGTCCGTTCACGTTCAACACCCGACCAGTCGGGTCGGACCCGTGCGAGATGCCGGTAGTATTTTTGTTGGACCCGGTCGGAGGTCCGAAC
CGGACAGTGACCAGTTACAAGAAGCACATGGACGTGTGGGCAAAAGAGTGTAGCGGAGACGAGTTCCAGGCAGCGCAGGATGTGGAGCATTTTCGCGTGGTGACTTCCGC
CCAGTTGAGCCCTGCCGTGTGGTCTAAGGCCCCACGTAGACAATGTTGCCAAGTGGTGGATGGTATAGCTGCCGGTGGCGGTGGAGTTGATAGTTTGGTAAATGTCCATA
TCAGAGGCTGCAACCGCTTTGAGACGGTGACTCCTCCTTAG
Protein sequenceShow/hide protein sequence
MPVKANGPRPEVLTEARYEIHGPIISLTNCMILDQDMTVRTYGKTAITCMIISSSPLVSKAGSVFWRQESSNNDDSKFNCSECHRTVQNSVHRKITVDHIAVEMQQTTNI
SHLLFGIAGSTKTWKKRQDYCELWWKPNITRGFVWVDEKPNVTWPATSPPYRVSADTSKFNYTCWYGSRSAIRLARIVKESFELGLENVRWFVMGDDDTVFFVENLVTVL
AKYDHTQMYYIGSNSESVEQDMIHSYATAYGGGGYAISYPLAAELVRILDGCLNRYASLYGGDQKVQACVSEIGVPLTKVIGFHQVDIRGSQYGFLAAHPVAPLVSLHHV
DYLPAIFPTMNQIDALKTLKTVYDLDPGRTLQQSFCYDLARNWSVTVSWGYSVQLYPSLVTAKEMEKNFQTFQTWKSWSDGPFTFNTRPVGSDPCEMPVVFLLDPVGGPN
RTVTSYKKHMDVWAKECSGDEFQAAQDVEHFRVVTSAQLSPAVWSKAPRRQCCQVVDGIAAGGGGVDSLVNVHIRGCNRFETVTPP