; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026757 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026757
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationtig00153033:3352417..3356693
RNA-Seq ExpressionSgr026757
SyntenySgr026757
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]1.1e-22486.54Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCCCGCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHYADQK L LN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF LDPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DGNGPVRSPSPAPTPQPHN HHPPTHHHHHHH PLTPAISPAPATEKGA EYGSPAPER+  SPKRSY AKPPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]1.1e-22486.54Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCCCGCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHYADQK L LN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF LDPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DGNGPVRSPSPAPTPQPHN HHPPTHHHHHHH PLTPAISPAPATEKGA EYGSPAPER+  SPKRSY AKPPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]1.4e-21984.68Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCC GCV IRRLIGF+CIFILLLS ALF+SAV WLPPF+HYADQK LGLN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF +DPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNN EF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG +GNGPVRSPSPAPTPQPHN+HHPPTHHHHHHH PL  AISPAPATEKGA EYGSPAPERS  SP+RSY A+PPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS

XP_031740216.1 uncharacterized protein LOC101216010 isoform X2 [Cucumis sativus]2.5e-22486.51Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCCCGCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHYADQK L LN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF LDPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DGNGPVRSPSPAPTPQPHN HHPPTHHHHHHH PLTPAISPAPATEKGA EYGSPAPER+  SPKRSY AKPPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSP
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSP
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSP

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]7.6e-22186.48Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSGQVA+GRCCCGCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHYADQK LGLN SYR HDIVATF+VERPVSLLEDNI
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEF IPS++V ILSLESLPGSN TKVVF LDPD D+SEI ST LSLIRST  +LVTNQ FLRITKS+FGEAFSFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVT PTIVQSSVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG +GNGP RSPSPAP PQPHN  +PPT HHHHHH  LTPAISPAPATEKGA EYGSPAPERS  SPKRSY AKPPGCQY  KR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPS--PQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPS
        KS RKEGKQSH TPLASPN+SP HSAASPS  PQH+VNPPAAP+ P  A TPLPNVIYAHVQPPSKS+SN P+KSTTNPS APSPSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPS--PQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPS

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein5.5e-22586.54Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCCCGCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHYADQK L LN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF LDPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DGNGPVRSPSPAPTPQPHN HHPPTHHHHHHH PLTPAISPAPATEKGA EYGSPAPER+  SPKRSY AKPPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS

A0A1S3C173 uncharacterized protein LOC1034958527.0e-22084.68Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND EQPLPS + SRPSG VA+GRCC GCV IRRLIGF+CIFILLLS ALF+SAV WLPPF+HYADQK LGLN SYR HDIVATF+VER VSLLEDN 
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         QL  DIFEEFPIPS++V ILSLE L GSN TKVVF +DPD DDSEI ST LSLIRS   +LVTNQ FL ITKS FGEA+SFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTI+GSNSSNLGLNN EF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG +GNGPVRSPSPAPTPQPHN+HHPPTHHHHHHH PL  AISPAPATEKGA EYGSPAPERS  SP+RSY A+PPGCQYRYKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS
        KS RKEGKQSH TPLASPNISP HSAASPSPQHQ+NPPAAPVSP  A TPLPNVIYAHVQPPSKSDSN      NPS APSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPS

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X37.0e-21282.75Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND E P PS VGS PS    +GRCC GCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHY+DQK LGLN SYR HDIVATF VERPVSLL+DNI
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         +L  DIFEEFPIPS++V ILSL SL GSN TKVVFG+DPD DD EIPST LSLIRST A++VTNQSFLRITKS+FGEAFSFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVT PTIVQSSVLLEVGNTPSM+RLKQLAQTI+ SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPTPQPHN+HHPP+HHHHHHHAPLTP ISPAPA E GA EYG  AP +S  SPKRSYEAKPPGCQ  YKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPSP
        KS RKEGKQ H +PLASP+ISPVHSAASPS QH        VSPT+ASTPLP+VIYAHVQPPSKSDSN P+KSTT+PS  PSPSPSP
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPSP

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X42.7e-21182.72Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND E P PS VGS PS    +GRCC GCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHY+DQK LGLN SYR HDIVATF VERPVSLL+DNI
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         +L  DIFEEFPIPS++V ILSL SL GSN TKVVFG+DPD DD EIPST LSLIRST A++VTNQSFLRITKS+FGEAFSFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVT PTIVQSSVLLEVGNTPSM+RLKQLAQTI+ SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPTPQPHN+HHPP+HHHHHHHAPLTP ISPAPA E GA EYG  AP +S  SPKRSYEAKPPGCQ  YKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPS
        KS RKEGKQ H +PLASP+ISPVHSAASPS QH        VSPT+ASTPLP+VIYAHVQPPSKSDSN P+KSTT+PS  PSPSPS
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPS

A0A6J1EH92 uncharacterized protein LOC111432513 isoform X17.0e-21282.75Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI
        MGKND E P PS VGS PS    +GRCC GCV IRRLIGF+CIFILLLS ALF+SAVFWLPPFLHY+DQK LGLN SYR HDIVATF VERPVSLL+DNI
Subjt:  MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNI

Query:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF
         +L  DIFEEFPIPS++V ILSL SL GSN TKVVFG+DPD DD EIPST LSLIRST A++VTNQSFLRITKS+FGEAFSFEVLKF GGITIIPPQSAF
Subjt:  LQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVT PTIVQSSVLLEVGNTPSM+RLKQLAQTI+ SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPTPQPHN+HHPP+HHHHHHHAPLTP ISPAPA E GA EYG  AP +S  SPKRSYEAKPPGCQ  YKR
Subjt:  GKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKR

Query:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPSP
        KS RKEGKQ H +PLASP+ISPVHSAASPS QH        VSPT+ASTPLP+VIYAHVQPPSKSDSN P+KSTT+PS  PSPSPSP
Subjt:  KSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSN-PKKSTTNPSFAPSPSPSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.3e-3635.22Show/hide
Query:  AEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLG---LNSSYRDHDIVATFDVERPVSLLEDNILQLENDIFEEFPIP-SVEV
        + GR C       RL+G +C+ +L+LS A+ +SA+FWL P    ++ K  G   LN+S     + A+F +++PVS +  +  ++E+DI     +  + +V
Subjt:  AEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLG---LNSSYRDHDIVATFDVERPVSLLEDNILQLENDIFEEFPIP-SVEV

Query:  VILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAFLLQKVQILFNFTLNFSIH
         +LSL     SN T V F + P   D EI   SLSL+RS+F  L   +S L++T S FG+  SF+VLKF GGIT+ P + A +     +LF+ T+  SI 
Subjt:  VILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAFLLQKVQILFNFTLNFSIH

Query:  QIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG
         +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+   FG+VK +  S+ L      
Subjt:  QIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG

Query:  GDGNGPVRSPSPAPTPQP
         DG  P      AP P P
Subjt:  GDGNGPVRSPSPAPTPQP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein7.1e-10048.41Show/hide
Query:  MGKNDEEQPLPSVVGSRPSGQ--VAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLED
        MGK +++  L  V G   +G   V   RC C C WI   +GFKC+F+LLLS ALF+SA+F L PF    D++   L+  +R H IVA+F + R  S L +
Subjt:  MGKNDEEQPLPSVVGSRPSGQ--VAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLED

Query:  NILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQS
        N LQL+NDIF+E    S++V IL++E     NITKVVFG+DPD    EI   SLS I+  F +++ NQS L++TKSLFGE F FEVLKF GGIT+IPPQS
Subjt:  NILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQS

Query:  AFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNT
        AF LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LYV L N+EGSTV+ PT V SSVLL VG + S  RLKQL  TITGS S NLGLNNT
Subjt:  AFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNT

Query:  EFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP----------HNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYE
         FGKVKQVRLSS L +S      +   +SPSP+P+P            H++HH   +HHHHHH  L+P ++P        S   SPAP R   S KR+  
Subjt:  EFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP----------HNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYE

Query:  AKP---PGCQYRYKRKSARKEGKQSHSTPLASPNI-SPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAH-VQPPSKSDSNPKKSTTNPSFAPSPS
        A P   PG +  +K K       Q  STP  +P+  +P H   SP+P         P+     S PLP+V++AH  QPP    + P++   N    P P 
Subjt:  AKP---PGCQYRYKRKSARKEGKQSHSTPLASPNI-SPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAH-VQPPSKSDSNPKKSTTNPSFAPSPS

Query:  PS
         S
Subjt:  PS

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.6e-10449.29Show/hide
Query:  MGKND-EEQPLP---SVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLL
        MGKN  EEQ LP       +R +G      CCC C WI      +C+ IL  S A+F+SA+FWLPPFL +AD   L L+  ++DH IVA+FDV +P+S +
Subjt:  MGKND-EEQPLP---SVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLL

Query:  EDNILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPP
        EDN++QLENDI +E   P  +VV+L+LE L   N T V+F +DP+ ++S+IP+   SLI++ F TLV  Q   R+T+SLFGE F FEVLKF GGIT+IPP
Subjt:  EDNILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LY+ L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTIT S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP--HNY-HHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPG
        +T FGKVKQVRLSSIL HS        P  S +P+P+PQP  H Y HH P HHHHHH     P++SP     KG +   +P  + SP  P+       P 
Subjt:  NTEFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP--HNY-HHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPG

Query:  CQYRYKRKSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPSPSP
        C Y  +R           + P  +P+ S  H  A P+P     PP     P   S+PLP+V++AH+ PPSK  S+P+   T    +PSP+P+P
Subjt:  CQYRYKRKSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPSPSP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.6e-10449.29Show/hide
Query:  MGKND-EEQPLP---SVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLL
        MGKN  EEQ LP       +R +G      CCC C WI      +C+ IL  S A+F+SA+FWLPPFL +AD   L L+  ++DH IVA+FDV +P+S +
Subjt:  MGKND-EEQPLP---SVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLL

Query:  EDNILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPP
        EDN++QLENDI +E   P  +VV+L+LE L   N T V+F +DP+ ++S+IP+   SLI++ F TLV  Q   R+T+SLFGE F FEVLKF GGIT+IPP
Subjt:  EDNILQLENDIFEEFPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LY+ L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTIT S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP--HNY-HHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPG
        +T FGKVKQVRLSSIL HS        P  S +P+P+PQP  H Y HH P HHHHHH     P++SP     KG +   +P  + SP  P+       P 
Subjt:  NTEFGKVKQVRLSSILKHSLNGGDGNGPVRSPSPAPTPQP--HNY-HHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPG

Query:  CQYRYKRKSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPSPSP
        C Y  +R           + P  +P+ S  H  A P+P     PP     P   S+PLP+V++AH+ PPSK  S+P+   T    +PSP+P+P
Subjt:  CQYRYKRKSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAAPVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPSPSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAACGACGAAGAACAGCCGCTGCCGTCCGTCGTCGGCTCGCGGCCGTCCGGCCAGGTTGCCGAGGGTCGATGCTGTTGTGGGTGTGTTTGGATTCGGAGGCT
CATTGGGTTCAAATGCATCTTCATTCTGCTATTGTCCTTTGCCTTGTTCATTTCTGCTGTCTTTTGGCTGCCCCCTTTTCTCCATTACGCAGATCAAAAGGGTCTGGGTC
TTAATTCCTCGTATCGAGATCATGATATAGTAGCAACGTTCGATGTTGAGAGACCAGTTTCTTTGCTGGAAGACAATATCTTGCAGCTCGAGAACGACATTTTTGAAGAG
TTCCCAATACCTTCTGTCGAAGTGGTAATACTATCTCTAGAATCCTTGCCTGGATCGAACATAACAAAAGTCGTGTTTGGCCTCGATCCAGATGCAGATGATTCAGAAAT
CCCGTCAACTTCCCTCAGTTTAATCAGGTCGACCTTTGCAACTCTAGTCACAAATCAGTCGTTCCTCCGCATAACTAAATCCTTGTTCGGGGAGGCCTTTTCGTTTGAAG
TACTGAAATTCTCCGGAGGAATAACGATAATCCCACCGCAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACCTTGAACTTCTCTATTCATCAAATT
CAAGTACATTTCAGTGAACTGACAAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATGTTAAACTGTGGAATGCGGAAGGTTCGACCGTGACTGG
CCCTACAATTGTTCAGTCATCTGTTCTTCTGGAAGTTGGAAATACTCCATCGATGCGGAGGCTGAAGCAGCTAGCTCAGACGATCACAGGTTCTAATTCGAGCAACCTCG
GCCTGAATAATACTGAGTTTGGAAAAGTGAAGCAAGTTCGCCTTTCATCGATTCTTAAACACTCCCTCAATGGCGGTGACGGGAACGGTCCCGTCAGGTCCCCTTCTCCT
GCTCCTACACCCCAGCCCCATAACTACCATCACCCCCCAACTCACCACCATCACCATCATCACGCGCCTTTAACACCTGCAATTTCACCTGCTCCTGCAACTGAGAAGGG
TGCCTCGGAATATGGTTCCCCTGCCCCCGAAAGAAGCCCGACATCGCCTAAGAGAAGTTATGAGGCAAAGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGCTA
GGAAGGAGGGAAAGCAATCTCATTCAACCCCCCTCGCCTCACCTAACATATCTCCCGTTCATTCTGCTGCATCGCCGTCACCACAACATCAAGTAAACCCACCAGCAGCA
CCTGTCTCTCCAACTCGGGCATCAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCGTCGAAAAGCGACTCAAACCCCAAAAAATCCACAACGAATCCATC
ATTCGCTCCATCTCCATCTCCATCTCCAT
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGAACGACGAAGAACAGCCGCTGCCGTCCGTCGTCGGCTCGCGGCCGTCCGGCCAGGTTGCCGAGGGTCGATGCTGTTGTGGGTGTGTTTGGATTCGGAGGCT
CATTGGGTTCAAATGCATCTTCATTCTGCTATTGTCCTTTGCCTTGTTCATTTCTGCTGTCTTTTGGCTGCCCCCTTTTCTCCATTACGCAGATCAAAAGGGTCTGGGTC
TTAATTCCTCGTATCGAGATCATGATATAGTAGCAACGTTCGATGTTGAGAGACCAGTTTCTTTGCTGGAAGACAATATCTTGCAGCTCGAGAACGACATTTTTGAAGAG
TTCCCAATACCTTCTGTCGAAGTGGTAATACTATCTCTAGAATCCTTGCCTGGATCGAACATAACAAAAGTCGTGTTTGGCCTCGATCCAGATGCAGATGATTCAGAAAT
CCCGTCAACTTCCCTCAGTTTAATCAGGTCGACCTTTGCAACTCTAGTCACAAATCAGTCGTTCCTCCGCATAACTAAATCCTTGTTCGGGGAGGCCTTTTCGTTTGAAG
TACTGAAATTCTCCGGAGGAATAACGATAATCCCACCGCAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACCTTGAACTTCTCTATTCATCAAATT
CAAGTACATTTCAGTGAACTGACAAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATGTTAAACTGTGGAATGCGGAAGGTTCGACCGTGACTGG
CCCTACAATTGTTCAGTCATCTGTTCTTCTGGAAGTTGGAAATACTCCATCGATGCGGAGGCTGAAGCAGCTAGCTCAGACGATCACAGGTTCTAATTCGAGCAACCTCG
GCCTGAATAATACTGAGTTTGGAAAAGTGAAGCAAGTTCGCCTTTCATCGATTCTTAAACACTCCCTCAATGGCGGTGACGGGAACGGTCCCGTCAGGTCCCCTTCTCCT
GCTCCTACACCCCAGCCCCATAACTACCATCACCCCCCAACTCACCACCATCACCATCATCACGCGCCTTTAACACCTGCAATTTCACCTGCTCCTGCAACTGAGAAGGG
TGCCTCGGAATATGGTTCCCCTGCCCCCGAAAGAAGCCCGACATCGCCTAAGAGAAGTTATGAGGCAAAGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGCTA
GGAAGGAGGGAAAGCAATCTCATTCAACCCCCCTCGCCTCACCTAACATATCTCCCGTTCATTCTGCTGCATCGCCGTCACCACAACATCAAGTAAACCCACCAGCAGCA
CCTGTCTCTCCAACTCGGGCATCAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCGTCGAAAAGCGACTCAAACCCCAAAAAATCCACAACGAATCCATC
ATTCGCTCCATCTCCATCTCCATCTCCAT
Protein sequenceShow/hide protein sequence
MGKNDEEQPLPSVVGSRPSGQVAEGRCCCGCVWIRRLIGFKCIFILLLSFALFISAVFWLPPFLHYADQKGLGLNSSYRDHDIVATFDVERPVSLLEDNILQLENDIFEE
FPIPSVEVVILSLESLPGSNITKVVFGLDPDADDSEIPSTSLSLIRSTFATLVTNQSFLRITKSLFGEAFSFEVLKFSGGITIIPPQSAFLLQKVQILFNFTLNFSIHQI
QVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTGPTIVQSSVLLEVGNTPSMRRLKQLAQTITGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGGDGNGPVRSPSP
APTPQPHNYHHPPTHHHHHHHAPLTPAISPAPATEKGASEYGSPAPERSPTSPKRSYEAKPPGCQYRYKRKSARKEGKQSHSTPLASPNISPVHSAASPSPQHQVNPPAA
PVSPTRASTPLPNVIYAHVQPPSKSDSNPKKSTTNPSFAPSPSPSPX