; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016247 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016247
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00007935:280666..283664
RNA-Seq ExpressionSgr016247
SyntenySgr016247
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036463.1 hypothetical protein SDJN02_00080 [Cucurbita argyrosperma subsp. argyrosperma]6.8e-21474.91Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M  RGK G+K LRDVSN KYGRTSSKSV TAK+KE   +S+VEE+DDALDRLLLVQSD SA  +QIDE  VKAFELKEM KQGRK+IESFTH+LS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPR QKVLS PS G D DI Q L+SE+N +VNDTEN+VIDSPD AEV   LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YSKS L
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITH
        +G+TS I++GAQPCFIAC D NEN LEG  +EP  GKPSG DL+ +GENLLEGNGI PS  KPSGS+LTK+GENLLEGNGI  SG EPSG D+ QV ITH
Subjt:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITH

Query:  QRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPD
        QRGFASP MLSKKNCS+LVMTPCLKMSPPKSCVLLEPISE SH+D+K +YKATPFPVGV D SSG DASDGLALKYPELLGIQQAHK   R KEVEASPD
Subjt:  QRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPD

Query:  WFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLKK
        WFMSP KTCVLLEPSD HSV++AAC                            CH+ KK    + PVGVSLPHIDSTPM KE ESV  VGKRAGEETLKK
Subjt:  WFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLKK

Query:  ELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        ELW+KFEAASANP+R +QALQKTSKKGFLD+LDE
Subjt:  ELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

XP_022153546.1 uncharacterized protein LOC111021026 isoform X1 [Momordica charantia]7.0e-25184.36Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M   GK GRK LRDVSNHK GR SSKSVTTA +KESDNRS+VEE+DDALDRLLLVQSD SAL HQIDE VVKAFELKEM KQGRKEIESFTHVLS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPRFQK  SHP+IGS+ DIGQSLA E+NALVNDTE+NVIDSPDHAEV QALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YS+SV 
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI
        NG+TSGI K  GAQPCFI+CGD NENLLEGNGIEP G KPSGSDL+KVGENLL+GNGIEPSG KPSGSD TKVGENLLEGNGI P GGE SG +LTQVGI
Subjt:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI

Query:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA
        T Q GF SP MLSK NCSVLVMTPC KMSPPKSCVLLEPISE SH+DQKRLYKATPFPVGVHD SSSGSDASDGLALKYPELLGIQQ HKSGIRKKEVEA
Subjt:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA

Query:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET
        SPDWFMSP KTCVLLEPSDSHSVENAACDID P+ S VLNSQLK  V  G +DVD CH+TK   SHQDPVGVSL H+DSTPMWK  ESV+  GKRAGEET
Subjt:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET

Query:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        LK+ELW+KFEAASANPFR NQ L+ TSKKGFLDLLDE
Subjt:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

XP_022153547.1 uncharacterized protein LOC111021026 isoform X2 [Momordica charantia]5.5e-24883.99Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M   GK GRK LRDVSNHK GR SSKSVTTA +KESDNRS+VEE+DDALDRLLLVQSD SAL HQIDE VVKAFELKEM KQGRKEIESFTHVLS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPRFQK  SHP+IGS+ DIGQSLA E+NALVNDTE+NVIDSPDHAEV QALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YS+SV 
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI
        NG+TSGI K  GAQPCFI+CGD NENLLEGNGIEP G KPSGSDL+KVGENLL+GNGIEPSG KPSGSD TKVGENLLEGNGI P GGE SG +LTQVGI
Subjt:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI

Query:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA
        T Q GF SP MLSK NCSVLVMTPC KMSPPKSCVLLEPISE SH+DQKRLYKATPFPVGVHD SSSGSDASDGLALKYPELLGIQQ HKSGIRKKEVEA
Subjt:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA

Query:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET
        SPDWFMSP KTCVLLEPSDSHSVENAACDID P+ S VLNSQLK  V  G +DVD CH+TK   SHQ  VGVSL H+DSTPMWK  ESV+  GKRAGEET
Subjt:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET

Query:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        LK+ELW+KFEAASANPFR NQ L+ TSKKGFLDLLDE
Subjt:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

XP_038882431.1 uncharacterized protein LOC120073701 isoform X1 [Benincasa hispida]9.8e-22176.16Show/hide
Query:  LPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVL
        LPAA  M  RGK G+  L DVSNHKY RTSSKSVT A +KE+  +S+VEE++++LDRLLLVQSD S L HQIDE VVKAFELKEM KQG++EIESFTHVL
Subjt:  LPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVL

Query:  SEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVE
        S+MLSSLKPWVPR QKV S PS  SDD I QSLASE+N LVND ENNVIDSPDHAE  Q LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK+V 
Subjt:  SEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVE

Query:  YSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPS-GCDL
        +SKSVLNG+TSGILK AQPCFIACGD NE+ LEG+GIEP  GKPSGSDL+ + +NLLE NGIE    KPSGSDLTK+G+NL+EGNG+EPSG E S G DL
Subjt:  YSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPS-GCDL

Query:  TQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKK
        TQ GITHQRGFASP +LSKKNCS+LVMTPC KMSPPKSCVLLEPISE SH+D+KR YKATPFPVGVHD SSGSDASDGLALKYPELLGIQQAHKSGIRKK
Subjt:  TQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKK

Query:  EVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRA
         VEASPDW+MSP KTCVLLEPSDSHSVE A C                          D CH+  K  SHQDPVGVSLPHID+TPM KE ESV  VGKRA
Subjt:  EVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRA

Query:  GEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        GEETLKKELW+KFEAASAN FR  QA+QKTSKKGFLDLLDE
Subjt:  GEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

XP_038882435.1 uncharacterized protein LOC120073701 isoform X2 [Benincasa hispida]1.0e-21775.79Show/hide
Query:  LPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVL
        LPAA  M  RGK G+  L DVSNHKY RTSSKSVT A +KE+  +S+VEE++++LDRLLLVQSD S L HQIDE VVKAFELKEM KQG++EIESFTHVL
Subjt:  LPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVL

Query:  SEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVE
        S+MLSSLKPWVPR QKV S PS  SDD I QSLASE+N LVND ENNVIDSPDHAE  Q LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK+V 
Subjt:  SEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVE

Query:  YSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPS-GCDL
        +SKSVLNG+TSGILK AQPCFIACGD NE+ LEG+GIEP  GKPSGSDL+ + +NLLE NGIE    KPSGSDLTK+G+NL+EGNG+EPSG E S G DL
Subjt:  YSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPS-GCDL

Query:  TQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKK
        TQ GITHQRGFASP +LSKKNCS+LVMTPC KMSPPKSCVLLEPISE SH+D+KR YKATPFPVGVHD SSGSDASDGLALKYPELLGIQQAHKSGIRKK
Subjt:  TQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKK

Query:  EVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRA
         VEASPDW+MSP KTCVLLEPSDSHSVE A C                          D CH+  K  SHQ  VGVSLPHID+TPM KE ESV  VGKRA
Subjt:  EVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRA

Query:  GEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        GEETLKKELW+KFEAASAN FR  QA+QKTSKKGFLDLLDE
Subjt:  GEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

TrEMBL top hitse value%identityAlignment
A0A6J1DJ80 uncharacterized protein LOC111021026 isoform X22.7e-24883.99Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M   GK GRK LRDVSNHK GR SSKSVTTA +KESDNRS+VEE+DDALDRLLLVQSD SAL HQIDE VVKAFELKEM KQGRKEIESFTHVLS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPRFQK  SHP+IGS+ DIGQSLA E+NALVNDTE+NVIDSPDHAEV QALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YS+SV 
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI
        NG+TSGI K  GAQPCFI+CGD NENLLEGNGIEP G KPSGSDL+KVGENLL+GNGIEPSG KPSGSD TKVGENLLEGNGI P GGE SG +LTQVGI
Subjt:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI

Query:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA
        T Q GF SP MLSK NCSVLVMTPC KMSPPKSCVLLEPISE SH+DQKRLYKATPFPVGVHD SSSGSDASDGLALKYPELLGIQQ HKSGIRKKEVEA
Subjt:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA

Query:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET
        SPDWFMSP KTCVLLEPSDSHSVENAACDID P+ S VLNSQLK  V  G +DVD CH+TK   SHQ  VGVSL H+DSTPMWK  ESV+  GKRAGEET
Subjt:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET

Query:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        LK+ELW+KFEAASANPFR NQ L+ TSKKGFLDLLDE
Subjt:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

A0A6J1DKY8 uncharacterized protein LOC111021026 isoform X13.4e-25184.36Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M   GK GRK LRDVSNHK GR SSKSVTTA +KESDNRS+VEE+DDALDRLLLVQSD SAL HQIDE VVKAFELKEM KQGRKEIESFTHVLS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPRFQK  SHP+IGS+ DIGQSLA E+NALVNDTE+NVIDSPDHAEV QALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YS+SV 
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI
        NG+TSGI K  GAQPCFI+CGD NENLLEGNGIEP G KPSGSDL+KVGENLL+GNGIEPSG KPSGSD TKVGENLLEGNGI P GGE SG +LTQVGI
Subjt:  NGVTSGILK--GAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGI

Query:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA
        T Q GF SP MLSK NCSVLVMTPC KMSPPKSCVLLEPISE SH+DQKRLYKATPFPVGVHD SSSGSDASDGLALKYPELLGIQQ HKSGIRKKEVEA
Subjt:  THQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHD-SSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEA

Query:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET
        SPDWFMSP KTCVLLEPSDSHSVENAACDID P+ S VLNSQLK  V  G +DVD CH+TK   SHQDPVGVSL H+DSTPMWK  ESV+  GKRAGEET
Subjt:  SPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEET

Query:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        LK+ELW+KFEAASANPFR NQ L+ TSKKGFLDLLDE
Subjt:  LKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

A0A6J1G9T2 uncharacterized protein LOC1114522431.2e-21174.39Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M  RGK G+K LRDVSN KYGRTSSKSV TAK+KE D +S+VEE+DDALDRLLLVQSD SA  +QIDE  VKAFELKEM KQGRK+IESFTH+LS++LSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPR QKVLS PS G D DI Q L+SE+N +VNDTEN+VIDSPD AEV   LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+YSKS L
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCD-LTQVGIT
        +G+TS I++GAQPCFIAC D NEN LEG    P  GKPSG DL+ +GENLLEGNGI PS  KPSGS+LTK+GENLLEGNGI  SG EPSG D + QV IT
Subjt:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCD-LTQVGIT

Query:  HQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASP
        HQRGFASP MLSKKNCS+LVMTPCLKMSPPKSCVLLEPISE SH+D+K +YKATPFPVGV D SSG DASDGLALKYPELLGIQQAHK   R KEVEASP
Subjt:  HQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASP

Query:  DWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLK
        DWFMSP KTCVLLEPSD HSV++AAC                            CH+ KK    + PVGVSLPHID+TPM KE ESV  VGKRAGEETLK
Subjt:  DWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLK

Query:  KELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        KELW+KFEAASANP+R +QALQKTSKKGFLD+LDE
Subjt:  KELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

A0A6J1GW51 uncharacterized protein LOC1114579652.5e-20973.11Show/hide
Query:  LGRLPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFT
        +G  PAA  M+ R K GRK LRD++NH YGRTSSKSV+TAK+KE DNRS+VEE+DDALDRLLLVQSD SAL  QIDE VVKAFELK+M +QGRKEIESFT
Subjt:  LGRLPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFT

Query:  HVLSEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK
        HVLS+MLSSLKPWVPRFQ   S PS  SDD I Q LASE+NALVN TE+NVIDSPD+A + Q LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK
Subjt:  HVLSEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK

Query:  HVEYSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGC
        H  YSKSVL G+TSG LKGAQPCF ACGD N                         ENLLEGNG+EPS  KP GSDLTK+G+NLLEGNG +PSG EPSG 
Subjt:  HVEYSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGC

Query:  DLTQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIR
        DLTQVG  HQRGFASP +LSKKNCS+LVMTPCLKMSPPKSCVLLEPISE S +D+KR+YKATPFPVGVHDSSSGSD SDGLALKYPELLGIQQAHKSGI+
Subjt:  DLTQVGITHQRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIR

Query:  KKEVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGK
        KK VEASPDWFMSP KTCVLLEPSDSHSVE+A CD  +                         ++ KK  +HQDPVGVSLP ID+TPM KE ESV  VGK
Subjt:  KKEVEASPDWFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGK

Query:  RAGEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        RAGEETLKKELW+KFEAASANPFR +Q+LQKTS KGFLDLLDE
Subjt:  RAGEETLKKELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

A0A6J1KCQ1 uncharacterized protein LOC1114926861.1e-21274.53Show/hide
Query:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS
        M  RGK G+K LRDVSN KYGRTSSKSV TAK+KE D  S+VEE+DD+LDRLLLVQSD SA  +QIDE VVKAFELKEM KQGRK+IESFTH+LS+MLSS
Subjt:  MTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSS

Query:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL
        LKPWVPR QKVLS PS G D DI Q L+SE+N +VNDTEN+VIDSP  AEV   LISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHV+Y KS L
Subjt:  LKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVL

Query:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITH
        +G+TS IL+GAQPCF+AC D NEN LEG  +EP  GKPSG DL+ +GENLLEGNGI PS  KPSGS+LTK+GENLLEG+GI  SG EPSG D+ QV ITH
Subjt:  NGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITH

Query:  QRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPD
        QRGFASP MLSKKNCS+LVMTPCLKMSPPKSCVLLEPISE SH+D+K +YKATPFPVGV D SSG DASDGLALKYPELLGIQQAHK  IR KEVEASPD
Subjt:  QRGFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPD

Query:  WFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLKK
        WFMSP KTCVLLEPSD HSV++AAC                            C + KK    + PVGVSLPHIDSTPM KE ESV  VGKRAGEETLKK
Subjt:  WFMSPLKTCVLLEPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLKK

Query:  ELWVKFEAASANPFRINQALQKTSKKGFLDLLDE
        ELW+KFEAASANP+R +QALQKTSKKGFLD+LDE
Subjt:  ELWVKFEAASANPFRINQALQKTSKKGFLDLLDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12540.1 unknown protein1.0e-5834.24Show/hide
Query:  EVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSSLK-----------------PWVPRFQKVLSHPSIGSDDDI
        E E  D  LD+L LV SD  +++ QIDE VV+A + K +SK G  E+ESF  VLS+MLSSLK                 PW PR Q+ +S   +  +D  
Subjt:  EVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQGRKEIESFTHVLSEMLSSLK-----------------PWVPRFQKVLSHPSIGSDDDI

Query:  GQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVLNGVTSGILKGAQPCFIACGDSNE
         QSL S N     + +   ++SP+  +  + L+SPSPLV WR   N ++GRQLFLLTPLP+ KS   KH   SK     +                D+  
Subjt:  GQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSVLNGVTSGILKGAQPCFIACGDSNE

Query:  NLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITHQRGFASPRMLSKKNCSVLVMTPC
        N       EP       SD      ++L G  ++ +G   S        ENL+E                        +  +SP +L +K  S L+MTPC
Subjt:  NLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITHQRGFASPRMLSKKNCSVLVMTPC

Query:  LKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPDWFMSPLKTCVLLEPSDSHSVENA
        LK+SPPKSC + +P+ E S   ++   K+T   +G    SSG + +D L  KYPELLGIQ  H    RK ++E+SP W+ SP KTCVL+EP         
Subjt:  LKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPDWFMSPLKTCVLLEPSDSHSVENA

Query:  ACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGK-RAGEETLKKELWVKFEAASANPFRINQALQK
                    +N +  +  + G  DV       K+++     G     ++STP++KE ES++   + +AGE TLKKELW +FE A+ +  R N     
Subjt:  ACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGK-RAGEETLKKELWVKFEAASANPFRINQALQK

Query:  T-----SKKGFLDLLDE
        T     +KK F+++L+E
Subjt:  T-----SKKGFLDLLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGCCTCCATTCTCGATTCTAGAGCATTGTGGAAGCACGCAAATGCCCGCCTTGGGGAGGCTACCGGCGGCGAACGGGATGACGACAAGGGGGAAACCAGGGAG
AAAGGCACTGAGGGACGTATCGAACCACAAATATGGTCGAACTTCCTCCAAATCTGTCACTACAGCCAAGAAAAAGGAAAGTGATAACAGGTCTGAGGTTGAAGAGCGAG
ATGATGCTCTCGATCGCCTCCTCCTCGTTCAGTCCGATTTCTCCGCGCTCATTCACCAGATTGATGAATTCGTTGTGAAAGCATTTGAACTGAAGGAAATGAGCAAACAA
GGGAGGAAGGAGATCGAATCTTTCACTCATGTCTTATCTGAAATGCTATCTTCTTTGAAGCCCTGGGTTCCCAGGTTTCAGAAGGTGCTCTCTCATCCATCAATAGGTTC
TGATGATGATATAGGACAATCGTTGGCTAGTGAAAACAATGCTTTGGTTAATGATACGGAAAACAACGTTATTGACAGTCCAGACCATGCTGAAGTTCATCAAGCTCTGA
TCTCTCCTTCACCCCTTGTATCATGGCGTGCTGGATGCAATATTGAGAGAGGAAGACAATTGTTTTTACTCACACCTCTTCCTATTTCTAAATCACTCTCATCGAAACAT
GTGGAATATTCTAAATCTGTACTTAATGGAGTGACTTCTGGCATACTCAAGGGTGCACAGCCATGTTTCATCGCATGTGGAGATTCAAACGAGAATCTGCTTGAAGGTAA
TGGAATTGAGCCTAGAGGTGGCAAGCCCTCTGGGTCTGACTTATCAAAAGTGGGGGAGAATCTGCTTGAAGGTAATGGAATTGAGCCTAGTGGTTGTAAGCCCTCTGGGT
CTGATTTAACAAAAGTGGGGGAGAATCTGCTTGAAGGTAATGGGATTGAGCCTAGTGGTGGTGAGCCTTCTGGGTGTGATTTAACACAAGTGGGGATAACTCATCAGCGT
GGATTTGCTTCCCCACGAATGTTATCAAAGAAAAATTGCTCCGTGTTAGTTATGACTCCATGCTTAAAAATGTCGCCTCCAAAATCTTGTGTACTGCTGGAACCCATTTC
AGAGTTATCACATCAAGACCAAAAAAGGCTTTACAAGGCCACACCTTTTCCCGTTGGAGTTCATGATAGCTCTTCTGGAAGTGACGCTTCTGATGGACTGGCTTTAAAGT
ACCCAGAACTCTTAGGAATTCAACAGGCTCATAAATCAGGAATTAGAAAGAAGGAGGTTGAAGCCTCACCGGACTGGTTTATGTCACCTCTAAAAACATGTGTTTTACTG
GAGCCGTCTGATTCTCATTCAGTGGAAAATGCTGCTTGTGATATCGACTTTCCTATGAATTCTTGGGTCCTGAATTCGCAGTTGAAATTGTCTGTATCAAAAGGAATTAA
TGATGTTGATAGATGTCACCAGACCAAGAAATATTCCAGCCATCAAGATCCAGTTGGGGTCAGTTTGCCGCATATAGATAGCACTCCCATGTGGAAGGAACGCGAAAGCG
TAATTGGGGTTGGCAAACGTGCTGGTGAGGAGACTCTTAAAAAAGAACTGTGGGTGAAATTTGAAGCAGCATCAGCCAATCCATTTCGCATTAATCAAGCTCTTCAAAAG
ACATCAAAGAAAGGTTTTCTGGACTTGCTGGATGAGGGCATTGCAGCAGATGTTGTGCTTATGCATTCCTCGGCCAGGAGAAAAAATTCTAATGCAAAAGCCTGTAATGA
TGCTGGAAGCAAATTTGCTCTAGAAAATGCCTGTGTAGTAGGCTACATTTTGTTCATAAGCATGAGGAAAAGCCGGATTACAATGGTTATACAAGTCATTCCGGTTCTTC
TCTTCACAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGCCTCCATTCTCGATTCTAGAGCATTGTGGAAGCACGCAAATGCCCGCCTTGGGGAGGCTACCGGCGGCGAACGGGATGACGACAAGGGGGAAACCAGGGAG
AAAGGCACTGAGGGACGTATCGAACCACAAATATGGTCGAACTTCCTCCAAATCTGTCACTACAGCCAAGAAAAAGGAAAGTGATAACAGGTCTGAGGTTGAAGAGCGAG
ATGATGCTCTCGATCGCCTCCTCCTCGTTCAGTCCGATTTCTCCGCGCTCATTCACCAGATTGATGAATTCGTTGTGAAAGCATTTGAACTGAAGGAAATGAGCAAACAA
GGGAGGAAGGAGATCGAATCTTTCACTCATGTCTTATCTGAAATGCTATCTTCTTTGAAGCCCTGGGTTCCCAGGTTTCAGAAGGTGCTCTCTCATCCATCAATAGGTTC
TGATGATGATATAGGACAATCGTTGGCTAGTGAAAACAATGCTTTGGTTAATGATACGGAAAACAACGTTATTGACAGTCCAGACCATGCTGAAGTTCATCAAGCTCTGA
TCTCTCCTTCACCCCTTGTATCATGGCGTGCTGGATGCAATATTGAGAGAGGAAGACAATTGTTTTTACTCACACCTCTTCCTATTTCTAAATCACTCTCATCGAAACAT
GTGGAATATTCTAAATCTGTACTTAATGGAGTGACTTCTGGCATACTCAAGGGTGCACAGCCATGTTTCATCGCATGTGGAGATTCAAACGAGAATCTGCTTGAAGGTAA
TGGAATTGAGCCTAGAGGTGGCAAGCCCTCTGGGTCTGACTTATCAAAAGTGGGGGAGAATCTGCTTGAAGGTAATGGAATTGAGCCTAGTGGTTGTAAGCCCTCTGGGT
CTGATTTAACAAAAGTGGGGGAGAATCTGCTTGAAGGTAATGGGATTGAGCCTAGTGGTGGTGAGCCTTCTGGGTGTGATTTAACACAAGTGGGGATAACTCATCAGCGT
GGATTTGCTTCCCCACGAATGTTATCAAAGAAAAATTGCTCCGTGTTAGTTATGACTCCATGCTTAAAAATGTCGCCTCCAAAATCTTGTGTACTGCTGGAACCCATTTC
AGAGTTATCACATCAAGACCAAAAAAGGCTTTACAAGGCCACACCTTTTCCCGTTGGAGTTCATGATAGCTCTTCTGGAAGTGACGCTTCTGATGGACTGGCTTTAAAGT
ACCCAGAACTCTTAGGAATTCAACAGGCTCATAAATCAGGAATTAGAAAGAAGGAGGTTGAAGCCTCACCGGACTGGTTTATGTCACCTCTAAAAACATGTGTTTTACTG
GAGCCGTCTGATTCTCATTCAGTGGAAAATGCTGCTTGTGATATCGACTTTCCTATGAATTCTTGGGTCCTGAATTCGCAGTTGAAATTGTCTGTATCAAAAGGAATTAA
TGATGTTGATAGATGTCACCAGACCAAGAAATATTCCAGCCATCAAGATCCAGTTGGGGTCAGTTTGCCGCATATAGATAGCACTCCCATGTGGAAGGAACGCGAAAGCG
TAATTGGGGTTGGCAAACGTGCTGGTGAGGAGACTCTTAAAAAAGAACTGTGGGTGAAATTTGAAGCAGCATCAGCCAATCCATTTCGCATTAATCAAGCTCTTCAAAAG
ACATCAAAGAAAGGTTTTCTGGACTTGCTGGATGAGGGCATTGCAGCAGATGTTGTGCTTATGCATTCCTCGGCCAGGAGAAAAAATTCTAATGCAAAAGCCTGTAATGA
TGCTGGAAGCAAATTTGCTCTAGAAAATGCCTGTGTAGTAGGCTACATTTTGTTCATAAGCATGAGGAAAAGCCGGATTACAATGGTTATACAAGTCATTCCGGTTCTTC
TCTTCACAAATTAG
Protein sequenceShow/hide protein sequence
MEQPPFSILEHCGSTQMPALGRLPAANGMTTRGKPGRKALRDVSNHKYGRTSSKSVTTAKKKESDNRSEVEERDDALDRLLLVQSDFSALIHQIDEFVVKAFELKEMSKQ
GRKEIESFTHVLSEMLSSLKPWVPRFQKVLSHPSIGSDDDIGQSLASENNALVNDTENNVIDSPDHAEVHQALISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKH
VEYSKSVLNGVTSGILKGAQPCFIACGDSNENLLEGNGIEPRGGKPSGSDLSKVGENLLEGNGIEPSGCKPSGSDLTKVGENLLEGNGIEPSGGEPSGCDLTQVGITHQR
GFASPRMLSKKNCSVLVMTPCLKMSPPKSCVLLEPISELSHQDQKRLYKATPFPVGVHDSSSGSDASDGLALKYPELLGIQQAHKSGIRKKEVEASPDWFMSPLKTCVLL
EPSDSHSVENAACDIDFPMNSWVLNSQLKLSVSKGINDVDRCHQTKKYSSHQDPVGVSLPHIDSTPMWKERESVIGVGKRAGEETLKKELWVKFEAASANPFRINQALQK
TSKKGFLDLLDEGIAADVVLMHSSARRKNSNAKACNDAGSKFALENACVVGYILFISMRKSRITMVIQVIPVLLFTN