; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G014150 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G014150
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic
Genome locationchr10:18249670..18251394
RNA-Seq ExpressionLsi10G014150
SyntenyLsi10G014150
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]2.8e-28388.17Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N H +SELRRSKSFSAAK EAGIG+ EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSG++AEFKT+KEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
         KNL N+++VGA K E IKPR LE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG
        EKD P DEAEGSPMNVGDKIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKATRDG
Subjt:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG

Query:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAA +DRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGEAN  VSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        PSKLAGFN G NDSKL   R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_004143144.1 protein OCTOPUS [Cucumis sativus]2.9e-30493.06Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQ ESPV+NNH S+ELRRSKS SAAKCEA GIGQSEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSG+EAEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        KRKNLSNNS VGAVKAEDIKPR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD
        FEKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K T D
Subjt:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD

Query:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA V+DRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        SKLAGFNG NDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKHV+
Subjt:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

XP_008464059.1 PREDICTED: UPF0503 protein At3g09070, chloroplastic [Cucumis melo]1.0e-30492.87Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNH SS ELRRSKS+SAAKCEAGIGQSE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+G++AEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        KRKNL NNSNVGAVK EDIKPR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD
        FEKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KAT D
Subjt:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD

Query:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA V+DRK+FKKVHRWRKVLSVLGMIQKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        SKLAGFNG NDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKHV+
Subjt:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]4.8e-28388Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N H +SELRRSKSFSAAK EAGIG+ EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSG++AEFKT+KEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
         KNL N+++VGA K E IKPR LE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKATRDG
Subjt:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG

Query:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAA +DRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGEAN  VSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        PSKLAGFN G NDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_038901013.1 protein OCTOPUS [Benincasa hispida]0.0e+0095.99Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQLKT+SHRLSTCHRHPSKPVTGFCASCLRERLAGID DTQQESPV NNH SSELRRSKS+SAAK EAGI QSEVQHRKSCDVRSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT REVEIESENLG ELREVVANER FRASEGIIGPALGTIDDF+G+EAEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        RKNLSNN NVG VKAEDIKPR LEIRETRSEVG+YGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
Subjt:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG
        EKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS SKATR+G
Subjt:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG

Query:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPS
        DLSGTDVTSKDSVPDAAV+DRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPS
Subjt:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPS

Query:  KLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        KLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPR+SPFNVKHVM
Subjt:  KLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

TrEMBL top hitse value%identityAlignment
A0A0A0KBP2 Uncharacterized protein1.4e-30493.06Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQ ESPV+NNH S+ELRRSKS SAAKCEA GIGQSEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSG+EAEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        KRKNLSNNS VGAVKAEDIKPR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD
        FEKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K T D
Subjt:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD

Query:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA V+DRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        SKLAGFNG NDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKHV+
Subjt:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic4.9e-30592.87Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNH SS ELRRSKS+SAAKCEAGIGQSE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+G++AEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        KRKNL NNSNVGAVK EDIKPR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD
        FEKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KAT D
Subjt:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD

Query:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA V+DRK+FKKVHRWRKVLSVLGMIQKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        SKLAGFNG NDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKHV+
Subjt:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

A0A5A7V3J1 UPF0503 protein4.9e-30592.87Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNH SS ELRRSKS+SAAKCEAGIGQSE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSS-ELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+G++AEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        KRKNL NNSNVGAVK EDIKPR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD
        FEKD PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KAT D
Subjt:  FEKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRD

Query:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA V+DRK+FKKVHRWRKVLSVLGMIQKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        SKLAGFNG NDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKHV+
Subjt:  SKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

A0A6J1FY88 UPF0503 protein At3g09070, chloroplastic-like4.4e-28287.63Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ KTVSHRLS+CHRHP+KPVTGFCASCLRERLAGID DTQQESP++ +  S ELRRSKSFSAAK +A IG+ +VQHR+SCDVRSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRC N++VEIESENLGFEL EVVANERQFRAS G IGPALGTIDDF+G+EAEFKTVKEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        RKNL NN NV AV  E IK R LEI ETRSEVG+YGLGRRSCDTDPRFS D GRMSLDDSRYSFDE RASWDGYLIGKTYP+ITPMVSVLEEAKF GT F
Subjt:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG
        EKD PSDEAEGSP NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGAS DFDDLKLISNAKVSPATTELFYGAKVL+TEKDLN SRSKATRDG
Subjt:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG

Query:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPS
        +LSGTDVTSKDSV DAA +DRKTFKKVHRWRKVLSVLGM+ KRSGESKSDDEES       DRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRD S
Subjt:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPS

Query:  KLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM
        K+AGFNG ND KLN SRWRDDFTL+RNRSVRYSPNNFDNGLLRFYLTPLRS+ SRGKPGKSRPRSS FNVKHV+
Subjt:  KLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHVM

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like2.3e-28388Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N H +SELRRSKSFSAAK EAGIG+ EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSG++AEFKT+KEFID+EFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
         KNL N+++VGA K E IKPR LE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKATRDG
Subjt:  EKDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDG

Query:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAA +DRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGEAN  VSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEANSCVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        PSKLAGFN G NDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like1.8e-5433.72Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP------------------VRNNH--------FSSELRRSKSFSAAKCEAGIGQSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                    N++        F  ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP------------------VRNNH--------FSSELRRSKSFSAAKCEAGIGQSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   +E E K +K+++D+  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK K   N    G       +P+            E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +   +PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEES-SVGG
        D+K +SN+  +             I    +  + +K  ++GD                       KK  RW K  S+LG I ++  + + +D  S S   
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEES-SVGG

Query:  NVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRGK
         +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G 
Subjt:  NVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRGK

Query:  PG
         G
Subjt:  PG

Q9SS80 Protein OCTOPUS1.2e-6133.83Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQES--------------------PVRNN---------------HFSSELRRSKSFSAAKCEAGIG
        HRLST C+RHP +  TGFC SCL ERL+ +D      S                    P  NN                F  ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQES--------------------PVRNN---------------HFSSELRRSKSFSAAKCEAGIG

Query:  QSEVQHRKSCDVRSGNSLSDLFCRED---------------KPRCT---------NREVEIESENLGFELRE----VVANERQFRASEG-----------
              R+SCDVR  +SL +LF +++               +PR +         N E E ES++   E  E    V A + +     G           
Subjt:  QSEVQHRKSCDVRSGNSLSDLFCRED---------------KPRCT---------NREVEIESENLGFELRE----VVANERQFRASEG-----------

Query:  --------IIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN--VGAVKAEDIKPRELE
                 + P  G        E E K +K++ID++ + KK +      +  S W AASVFSKKL KWR+ QK K   N  +   G+ +    KP   +
Subjt:  --------IIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN--VGAVKAEDIKPRELE

Query:  IRETRSEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----------
        +R+T+SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A          
Subjt:  IRETRSEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----------

Query:  KFSGTGFEKDAPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEK
               E+ AP          V D   IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +
Subjt:  KFSGTGFEKDAPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEK

Query:  DLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEAN
        D NN   +   +G      +     + D  V      KK  RW K  S+LG+I ++S     ++EE        + G +V+R ++ESW +LR    G   
Subjt:  DLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEAN

Query:  SCVSQKLIRSYS-VSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
             +++RS S VS R         G +  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  SCVSQKLIRSYS-VSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)1.6e-6635.27Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNN----------------------HFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDV
        HR ST C RHP +  TGFC SCL +RL+ +D   +  + V ++                       F  ELRR+KSFSA+K EA  +G  E Q R+SCDV
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNN----------------------HFSSELRRSKSFSAAKCEA-GIGQSEVQHRKSCDV

Query:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGDEAEFKTVK-EFIDIEFRRK--
        R  N+L  LF  + +     +E       EI+ E +   ++  V  E     SE                  ID+   +E E +T K E   +EF  +  
Subjt:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGDEAEFKTVK-EFIDIEFRRK--

Query:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK-RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------D
         K   RD +EIAGS W AASVFSKKL KWR+KQK +K+ + N   G+      K    ++R+T+SE+ EYG GRRSCDTDPRFS+DAGR SL       D
Subjt:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK-RKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------D

Query:  DSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDA--PSDEA-EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRK-
        D RYSF+EPRASWDGYLIG+     R+  M+SV+E++         D   P +++ + S   + + +PGGSAQT++YY+DS SS RRRKS DRSSS RK 
Subjt:  DSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDA--PSDEA-EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRK-

Query:  --GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLS---GTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGES
              + D+LKL  + +           AK L+       S S + RD   S     ++  +++V       ++T K    W    ++ G++ +++G +
Subjt:  --GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLS---GTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGES

Query:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRS-YSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL
        K ++EE   G   VDR  + SW       N E  +    K+IRS  SVS R      G                   LQRN      S +   +  +NG+
Subjt:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRS-YSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL

Query:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHVM
        L+FYLTP +     S +S     +  P S PF  ++VM
Subjt:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHVM

AT3G09070.1 Protein of unknown function (DUF740)8.2e-6333.83Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQES--------------------PVRNN---------------HFSSELRRSKSFSAAKCEAGIG
        HRLST C+RHP +  TGFC SCL ERL+ +D      S                    P  NN                F  ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQES--------------------PVRNN---------------HFSSELRRSKSFSAAKCEAGIG

Query:  QSEVQHRKSCDVRSGNSLSDLFCRED---------------KPRCT---------NREVEIESENLGFELRE----VVANERQFRASEG-----------
              R+SCDVR  +SL +LF +++               +PR +         N E E ES++   E  E    V A + +     G           
Subjt:  QSEVQHRKSCDVRSGNSLSDLFCRED---------------KPRCT---------NREVEIESENLGFELRE----VVANERQFRASEG-----------

Query:  --------IIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN--VGAVKAEDIKPRELE
                 + P  G        E E K +K++ID++ + KK +      +  S W AASVFSKKL KWR+ QK K   N  +   G+ +    KP   +
Subjt:  --------IIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN--VGAVKAEDIKPRELE

Query:  IRETRSEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----------
        +R+T+SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A          
Subjt:  IRETRSEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----------

Query:  KFSGTGFEKDAPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEK
               E+ AP          V D   IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +
Subjt:  KFSGTGFEKDAPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEK

Query:  DLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEAN
        D NN   +   +G      +     + D  V      KK  RW K  S+LG+I ++S     ++EE        + G +V+R ++ESW +LR    G   
Subjt:  DLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEAN

Query:  SCVSQKLIRSYS-VSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
             +++RS S VS R         G +  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  SCVSQKLIRSYS-VSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

AT3G46990.1 Protein of unknown function (DUF740)1.6e-6635.52Show/hide
Query:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEI
        S+CHRHPS KP +GFCASCLRERL  I+  +   + V+    + ELRR +S+S     A +  S+   R+SCDVR S +SL DLF  +D+ R  +   + 
Subjt:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEI

Query:  ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNS
           +L  E  E    E  +   E I G   G          E KT+KEFID+++R   KKN G+DL+EI       ASV S++L  +   ++    S++ 
Subjt:  ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNS

Query:  NVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDAPSDE
          G V                        GR S D DPR S D GR+       SF++PR+SWDG LI K+Y ++T + +V E+AK +  G E++   ++
Subjt:  NVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDAPSDE

Query:  AEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVT
                 +K PGG+ QTK+YY DS    RRR+SFDRS S ++    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L +S   + ++      ++ 
Subjt:  AEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVT

Query:  SKDSVPDAAVVDRKTFKKVH------RWRKVLSVLGMIQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCR--
        SK  +  AA  + K    V       +W K  ++ G+IQ R  E+K++   ++   + GN V+  +AES  KLRRV  GE N  VS+KL++SYSVS R  
Subjt:  SKDSVPDAAVVDRKTFKKVH------RWRKVLSVLGMIQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCR--

Query:  ------DPSKLAGFNG---------------------SNDSKLN-VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
                + ++GF G                     S D  +N +   ++   LQRN +V   S  N +  + RFYL+P++S+ +  K GKSR
Subjt:  ------DPSKLAGFNG---------------------SNDSKLN-VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR

AT5G01170.1 Protein of unknown function (DUF740)1.3e-5533.72Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP------------------VRNNH--------FSSELRRSKSFSAAKCEAGIGQSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                    N++        F  ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP------------------VRNNH--------FSSELRRSKSFSAAKCEAGIGQSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   +E E K +K+++D+  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK K   N    G       +P+            E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSNVGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +   +PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDAPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEES-SVGG
        D+K +SN+  +             I    +  + +K  ++GD                       KK  RW K  S+LG I ++  + + +D  S S   
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEES-SVGG

Query:  NVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRGK
         +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G 
Subjt:  NVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRGK

Query:  PG
         G
Subjt:  PG

AT5G58930.1 Protein of unknown function (DUF740)9.1e-7035.58Show/hide
Query:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIE
        + CHRHP SKP TGFCA+CLRERL+ I+  +   S       S+ELRR +S+S     A +   +   R+SCDVRS +   D             + E+ 
Subjt:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIE

Query:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN
          ++ F +   +  + +    EG        + +   ++ E KT+KE ID+E R +  KN G+D            SVFS+ L K+  K  RK + ++ N
Subjt:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSN

Query:  VGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDAPSDEA
                                   LGRRSCD DPR S+DAGR+       SFDEPRASWDG LIGKTYP++ P+ SV E+ K S      +   ++ 
Subjt:  VGAVKAEDIKPRELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDAPSDEA

Query:  EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTS
        + +        PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L +S   + ++      ++ S
Subjt:  EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTS

Query:  K--DSVPDAAVVDRKTF---KKVHRWRKVLSVLGMIQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSC-----
        K    V    V  +  F   K    W K  +  G+IQ+++  +K++   ++   +GGN ++  +AES  KLRRVA GE N  VS+KLIRSYSVS      
Subjt:  K--DSVPDAAVVDRKTF---KKVHRWRKVLSVLGMIQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSC-----

Query:  ---RDPSKLAGFNGSNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
           R  S + GF G   S                           V   R+      ++   YSP+N  NG++RFYLTPL S+ +  K GKSR
Subjt:  ---RDPSKLAGFNGSNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCAGCTCAAAACTGTATCCCATCGGCTTTCAACTTGTCACCGTCATCCTAGCAAGCCGGTGACTGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCTGG
GATTGATCCCGACACGCAGCAGGAATCGCCTGTCCGGAACAACCATTTTTCATCGGAGCTTCGTCGGAGTAAATCCTTTTCCGCAGCGAAGTGTGAGGCTGGTATTGGAC
AATCGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGGTCTGGGAACTCCTTGTCGGACCTTTTCTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATC
GAATCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGCACGATCGATGATTTTTC
TGGAGATGAGGCTGAGTTCAAGACGGTGAAAGAGTTCATAGATATTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAG
CGGCTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAGCAAAAAAGGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGTGAAAGCAGAGGATATCAAGCCT
AGAGAGCTTGAAATCAGGGAAACTCGTTCGGAGGTTGGAGAATATGGATTGGGAAGAAGGTCTTGTGATACAGATCCAAGATTCTCTGTCGATGCAGGTAGAATGTCGTT
GGATGATTCACGGTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCGATGGTATCAGTTTTGGAGGAGGCCA
AATTTTCTGGTACTGGATTTGAGAAAGATGCTCCTTCCGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATCCCTGGTGGATCGGCTCAGACTAAAGATTAC
TACATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCAAGCTCACACAGAAAAGGGGCATCAGGAGACTTTGATGACTTGAAATTAATATCAAACGC
AAAGGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTTCTAATCACAGAGAAAGATTTGAACAACTCTCGCTCAAAAGCAACCAGAGATGGCGATTTGAGTG
GCACTGATGTTACTTCCAAAGATTCTGTTCCTGATGCAGCTGTGGTTGATCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATC
CAAAAGCGAAGTGGTGAAAGCAAATCTGATGATGAAGAAAGTAGTGTTGGAGGTAATGTCGTTGATCGGCCTATAGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGCAAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGTGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGATTTAATGGCAGTAATGATTCGAAACTGA
ACGTTTCGAGATGGAGAGACGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCGCCAAATAACTTTGATAATGGCTTATTAAGGTTTTATTTGACACCATTGAGG
AGCTACAGTAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGTCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGCAGCTCAAAACTGTATCCCATCGGCTTTCAACTTGTCACCGTCATCCTAGCAAGCCGGTGACTGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCTGG
GATTGATCCCGACACGCAGCAGGAATCGCCTGTCCGGAACAACCATTTTTCATCGGAGCTTCGTCGGAGTAAATCCTTTTCCGCAGCGAAGTGTGAGGCTGGTATTGGAC
AATCGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGGTCTGGGAACTCCTTGTCGGACCTTTTCTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATC
GAATCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGCACGATCGATGATTTTTC
TGGAGATGAGGCTGAGTTCAAGACGGTGAAAGAGTTCATAGATATTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAG
CGGCTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAGCAAAAAAGGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGTGAAAGCAGAGGATATCAAGCCT
AGAGAGCTTGAAATCAGGGAAACTCGTTCGGAGGTTGGAGAATATGGATTGGGAAGAAGGTCTTGTGATACAGATCCAAGATTCTCTGTCGATGCAGGTAGAATGTCGTT
GGATGATTCACGGTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCGATGGTATCAGTTTTGGAGGAGGCCA
AATTTTCTGGTACTGGATTTGAGAAAGATGCTCCTTCCGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATCCCTGGTGGATCGGCTCAGACTAAAGATTAC
TACATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCAAGCTCACACAGAAAAGGGGCATCAGGAGACTTTGATGACTTGAAATTAATATCAAACGC
AAAGGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTTCTAATCACAGAGAAAGATTTGAACAACTCTCGCTCAAAAGCAACCAGAGATGGCGATTTGAGTG
GCACTGATGTTACTTCCAAAGATTCTGTTCCTGATGCAGCTGTGGTTGATCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATC
CAAAAGCGAAGTGGTGAAAGCAAATCTGATGATGAAGAAAGTAGTGTTGGAGGTAATGTCGTTGATCGGCCTATAGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGCAAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGTGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGATTTAATGGCAGTAATGATTCGAAACTGA
ACGTTTCGAGATGGAGAGACGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCGCCAAATAACTTTGATAATGGCTTATTAAGGTTTTATTTGACACCATTGAGG
AGCTACAGTAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGTCATGTAA
Protein sequenceShow/hide protein sequence
MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHFSSELRRSKSFSAAKCEAGIGQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEI
ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGDEAEFKTVKEFIDIEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLSNNSNVGAVKAEDIKP
RELEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDAPSDEAEGSPMNVGDKIPGGSAQTKDY
YMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATRDGDLSGTDVTSKDSVPDAAVVDRKTFKKVHRWRKVLSVLGMI
QKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
SYSSRGKPGKSRPRSSPFNVKHVM