; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23838 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23838
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
Genome locationCarg_Chr02:4268124..4270086
RNA-Seq ExpressionCarg23838
SyntenyCarg23838
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]2.5e-30399.81Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDP DEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

KAG7035377.1 UPF0503 protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVRKEQAKKLSFQCQTSHVKRCTTSSSSSSSIKACTVNIL
        PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVRKEQAKKLSFQCQTSHVKRCTTSSSSSSSIKACTVNIL
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVRKEQAKKLSFQCQTSHVKRCTTSSSSSSSIKACTVNIL

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]4.2e-30399.63Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRS R
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

XP_023007551.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima]1.8e-29898.5Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSG+SLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEE KLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDA GIDRKTFKKVHRWRKVLSVLGM QKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAG NGGGNDSKLYGLRRRDDFTLQRNRSVR
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

XP_023532200.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo]1.2e-29798.13Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKRE GIGRPEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLRE+AGSVW AASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEA+KPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEE KLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDS SKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKD VPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPF AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAGFN GGGNDSKLYGLRRRDD TLQRNRSVR
Subjt:  PSKLAGFN-GGGNDSKLYGLRRRDDFTLQRNRSVR

TrEMBL top hitse value%identityAlignment
A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic2.5e-25385.42Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTS-ELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCRED
        MNLQ K+VSHRLS+C RHPS+PVTGFCASCLRERLAGID D + ESP+ N HS+S ELRRSKS+SAAK EAGIG+ E+QHRKSCD RSGNSLSDLFCRED
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTS-ELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV   RQFRASEG+IGP L  IDDF+GEDAEFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIG
        K KNLGN+++VGA K E IKPR LE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G G
Subjt:  KMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLS++RRRKSFDRS SH+KGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT D
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD

Query:  GDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR
        GDLSGT++TSKDSVPDA  IDRK+FKKVHRWRKVLSVLGM+QKR+GE SKSDDEES V GNVVDRP   ESWEKLRRVANGEAN  VSQKLIRSYSVSCR
Subjt:  GDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR

Query:  DPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        DPSKLAGFN GGNDSKL   R RDDFTLQRNRSVR
Subjt:  DPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

A0A5A7V3J1 UPF0503 protein2.5e-25385.42Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTS-ELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCRED
        MNLQ K+VSHRLS+C RHPS+PVTGFCASCLRERLAGID D + ESP+ N HS+S ELRRSKS+SAAK EAGIG+ E+QHRKSCD RSGNSLSDLFCRED
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTS-ELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV   RQFRASEG+IGP L  IDDF+GEDAEFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIG
        K KNLGN+++VGA K E IKPR LE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G G
Subjt:  KMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLS++RRRKSFDRS SH+KGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT D
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD

Query:  GDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR
        GDLSGT++TSKDSVPDA  IDRK+FKKVHRWRKVLSVLGM+QKR+GE SKSDDEES V GNVVDRP   ESWEKLRRVANGEAN  VSQKLIRSYSVSCR
Subjt:  GDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR

Query:  DPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        DPSKLAGFN GGNDSKL   R RDDFTLQRNRSVR
Subjt:  DPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

A0A6J1D4T1 UPF0503 protein At3g09070, chloroplastic1.5e-25384.64Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNL  K V HRLS+C+RHPS+PVTGFCA CLRERLAGID DTRQE+P  NQHS+SELRRSKSFSAAKR+AGIG+PEVQHRKSCD RSGNSLSDLFCRED+
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLGFELREV A ERQFRASEG IGP LD IDDF+G +AEFKTMKEFIDLE RRKKN GRDLREIAGSVWEAASV SKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGN ++ G  KTE  KPR+LE RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR  PMVSVLEE K PG GF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        E  DP DEAEG  MNVGDKIPGGSAQTKDYYM+SLS++RRRKSFDRSSSH+KGASADFDDLK ISNAKVSPATTELFYGAKVLITEKDL DSHSK+TRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSG+ +TSKDSVPDAAG DRKTFKK +RW+KVL VLGM+QKRS   SKSDDEE CVG N VDRP  AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        P+KLAGFN GGND KL GLRRRDD TLQRNRSVR
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like2.0e-30399.63Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRS R
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

A0A6J1L0V5 UPF0503 protein At3g09070, chloroplastic-like8.8e-29998.5Show/hide
Query:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSG+SLSDLFCREDK
Subjt:  MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF
        MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEE KLPGIGF
Subjt:  MKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGTNITSKDSVPDA GIDRKTFKKVHRWRKVLSVLGM QKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR
        PSKLAG NGGGNDSKLYGLRRRDDFTLQRNRSVR
Subjt:  PSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVR

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like2.1e-4733.51Show/hide
Query:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS-------------------------ELRRSKSFSAAKREAGIGRPE
        HRLS SC  HP E  +GFC SCL +RL+ +D       S + ++ P ++  S                           ELRR+KSFSA   E   G  E
Subjt:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS-------------------------ELRRSKSFSAAKREAGIGRPE

Query:  VQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELREVVAKERQFRASE--GVIGPALDIIDDFSG----EDAEFKTMKEFIDLEFRRKKNA
         Q R+SCD R  +   +L   E      ++  E   E+   E+   V +E +    E  G   P  +I+++ S     E+ E K MK+++DL  + KK +
Subjt:  VQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELREVVAKERQFRASE--GVIGPALDIIDDFSG----EDAEFKTMKEFIDLEFRRKKNA

Query:  GRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYS
           +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + ++                E G+GRRS DTDPRFS+DA       GR+S+DDSRYS
Subjt:  GRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYS

Query:  FDEPRASWDGYLIGRTH----PRPTPMVSVLEETKLPGIGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADF
         DEPRASWDG+LIGRT     P P  M+SV+E   L     +    PS +      +    IPGGS QT+DYY    S+ RRRKS DRS+S +K    + 
Subjt:  FDEPRASWDGYLIGRTH----PRPTPMVSVLEETKLPGIGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADF

Query:  DDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVG
        +D+K +SN+  +             I    ++ + +K  ++GD                       KK  RW K  S+LG + ++  +  + D       
Subjt:  DDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVG

Query:  GNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLAGFNGGGN
          +V+R   +ESW ++R   NGE  G    K+ RS S VS R        +GGG+
Subjt:  GNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLAGFNGGGN

Q9SS80 Protein OCTOPUS6.3e-6032.76Show/hide
Query:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS----------------------------ELRRSKSFSAAKREAGIG
        HRLS SC RHP E  TGFC SCL ERL+ +D       S + ++ P ++  +                              ELRR+KSFSA+K   G  
Subjt:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS----------------------------ELRRSKSFSAAKREAGIG

Query:  RPEVQHRKSCDARSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGE--
              R+SCD R  +SL +LF ++++       T  E+++                  E+E+   EL E   ++        ++  + +++ + S E  
Subjt:  RPEVQHRKSCDARSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGE--

Query:  -----------------DAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GNDTDVGAAKTEAIKPRVLEVRETRS
                         + E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Subjt:  -----------------DAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GNDTDVGAAKTEAIKPRVLEVRETRS

Query:  EVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRT-------HPRPTPMVSVLEETKLP----------GIG
        E+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIGRT        P P  M+SV+E+   P             
Subjt:  EVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRT-------HPRPTPMVSVLEETKLP----------GIG

Query:  FEKDDPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGAS----ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH
         E+  P          V D   IPGGS QT+DYY DS S  RRRKS DRSSS  +  +    AD D+ KL  ++ +S    + + G+        L+D++
Subjt:  FEKDDPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGAS----ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH

Query:  SKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEES-----CVGGNVVDRPFAAESWEKLRRVANGEANGSVSQ
        + A    D +G+       + D         KK  RW K  S+LG++ ++S    + ++EE       + G +V+R   +ESW +LR   NG   G   +
Subjt:  SKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEES-----CVGGNVVDRPFAAESWEKLRRVANGEANGSVSQ

Query:  KLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRD
         +  + +VS R        +GGG+  K+ GL RR+
Subjt:  KLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRD

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)9.7e-6437.39Show/hide
Query:  HRLS-SCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHST----------------------SELRRSKSFSAAKREA-GIGRPEVQHRKSCDA
        HR S SC RHP E  TGFC SCL +RL+ +D   +  + V +                           ELRR+KSFSA+K EA  +G  E Q R+SCD 
Subjt:  HRLS-SCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHST----------------------SELRRSKSFSAAKREA-GIGRPEVQHRKSCDA

Query:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVAKERQFRASE---------GVIGPALDIIDDFSGEDAEFKTMK-EFIDLEFRRK--
        R  N+L  LF  + +     +E       EI+ E +   ++  V +E     SE                 +ID+   E+ E +T K E   +EF  +  
Subjt:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVAKERQFRASE---------GVIGPALDIIDDFSGEDAEFKTMK-EFIDLEFRRK--

Query:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMK-----NLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL----
         K   RD +EIAGS W AASVFSKKL KWR+KQK+K     NLG     G++     K    ++R+T+SE+ EYG GRRSCDTDPRFS+DAGR SL    
Subjt:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMK-----NLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL----

Query:  ---DDSRYSFDEPRASWDGYLIGRTHP--RPTPMVSVLEETKLPGIGFEKDDPSDEAEGSPM----NVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSS
           DD RYSF+EPRASWDGYLIGR     R   M+SV+E++ +      + D     E SP      + + +PGGSAQT++YY+DS S+ RRRKS DRSS
Subjt:  ---DDSRYSFDEPRASWDGYLIGRTHP--RPTPMVSVLEETKLPGIGFEKDDPSDEAEGSPM----NVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSS

Query:  SHKK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSG
        S +K      A+ D+LKL  + +                  KDL  SHS + RD D        +  V +  G      K+  + R   ++ G+L +++G
Subjt:  SHKK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSG

Query:  ESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRS-YSVSCRDPSKLAGFNGGG
          +K ++EE   G   VDR F+  SW       N E       K+IRS  SVS R     +G  GGG
Subjt:  ESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRS-YSVSCRDPSKLAGFNGGG

AT3G09070.1 Protein of unknown function (DUF740)4.5e-6132.76Show/hide
Query:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS----------------------------ELRRSKSFSAAKREAGIG
        HRLS SC RHP E  TGFC SCL ERL+ +D       S + ++ P ++  +                              ELRR+KSFSA+K   G  
Subjt:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS----------------------------ELRRSKSFSAAKREAGIG

Query:  RPEVQHRKSCDARSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGE--
              R+SCD R  +SL +LF ++++       T  E+++                  E+E+   EL E   ++        ++  + +++ + S E  
Subjt:  RPEVQHRKSCDARSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGE--

Query:  -----------------DAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GNDTDVGAAKTEAIKPRVLEVRETRS
                         + E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Subjt:  -----------------DAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GNDTDVGAAKTEAIKPRVLEVRETRS

Query:  EVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRT-------HPRPTPMVSVLEETKLP----------GIG
        E+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIGRT        P P  M+SV+E+   P             
Subjt:  EVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRT-------HPRPTPMVSVLEETKLP----------GIG

Query:  FEKDDPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGAS----ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH
         E+  P          V D   IPGGS QT+DYY DS S  RRRKS DRSSS  +  +    AD D+ KL  ++ +S    + + G+        L+D++
Subjt:  FEKDDPSDEAEGSPMNVGDK--IPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGAS----ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH

Query:  SKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEES-----CVGGNVVDRPFAAESWEKLRRVANGEANGSVSQ
        + A    D +G+       + D         KK  RW K  S+LG++ ++S    + ++EE       + G +V+R   +ESW +LR   NG   G   +
Subjt:  SKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEES-----CVGGNVVDRPFAAESWEKLRRVANGEANGSVSQ

Query:  KLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRD
         +  + +VS R        +GGG+  K+ GL RR+
Subjt:  KLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRD

AT3G46990.1 Protein of unknown function (DUF740)1.0e-6035.01Show/hide
Query:  SSCRRHPS-EPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDAR-SGNSLSDLFCREDKPRCTNREVEI
        SSC RHPS +P +GFCASCLRERL  I++    +S  L    T ELRR +S+S   R A +   +   R+SCD R S +SL DLF  +D+ R  +   + 
Subjt:  SSCRRHPS-EPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDAR-SGNSLSDLFCREDKPRCTNREVEI

Query:  ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGED--AEFKTMKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGN
           +L  E  E   +E  +   E + G      D+        E KTMKEFIDL++R   KKN G+DL+EI       ASV S++L     K    N  N
Subjt:  ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGED--AEFKTMKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGN

Query:  DTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGFEKDDPS
        D                   ++ S       GR S D DPR S D GR+       SF++PR+SWDG LI +++ + T + +V E+ K    G E+++  
Subjt:  DTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGFEKDDPS

Query:  DEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTN
        ++         +K PGG+ QTK+YY DS    RRR+SFDRS S K+    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L+DS+  + ++       
Subjt:  DEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTN

Query:  ITSKDSVPDAAGIDRKTFKKVH------RWRKVLSVLGMLQKRSGESSKSDDEESC-VGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR
        + SK  +  AAG + K    V       +W K  ++ G++Q+++   ++   E+   + GN V+    AES  KLRRV  GE N  VS+KL++SYSVS R
Subjt:  ITSKDSVPDAAGIDRKTFKKVH------RWRKVLSVLGMLQKRSGESSKSDDEESC-VGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCR

Query:  --------DPSKLAGFNGGGN--------------------DSKLYGLR-RRDDFTLQRNRSV---RKEQAKKLSFQCQTSHVKRCTTSSSSSSSIK
                  + ++GF GG +                    D  + G+  +++   LQRN +V    +E  +K  F+   S VK   TS S  S +K
Subjt:  --------DPSKLAGFNGGGN--------------------DSKLYGLR-RRDDFTLQRNRSV---RKEQAKKLSFQCQTSHVKRCTTSSSSSSSIK

AT5G01170.1 Protein of unknown function (DUF740)1.5e-4833.51Show/hide
Query:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS-------------------------ELRRSKSFSAAKREAGIGRPE
        HRLS SC  HP E  +GFC SCL +RL+ +D       S + ++ P ++  S                           ELRR+KSFSA   E   G  E
Subjt:  HRLS-SCRRHPSEPVTGFCASCLRERLAGID-------SDTRQESPVLNQHSTS-------------------------ELRRSKSFSAAKREAGIGRPE

Query:  VQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELREVVAKERQFRASE--GVIGPALDIIDDFSG----EDAEFKTMKEFIDLEFRRKKNA
         Q R+SCD R  +   +L   E      ++  E   E+   E+   V +E +    E  G   P  +I+++ S     E+ E K MK+++DL  + KK +
Subjt:  VQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELREVVAKERQFRASE--GVIGPALDIIDDFSG----EDAEFKTMKEFIDLEFRRKKNA

Query:  GRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYS
           +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + ++                E G+GRRS DTDPRFS+DA       GR+S+DDSRYS
Subjt:  GRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYS

Query:  FDEPRASWDGYLIGRTH----PRPTPMVSVLEETKLPGIGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADF
         DEPRASWDG+LIGRT     P P  M+SV+E   L     +    PS +      +    IPGGS QT+DYY    S+ RRRKS DRS+S +K    + 
Subjt:  FDEPRASWDGYLIGRTH----PRPTPMVSVLEETKLPGIGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADF

Query:  DDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVG
        +D+K +SN+  +             I    ++ + +K  ++GD                       KK  RW K  S+LG + ++  +  + D       
Subjt:  DDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVG

Query:  GNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLAGFNGGGN
          +V+R   +ESW ++R   NGE  G    K+ RS S VS R        +GGG+
Subjt:  GNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLAGFNGGGN

AT5G58930.1 Protein of unknown function (DUF740)8.2e-6336.85Show/hide
Query:  CRRHP-SEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESE
        C RHP S+P TGFCA+CLRERL+ I++ +   S      +++ELRR +S+S   R+A     +   R+SCD RS +   D             + E+   
Subjt:  CRRHP-SEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESE

Query:  NLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKK--QKMKNLGNDTD
        ++ F +   + ++ +    EG       ++++   ED E KTMKE IDLE R +  KN G+D            SVFS+ L K+  K  +K+ + GN   
Subjt:  NLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKK--QKMKNLGNDTD

Query:  VGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEET-----KLPGIGFEKDD
                                   LGRRSCD DPR S+DAGR+       SFDEPRASWDG LIG+T+P+  P+ SV E+      K+ G   E+D+
Subjt:  VGAAKTEAIKPRVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEET-----KLPGIGFEKDD

Query:  PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSG
         ++             PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L+DS+  + ++     
Subjt:  PSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSG

Query:  TNITSKDSVPDAAGIDRK-----TFKKVHRWRKVLSVLGMLQKRS--GESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVS
          + SK     AAG  +K       K    W K  +  G++Q+++   ++    ++   +GGN ++    AES  KLRRVA GE NG VS+KLIRSYSVS
Subjt:  TNITSKDSVPDAAGIDRK-----TFKKVHRWRKVLSVLGMLQKRS--GESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVS

Query:  C--------RDPSKLAGFNGG
                 R  S + GF GG
Subjt:  C--------RDPSKLAGFNGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCAGAACAAAGCTGTTTCTCATCGGCTTTCTAGTTGTCGCCGGCATCCTAGTGAGCCGGTGACTGGATTCTGTGCCTCCTGCCTCCGAGAACGCCTTGCTGG
GATTGATTCCGATACGCGGCAGGAATCGCCTGTTTTGAACCAGCATTCTACGTCTGAGCTCCGGCGGAGTAAATCTTTTTCTGCGGCGAAGCGTGAGGCCGGCATCGGAC
GACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCTGGTAATTCGTTGTCGGACCTTTTCTGTCGGGAAGATAAGCCGAGATGTACGAATCGGGAGGTGGAGATC
GAGTCTGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCGAAGGAGAGGCAATTTAGGGCTTCTGAGGGGGTAATTGGACCGGCTCTGGATATAATCGACGATTTTTC
TGGAGAGGATGCTGAGTTCAAGACGATGAAGGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCTGGTCGCGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAG
CGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTAATGATACCGATGTAGGTGCGGCGAAGACAGAGGCTATCAAGCCG
AGAGTGCTTGAAGTTAGGGAGACTCGTTCCGAGGTTGGAGAATACGGATTGGGAAGAAGGTCTTGTGATACAGATCCAAGATTCTCTGTCGATGCAGGTAGAATGTCGCT
GGATGACTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAGAACTCATCCAAGACCTACGCCGATGGTTTCAGTTTTGGAGGAGACGA
AATTACCTGGGATTGGATTTGAGAAAGACGATCCTTCTGATGAAGCAGAAGGATCTCCGATGAATGTAGGCGATAAGATCCCTGGTGGATCGGCTCAGACTAAAGATTAC
TATATGGATTCATTGTCTACTGTAAGGCGGAGGAAGAGTTTCGATCGTTCAAGTTCACACAAAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGC
AAAGGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACCAGAGATGGCGATTTGAGTG
GCACCAATATTACTTCAAAAGATTCTGTTCCTGATGCGGCTGGGATTGATCGAAAGACGTTCAAAAAGGTGCATAGATGGCGTAAAGTGTTGAGTGTTTTGGGTATGTTG
CAGAAGCGAAGTGGTGAGAGCAGCAAGTCTGATGATGAAGAAAGCTGTGTTGGTGGAAATGTGGTTGATCGGCCTTTTGCCGCCGAGTCATGGGAGAAGCTAAGGCGTGT
TGCTAATGGGGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGTAGCTACAGCGTTAGCTGTAGAGATCCCAGCAAGCTAGCTGGGTTTAATGGCGGTGGCAACGATT
CGAAACTGTACGGTTTGAGGCGAAGAGACGATTTTACATTGCAGAGGAATCGGAGTGTCAGGAAAGAGCAGGCCAAGAAGCTCTCCTTTCAATGTCAAACATCCCACGTA
AAAAGGTGCACCACTTCTTCTTCTTCTTCTTCTTCCATTAAAGCTTGTACTGTAAACATTCTATAA
mRNA sequenceShow/hide mRNA sequence
AGAGAGGGTTTAGTGAGGGTTTCTTCGGAGCTTACTTTTGTTTTTCCGTTTCTTCACTCTCTCCTTTTTGTTTCTAACCACTTCTTCTTCGGCTTCGGCTTCTCTTCGTT
TTTTCGATCTTGATGGCTATGCGCCATGAGAGTATTTTGTATCTTCAGCTATGAATCTGCAGAACAAAGCTGTTTCTCATCGGCTTTCTAGTTGTCGCCGGCATCCTAGT
GAGCCGGTGACTGGATTCTGTGCCTCCTGCCTCCGAGAACGCCTTGCTGGGATTGATTCCGATACGCGGCAGGAATCGCCTGTTTTGAACCAGCATTCTACGTCTGAGCT
CCGGCGGAGTAAATCTTTTTCTGCGGCGAAGCGTGAGGCCGGCATCGGACGACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCTGGTAATTCGTTGTCGGACC
TTTTCTGTCGGGAAGATAAGCCGAGATGTACGAATCGGGAGGTGGAGATCGAGTCTGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCGAAGGAGAGGCAATTTAGG
GCTTCTGAGGGGGTAATTGGACCGGCTCTGGATATAATCGACGATTTTTCTGGAGAGGATGCTGAGTTCAAGACGATGAAGGAGTTTATAGATCTTGAATTTCGGAGGAA
GAAGAATGCTGGTCGCGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATC
TCGGTAATGATACCGATGTAGGTGCGGCGAAGACAGAGGCTATCAAGCCGAGAGTGCTTGAAGTTAGGGAGACTCGTTCCGAGGTTGGAGAATACGGATTGGGAAGAAGG
TCTTGTGATACAGATCCAAGATTCTCTGTCGATGCAGGTAGAATGTCGCTGGATGACTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGG
AAGAACTCATCCAAGACCTACGCCGATGGTTTCAGTTTTGGAGGAGACGAAATTACCTGGGATTGGATTTGAGAAAGACGATCCTTCTGATGAAGCAGAAGGATCTCCGA
TGAATGTAGGCGATAAGATCCCTGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTACTGTAAGGCGGAGGAAGAGTTTCGATCGTTCAAGTTCACAC
AAAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGCAAAGGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAA
AGATTTGAAGGACTCACACTCAAAAGCAACCAGAGATGGCGATTTGAGTGGCACCAATATTACTTCAAAAGATTCTGTTCCTGATGCGGCTGGGATTGATCGAAAGACGT
TCAAAAAGGTGCATAGATGGCGTAAAGTGTTGAGTGTTTTGGGTATGTTGCAGAAGCGAAGTGGTGAGAGCAGCAAGTCTGATGATGAAGAAAGCTGTGTTGGTGGAAAT
GTGGTTGATCGGCCTTTTGCCGCCGAGTCATGGGAGAAGCTAAGGCGTGTTGCTAATGGGGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGTAGCTACAGCGTTAG
CTGTAGAGATCCCAGCAAGCTAGCTGGGTTTAATGGCGGTGGCAACGATTCGAAACTGTACGGTTTGAGGCGAAGAGACGATTTTACATTGCAGAGGAATCGGAGTGTCA
GGAAAGAGCAGGCCAAGAAGCTCTCCTTTCAATGTCAAACATCCCACGTAAAAAGGTGCACCACTTCTTCTTCTTCTTCTTCTTCCATTAAAGCTTGTACTGTAAACATT
CTATAAATTAGTT
Protein sequenceShow/hide protein sequence
MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRSKSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEI
ESENLGFELREVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKP
RVLEVRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGFEKDDPSDEAEGSPMNVGDKIPGGSAQTKDY
YMDSLSTVRRRKSFDRSSSHKKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGML
QKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVRKEQAKKLSFQCQTSHV
KRCTTSSSSSSSIKACTVNIL