; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G045150 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G045150
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic
Genome locationCiama_Chr02:32998897..33000621
RNA-Seq ExpressionCaUC02G045150
SyntenyCaUC02G045150
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]6.6e-28088.19Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+ ESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDP  EAEGSPMNVGDKIPGGSAQTKDYY DSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDLS T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_004143144.1 protein OCTOPUS [Cucumis sativus]1.1e-29892.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPV+NNHSS ELRRSKS SAAKCEA GIG SEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSGEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNLSNNS VGA KAEDI PR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K TT 
Subjt:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDLS TDVTSKDSVPDA VIDRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

XP_008464059.1 PREDICTED: UPF0503 protein At3g09070, chloroplastic [Cucumis melo]1.7e-29991.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD QHESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDLS TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]1.1e-27988.02Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+ ESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPS EAEGSPMNVG+KIPGGSAQTKDYY DSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDLS T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_038901013.1 protein OCTOPUS [Benincasa hispida]1.2e-30594.26Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNH-SSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQLKT+SHRLSTCHRHPSKPVTGFCASCLRERLAGID DTQ ESPV NNH SSELRRSKS+SAAK EAGI  SEVQHRKSCDVRSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNH-SSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT REVEIESENLG ELREVVANER FRASEGIIGPALGTIDDF+GEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
         KNLSNN NVG  KAEDI PRVLEIRETRSEVG+YGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSSLRRRKSFDRS SHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS SKA TR+
Subjt:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDP
        GDLS TDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGM+QKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGE NSCVSQKLIRSYSVSCRDP
Subjt:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        SKLAGFN  NDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPR+SPFNVKH M
Subjt:  SKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

TrEMBL top hitse value%identityAlignment
A0A0A0KBP2 Uncharacterized protein5.2e-29992.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPV+NNHSS ELRRSKS SAAKCEA GIG SEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSGEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNLSNNS VGA KAEDI PR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K TT 
Subjt:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDLS TDVTSKDSVPDA VIDRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic8.0e-30091.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD QHESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDLS TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A5A7V3J1 UPF0503 protein8.0e-30091.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD QHESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDLS TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like5.4e-28088.02Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+ ESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPS EAEGSPMNVG+KIPGGSAQTKDYY DSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDLS T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

A0A6J1L0V5 UPF0503 protein At3g09070, chloroplastic-like9.6e-27787.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+ ESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSG+SLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEEAK  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPS EAEGSPMNVGDKIPGGSAQTKDYY DSLSS+RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDLS T++TSKDSVPDA  IDRKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAG N  GNDSKL   R RDDFTLQRNRSVRYSP NFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like8.9e-5434Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQHESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                           VR     ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQHESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   EE E K +K+++DL  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + +                 E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +    PS +      +    IPGGS QT+DYYT   SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG
        D+K +SN+  +             I    +  + +K   ++GD                       KK  RW K  S+LG + ++  + + +D  S S  
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG

Query:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG
          +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G
Subjt:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG

Query:  KPG
          G
Subjt:  KPG

Q9SS80 Protein OCTOPUS4.4e-6133.88Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQHESPVRNNHSSELRRSKSFSAAKCEAGI-
        HRLST C+RHP +  TGFC SCL ERL+ +D                                     +T     V+     ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQHESPVRNNHSSELRRSKSFSAAKCEAGI-

Query:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-
        G+ E Q R+SCDVR  +SL +LF ++++       T  E+++                  E+E+   EL E    +        I+  +   + + S E 
Subjt:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-

Query:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR
                          E E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK   N  +   G+A+     P   ++R+T+
Subjt:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR

Query:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD
        SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A        T  +   
Subjt:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD

Query:  PSGEAEGSPMNVGDK--------IPGGSAQTKDYYTDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS
        P  E    P  V           IPGGS QT+DYYTD  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +D NN+
Subjt:  PSGEAEGSPMNVGDK--------IPGGSAQTKDYYTDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS

Query:  RSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS
         +  T  +G      +     + D  V      KK  RW K  S+LG++ ++S     ++EE        + G +V+R ++ESW +LR    G       
Subjt:  RSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS

Query:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
         +++RS S VS R          G+  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)6.7e-6534.8Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-----------------------ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDV
        HR ST C RHP +  TGFC SCL +RL+ +D   ++ + V ++                          ELRR+KSFSA+K EA  +G  E Q R+SCDV
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSS-----------------------ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDV

Query:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGEEGEFKTVK-EFIDLEFRRK--
        R  N+L  LF  + +     +E       EI+ E +   ++  V  E     SE                  ID+   EE E +T K E   +EF  +  
Subjt:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGEEGEFKTVK-EFIDLEFRRK--

Query:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVL--EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------
         K   RD +EIAGS W AASVFSKKL KWR+KQK+K      N+GA  +     + +  ++R+T+SE+ EYG GRRSCDTDPRFS+DAGR SL       
Subjt:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVL--EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------

Query:  DDSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDDPSGEAEGSPM----NVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHR
        DD RYSF+EPRASWDGYLIG+     R+  M+SV+E++        + D     E SP      + + +PGGSAQT++YY DS SS RRRKS DRSSS R
Subjt:  DDSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDDPSGEAEGSPM----NVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHR

Query:  K---GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGES
        K       + D+LKL  + +           AK L++    N+ R    + +   ++ ++  +++V       ++T K    W    ++ G++ +++G +
Subjt:  K---GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGES

Query:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRS-YSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL
        K ++EE   G   VDR  + SW       N E  +    K+IRS  SVS R      G                   LQRN      S +   +  +NG+
Subjt:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRS-YSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL

Query:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHAM
        L+FYLTP +     S +S     +  P S PF  ++ M
Subjt:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHAM

AT3G09070.1 Protein of unknown function (DUF740)3.1e-6233.88Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQHESPVRNNHSSELRRSKSFSAAKCEAGI-
        HRLST C+RHP +  TGFC SCL ERL+ +D                                     +T     V+     ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQHESPVRNNHSSELRRSKSFSAAKCEAGI-

Query:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-
        G+ E Q R+SCDVR  +SL +LF ++++       T  E+++                  E+E+   EL E    +        I+  +   + + S E 
Subjt:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-

Query:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR
                          E E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK   N  +   G+A+     P   ++R+T+
Subjt:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR

Query:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD
        SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A        T  +   
Subjt:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD

Query:  PSGEAEGSPMNVGDK--------IPGGSAQTKDYYTDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS
        P  E    P  V           IPGGS QT+DYYTD  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +D NN+
Subjt:  PSGEAEGSPMNVGDK--------IPGGSAQTKDYYTDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS

Query:  RSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS
         +  T  +G      +     + D  V      KK  RW K  S+LG++ ++S     ++EE        + G +V+R ++ESW +LR    G       
Subjt:  RSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS

Query:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
         +++RS S VS R          G+  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

AT3G46990.1 Protein of unknown function (DUF740)1.0e-6535.35Show/hide
Query:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQHESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEIE
        S+CHRHPS KP +GFCASCLRERL  I+  +   + V+   + ELRR +S+S     A + +S+   R+SCDVR S +SL DLF  +D+ R  +   +  
Subjt:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQHESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEIE

Query:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN
          +L  E  E    E  +   E I G   G       E    KT+KEFIDL++R   KKN G+DL+EI       ASV S++L  +   ++    S++  
Subjt:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN

Query:  VGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSGEA
         G                          GR S D DPR S D GR+       SF++PR+SWDG LI K+Y ++T + +V E+AK +  G E+++     
Subjt:  VGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSGEA

Query:  EGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVT
            +   +K PGG+ QTK+YY+DS    RRR+SFDRS S ++    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L +S +  + ++    S ++ 
Subjt:  EGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVT

Query:  SKDSVPDAAVIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCR--
        SK  +  AA  + K    V       +W K  ++ G++Q R  E+K++   ++   + GN V+  +AES  KLRRV  GE N  VS+KL++SYSVS R  
Subjt:  SKDSVPDAAVIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCR--

Query:  ------DPSKLAGFNAGNDS-------KLN---------------VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
                + ++GF  G  S        +N               +   ++   LQRN +V   S  N +  + RFYL+P++S+ +  K GKSR
Subjt:  ------DPSKLAGFNAGNDS-------KLN---------------VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR

AT5G01170.1 Protein of unknown function (DUF740)6.3e-5534Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQHESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                           VR     ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQHESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   EE E K +K+++DL  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + +                 E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +    PS +      +    IPGGS QT+DYYT   SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG
        D+K +SN+  +             I    +  + +K   ++GD                       KK  RW K  S+LG + ++  + + +D  S S  
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG

Query:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG
          +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G
Subjt:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG

Query:  KPG
          G
Subjt:  KPG

AT5G58930.1 Protein of unknown function (DUF740)1.0e-6836.12Show/hide
Query:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIES
        + CHRHP SKP TGFCA+CLRERL+ I+  +   S      S+ELRR +S+S    +A   + +   R+SCDVRS +   D             + E+  
Subjt:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIES

Query:  ENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNV
         ++ F +   +  + +    EG        + +   E+GE KT+KE IDLE R +  KN G+D            SVFS+ L K+  K   K + ++ N 
Subjt:  ENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNV

Query:  GAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFS-----GTGFEKDDP
                                  LGRRSCD DPR S+DAGR+       SFDEPRASWDG LIGKTYP++ P+ SV E+ K S     G   E+D+ 
Subjt:  GAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFS-----GTGFEKDDP

Query:  SGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSS
        +              PGG+AQT+DYY DS    RRR+SFDRSS H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L +S +  + ++    S
Subjt:  SGEAEGSPMNVGDKIPGGSAQTKDYYTDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSS

Query:  TDVTSK--DSVPDAAVIDRKTF---KKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSC
         ++ SK    V    V  +  F   K    W K  +  G++Q+++  +K++   ++   +GGN ++  +AES  KLRRVA GE N  VS+KLIRSYSVS 
Subjt:  TDVTSK--DSVPDAAVIDRKTF---KKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSC

Query:  --------RDPSKLAGFNAGNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
                R  S + GF  G  S                           V   R+      ++   YSP+N  NG++RFYLTPL S+ +  K GKSR
Subjt:  --------RDPSKLAGFNAGNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCAGCTCAAAACTGTATCACATCGGCTTTCCACTTGTCACCGTCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCCGG
GATTGATCCCGACACGCAGCACGAATCGCCTGTTCGGAACAACCATTCTTCAGAGCTTCGTCGGAGTAAATCCTTTTCTGCAGCCAAGTGTGAGGCTGGTATTGGACTAT
CGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGCTCCGGGAACTCCTTGTCGGACCTTTTCTGTCGTGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAA
TCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGTACGATCGATGATTTTTCTGG
AGAGGAGGGTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAGCGG
CTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAACAAAAAATGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGCGAAAGCAGAGGATATCAACCCTAGA
GTGCTTGAAATCAGGGAGACTCGTTCGGAGGTCGGAGAATATGGACTAGGAAGAAGATCTTGTGATACAGATCCAAGATTCTCTGTTGATGCAGGTAGAATGTCGTTGGA
TGATTCACGTTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCCATGGTATCAGTTTTGGAGGAGGCCAAAT
TTTCTGGTACTGGATTTGAGAAAGATGATCCTTCCGGTGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATTCCTGGTGGATCGGCTCAGACTAAAGATTACTAC
ACGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCGAGTTCACACAGAAAAGGGGCTTCCGGGGACTTTGATGACTTGAAATTAATATCAAATGCAAA
GGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAAGTTTTAATCACAGAGAAAGATTTGAACAACTCCCGCTCAAAAGCAACAACCAGAGATGGCGATTTGAGTA
GCACGGATGTTACCTCCAAAGATTCTGTTCCTGATGCAGCTGTGATTGACCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAGAAGCGAAGTGGTGAAAGTAAGTCTGATGATGAAGAAAGCAGTGTTGGAGGGAATGTGGTTGATCGGCCTATTGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGTGAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGCTTTAATGCTGGTAATGATTCGAAACTGA
ACGTTTCGAGATGGAGAGATGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCACCAAATAACTTTGATAATGGCTTACTAAGGTTCTATTTGACACCATTGAGG
AGCTACAGCAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGCCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGCAGCTCAAAACTGTATCACATCGGCTTTCCACTTGTCACCGTCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCCGG
GATTGATCCCGACACGCAGCACGAATCGCCTGTTCGGAACAACCATTCTTCAGAGCTTCGTCGGAGTAAATCCTTTTCTGCAGCCAAGTGTGAGGCTGGTATTGGACTAT
CGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGCTCCGGGAACTCCTTGTCGGACCTTTTCTGTCGTGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAA
TCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGTACGATCGATGATTTTTCTGG
AGAGGAGGGTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAGCGG
CTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAACAAAAAATGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGCGAAAGCAGAGGATATCAACCCTAGA
GTGCTTGAAATCAGGGAGACTCGTTCGGAGGTCGGAGAATATGGACTAGGAAGAAGATCTTGTGATACAGATCCAAGATTCTCTGTTGATGCAGGTAGAATGTCGTTGGA
TGATTCACGTTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCCATGGTATCAGTTTTGGAGGAGGCCAAAT
TTTCTGGTACTGGATTTGAGAAAGATGATCCTTCCGGTGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATTCCTGGTGGATCGGCTCAGACTAAAGATTACTAC
ACGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCGAGTTCACACAGAAAAGGGGCTTCCGGGGACTTTGATGACTTGAAATTAATATCAAATGCAAA
GGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAAGTTTTAATCACAGAGAAAGATTTGAACAACTCCCGCTCAAAAGCAACAACCAGAGATGGCGATTTGAGTA
GCACGGATGTTACCTCCAAAGATTCTGTTCCTGATGCAGCTGTGATTGACCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAGAAGCGAAGTGGTGAAAGTAAGTCTGATGATGAAGAAAGCAGTGTTGGAGGGAATGTGGTTGATCGGCCTATTGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGTGAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGCTTTAATGCTGGTAATGATTCGAAACTGA
ACGTTTCGAGATGGAGAGATGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCACCAAATAACTTTGATAATGGCTTACTAAGGTTCTATTTGACACCATTGAGG
AGCTACAGCAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGCCATGTAA
Protein sequenceShow/hide protein sequence
MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIE
SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPR
VLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSGEAEGSPMNVGDKIPGGSAQTKDYY
TDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLSSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMM
QKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
SYSSRGKPGKSRPRSSPFNVKHAM