; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G045330 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G045330
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic
Genome locationCmU531Chr02:33234156..33235880
RNA-Seq ExpressionCmUC02G045330
SyntenyCmUC02G045330
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]1.2e-28188.54Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDP DEAEGSPMNVGDKIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDL  T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_004143144.1 protein OCTOPUS [Cucumis sativus]6.3e-29992.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQ ESPV+NNHSS ELRRSKS SAAKCEA GIG SEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSGEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNLSNNS VGA KAEDI PR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K TT 
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDL  TDVTSKDSVPDA VIDRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

XP_008464059.1 PREDICTED: UPF0503 protein At3g09070, chloroplastic [Cucumis melo]9.7e-30091.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDL  TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]2.0e-28188.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDL  T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

XP_038901013.1 protein OCTOPUS [Benincasa hispida]2.8e-30794.61Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNH-SSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQLKT+SHRLSTCHRHPSKPVTGFCASCLRERLAGID DTQQESPV NNH SSELRRSKS+SAAK EAGI  SEVQHRKSCDVRSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNH-SSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT REVEIESENLG ELREVVANER FRASEGIIGPALGTIDDF+GEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
         KNLSNN NVG  KAEDI PRVLEIRETRSEVG+YGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS SKA TR+
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDP
        GDL  TDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGM+QKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGE NSCVSQKLIRSYSVSCRDP
Subjt:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDP

Query:  SKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        SKLAGFN  NDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPR+SPFNVKH M
Subjt:  SKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

TrEMBL top hitse value%identityAlignment
A0A0A0KBP2 Uncharacterized protein3.1e-29992.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQ ESPV+NNHSS ELRRSKS SAAKCEA GIG SEVQHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTNREVEIESENLGFELREVV N RQFRASEGIIGP LGTID FSGEE EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAAS FSKKLGKWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNLSNNS VGA KAEDI PR  EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR K TT 
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDL  TDVTSKDSVPDA VIDRKTFKKVHRWRKVLSVLGM+QKR+GESKSDDEESSVGGNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDN GLLRFYLTPLRSY +RGK GK+RPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDN-GLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic4.7e-30091.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDL  TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A5A7V3J1 UPF0503 protein4.7e-30091.67Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED
        MNLQLK+VSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPD Q ESP+ NNHSS  ELRRSKS+SAAKCEAGIG SE+QHRKSCDVRSGNSLSDLFCRED
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS--ELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLGFELREVV N RQFRASEGIIGP LGTIDDF+GE+ EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG
        K KNL NNSNVGA K EDI PR LEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSG G
Subjt:  KMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG

Query:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR
        FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGASGDFD+LKLISNAKVSPATTELFYGAKVLITEKDLN+SR KA T 
Subjt:  FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTR

Query:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD
        DGDL  TDVTSKDSVPDA VIDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP+ ESWEKLRRVANGE NSCVSQKLIRSYSVSCRD
Subjt:  DGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRD

Query:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM
        PSKLAGFN GNDSKLNV+RWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY SRGK GKSRPR+SPFNVKH +
Subjt:  PSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKHAM

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like9.9e-28288.37Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEE K  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS++RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDL  T++TSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAGFN  GNDSKL   R RDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

A0A6J1L0V5 UPF0503 protein At3g09070, chloroplastic-like2.3e-27888.02Show/hide
Query:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK
        MNLQ K VSHRLS+C RHPS+PVTGFCASCLRERLAGID DT+QESPV N HS SELRRSKSFSAAK EAGIG  EVQHRKSCD RSG+SLSDLFCREDK
Subjt:  MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHS-SELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLGFELREVVA ERQFRASEG+IGPAL  IDDFSGE+ EFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF
        MKNL N+++VGAAK E I PRVLE+RETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIG+T+PR TPMVSVLEEAK  G GF
Subjt:  MKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF

Query:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD
        EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRSSSH+KGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +S SKA TRD
Subjt:  EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRD

Query:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR
        GDL  T++TSKDSVPDA  IDRKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP  AESWEKLRRVANGE N  VSQKLIRSYSVSCR
Subjt:  GDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPI-AESWEKLRRVANGEVNSCVSQKLIRSYSVSCR

Query:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH
        DPSKLAG N  GNDSKL   R RDDFTLQRNRSVRYSP NFDNGLLRFYLTPLRSY +RGKPGKSRPRSSPFNVKH
Subjt:  DPSKLAGFN-AGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSRPRSSPFNVKH

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like2.0e-5333.83Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                           VR     ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   EE E K +K+++DL  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + +                 E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +    PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG
        D+K +SN+  +             I    +  + +K   ++GD                       KK  RW K  S+LG + ++  + + +D  S S  
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG

Query:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG
          +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G
Subjt:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG

Query:  KPG
          G
Subjt:  KPG

Q9SS80 Protein OCTOPUS8.9e-6233.88Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQQESPVRNNHSSELRRSKSFSAAKCEAGI-
        HRLST C+RHP +  TGFC SCL ERL+ +D                                     +T     V+     ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQQESPVRNNHSSELRRSKSFSAAKCEAGI-

Query:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-
        G+ E Q R+SCDVR  +SL +LF ++++       T  E+++                  E+E+   EL E    +        I+  +   + + S E 
Subjt:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-

Query:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR
                          E E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK   N  +   G+A+     P   ++R+T+
Subjt:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR

Query:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD
        SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A        T  +   
Subjt:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD

Query:  PSDEAEGSPMNVGDK--------IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS
        P +E    P  V           IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +D NN+
Subjt:  PSDEAEGSPMNVGDK--------IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS

Query:  RSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS
         +  T  +G  R   +     + D  V      KK  RW K  S+LG++ ++S     ++EE        + G +V+R ++ESW +LR    G       
Subjt:  RSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS

Query:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
         +++RS S VS R          G+  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)1.8e-6534.8Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-----------------------ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDV
        HR ST C RHP +  TGFC SCL +RL+ +D   +  + V ++                          ELRR+KSFSA+K EA  +G  E Q R+SCDV
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSS-----------------------ELRRSKSFSAAKCEA-GIGLSEVQHRKSCDV

Query:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGEEGEFKTVK-EFIDLEFRRK--
        R  N+L  LF  + +     +E       EI+ E +   ++  V  E     SE                  ID+   EE E +T K E   +EF  +  
Subjt:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLGFELREVVANERQFRASE---------GIIGPALGTIDDFSGEEGEFKTVK-EFIDLEFRRK--

Query:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVL--EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------
         K   RD +EIAGS W AASVFSKKL KWR+KQK+K      N+GA  +     + +  ++R+T+SE+ EYG GRRSCDTDPRFS+DAGR SL       
Subjt:  -KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVL--EIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSL-------

Query:  DDSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDDPSDEAEGSPM----NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHR
        DD RYSF+EPRASWDGYLIG+     R+  M+SV+E++        + D     E SP      + + +PGGSAQT++YY+DS SS RRRKS DRSSS R
Subjt:  DDSRYSFDEPRASWDGYLIGKTYP--RITPMVSVLEEAKFSGTGFEKDDPSDEAEGSPM----NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHR

Query:  K---GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGES
        K       + D+LKL  + +           AK L++    N+ R    + + +    ++  +++V       ++T K    W    ++ G++ +++G +
Subjt:  K---GASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGES

Query:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRS-YSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL
        K ++EE   G   VDR  + SW       N E  +    K+IRS  SVS R      G                   LQRN      S +   +  +NG+
Subjt:  KSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRS-YSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNR-----SVRYSPNNFDNGL

Query:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHAM
        L+FYLTP +     S +S     +  P S PF  ++ M
Subjt:  LRFYLTPLR-----SYSSRGKPGKSRPRSSPFNVKHAM

AT3G09070.1 Protein of unknown function (DUF740)6.3e-6333.88Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQQESPVRNNHSSELRRSKSFSAAKCEAGI-
        HRLST C+RHP +  TGFC SCL ERL+ +D                                     +T     V+     ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGIDP------------------------------------DTQQESPVRNNHSSELRRSKSFSAAKCEAGI-

Query:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-
        G+ E Q R+SCDVR  +SL +LF ++++       T  E+++                  E+E+   EL E    +        I+  +   + + S E 
Subjt:  GLSEVQHRKSCDVRSGNSLSDLFCREDK----PRCTNREVEI------------------ESENLGFELREVVANERQFRASEGIIGPALGTIDDFSGE-

Query:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR
                          E E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK   N  +   G+A+     P   ++R+T+
Subjt:  ------------------EGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN--VGAAKAEDINPRVLEIRETR

Query:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD
        SE+ +YG GRRSCDTDPRFS+DA              GR+SLDD RYSFDEPRASWDG LIG+T        P    M+SV+E+A        T  +   
Subjt:  SEVGEYGLGRRSCDTDPRFSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGKTY-------PRITPMVSVLEEA----KFSGTGFEKDD

Query:  PSDEAEGSPMNVGDK--------IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS
        P +E    P  V           IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+    D D+ KL     VS A +   Y   +    +D NN+
Subjt:  PSDEAEGSPMNVGDK--------IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---GDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNS

Query:  RSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS
         +  T  +G  R   +     + D  V      KK  RW K  S+LG++ ++S     ++EE        + G +V+R ++ESW +LR    G       
Subjt:  RSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPIAESWEKLRRVANGEVNSCVS

Query:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS
         +++RS S VS R          G+  K+N          +RN+S RYSP N +NG+L+FYL  +++
Subjt:  QKLIRSYS-VSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRS

AT3G46990.1 Protein of unknown function (DUF740)7.9e-6635.52Show/hide
Query:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQQESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEIE
        S+CHRHPS KP +GFCASCLRERL  I+    Q S +    + ELRR +S+S     A + +S+   R+SCDVR S +SL DLF  +D+ R  +   +  
Subjt:  STCHRHPS-KPVTGFCASCLRERLAGIDPDTQQESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVR-SGNSLSDLFCREDKPRCTNREVEIE

Query:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN
          +L  E  E    E  +   E I G   G       E    KT+KEFIDL++R   KKN G+DL+EI       ASV S++L  +   ++    S++  
Subjt:  SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSN

Query:  VGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSDEA
         G                          GR S D DPR S D GR+       SF++PR+SWDG LI K+Y ++T + +V E+AK +  G E+++  ++ 
Subjt:  VGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSDEA

Query:  EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVT
                +K PGG+ QTK+YY DS    RRR+SFDRS S ++    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L +S +  + ++    S ++ 
Subjt:  EGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVT

Query:  SKDSVPDAAVIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCR--
        SK  +  AA  + K    V       +W K  ++ G++Q R  E+K++   ++   + GN V+  +AES  KLRRV  GE N  VS+KL++SYSVS R  
Subjt:  SKDSVPDAAVIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCR--

Query:  ------DPSKLAGFNAGNDS-------KLN---------------VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
                + ++GF  G  S        +N               +   ++   LQRN +V   S  N +  + RFYL+P++S+ +  K GKSR
Subjt:  ------DPSKLAGFNAGNDS-------KLN---------------VSRWRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR

AT5G01170.1 Protein of unknown function (DUF740)1.4e-5433.83Show/hide
Query:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                           VR     ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCASCLRERLAGID------PDTQQESP---------------------------VRNNHSSELRRSKSFSAAKCEAGIGLSE

Query:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG
         Q R+SCDVR  +   +L   E    DK     RE  +    L   E  E+  +E       G I     +  +   EE E K +K+++DL  + KK + 
Subjt:  VQHRKSCDVRSGNSLSDLFCRE----DKPRCTNREVEIESENLGF-ELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K   N   VG  + +                 E G+GRRS DTDPRFS+DA       GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDA-------GRMSLDDSRYSF

Query:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD
        DEPRASWDG+LIG+T     P    M+SV+E A  + +  +    PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK  + + +
Subjt:  DEPRASWDGYLIGKTYPRITP----MVSVLEEAKFSGTGFE-KDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG
        D+K +SN+  +             I    +  + +K   ++GD                       KK  RW K  S+LG + ++  + + +D  S S  
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVG

Query:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG
          +V+R ++ESW ++R   NGE       K+ RS S                    NVS         RN+S RYS  + +NG+LRFYLTP+ RS+ + G
Subjt:  GNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPL-RSYSSRG

Query:  KPG
          G
Subjt:  KPG

AT5G58930.1 Protein of unknown function (DUF740)2.6e-6935.92Show/hide
Query:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIES
        + CHRHP SKP TGFCA+CLRERL+ I+  +   S      S+ELRR +S+S    +A   + +   R+SCDVRS +   D             + E+  
Subjt:  STCHRHP-SKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIES

Query:  ENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNV
         ++ F +   +  + +    EG        + +   E+GE KT+KE IDLE R +  KN G+D            SVFS+ L K+  K   K + ++ N 
Subjt:  ENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNV

Query:  GAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSDEAE
                                  LGRRSCD DPR S+DAGR+       SFDEPRASWDG LIGKTYP++ P+ SV E+ K S      +   ++ +
Subjt:  GAAKAEDINPRVLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSDEAE

Query:  GSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTS
         +        PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L +S +  + ++    S ++ S
Subjt:  GSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTS

Query:  K--DSVPDAAVIDRKTF---KKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSC-----
        K    V    V  +  F   K    W K  +  G++Q+++  +K++   ++   +GGN ++  +AES  KLRRVA GE N  VS+KLIRSYSVS      
Subjt:  K--DSVPDAAVIDRKTF---KKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSC-----

Query:  ---RDPSKLAGFNAGNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR
           R  S + GF  G  S                           V   R+      ++   YSP+N  NG++RFYLTPL S+ +  K GKSR
Subjt:  ---RDPSKLAGFNAGNDS------------------------KLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYSSRGKPGKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCAGCTCAAAACTGTATCACATCGGCTTTCCACTTGTCACCGTCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCCGG
GATTGATCCCGACACGCAGCAGGAATCGCCTGTTCGGAACAACCATTCTTCAGAGCTTCGTCGGAGTAAATCCTTTTCTGCAGCCAAGTGTGAGGCTGGTATTGGACTAT
CGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGCTCCGGGAACTCCTTGTCGGACCTTTTCTGTCGTGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAA
TCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGTACGATCGATGATTTTTCTGG
AGAGGAGGGTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAGCGG
CTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAACAAAAAATGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGCGAAAGCAGAGGACATCAACCCTAGA
GTGCTTGAAATCAGGGAGACTCGTTCGGAGGTCGGAGAATATGGACTAGGAAGAAGATCTTGTGATACAGATCCAAGATTCTCTGTTGATGCAGGTAGAATGTCGTTGGA
TGATTCACGTTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCCATGGTATCAGTTTTGGAGGAGGCCAAAT
TTTCTGGTACTGGATTTGAGAAAGATGATCCTTCCGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATTCCTGGTGGATCGGCTCAGACTAAAGATTACTAC
ATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCGAGTTCACACAGAAAAGGGGCTTCCGGGGACTTTGATGACTTGAAATTAATATCAAATGCAAA
GGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAAGTTTTAATCACAGAGAAAGATTTGAACAACTCCCGCTCAAAAGCAACAACCAGAGATGGCGATTTGCGTA
GCACGGATGTTACCTCCAAAGATTCTGTTCCTGATGCAGCTGTGATTGACCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAGAAGCGAAGTGGTGAAAGTAAGTCTGATGATGAAGAAAGCAGTGTTGGAGGGAATGTGGTTGATCGGCCTATTGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGTGAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGCTTTAATGCTGGTAATGATTCGAAACTTA
ACGTTTCGAGATGGAGAGATGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCACCAAATAACTTTGATAATGGCTTACTAAGGTTCTATTTGACACCATTGAGG
AGCTACAGCAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGCCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGCAGCTCAAAACTGTATCACATCGGCTTTCCACTTGTCACCGTCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTCTTGTCTCCGGGAACGCCTTGCCGG
GATTGATCCCGACACGCAGCAGGAATCGCCTGTTCGGAACAACCATTCTTCAGAGCTTCGTCGGAGTAAATCCTTTTCTGCAGCCAAGTGTGAGGCTGGTATTGGACTAT
CGGAGGTTCAGCATCGGAAGTCTTGCGATGTTCGCTCCGGGAACTCCTTGTCGGACCTTTTCTGTCGTGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAA
TCCGAGAATTTAGGTTTTGAATTGCGTGAGGTTGTGGCTAATGAGAGACAATTTAGGGCTTCCGAGGGGATAATTGGACCGGCTTTGGGTACGATCGATGATTTTTCTGG
AGAGGAGGGTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTACGAGAAATTGCAGGGAGTGTTTGGGAAGCGG
CTTCAGTCTTCAGCAAGAAACTCGGCAAATGGAGGAAAAAACAAAAAATGAAGAATCTCAGTAACAATAGCAATGTAGGTGCGGCGAAAGCAGAGGACATCAACCCTAGA
GTGCTTGAAATCAGGGAGACTCGTTCGGAGGTCGGAGAATATGGACTAGGAAGAAGATCTTGTGATACAGATCCAAGATTCTCTGTTGATGCAGGTAGAATGTCGTTGGA
TGATTCACGTTATTCATTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAAAACTTATCCAAGGATTACGCCCATGGTATCAGTTTTGGAGGAGGCCAAAT
TTTCTGGTACTGGATTTGAGAAAGATGATCCTTCCGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGATAAGATTCCTGGTGGATCGGCTCAGACTAAAGATTACTAC
ATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCGAGTTCACACAGAAAAGGGGCTTCCGGGGACTTTGATGACTTGAAATTAATATCAAATGCAAA
GGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAAGTTTTAATCACAGAGAAAGATTTGAACAACTCCCGCTCAAAAGCAACAACCAGAGATGGCGATTTGCGTA
GCACGGATGTTACCTCCAAAGATTCTGTTCCTGATGCAGCTGTGATTGACCGAAAGACATTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAGAAGCGAAGTGGTGAAAGTAAGTCTGATGATGAAGAAAGCAGTGTTGGAGGGAATGTGGTTGATCGGCCTATTGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGTGAACTCTTGTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGCTGGCTTTAATGCTGGTAATGATTCGAAACTTA
ACGTTTCGAGATGGAGAGATGATTTTACATTGCAGAGGAATCGGAGTGTCAGGTATTCACCAAATAACTTTGATAATGGCTTACTAAGGTTCTATTTGACACCATTGAGG
AGCTACAGCAGCAGAGGCAAACCAGGAAAGAGCAGACCAAGAAGTTCTCCTTTCAATGTCAAGCATGCCATGTAA
Protein sequenceShow/hide protein sequence
MNLQLKTVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQQESPVRNNHSSELRRSKSFSAAKCEAGIGLSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIE
SENLGFELREVVANERQFRASEGIIGPALGTIDDFSGEEGEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLSNNSNVGAAKAEDINPR
VLEIRETRSEVGEYGLGRRSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGFEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYY
MDSLSSLRRRKSFDRSSSHRKGASGDFDDLKLISNAKVSPATTELFYGAKVLITEKDLNNSRSKATTRDGDLRSTDVTSKDSVPDAAVIDRKTFKKVHRWRKVLSVLGMM
QKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEVNSCVSQKLIRSYSVSCRDPSKLAGFNAGNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
SYSSRGKPGKSRPRSSPFNVKHAM