; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023453 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023453
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic
Genome locationtig00000892:3422428..3424146
RNA-Seq ExpressionSgr023453
SyntenySgr023453
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]9.1e-28286.41Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ K VSHRLS+C RHP +PVTGFCASCLRERLAGID DTR E+P  NQHS SELRRSKSFS AKREA I +P+VQHRKSCD R GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLGFEL E  A ER+FRASEG +GP L+ IDD +G +AEFKTMKEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++++VG  KTEAIKPRVLE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE K PG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLS++RRRKSFDRS+SH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDL D HSKATRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD
        DLSGT +TSKDSVPDAAGIDRKTFKK +RWRKVLSVLGM+QKRS   SKSDDEESCVGGN  DRPF AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH
        PSKLAGFN GGNDSKL GLRRRDDF LQRNRSVR SPNNFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH

XP_022148337.1 UPF0503 protein At3g09070, chloroplastic [Momordica charantia]4.2e-29589.49Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNL AKTV HRLSTC RHP KPVTGFCA CLRERLAGIDPDTR ETP  NQHS+SELRRSKSFS AKR+A I QP+VQHRKSCDVR GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        PKCPNQEVEIESENLGFEL E AANER+FRASEGA+GPPL+TIDD AGGEAEFKTMKEFIDLE RRKKN GRDLREIAGS WEAASV SKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++SN G VKTE  KPR+LE RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRL PMVSVLEEVKFPG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        EN DPPDE EG  MNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRS+SHRKGASADFDDLK ISNAKVSPATTELFYGAKVLITEKDLND HSK+TRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPSK
        DLSG+EVTSKDSVPDAAG DRKTFKKAYRW+KVL VLGM+QKRSESKSDDEE CVG N+ DRP AESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP+K
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPSK

Query:  LAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV
        LAGFNGGND KLNGLRRRDD  LQRNRSVR SPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV
Subjt:  LAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]4.5e-28186.06Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ K VSHRLS+C RHP +PVTGFCASCLRERLAGID DTR E+P  NQHS SELRRSKSFS AKREA I +P+VQHRKSCD R GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLGFEL E  A ER+FRASEG +GP L+ IDD +G +AEFKTMKEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++++VG  KTEAIKPRVLE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE K PG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVG+KIPGGSAQTKDYYM+SLS++RRRKSFDRS+SH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDL D HSKATRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD
        DLSGT +TSKDSVPDAAGIDRKTFKK +RWRKVLSVLGM+QKRS   SKSDDEESCVGGN  DRPF AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH
        PSKLAGFN GGNDSKL GLRRRDDF LQRNRS R SPNNFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH

XP_023532200.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo]6.1e-27885.04Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ K VSHRLS+C RHP +PVTGFCASCLRERLAGID DTR E+P  NQHS SELRRSKSFS AKRE  I +P+VQHRKSCD R GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLGFEL E  A ER+FRASEG +GP L+ IDD +G +AEFKTMKEFIDLEFRRKKNAGRDLRE+AGS W AASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++++VG  KTEA+KPRVLE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE K PG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLS++RRRKSFDRS+SH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDL D  SKATRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD
        DLSGT +TSKD VPDAAGIDRKTFKK +RWRKVLSVLGM+QKRS   SKSDDEESCVGGN  DRPF AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD

Query:  PSKLAGFN--GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH
        PSKLAGFN  GGNDSKL GLRRRDD  LQRNRSVR SPNNFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN--GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH

XP_038901013.1 protein OCTOPUS [Benincasa hispida]4.2e-27986.06Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ KT+SHRLSTCHRHP KPVTGFCASCLRERLAGID DT+ E+P  N HS+SELRRSKS+S AKREA I Q +VQHRKSCDVR GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C  +EVEIESENLG EL E  ANER FRASEG +GP L TIDD AG EAEFKT+KEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
         KNL +N NVG VK E IKPRVLE+RETRSEVG+YGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE KF G GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLSSLRRRKSFDRS SHRKGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDLN+ HSKATR+G
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPS
        DLSGT+VTSKDSVPDAA IDRKTFKK +RWRKVLSVLGMIQKRS ESKSDDEES VGGN  DRP AESWEKLRRVANGEAN  VSQKLIRSYSVSCRDPS
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPS

Query:  KLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSY-SRGKPGKSRPRSSPFNVKHVI
        KLAGFNG NDSKLN  R RDDF LQRNRSVR SPNNFDNGLLRFYLTPLRSY SRGKPGKSRPR+SPFNVKHV+
Subjt:  KLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSY-SRGKPGKSRPRSSPFNVKHVI

TrEMBL top hitse value%identityAlignment
A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic7.5e-27484.49Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSAS-ELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCRED
        MNLQ K+VSHRLSTCHRHP KPVTGFCASCLRERLAGIDPD + E+P  N HS+S ELRRSKS+S AK EA I Q ++QHRKSCDVR GNSLSDLFCRED
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSAS-ELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCRED

Query:  GPKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQ
         P+C N EVEIESENLGFEL E   N R+FRASEG +GP L TIDD AG +AEFKT+KEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKL KWRKKQ
Subjt:  GPKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQ

Query:  KMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGG
        K KNLG+NSNVG VK E IKPR LE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE KF G G
Subjt:  KMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGG

Query:  FENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRD
        FE DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLSSLRRRKSFDRS SHRKGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDLN    KAT D
Subjt:  FENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRD

Query:  GDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP
        GDLSGT+VTSKDSVPDA  IDRK+FKK +RWRKVLSVLGMIQKR+ ESKSDDEES V GN  DRP  ESWEKLRRVANGEAN  VSQKLIRSYSVSCRDP
Subjt:  GDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP

Query:  SKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHVI
        SKLAGFNGGNDSKLN  R RDDF LQRNRSVR SPNNFDNGLLRFYLTPLRSYSRGK GKSRPR+SPFNVKHVI
Subjt:  SKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHVI

A0A5A7V3J1 UPF0503 protein7.5e-27484.49Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSAS-ELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCRED
        MNLQ K+VSHRLSTCHRHP KPVTGFCASCLRERLAGIDPD + E+P  N HS+S ELRRSKS+S AK EA I Q ++QHRKSCDVR GNSLSDLFCRED
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSAS-ELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCRED

Query:  GPKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQ
         P+C N EVEIESENLGFEL E   N R+FRASEG +GP L TIDD AG +AEFKT+KEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKL KWRKKQ
Subjt:  GPKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQ

Query:  KMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGG
        K KNLG+NSNVG VK E IKPR LE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE KF G G
Subjt:  KMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGG

Query:  FENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRD
        FE DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLSSLRRRKSFDRS SHRKGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDLN    KAT D
Subjt:  FENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRD

Query:  GDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP
        GDLSGT+VTSKDSVPDA  IDRK+FKK +RWRKVLSVLGMIQKR+ ESKSDDEES V GN  DRP  ESWEKLRRVANGEAN  VSQKLIRSYSVSCRDP
Subjt:  GDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS-ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP

Query:  SKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHVI
        SKLAGFNGGNDSKLN  R RDDF LQRNRSVR SPNNFDNGLLRFYLTPLRSYSRGK GKSRPR+SPFNVKHVI
Subjt:  SKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHVI

A0A6J1D4T1 UPF0503 protein At3g09070, chloroplastic2.0e-29589.49Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNL AKTV HRLSTC RHP KPVTGFCA CLRERLAGIDPDTR ETP  NQHS+SELRRSKSFS AKR+A I QP+VQHRKSCDVR GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        PKCPNQEVEIESENLGFEL E AANER+FRASEGA+GPPL+TIDD AGGEAEFKTMKEFIDLE RRKKN GRDLREIAGS WEAASV SKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++SN G VKTE  KPR+LE RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRL PMVSVLEEVKFPG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        EN DPPDE EG  MNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRS+SHRKGASADFDDLK ISNAKVSPATTELFYGAKVLITEKDLND HSK+TRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPSK
        DLSG+EVTSKDSVPDAAG DRKTFKKAYRW+KVL VLGM+QKRSESKSDDEE CVG N+ DRP AESWEKLRRVANGEANCSVSQKLIRSYSVSCRDP+K
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPSK

Query:  LAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV
        LAGFNGGND KLNGLRRRDD  LQRNRSVR SPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV
Subjt:  LAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like2.2e-28186.06Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ K VSHRLS+C RHP +PVTGFCASCLRERLAGID DTR E+P  NQHS SELRRSKSFS AKREA I +P+VQHRKSCD R GNSLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLGFEL E  A ER+FRASEG +GP L+ IDD +G +AEFKTMKEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++++VG  KTEAIKPRVLE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE K PG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVG+KIPGGSAQTKDYYM+SLS++RRRKSFDRS+SH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDL D HSKATRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD
        DLSGT +TSKDSVPDAAGIDRKTFKK +RWRKVLSVLGM+QKRS   SKSDDEESCVGGN  DRPF AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH
        PSKLAGFN GGNDSKL GLRRRDDF LQRNRS R SPNNFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH

A0A6J1L0V5 UPF0503 protein At3g09070, chloroplastic-like5.0e-27885.71Show/hide
Query:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG
        MNLQ K VSHRLS+C RHP +PVTGFCASCLRERLAGID DTR E+P  NQHS SELRRSKSFS AKREA I +P+VQHRKSCD R G+SLSDLFCRED 
Subjt:  MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDG

Query:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK
        P+C  +EVEIESENLGFEL E  A ER+FRASEG +GP L+ IDD +G +AEFKTMKEFIDLEFRRKKNAGRDLREIAGS WEAASVFSKKLGKWRKKQK
Subjt:  PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQK

Query:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF
        MKNLG++++VG  KTEAIKPRVLE+RETRSEVGEYGLGRRSCDTDPR SVDAGRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE K PG GF
Subjt:  MKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGF

Query:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG
        E DDP DE EGSPMNVGDKIPGGSAQTKDYYM+SLSS+RRRKSFDRS+SH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDL D HSKATRDG
Subjt:  ENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDG

Query:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD
        DLSGT +TSKDSVPDA GIDRKTFKK +RWRKVLSVLGM QKRS   SKSDDEESCVGGN  DRPF AESWEKLRRVANGEAN SVSQKLIRSYSVSCRD
Subjt:  DLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS--ESKSDDEESCVGGNAADRPF-AESWEKLRRVANGEANCSVSQKLIRSYSVSCRD

Query:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH
        PSKLAG N GGNDSKL GLRRRDDF LQRNRSVR SP NFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKH
Subjt:  PSKLAGFN-GGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSRGKPGKSRPRSSPFNVKH

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like4.9e-5233Show/hide
Query:  HRLST-CHRHPCKPVTGFCASCLRERLAGID------PDTRLETP----------------AGNQHSAS----------ELRRSKSFSGAKREACIRQPD
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                +G  +S            ELRR+KSFS    E      +
Subjt:  HRLST-CHRHPCKPVTGFCASCLRERLAGID------PDTRLETP----------------AGNQHSAS----------ELRRSKSFSGAKREACIRQPD

Query:  VQHRKSCDVRWGNSLSDLFCREDG--PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPL-NTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRD
         Q R+SCDVR  +   +L   E     K   +  E     +  E+ E A  E +    E   G  +     +I   E E K MK+++DL  + KK +   
Subjt:  VQHRKSCDVRWGNSLSDLFCREDG--PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPL-NTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRD

Query:  LREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDA-------GRMSLDDSRYSFDE
        +++ AGSF+ AASVFSKKL KW++KQK+K     + VG  + ++                E G+GRRS DTDPR S+DA       GR+S+DDSRYS DE
Subjt:  LREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDA-------GRMSLDDSRYSFDE

Query:  PRASWDGYLIGRTYPRLTP----MVSVLEEVKFPGGGFENDDPPD----EVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADF
        PRASWDG+LIGRT     P    M+SV+E         +    P       +  P+ +   IPGGS QT+DYY    SS RRRKS DRSNS RK    + 
Subjt:  PRASWDGYLIGRTYPRLTP----MVSVLEEVKFPGGGFENDDPPD----EVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADF

Query:  DDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGN
        +D+K +SN+  +             I    +    +K  ++GD                       KK+ RW K  S+LG I ++ +   +++      +
Subjt:  DDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGN

Query:  A--ADRPFAESWEKLRRVANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLR
        A   +R  +ESW ++R   NGE       K+ RS S VS R        +GG  +              RN+S R S  + +NG+LRFYLTP+R
Subjt:  A--ADRPFAESWEKLRRVANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLR

Q9SS80 Protein OCTOPUS2.2e-6534.56Show/hide
Query:  HRLST-CHRHPCKPVTGFCASCLRERLAGID-------------PDT-------RLETPAGNQHSAS---------------ELRRSKSFSGAKREACIR
        HRLST C+RHP +  TGFC SCL ERL+ +D             P T        L  P+GN                    ELRR+KSFS +K      
Subjt:  HRLST-CHRHPCKPVTGFCASCLRERLAGID-------------PDT-------RLETPAGNQHSAS---------------ELRRSKSFSGAKREACIR

Query:  QPDVQHRKSCDVRWGNSLSDLFCREDGPKCP------------------------NQEVEIESENLGFELCE----AAANEREFRASEGALGPPLNTIDD
              R+SCDVR  +SL +LF +++    P                        N E E ES++   E  E      A + E     G L       D+
Subjt:  QPDVQHRKSCDVRWGNSLSDLFCREDGPKCP------------------------NQEVEIESENLGFELCE----AAANEREFRASEGALGPPLNTIDD

Query:  IAG---------------GEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNL--GDNSNVGVVKTEAIKPRVLEMRET
        I                  E E K +K++IDL+ + KK +      +  SFW AASVFSKKL KWR+ QKMK    G +   G  +    KP   ++R+T
Subjt:  IAG---------------GEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNL--GDNSNVGVVKTEAIKPRVLEMRET

Query:  RSEVGEYGLGRRSCDTDPRLSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLE-------------EVK
        +SE+ +YG GRRSCDTDPR S+DA              GR+SLDD RYSFDEPRASWDG LIGRT        P    M+SV+E             +++
Subjt:  RSEVGEYGLGRRSCDTDPRLSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLE-------------EVK

Query:  FPGGGFENDDPPDEVEGSPMNVGDK--IPGGSAQTKDYYMESLSSLRRRKSFDRSNSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKD
        FP    E   PP  V      V D   IPGGS QT+DYY +  SS RRRKS DRS+S  RK A+   AD D+ KL  ++ +S                  
Subjt:  FPGGGFENDDPPDEVEGSPMNVGDK--IPGGSAQTKDYYMESLSSLRRRKSFDRSNSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKD

Query:  LNDPHSKATRDGDLSGTEVTSKDSVPDAAGI--DRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDDEES-------CVGGNAADRPFAESWEKLRRV
          D +S + RD +    E     S  + A +  DRK       KK+ RW K  S+LG+I ++S +K ++EE         + G   +R  +ESW +LR  
Subjt:  LNDPHSKATRDGDLSGTEVTSKDSVPDAAGI--DRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDDEES-------CVGGNAADRPFAESWEKLRRV

Query:  ANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSR
          G        +++RS S VS R         GG+  K+NGL R       RN+S R SP N +NG+L+FYL  +++  R
Subjt:  ANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSR

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)4.1e-7036.6Show/hide
Query:  HRLST-CHRHPCKPVTGFCASCLRERLAGID----------------PDTRLETPAGNQHSAS------ELRRSKSFSGAKREACIRQPDVQHRKSCDVR
        HR ST C RHP +  TGFC SCL +RL+ +D                P +     A  + S+S      ELRR+KSFS +K EA         R+SCDVR
Subjt:  HRLST-CHRHPCKPVTGFCASCLRERLAGID----------------PDTRLETPAGNQHSAS------ELRRSKSFSGAKREACIRQPDVQHRKSCDVR

Query:  WGNSLSDLFCRE--------DGPKCPNQEVEIESENL---------GFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMK-EFIDLEFRRK-
          N+L  LF  +        +G      E+++E  N            E+     NE++ +        P + ID+I   E E +T K E   +EF  + 
Subjt:  WGNSLSDLFCRE--------DGPKCPNQEVEIESENL---------GFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMK-EFIDLEFRRK-

Query:  --KNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMK-----NLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSL---
          K   RD +EIAGSFW AASVFSKKL KWR+KQK+K     NLG  S+   V+    K    ++R+T+SE+ EYG GRRSCDTDPR S+DAGR SL   
Subjt:  --KNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMK-----NLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSL---

Query:  ----DDSRYSFDEPRASWDGYLIGRTYP--RLTPMVSVLEEVKFPGGGFENDDPPDEVEGSPM----NVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRS
            DD RYSF+EPRASWDGYLIGR     R+  M+SV+E+           D    VE SP      + + +PGGSAQT++YY++S SS RRRKS DRS
Subjt:  ----DDSRYSFDEPRASWDGYLIGRTYP--RLTPMVSVLEEVKFPGGGFENDDPPDEVEGSPM----NVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRS

Query:  NSHRK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS
        +S RK      A+ D+LKL  + +           AK L++       HS + RD D    E   +  V +  G      K+  + R   ++ G++ +++
Subjt:  NSHRK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRS

Query:  ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRS-YSVSCRDPSKLAGFNGGNDSKLNGLRRRD-DFVLQRNRSVRCSPNNFDNGLLR
         +K ++EE   G    DR F+ SW       N E       K+IRS  SVS R     +G  GG      GL+R   D  +   + V    +  +NG+L+
Subjt:  ESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRS-YSVSCRDPSKLAGFNGGNDSKLNGLRRRD-DFVLQRNRSVRCSPNNFDNGLLR

Query:  FYLTPLRSYSRGKPGKSRPRSSP
        FYLTP +   RG    + P S P
Subjt:  FYLTPLRSYSRGKPGKSRPRSSP

AT3G09070.1 Protein of unknown function (DUF740)1.6e-6634.56Show/hide
Query:  HRLST-CHRHPCKPVTGFCASCLRERLAGID-------------PDT-------RLETPAGNQHSAS---------------ELRRSKSFSGAKREACIR
        HRLST C+RHP +  TGFC SCL ERL+ +D             P T        L  P+GN                    ELRR+KSFS +K      
Subjt:  HRLST-CHRHPCKPVTGFCASCLRERLAGID-------------PDT-------RLETPAGNQHSAS---------------ELRRSKSFSGAKREACIR

Query:  QPDVQHRKSCDVRWGNSLSDLFCREDGPKCP------------------------NQEVEIESENLGFELCE----AAANEREFRASEGALGPPLNTIDD
              R+SCDVR  +SL +LF +++    P                        N E E ES++   E  E      A + E     G L       D+
Subjt:  QPDVQHRKSCDVRWGNSLSDLFCREDGPKCP------------------------NQEVEIESENLGFELCE----AAANEREFRASEGALGPPLNTIDD

Query:  IAG---------------GEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNL--GDNSNVGVVKTEAIKPRVLEMRET
        I                  E E K +K++IDL+ + KK +      +  SFW AASVFSKKL KWR+ QKMK    G +   G  +    KP   ++R+T
Subjt:  IAG---------------GEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNL--GDNSNVGVVKTEAIKPRVLEMRET

Query:  RSEVGEYGLGRRSCDTDPRLSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLE-------------EVK
        +SE+ +YG GRRSCDTDPR S+DA              GR+SLDD RYSFDEPRASWDG LIGRT        P    M+SV+E             +++
Subjt:  RSEVGEYGLGRRSCDTDPRLSVDA--------------GRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLE-------------EVK

Query:  FPGGGFENDDPPDEVEGSPMNVGDK--IPGGSAQTKDYYMESLSSLRRRKSFDRSNSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKD
        FP    E   PP  V      V D   IPGGS QT+DYY +  SS RRRKS DRS+S  RK A+   AD D+ KL  ++ +S                  
Subjt:  FPGGGFENDDPPDEVEGSPMNVGDK--IPGGSAQTKDYYMESLSSLRRRKSFDRSNSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKD

Query:  LNDPHSKATRDGDLSGTEVTSKDSVPDAAGI--DRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDDEES-------CVGGNAADRPFAESWEKLRRV
          D +S + RD +    E     S  + A +  DRK       KK+ RW K  S+LG+I ++S +K ++EE         + G   +R  +ESW +LR  
Subjt:  LNDPHSKATRDGDLSGTEVTSKDSVPDAAGI--DRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDDEES-------CVGGNAADRPFAESWEKLRRV

Query:  ANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSR
          G        +++RS S VS R         GG+  K+NGL R       RN+S R SP N +NG+L+FYL  +++  R
Subjt:  ANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRSYSR

AT3G46990.1 Protein of unknown function (DUF740)2.1e-7435.47Show/hide
Query:  STCHRHP-CKPVTGFCASCLRERLAGIDPDT----RLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVR-WGNSLSDLFCRED-------
        S+CHRHP  KP +GFCASCLRERL  I+  +     ++TP        ELRR +S+S   R A +   D   R+SCDVR   +SL DLF  +D       
Subjt:  STCHRHP-CKPVTGFCASCLRERLAGIDPDT----RLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVR-WGNSLSDLFCRED-------

Query:  --GPKCPNQEVEIESENLGFELCEAAANEREFRASEGAL----GPPLNTIDDIAGGEAEFKTMKEFIDLEFRR--KKNAGRDLREIAGSFWEAASVFSKK
           P  P+ + E E E            E ++   E       G P   ++       E KTMKEFIDL++R   KKN G+DL+EI       ASV S++
Subjt:  --GPKCPNQEVEIESENLGFELCEAAANEREFRASEGAL----GPPLNTIDDIAGGEAEFKTMKEFIDLEFRR--KKNAGRDLREIAGSFWEAASVFSKK

Query:  LGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLE
        L  +   ++     D+   G+V                        GR S D DPRLS D GR+       SF++PR+SWDG LI ++Y +LT + +V E
Subjt:  LGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLE

Query:  EVKFPGGGFENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLND
        + K   G          VE   +   +K PGG+ QTK+YY +S    RRR+SFDRS S ++    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L D
Subjt:  EVKFPGGGFENDDPPDEVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLND

Query:  PHSKATRDGDLSGTEVTSKDSVPDAAGIDRK------TFKKAYRWRKVLSVLGMIQKRSESKSD---DEESCVGGNAADRPFAESWEKLRRVANGEANCS
         +  + ++      E+ SK  +  AAG + K        K   +W K  ++ G+IQ+++E+K++   ++   + GNA +   AES  KLRRV  GE N  
Subjt:  PHSKATRDGDLSGTEVTSKDSVPDAAGIDRK------TFKKAYRWRKVLSVLGMIQKRSESKSD---DEESCVGGNAADRPFAESWEKLRRVANGEANCS

Query:  VSQKLIRSYSVSCR--------DPSKLAGFNGGN---------------------DSKLNGLR-RRDDFVLQRNRSV-RCSPNNFDNGLLRFYLTPLRSY
        VS+KL++SYSVS R          + ++GF GG                      D  +NG+  +++  +LQRN +V  CS  N +  + RFYL+P++S+
Subjt:  VSQKLIRSYSVSCR--------DPSKLAGFNGGN---------------------DSKLNGLR-RRDDFVLQRNRSV-RCSPNNFDNGLLRFYLTPLRSY

Query:  SRGKPGKSR
           K GKSR
Subjt:  SRGKPGKSR

AT5G01170.1 Protein of unknown function (DUF740)3.5e-5333Show/hide
Query:  HRLST-CHRHPCKPVTGFCASCLRERLAGID------PDTRLETP----------------AGNQHSAS----------ELRRSKSFSGAKREACIRQPD
        HRLST C  HP +  +GFC SCL +RL+ +D      P +    P                +G  +S            ELRR+KSFS    E      +
Subjt:  HRLST-CHRHPCKPVTGFCASCLRERLAGID------PDTRLETP----------------AGNQHSAS----------ELRRSKSFSGAKREACIRQPD

Query:  VQHRKSCDVRWGNSLSDLFCREDG--PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPL-NTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRD
         Q R+SCDVR  +   +L   E     K   +  E     +  E+ E A  E +    E   G  +     +I   E E K MK+++DL  + KK +   
Subjt:  VQHRKSCDVRWGNSLSDLFCREDG--PKCPNQEVEIESENLGFELCEAAANEREFRASEGALGPPL-NTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRD

Query:  LREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDA-------GRMSLDDSRYSFDE
        +++ AGSF+ AASVFSKKL KW++KQK+K     + VG  + ++                E G+GRRS DTDPR S+DA       GR+S+DDSRYS DE
Subjt:  LREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSNVGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDA-------GRMSLDDSRYSFDE

Query:  PRASWDGYLIGRTYPRLTP----MVSVLEEVKFPGGGFENDDPPD----EVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADF
        PRASWDG+LIGRT     P    M+SV+E         +    P       +  P+ +   IPGGS QT+DYY    SS RRRKS DRSNS RK    + 
Subjt:  PRASWDGYLIGRTYPRLTP----MVSVLEEVKFPGGGFENDDPPD----EVEGSPMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADF

Query:  DDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGN
        +D+K +SN+  +             I    +    +K  ++GD                       KK+ RW K  S+LG I ++ +   +++      +
Subjt:  DDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMIQKRSESKSDDEESCVGGN

Query:  A--ADRPFAESWEKLRRVANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLR
        A   +R  +ESW ++R   NGE       K+ RS S VS R        +GG  +              RN+S R S  + +NG+LRFYLTP+R
Subjt:  A--ADRPFAESWEKLRRVANGEANCSVSQKLIRSYS-VSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLR

AT5G58930.1 Protein of unknown function (DUF740)7.1e-7537.27Show/hide
Query:  STCHRHP-CKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDGPKCPNQEVEIE
        + CHRHP  KP TGFCA+CLRERL+ I      E  + +  +++ELRR +S+S   R+A     D   R+SCDVR  +   D             + E+ 
Subjt:  STCHRHP-CKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDGPKCPNQEVEIE

Query:  SENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSN
          ++ F +      + E    EG         ++I  GE   KTMKE IDLE R +  KN G+D            SVFS+ L K+  K   K + D+ N
Subjt:  SENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSN

Query:  VGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGFENDDPPDEV
                                   LGRRSCD DPRLS+DAGR+       SFDEPRASWDG LIG+TYP+L P+ SV E+VK           P+++
Subjt:  VGVVKTEAIKPRVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGFENDDPPDEV

Query:  EGSPMNVGDK-IPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVT
         G  +   +K  PGG+AQT+DYY++S    RRR+SFDRS+ H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L D +  + ++      E+ 
Subjt:  EGSPMNVGDK-IPGGSAQTKDYYMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVT

Query:  SKDSVPDAAGIDRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDD---EESC-VGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSC----
        SK     AAG  +K       K    W K  +  G+IQ++++   ++   E+S  +GGN  +   AES  KLRRVA GE N  VS+KLIRSYSVS     
Subjt:  SKDSVPDAAGIDRK-----TFKKAYRWRKVLSVLGMIQKRSESKSDD---EESC-VGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSC----

Query:  ----RDPSKLAGFNGG-----------------------NDSKLNGLRRRDDFVLQRNRSV-RCSPNNFDNGLLRFYLTPLRSYSRGKPGKSR
            R  S + GF GG                        D   +G+  + + +LQ +  +   SP+N  NG++RFYLTPL S+   K GKSR
Subjt:  ----RDPSKLAGFNGG-----------------------NDSKLNGLRRRDDFVLQRNRSV-RCSPNNFDNGLLRFYLTPLRSYSRGKPGKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTCAGGCCAAAACAGTATCGCATCGGCTTTCAACATGTCACCGTCATCCGTGCAAACCGGTGACCGGGTTCTGTGCCTCCTGCCTCCGGGAACGCCTTGCCGG
GATTGATCCCGATACGCGGCTGGAAACTCCTGCAGGGAACCAACATTCTGCGTCGGAGCTCCGTCGCAGCAAATCTTTCTCTGGAGCGAAGCGTGAGGCCTGCATCAGGC
AACCCGACGTGCAGCATCGTAAGTCATGCGATGTTCGCTGGGGGAACTCCTTGTCGGACCTTTTCTGTCGCGAAGATGGCCCGAAATGTCCGAATCAGGAGGTGGAGATC
GAATCCGAGAACTTAGGTTTTGAATTGTGTGAGGCTGCAGCAAATGAGAGAGAATTTAGGGCTTCTGAGGGGGCGCTTGGACCGCCTCTGAATACAATCGACGATATTGC
TGGAGGGGAGGCTGAGTTCAAGACAATGAAAGAATTTATAGATCTCGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTTTCTGGGAAG
CAGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACAATAGCAATGTAGGTGTGGTGAAAACAGAAGCTATCAAGCCG
AGAGTGCTTGAAATGAGGGAGACTCGCTCCGAGGTTGGAGAATACGGATTGGGGAGAAGGTCTTGTGATACAGATCCAAGATTATCTGTCGATGCGGGTAGGATGTCGCT
GGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAGAACTTATCCAAGGCTGACGCCGATGGTTTCAGTTTTAGAGGAGGTCA
AATTCCCTGGTGGTGGATTTGAGAATGATGATCCTCCTGATGAAGTAGAAGGGTCCCCGATGAATGTAGGAGATAAGATCCCTGGTGGATCGGCTCAGACCAAAGACTAC
TATATGGAATCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCAAATTCACACAGAAAAGGGGCGTCAGCAGACTTTGATGACTTGAAATTAATATCAAACGC
AAAAGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAACGACCCTCACTCGAAAGCAACCAGAGATGGCGATTTGAGTG
GCACTGAAGTTACTTCCAAAGATTCTGTTCCTGATGCAGCTGGGATCGATAGAAAGACGTTCAAGAAGGCGTATAGGTGGCGTAAAGTATTAAGTGTTTTGGGTATGATT
CAAAAGCGAAGTGAAAGCAAGTCCGATGATGAAGAAAGCTGTGTTGGAGGAAATGCGGCCGATCGGCCTTTCGCTGAGTCGTGGGAAAAGCTGAGGCGAGTAGCTAATGG
AGAAGCAAACTGTTCTGTTAGCCAGAAGCTCATACGCAGCTACAGCGTAAGCTGTAGAGATCCCAGCAAACTGGCTGGGTTCAATGGTGGTAATGATTCGAAACTGAACG
GTTTGAGACGGAGAGACGATTTTGTGTTGCAGAGGAATCGGAGTGTCAGGTGTTCACCAAATAACTTTGACAATGGCTTATTAAGGTTCTACTTGACACCATTGAGGAGC
TACAGCAGAGGCAAACCGGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTCAGGCCAAAACAGTATCGCATCGGCTTTCAACATGTCACCGTCATCCGTGCAAACCGGTGACCGGGTTCTGTGCCTCCTGCCTCCGGGAACGCCTTGCCGG
GATTGATCCCGATACGCGGCTGGAAACTCCTGCAGGGAACCAACATTCTGCGTCGGAGCTCCGTCGCAGCAAATCTTTCTCTGGAGCGAAGCGTGAGGCCTGCATCAGGC
AACCCGACGTGCAGCATCGTAAGTCATGCGATGTTCGCTGGGGGAACTCCTTGTCGGACCTTTTCTGTCGCGAAGATGGCCCGAAATGTCCGAATCAGGAGGTGGAGATC
GAATCCGAGAACTTAGGTTTTGAATTGTGTGAGGCTGCAGCAAATGAGAGAGAATTTAGGGCTTCTGAGGGGGCGCTTGGACCGCCTCTGAATACAATCGACGATATTGC
TGGAGGGGAGGCTGAGTTCAAGACAATGAAAGAATTTATAGATCTCGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTTTCTGGGAAG
CAGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACAATAGCAATGTAGGTGTGGTGAAAACAGAAGCTATCAAGCCG
AGAGTGCTTGAAATGAGGGAGACTCGCTCCGAGGTTGGAGAATACGGATTGGGGAGAAGGTCTTGTGATACAGATCCAAGATTATCTGTCGATGCGGGTAGGATGTCGCT
GGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGAAGAACTTATCCAAGGCTGACGCCGATGGTTTCAGTTTTAGAGGAGGTCA
AATTCCCTGGTGGTGGATTTGAGAATGATGATCCTCCTGATGAAGTAGAAGGGTCCCCGATGAATGTAGGAGATAAGATCCCTGGTGGATCGGCTCAGACCAAAGACTAC
TATATGGAATCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGTTCAAATTCACACAGAAAAGGGGCGTCAGCAGACTTTGATGACTTGAAATTAATATCAAACGC
AAAAGTATCTCCTGCAACTACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAACGACCCTCACTCGAAAGCAACCAGAGATGGCGATTTGAGTG
GCACTGAAGTTACTTCCAAAGATTCTGTTCCTGATGCAGCTGGGATCGATAGAAAGACGTTCAAGAAGGCGTATAGGTGGCGTAAAGTATTAAGTGTTTTGGGTATGATT
CAAAAGCGAAGTGAAAGCAAGTCCGATGATGAAGAAAGCTGTGTTGGAGGAAATGCGGCCGATCGGCCTTTCGCTGAGTCGTGGGAAAAGCTGAGGCGAGTAGCTAATGG
AGAAGCAAACTGTTCTGTTAGCCAGAAGCTCATACGCAGCTACAGCGTAAGCTGTAGAGATCCCAGCAAACTGGCTGGGTTCAATGGTGGTAATGATTCGAAACTGAACG
GTTTGAGACGGAGAGACGATTTTGTGTTGCAGAGGAATCGGAGTGTCAGGTGTTCACCAAATAACTTTGACAATGGCTTATTAAGGTTCTACTTGACACCATTGAGGAGC
TACAGCAGAGGCAAACCGGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATATAA
Protein sequenceShow/hide protein sequence
MNLQAKTVSHRLSTCHRHPCKPVTGFCASCLRERLAGIDPDTRLETPAGNQHSASELRRSKSFSGAKREACIRQPDVQHRKSCDVRWGNSLSDLFCREDGPKCPNQEVEI
ESENLGFELCEAAANEREFRASEGALGPPLNTIDDIAGGEAEFKTMKEFIDLEFRRKKNAGRDLREIAGSFWEAASVFSKKLGKWRKKQKMKNLGDNSNVGVVKTEAIKP
RVLEMRETRSEVGEYGLGRRSCDTDPRLSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKFPGGGFENDDPPDEVEGSPMNVGDKIPGGSAQTKDY
YMESLSSLRRRKSFDRSNSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLNDPHSKATRDGDLSGTEVTSKDSVPDAAGIDRKTFKKAYRWRKVLSVLGMI
QKRSESKSDDEESCVGGNAADRPFAESWEKLRRVANGEANCSVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNGLRRRDDFVLQRNRSVRCSPNNFDNGLLRFYLTPLRS
YSRGKPGKSRPRSSPFNVKHVI