; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006674 (gene) of Snake gourd v1 genome

Gene IDTan0006674
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
Genome locationLG05:3110998..3112719
RNA-Seq ExpressionTan0006674
SyntenyTan0006674
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605428.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]7.9e-30292.86Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD P DEAEGSPMNVG+KIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL GFN GGNDSKL GLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

XP_022948149.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]2.1e-30293.03Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL GFN GGNDSKL GLRRRDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

XP_023007551.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima]7.0e-29892.16Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKREAGIG+PEVQHRKSCDARSG+SLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDA GIDRKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL G N GGNDSKL GLRRRDDFTLQRNRSVRYSP NFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

XP_023532200.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo]1.8e-29891.65Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKRE GIG+PEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLRE+AGSVW AASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEA+KPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDS SKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKD VPDAAGIDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN--GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL GFN  GGNDSKL GLRRRDD TLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN--GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

XP_038901013.1 protein OCTOPUS [Benincasa hispida]1.1e-29089.55Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ+K +SHRLSTCHRHPSKPVTGFCA CLRERLAGID DT+QESP  N HSSSELRRSKS+SAAKREAGI Q EVQHRKSCD RSGNSLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT REVEIESENLG ELREVVANER FRAS G IGPAL TIDDFAGE+AEFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
         KNL ++ NVG  K E IKPRVLEIRETRSEVG+YGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G GF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFDDLKLISNAKVSPATTELFYGAKVLITEKDL +SHSKATR+G
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPS
        DLSGTDVTSKDSVPDAA IDRKTFKKVHRWRKVLSVLGM+QKRSGESKSDDEESSVGGNVVDRP AESWEKLRRVANGEAN  VSQKLIRSYSVSCRDPS
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPS

Query:  KLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY-NRGKPGKSRPRSSPFNVKHVM
        KL GFNG NDSKLN  R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY +RGKPGKSRPR+SPFNVKHVM
Subjt:  KLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY-NRGKPGKSRPRSSPFNVKHVM

TrEMBL top hitse value%identityAlignment
A0A1S3CKL0 UPF0503 protein At3g09070, chloroplastic5.0e-28687.98Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCRED
        MNLQ+K+VSHRLSTCHRHPSKPVTGFCA CLRERLAGIDPD + ESP  N HSSS ELRRSKS+SAAK EAGIGQ E+QHRKSCD RSGNSLSDLFCRED
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLG+ELREVV N RQFRAS G IGP L TIDDFAGEDAEFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG
        K KNLG++SNVGA K E IKPR LEIRETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G G
Subjt:  KMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG

Query:  FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD
        FEKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT D
Subjt:  FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD

Query:  GDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA  IDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP  ESWEKLRRVANGEAN  VSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDP

Query:  SKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM
        SKL GFNGGNDSKLN  R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY+RGK GKSRPR+SPFNVKHV+
Subjt:  SKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM

A0A5A7V3J1 UPF0503 protein5.0e-28687.98Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCRED
        MNLQ+K+VSHRLSTCHRHPSKPVTGFCA CLRERLAGIDPD + ESP  N HSSS ELRRSKS+SAAK EAGIGQ E+QHRKSCD RSGNSLSDLFCRED
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCRED

Query:  KPRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ
        KPRCTN EVEIESENLG+ELREVV N RQFRAS G IGP L TIDDFAGEDAEFKT+KEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKL KWRKKQ
Subjt:  KPRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQ

Query:  KMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG
        K KNLG++SNVGA K E IKPR LEIRETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G G
Subjt:  KMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG

Query:  FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD
        FEKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFD+LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT D
Subjt:  FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRD

Query:  GDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDP
        GDLSGTDVTSKDSVPDA  IDRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP  ESWEKLRRVANGEAN  VSQKLIRSYSVSCRDP
Subjt:  GDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDP

Query:  SKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM
        SKL GFNGGNDSKLN  R RDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSY+RGK GKSRPR+SPFNVKHV+
Subjt:  SKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM

A0A6J1D4T1 UPF0503 protein At3g09070, chloroplastic1.6e-28788.11Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNL  K V HRLSTC RHPSKPVTGFCAYCLRERLAGIDPDTRQE+P RN HSSSELRRSKSFSAAKR+AGIGQPEVQHRKSCD RSGNSLSDLFCRED+
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        P+C N+EVEIESENLG+ELREV ANERQFRAS GAIGP LDTIDDFAG +AEFKTMKEFIDLE RRKKN GRDLREIAGSVWEAASV SKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+ SN G  KTE  KPR+LE RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRTYPRL PMVSVLEEVK PG GF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        E   P DEAEG  MNVG+KIPGGSAQTKDYYM+SLSS+RRRKSFDRSSSHRKGASADFDDLK ISNAKVSPATTELFYGAKVLITEKDL DSHSK+TRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPS
        DLSG++VTSKDSVPDAAG DRKTFKK +RW+KVL VLGMMQKRS ESKSDDEE  VG N VDRP AESWEKLRRVANGEAN SVSQKLIRSYSVSCRDP+
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPS

Query:  KLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHV
        KL GFNGGND KLNGLRRRDD TLQRNRSVRYSPNNFDNGLLRFYLTPLRSY+RGKPGKSRPRSSPFNVKHV
Subjt:  KLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHV

A0A6J1G914 UPF0503 protein At3g09070, chloroplastic-like1.0e-30293.03Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCTNREVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL GFN GGNDSKL GLRRRDDFTLQRNRS RYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

A0A6J1L0V5 UPF0503 protein At3g09070, chloroplastic-like3.4e-29892.16Show/hide
Query:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK
        MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRSKSFSAAKREAGIG+PEVQHRKSCDARSG+SLSDLFCREDK
Subjt:  MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDK

Query:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
        PRCT +EVEIESENLG+ELREVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
Subjt:  PRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
        MKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRRSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH+KGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
        DLSGT++TSKDSVPDA GIDRKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANGEANGSVSQKLIRSYSVSCRD

Query:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
        PSKL G N GGNDSKL GLRRRDDFTLQRNRSVRYSP NFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Subjt:  PSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like2.0e-5334.23Show/hide
Query:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHSSS-----------------ELRRSKSFSAAKREAGIGQPE
        HRLST C  HP +  +GFC  CL +RL+ +D      P +    P         A    SSS                 ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHSSS-----------------ELRRSKSFSAAKREAGIGQPE

Query:  VQHRKSCDARSGNSLSDLFCRE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAG
         Q R+SCD R  +   +L   E    DK     RE  +    L   E  E+  +E       G I    +   +   E+ E K MK+++DL  + KK + 
Subjt:  VQHRKSCDARSGNSLSDLFCRE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K     + VG  + ++                E G+GRRS D+DP       RFSVD+GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF

Query:  DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD
        DEPRASWDG+LIGRT     P    M+SV+E   L     +    PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK    + +
Subjt:  DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVGG
        D+K +SN+  +             I    ++ + +K  ++GD                       KK  RW K  S+LG + ++  + + +D  S S   
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVGG

Query:  NVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
         +V+R  +ESW ++R   NGE  G    K+ RS S VS R        +GG  +              RN+S RYS  + +NG+LRFYLTP+R
Subjt:  NVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR

Q9SS80 Protein OCTOPUS1.3e-6533.73Show/hide
Query:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS----------------------ELRRSKSFSAAKREAGIG
        HRLST C+RHP +  TGFC  CL ERL+ +D             P T   +  + L   S                      ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS----------------------ELRRSKSFSAAKREAGIG

Query:  QPEVQHRKSCDARSGNSLSDLFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANERQF----------RASAGAIG
              R+SCD R  +SL +LF +++               +PR ++            E E + E L  E  E       F          R  +  I 
Subjt:  QPEVQHRKSCDARSGNSLSDLFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANERQF----------RASAGAIG

Query:  PALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS
           + I++         + E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Subjt:  PALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS

Query:  EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLEEVKLP----------GIG
        E+ +YG GRRSCD+DP              RFSVD+GR+SLDD RYSFDEPRASWDG LIGRT        P    M+SV+E+   P             
Subjt:  EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLEEVKLP----------GIG

Query:  FEKDGPSDEAEGSPMNVGEK--IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH
         E+  P          V +   IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL  ++ +S    + + G+        L+D++
Subjt:  FEKDGPSDEAEGSPMNVGEK--IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH

Query:  SKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK
        + A    D +G+       + D         KK  RW K  S+LG++ ++S     ++EE        + G +V+R  +ESW +LR   NG   G   + 
Subjt:  SKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK

Query:  LIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNR
        +  + +VS R         GG+  K+NGL R       RN+S RYSP N +NG+L+FYL  +++  R
Subjt:  LIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNR

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)6.5e-6835.97Show/hide
Query:  HRLST-CHRHPSKPVTGFCAYCLRERLAGIDPDTRQE----SPARNLHSSS------------------ELRRSKSFSAAKREA-GIGQPEVQHRKSCDA
        HR ST C RHP +  TGFC  CL +RL+ +D   +      S ++   SSS                  ELRR+KSFSA+K EA  +G  E Q R+SCD 
Subjt:  HRLST-CHRHPSKPVTGFCAYCLRERLAGIDPDTRQE----SPARNLHSSS------------------ELRRSKSFSAAKREA-GIGQPEVQHRKSCDA

Query:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLG-----------YELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMK-EFIDLEFRRK
        R  N+L  LF  + +     +E       EI+ E +             E+     NE+  +            ID+   E+ E +T K E   +EF  +
Subjt:  RSGNSLSDLFCREDKPRCTNRE------VEIESENLG-----------YELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMK-EFIDLEFRRK

Query:  ---KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVL--EIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSL-----
           K   RD +EIAGS W AASVFSKKL KWR+KQK+K      N+GA  +     + +  ++R+T+SE+ EYG GRRSCD+DPRFS+D GR SL     
Subjt:  ---KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVL--EIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSL-----

Query:  --DDSRYSFDEPRASWDGYLIGRTYP--RLTPMVSVLEEVKLPGIGFEKDG--PSDEA-EGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH
          DD RYSF+EPRASWDGYLIGR     R+  M+SV+E+  +       D   P +++ + S   + E +PGGSAQT++YY+DS SS RRRKS DRSSS 
Subjt:  --DDSRYSFDEPRASWDGYLIGRTYP--RLTPMVSVLEEVKLPGIGFEKDG--PSDEA-EGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH

Query:  RK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGES
        RK      A+ D+LKL  + +                  KDL  SHS + RD D    +   +  V +  G      K+  + R   ++ G++ +++G +
Subjt:  RK---GASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGES

Query:  KSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRS-YSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYL
        K ++EE   G   VDR ++ SW       N E       K+IRS  SVS R     GG          GL+R    ++    S +   +  +NG+L+FYL
Subjt:  KSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRS-YSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYL

Query:  TPLRSYNRGKPGKSRPRSSP
        TP +   RG    + P S P
Subjt:  TPLRSYNRGKPGKSRPRSSP

AT3G09070.1 Protein of unknown function (DUF740)9.4e-6733.73Show/hide
Query:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS----------------------ELRRSKSFSAAKREAGIG
        HRLST C+RHP +  TGFC  CL ERL+ +D             P T   +  + L   S                      ELRR+KSFSA+K   G  
Subjt:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS----------------------ELRRSKSFSAAKREAGIG

Query:  QPEVQHRKSCDARSGNSLSDLFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANERQF----------RASAGAIG
              R+SCD R  +SL +LF +++               +PR ++            E E + E L  E  E       F          R  +  I 
Subjt:  QPEVQHRKSCDARSGNSLSDLFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANERQF----------RASAGAIG

Query:  PALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS
           + I++         + E K +K++IDL+ + KK +      +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Subjt:  PALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS

Query:  EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLEEVKLP----------GIG
        E+ +YG GRRSCD+DP              RFSVD+GR+SLDD RYSFDEPRASWDG LIGRT        P    M+SV+E+   P             
Subjt:  EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTY-------PRLTPMVSVLEEVKLP----------GIG

Query:  FEKDGPSDEAEGSPMNVGEK--IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH
         E+  P          V +   IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL  ++ +S    + + G+        L+D++
Subjt:  FEKDGPSDEAEGSPMNVGEK--IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSH

Query:  SKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK
        + A    D +G+       + D         KK  RW K  S+LG++ ++S     ++EE        + G +V+R  +ESW +LR   NG   G   + 
Subjt:  SKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK

Query:  LIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNR
        +  + +VS R         GG+  K+NGL R       RN+S RYSP N +NG+L+FYL  +++  R
Subjt:  LIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNR

AT3G46990.1 Protein of unknown function (DUF740)3.8e-6835.11Show/hide
Query:  STCHRHPS-KPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDAR-SGNSLSDLFCREDKPRCTNREVEI
        S+CHRHPS KP +GFCA CLRERL  I+     +S +     + ELRR +S+S   R A +   +   R+SCD R S +SL DLF  +D+ R  +   + 
Subjt:  STCHRHPS-KPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDAR-SGNSLSDLFCREDKPRCTNREVEI

Query:  ESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGED--------AEFKTMKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK
           +L  E  E    E  +              +D  G D         E KTMKEFIDL++R   KKN G+DL+EI       ASV S++L  +   ++
Subjt:  ESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGED--------AEFKTMKEFIDLEFRR--KKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK

Query:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF
             D    G                          GR S D DPR S D GR+       SF++PR+SWDG LI ++Y +LT + +V E+ K    G 
Subjt:  MKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF

Query:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG
        E++   ++         EK PGG+ QTK+YY DS    RRR+SFDRS S ++    + D+L+ ISNAKVSP T  LF+GAK+L+TEK+L+DS+  + ++ 
Subjt:  EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDG

Query:  DLSGTDVTSKDSVPDAAGIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRS
             ++ SK  +  AAG + K    V       +W K  ++ G++Q R  E+K++   ++   + GN V+   AES  KLRRV  GE N  VS+KL++S
Subjt:  DLSGTDVTSKDSVPDAAGIDRKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRS

Query:  YSVSCR--------DPSKLGGFNGGN---------------------DSKLNGLR-RRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKS
        YSVS R          + + GF GG                      D  +NG+  +++   LQRN +V   S  N +  + RFYL+P++S+   K GKS
Subjt:  YSVSCR--------DPSKLGGFNGGN---------------------DSKLNGLR-RRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKS

Query:  R
        R
Subjt:  R

AT5G01170.1 Protein of unknown function (DUF740)1.4e-5434.23Show/hide
Query:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHSSS-----------------ELRRSKSFSAAKREAGIGQPE
        HRLST C  HP +  +GFC  CL +RL+ +D      P +    P         A    SSS                 ELRR+KSFSA   E   G  E
Subjt:  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHSSS-----------------ELRRSKSFSAAKREAGIGQPE

Query:  VQHRKSCDARSGNSLSDLFCRE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAG
         Q R+SCD R  +   +L   E    DK     RE  +    L   E  E+  +E       G I    +   +   E+ E K MK+++DL  + KK + 
Subjt:  VQHRKSCDARSGNSLSDLFCRE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAG

Query:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF
          +++ AGS + AASVFSKKL KW++KQK+K     + VG  + ++                E G+GRRS D+DP       RFSVD+GR+S+DDSRYS 
Subjt:  RDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF

Query:  DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD
        DEPRASWDG+LIGRT     P    M+SV+E   L     +    PS +      +    IPGGS QT+DYY    SS RRRKS DRS+S RK    + +
Subjt:  DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD

Query:  DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVGG
        D+K +SN+  +             I    ++ + +K  ++GD                       KK  RW K  S+LG + ++  + + +D  S S   
Subjt:  DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEES-SVGG

Query:  NVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
         +V+R  +ESW ++R   NGE  G    K+ RS S VS R        +GG  +              RN+S RYS  + +NG+LRFYLTP+R
Subjt:  NVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR

AT5G58930.1 Protein of unknown function (DUF740)4.2e-7536.93Show/hide
Query:  STCHRHP-SKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIE
        + CHRHP SKP TGFCA CLRERL+ I      E+ + ++ +S+ELRR +S+S   R+A     +   R+SCD RS +   D             + E+ 
Subjt:  STCHRHP-SKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIE

Query:  SENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSN
          ++ + +   +  + +     G        + +   ED E KTMKE IDLE R +  KN G+D            SVFS+ L K+  K   K + D  N
Subjt:  SENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSN

Query:  VGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGFEKDGPSDEA
                                   LGRRSCD DPR S+D GR+       SFDEPRASWDG LIG+TYP+L P+ SV E+VK            ++ 
Subjt:  VGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGFEKDGPSDEA

Query:  EGSPMNVGEK-IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVT
         G  +   EK  PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAKVSP T  LF+GAK+L+TE++L+DS+  + ++      ++ 
Subjt:  EGSPMNVGEK-IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVT

Query:  SKDSVPDAAGIDRK-----TFKKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSC----
        SK     AAG  +K       K    W K  +  G++Q+++  +K++   ++   +GGN ++   AES  KLRRVA GE NG VS+KLIRSYSVS     
Subjt:  SKDSVPDAAGIDRK-----TFKKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSC----

Query:  ----RDPSKLGGFNGG-----------------------NDSKLNGLRRRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKSR
            R  S + GF GG                        D   +G+  + +  LQ +  +  YSP+N  NG++RFYLTPL S+   K GKSR
Subjt:  ----RDPSKLGGFNGG-----------------------NDSKLNGLRRRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGCAGGTCAAAGCTGTATCTCATCGGCTTTCCACTTGTCACCGCCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTACTGCCTCCGTGAACGCCTCGCCGG
GATTGATCCCGATACGCGTCAGGAATCGCCTGCTCGGAACCTGCATTCTTCATCGGAGCTCCGTCGGAGTAAATCTTTCTCTGCGGCGAAGCGTGAGGCCGGCATCGGAC
AACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCCGGGAACTCGTTGTCGGACCTTTTTTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATC
GAATCCGAGAACTTAGGTTATGAATTGCGTGAGGTTGTGGCAAATGAGAGGCAATTTAGGGCTTCTGCGGGGGCAATTGGACCGGCTTTGGATACGATCGACGATTTTGC
TGGAGAGGATGCTGAGTTCAAGACGATGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAG
CGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACGATAGCAATGTAGGTGCGGCGAAAACAGAGGCTATCAAGCCG
AGAGTGCTTGAAATCAGGGAGACGCGTTCCGAGGTCGGAGAATACGGATTGGGAAGAAGGTCTTGTGATTCAGATCCAAGATTCTCTGTCGATGTAGGTAGAATGTCGTT
GGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGTAGAACTTATCCAAGGCTTACGCCGATGGTTTCAGTTTTGGAGGAGGTCA
AATTACCTGGTATTGGATTTGAGAAAGACGGTCCTTCTGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGAAAAGATTCCCGGTGGATCGGCTCAGACTAAAGATTAC
TATATGGATTCATTGTCTTCTGTAAGGCGGAGGAAGAGTTTTGATCGTTCAAGTTCACACAGAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGC
AAAGGTATCTCCTGCAACCACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACAAGAGATGGCGATTTGAGTG
GCACTGATGTTACTTCAAAAGATTCTGTTCCCGATGCAGCTGGGATTGATCGAAAGACGTTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAAAAGCGAAGTGGAGAGAGTAAGTCCGATGATGAAGAAAGCAGTGTTGGAGGAAATGTGGTTGATCGGCCTTATGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGGTGGATTTAATGGCGGTAATGATTCGAAACTGA
ACGGTTTGAGGCGGAGAGACGACTTTACGTTGCAGAGGAATCGGAGCGTCAGGTATTCACCAAATAACTTCGATAATGGCTTATTAAGGTTCTACTTGACACCATTGAGA
AGCTACAACAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGCAGGTCAAAGCTGTATCTCATCGGCTTTCCACTTGTCACCGCCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTACTGCCTCCGTGAACGCCTCGCCGG
GATTGATCCCGATACGCGTCAGGAATCGCCTGCTCGGAACCTGCATTCTTCATCGGAGCTCCGTCGGAGTAAATCTTTCTCTGCGGCGAAGCGTGAGGCCGGCATCGGAC
AACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCCGGGAACTCGTTGTCGGACCTTTTTTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATC
GAATCCGAGAACTTAGGTTATGAATTGCGTGAGGTTGTGGCAAATGAGAGGCAATTTAGGGCTTCTGCGGGGGCAATTGGACCGGCTTTGGATACGATCGACGATTTTGC
TGGAGAGGATGCTGAGTTCAAGACGATGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAG
CGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACGATAGCAATGTAGGTGCGGCGAAAACAGAGGCTATCAAGCCG
AGAGTGCTTGAAATCAGGGAGACGCGTTCCGAGGTCGGAGAATACGGATTGGGAAGAAGGTCTTGTGATTCAGATCCAAGATTCTCTGTCGATGTAGGTAGAATGTCGTT
GGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGTAGAACTTATCCAAGGCTTACGCCGATGGTTTCAGTTTTGGAGGAGGTCA
AATTACCTGGTATTGGATTTGAGAAAGACGGTCCTTCTGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGAAAAGATTCCCGGTGGATCGGCTCAGACTAAAGATTAC
TATATGGATTCATTGTCTTCTGTAAGGCGGAGGAAGAGTTTTGATCGTTCAAGTTCACACAGAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGC
AAAGGTATCTCCTGCAACCACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACAAGAGATGGCGATTTGAGTG
GCACTGATGTTACTTCAAAAGATTCTGTTCCCGATGCAGCTGGGATTGATCGAAAGACGTTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATG
CAAAAGCGAAGTGGAGAGAGTAAGTCCGATGATGAAGAAAGCAGTGTTGGAGGAAATGTGGTTGATCGGCCTTATGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAA
TGGAGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGGTGGATTTAATGGCGGTAATGATTCGAAACTGA
ACGGTTTGAGGCGGAGAGACGACTTTACGTTGCAGAGGAATCGGAGCGTCAGGTATTCACCAAATAACTTCGATAATGGCTTATTAAGGTTCTACTTGACACCATTGAGA
AGCTACAACAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATGTAA
Protein sequenceShow/hide protein sequence
MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEI
ESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKP
RVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGFEKDGPSDEAEGSPMNVGEKIPGGSAQTKDY
YMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMM
QKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR
SYNRGKPGKSRPRSSPFNVKHVM