; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G27920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G27920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:22684076..22687894
RNA-Seq ExpressionCSPI01G27920
SyntenyCSPI01G27920
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637469.1 hypothetical protein CSA_004502 [Cucumis sativus]0.0e+0094.81Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF--------------------------ETLVRV
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF                          ETLVRV
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF--------------------------ETLVRV

Query:  LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM
        LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM
Subjt:  LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM

Query:  PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT
        PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRR+KLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT
Subjt:  PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT

Query:  ENSSVKKLSRKAKKKNKKLKLLSQLKSAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPN
        ENSSVKKLSRKAKK+NKKLKLLSQLKSAVEGDLLFCINKQGENENGN EDTTMNEPVNEALVSAAPT+STTENSSVKSLKRKAKRKNKKNKLVKYDLVPN
Subjt:  ENSSVKKLSRKAKKKNKKLKLLSQLKSAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPN

Query:  TDATQLKSAVENNDTH
        TDATQLKSAVENNDTH
Subjt:  TDATQLKSAVENNDTH

TYK10112.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.1e-26985.11Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKKN FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-LSQL
        QGVRVFDNTISN RAK+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK+NKK     SQL
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-LSQL

Query:  KSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHI
        KSAVE +          + NG EDTTM+E VN ALVS APT+STTENSS K   +KAK+KNK+ K+V+  LVPN DATQLKSAVENNDT +
Subjt:  KSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHI

XP_004135698.1 uncharacterized protein LOC101207150 [Cucumis sativus]0.0e+0098.98Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLK
        QGVRVFDNTISNRR+KLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKK+NKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLK

Query:  SAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH
        SAVEGDLLFCINKQGENENGN EDTTMNEPVNEALVSAAPT+STTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH
Subjt:  SAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH

XP_008450830.1 PREDICTED: uncharacterized protein LOC103492302 [Cucumis melo]1.6e-24189.31Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKKNPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLL
        QGVRVFDNTISN RAK+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLL

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]1.8e-22473.05Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQ
        MAL LVESM+S+NPLKKNPFLGENYEFTLAQSIQNV+AEIRKGN  FSQFT+ FY+LIQARADPPLESIWFYSAL FRS   N KGDFLERVAAMKVLFQ
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQ

Query:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKN-DKSLDFSLITPFMDLISIWTQPNEGLDQ
        LV SCSAPCGSSKTI LLSPVVSEVYKL++DM GKDL S REKKAMREVKSLVEAILGF+NLSS +DSDKN D+SLDF+LITPF+DLIS+WT PNEGLDQ
Subjt:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKN-DKSLDFSLITPFMDLISIWTQPNEGLDQ

Query:  FLPLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVL
        FLPLV SEVR EFSSG CDVRRLAGVVIAETFLMKLCLDFN G SRQDLEKDL  W VGSIT+IRNFY FETLVR LLEATLPVTSLLST++EALLRKVL
Subjt:  FLPLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVL

Query:  YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKA
        YD+LILV+YSFLKPE AI+LPAEHVA LAVKRLILT+EAIEFYREHGDQ+RAISYLNAFSSS VSSQIIRW+KSQMPSNEN+  PNG SPK+ LEWLL+A
Subjt:  YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKA

Query:  EDQGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQ
        EDQGVRVFD TISNR AKLVLDTSKSVS EGDKVDDDLLFYID                                                         
Subjt:  EDQGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQ

Query:  LKSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT
                      KQGE+ENG+EDTTM+E VN ALVS A T+STTEN S K  +R  KRKN+K K VKYDLVP++D TQ +S  +NNDT
Subjt:  LKSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein0.0e+0098.98Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLK
        QGVRVFDNTISNRR+KLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKK+NKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLK

Query:  SAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH
        SAVEGDLLFCINKQGENENGN EDTTMNEPVNEALVSAAPT+STTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH
Subjt:  SAVEGDLLFCINKQGENENGN-EDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTH

A0A1S3BQ57 uncharacterized protein LOC1034923027.9e-24289.31Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKKNPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLL
        QGVRVFDNTISN RAK+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLL

A0A5D3CFU4 Pentatricopeptide repeat-containing protein1.5e-26985.11Show/hide
Query:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKKN FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-LSQL
        QGVRVFDNTISN RAK+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK+NKK     SQL
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-LSQL

Query:  KSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHI
        KSAVE +          + NG EDTTM+E VN ALVS APT+STTENSS K   +KAK+KNK+ K+V+  LVPN DATQLKSAVENNDT +
Subjt:  KSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHI

A0A6J1H8Q6 uncharacterized protein LOC1114615262.6e-20873.98Show/hide
Query:  MESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  K++PFLGENYEFTL QSIQNVLAEIR+GN+ FSQF + FY+LIQAR DPPLESIWFYSAL FRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LLSPVV EVYKL+ DM GKDL+S REKKAMREVKSLVE +LGF+NLSS +DSD+N +SLDF+L+TPF+DLISIW   NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAETFLMKLCLD N GRSRQDLE DL  WAVGSIT+I+NFY FETLVR LLEATLPV SLLST++EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLPA+HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRW+KSQ+PSNEN N P G SPK+FLEWLLKAED GVRVFD
Subjt:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRAKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGS-EEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-------
        +TISNRRAKLVLDTSKSVS     EG+ VDD+LLFYIDKQG N NGS EED  MDESVNAAL SAA TMSTT+N S KK  ++  KK KK+K        
Subjt:  NTISNRRAKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGS-EEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKL-------

Query:  ---LSQLKSAVEGDLLFCINKQGE--NENGNEDTTMNE
           +++L+SAVE +     + +GE  N + +ED+   E
Subjt:  ---LSQLKSAVEGDLLFCINKQGE--NENGNEDTTMNE

A0A6J1JEZ4 uncharacterized protein LOC1114851552.3e-20968.77Show/hide
Query:  MESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  K++PFLGENYEFTL QSIQNVLAEIR+GN+VFSQF + FY+LIQARADPPLESIWFYSAL FRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LL+PVV EVYKL+ DM GKDL S REKKAMREVKSLVE ILGF+NLSS +DSD+N +SLDF+L+TPF+DLISIWT  NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAETFL+KLCLD N GRSRQDLE DL  WAVGSIT+I+NFY FETLVR LLEATLPV SLLST++EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLPA+HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRW+KSQ+PS+EN+N P G SPK+FLEWL KAED GVRVFD
Subjt:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRAKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGS-EEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLKSA
        +TISNRRAKLVLDTSKSVS     EG+ VDD+LLFYIDKQG N NGS EED  MDE+VNAAL SAA TMSTT+N   KK  R+  KK KK+         
Subjt:  NTISNRRAKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGS-EEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLKSA

Query:  VEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT
                                                                     K  KYDLVPN+DAT+L+SAV++NDT
Subjt:  VEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein1.0e-4428.23Show/hide
Query:  LAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARAD-PPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL
        L  SI+ +L + R G   FS F   F +++    + PPLE +WFYSA++F SS     D  + V      FQL+ S S      K ++LLSPVV ++ +L
Subjt:  LAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARAD-PPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL

Query:  VIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR
        VI  R             R+  SL+E I+ ++++   ++    D  +       F DL  +W         +  + L+ F+P     +R+E  S  C V 
Subjt:  VIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR

Query:  RLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLST--DNEALLRKVLYDALI-LVDYSFLKPEIAI
         LAG+V ++ FL+ LC  F+    R +L+KDL    +  I+   + + F+ ++++LLE  L +TSL+    ++EA L +++ +A+I  V+  FL P    
Subjt:  RLAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLST--DNEALLRKVLYDALI-LVDYSFLKPEIAI

Query:  NLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRAK
        +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I W+ SQ     + +    L+P  F+EWL+  E+QG RVF+   S   AK
Subjt:  NLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRAK

Query:  LVLDTSK---SVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLKSAVEGDLLF
         V+  S+   S+     K +++               ++DT M +  N +  S       + N+  +K  R  K+   K+KL     S ++ +  F
Subjt:  LVLDTSK---SVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLKSAVEGDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGGTCTTGTTGAATCGATGGAATCTATCAACCCTTTAAAGAAAAATCCTTTCCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAATGTCTT
AGCTGAAATTCGCAAAGGAAATGTTGTTTTTTCTCAATTTACGAAAAGATTCTACAAATTGATTCAAGCTAGAGCTGACCCACCATTAGAATCGATTTGGTTTTACTCCG
CATTAAAGTTTCGTAGTAGTTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCGGCTCCTTGTGGTTCT
TCGAAGACCATTACGTTGCTTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGACAAGGGAAAAGAAAGCAATGAGAGA
GGTTAAGTCTTTAGTTGAAGCAATTCTTGGTTTTATGAATCTGAGTTCACGTGAGGATTCGGACAAGAATGATAAATCTCTCGACTTCAGTTTGATTACTCCTTTTATGG
ATTTAATTAGTATCTGGACGCAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAGTTTAGTTCGGGAGAGTGTGATGTTCGTCGC
TTGGCTGGAGTTGTAATCGCTGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACTATGGACGTTCGAGGCAAGATTTGGAGAAAGATCTAATAACATGGGCTGTTGG
ATCAATAACTCAGATTAGAAATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTACCTGTGACGTCTCTTTTGAGTACTGACAATGAAGCTTTGT
TAAGGAAGGTTCTATATGATGCTCTTATATTGGTTGATTATTCGTTTTTGAAACCTGAGATAGCCATTAACTTACCTGCCGAGCATGTGGCGTTTCTTGCTGTTAAGAGA
TTGATTCTTACTTATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAATAGAGCCATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTAT
TAGATGGATCAAAAGCCAAATGCCTAGCAATGAGAATCTAAATTGCCCCAATGGGTTGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAA
GAGTATTTGACAATACCATTTCCAATCGTCGAGCCAAATTAGTTCTTGATACTTCCAAATCAGTCTCATTTGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATC
GATAAGCAAGGGGGAAATGTAAATGGAAGTGAGGAGGACACGACAATGGATGAATCGGTAAATGCAGCTCTCGCTTCTGCTGCTCCTACAATGTCAACGACTGAAAATAG
TTCGGTCAAGAAGCTAAGTAGAAAAGCAAAAAAAAAGAATAAAAAGTTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGGCGATCTTTTGTTTTGCATCAATAAGC
AAGGGGAAAATGAAAATGGAAATGAGGACACGACAATGAATGAACCAGTAAATGAAGCTCTTGTTTCTGCGGCTCCTACAATCTCAACGACTGAAAATAGTTCGGTAAAG
AGTCTAAAGAGAAAGGCAAAAAGAAAGAATAAAAAAAATAAATTGGTTAAGTACGATCTGGTTCCGAACACTGATGCTACCCAGTTGAAGTCAGCTGTTGAGAATAACGA
TACACACATTTAA
mRNA sequenceShow/hide mRNA sequence
CAAAACCCTAAAGTAGCTTTCGCCTTCTTCCCCTTCCAATTCTTTCTCCATTGTAATCGATTTTCAAATTTCATAAGTTTCAACCAGTTTCATGGCTTTGGGTCTTGTTG
AATCGATGGAATCTATCAACCCTTTAAAGAAAAATCCTTTCCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAATGTCTTAGCTGAAATTCGCAAAGGA
AATGTTGTTTTTTCTCAATTTACGAAAAGATTCTACAAATTGATTCAAGCTAGAGCTGACCCACCATTAGAATCGATTTGGTTTTACTCCGCATTAAAGTTTCGTAGTAG
TTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCGGCTCCTTGTGGTTCTTCGAAGACCATTACGTTGC
TTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGACAAGGGAAAAGAAAGCAATGAGAGAGGTTAAGTCTTTAGTTGAA
GCAATTCTTGGTTTTATGAATCTGAGTTCACGTGAGGATTCGGACAAGAATGATAAATCTCTCGACTTCAGTTTGATTACTCCTTTTATGGATTTAATTAGTATCTGGAC
GCAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAGTTTAGTTCGGGAGAGTGTGATGTTCGTCGCTTGGCTGGAGTTGTAATCG
CTGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACTATGGACGTTCGAGGCAAGATTTGGAGAAAGATCTAATAACATGGGCTGTTGGATCAATAACTCAGATTAGA
AATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTACCTGTGACGTCTCTTTTGAGTACTGACAATGAAGCTTTGTTAAGGAAGGTTCTATATGA
TGCTCTTATATTGGTTGATTATTCGTTTTTGAAACCTGAGATAGCCATTAACTTACCTGCCGAGCATGTGGCGTTTCTTGCTGTTAAGAGATTGATTCTTACTTATGAGG
CCATAGAGTTTTACAGGGAGCATGGAGATCAGAATAGAGCCATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTATTAGATGGATCAAAAGCCAA
ATGCCTAGCAATGAGAATCTAAATTGCCCCAATGGGTTGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAAGAGTATTTGACAATACCAT
TTCCAATCGTCGAGCCAAATTAGTTCTTGATACTTCCAAATCAGTCTCATTTGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATCGATAAGCAAGGGGGAAATG
TAAATGGAAGTGAGGAGGACACGACAATGGATGAATCGGTAAATGCAGCTCTCGCTTCTGCTGCTCCTACAATGTCAACGACTGAAAATAGTTCGGTCAAGAAGCTAAGT
AGAAAAGCAAAAAAAAAGAATAAAAAGTTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGGCGATCTTTTGTTTTGCATCAATAAGCAAGGGGAAAATGAAAATGG
AAATGAGGACACGACAATGAATGAACCAGTAAATGAAGCTCTTGTTTCTGCGGCTCCTACAATCTCAACGACTGAAAATAGTTCGGTAAAGAGTCTAAAGAGAAAGGCAA
AAAGAAAGAATAAAAAAAATAAATTGGTTAAGTACGATCTGGTTCCGAACACTGATGCTACCCAGTTGAAGTCAGCTGTTGAGAATAACGATACACACATTTAATCAAGG
CAAGTCGATAATCCTCTGAGAACGGTTGATTTTGAACTCGGGACTTTGCAACAGTTCAACCAAGGATTGTAAATGTAAAATGATTCACTATTTAATCAAGGAAAGGAAGA
AGATGGAAAGACGTTTTGGGTCATTATGATGCTGCTGCTTGGCTTCATGAAGATGAATCAGATCTTTAACTTCTATTTGATATAAATGTACTTTACTACCTTTGAGTAAT
TCTCTTTTAAGAGATTTTTCAAATTTAAACATCACAAAAATACTCTCTCCAAATACAACTCTTACTCTTAGACCTTTCATCTAACAAAGTTTTGCATGACATTGTATT
Protein sequenceShow/hide protein sequence
MALGLVESMESINPLKKNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGS
SKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEVREEFSSGECDVRR
LAGVVIAETFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKR
LILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLDTSKSVSFEGDKVDDDLLFYI
DKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKKNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEDTTMNEPVNEALVSAAPTISTTENSSVK
SLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHI