; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G001180 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G001180
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUnknown protein
Genome locationGy14Chr3:865274..870418
RNA-Seq ExpressionCsGy3G001180
SyntenyCsGy3G001180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146104.1 uncharacterized protein LOC101206874 [Cucumis sativus]0.0100Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL

Query:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
        DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
Subjt:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE

Query:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
        TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
Subjt:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL

Query:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
        HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
Subjt:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS

Query:  WLLSTDDYTVPWNA
        WLLSTDDYTVPWNA
Subjt:  WLLSTDDYTVPWNA

XP_008448630.1 PREDICTED: uncharacterized protein LOC103490747 isoform X1 [Cucumis melo]0.092.29Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLF+ ESTFNDEQDVSS KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG------------------NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG                  NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG------------------NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK

Query:  HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEA
        HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVK+F+DLMLKDD KDVWEVINEFL HESFSSLCQHLLVTLE+A
Subjt:  HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEA

Query:  DFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDG
        DFCNFLK+LCKLLRPRIETKDFGNSSFMFEVIL KYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAI+HKIS+ISSN HCLFPLLKECDG
Subjt:  DFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDG

Query:  RKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDEL
        RKKTIEMIKWLGLQSWVLHYR SEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFE  NRARA+SKKRKKG KGRKRRK NFDSQ+SCDDEL
Subjt:  RKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDEL

Query:  LDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        LD DI+NDRMDLKLNTGSW LSTDDYTVPWNA
Subjt:  LDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

XP_008448632.1 PREDICTED: uncharacterized protein LOC103490747 isoform X2 [Cucumis melo]0.095.53Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLF+ ESTFNDEQDVSS KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL

Query:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
        DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVK+F+DLMLKDD KDVWEVINEFL HESFSSLCQHLLVTLE+ADFCNFLK+LCKLLRPRIE
Subjt:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE

Query:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
        TKDFGNSSFMFEVIL KYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAI+HKIS+ISSN HCLFPLLKECDGRKKTIEMIKWLGLQSWVL
Subjt:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL

Query:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
        HYR SEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFE  NRARA+SKKRKKG KGRKRRK NFDSQ+SCDDELLD DI+NDRMDLKLNTGS
Subjt:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS

Query:  WLLSTDDYTVPWNA
        W LSTDDYTVPWNA
Subjt:  WLLSTDDYTVPWNA

XP_022948246.1 uncharacterized protein LOC111451855 [Cucurbita moschata]2.97e-30383.81Show/hide
Query:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI
        MIDLFL E  FN+E DV S KLRISLLS LESVLWKLL  GGRSEVRLWLSNTIAS+TSISPQHQR+LFMT LR KPLKW FAS LLQM FEKR REAG+
Subjt:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI

Query:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN
        LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFI+NVPEFW SN
Subjt:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN

Query:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD
        EF+ESLKDGEILFLDTKFFVKY  D MLKDD +DVW+ INEFLT E FSSLCQHLL+TLEEADFC FLKMLCKLLRP  ETKDFGNSSF+FEV+L+KYGD
Subjt:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD

Query:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD
        +ES+DQILLLNAVINQGRQLLR ++DED EE+LDEIK I+++IS+ISSN H L PLLKEC  RKKTIE+IKWLGLQSWVLHYRMS+ECQT ELWESLFVD
Subjt:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD

Query:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        NGI FRKSNEY LLDHSC SEDDGFE  N A  +SKKRK+G KGRKRRK +FD +DSCDDELLDFDIK D+ DLKLNTGSWLLS D+YTVPWNA
Subjt:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

XP_038891380.1 uncharacterized protein LOC120080808 [Benincasa hispida]0.092.11Show/hide
Query:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI
        MIDLFL E  FNDEQDVSS KLRISLLS+LESVLWKLLT GGRSEVRLWL+N+IASVTSISPQHQRDLFMTLLRRKP KWAFASQLLQMLFEKRSREAGI
Subjt:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI

Query:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN
        LIAKRSYIMEKFFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFI+NVPEFWSSN
Subjt:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN

Query:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD
        EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFL HESFSSL QHLLVTLEEADFC+FLKMLCKLLRPRIETKDFGN SF FEVIL+KYGD
Subjt:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD

Query:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD
        SESIDQILLLNAV+NQGRQ+LRLLRDED EEQLDEIKAIVHKIS+ISSN   LFPLL ECDGRK+TIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD
Subjt:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD

Query:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        NGIGF+KSNEY LLDHS  SEDDGFE  NRA A+SK+RKKGGKGRKRRK +FD +DSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
Subjt:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

TrEMBL top hitse value%identityAlignment
A0A0A0L6D1 Uncharacterized protein0.0100Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL

Query:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
        DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
Subjt:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE

Query:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
        TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
Subjt:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL

Query:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
        HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
Subjt:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS

Query:  WLLSTDDYTVPWNA
        WLLSTDDYTVPWNA
Subjt:  WLLSTDDYTVPWNA

A0A1S3BK58 uncharacterized protein LOC103490747 isoform X10.092.29Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLF+ ESTFNDEQDVSS KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG------------------NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG                  NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEG------------------NPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK

Query:  HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEA
        HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVK+F+DLMLKDD KDVWEVINEFL HESFSSLCQHLLVTLE+A
Subjt:  HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEA

Query:  DFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDG
        DFCNFLK+LCKLLRPRIETKDFGNSSFMFEVIL KYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAI+HKIS+ISSN HCLFPLLKECDG
Subjt:  DFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDG

Query:  RKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDEL
        RKKTIEMIKWLGLQSWVLHYR SEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFE  NRARA+SKKRKKG KGRKRRK NFDSQ+SCDDEL
Subjt:  RKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDEL

Query:  LDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        LD DI+NDRMDLKLNTGSW LSTDDYTVPWNA
Subjt:  LDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

A0A1S3BKS3 uncharacterized protein LOC103490747 isoform X20.095.53Show/hide
Query:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
        MSFGSQSRKKAFNRKLYRYRMIDLF+ ESTFNDEQDVSS KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW
Subjt:  MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKW

Query:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
        AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Subjt:  AFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL

Query:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE
        DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVK+F+DLMLKDD KDVWEVINEFL HESFSSLCQHLLVTLE+ADFCNFLK+LCKLLRPRIE
Subjt:  DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIE

Query:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL
        TKDFGNSSFMFEVIL KYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAI+HKIS+ISSN HCLFPLLKECDGRKKTIEMIKWLGLQSWVL
Subjt:  TKDFGNSSFMFEVILTKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVL

Query:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS
        HYR SEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFE  NRARA+SKKRKKG KGRKRRK NFDSQ+SCDDELLD DI+NDRMDLKLNTGS
Subjt:  HYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGS

Query:  WLLSTDDYTVPWNA
        W LSTDDYTVPWNA
Subjt:  WLLSTDDYTVPWNA

A0A6J1CWP0 uncharacterized protein LOC111014910 isoform X11.48e-29284.01Show/hide
Query:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI
        MIDLFL E  FNDE+DV S KLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQLLQM FEKR R AGI
Subjt:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI

Query:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN
        LIAKRSYIMEKFFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV+NFI++VPEFWSSN
Subjt:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN

Query:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD
        EFAESLKDGEIL LDT+FFVKYFVDLMLKDD KDVWE INE+L  ESFSSLC+HLL+TLEEADFC FLKMLCKLL PRIETKD G+SSF+ E+IL++YGD
Subjt:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD

Query:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD
         ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAIV +IS+ISSN H L PLLKEC+ R+KTIE+IKWLGLQSWVL YRMSEECQTPELWESLF D
Subjt:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD

Query:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        NGIGFRKSNEY LLDHSC SEDDGFEL + A A+  KR+KG KGRKRRK NFD     D+ELL FD KNDR+DLKLNTGSWLLS DDYTVPWNA
Subjt:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

A0A6J1G8R1 uncharacterized protein LOC1114518551.44e-30383.81Show/hide
Query:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI
        MIDLFL E  FN+E DV S KLRISLLS LESVLWKLL  GGRSEVRLWLSNTIAS+TSISPQHQR+LFMT LR KPLKW FAS LLQM FEKR REAG+
Subjt:  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGI

Query:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN
        LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFI+NVPEFW SN
Subjt:  LIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN

Query:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD
        EF+ESLKDGEILFLDTKFFVKY  D MLKDD +DVW+ INEFLT E FSSLCQHLL+TLEEADFC FLKMLCKLLRP  ETKDFGNSSF+FEV+L+KYGD
Subjt:  EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD

Query:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD
        +ES+DQILLLNAVINQGRQLLR ++DED EE+LDEIK I+++IS+ISSN H L PLLKEC  RKKTIE+IKWLGLQSWVLHYRMS+ECQT ELWESLFVD
Subjt:  SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVD

Query:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        NGI FRKSNEY LLDHSC SEDDGFE  N A  +SKKRK+G KGRKRRK +FD +DSCDDELLDFDIK D+ DLKLNTGSWLLS D+YTVPWNA
Subjt:  NGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48340.1 unknown protein8.4e-14553.52Show/hide
Query:  MIDLFLQESTFNDEQDVSSE-KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAG
        M++LFL E  +ND+   SS   + + LL++L S +  L+T G RSE RLWL + ++++ SISP  Q ++FM LLR KP K  F SQ+L M+FEKR R+ G
Subjt:  MIDLFLQESTFNDEQDVSSE-KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAG

Query:  ILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSS
         L+AKRSYI+EKFFEGN +RI +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV +T++NF+ NVPEFWSS
Subjt:  ILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSS

Query:  NEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYG
        NEFAESLKDG+ILFLDTKFF+  F+  M ++D  DVW+ + EFL  ESFSSL QHLL+TLEE D C FL++L     P IE+ D G+SS    V+L++Y 
Subjt:  NEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYG

Query:  DSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFV
        D+ESID++LLL+++INQGRQLLRL+RDE+G ++ + +K  + +I     N      +L+E   + K I++IK LGL SW +H+R+SEECQTP+ WE LF 
Subjt:  DSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFV

Query:  DNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNF--DSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA
        +NGI FR+S+++ LL ++  SE+   +  +R+R   K+ K+  K RK++K     D  D  DDELL         DL   + SWLLSTD ++  W +
Subjt:  DNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNF--DSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNA

AT5G48340.2 unknown protein4.9e-14553.29Show/hide
Query:  MIDLFLQESTFNDEQDVSSE-KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAG
        M++LFL E  +ND+   SS   + + LL++L S +  L+T G RSE RLWL + ++++ SISP  Q ++FM LLR KP K  F SQ+L M+FEKR R+ G
Subjt:  MIDLFLQESTFNDEQDVSSE-KLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAG

Query:  ILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSS
         L+AKRSYI+EKFFEGN +RI +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV +T++NF+ NVPEFWSS
Subjt:  ILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSS

Query:  NEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYG
        NEFAESLKDG+ILFLDTKFF+  F+  M ++D  DVW+ + EFL  ESFSSL QHLL+TLEE D C FL++L     P IE+ D G+SS    V+L++Y 
Subjt:  NEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYG

Query:  DSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFV
        D+ESID++LLL+++INQGRQLLRL+RDE+G ++ + +K  + +I     N      +L+E   + K I++IK LGL SW +H+R+SEECQTP+ WE LF 
Subjt:  DSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFV

Query:  DNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNF--DSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNAFLI
        +NGI FR+S+++ LL ++  SE+   +  +R+R   K+ K+  K RK++K     D  D  DDELL         DL   + SWLLSTD ++  W +   
Subjt:  DNGIGFRKSNEYLLLDHSCSSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNF--DSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNAFLI

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTTGGTTCTCAATCGAGAAAGAAAGCGTTCAATAGGAAGTTGTATAGGTATAGAATGATCGATCTGTTTCTACAAGAGTCCACCTTCAATGACGAACAGGATGT
TAGCTCTGAGAAGTTGAGAATTTCTCTGTTAAGTGAATTAGAATCTGTTTTATGGAAGTTGCTGACTTGTGGAGGACGGTCGGAGGTTCGATTATGGCTTTCTAATACCA
TAGCCAGTGTGACATCGATCAGTCCCCAGCATCAGCGGGACCTGTTTATGACATTACTGAGACGGAAGCCATTGAAGTGGGCCTTTGCTTCTCAATTACTGCAAATGTTA
TTTGAAAAGAGATCAAGGGAGGCAGGGATTCTCATTGCTAAGAGAAGCTACATAATGGAAAAGTTTTTCGAAGGAAACCCAAGACGAATATCTCAGTGGTTTTCCAATTT
TGCCACAAATGGTGCATCAGATCATGGAAAAGGTGCCAAGGCCTTGGCACAGTTTGCTTTTGTAAATAGAGACATTTGTTGGGAGGAGCTTGAGTGGAAGGGGAAACATG
GGCAGTCACCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTCAGAATGTACCAGAGTTTTGGTCTTCCAAT
GAATTTGCTGAGTCACTAAAAGATGGTGAAATTTTGTTTCTTGACACGAAATTCTTTGTGAAATATTTTGTTGATCTGATGCTTAAAGATGATCCAAAAGATGTTTGGGA
AGTCATTAATGAGTTCCTGACGCATGAGTCATTTTCTTCACTGTGTCAACATCTCCTTGTTACTCTTGAAGAGGCTGATTTCTGCAACTTCCTAAAAATGTTATGCAAAC
TTCTTAGGCCTAGAATAGAAACCAAGGATTTTGGTAATTCGTCTTTCATGTTTGAGGTCATACTTACCAAGTATGGTGACTCTGAATCTATTGATCAGATTTTACTATTA
AATGCTGTCATTAATCAAGGACGCCAACTTCTACGCCTTTTACGTGATGAAGATGGGGAAGAACAATTGGATGAAATCAAGGCTATTGTCCACAAAATTTCATCGATCTC
AAGCAACTGTCATTGTTTGTTCCCATTGTTAAAGGAGTGTGACGGGAGGAAAAAGACAATAGAGATGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAA
TGTCAGAGGAATGCCAGACTCCTGAGTTATGGGAGTCCTTGTTTGTTGATAACGGCATAGGATTCCGAAAATCTAATGAATATTTGTTGTTGGATCACAGTTGCTCGTCA
GAAGATGATGGGTTTGAACTGTATAATAGAGCACGAGCTCAATCTAAGAAGCGAAAAAAGGGGGGGAAAGGTAGAAAAAGAAGAAAAGGGAACTTTGACAGCCAGGATAG
CTGTGATGACGAGCTGTTGGACTTTGATATTAAAAATGATAGGATGGACTTGAAGTTAAACACTGGAAGTTGGTTGCTTTCCACTGATGACTACACTGTACCATGGAATG
CTTTCTTAATAGGAAAAAAAAAATCAACTAAAGAGAATTCCTACGGGGGGGTTATCTTGAAGCAATGCCTTGACAGGAACATGAAGATATTTAGAAGGATCTACCCGAAC
ACTTATCAAAGTATTGTATGGCTTCTTGGATGA
mRNA sequenceShow/hide mRNA sequence
CTCTAAATTGTTGGAAGGATATACAAAAGAATAGAGCTGGGAAACAGAAGGAAGTTGGAAAGTGGAATAGTATTGCTGAAGCTGGTCCCGGCGATTTTGAAATTTCAGGA
AGCAGCGATTGGAAAATCAAGAAATTTAGCTACGATTTCACGCTAGAATCGGTATGAATTTGAGATTCTGTCGTTAAATCCTTACTATACCAATGGATTGCCACACTTTG
ATCAACCAGTTATCGTTCCTGTCTTTGATTCCTGCTTGAATTCATTTTCGTATGAGTTTTGGTTCTCAATCGAGAAAGAAAGCGTTCAATAGGAAGTTGTATAGGTATAG
AATGATCGATCTGTTTCTACAAGAGTCCACCTTCAATGACGAACAGGATGTTAGCTCTGAGAAGTTGAGAATTTCTCTGTTAAGTGAATTAGAATCTGTTTTATGGAAGT
TGCTGACTTGTGGAGGACGGTCGGAGGTTCGATTATGGCTTTCTAATACCATAGCCAGTGTGACATCGATCAGTCCCCAGCATCAGCGGGACCTGTTTATGACATTACTG
AGACGGAAGCCATTGAAGTGGGCCTTTGCTTCTCAATTACTGCAAATGTTATTTGAAAAGAGATCAAGGGAGGCAGGGATTCTCATTGCTAAGAGAAGCTACATAATGGA
AAAGTTTTTCGAAGGAAACCCAAGACGAATATCTCAGTGGTTTTCCAATTTTGCCACAAATGGTGCATCAGATCATGGAAAAGGTGCCAAGGCCTTGGCACAGTTTGCTT
TTGTAAATAGAGACATTTGTTGGGAGGAGCTTGAGTGGAAGGGGAAACATGGGCAGTCACCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCAT
CAAACTGTGAAGAATTTCATTCAGAATGTACCAGAGTTTTGGTCTTCCAATGAATTTGCTGAGTCACTAAAAGATGGTGAAATTTTGTTTCTTGACACGAAATTCTTTGT
GAAATATTTTGTTGATCTGATGCTTAAAGATGATCCAAAAGATGTTTGGGAAGTCATTAATGAGTTCCTGACGCATGAGTCATTTTCTTCACTGTGTCAACATCTCCTTG
TTACTCTTGAAGAGGCTGATTTCTGCAACTTCCTAAAAATGTTATGCAAACTTCTTAGGCCTAGAATAGAAACCAAGGATTTTGGTAATTCGTCTTTCATGTTTGAGGTC
ATACTTACCAAGTATGGTGACTCTGAATCTATTGATCAGATTTTACTATTAAATGCTGTCATTAATCAAGGACGCCAACTTCTACGCCTTTTACGTGATGAAGATGGGGA
AGAACAATTGGATGAAATCAAGGCTATTGTCCACAAAATTTCATCGATCTCAAGCAACTGTCATTGTTTGTTCCCATTGTTAAAGGAGTGTGACGGGAGGAAAAAGACAA
TAGAGATGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGAGGAATGCCAGACTCCTGAGTTATGGGAGTCCTTGTTTGTTGATAACGGCATA
GGATTCCGAAAATCTAATGAATATTTGTTGTTGGATCACAGTTGCTCGTCAGAAGATGATGGGTTTGAACTGTATAATAGAGCACGAGCTCAATCTAAGAAGCGAAAAAA
GGGGGGGAAAGGTAGAAAAAGAAGAAAAGGGAACTTTGACAGCCAGGATAGCTGTGATGACGAGCTGTTGGACTTTGATATTAAAAATGATAGGATGGACTTGAAGTTAA
ACACTGGAAGTTGGTTGCTTTCCACTGATGACTACACTGTACCATGGAATGCTTTCTTAATAGGAAAAAAAAAATCAACTAAAGAGAATTCCTACGGGGGGGTTATCTTG
AAGCAATGCCTTGACAGGAACATGAAGATATTTAGAAGGATCTACCCGAACACTTATCAAAGTATTGTATGGCTTCTTGGATGAAATGGCTGTTTGCTAAGAGGGAATGA
AGAACGAAGCTTTGTTAGTATGAGATATATTCAGCAGTGAAAGTAGACCGGATCTACCTTCTGAAACCGTTGTAGCTACCGACAAGCTTAATTTTACCAACAGATACAAG
CACAAGATCGAACGTGTATTGGTAAGGCAAGAAATGATGCAATATGCCACAAGTTCGTTGTACATTACATGCTATCAAGTTAACTTATAGCCTTTCGTTAAACATATTTC
TTTCTCATAGAATACAATTACTCATATGCTAACTAACGAATTACCATTGAATTGTATTATTGTTCTTAGATTGTAATATGTTGCTGATCAGGAATATATGAGGATTAGAA
CTTAGAAGCTCAGGTCAGTTACCACTTTTGCTTCCTTTTTCGGTAGAACAAATATGTTTAGTTTTTGTATTGTTTGTAAATAGACTCAATTCAATTCTGAATCTGGTTCG
CAAAAATTATTGGTTCCGGTTTCTCTTGTGCTAGGTGATTAAAATTTCTTACTCGTACATCGCAATTAGTAGATATAGCATTTATGGAATAATATGATCCACAGGATTAA
AAATTAGATTCTAAGGGGGCGTTTGAGCCTCTAACTTGCCTTGGGAGGAGTGGACTACTCATTCAATGTTTGGGTCTCCTATTACAATAGTTGGAGTTCTTGGTTATTAT
AACAATTAGACTATTTGAAAATCCTCGTTGTCTATGGTGTTTACCATTTTGTAACTATTCTCCCTCCTTATTTGATCCAAACTAAAATAGTTTGTACCTCGACGACAACA
CTTATAACTCACATACTATAGTAAACCAACTCAATACTCCAAACAGCCCTCTACTAAACCGACCATTCTCTCTTCCTTCCAACCCTTCCATCACATAAGGCTTGATCCCA
TTGATGTTTTTCTTCAGCTGGTTAAATCCTTTTGTCAAAAAGGAAGGTAGTTACAAAGAAAGAAGTAAATTGTGAAAGATGAAGCTGGGAAGTAGAATGGAAAGTGTAAA
GGTAGAAAATGGTTATTTTAGCACTCAAGTGTACTTTTTTTCGAGTTCAAGTTGGGGTGAGAAATTTGAATCTGGATCATCTGGGTTCTCAGTACATACAGGTGTCAATT
AAGCTTGTTCGATATATAAAGAGGATTTTGATTGTTCCTACTAGCTTATTCTCTTCCGTTTGGGTCTCCTCAATTTGTTCTTTGTGTTCACTGCTGGTCCATGATTGTGA
CTTTTAAATGCCATACCAAATCATTCATCAGATCTCTGAATTTGGGAGGTGATTAAAAGATTGGTG
Protein sequenceShow/hide protein sequence
MSFGSQSRKKAFNRKLYRYRMIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSISPQHQRDLFMTLLRRKPLKWAFASQLLQML
FEKRSREAGILIAKRSYIMEKFFEGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSN
EFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVINEFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGDSESIDQILLL
NAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCLFPLLKECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSCSS
EDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNTGSWLLSTDDYTVPWNAFLIGKKKSTKENSYGGVILKQCLDRNMKIFRRIYPN
TYQSIVWLLG