; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029668 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029668
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153449:1619746..1624240
RNA-Seq ExpressionSgr029668
SyntenySgr029668
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146104.1 uncharacterized protein LOC101206874 [Cucumis sativus]9.4e-23083.92Show/hide
Query:  TFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSYIM
        TF+DE+DV S KL+ISLLS+LES+L KLLT GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPL WA ASQLLQMLFEKRSREAGILIAKRSYIM
Subjt:  TFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSYIM

Query:  ENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLKDG
        E FF+GNPRRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFWSSNEFAESLKDG
Subjt:  ENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLKDG

Query:  EILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRIFL
        EILFLDTKFFVKYF DLMLKDD KDVWEV+NEFL  ESFSSLCQ LL+TLEEADFC FLKMLCK L PRIETKDFGNSSF+FEVIL+KYGD ESID+I L
Subjt:  EILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRIFL

Query:  LNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKAN
        LNAVINQGRQLLR LRDED +E+ DEIKAIV +IS+ISS    L  LLKEC  R++ IE+IKWLGLQSWVLHY M+EECQTPELWESLFVDNGIGFRK+N
Subjt:  LNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKAN

Query:  EYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        EY LL+HS  SEDDG EL N A A+  KRKK GK RKRRK NFD +DS DDELLDFDIKNDRMDLKLNTGSWLLS DDYTVPWNA
Subjt:  EYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

XP_008448632.1 PREDICTED: uncharacterized protein LOC103490747 isoform X2 [Cucumis melo]2.1e-22982.58Show/hide
Query:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS
        ++ TF+DE+DV SAKL+ISLLS+LES+L KLLT GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPL WA ASQLLQMLFEKRSREAGILIAKRS
Subjt:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS

Query:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL
        YIME FF+GNPRRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFWSSNEFAESL
Subjt:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL

Query:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR
        KDGEILFLDTKFFVK+F DLMLKDDSKDVWEV+NEFLM ESFSSLCQ LL+TLE+ADFC FLK+LCK L PRIETKDFGNSSF+FEVIL+KYGD ESID+
Subjt:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR

Query:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR
        I LLNAVINQGRQLLR LRDED +E+ DEIKAI+ +ISAISS +  L  LLKEC  R++ IE+IKWLGLQSWVLHY  +EECQTPELWESLFVDNGIGFR
Subjt:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR

Query:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKKG-KRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        K+NEY LL+HS  SEDDG E CN A AK  KRKKG K RKRRK+NFD ++S DDELLD DI+NDRMDLKLNTGSW LS DDYTVPWNA
Subjt:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKKG-KRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

XP_022145467.1 uncharacterized protein LOC111014910 isoform X1 [Momordica charantia]4.6e-23785.36Show/hide
Query:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY
        +P F+DEKDVDSAKL+ISLLS+LES+L+KLL SGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLR KPL WALASQLLQM FEKR R AGILIAKRSY
Subjt:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY

Query:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK
        IME FF+GN RRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV+NFIE+VPEFWSSNEFAESLK
Subjt:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK

Query:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI
        DGEIL LDT+FFVKYF DLMLKDDSKDVWE +NE+LMQESFSSLC+ LLITLEEADFCYFLKMLCK L+PRIETKD G+SSF+ E+ILS+YGD ESID+I
Subjt:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI

Query:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKA
         LLNAVINQGRQLLR LRDEDA+EEWDEIKAIVSEISAISS T SLS LLKEC RR+ IEVIKWLGLQSWVL Y M+EECQTPELWESLF DNGIGFRK+
Subjt:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKA

Query:  NEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        NEYALL+HS  SEDDG ELC+TASAK+MKR+KGK RKRRK+NFD     D+ELL FD KNDR+DLKLNTGSWLLSIDDYTVPWNA
Subjt:  NEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

XP_023540456.1 uncharacterized protein LOC111800821 [Cucurbita pepo subsp. pepo]1.0e-22882.96Show/hide
Query:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY
        +P F++E+DV SAKL+ISLLS+LES+L KLL SGGRSEVRLWL NTIASMTSISPQHQR+LF+TFLR KPLNW  AS LLQMLFEKR REAG+LIAKRSY
Subjt:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY

Query:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK
        IME FF+GNPRRISQWFSNFA NGASDHG+GAKALAQFSFVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFW SNEFAESLK
Subjt:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK

Query:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI
        DGEILFLDTKFFVKY  D MLKDDS+DVW+ +NEFL QESFSSLCQ LLITLEEADFC FLKMLCK L PR+ETKDFGNSS LFEVILSKYGD ES+D+I
Subjt:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI

Query:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK
         LLNAVINQGRQLLRF++DEDA+EE DEIK I+ EISAISS T SLS LLKECYRR + IEVIKWLGLQSWVLHY M++ECQT ELWESLFVDNGI FRK
Subjt:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK

Query:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        +NEYALL+HS LSEDDG E CNTAS K  KRK+ K+ RKRRK+N DDEDS DDELLDFDIK D+ DLKLNTGSWLLSID+YTVPWNA
Subjt:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

XP_038891380.1 uncharacterized protein LOC120080808 [Benincasa hispida]1.4e-23083.61Show/hide
Query:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS
        ++P F+DE+DV SAKL+ISLLSKLES+L KLLTSGGRSEVRLWL+N+IAS+TSISPQHQRDLF+T LR KP  WA ASQLLQMLFEKRSREAGILIAKRS
Subjt:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS

Query:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL
        YIME FF+GN RRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTVKNFIENVPEFWSSNEFAESL
Subjt:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL

Query:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR
        KDGEILFLDTKFFVKYF DLMLKDD KDVWEV+NEFLM ESFSSL Q LL+TLEEADFC FLKMLCK L PRIETKDFGN SF FEVILSKYGD ESID+
Subjt:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR

Query:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR
        I LLNAV+NQGRQ+LR LRDED +E+ DEIKAIV +ISAISS T SL  LL EC  R+R IE+IKWLGLQSWVLHY M+EECQTPELWESLFVDNGIGF+
Subjt:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR

Query:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        K+NEY+LL+HS LSEDDG E CN A AK  +RKK GK RKRRK++FD EDS DDELLDFDIKNDRMDLKLNTGSWLLS DDYTVPWNA
Subjt:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

TrEMBL top hitse value%identityAlignment
A0A0A0L6D1 Uncharacterized protein4.5e-23083.92Show/hide
Query:  TFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSYIM
        TF+DE+DV S KL+ISLLS+LES+L KLLT GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPL WA ASQLLQMLFEKRSREAGILIAKRSYIM
Subjt:  TFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSYIM

Query:  ENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLKDG
        E FF+GNPRRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFWSSNEFAESLKDG
Subjt:  ENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLKDG

Query:  EILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRIFL
        EILFLDTKFFVKYF DLMLKDD KDVWEV+NEFL  ESFSSLCQ LL+TLEEADFC FLKMLCK L PRIETKDFGNSSF+FEVIL+KYGD ESID+I L
Subjt:  EILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRIFL

Query:  LNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKAN
        LNAVINQGRQLLR LRDED +E+ DEIKAIV +IS+ISS    L  LLKEC  R++ IE+IKWLGLQSWVLHY M+EECQTPELWESLFVDNGIGFRK+N
Subjt:  LNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKAN

Query:  EYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        EY LL+HS  SEDDG EL N A A+  KRKK GK RKRRK NFD +DS DDELLDFDIKNDRMDLKLNTGSWLLS DDYTVPWNA
Subjt:  EYALLEHSSLSEDDGLELCNTASAKVMKRKK-GKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

A0A1S3BKS3 uncharacterized protein LOC103490747 isoform X21.0e-22982.58Show/hide
Query:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS
        ++ TF+DE+DV SAKL+ISLLS+LES+L KLLT GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPL WA ASQLLQMLFEKRSREAGILIAKRS
Subjt:  VKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS

Query:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL
        YIME FF+GNPRRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFWSSNEFAESL
Subjt:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL

Query:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR
        KDGEILFLDTKFFVK+F DLMLKDDSKDVWEV+NEFLM ESFSSLCQ LL+TLE+ADFC FLK+LCK L PRIETKDFGNSSF+FEVIL+KYGD ESID+
Subjt:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR

Query:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR
        I LLNAVINQGRQLLR LRDED +E+ DEIKAI+ +ISAISS +  L  LLKEC  R++ IE+IKWLGLQSWVLHY  +EECQTPELWESLFVDNGIGFR
Subjt:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKEC-YRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFR

Query:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKKG-KRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        K+NEY LL+HS  SEDDG E CN A AK  KRKKG K RKRRK+NFD ++S DDELLD DI+NDRMDLKLNTGSW LS DDYTVPWNA
Subjt:  KANEYALLEHSSLSEDDGLELCNTASAKVMKRKKG-KRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

A0A6J1CWP0 uncharacterized protein LOC111014910 isoform X12.3e-23785.36Show/hide
Query:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY
        +P F+DEKDVDSAKL+ISLLS+LES+L+KLL SGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLR KPL WALASQLLQM FEKR R AGILIAKRSY
Subjt:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY

Query:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK
        IME FF+GN RRISQWFSNFA NGASDHG+GAKALAQF+FVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV+NFIE+VPEFWSSNEFAESLK
Subjt:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK

Query:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI
        DGEIL LDT+FFVKYF DLMLKDDSKDVWE +NE+LMQESFSSLC+ LLITLEEADFCYFLKMLCK L+PRIETKD G+SSF+ E+ILS+YGD ESID+I
Subjt:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI

Query:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKA
         LLNAVINQGRQLLR LRDEDA+EEWDEIKAIVSEISAISS T SLS LLKEC RR+ IEVIKWLGLQSWVL Y M+EECQTPELWESLF DNGIGFRK+
Subjt:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKA

Query:  NEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        NEYALL+HS  SEDDG ELC+TASAK+MKR+KGK RKRRK+NFD     D+ELL FD KNDR+DLKLNTGSWLLSIDDYTVPWNA
Subjt:  NEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

A0A6J1G8R1 uncharacterized protein LOC1114518551.9e-22882.34Show/hide
Query:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY
        +P F++E DV SAKL+ISLLS+LES+L KLL SGGRSEVRLWLSNTIASMTSISPQHQR+LF+TFLR KPL W  AS LLQM FEKR REAG+LIAKRSY
Subjt:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY

Query:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK
        IME FF+GNPRRISQWFSNFA NGASDHG+GAKALAQFSFVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFW SNEF+ESLK
Subjt:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK

Query:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI
        DGEILFLDTKFFVKY  D MLKDDS+DVW+ +NEFL QE FSSLCQ LLITLEEADFC FLKMLCK L P  ETKDFGNSSFLFEV+LSKYGD ES+D+I
Subjt:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI

Query:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK
         LLNAVINQGRQLLRF++DEDA+EE DEIK I+ EISAISS T SLS LLKECYRR + IEVIKWLGLQSWVLHY M++ECQT ELWESLFVDNGI FRK
Subjt:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK

Query:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        +NEYALL+HS LSEDDG E CNTAS K  KRK+GK+ RKRRK++FDDEDS DDELLDFDIK D+ DLKLNTGSWLLSID+YTVPWNA
Subjt:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

A0A6J1KWG9 uncharacterized protein LOC1114988256.1e-22782.14Show/hide
Query:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY
        +P F++E+DV SAKL+ISLLS+LE++L KLL SGGRSEVRLWLSNTIASMTSISPQHQR+LF+TFLR KPL W  AS LLQM FEKR REAG+LIAKRSY
Subjt:  KPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRSY

Query:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK
        IME FF+GNPRRISQWFSNFA NGASDHG+GAKALAQFSFVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDV QTVKNFI+NVPEFW SNEFAESLK
Subjt:  IMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESLK

Query:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI
        DGEILFLDTKFFVKY  D MLKDDS+DVW+ +NEFL QESFSSLCQ LLITLEEADFC FLKMLCK L P +ETKDFGNSSFLFEVILSKYGD ES+D+I
Subjt:  DGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRI

Query:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK
         LLNAVIN+GRQLLRF++DEDA+EE DEIK I+ EISAISS T SLS LLKECYRR + IEVIKWLGLQSWVLHY M++ECQT ELWE LFVDNGI FRK
Subjt:  FLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRR-RIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK

Query:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        +NEYALL+HS LSEDDG E CNTAS K  KRK+GK+ RKRRK+N DDEDS D ELLDFDIK D+ DLKLNTGSWLLSID+YTVPWNA
Subjt:  ANEYALLEHSSLSEDDGLELCNTASAKVMKRKKGKR-RKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48340.1 unknown protein4.3e-14052.56Show/hide
Query:  KPTFSDEKDVDS-AKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS
        +P ++D+    S   + + LL+KL S +Q L+T G RSE RLWL + ++++ SISP  Q ++F+  LR KP      SQ+L M+FEKR R+ G L+AKRS
Subjt:  KPTFSDEKDVDS-AKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS

Query:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL
        YI+E FF+GN +RI +WFS FA +G SDH RGAKALAQF+F NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV++T++NF++NVPEFWSSNEFAESL
Subjt:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL

Query:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR
        KDG+ILFLDTKFF+  F   M ++D  DVW+ V EFL +ESFSSL Q LLITLEE D C FL++L  +  P IE+ D G+SS    V+LS+Y D ESID 
Subjt:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR

Query:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK
        + LL+++INQGRQLLR +RDE+  +E + +K  ++EI        S S +L+E  + + I+VIK LGL SW +H+ ++EECQTP+ WE LF +NGI FR+
Subjt:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK

Query:  ANEYALLEHSSLSE--DDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSH-DDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        +++++LL ++  SE  +   +  +  S K  KR+K KR+K++K+ FDD+D   DDELL         DL   + SWLLS D ++  W +
Subjt:  ANEYALLEHSSLSE--DDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSH-DDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA

AT5G48340.2 unknown protein4.3e-14052.56Show/hide
Query:  KPTFSDEKDVDS-AKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS
        +P ++D+    S   + + LL+KL S +Q L+T G RSE RLWL + ++++ SISP  Q ++F+  LR KP      SQ+L M+FEKR R+ G L+AKRS
Subjt:  KPTFSDEKDVDS-AKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKRSREAGILIAKRS

Query:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL
        YI+E FF+GN +RI +WFS FA +G SDH RGAKALAQF+F NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV++T++NF++NVPEFWSSNEFAESL
Subjt:  YIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAESL

Query:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR
        KDG+ILFLDTKFF+  F   M ++D  DVW+ V EFL +ESFSSL Q LLITLEE D C FL++L  +  P IE+ D G+SS    V+LS+Y D ESID 
Subjt:  KDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDR

Query:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK
        + LL+++INQGRQLLR +RDE+  +E + +K  ++EI        S S +L+E  + + I+VIK LGL SW +H+ ++EECQTP+ WE LF +NGI FR+
Subjt:  IFLLNAVINQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRK

Query:  ANEYALLEHSSLSE--DDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSH-DDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA
        +++++LL ++  SE  +   +  +  S K  KR+K KR+K++K+ FDD+D   DDELL         DL   + SWLLS D ++  W +
Subjt:  ANEYALLEHSSLSE--DDGLELCNTASAKVMKRKKGKRRKRRKKNFDDEDSH-DDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAGACTCTTGGGCCGAAGCCCAAATCTCTCTTCTTTCGAGCCCAGTCTTTTTCACAACCTGTCAAGCCCACCTTCAGTGACGAAAAGGATGTTGACTCTGCCAA
GTTGAAAATTTCTCTATTAAGTAAATTAGAATCTATTTTACAGAAATTGCTGACTTCTGGAGGACGGTCCGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGA
CATCTATCAGTCCCCAGCACCAGCGGGACCTGTTTGTGACCTTCCTGAGACTGAAGCCACTGAATTGGGCCTTAGCATCTCAACTACTGCAAATGTTGTTTGAAAAGAGA
TCGCGAGAGGCAGGGATTCTCATTGCCAAGAGAAGCTACATAATGGAAAATTTTTTCGATGGAAATCCAAGACGAATATCTCAGTGGTTTTCCAATTTTGCTATGAATGG
TGCATCAGATCATGGAAGAGGTGCCAAGGCCCTGGCACAGTTTTCTTTTGTAAATCGTGACATTTGCTGGGAGGAGCTTGAGTGGAAGGGGAAACACGGGCAGTCACCTG
CAGTGGTTGCGACGAAGCCCCATTATTTTCTTGATCTGGATGTGCAACAAACTGTGAAGAATTTCATTGAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCTGAG
TCGCTCAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTTGTGAAATATTTCACCGATCTGATGCTTAAAGATGATTCAAAAGATGTTTGGGAAGTCGTTAATGA
GTTCCTAATGCAGGAGTCATTTTCTTCATTGTGTCAACGTCTTCTTATTACACTCGAAGAGGCTGATTTCTGCTACTTTCTGAAAATGCTGTGTAAATTTCTCAACCCTA
GAATAGAAACCAAGGATTTTGGTAATTCATCTTTTCTGTTTGAGGTCATACTTTCTAAATATGGTGACCGTGAATCTATTGATCGGATTTTCCTATTAAATGCTGTCATT
AATCAAGGACGCCAACTTCTACGGTTTTTACGTGATGAAGATGCTAAGGAAGAATGGGATGAAATCAAGGCTATTGTCTCAGAGATTTCGGCAATCTCAAGCAAAACTGA
TAGCTTATCCTCACTATTGAAAGAGTGTTACAGAAGAAGGATCATTGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGTATGGCAGAGGAATGTC
AGACACCTGAGTTATGGGAATCCTTGTTTGTTGATAATGGCATAGGCTTCCGAAAAGCTAATGAATATGCATTGTTAGAACACAGTTCCTTATCGGAAGATGATGGTTTA
GAACTGTGTAATACAGCATCGGCTAAAGTTATGAAGCGAAAAAAGGGAAAACGTAGAAAGAGAAGAAAAAAGAATTTTGACGATGAGGACAGCCATGATGATGAGCTGTT
GGACTTTGATATTAAAAATGATAGGATGGATTTGAAATTAAACACTGGAAGTTGGTTGCTTTCCATTGATGACTATACTGTACCATGGAATGCT
mRNA sequenceShow/hide mRNA sequence
ATGACTGAGACTCTTGGGCCGAAGCCCAAATCTCTCTTCTTTCGAGCCCAGTCTTTTTCACAACCTGTCAAGCCCACCTTCAGTGACGAAAAGGATGTTGACTCTGCCAA
GTTGAAAATTTCTCTATTAAGTAAATTAGAATCTATTTTACAGAAATTGCTGACTTCTGGAGGACGGTCCGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGA
CATCTATCAGTCCCCAGCACCAGCGGGACCTGTTTGTGACCTTCCTGAGACTGAAGCCACTGAATTGGGCCTTAGCATCTCAACTACTGCAAATGTTGTTTGAAAAGAGA
TCGCGAGAGGCAGGGATTCTCATTGCCAAGAGAAGCTACATAATGGAAAATTTTTTCGATGGAAATCCAAGACGAATATCTCAGTGGTTTTCCAATTTTGCTATGAATGG
TGCATCAGATCATGGAAGAGGTGCCAAGGCCCTGGCACAGTTTTCTTTTGTAAATCGTGACATTTGCTGGGAGGAGCTTGAGTGGAAGGGGAAACACGGGCAGTCACCTG
CAGTGGTTGCGACGAAGCCCCATTATTTTCTTGATCTGGATGTGCAACAAACTGTGAAGAATTTCATTGAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCTGAG
TCGCTCAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTTGTGAAATATTTCACCGATCTGATGCTTAAAGATGATTCAAAAGATGTTTGGGAAGTCGTTAATGA
GTTCCTAATGCAGGAGTCATTTTCTTCATTGTGTCAACGTCTTCTTATTACACTCGAAGAGGCTGATTTCTGCTACTTTCTGAAAATGCTGTGTAAATTTCTCAACCCTA
GAATAGAAACCAAGGATTTTGGTAATTCATCTTTTCTGTTTGAGGTCATACTTTCTAAATATGGTGACCGTGAATCTATTGATCGGATTTTCCTATTAAATGCTGTCATT
AATCAAGGACGCCAACTTCTACGGTTTTTACGTGATGAAGATGCTAAGGAAGAATGGGATGAAATCAAGGCTATTGTCTCAGAGATTTCGGCAATCTCAAGCAAAACTGA
TAGCTTATCCTCACTATTGAAAGAGTGTTACAGAAGAAGGATCATTGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGTATGGCAGAGGAATGTC
AGACACCTGAGTTATGGGAATCCTTGTTTGTTGATAATGGCATAGGCTTCCGAAAAGCTAATGAATATGCATTGTTAGAACACAGTTCCTTATCGGAAGATGATGGTTTA
GAACTGTGTAATACAGCATCGGCTAAAGTTATGAAGCGAAAAAAGGGAAAACGTAGAAAGAGAAGAAAAAAGAATTTTGACGATGAGGACAGCCATGATGATGAGCTGTT
GGACTTTGATATTAAAAATGATAGGATGGATTTGAAATTAAACACTGGAAGTTGGTTGCTTTCCATTGATGACTATACTGTACCATGGAATGCT
Protein sequenceShow/hide protein sequence
MTETLGPKPKSLFFRAQSFSQPVKPTFSDEKDVDSAKLKISLLSKLESILQKLLTSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRLKPLNWALASQLLQMLFEKR
SREAGILIAKRSYIMENFFDGNPRRISQWFSNFAMNGASDHGRGAKALAQFSFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVKNFIENVPEFWSSNEFAE
SLKDGEILFLDTKFFVKYFTDLMLKDDSKDVWEVVNEFLMQESFSSLCQRLLITLEEADFCYFLKMLCKFLNPRIETKDFGNSSFLFEVILSKYGDRESIDRIFLLNAVI
NQGRQLLRFLRDEDAKEEWDEIKAIVSEISAISSKTDSLSSLLKECYRRRIIEVIKWLGLQSWVLHYSMAEECQTPELWESLFVDNGIGFRKANEYALLEHSSLSEDDGL
ELCNTASAKVMKRKKGKRRKRRKKNFDDEDSHDDELLDFDIKNDRMDLKLNTGSWLLSIDDYTVPWNA