; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002223 (gene) of Chayote v1 genome

Gene IDSed0002223
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG10:7270272..7275108
RNA-Seq ExpressionSed0002223
SyntenySed0002223
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594816.1 hypothetical protein SDJN03_11369, partial [Cucurbita argyrosperma subsp. sororia]1.4e-18775.16Show/hide
Query:  GFGDYCIFTGVAAPFEGRIMPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFS
        G G + I   V    +G IMPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAPTT LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF 
Subjt:  GFGDYCIFTGVAAPFEGRIMPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFS

Query:  PSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTG----NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV---
        PSPYIVNHKRRGPRLLKSFS+D V   QK ND D G    N + NGSD   VKL+EGASVTVD+PIPNKDGHRNGLD  +N+NV QNG VDGDHGA    
Subjt:  PSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTG----NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV---

Query:  -GSNLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELRE
         GSN +N+ S+  ++N VA +KDSLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELRE
Subjt:  -GSNLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELRE

Query:  MKLSLLMELEKRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIA
        M+LSLLMELEKRK +EEAL  L+GQWQRL E L LVGLTLPSDPTVA  G  L SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIA
Subjt:  MKLSLLMELEKRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIA

Query:  RLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        RLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSSM    AT+ DD TD
Subjt:  RLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

KAG7026781.1 hypothetical protein SDJN02_10788 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-18676.51Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAPTT LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTG----NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVA
        S+D V   QK ND D G    N + NGSD   VKL+EGASVTVD+PIPNK+GHRNGLD  +++NV QNG VDGDHGA     GSN +N+ S+ M++N VA
Subjt:  SQDGVFFSQKTNDKDTG----NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVA

Query:  LDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL
         +KDSLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELREM+LSLLMELEKRK +EEAL
Subjt:  LDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL

Query:  KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQR
          L+GQWQRL E L LVGLTLPSDPTVA  G  L SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQR
Subjt:  KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQR

Query:  NQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        NQEAVDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSS+    AT+ DD TD
Subjt:  NQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

XP_022962776.1 uncharacterized protein LOC111463164 [Cucurbita moschata]1.3e-18877.61Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAPTT LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD
        S+D V   QK ND D GN + NGSD   VKL+EGASVTVD+PIPNKDGHRNGLD  +++NV QNG VDGDHGA     GSN +N+ S+ M++N VA +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELREM+LSLLMELEKRK +EEAL  L+
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
        GQWQRL E L LVGLTLPSDPTVA  G  L SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        VDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSSM    AT+ DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

XP_023003998.1 uncharacterized protein LOC111497447 [Cucurbita maxima]1.0e-18576.52Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAP+T LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD
        S+D V   QK ND D GN + NGSD   VKL+EGASVTVD+PIPNKDG RNG D  +++NV QNG VDGDHGA     GSN +N+ SS M++N VA +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELREM+LSLLMELEKRK +EEAL  L+
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
        GQWQRL E L LVGLTLPSDPTV+  G  + SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        VDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSS+    AT+ DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

XP_038882592.1 uncharacterized protein LOC120073808 [Benincasa hispida]1.2e-18677.78Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD
        S+D V   +K ND D GNGS  GSD   VK TEG+SVTVD+PIP KDG RNG D  S++NV QNG VDGDHGA      +N SNHES  +++NGVA +K+
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLKVVV+N +S+GDT+DF+DP D LS  SNTDGEDNGFERSAK  TPMG FYDAWEELSSEG   PS  DIEAELREMKL+LLMELEKRK +EEAL KLQ
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
        GQW RL EQLLLVGLTLPSDP VA EG QL SDP EELCQQV LARF+S SIGRGI RAEVETEMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD
        VDLARRER++RKRRQRW+WGSVA  ITLGTAVLAWS+LP GKD   S++   + DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD

TrEMBL top hitse value%identityAlignment
A0A1S3B1E0 uncharacterized protein LOC1034850651.2e-17975.38Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD
        S+D V   +K NDKD GNGS   SDG  VKLTEGASVTV  PIP+K G RNGLD  S++N+ +NG VDGDHGA      S+ +NHESS + ++G+A +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLK VV+N +S GD +DF+DP D LS  SNTDGEDNGFERSAK  TPMG FYDAWEELSSEG   PS  DIE + REM+  LLME+EK+K +EEAL KLQ
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
         QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAEVE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD
        VDLARRER++RKRRQRW+WGSVA  ITLGTAVLAWS+LP GKD   S++   + DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD

A0A5A7T005 Uncharacterized protein1.2e-17975.38Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD
        S+D V   +K NDKD GNGS   SDG  VKLTEGASVTV  PIP+K G RNGLD  S++N+ +NG VDGDHGA      S+ +NHESS + ++G+A +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLK VV+N +S GD +DF+DP D LS  SNTDGEDNGFERSAK  TPMG FYDAWEELSSEG   PS  DIE + REM+  LLME+EK+K +EEAL KLQ
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
         QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAEVE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD
        VDLARRER++RKRRQRW+WGSVA  ITLGTAVLAWS+LP GKD   S++   + DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD

A0A5D3CMF0 Uncharacterized protein4.1e-18075.6Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD
        S+D V   +K NDKD GNGS   SDG  VKLTEGASVTV  PIP+K G RNGLD  S++N+ +NG VDGDHGA      S+ +NHESS + ++G+A +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----SNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLK VV+N +S GD +DF+DP D LS  SNTDGEDNGFERSAK  TPMG FYDAWEELSSEG   PS  DIE + REM+  LLME+EKRK +EEAL KLQ
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
         QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAEVE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD
        VDLARRER++RKRRQRW+WGSVA  ITLGTAVLAWS+LP GKD   S++   + DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD

A0A6J1HDH8 uncharacterized protein LOC1114631646.3e-18977.61Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAPTT LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD
        S+D V   QK ND D GN + NGSD   VKL+EGASVTVD+PIPNKDGHRNGLD  +++NV QNG VDGDHGA     GSN +N+ S+ M++N VA +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELREM+LSLLMELEKRK +EEAL  L+
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
        GQWQRL E L LVGLTLPSDPTVA  G  L SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        VDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSSM    AT+ DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

A0A6J1KTC8 uncharacterized protein LOC1114974475.0e-18676.52Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF
        MPTFTTIALDRLLEPGTSKSVDKPLPK  PALT NRAP+T LERRNSAS A+RKVQRPQIKP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSF
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSF

Query:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD
        S+D V   QK ND D GN + NGSD   VKL+EGASVTVD+PIPNKDG RNG D  +++NV QNG VDGDHGA     GSN +N+ SS M++N VA +KD
Subjt:  SQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GSNLSNHESSTMLNNGVALDKD

Query:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ
        SLKVVV  L S+GD +DF+DPQD LS  SNTDGEDNG+ERSAK  TPMG FYDAWEE+SS+G  HPS   IEAELREM+LSLLMELEKRK +EEAL  L+
Subjt:  SLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ

Query:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA
        GQWQRL E L LVGLTLPSDPTV+  G  + SDP EELCQQVN+ARF+SGSIGRGI RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEA
Subjt:  GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEA

Query:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD
        VDLARRER++RKRR RWMWGSVA  ITLGTAVLAWS+LP GKDSSS+    AT+ DD TD
Subjt:  VDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50910.1 unknown protein8.1e-9648.67Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRG-PRLLKS
        MPTF+ IALDR+LEPG S SV+  +P     L  ++ P +KLE+       +R V RP + P LY TP+A PLP+SP SF PSPYI+NHK RG PRLLKS
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRG-PRLLKS

Query:  FSQDGVFFS--QKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHG--AVGSNLSNHESSTMLN--NGVAL
         S+  V  S  QKT +++T       +D KV       S +  I    +D + NG+   +  N   +GIVDG  G  +     S +  S + N  NG+  
Subjt:  FSQDGVFFS--QKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHG--AVGSNLSNHESSTMLN--NGVAL

Query:  DKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGE-DNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL
             + V        +++DFYDP +  S  SNTD E D G E S +  TP+G FYDAW+ELS++     S  +IE+EL E++LSLLME+EKRK +EEAL
Subjt:  DKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGE-DNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL

Query:  KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQR
        +++Q  WQRL EQ+  VGL +P DPT +     L+    EEL  Q+ +ARF+S S+GRG+ +AEVE EME+ LE KNFEI RL DRL YYEAVN EMSQR
Subjt:  KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQR

Query:  NQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA
        NQEA+++ARRER KRK+RQRW+WGS+AA ITLG+A LAWS++P  K SS ++
Subjt:  NQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA

AT5G66480.1 unknown protein2.8e-7240.56Show/hide
Query:  MPTFTTIALDRLLEPGTSKSVDKPLP-KVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKS
        MPTF+  AL R L  GTS S   P   + KP++ ++ +   K          ++   RPQ+ P LY T +  P P+SP S+ PSPYI+NHK RGP L   
Subjt:  MPTFTTIALDRLLEPGTSKSVDKPLP-KVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKS

Query:  FSQ-DGVFFS-QKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDG-HRNGLDH--------VSNTNVAQNGIVDGDHGAVGSN--LSNHESSTM
         S+ DG         +K +GN     +       +    +T  I + + +G H  G+             T + +    D  +G +GSN   SN E  + 
Subjt:  FSQ-DGVFFS-QKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDG-HRNGLDH--------VSNTNVAQNGIVDGDHGAVGSN--LSNHESSTM

Query:  LNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTY---TPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELE
        L   V +  D             + ++FY+P + +S  SNT+ ED  FER+  ++   T +G FYDA +ELS++     S  +IE+E+REM+L LLME+E
Subjt:  LNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTY---TPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELE

Query:  KRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYE
        +R+ +E  L+++Q  W+RL +QL  VG+ LP DPT +     LA    +EL  Q+ + RF+S ++G  + + EVE EMEA+LE KNFEI RL DRL YYE
Subjt:  KRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYE

Query:  AVNHEMSQRNQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA
         VN EMSQRNQEA+++ARR+  KRKRRQRW+WGS+AA ITLG+ VLAWS+LPPG  SS  A
Subjt:  AVNHEMSQRNQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGTTTTAAATTTGTGCAATTTGATGGAGGTTTTGGGGATTATTGTATATTCACTGGGGTTGCTGCACCTTTTGAAGGCAGAATCATGCCAACGTTTACTACGAT
TGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAAGTGAAGCCTGCTTTGACCTCTAATCGTGCGCCCACCACGAAGTTGGAGA
GGAGAAATAGCGCATCACTTGCTGATAGAAAGGTTCAGCGGCCTCAAATTAAGCCAGAACTATATACCACTCCAGAGGCAACTCCTCTCCCCGACTCACCGCCTTCGTTT
TCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGCCCTCGTCTGTTGAAGAGTTTCTCTCAGGATGGTGTCTTCTTTAGTCAAAAGACAAATGATAAGGATACAGG
AAATGGAAGCGCGAATGGTTCAGATGGCAAAGTTGTAAAATTGACTGAGGGTGCTTCTGTTACTGTTGACATACCTATCCCAAACAAAGATGGACACAGAAATGGTCTAG
ACCATGTTAGTAATACTAATGTTGCTCAAAATGGGATTGTTGATGGTGATCATGGTGCTGTTGGGAGTAATCTTAGTAATCATGAAAGTAGTACAATGTTGAACAATGGT
GTTGCTCTGGATAAGGATTCATTGAAGGTTGTTGTGACAAATTTACAAAGTGTTGGAGATACTGATGACTTCTATGACCCACAGGATTTTTTGAGTGCCAAGAGTAACAC
AGATGGAGAAGATAACGGATTTGAACGTTCAGCCAAGACTTATACTCCTATGGGGGTATTTTATGATGCTTGGGAAGAGCTTTCGTCTGAGGGGTTTTCACATCCTTCTT
TTCCTGATATTGAAGCTGAGTTGCGTGAAATGAAACTATCCCTATTGATGGAACTTGAGAAACGAAAGCTTTCTGAGGAAGCACTGAAAAAATTGCAGGGTCAGTGGCAG
AGGCTTAGTGAACAGCTACTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCATGGAAGGGAAACAGTTAGCTTCTGACCCTGTTGAAGAATTGTGTCAACA
AGTTAATCTTGCTAGGTTTATATCGGGTTCAATTGGAAGGGGTATAGTGAGGGCCGAAGTGGAGACTGAGATGGAGGCACAGCTTGAAGTGAAGAATTTTGAGATTGCTC
GATTGTTGGACCGGCTTCGTTACTATGAGGCTGTGAATCATGAAATGTCTCAGAGGAATCAAGAAGCTGTAGATTTGGCGCGACGCGAGAGGGTGAAAAGAAAAAGGAGG
CAAAGGTGGATGTGGGGTTCGGTTGCCGCTGTGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTTCCTTCCACCGGGAAAGGATTCGTCATCCATGGCCACTGATCG
TGATGATGGAACAGATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATTGTAATTTGTATGAAGAACAAAAGAAGAGGAAAAAGGGGTCCAAAGATTTGCGGAGAAGGTGAGTGCGTGTGCGACAAGGCGAAGTCGAGGCGGTCGTTGCGATCCCT
AAGAACAAGAAGAGGAAAAAAAACTTGGAGAGGAGCTTTTCAATCTCTCTCTCTTGAATCCCGTTTGTGTTTTCAAATCCTCTTTTTTTATGAAGGACGGGGATGAGTCT
CCACAATTCAAATCCATTCTGTAAAATGGAACTTTGTGTTCTCTCTACAAACTTTCCGACTCCTCTCCGATCCTAGGGCTTTTCCCCCGCCTTCCTCTCCTTTTCCAGGT
ACTTTCCCTTCTGGGATTTTTCTTTTGTTCGATTTTGAATGGGGGATATGGAGAATTCCTGTTCTGGGGTTGTTTGTTTTTGGTTTTTAGAGGTTGGGTGTGCTCGTAAT
TTGAGTATCTTTGATGGGTTGTGTTGGGATTTCTTTATTTTCGTTTAATTTTGAGTTTTTTGTATGATTTTGTTGCTGGGTTATCTTTGTTTTGCAACATTTCTCATGAT
TAGTTGGAGGTTGAGCTTTGTGTGTGTGTGTTTTTCTTCTTTCGTACGATTTATCTGTTTTGAATTTTGTTTGTTGCTTCGTTTTTGGTGTAATTTGTTTGTGGAAATTT
GGAATAGCTAGTTTTTGTGATGTTTCTTATGACTGACCATCCATGGTTTTTTATTTGGATTCTGTTGTGCTTTATATTACCATGGTTTTGGTTTATGCTTCATTTTTTGA
GATTTGGCTGGATGGAGCTTTTAGCTTTCAGCAATGTTTTGTTTTAAATTTGTGCAATTTGATGGAGGTTTTGGGGATTATTGTATATTCACTGGGGTTGCTGCACCTTT
TGAAGGCAGAATCATGCCAACGTTTACTACGATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAAGTGAAGCCTGCTTTGA
CCTCTAATCGTGCGCCCACCACGAAGTTGGAGAGGAGAAATAGCGCATCACTTGCTGATAGAAAGGTTCAGCGGCCTCAAATTAAGCCAGAACTATATACCACTCCAGAG
GCAACTCCTCTCCCCGACTCACCGCCTTCGTTTTCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGCCCTCGTCTGTTGAAGAGTTTCTCTCAGGATGGTGTCTT
CTTTAGTCAAAAGACAAATGATAAGGATACAGGAAATGGAAGCGCGAATGGTTCAGATGGCAAAGTTGTAAAATTGACTGAGGGTGCTTCTGTTACTGTTGACATACCTA
TCCCAAACAAAGATGGACACAGAAATGGTCTAGACCATGTTAGTAATACTAATGTTGCTCAAAATGGGATTGTTGATGGTGATCATGGTGCTGTTGGGAGTAATCTTAGT
AATCATGAAAGTAGTACAATGTTGAACAATGGTGTTGCTCTGGATAAGGATTCATTGAAGGTTGTTGTGACAAATTTACAAAGTGTTGGAGATACTGATGACTTCTATGA
CCCACAGGATTTTTTGAGTGCCAAGAGTAACACAGATGGAGAAGATAACGGATTTGAACGTTCAGCCAAGACTTATACTCCTATGGGGGTATTTTATGATGCTTGGGAAG
AGCTTTCGTCTGAGGGGTTTTCACATCCTTCTTTTCCTGATATTGAAGCTGAGTTGCGTGAAATGAAACTATCCCTATTGATGGAACTTGAGAAACGAAAGCTTTCTGAG
GAAGCACTGAAAAAATTGCAGGGTCAGTGGCAGAGGCTTAGTGAACAGCTACTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCATGGAAGGGAAACAGTT
AGCTTCTGACCCTGTTGAAGAATTGTGTCAACAAGTTAATCTTGCTAGGTTTATATCGGGTTCAATTGGAAGGGGTATAGTGAGGGCCGAAGTGGAGACTGAGATGGAGG
CACAGCTTGAAGTGAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTTCGTTACTATGAGGCTGTGAATCATGAAATGTCTCAGAGGAATCAAGAAGCTGTAGATTTG
GCGCGACGCGAGAGGGTGAAAAGAAAAAGGAGGCAAAGGTGGATGTGGGGTTCGGTTGCCGCTGTGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTTCCTTCCACC
GGGAAAGGATTCGTCATCCATGGCCACTGATCGTGATGATGGAACAGATGGATGACCACTGACAAAAGAAGTAGCTTATGTACATGTGGTATAATGTTGATATCAAATGT
TTGTTGCTTTTGTCAGAAGTATTTGCATGCCCGAAGGTATAGTTCTTAAAAATCTTCGGCCCTTATAATTAAAGGTGAAAGAAAAGGTTGGTTCTTGATATTTCTTTTTC
CCTCAGATCTAACAGGGTTATAACACCTGAAATTTTGGTAATTCATGACAAACCATACACTGTCTATTTACTGTGCTAACTGTAGTTGAAATATTGTGCCTGACTTAGAG
ACGGGCGTGATTGCCCTTGTTTCGAAAAAAAAAAGAACTGTAGTTGAAATATTGCGTTGAGCATTCAAATAATTATTGGCAGAGTGGAGTTGAACTCTCTATGGTAATGC
CTGTTTCATCCTTCAACTATTGTATATAGCAC
Protein sequenceShow/hide protein sequence
MFCFKFVQFDGGFGDYCIFTGVAAPFEGRIMPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSF
SPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVGSNLSNHESSTMLNNG
VALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQGQWQ
RLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRR
QRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMATDRDDGTDG