; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020884 (gene) of Chayote v1 genome

Gene IDSed0020884
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationLG08:34270818..34273927
RNA-Seq ExpressionSed0020884
SyntenySed0020884
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030497.1 hypothetical protein SDJN02_08844 [Cucurbita argyrosperma subsp. argyrosperma]9.2e-18653.5Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET
        M PL  SF+  LS+F+  VSVF AII + F   V +             R   +S      H   G +   + SV+ASNSSKFQ E T QI GFVEESET
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET

Query:  ATSFVRELYCDSSPSSGNNQFP-DDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGA
           FV ELYCD+SPSS  NQ P DDFD + GK L EF   T E            G +  +LE+F                                   
Subjt:  ATSFVRELYCDSSPSSGNNQFP-DDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGA

Query:  LPQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLS
                       S  E+L   +E +D + N                                     DG I       F  LP V T  GDCSV  S
Subjt:  LPQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLS

Query:  DSESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE-----------------DFFEEHQDLINQLK
        DS+SES SFDEEF  IE+ELEP+L VS NA++ PVN+       D  GE  ET+ D K M FEE++EEE                 +F +EHQDLI QLK
Subjt:  DSESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE-----------------DFFEEHQDLINQLK

Query:  FELRNSRTGGLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLP
         ELRNSRTGGLPTV  EEE  PE + PTSVEALKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI  +KLKDP SS++ KKS +KS++ 
Subjt:  FELRNSRTGGLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLP

Query:  HKLRSGRAEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSL
        HKLR+GRA KG   L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+LQQN +R TS+Y+RVV+EFQ FCIL+QRFIEDE F GPRI NYVKNRLLVRSL
Subjt:  HKLRSGRAEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSL

Query:  LQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGN
        LQVPAIREDC +DKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDVG++ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR GN
Subjt:  LQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGN

Query:  CIVKKLQRIS--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP
        CIVKKL+R+S  EEGR+K ELLIAEVELKL+SRV+SMSRLTE QL+WCHKKLHQ+NFV+RKV++EPSF  FP
Subjt:  CIVKKLQRIS--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP

XP_011648587.1 uncharacterized protein LOC101214479 isoform X1 [Cucumis sativus]1.5e-16751.32Show/hide
Query:  NSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLA-------------SNSSKFQSEPTKQICGFVEESETATSF
        N   +  S +   +S+FGAII  F R                 I   YGT    VDSVL               NSSK+Q EPT QI GF+++SET   F
Subjt:  NSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLA-------------SNSSKFQSEPTKQICGFVEESETATSF

Query:  VRELYCDSSPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQV
        V+E    +SPSS  NQ PD DF+ +  K L E   G     +E F  E+  GLE  D                           + +GLE          
Subjt:  VRELYCDSSPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQV

Query:  LTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSES
                  L + E L             + + D  +   V   G DCS                                EV TL  D S   SDS+ 
Subjt:  LTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSES

Query:  ESLSFDEEFIELELE--PQLQV-SNAKILPVN-------EDGFGESKETDWDSKEMGF-------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTV
        ES  FDEE+IE+ELE  P L V +NAKILPVN       +D   E  ET+ D K M F       EEE+EEE+F +EHQDLINQLK ELRNSRTGGLPTV
Subjt:  ESLSFDEEFIELELE--PQLQV-SNAKILPVN-------EDGFGESKETDWDSKEMGF-------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTV

Query:  LEE--EGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR-AEKGS
         EE  EGE  S+CPTSVE LKPLK   +FE K+H++EI KVYKTYA+KMRKLDI N QTNYAIG VKLKDP  S+DGKKS +KS+ P KLR GR   K  
Subjt:  LEE--EGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR-AEKGS

Query:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN
         +L RDLKRD+EM+YVGH+CLSWE+L+WQHRKA +LQQN SR  S+++RVV+EFQLF ILIQRFIEDE+F GPRI+NY +NRL +RSLLQVPAIR DC N
Subjt:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN

Query:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISE-
        DKK RGKE ES ISTAALVSIIE++M VFREF++ +K V NS IK A  QL         +AQ M+MEIR+ L+KKE+RLK+I+RSGNCI KK +RI E 
Subjt:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISE-

Query:  EGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        EGRVK ELLIAEVELKL+SRV+SMSRLTE QLIWCHKKLHQ+NFV+RKV++EPSF LFPC
Subjt:  EGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

XP_022946773.1 uncharacterized protein LOC111450741 [Cucurbita moschata]2.6e-18854.8Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET
        M PL  SF+  LS+F+  VSVF AII + F   V +             R   +S      H   G +   + SV+ASNSSKFQ E T QI GFVEESET
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET

Query:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL
           FV ELYCD+S SS  NQ PDD                                   D + +SGK L  F  ET EGF     D+ K  L S D    
Subjt:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL

Query:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD
                         E+L   +E +D + N                                     DG I       F  LPEV T  GDCS   SD
Subjt:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD

Query:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE------DFFEEHQDLINQLKFELRNSRTGGLP
        S+SES SFDEEF  IELELEP+LQVS NA++ PVN+       D  GE  ET+ D K M FEE++EEE      +F +EHQDLI QLK ELRNSRTGGLP
Subjt:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE------DFFEEHQDLINQLKFELRNSRTGGLP

Query:  TVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGS
        TV  EEE  PE + PTSVEALKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI  +KLKDP SS++ KKS +KS++ +KLR+GRA KG 
Subjt:  TVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGS

Query:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN
          L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+LQQN +R TS+Y+RVV+EFQ FCIL+QRFIEDE F GPRI NYVKNRLLVRSLLQVPAIREDC +
Subjt:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN

Query:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRIS--
        DKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDVG++ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR GNCIVKKL+R+S  
Subjt:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRIS--

Query:  EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        EEGR+K ELLIAEVELKL+SRV+SMSRLTE QL+WCHKKLHQ+NFV+RKV+VEPSF  FPC
Subjt:  EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

XP_022999382.1 uncharacterized protein LOC111493771 [Cucurbita maxima]2.0e-18554.52Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPF-ICRSLFS--SRSNPTIHS-------EYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESETA
        M PL  SF+  LS+F+  VSVF AII + F   V +    S+ +       T+ S       E   +   + SV+ASNSSKFQ E T QI GFVEESET 
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPF-ICRSLFS--SRSNPTIHS-------EYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESETA

Query:  TSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALP
          FV ELYCD+SPSS  NQ PDD                                   D E +SGK L  F  ET EGF     D+ K  L S D     
Subjt:  TSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALP

Query:  QVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDS
                        E+L   +E +D + N                                     DG I       F  LPEV T  GDCS   SDS
Subjt:  QVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDS

Query:  ESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE----------DFFEEHQDLINQLKFELRNSRTG
        +SES SFDEEF  IELELEP+L VS NA++ PVN+       D   ES ET+ D K M FEEE+EEE          +F +EHQDLI QLK ELRNSRTG
Subjt:  ESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE----------DFFEEHQDLINQLKFELRNSRTG

Query:  GLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE
        GLPTV  EEE  PE + PTSVE LKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI F+KLKDP SS+D KKS +KS++ HKLR+ RA 
Subjt:  GLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE

Query:  KGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIRED
        KG   L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+LQQN +R TS+Y+RVV+EFQ FCILIQRF+EDE F GPRI NYVKNRLLVRSLLQVPAIRED
Subjt:  KGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIRED

Query:  CANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI
        C NDKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDV ++ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR G+CIVKKL+R+
Subjt:  CANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI

Query:  S--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP
        S  EEGR+K ELLIAEVELKL+SRV+SM RLTE QL+WCHKKLHQ+NFV+RKV++EPSF  FP
Subjt:  S--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP

XP_023546981.1 uncharacterized protein LOC111805919 [Cucurbita pepo subsp. pepo]1.2e-18554.12Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET
        M PL  SF+  LS+F+  VSVF A+I + F   V +             R   +S      H   G +   + SV+ASNSSKFQ E T QI GFVEESET
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET

Query:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL
           FV ELYCD+SPSS  NQ PDD                                   D +  SGK L  F  ET EGF     D+ K  L S D    
Subjt:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL

Query:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD
                         E+L   +E +D + N                                     DG I       F  LPEV T  GDCS   SD
Subjt:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD

Query:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE-----------DFFEEHQDLINQLKFELRNSR
        S+SES SFDEEF  IELELEP+L VS NA++ PVN+       D  GE  ET+ D K M FEEE+EEE           +F +EHQDLI QLK ELRNSR
Subjt:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE-----------DFFEEHQDLINQLKFELRNSR

Query:  TGGLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR
        TGGLPTV  EEE  PE + PTSVEALKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI  +KLKDP SS+D KKS +KS++ HKLR+GR
Subjt:  TGGLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR

Query:  AEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIR
        A KG   L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+L QN +R TS+Y+RVV+EFQ FCIL+QRFIEDE F GPRI NYVKNRLLVRSLLQVPAIR
Subjt:  AEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIR

Query:  EDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQ
        EDC +DKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDVGN+ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR GNCIVKK +
Subjt:  EDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQ

Query:  RIS--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP
        R+S  EEGR+K ELL+AEVE+KL+SRV+SMSRLTE QL+WCHKKLHQ+ FV+RKV+VEPSF  FP
Subjt:  RIS--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP

TrEMBL top hitse value%identityAlignment
A0A0A0LFQ1 Uncharacterized protein7.1e-16851.32Show/hide
Query:  NSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLA-------------SNSSKFQSEPTKQICGFVEESETATSF
        N   +  S +   +S+FGAII  F R                 I   YGT    VDSVL               NSSK+Q EPT QI GF+++SET   F
Subjt:  NSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLA-------------SNSSKFQSEPTKQICGFVEESETATSF

Query:  VRELYCDSSPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQV
        V+E    +SPSS  NQ PD DF+ +  K L E   G     +E F  E+  GLE  D                           + +GLE          
Subjt:  VRELYCDSSPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQV

Query:  LTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSES
                  L + E L             + + D  +   V   G DCS                                EV TL  D S   SDS+ 
Subjt:  LTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSES

Query:  ESLSFDEEFIELELE--PQLQV-SNAKILPVN-------EDGFGESKETDWDSKEMGF-------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTV
        ES  FDEE+IE+ELE  P L V +NAKILPVN       +D   E  ET+ D K M F       EEE+EEE+F +EHQDLINQLK ELRNSRTGGLPTV
Subjt:  ESLSFDEEFIELELE--PQLQV-SNAKILPVN-------EDGFGESKETDWDSKEMGF-------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTV

Query:  LEE--EGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR-AEKGS
         EE  EGE  S+CPTSVE LKPLK   +FE K+H++EI KVYKTYA+KMRKLDI N QTNYAIG VKLKDP  S+DGKKS +KS+ P KLR GR   K  
Subjt:  LEE--EGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR-AEKGS

Query:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN
         +L RDLKRD+EM+YVGH+CLSWE+L+WQHRKA +LQQN SR  S+++RVV+EFQLF ILIQRFIEDE+F GPRI+NY +NRL +RSLLQVPAIR DC N
Subjt:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN

Query:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISE-
        DKK RGKE ES ISTAALVSIIE++M VFREF++ +K V NS IK A  QL         +AQ M+MEIR+ L+KKE+RLK+I+RSGNCI KK +RI E 
Subjt:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISE-

Query:  EGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        EGRVK ELLIAEVELKL+SRV+SMSRLTE QLIWCHKKLHQ+NFV+RKV++EPSF LFPC
Subjt:  EGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

A0A1S3CB94 uncharacterized protein LOC1034987341.6e-16751.66Show/hide
Query:  NFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLAS-------------NSSKFQSEPTKQICGFVEESETATSFVRELYCDS
        NF+  +S+FGAII +F R                 I   YGT    VDSVL +             NSSK+Q EPT QI GF++ESET T FV+E    +
Subjt:  NFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLAS-------------NSSKFQSEPTKQICGFVEESETATSFVRELYCDS

Query:  SPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQVLTLGEDCS
        SPSS + Q PD DF+ +  K L E   G     +E    E+  GLE  D                           + +GL                   
Subjt:  SPSSGNNQFPD-DFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQVLTLGEDCS

Query:  VSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSESESLSFDEE
        VS  D+   + VE+F                      G DC                                PEV TL  D S   SDS+SES SFDEE
Subjt:  VSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSESESLSFDEE

Query:  FIELELE--PQLQVS-NAKILPV-------NEDGFGESKETDWDSKEMGF------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEE--EGEP
        ++E+ELE  P+L VS NAK+LPV       N+D   E  ET+   K M F      +EE+ EE+F +EHQDLINQLK ELRNSRTGGLPTV EE  EGE 
Subjt:  FIELELE--PQLQVS-NAKILPV-------NEDGFGESKETDWDSKEMGF------EEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEE--EGEP

Query:  ESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE-KGSTKLIRDLKR
         S+CPT+VE LKPLK   +FE K+H++EI KVYKTY +KMRKLDI N QTNYAIG VKLKDP  S+DGKKS +KS+ P KLR  R + KG   L RDLKR
Subjt:  ESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE-KGSTKLIRDLKR

Query:  DLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEG
        D+EM+YVGH+CLSWE+L+WQHRKA +LQQN SR  SQ++RV +EFQLF ILIQRFIEDE+F GPRI+NY +NRL +RSLLQVPAIR DC NDKK RGKE 
Subjt:  DLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEG

Query:  ESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI-SEEGRVKKELL
        ES ISTAALVSIIE++M VFREF++ DK V NS IK A VQL         +AQ M+MEIR  L+KKE+RLK+I+RSGNCI KK +RI  EEGRVK ELL
Subjt:  ESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI-SEEGRVKKELL

Query:  IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        IAEVELKL+SRV+SMSRLTE QLIWCHKKLHQ+NFV+RKV++EPSF LFPC
Subjt:  IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

A0A6J1G4S1 uncharacterized protein LOC1114507411.2e-18854.8Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET
        M PL  SF+  LS+F+  VSVF AII + F   V +             R   +S      H   G +   + SV+ASNSSKFQ E T QI GFVEESET
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPFIC-----------RSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESET

Query:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL
           FV ELYCD+S SS  NQ PDD                                   D + +SGK L  F  ET EGF     D+ K  L S D    
Subjt:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL

Query:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD
                         E+L   +E +D + N                                     DG I       F  LPEV T  GDCS   SD
Subjt:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD

Query:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE------DFFEEHQDLINQLKFELRNSRTGGLP
        S+SES SFDEEF  IELELEP+LQVS NA++ PVN+       D  GE  ET+ D K M FEE++EEE      +F +EHQDLI QLK ELRNSRTGGLP
Subjt:  SESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE------DFFEEHQDLINQLKFELRNSRTGGLP

Query:  TVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGS
        TV  EEE  PE + PTSVEALKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI  +KLKDP SS++ KKS +KS++ +KLR+GRA KG 
Subjt:  TVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGS

Query:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN
          L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+LQQN +R TS+Y+RVV+EFQ FCIL+QRFIEDE F GPRI NYVKNRLLVRSLLQVPAIREDC +
Subjt:  TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCAN

Query:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRIS--
        DKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDVG++ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR GNCIVKKL+R+S  
Subjt:  DKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRIS--

Query:  EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        EEGR+K ELLIAEVELKL+SRV+SMSRLTE QL+WCHKKLHQ+NFV+RKV+VEPSF  FPC
Subjt:  EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

A0A6J1KAQ8 uncharacterized protein LOC1114937719.9e-18654.52Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPF-ICRSLFS--SRSNPTIHS-------EYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESETA
        M PL  SF+  LS+F+  VSVF AII + F   V +    S+ +       T+ S       E   +   + SV+ASNSSKFQ E T QI GFVEESET 
Subjt:  MFPLPNSFQICLSNFIASVSVFGAII-IHFSRHVPF-ICRSLFS--SRSNPTIHS-------EYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESETA

Query:  TSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALP
          FV ELYCD+SPSS  NQ PDD                                   D E +SGK L  F  ET EGF     D+ K  L S D     
Subjt:  TSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALP

Query:  QVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDS
                        E+L   +E +D + N                                     DG I       F  LPEV T  GDCS   SDS
Subjt:  QVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDS

Query:  ESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE----------DFFEEHQDLINQLKFELRNSRTG
        +SES SFDEEF  IELELEP+L VS NA++ PVN+       D   ES ET+ D K M FEEE+EEE          +F +EHQDLI QLK ELRNSRTG
Subjt:  ESESLSFDEEF--IELELEPQLQVS-NAKILPVNE-------DGFGESKETDWDSKEMGFEEEKEEE----------DFFEEHQDLINQLKFELRNSRTG

Query:  GLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE
        GLPTV  EEE  PE + PTSVE LKPLKNGG+FE +  +KEI KVYKTYA KMRKLD+ NTQTNYAI F+KLKDP SS+D KKS +KS++ HKLR+ RA 
Subjt:  GLPTVL-EEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAE

Query:  KGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIRED
        KG   L+RDLKRD+EM+YVGH+CLSWEVL+WQHRKAI+LQQN +R TS+Y+RVV+EFQ FCILIQRF+EDE F GPRI NYVKNRLLVRSLLQVPAIRED
Subjt:  KGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIRED

Query:  CANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI
        C NDKKLRGKEGES ISTAALVS+IEE+M VFR+F++ DKDV ++ IK A V++         +AQ M+MEIRTELRKKE+RLK+IVR G+CIVKKL+R+
Subjt:  CANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRI

Query:  S--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP
        S  EEGR+K ELLIAEVELKL+SRV+SM RLTE QL+WCHKKLHQ+NFV+RKV++EPSF  FP
Subjt:  S--EEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFP

A0A6J1KH61 uncharacterized protein LOC111493177 isoform X12.8e-15647.59Show/hide
Query:  MFPLPNSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVL------------ASNSSKFQSEPTKQICGFVEESET
        M P  NSF+  L N I  VSVFGAII++F R+V +I R+LF+S SN     +YG   + V SVL            +SNSSKFQ +PT QI GF+EES+T
Subjt:  MFPLPNSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVL------------ASNSSKFQSEPTKQICGFVEESET

Query:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL
         + FV++L+CD +P                                                                                      
Subjt:  ATSFVRELYCDSSPSSGNNQFPDDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGAL

Query:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD
                                                                                                            
Subjt:  PQVLTLGEDCSVSLSDSESLSFVEEFKDEQINGLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSD

Query:  SESESLSFDEEFIELELEPQLQVSNAKILPVNEDGFGESKETDWDSKEMGFEEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTS
               FD+ FIE E E +  VSNA+ L VN+    E++E+         EE KE E    EHQDLINQLK ELRNSRTG LPTVLEEE EPE   P S
Subjt:  SESESLSFDEEFIELELEPQLQVSNAKILPVNEDGFGESKETDWDSKEMGFEEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTS

Query:  VE--ALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGSTKLIRDLKRDLEMIY
        +   ALKPLKNGG+FEQKEH+KEI  +YKTY DKMRKLDIFNTQTNYAIGFVKLK+P    DG+KSS+KS+    LRSGRA+KG  KLI+D+KRD+EM+Y
Subjt:  VE--ALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEKGSTKLIRDLKRDLEMIY

Query:  VGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNIST
        VGH+CLSWE+L WQHRKA QLQQN     SQ+S+VV+EFQLFCILIQRFIEDE F+GPRIENY KNR LVR+LLQVPAIREDC NDK L      S IST
Subjt:  VGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNIST

Query:  AALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQ--LHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVE
        A LVSIIE+++ VFRE +Q+DKD+GNSGIK    Q  +H  D QNPFDA++M++++RT+L+KKEKRLKD++RSGNCIVKKLQR+SE+G  + ELLIAEVE
Subjt:  AALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQ--LHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVE

Query:  LKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        L+LISRV+SMSRLTE QLIWCHKKLHQV FV  KV +EPSFLLFPC
Subjt:  LKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)3.5e-8240.58Show/hide
Query:  SVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGFGESKETDWDSKE----
        S+ L  SES  F++E +  ++   E    G+L E L        +L   E      D+ FI  EL P LQ+SN          +GE +E ++  +E    
Subjt:  SVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGFGESKETDWDSKE----

Query:  MGFEEEKEEEDFFE----------EHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTSVEALKPLKNGGSFEQ-KEHYKEILKVYKTYADKMRKLD
        MGF+E++E  D F+          EH D+I +LK ELR +RTGGL T+LEE   P       ++ LKPLK     +Q K+   EI KVYK YA KMRKLD
Subjt:  MGFEEEKEEEDFFE----------EHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTSVEALKPLKNGGSFEQ-KEHYKEILKVYKTYADKMRKLD

Query:  IFNTQTNYAIGFVKLKD---PTSSID--GKKSSIKSMLPHKLRSGRAEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSR
        + ++QT ++I  +KLKD   P+ + D   K S  +++ P K  +   +  S +L+++  RD E +YVG +CLSWE+L WQ+ K ++   +S   T QY+ 
Subjt:  IFNTQTNYAIGFVKLKD---PTSSID--GKKSSIKSMLPHKLRSGRAEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSR

Query:  VVSEFQLFCILIQRFIEDERF-SGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSAN
        V  EFQLF +L+QRF+E+E F +  R+E Y+KNR   ++ LQ+P +R+D ++ KK R  EGE  + T  L  II E+M VF EF+  DKD   S +K + 
Subjt:  VVSEFQLFCILIQRFIEDERF-SGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSAN

Query:  VQLHHTDI--QNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGR--VKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFV
           H T +  Q+  D + ++ +IRT L+KKEK+LK+I RS +CIVKKL++   +    VK ELLIA++EL+L+SRVM MS+LT  +L WC +KL +++F 
Subjt:  VQLHHTDI--QNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGR--VKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFV

Query:  DRKVLVEPSFLLFPC
         RK+ +EP F L PC
Subjt:  DRKVLVEPSFLLFPC

AT1G73850.1 Protein of unknown function (DUF1666)4.6e-2629.73Show/hide
Query:  EILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEK------GSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHR
        E   V++ Y ++M  L   + Q  +    +K      SI  +  SI   + HKL S   +K      GS     +   +LE  YV  ICL+WE L+W ++
Subjt:  EILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGRAEK------GSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHR

Query:  KAIQLQQNSSRRTSQY---SRVVSEFQLFCILIQRFIEDERFS-GPRIENYVKNRLLVRSLLQVPAIRE-DCANDKKLRGKEG-ESNISTAALVSIIEEA
           + +  + R  +     + +  +F+ F IL+QR++E+E +  G R E Y + R L   LL VP  ++ +   +K+   +EG  S IS+A+ + I+EE 
Subjt:  KAIQLQQNSSRRTSQY---SRVVSEFQLFCILIQRFIEDERFS-GPRIENYVKNRLLVRSLLQVPAIRE-DCANDKKLRGKEG-ESNISTAALVSIIEEA

Query:  MHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVELKLISRVMSMSR
        +  F  F+Q DK+     I  A         + P D  ++ +  +   +KK K LK++ R G  + KK   I EE     E+L+  ++LK++SRV+ M+ 
Subjt:  MHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVELKLISRVMSMSR

Query:  LTEGQLIWCHKKLHQVNFVD-RKVLVEPSFLLF
        + E  L WC +K+ +V  +   KVL   S  LF
Subjt:  LTEGQLIWCHKKLHQVNFVD-RKVLVEPSFLLF

AT3G20260.1 Protein of unknown function (DUF1666)1.2e-2927.08Show/hide
Query:  KILPVNEDGFGESKETDWDSKEMGFEEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILK--
        KIL   E    E+ E        G E E +++DF      + N++K  L+  R      ++ EE E E       E     ++  S E ++   E L+  
Subjt:  KILPVNEDGFGESKETDWDSKEMGFEEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILK--

Query:  -----VYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR--------AEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNW
             VY+ Y ++M   D  ++Q     G + +    S+   + +S K   P +  S +         E      + D  +DLE  YV  +CL+WE L+ 
Subjt:  -----VYKTYADKMRKLDIFNTQTNYAIGFVKLKDPTSSIDGKKSSIKSMLPHKLRSGR--------AEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNW

Query:  QHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERF-SGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMH
        Q+ +   L        + Y+     FQ F +L+QR+IE+E F  G R E Y + R  +  LLQ P I+    +DKK   K+    +    L+ +IE ++ 
Subjt:  QHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERF-SGPRIENYVKNRLLVRSLLQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMH

Query:  VFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVELKLISRVMSMSRLT
         F  F++ DK   N GI      L      N  ++   ++ +++ + KK  + K++ +    + KK    + EG    +LL A +++KL +RV+ MS+++
Subjt:  VFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNCIVKKLQRISEEGRVKKELLIAEVELKLISRVMSMSRLT

Query:  EGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        + QL+WC +K+ ++NF   K+   PS +LFPC
Subjt:  EGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)1.8e-7536.02Show/hide
Query:  SLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGF-------------GESKETDWDSKEMGFEEEKEEE-DFFE---EHQDLINQLKFELRNSR
        S+S  + + + F  E   LE +     S+      +EDGF             G+++++D        EEE+EE+ + FE   EHQDLI QLK E++  +
Subjt:  SLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGF-------------GESKETDWDSKEMGFEEEKEEE-DFFE---EHQDLINQLKFELRNSR

Query:  T-GGLPTVLEEEGEPESICPTSVEALKP--LKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDP---TSSIDGKKS--SIKSMLP
          GGL T+LEEE E +  CP  +E LKP  ++    F+  +   E+ K +++Y ++MRKLDI + Q +YA+G ++ K P   TS++    S  S  S+  
Subjt:  T-GGLPTVLEEEGEPESICPTSVEALKP--LKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDP---TSSIDGKKS--SIKSMLP

Query:  HKLRSGRAEKGS----TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLL
          +R  +A+K       + +++++ +LE +YVG +CLSWE+L+WQ+ KAI+L ++    + +Y+ V  EFQ F +L+QRF+E+E F  PR+++Y+K R +
Subjt:  HKLRSGRAEKGS----TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLL

Query:  VRSLLQVPAIREDCANDKKL-RGKEGESN----ISTAALVSIIEEAMHVFREFIQTDKDVGN-SGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEK
        +R+LLQ+P IRED   DKK  R ++ E N    I +  LV I+EE + +F  F++ DK   +    KS        D +   +   M  E++++L+ KEK
Subjt:  VRSLLQVPAIREDCANDKKL-RGKEGESN----ISTAALVSIIEEAMHVFREFIQTDKDVGN-SGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEK

Query:  RLKDIVRSGNCIVKKLQRISEEGRVKKELL--IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        RL+D+++S  CI+++ Q+  EE   + ++L   ++V++KL++RV++MS+LT   L+WCH KL ++NFV+R++ ++PSF LFPC
Subjt:  RLKDIVRSGNCIVKKLQRISEEGRVKKELL--IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)1.7e-7335.74Show/hide
Query:  SLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGF-------------GESKETDWDSKEMGFEEEKEEE-DFFE---EHQDLINQLKFELRNSR
        S+S  + + + F  E   LE +     S+      +EDGF             G+++++D        EEE+EE+ + FE   EHQDLI QLK E++  +
Subjt:  SLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGF-------------GESKETDWDSKEMGFEEEKEEE-DFFE---EHQDLINQLKFELRNSR

Query:  T-GGLPTVLEEEGEPESICPTSVEALKP--LKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDP---TSSIDGKKS--SIKSMLP
          GGL T+LEEE E +  CP  +E LKP  ++    F+  +   E+ K +++Y ++MRKLDI + Q +YA+G ++ K P   TS++    S  S  S+  
Subjt:  T-GGLPTVLEEEGEPESICPTSVEALKP--LKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKLKDP---TSSIDGKKS--SIKSMLP

Query:  HKLRSGRAEKGS----TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLL
          +R  +A+K       + +++++ +LE +YVG +CLSWE+L+WQ+ KAI+L ++    + +Y+ V  EFQ F +L+QRF+E+E F  PR+++Y+K R +
Subjt:  HKLRSGRAEKGS----TKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYVKNRLL

Query:  VRSLLQVPAIREDCANDKKL-RGKEGESN----ISTAALVSIIEEAMHVFREFIQTDKDVGN-SGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRK-KE
        +R+LLQ+P IRED   DKK  R ++ E N    I +  LV I+EE + +F  F++ DK   +    KS        D +   +   M  E++++L+   E
Subjt:  VRSLLQVPAIREDCANDKKL-RGKEGESN----ISTAALVSIIEEAMHVFREFIQTDKDVGN-SGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRK-KE

Query:  KRLKDIVRSGNCIVKKLQRISEEGRVKKELL--IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC
        KRL+D+++S  CI+++ Q+  EE   + ++L   ++V++KL++RV++MS+LT   L+WCH KL ++NFV+R++ ++PSF LFPC
Subjt:  KRLKDIVRSGNCIVKKLQRISEEGRVKKELL--IAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCTCTCCCCAATTCTTTCCAAATTTGCCTCTCGAACTTCATCGCTTCAGTCTCTGTATTTGGCGCAATAATCATACACTTCTCCAGGCATGTTCCCTTCATTTG
TCGCTCCCTTTTCAGTTCCCGATCAAATCCCACCATTCATTCCGAATATGGAACAAAACCCATTTCTGTCGATTCTGTTTTGGCGTCGAATTCAAGCAAATTTCAATCGG
AACCCACCAAGCAAATTTGTGGGTTCGTGGAAGAATCTGAAACTGCCACCTCTTTTGTCCGAGAATTGTACTGTGATTCTTCACCTTCTTCAGGTAATAATCAATTTCCT
GATGATTTTGATGTTTTCTGTGGAAAAGAATTGTGGGAATTTGGTTTAGGAACGGGAGAAGGTTCTGTTGAGGAATTTAGAGATGAACAGATTATGGGTTTAGAATCTTG
TGATCTTGAAATTTTCAGCGGAAAAAATTTGTGGGGTTTTGGTTTAGAAACGGGAGAAGGTTTTGTTAAGGGATTTGGAGATGAACAGAAGAAGGGCTTAGAATCTTGTG
ATTTTGGTGCGTTACCTCAAGTTTTAACACTTGGGGAAGATTGTTCTGTTTCTTTATCTGATTCTGAATCTTTAAGTTTTGTTGAGGAATTTAAAGATGAACAGATTAAT
GGCCTAGAATCTTGTGATTTTGGTTTGTTACCTGGAGTTTTAACACTTGGGGAAGATTGTTCTGTTTCTTTATCTGATTCTGAATCTTCAAGTTTTGTTGAGGAATTTAT
AGATGGACAGATTAAGGATCTAGAATCTTGTGATTTTGGTCTGTTACCTGAAGTTTTAACACTTGGGGGAGATTGTTCTGTTTCATTATCTGATTCTGAATCCGAATCTC
TAAGTTTTGATGAAGAGTTCATAGAATTAGAGCTAGAACCCCAATTGCAGGTTTCAAATGCCAAAATTTTGCCTGTAAATGAAGATGGATTTGGGGAATCAAAAGAAACA
GATTGGGACAGCAAAGAGATGGGGTTTGAAGAAGAAAAAGAGGAAGAGGATTTTTTTGAAGAGCATCAAGATTTGATAAATCAGCTCAAATTTGAGCTAAGAAACTCAAG
AACAGGAGGGCTTCCAACAGTGTTGGAAGAAGAGGGTGAGCCAGAATCCATTTGTCCCACATCGGTTGAAGCTCTAAAACCTCTAAAAAATGGAGGTAGTTTTGAACAGA
AAGAGCATTACAAAGAGATCCTTAAAGTCTACAAGACTTATGCAGATAAAATGCGGAAACTCGACATCTTCAATACTCAGACAAATTATGCAATTGGTTTTGTTAAGTTG
AAAGATCCAACTAGCTCAATAGATGGAAAGAAATCTAGCATTAAATCTATGCTTCCCCACAAGCTAAGGTCAGGGAGAGCAGAGAAGGGATCTACAAAATTGATTAGAGA
CTTAAAGAGGGACTTGGAAATGATCTATGTTGGACATATTTGCTTGTCTTGGGAAGTTCTTAATTGGCAGCATAGGAAGGCCATACAATTGCAACAAAATAGCTCTCGAC
GAACCTCCCAGTACAGTCGAGTCGTTAGCGAGTTTCAACTCTTCTGCATCCTCATTCAAAGATTCATCGAGGACGAGCGGTTTTCCGGCCCTAGAATCGAAAACTACGTG
AAAAACCGACTTCTCGTTCGTAGTCTTCTTCAAGTTCCTGCCATTCGAGAGGACTGTGCAAATGACAAAAAGCTGAGAGGCAAAGAAGGTGAAAGTAACATTTCAACTGC
AGCTCTAGTATCAATCATTGAAGAAGCAATGCATGTTTTTCGAGAATTTATACAGACAGATAAAGACGTCGGGAATTCGGGTATCAAGAGTGCTAATGTACAACTTCATC
ACACTGATATCCAAAACCCTTTTGATGCACAAGTTATGATTATGGAAATAAGAACTGAACTAAGAAAGAAAGAGAAAAGGCTAAAAGACATAGTGAGAAGTGGGAATTGT
ATAGTAAAGAAGTTGCAGAGGATTAGTGAAGAAGGCAGAGTGAAGAAGGAATTGTTGATAGCTGAGGTTGAATTGAAATTGATATCAAGAGTTATGAGCATGTCAAGATT
AACAGAGGGCCAATTGATTTGGTGCCATAAAAAGCTACACCAAGTCAATTTTGTGGATAGGAAGGTTCTTGTAGAGCCCTCATTTTTGCTTTTCCCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
CGGCCTTTCTTTTCCACCCTCTCTGCAATATTCCCAACGGCTCGATTCCACCAATGGCTACCAAACCCTGTTTTCATTCCCTCAATTATTCGTTTCTTATATCAGTTTTT
GATCCATAAATATCATTAAAAACATAATATGTTTCCTCTCCCCAATTCTTTCCAAATTTGCCTCTCGAACTTCATCGCTTCAGTCTCTGTATTTGGCGCAATAATCATAC
ACTTCTCCAGGCATGTTCCCTTCATTTGTCGCTCCCTTTTCAGTTCCCGATCAAATCCCACCATTCATTCCGAATATGGAACAAAACCCATTTCTGTCGATTCTGTTTTG
GCGTCGAATTCAAGCAAATTTCAATCGGAACCCACCAAGCAAATTTGTGGGTTCGTGGAAGAATCTGAAACTGCCACCTCTTTTGTCCGAGAATTGTACTGTGATTCTTC
ACCTTCTTCAGGTAATAATCAATTTCCTGATGATTTTGATGTTTTCTGTGGAAAAGAATTGTGGGAATTTGGTTTAGGAACGGGAGAAGGTTCTGTTGAGGAATTTAGAG
ATGAACAGATTATGGGTTTAGAATCTTGTGATCTTGAAATTTTCAGCGGAAAAAATTTGTGGGGTTTTGGTTTAGAAACGGGAGAAGGTTTTGTTAAGGGATTTGGAGAT
GAACAGAAGAAGGGCTTAGAATCTTGTGATTTTGGTGCGTTACCTCAAGTTTTAACACTTGGGGAAGATTGTTCTGTTTCTTTATCTGATTCTGAATCTTTAAGTTTTGT
TGAGGAATTTAAAGATGAACAGATTAATGGCCTAGAATCTTGTGATTTTGGTTTGTTACCTGGAGTTTTAACACTTGGGGAAGATTGTTCTGTTTCTTTATCTGATTCTG
AATCTTCAAGTTTTGTTGAGGAATTTATAGATGGACAGATTAAGGATCTAGAATCTTGTGATTTTGGTCTGTTACCTGAAGTTTTAACACTTGGGGGAGATTGTTCTGTT
TCATTATCTGATTCTGAATCCGAATCTCTAAGTTTTGATGAAGAGTTCATAGAATTAGAGCTAGAACCCCAATTGCAGGTTTCAAATGCCAAAATTTTGCCTGTAAATGA
AGATGGATTTGGGGAATCAAAAGAAACAGATTGGGACAGCAAAGAGATGGGGTTTGAAGAAGAAAAAGAGGAAGAGGATTTTTTTGAAGAGCATCAAGATTTGATAAATC
AGCTCAAATTTGAGCTAAGAAACTCAAGAACAGGAGGGCTTCCAACAGTGTTGGAAGAAGAGGGTGAGCCAGAATCCATTTGTCCCACATCGGTTGAAGCTCTAAAACCT
CTAAAAAATGGAGGTAGTTTTGAACAGAAAGAGCATTACAAAGAGATCCTTAAAGTCTACAAGACTTATGCAGATAAAATGCGGAAACTCGACATCTTCAATACTCAGAC
AAATTATGCAATTGGTTTTGTTAAGTTGAAAGATCCAACTAGCTCAATAGATGGAAAGAAATCTAGCATTAAATCTATGCTTCCCCACAAGCTAAGGTCAGGGAGAGCAG
AGAAGGGATCTACAAAATTGATTAGAGACTTAAAGAGGGACTTGGAAATGATCTATGTTGGACATATTTGCTTGTCTTGGGAAGTTCTTAATTGGCAGCATAGGAAGGCC
ATACAATTGCAACAAAATAGCTCTCGACGAACCTCCCAGTACAGTCGAGTCGTTAGCGAGTTTCAACTCTTCTGCATCCTCATTCAAAGATTCATCGAGGACGAGCGGTT
TTCCGGCCCTAGAATCGAAAACTACGTGAAAAACCGACTTCTCGTTCGTAGTCTTCTTCAAGTTCCTGCCATTCGAGAGGACTGTGCAAATGACAAAAAGCTGAGAGGCA
AAGAAGGTGAAAGTAACATTTCAACTGCAGCTCTAGTATCAATCATTGAAGAAGCAATGCATGTTTTTCGAGAATTTATACAGACAGATAAAGACGTCGGGAATTCGGGT
ATCAAGAGTGCTAATGTACAACTTCATCACACTGATATCCAAAACCCTTTTGATGCACAAGTTATGATTATGGAAATAAGAACTGAACTAAGAAAGAAAGAGAAAAGGCT
AAAAGACATAGTGAGAAGTGGGAATTGTATAGTAAAGAAGTTGCAGAGGATTAGTGAAGAAGGCAGAGTGAAGAAGGAATTGTTGATAGCTGAGGTTGAATTGAAATTGA
TATCAAGAGTTATGAGCATGTCAAGATTAACAGAGGGCCAATTGATTTGGTGCCATAAAAAGCTACACCAAGTCAATTTTGTGGATAGGAAGGTTCTTGTAGAGCCCTCA
TTTTTGCTTTTCCCTTGTTGACTTTAGAAATCACAGCAGCTTGTACATATATACAATATACATCAATCAATATAATAAAGGATTCCAAATATTTTGTTTTTTATGAATCA
ATCAATCTAAAGGATTCTAA
Protein sequenceShow/hide protein sequence
MFPLPNSFQICLSNFIASVSVFGAIIIHFSRHVPFICRSLFSSRSNPTIHSEYGTKPISVDSVLASNSSKFQSEPTKQICGFVEESETATSFVRELYCDSSPSSGNNQFP
DDFDVFCGKELWEFGLGTGEGSVEEFRDEQIMGLESCDLEIFSGKNLWGFGLETGEGFVKGFGDEQKKGLESCDFGALPQVLTLGEDCSVSLSDSESLSFVEEFKDEQIN
GLESCDFGLLPGVLTLGEDCSVSLSDSESSSFVEEFIDGQIKDLESCDFGLLPEVLTLGGDCSVSLSDSESESLSFDEEFIELELEPQLQVSNAKILPVNEDGFGESKET
DWDSKEMGFEEEKEEEDFFEEHQDLINQLKFELRNSRTGGLPTVLEEEGEPESICPTSVEALKPLKNGGSFEQKEHYKEILKVYKTYADKMRKLDIFNTQTNYAIGFVKL
KDPTSSIDGKKSSIKSMLPHKLRSGRAEKGSTKLIRDLKRDLEMIYVGHICLSWEVLNWQHRKAIQLQQNSSRRTSQYSRVVSEFQLFCILIQRFIEDERFSGPRIENYV
KNRLLVRSLLQVPAIREDCANDKKLRGKEGESNISTAALVSIIEEAMHVFREFIQTDKDVGNSGIKSANVQLHHTDIQNPFDAQVMIMEIRTELRKKEKRLKDIVRSGNC
IVKKLQRISEEGRVKKELLIAEVELKLISRVMSMSRLTEGQLIWCHKKLHQVNFVDRKVLVEPSFLLFPC