; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G022500 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G022500
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein SIEL isoform X1
Genome locationchr03:33464908..33473401
RNA-Seq ExpressionLsi03G022500
SyntenyLsi03G022500
Gene Ontology termsNA
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017202.1 Protein SIEL [Cucurbita argyrosperma subsp. argyrosperma]1.9e-28467.77Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRVVIT GLMLAAHSPERKQ  SDEIF  +CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ               VIAYIVLAISAPVLD H+LRIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------
        AATLLGRISHAL DIMDQ T+FAYLLQNSK+ GLSDLGFNPEG PCS TP S +NDILAIAS KTPA IHE++      ++     + K  DI+      
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------

Query:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI
             LRT                                                 +G                                   L T  +
Subjt:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI

Query:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
         L N                       LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEK LY
Subjt:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

XP_004147305.1 protein SIEL [Cucumis sativus]8.1e-29167.61Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE DL+L+S INE+DD+SFLSLCFGP VS R WLLNNAE+FQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGN V ED SMIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAA+RVVIT GLMLAAHSPERKQQL DEIF+  CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCST+QLE+LAL+
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM M N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH++M D +IKDV EQ               VIAYIVLAISA   DNHTLRIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------
        AATLLGRISHALGDIMDQ T+FAYLL NSKHIGLSDLGFN EG  CS T  SS+NDI AIASLK PAMIHEQ+      ++     + K  DI+      
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -LRTDVGRLYTQNISLRN-----------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
         LR   G +     +LR            LKEEC EPSTF+CEVQRSL NL TITPK  CSS   +++LKSF LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  -LRTDVGRLYTQNISLRN-----------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDN+TSQF+FLDFLSLG CDE REFMY VPF+RTPKASSFIARICIGLECWFENAEVNERRGGP  DLA
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA

Query:  YICKEKEVYLSMIHK
        YICKEKEVYLSMIHK
Subjt:  YICKEKEVYLSMIHK

XP_008463329.1 PREDICTED: protein SIEL isoform X1 [Cucumis melo]4.6e-28667.16Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RVVIT GLMLAAH+PERKQQL DEIF+  CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ               V+AYIVLAISA  LDNHTLRIPPR+FSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------
        AATLLGRISHALGDIMDQ T+FAYLL NSKHIGLSDLGFN E   CS T  SS+NDI AIASLK PAMIHEQ                            
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------

Query:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------
                 R  + ++G +    +K N        YL+                                                              
Subjt:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------

Query:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                     T + +L T   ++ N LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHK
        AYICKEKEVYLSMI K
Subjt:  AYICKEKEVYLSMIHK

XP_023527352.1 protein SIEL isoform X1 [Cucurbita pepo subsp. pepo]6.7e-28567.9Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRVVIT GLMLAAHSPERKQ  SDEIF  +CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ               VIAY VLAISAPVLD H+LRIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------
        AATLLGRISHALGDIMDQ T+FAYLLQNSK+ GLSDLGFNPEG PCS TP S +NDILAIAS KTPA IHE++      ++     + K  DI+      
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------

Query:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI
             LRT                                                 +G                                   L T  +
Subjt:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI

Query:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
         L N                       LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEKPLY
Subjt:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

XP_038894154.1 protein SIEL [Benincasa hispida]1.3e-29969.57Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        M ERDL+LVSAINELDD+SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGL GLGNTVLED SMIEGCYCR+IELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAV VVIT GLMLAAHSP RKQ LSDEIF+  CS  R+MNMKVRVNAFDA+K LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLEM ALD
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNLT+LSTKF GEALSLLMD+LNDDSV+VRL+AL TLHHM M N LKLQEAHMHMFL ALND DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        K+AK PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ               VIAYIVLAISAPV DNHT RIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------
        AAT+LGRISHALGDIMDQ TVFAYLLQNSKHIGLSDLGFNPEG PCSPTP +S+ND+ AIASLK PAMIHEQR      ++     + K  DI+      
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------

Query:  --------------------LRTD------------------------------------------VGRLYTQNISLRN---------------------
                             RTD                                          +G+L  +   LR+                     
Subjt:  --------------------LRTD------------------------------------------VGRLYTQNISLRN---------------------

Query:  -----------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEEC EPSTF+CEVQRSL NL  ITPK  CSSP F+KLLKSF L HLEISE L+H+ AELV+ DNDYEKP
Subjt:  -----------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA
        LYFVPGLPVGIPCQIILHN  S+R+LWF+ITMDNMTSQFIFLDFLSLG CDE REF YIVPF+RTPKASSFIARICIGLECWFEN EVNER GGP RDLA
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA

Query:  YICKEKEVYLSMIHK
        YICKEKEVY SMIHK
Subjt:  YICKEKEVYLSMIHK

TrEMBL top hitse value%identityAlignment
A0A1S3CIY4 protein SIEL isoform X21.1e-26965.17Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RVVIT GLMLAAH+PERKQQL DEIF+  CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQVIAYIVLAISAPVLDNHTLRIPPRIFSYAATLLGRISHALGDI
        KL K PDLVTFQLSFNGLLESLESYPQ +           +  ++ D         +V+AYIVLAISA  LDNHTLRIPPR+FSYAATLLGRISHALGDI
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQVIAYIVLAISAPVLDNHTLRIPPRIFSYAATLLGRISHALGDI

Query:  MDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ-------------------------------------RVLQGS
        MDQ T+FAYLL NSKHIGLSDLGFN E   CS T  SS+NDI AIASLK PAMIHEQ                                     R  + +
Subjt:  MDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ-------------------------------------RVLQGS

Query:  IGSI----HKKNDI------YLR---------------------------------------------------------------------------TD
        +G +    +K N        YL+                                                                           T 
Subjt:  IGSI----HKKNDI------YLR---------------------------------------------------------------------------TD

Query:  VGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLYFVPGLPVGIPCQI
        + +L T   ++ N LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKPLYFVPGLPVGIPCQI
Subjt:  VGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLYFVPGLPVGIPCQI

Query:  ILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDLAYICKEKEVYLSMIH
        ILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DLAYICKEKEVYLSMI 
Subjt:  ILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDLAYICKEKEVYLSMIH

Query:  K
        K
Subjt:  K

A0A1S3CKJ8 protein SIEL isoform X12.2e-28667.16Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RVVIT GLMLAAH+PERKQQL DEIF+  CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ               V+AYIVLAISA  LDNHTLRIPPR+FSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------
        AATLLGRISHALGDIMDQ T+FAYLL NSKHIGLSDLGFN E   CS T  SS+NDI AIASLK PAMIHEQ                            
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------

Query:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------
                 R  + ++G +    +K N        YL+                                                              
Subjt:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------

Query:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                     T + +L T   ++ N LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHK
        AYICKEKEVYLSMI K
Subjt:  AYICKEKEVYLSMIHK

A0A5A7UEC0 Protein SIEL isoform X12.2e-28667.16Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RVVIT GLMLAAH+PERKQQL DEIF+  CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIFI--CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ               V+AYIVLAISA  LDNHTLRIPPR+FSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------
        AATLLGRISHALGDIMDQ T+FAYLL NSKHIGLSDLGFN E   CS T  SS+NDI AIASLK PAMIHEQ                            
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQ----------------------------

Query:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------
                 R  + ++G +    +K N        YL+                                                              
Subjt:  ---------RVLQGSIGSI----HKKNDI------YLR--------------------------------------------------------------

Query:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                     T + +L T   ++ N LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  -------------TDVGRLYTQNISLRN-LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHK
        AYICKEKEVYLSMI K
Subjt:  AYICKEKEVYLSMIHK

A0A6J1F7A9 protein SIEL8.0e-28467.53Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRVVIT GLMLAAH PERKQ  SDEIF  +CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ               VIAYIVLAISAPVLD H+LRIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------
        AATLLGRISHAL DIMDQ T+FAYLLQNSK+ GLSDLGFNPEG PCS TP S +NDILAIAS KTPA IH+++      ++     + K  DI+      
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIY------

Query:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI
             LRT                                                 +G                                   L T  +
Subjt:  -----LRT------------------------------------------------DVGR----------------------------------LYTQNI

Query:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
         L N                       LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEK LY
Subjt:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

A0A6J1J0A1 protein SIEL2.2e-28166.79Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAE FQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLG TV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRVVIT GLMLAAHSPERKQ  SDEIF  +CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVVITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF+GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY
        KLAK PDL TFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ               V+AYIVLAISAPVLD H+LRIPPRIFSY
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ---------------VIAYIVLAISAPVLDNHTLRIPPRIFSY

Query:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIYLRTDVG
        AATLLGRISHALGDIMDQ T+FAYLLQNSK+ GLSDLGFNPEG PCS TP S +NDILAIAS KTPAMIHE++      ++     + K  DI+     G
Subjt:  AATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDILAIASLKTPAMIHEQR-----VLQGSIGSIHKKNDIYLRTDVG

Query:  RLY---------------------------------------------------------------------------------------------TQNI
         L+                                                                                             T  +
Subjt:  RLY---------------------------------------------------------------------------------------------TQNI

Query:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
         L N                       LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEIS+KLKH+ AELVI DNDYEKPLY
Subjt:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN  SER+LWF+ITMDN TSQFIFLDFL L G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+ EVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

SwissProt top hitse value%identityAlignment
Q54LH5 Integrator complex subunit 4 homolog4.1e-1123.4Show/hide
Query:  VFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRS---MIEGCYCRAIELLNDMEDCVRSAAVRVVITSGLMLAAH---SPERKQQLSDEIF--ICSTMRNM
        + L + KD    VR+A+L  LS     V+  R     +   Y   I LL D  + VR   ++++   G +   H   S   K +L D++F  IC+ + + 
Subjt:  VFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRS---MIEGCYCRAIELLNDMEDCVRSAAVRVVITSGLMLAAH---SPERKQQLSDEIF--ICSTMRNM

Query:  NMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCS-------------------------------------------------------
        ++ VR  A   +     VS + L+Q++SK V+   KGK   +  S                                                       
Subjt:  NMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCS-------------------------------------------------------

Query:  ------TEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRA
              ++ L +L   V GAF+ G+EDEF +V      D++  L+V + +F  + +  L+DI ND+  +VR+ ++ +L  +G  N++ ++E  +H+ L  
Subjt:  ------TEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRA

Query:  LNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH
        L       R +L +LL      +      +   LL +L  YP D   +   L  +GQ +
Subjt:  LNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH

Q68F70 Integrator complex subunit 41.6e-1525.46Show/hide
Query:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-----VITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNA
        D  P VR AA+  +  L    L+   + +  Y +A +LL D  + VRSAAV +      + S  ++   S   + +L D+ F  +C  + + +  VRV A
Subjt:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-----VITSGLMLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNA

Query:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL
           +  +  VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   ++L  L
Subjt:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL

Query:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD
           S  F  + L  L+D+ ND+   VRLQ++ T+  +   +++ L+E  +   L  L D    +R AL +LL           QL+   LL++L  YP D
Subjt:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD

Query:  ESDVLSVLFHMGQNHVDMDDSVIKDV
           +   L  +G  H  +  S++ ++
Subjt:  ESDVLSVLFHMGQNHVDMDDSVIKDV

Q8CIM8 Integrator complex subunit 45.9e-1827.62Show/hide
Query:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRVV-ITSGL----MLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNA
        D  P VR AA+  +  L    L+    I   Y +A +LL+D  + VRSAAV+++ + S L    ++   S   + +L D+ F  IC  + + +  VRV A
Subjt:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRVV-ITSGL----MLAAHSPERKQQLSDEIF--ICSTMRNMNMKVRVNA

Query:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL
           +  +E VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   +AL  L
Subjt:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL

Query:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD
           S  F  + L  L+D+ ND+   VRLQ++ T+  +   N++ L+E  +   L  L D    +R AL +LL            L+   LL++L  YP D
Subjt:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD

Query:  ESDVLSVLFHMGQNH
           +   L  +G  H
Subjt:  ESDVLSVLFHMGQNH

Q8VZA0 Protein SIEL2.1e-10832.16Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND
        ++ER   + +A++++DD  F S+C G  +S R+WLL NA+RF +  S+LFT+FLGF+KDPYPY+RK ALDGL  + N         +EGCY RA+ELL+D
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND

Query:  MEDCVRSAAVRVVITSGLMLAAHSPE--RKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML
         ED VRS+AVR V   G ++ A   E   ++  +D +F  +CS +R+M++ VRV  F A   +   S+ ++LQ++SK+VL   KGK      S    ++ 
Subjt:  MEDCVRSAAVRVVITSGLMLAAHSPE--RKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML

Query:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR
        +   AG ++HG EDEF +V      D+  +L+V S KF  EA+ LLMD+L DD + VRL+AL  LHH+    +LK+QE +M  FL A+ D   ++R   R
Subjt:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR

Query:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFE---------------QVIAYIVLAISAPVLDNHTL-RIPPR
         +LKLAK PDL       +G+L+SLE YPQDE D+LS LFH GQNH +   S++K   E               Q+ A + L ISAP+ +  ++  IPP 
Subjt:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFE---------------QVIAYIVLAISAPVLDNHTL-RIPPR

Query:  IFSYAATLLGRISHALGDIMDQRTVFAYLLQ-------------------NSKHIGLSDLGFNPEGTPCSPTPRSS------------------LNDIL-
         FSY+  +LG+ S  L D+MDQ  + AYL                     ++     +DL  NP   P    P  S                  +N IL 
Subjt:  IFSYAATLLGRISHALGDIMDQRTVFAYLLQ-------------------NSKHIGLSDLGFNPEGTPCSPTPRSS------------------LNDIL-

Query:  ----------------AIASLKT-----PAMIHEQRVLQGSIGSI---------------HKKNDIYLRTDVG---------------------------
                        A+ +L+        +  +  + +G++  I               H     ++ T                              
Subjt:  ----------------AIASLKT-----PAMIHEQRVLQGSIGSI---------------HKKNDIYLRTDVG---------------------------

Query:  --------------RLYTQNISLR----------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA
                      RLY   I  R                + +++C +PS F+ E ++SL    +      C      K+ K F       S  L+ ++A
Subjt:  --------------RLYTQNISLR----------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA

Query:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV
        E+ +  N    P+ FVPGLPV IPC+I L N P +  LW +I+ ++ T QF++LD          + FM+    + TP+A  F  R+ IG+EC FE+   
Subjt:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV

Query:  NERRGGPNRDLAYICKEKEVYLSMIHK
         ++R GP   +AY+CKE+E++LS++ +
Subjt:  NERRGGPNRDLAYICKEKEVYLSMIHK

Q96HW7 Integrator complex subunit 44.5e-1827.36Show/hide
Query:  FTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRVV-ITSGL----MLAAHSPERKQQLSDEIF--ICSTMRNMNMKVR
        +  D  P VR AA+  +  L    L+    I   Y +A +LL+D  + VRSAAV+++ + S L    ++   S   + +L D+ F  IC  + + +  VR
Subjt:  FTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRVV-ITSGL----MLAAHSPERKQQLSDEIF--ICSTMRNMNMKVR

Query:  VNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDAL
        V A   +  +E VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   +AL
Subjt:  VNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDAL

Query:  FNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESY
          L   S  F  + L  L+D+ ND+   VRLQ++ T+  +   N++ L+E  +   L  L D    +R AL +LL            L+   LL++L  Y
Subjt:  FNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESY

Query:  PQDESDVLSVLFHMGQNH
        P D   +   L  +G  H
Subjt:  PQDESDVLSVLFHMGQNH

Arabidopsis top hitse value%identityAlignment
AT3G08800.1 ARM repeat superfamily protein1.5e-10932.16Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND
        ++ER   + +A++++DD  F S+C G  +S R+WLL NA+RF +  S+LFT+FLGF+KDPYPY+RK ALDGL  + N         +EGCY RA+ELL+D
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND

Query:  MEDCVRSAAVRVVITSGLMLAAHSPE--RKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML
         ED VRS+AVR V   G ++ A   E   ++  +D +F  +CS +R+M++ VRV  F A   +   S+ ++LQ++SK+VL   KGK      S    ++ 
Subjt:  MEDCVRSAAVRVVITSGLMLAAHSPE--RKQQLSDEIF--ICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML

Query:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR
        +   AG ++HG EDEF +V      D+  +L+V S KF  EA+ LLMD+L DD + VRL+AL  LHH+    +LK+QE +M  FL A+ D   ++R   R
Subjt:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR

Query:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFE---------------QVIAYIVLAISAPVLDNHTL-RIPPR
         +LKLAK PDL       +G+L+SLE YPQDE D+LS LFH GQNH +   S++K   E               Q+ A + L ISAP+ +  ++  IPP 
Subjt:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFE---------------QVIAYIVLAISAPVLDNHTL-RIPPR

Query:  IFSYAATLLGRISHALGDIMDQRTVFAYLLQ-------------------NSKHIGLSDLGFNPEGTPCSPTPRSS------------------LNDIL-
         FSY+  +LG+ S  L D+MDQ  + AYL                     ++     +DL  NP   P    P  S                  +N IL 
Subjt:  IFSYAATLLGRISHALGDIMDQRTVFAYLLQ-------------------NSKHIGLSDLGFNPEGTPCSPTPRSS------------------LNDIL-

Query:  ----------------AIASLKT-----PAMIHEQRVLQGSIGSI---------------HKKNDIYLRTDVG---------------------------
                        A+ +L+        +  +  + +G++  I               H     ++ T                              
Subjt:  ----------------AIASLKT-----PAMIHEQRVLQGSIGSI---------------HKKNDIYLRTDVG---------------------------

Query:  --------------RLYTQNISLR----------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA
                      RLY   I  R                + +++C +PS F+ E ++SL    +      C      K+ K F       S  L+ ++A
Subjt:  --------------RLYTQNISLR----------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA

Query:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV
        E+ +  N    P+ FVPGLPV IPC+I L N P +  LW +I+ ++ T QF++LD          + FM+    + TP+A  F  R+ IG+EC FE+   
Subjt:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV

Query:  NERRGGPNRDLAYICKEKEVYLSMIHK
         ++R GP   +AY+CKE+E++LS++ +
Subjt:  NERRGGPNRDLAYICKEKEVYLSMIHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCGGGATCTAAAACTTGTTTCTGCCATTAACGAACTCGACGATCGGTCATTCCTCTCGCTTTGCTTTGGTCCTTTTGTGTCCATCAGGATTTGGCTTCTCAA
CAACGCCGAGAGGTTCCAAATAAGGCCATCGCTGTTATTCACTGTTTTCCTAGGGTTTACGAAGGATCCGTATCCATATGTTAGGAAAGCTGCTCTCGATGGCCTATCAG
GTTTGGGGAACACTGTTCTCGAGGACCGCAGCATGATTGAAGGTTGCTATTGCCGCGCTATTGAGCTTCTGAACGACATGGAGGATTGTGTTAGGTCAGCTGCTGTACGC
GTCGTCATCACTTCGGGTCTAATGCTTGCGGCACATAGTCCAGAGAGGAAACAACAGTTGTCTGATGAAATATTCATCTGTTCCACGATGAGAAATATGAACATGAAGGT
CAGGGTTAATGCATTTGATGCAATAAAGTGGCTGGAAATTGTTTCTCAGGATCTTCTTTTACAAAGCGTGTCCAAGAGAGTCTTGAGTATCTTCAAGGGGAAAAACTCTC
TTGTTCAATGCTCTACCGAACAATTGGAAATGTTGGCGTTGGATGTTGCCGGGGCTTTTGTGCATGGCGTAGAAGATGAATTCTGTCAGGTAACTTATGTTGGTACCTGC
GATGCTTTGTTTAATTTGACCGTCCTATCAACCAAGTTCACTGGCGAGGCCTTAAGCTTATTGATGGACATCCTGAATGATGATTCGGTTGCTGTCCGCTTGCAAGCTTT
GGCAACATTACATCATATGGGAATGTTCAATAGTTTGAAATTGCAAGAAGCGCATATGCACATGTTTCTCAGGGCGTTAAATGACATTGATGGTCATGTAAGATCTGCTT
TAAGGAAACTTCTTAAATTAGCGAAGCGGCCAGATTTGGTGACATTTCAATTGTCTTTTAATGGTCTTCTCGAAAGTTTAGAATCATACCCGCAGGATGAGTCTGATGTG
CTCTCCGTGCTGTTTCATATGGGTCAGAATCATGTAGATATGGACGACTCCGTTATCAAGGATGTTTTTGAACAGGTGATTGCATACATTGTTCTAGCTATTTCAGCTCC
CGTTTTGGATAATCATACTCTTAGAATTCCACCAAGAATATTTTCTTATGCAGCTACATTGCTGGGAAGGATCTCTCATGCTTTGGGCGACATTATGGATCAAAGAACAG
TTTTTGCTTACTTGCTGCAAAACAGTAAACACATTGGATTATCTGATCTGGGGTTTAATCCAGAGGGAACCCCATGCTCACCTACACCTAGAAGTTCTCTCAACGATATA
CTTGCCATTGCCTCCCTTAAGACACCTGCAATGATACACGAGCAGCGGGTTCTGCAAGGAAGCATTGGGAGTATTCACAAGAAGAACGACATATATCTTAGAACTGATGT
TGGTCGCTTATACACTCAGAACATCTCCTTAAGGAACCTTAAGGAAGAATGTATTGAGCCCTCAACTTTTATATGTGAAGTTCAAAGATCATTGCCGAACTTGTGCACCA
TTACTCCTAAACCTCCTTGCAGTTCACCTTATTTTAAAAAACTGCTCAAATCTTTCATCCTTGGCCATCTAGAAATTTCAGAAAAACTTAAGCACATCAATGCAGAACTA
GTCATTTCTGATAACGACTATGAGAAACCCCTGTATTTTGTCCCAGGACTACCCGTTGGTATTCCTTGCCAAATTATCCTACACAATGATCCTAGTGAGAGGAGGTTGTG
GTTTAAAATCACTATGGATAACATGACAAGTCAGTTTATCTTTTTGGATTTCCTTTCCTTGGGAAGTTGTGACGAGTTTAGAGAATTTATGTACATTGTTCCATTCCATA
GAACTCCAAAGGCTTCTTCTTTTATAGCGAGGATTTGTATAGGACTTGAATGTTGGTTTGAGAATGCTGAAGTTAATGAACGCCGTGGAGGTCCAAATCGTGACCTGGCC
TACATTTGTAAAGAGAAGGAAGTTTATCTCTCCATGATCCACAAAGTTATAAGCTTAAATACTGTTGCCGTGCGAGTGAATAACTGGTGCAAGATTCGGACAGATTCATT
GATGACTCCAGCTGCTGCATATCCTCGCGTCCTTTTCATGGGTTCCCCAATGATCCATGATTCCATACGTGGATCGTACACTTCAACGCTCGAAACCATGGAATGTCCAT
CGAATCCCCCAAGAGCATAA
mRNA sequenceShow/hide mRNA sequence
GCAGAGACTAGAGTTACTCTACCATCCCACTACACTGGAAGCGCCGTGCGAGAGAAAAAGAAAAGAAAAAGAGAGGGAAAATATATATATAAACCCACTCATGCAATTTT
GCCTCGCAGTCCGCCAGTCCCACATTTGCAGTCACTTTCCCGCCGAACCCCTCCCCGTCGGCTTCTGCAATTCCCTTCTTATCTTTCTCACTTTTCATCGACTGTCGTCA
TAGCAGTCCACAAAAATGGCATATTCTTCGCTTTATTTTGCTTGCCGACGTAACCAAGGACACCGCAATCAAATCTGTTCTCATCGTGCCTGAGTTGAGAGCCAAGCCAA
GATCCATTCCAGCATAAATGGCGGAGCGGGATCTAAAACTTGTTTCTGCCATTAACGAACTCGACGATCGGTCATTCCTCTCGCTTTGCTTTGGTCCTTTTGTGTCCATC
AGGATTTGGCTTCTCAACAACGCCGAGAGGTTCCAAATAAGGCCATCGCTGTTATTCACTGTTTTCCTAGGGTTTACGAAGGATCCGTATCCATATGTTAGGAAAGCTGC
TCTCGATGGCCTATCAGGTTTGGGGAACACTGTTCTCGAGGACCGCAGCATGATTGAAGGTTGCTATTGCCGCGCTATTGAGCTTCTGAACGACATGGAGGATTGTGTTA
GGTCAGCTGCTGTACGCGTCGTCATCACTTCGGGTCTAATGCTTGCGGCACATAGTCCAGAGAGGAAACAACAGTTGTCTGATGAAATATTCATCTGTTCCACGATGAGA
AATATGAACATGAAGGTCAGGGTTAATGCATTTGATGCAATAAAGTGGCTGGAAATTGTTTCTCAGGATCTTCTTTTACAAAGCGTGTCCAAGAGAGTCTTGAGTATCTT
CAAGGGGAAAAACTCTCTTGTTCAATGCTCTACCGAACAATTGGAAATGTTGGCGTTGGATGTTGCCGGGGCTTTTGTGCATGGCGTAGAAGATGAATTCTGTCAGGTAA
CTTATGTTGGTACCTGCGATGCTTTGTTTAATTTGACCGTCCTATCAACCAAGTTCACTGGCGAGGCCTTAAGCTTATTGATGGACATCCTGAATGATGATTCGGTTGCT
GTCCGCTTGCAAGCTTTGGCAACATTACATCATATGGGAATGTTCAATAGTTTGAAATTGCAAGAAGCGCATATGCACATGTTTCTCAGGGCGTTAAATGACATTGATGG
TCATGTAAGATCTGCTTTAAGGAAACTTCTTAAATTAGCGAAGCGGCCAGATTTGGTGACATTTCAATTGTCTTTTAATGGTCTTCTCGAAAGTTTAGAATCATACCCGC
AGGATGAGTCTGATGTGCTCTCCGTGCTGTTTCATATGGGTCAGAATCATGTAGATATGGACGACTCCGTTATCAAGGATGTTTTTGAACAGGTGATTGCATACATTGTT
CTAGCTATTTCAGCTCCCGTTTTGGATAATCATACTCTTAGAATTCCACCAAGAATATTTTCTTATGCAGCTACATTGCTGGGAAGGATCTCTCATGCTTTGGGCGACAT
TATGGATCAAAGAACAGTTTTTGCTTACTTGCTGCAAAACAGTAAACACATTGGATTATCTGATCTGGGGTTTAATCCAGAGGGAACCCCATGCTCACCTACACCTAGAA
GTTCTCTCAACGATATACTTGCCATTGCCTCCCTTAAGACACCTGCAATGATACACGAGCAGCGGGTTCTGCAAGGAAGCATTGGGAGTATTCACAAGAAGAACGACATA
TATCTTAGAACTGATGTTGGTCGCTTATACACTCAGAACATCTCCTTAAGGAACCTTAAGGAAGAATGTATTGAGCCCTCAACTTTTATATGTGAAGTTCAAAGATCATT
GCCGAACTTGTGCACCATTACTCCTAAACCTCCTTGCAGTTCACCTTATTTTAAAAAACTGCTCAAATCTTTCATCCTTGGCCATCTAGAAATTTCAGAAAAACTTAAGC
ACATCAATGCAGAACTAGTCATTTCTGATAACGACTATGAGAAACCCCTGTATTTTGTCCCAGGACTACCCGTTGGTATTCCTTGCCAAATTATCCTACACAATGATCCT
AGTGAGAGGAGGTTGTGGTTTAAAATCACTATGGATAACATGACAAGTCAGTTTATCTTTTTGGATTTCCTTTCCTTGGGAAGTTGTGACGAGTTTAGAGAATTTATGTA
CATTGTTCCATTCCATAGAACTCCAAAGGCTTCTTCTTTTATAGCGAGGATTTGTATAGGACTTGAATGTTGGTTTGAGAATGCTGAAGTTAATGAACGCCGTGGAGGTC
CAAATCGTGACCTGGCCTACATTTGTAAAGAGAAGGAAGTTTATCTCTCCATGATCCACAAAGTTATAAGCTTAAATACTGTTGCCGTGCGAGTGAATAACTGGTGCAAG
ATTCGGACAGATTCATTGATGACTCCAGCTGCTGCATATCCTCGCGTCCTTTTCATGGGTTCCCCAATGATCCATGATTCCATACGTGGATCGTACACTTCAACGCTCGA
AACCATGGAATGTCCATCGAATCCCCCAAGAGCATAA
Protein sequenceShow/hide protein sequence
MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVR
VVITSGLMLAAHSPERKQQLSDEIFICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTC
DALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDV
LSVLFHMGQNHVDMDDSVIKDVFEQVIAYIVLAISAPVLDNHTLRIPPRIFSYAATLLGRISHALGDIMDQRTVFAYLLQNSKHIGLSDLGFNPEGTPCSPTPRSSLNDI
LAIASLKTPAMIHEQRVLQGSIGSIHKKNDIYLRTDVGRLYTQNISLRNLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAEL
VISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA
YICKEKEVYLSMIHKVISLNTVAVRVNNWCKIRTDSLMTPAAAYPRVLFMGSPMIHDSIRGSYTSTLETMECPSNPPRA