; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020438 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020438
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein SIEL
Genome locationChr04:31858954..31864556
RNA-Seq ExpressionHG10020438
SyntenyHG10020438
Gene Ontology termsNA
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017202.1 Protein SIEL [Cucurbita argyrosperma subsp. argyrosperma]9.8e-20653.63Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRV+                          CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ                                           
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------

Query:  ------------------------------NISLRN----------------------------------------------------------------
                                      N  L +                                                                
Subjt:  ------------------------------NISLRN----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
                                   LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEK LY
Subjt:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

XP_004147305.1 protein SIEL [Cucumis sativus]4.4e-21453.92Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE DL+L+S INE+DD+SFLSLCFGP VS R WLLNNAE+FQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGN V ED SMIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAA+RV+                          CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCST+QLE+LAL+
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM M N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH++M D +IKDV EQ                              N +LR        
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEEC EPSTF+CEVQRSL NL TITPK  CSS   +++LKSF LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDN+TSQF+FLDFLSLG CDE REFMY VPF+RTPKASSFIARICIGLECWFENAEVNERRGGP  DLA
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA

Query:  YICKEKEVYLSMIHKG
        YICKEKEVYLSMIHKG
Subjt:  YICKEKEVYLSMIHKG

XP_008463329.1 PREDICTED: protein SIEL isoform X1 [Cucumis melo]2.2e-21053.61Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RV+                          CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ                              N +LR        
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHKG
        AYICKEKEVYLSMI KG
Subjt:  AYICKEKEVYLSMIHKG

XP_023527352.1 protein SIEL isoform X1 [Cucurbita pepo subsp. pepo]2.6e-20653.75Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRV+                          CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ                                           
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------------NI
                                                                                                           +
Subjt:  --------------------------------------------------------------------------------------------------NI

Query:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
         L N                       LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEKPLY
Subjt:  SLRN-----------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

XP_038894154.1 protein SIEL [Benincasa hispida]5.0e-21855.02Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        M ERDL+LVSAINELDD+SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGL GLGNTVLED SMIEGCYCR+IELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAV V+                          CS  R+MNMKVRVNAFDA+K LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLEM ALD
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNLT+LSTKF GEALSLLMD+LNDDSV+VRL+AL TLHHM M N LKLQEAHMHMFL ALND DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------
        K+AK PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ                                           
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------

Query:  ------------------------------NISLRN----------------------------------------------------------------
                                      +I L +                                                                
Subjt:  ------------------------------NISLRN----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEEC EPSTF+CEVQRSL NL  ITPK  CSSP F+KLLKSF L HLEISE L+H+ AELV+ DNDYEKP
Subjt:  -----------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA
        LYFVPGLPVGIPCQIILHN  S+R+LWF+ITMDNMTSQFIFLDFLSLG CDE REF YIVPF+RTPKASSFIARICIGLECWFEN EVNER GGP RDLA
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLA

Query:  YICKEKEVYLSMIHKG
        YICKEKEVY SMIHKG
Subjt:  YICKEKEVYLSMIHKG

TrEMBL top hitse value%identityAlignment
A0A1S3CIY4 protein SIEL isoform X21.2e-20153.75Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RV+                          CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQ-----------------------------------------------------------DESDVLSVLFH---
        KL K PDLVTFQLSFNGLLESLESYPQ                                                           D+S + + L H   
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQ-----------------------------------------------------------DESDVLSVLFH---

Query:  -----------------------------------------MGQNHVDMDDSV------IKDVF------------------------------------
                                                  GQ   D  +S+      ++D++                                    
Subjt:  -----------------------------------------MGQNHVDMDDSV------IKDVF------------------------------------

Query:  ----------------------------------------------------------EQNI----------------------SLRN-----------L
                                                                  EQ+I                      SLR            L
Subjt:  ----------------------------------------------------------EQNI----------------------SLRN-----------L

Query:  KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKI
        KEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKPLYFVPGLPVGIPCQIILHN PSER+LWF+I
Subjt:  KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKI

Query:  TMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDLAYICKEKEVYLSMIHKG
        TMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DLAYICKEKEVYLSMI KG
Subjt:  TMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDLAYICKEKEVYLSMIHKG

A0A1S3CKJ8 protein SIEL isoform X11.1e-21053.61Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RV+                          CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ                              N +LR        
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHKG
        AYICKEKEVYLSMI KG
Subjt:  AYICKEKEVYLSMIHKG

A0A5A7UEC0 Protein SIEL isoform X11.1e-21053.61Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAE+DL+L+S +NE+D++SFLSLCFGP VSIR WLLNNAERFQ+RPSLLFTVFLGFTKDPYPYVRKAALDGLS LGNTV ED  MIEGCYCRAIELLNDM
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        ED VRSAA+RV+                          CS  R+MNMKVRVNAFDAI+ LEIVS+DLLLQSVSKRVLSIFKGK SLVQCSTEQLE+LAL+
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHG+EDEF QV     CDA+FNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM   N LKLQEAHMHMFL AL D DGHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------
        KL K PDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHV+M DS+IKDVFEQ                              N +LR        
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ------------------------------NISLR--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP
                                     LKEE  EPSTF+CEVQRSL NL TITPK  C+S   +++LK F LGHLEISE+LKHI AELVISDN+YEKP
Subjt:  ----------------------------NLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKP

Query:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL
        LYFVPGLPVGIPCQIILHN PSER+LWF+ITMDNMTSQFIFLDFLSLG CDE REFMY VPF+RTPKASSFIA+ICIGLECWFENAEVN ERRGGP  DL
Subjt:  LYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVN-ERRGGPNRDL

Query:  AYICKEKEVYLSMIHKG
        AYICKEKEVYLSMI KG
Subjt:  AYICKEKEVYLSMIHKG

A0A6J1F7A9 protein SIEL4.7e-20653.63Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAERFQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLGNTV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRV+                          CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------
        KLAK PDLVTFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ                                           
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------

Query:  ------------------------------NISLRN----------------------------------------------------------------
                                      N  L +                                                                
Subjt:  ------------------------------NISLRN----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
                                   LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEISEKLKH+ AELVI DNDYEK LY
Subjt:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN PSER+LWF+ITMDN TSQFIFLDFLSL G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+AEVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

A0A6J1J0A1 protein SIEL4.1e-20253.26Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM
        MAERD +LVSAINELDDRSFLSLCFGP VSIRIWLL NAE FQIRPSLL TVFLGFTKDPYPYVRKAALDGL+GLG TV+ED SMIE CY RAIELLND+
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDM

Query:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD
        EDCVRSAAVRV+                          CS  R+M+M+VR NAF AIK LEIVS+DLLLQS+SKRVLSIFKGK SLVQC TEQLEMLALD
Subjt:  EDCVRSAAVRVI--------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALD

Query:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL
        VAGAFVHGVEDEF QV     CDALFNL +LSTKF+GEALSLLMD+LNDDSV+VRLQAL TLHHM + N L+LQEAHMHMFL AL+D +GHVRSALRKLL
Subjt:  VAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLL

Query:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------
        KLAK PDL TFQLSFNGL+ESLESYPQDESDVLSVLFHMGQNHV+M  S+I DVFEQ                                           
Subjt:  KLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFEQ-------------------------------------------

Query:  ------NIS-------------------------------------------------------------------------------------------
               IS                                                                                           
Subjt:  ------NIS-------------------------------------------------------------------------------------------

Query:  -----LRN--------------------------------------------------------------------------------------------
             LRN                                                                                            
Subjt:  -----LRN--------------------------------------------------------------------------------------------

Query:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY
                                   LKEECIEPSTF+CEVQRSL  L  ITPK  C S  F+KLLK+F L HLEIS+KLKH+ AELVI DNDYEKPLY
Subjt:  ---------------------------LKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLY

Query:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY
        FVPGLPVGI CQIILHN  SER+LWF+ITMDN TSQFIFLDFL L G CDE REF Y VPF+RTPKASSFIARICIGLECWFE+ EVNERRGGP RDLAY
Subjt:  FVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSL-GSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAY

Query:  ICKEKEVYLSMIH
        ICKEKEVYLSMIH
Subjt:  ICKEKEVYLSMIH

SwissProt top hitse value%identityAlignment
Q54LH5 Integrator complex subunit 4 homolog3.2e-1028.1Show/hide
Query:  TEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDG
        ++ L +L   V GAF+ G+EDEF +V      D++  L+V + +F  + +  L+DI ND+  +VR+ ++ +L  +G  N++ ++E  +H+ L  L     
Subjt:  TEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDG

Query:  HVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH
          R +L +LL      +      +   LL +L  YP D   +   L  +GQ +
Subjt:  HVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNH

Q68F70 Integrator complex subunit 42.6e-1223.93Show/hide
Query:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVRVNA
        D  P VR AA+  +  L    L+   + +  Y +A +LL D  + VRSAAV +                               +C  + + +  VRV A
Subjt:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVRVNA

Query:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL
           +  +  VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   ++L  L
Subjt:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL

Query:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD
           S  F  + L  L+D+ ND+   VRLQ++ T+  +   +++ L+E  +   L  L D    +R AL +LL           QL+   LL++L  YP D
Subjt:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD

Query:  ESDVLSVLFHMGQNHVDMDDSVIKDV
           +   L  +G  H  +  S++ ++
Subjt:  ESDVLSVLFHMGQNHVDMDDSVIKDV

Q8CIM8 Integrator complex subunit 42.8e-1425.71Show/hide
Query:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVRVNA
        D  P VR AA+  +  L    L+    I   Y +A +LL+D  + VRSAAV++                               IC  + + +  VRV A
Subjt:  DPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVRVNA

Query:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL
           +  +E VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   +AL  L
Subjt:  FDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNL

Query:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD
           S  F  + L  L+D+ ND+   VRLQ++ T+  +   N++ L+E  +   L  L D    +R AL +LL            L+   LL++L  YP D
Subjt:  TVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQD

Query:  ESDVLSVLFHMGQNH
           +   L  +G  H
Subjt:  ESDVLSVLFHMGQNH

Q8VZA0 Protein SIEL1.2e-7626.36Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND
        ++ER   + +A++++DD  F S+C G  +S R+WLL NA+RF +  S+LFT+FLGF+KDPYPY+RK ALDGL  + N         +EGCY RA+ELL+D
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND

Query:  MEDCVRSAAVRVI----------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML
         ED VRS+AVR +                            CS +R+M++ VRV  F A   +   S+ ++LQ++SK+VL   KGK      S    ++ 
Subjt:  MEDCVRSAAVRVI----------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML

Query:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR
        +   AG ++HG EDEF +V      D+  +L+V S KF  EA+ LLMD+L DD + VRL+AL  LHH+    +LK+QE +M  FL A+ D   ++R   R
Subjt:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR

Query:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIK---------------------------------------------
         +LKLAK PDL       +G+L+SLE YPQDE D+LS LFH GQNH +   S++K                                             
Subjt:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIK---------------------------------------------

Query:  -----------------DVFEQNISLRNL-----------------------------------------------------------------------
                         D+ +Q++ L  L                                                                       
Subjt:  -----------------DVFEQNISLRNL-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA
                                                  +++C +PS F+ E ++SL    +      C      K+ K F       S  L+ ++A
Subjt:  ------------------------------------------KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA

Query:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV
        E+ +  N    P+ FVPGLPV IPC+I L N P +  LW +I+ ++ T QF++LD          + FM+    + TP+A  F  R+ IG+EC FE+   
Subjt:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV

Query:  NERRGGPNRDLAYICKEKEVYLSMIHK
         ++R GP   +AY+CKE+E++LS++ +
Subjt:  NERRGGPNRDLAYICKEKEVYLSMIHK

Q96HW7 Integrator complex subunit 41.7e-1425.47Show/hide
Query:  FTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVR
        +  D  P VR AA+  +  L    L+    I   Y +A +LL+D  + VRSAAV++                               IC  + + +  VR
Subjt:  FTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVRV-------------------------------ICSTMRNMNMKVR

Query:  VNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDAL
        V A   +  +E VS   L Q++ K+++S  + K +                           +  T  + ++     GAFVHG+EDE  +V  +   +AL
Subjt:  VNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSL-------------------------VQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDAL

Query:  FNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESY
          L   S  F  + L  L+D+ ND+   VRLQ++ T+  +   N++ L+E  +   L  L D    +R AL +LL            L+   LL++L  Y
Subjt:  FNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESY

Query:  PQDESDVLSVLFHMGQNH
        P D   +   L  +G  H
Subjt:  PQDESDVLSVLFHMGQNH

Arabidopsis top hitse value%identityAlignment
AT3G08800.1 ARM repeat superfamily protein8.3e-7826.36Show/hide
Query:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND
        ++ER   + +A++++DD  F S+C G  +S R+WLL NA+RF +  S+LFT+FLGF+KDPYPY+RK ALDGL  + N         +EGCY RA+ELL+D
Subjt:  MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNT-VLEDRSMIEGCYCRAIELLND

Query:  MEDCVRSAAVRVI----------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML
         ED VRS+AVR +                            CS +R+M++ VRV  F A   +   S+ ++LQ++SK+VL   KGK      S    ++ 
Subjt:  MEDCVRSAAVRVI----------------------------CSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEML

Query:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR
        +   AG ++HG EDEF +V      D+  +L+V S KF  EA+ LLMD+L DD + VRL+AL  LHH+    +LK+QE +M  FL A+ D   ++R   R
Subjt:  ALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDILNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALR

Query:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIK---------------------------------------------
         +LKLAK PDL       +G+L+SLE YPQDE D+LS LFH GQNH +   S++K                                             
Subjt:  KLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIK---------------------------------------------

Query:  -----------------DVFEQNISLRNL-----------------------------------------------------------------------
                         D+ +Q++ L  L                                                                       
Subjt:  -----------------DVFEQNISLRNL-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA
                                                  +++C +PS F+ E ++SL    +      C      K+ K F       S  L+ ++A
Subjt:  ------------------------------------------KEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINA

Query:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV
        E+ +  N    P+ FVPGLPV IPC+I L N P +  LW +I+ ++ T QF++LD          + FM+    + TP+A  F  R+ IG+EC FE+   
Subjt:  ELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITMDNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEV

Query:  NERRGGPNRDLAYICKEKEVYLSMIHK
         ++R GP   +AY+CKE+E++LS++ +
Subjt:  NERRGGPNRDLAYICKEKEVYLSMIHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCGGGATCTAAAACTTGTTTCTGCCATTAACGAACTCGACGATCGGTCATTCCTCTCGCTTTGCTTTGGTCCTTTTGTGTCCATCAGGATTTGGCTTCTCAA
CAACGCCGAGAGGTTCCAAATAAGGCCATCGCTGTTATTCACTGTTTTCCTAGGGTTTACGAAGGATCCGTATCCATATGTTAGGAAAGCTGCTCTCGATGGCCTATCAG
GTTTGGGGAACACTGTTCTCGAGGACCGCAGCATGATTGAAGGTTGCTATTGCCGCGCTATTGAGCTTCTGAACGACATGGAGGATTGTGTTAGGTCAGCTGCTGTACGC
GTCATCTGTTCCACGATGAGAAATATGAACATGAAGGTCAGGGTTAATGCATTTGATGCAATAAAGTGGCTGGAAATTGTTTCTCAGGATCTTCTTTTACAAAGCGTGTC
CAAGAGAGTCTTGAGTATCTTCAAGGGGAAAAACTCTCTTGTTCAATGCTCTACCGAACAATTGGAAATGTTGGCGTTGGATGTTGCCGGGGCTTTTGTGCATGGCGTAG
AAGATGAATTCTGTCAGGTAACTTATGTTGGTACCTGCGATGCTTTGTTTAATTTGACCGTCCTATCAACCAAGTTCACTGGCGAGGCCTTAAGCTTATTGATGGACATC
CTGAATGATGATTCGGTTGCTGTCCGCTTGCAAGCTTTGGCAACATTACATCATATGGGAATGTTCAATAGTTTGAAATTGCAAGAAGCGCATATGCACATGTTTCTCAG
GGCGTTAAATGACATTGATGGTCATGTAAGATCTGCTTTAAGGAAACTTCTTAAATTAGCGAAGCGGCCAGATTTGGTGACATTTCAATTGTCTTTTAATGGTCTTCTCG
AAAGTTTAGAATCATACCCGCAGGATGAGTCTGATGTGCTCTCCGTGCTGTTTCATATGGGTCAGAATCATGTAGATATGGACGACTCCGTTATCAAGGATGTTTTTGAA
CAGAACATCTCCTTAAGGAACCTTAAGGAAGAATGTATTGAGCCCTCAACTTTTATATGTGAAGTTCAAAGATCATTGCCGAACTTGTGCACCATTACTCCTAAACCTCC
TTGCAGTTCACCTTATTTTAAAAAACTGCTCAAATCTTTCATCCTTGGCCATCTAGAAATTTCAGAAAAACTTAAGCACATCAATGCAGAACTAGTCATTTCTGATAACG
ACTATGAGAAACCCCTGTATTTTGTCCCAGGACTACCCGTTGGTATTCCTTGCCAAATTATCCTACACAATGATCCTAGTGAGAGGAGGTTGTGGTTTAAAATCACTATG
GATAACATGACAAGTCAGTTTATCTTTTTGGATTTCCTTTCCTTGGGAAGTTGTGACGAGTTTAGAGAATTTATGTACATTGTTCCATTCCATAGAACTCCAAAGGCTTC
TTCTTTTATAGCGAGGATTTGTATAGGACTTGAATGTTGGTTTGAGAATGCTGAAGTTAATGAACGCCGTGGAGGTCCAAATCGTGACCTGGCCTACATTTGTAAAGAGA
AGGAAGTTTATCTCTCCATGATCCACAAAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGCGGGATCTAAAACTTGTTTCTGCCATTAACGAACTCGACGATCGGTCATTCCTCTCGCTTTGCTTTGGTCCTTTTGTGTCCATCAGGATTTGGCTTCTCAA
CAACGCCGAGAGGTTCCAAATAAGGCCATCGCTGTTATTCACTGTTTTCCTAGGGTTTACGAAGGATCCGTATCCATATGTTAGGAAAGCTGCTCTCGATGGCCTATCAG
GTTTGGGGAACACTGTTCTCGAGGACCGCAGCATGATTGAAGGTTGCTATTGCCGCGCTATTGAGCTTCTGAACGACATGGAGGATTGTGTTAGGTCAGCTGCTGTACGC
GTCATCTGTTCCACGATGAGAAATATGAACATGAAGGTCAGGGTTAATGCATTTGATGCAATAAAGTGGCTGGAAATTGTTTCTCAGGATCTTCTTTTACAAAGCGTGTC
CAAGAGAGTCTTGAGTATCTTCAAGGGGAAAAACTCTCTTGTTCAATGCTCTACCGAACAATTGGAAATGTTGGCGTTGGATGTTGCCGGGGCTTTTGTGCATGGCGTAG
AAGATGAATTCTGTCAGGTAACTTATGTTGGTACCTGCGATGCTTTGTTTAATTTGACCGTCCTATCAACCAAGTTCACTGGCGAGGCCTTAAGCTTATTGATGGACATC
CTGAATGATGATTCGGTTGCTGTCCGCTTGCAAGCTTTGGCAACATTACATCATATGGGAATGTTCAATAGTTTGAAATTGCAAGAAGCGCATATGCACATGTTTCTCAG
GGCGTTAAATGACATTGATGGTCATGTAAGATCTGCTTTAAGGAAACTTCTTAAATTAGCGAAGCGGCCAGATTTGGTGACATTTCAATTGTCTTTTAATGGTCTTCTCG
AAAGTTTAGAATCATACCCGCAGGATGAGTCTGATGTGCTCTCCGTGCTGTTTCATATGGGTCAGAATCATGTAGATATGGACGACTCCGTTATCAAGGATGTTTTTGAA
CAGAACATCTCCTTAAGGAACCTTAAGGAAGAATGTATTGAGCCCTCAACTTTTATATGTGAAGTTCAAAGATCATTGCCGAACTTGTGCACCATTACTCCTAAACCTCC
TTGCAGTTCACCTTATTTTAAAAAACTGCTCAAATCTTTCATCCTTGGCCATCTAGAAATTTCAGAAAAACTTAAGCACATCAATGCAGAACTAGTCATTTCTGATAACG
ACTATGAGAAACCCCTGTATTTTGTCCCAGGACTACCCGTTGGTATTCCTTGCCAAATTATCCTACACAATGATCCTAGTGAGAGGAGGTTGTGGTTTAAAATCACTATG
GATAACATGACAAGTCAGTTTATCTTTTTGGATTTCCTTTCCTTGGGAAGTTGTGACGAGTTTAGAGAATTTATGTACATTGTTCCATTCCATAGAACTCCAAAGGCTTC
TTCTTTTATAGCGAGGATTTGTATAGGACTTGAATGTTGGTTTGAGAATGCTGAAGTTAATGAACGCCGTGGAGGTCCAAATCGTGACCTGGCCTACATTTGTAAAGAGA
AGGAAGTTTATCTCTCCATGATCCACAAAGGTTGA
Protein sequenceShow/hide protein sequence
MAERDLKLVSAINELDDRSFLSLCFGPFVSIRIWLLNNAERFQIRPSLLFTVFLGFTKDPYPYVRKAALDGLSGLGNTVLEDRSMIEGCYCRAIELLNDMEDCVRSAAVR
VICSTMRNMNMKVRVNAFDAIKWLEIVSQDLLLQSVSKRVLSIFKGKNSLVQCSTEQLEMLALDVAGAFVHGVEDEFCQVTYVGTCDALFNLTVLSTKFTGEALSLLMDI
LNDDSVAVRLQALATLHHMGMFNSLKLQEAHMHMFLRALNDIDGHVRSALRKLLKLAKRPDLVTFQLSFNGLLESLESYPQDESDVLSVLFHMGQNHVDMDDSVIKDVFE
QNISLRNLKEECIEPSTFICEVQRSLPNLCTITPKPPCSSPYFKKLLKSFILGHLEISEKLKHINAELVISDNDYEKPLYFVPGLPVGIPCQIILHNDPSERRLWFKITM
DNMTSQFIFLDFLSLGSCDEFREFMYIVPFHRTPKASSFIARICIGLECWFENAEVNERRGGPNRDLAYICKEKEVYLSMIHKG