; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003921 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003921
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTol-Pal system protein TolB
Genome locationChr08:11644029..11647950
RNA-Seq ExpressionHG10003921
SyntenyHG10003921
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like
IPR011659 - WD40-like Beta Propeller


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055490.1 DPP6 N-terminal domain-like protein [Cucumis melo var. makuwa]0.0e+0087.89Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFDI+TLP D N+ PS+ DE  ITDGQSVNFNGYFPSSSS  SLISLLTNQS SF PDLELVYVTER+GIS+IFYDA+FGG G+S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS+LEIPHRLQIPLLD+EQ+ EVRVS KDRPSLSGDYL+YVSTHEDP E RTSWAAVYSR+LKSG+TRRLTPYGIADFSPS+SPSGIWTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTDLYIFLTRDGS RVKVVEHGGWPCWAD+STLYFHRRGDDQWLSIY+AILPSHGEI  DSV IERLTPPGLHVFTPATSPANKNLIAVATR
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFN+VTGEFK+LT+ VSPN+HH NPF+SADGT IGYHKCRGD NGRKSN LFFENVRSPVSNLSLFRIA SFPSFSP GDR+AY +FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVINRDGSN+REVFSGAAFSTAWDPVR+GVVYTSAGPDFA VS+ VDIISVNVD++E+N KKLTTNG NNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGESKGLR LTEG WTDTMCSWSPDGDWIAFSSDR+NPG+GSFDLFLIHPNGTGLRKLFQSG  GRANHPNW PDGK LVFTTDNAGISAEP+SNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEIY IKLDGS+LQRLTHNSYEDGTPTWSPRYI+P+NVE LYDVEPCGFEDCHWLNQ  +A NDIRP+LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

XP_008466749.1 PREDICTED: uncharacterized protein LOC103504088 [Cucumis melo]0.0e+0087.89Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFDI+TLP D N+ PS+ DE  ITDGQSVNFNGYFPSSSS  SLISLLTNQS SF PDLELVYVTER+GIS+IFYDA+FGG G+S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS+LEIPHRLQIPLLD+EQ+ EVRVS KDRPSLSGDYL+YVSTHEDP E RTSWAAVYSR+LKSG+TRRLTPYGIADFSPS+SPSGIWTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTDLYIFLTRDGS RVKVVEHGGWPCWAD+STLYFHRRGDDQWLSIY+AILPSHGEI  DSV IERLTPPGLHVFTPATSPANKNLIAVATR
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFN+VTGEFK+LT+ VSPN+HH NPF+SADGT IGYHKCRGD NGRKSN LFFENVRSPVSNLSLFRIA SFPSFSP GDR+AY +FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVINRDGSN+REVFSGAAFSTAWDPVR+GVVYTSAGPDFA VS+ VDIISVNVD++E+N KKLTTNG NNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGESKGLR LTEG WTDTMCSWSPDGDWIAFSSDR+NPG+GSFDLFLIHPNGTGLRKLFQSG  GRANHPNW PDGK LVFTTDNAGISAEP+SNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEIY IKLDGS+LQRLTHNSYEDGTPTWSPRYI+P+NVE LYDVEPCGFEDCHWLNQ  +A NDIRP+LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

XP_022993858.1 uncharacterized protein LOC111489739 [Cucurbita maxima]0.0e+0087.3Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFD+FTLP D N  PS  DET ITDG+SVNFNGYFPSSSSS S+ISLLTNQSQS RPDLELVYVTER+GIS+IFYDAI+GG G S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS LEIPHRLQIPLLDDEQ+ EVRVS KDRPSLSGDYL+YVSTHEDPGE RTSWAAVYSRSL+SG TRRLTPYGIADFSPS+SPSG+WTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTD+YIFLTRDG+ RVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIY+AILPS GEIS DSV IERLTPPGLHVFTPATSPANKNLIAVA+R
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFNLVTGEFKELTK VSPNSHH NPFISADGT IGYHKCRGD NGRKSN LF ENVRSPVSNLSLFRI GSFPSFSP GDR+A+V+FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVI+RDGSN+++V+ GAAFSTAWDPVREGVVYTSAGPDFA +S+ VDIISVNVDDDE N KKLT NG NNAFPSPSPDGKWIVFRSG++GYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGE+K L  LTEGQWTDTMCSWSPDGDWIAF+SDR NPG+GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK LVFTTDNAGISAEPISNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEI+ IK+DGS+LQRLTHNSYEDGTPTWSPRYISP+NV+  YDVEPCGFEDCHWLNQK KA N+I+P LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

XP_023550097.1 uncharacterized protein LOC111808393 [Cucurbita pepo subsp. pepo]0.0e+0087.72Show/hide
Query:  NSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVR
        NSIVFTTLGRS YAFD+FTLP D N  PS  DET ITDG+SVNFNGYFPSSSSS S++SLLTNQSQS RPDLELVYVTER+GIS+IFYDAI+GG G S R
Subjt:  NSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVR

Query:  RRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKGW
        RRS LEIPHRLQIPLLDDEQ+ EVRVS KDRPSLSGDYL+YVSTHEDPGE RTSWAAVYSRSL+SG TRRLTPYGIADFSPS+SPSG+WTAVASYGEKGW
Subjt:  RRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKGW

Query:  AGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATRR
        AGEVEELSTD+YIFLTRDG+ RVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIY+AILPS GEIS DSV IERLTPPGLHVFTPATSPANKNLIAVATRR
Subjt:  AGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATRR

Query:  PDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGLY
        PDSSFRHIELFNLVTGEFKELTKAVSPNSHH NPFISADGT IGYHKCRGD NGRKSN LF ENVRSPVSNLSLFRI GSFPSFSP GDR+A+V+FPGLY
Subjt:  PDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGLY

Query:  VINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVD
        VINRDGSN+++V+ GAAFSTAWDPVREGVVYTSAGPDFA +S+ VDIISVNVDDDE + KKLT NG NNAFPSPSPDGKWIVFRSG++GYKNLYIMDAV+
Subjt:  VINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVD

Query:  GESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQP
        GESK L  LTEGQWTDTMCSWSPDGDWIAF+SDR NPG+GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK LVFTTDNAGISAEPISNPHHYQP
Subjt:  GESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQP

Query:  YGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        YGEIY IK+DGS+LQRLTHNSYEDGTPTWSPRYISP+NV+  YDVEPCGFEDCHWLNQK KA N+I+P LTGPRCS
Subjt:  YGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

XP_038884611.1 uncharacterized protein LOC120075362 [Benincasa hispida]0.0e+0089.17Show/hide
Query:  GGDIISGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFG
        GGD  SG+SIVFTTLGRS Y FDIFTLP DGNSYPST  E Q+TDGQSVNFNG+FPS+S SPSLISLLTNQSQ  RPDLE+VYVTER+GISKIFYD IFG
Subjt:  GGDIISGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFG

Query:  GIGISVRRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVA
        G GIS RRRS+LEIPHRLQI LLDDEQE EVRVS KDRPSLSGDYL+YVSTH+DPG+ RTSWAAVYS++LKSGITRRLTPYG+ADFSPS+SPSGIWTAVA
Subjt:  GIGISVRRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVA

Query:  SYGEKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNL
        SYGE GWAGEVEELSTDLYIFLTRDGS RVKVVEHGGWPCW DDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSP NKNL
Subjt:  SYGEKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNL

Query:  IAVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAY
        IAVATRRPDSSFRHIELFNLVTGEFKELTK VSPNSHH NPFISADGT IGYHKCRG+ +GRKSNPLFFENVRSPVSNLSLFR  GSFPSFSP GDR+AY
Subjt:  IAVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAY

Query:  VDFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNL
         +FPGLYVIN DGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFA VS+ VDIIS+NVDDDE+N KKLTT G NNAFPSPSPDGKWIVFRSG+TGYKNL
Subjt:  VDFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNL

Query:  YIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPIS
        YIMDA+DGESKGL  LTEGQWTDTMCSWSPDGDWIAFSSDRDNPG GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSP GK LVFTTDNAGISAEPIS
Subjt:  YIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPIS

Query:  NPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        NPHHYQPYGEIY IK+DGS+L RLTHNSYEDGTPTWSPRYISP+NVE LYD EPCGFEDCHWL QKKKA NDIRP+LTGPRCS
Subjt:  NPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

TrEMBL top hitse value%identityAlignment
A0A0A0KH20 Uncharacterized protein0.0e+0086.47Show/hide
Query:  IISGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIG
        +  G+SIVFTTLGRS YAFDI+TLP D N  PS  DE  ITDGQ VNFNGYFPSS+SS SLISLLTNQS SF PD ELVYVTER+GIS IFYDA+FGGIG
Subjt:  IISGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIG

Query:  ISVRRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYG
        +S RRRS+LEIPHRLQIPLLD+EQ+ E RVS KDRPSLSGDYL+YVSTHEDP E RTSWAAVYSR+LKSG+TRRLTPYGIADFSPS+SPSGIWTAVASYG
Subjt:  ISVRRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYG

Query:  EKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAV
        EKGWAG+VEELSTDLYIFLTRDGS RVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIY+AILPSHGEIS DSV IERLTPPGLHVFTPATS ANKNLIAV
Subjt:  EKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAV

Query:  ATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDF
        ATRRPDSSFRHIELFN+VTGEFKELTK VSPNSHH NPF+SADGT IGYHKCRGD N RKSN L FE VRSPVSNLSLFRIA SFPSFSP GDR+AY +F
Subjt:  ATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDF

Query:  PGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIM
        PGLYVI RDGSN+REVFSGAAFSTAWDPVR+GVVYTSAGPDFA VS+ VDIISVNVD++E+N KKLTTNG NNAFPSPSPDGKWIVFRSGQTGYKNLYIM
Subjt:  PGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIM

Query:  DAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPH
        DAV+GESKGLR LTEGQWTDTMCSWSP+GDWIAFSSDR+NPG GSFDLFLIHPNGTGLRKLFQSG  GRANHPNW PDGK LVFTTDNAGIS EP+SNPH
Subjt:  DAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPH

Query:  HYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        HYQPYGEIY IKLDGS+LQRLTHNSYEDGTPTWSPRYI+P+NVE LYDVEPCGFEDCHWLNQ  KA N + P+LTGPRCS
Subjt:  HYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

A0A1S3CT97 uncharacterized protein LOC1035040880.0e+0087.89Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFDI+TLP D N+ PS+ DE  ITDGQSVNFNGYFPSSSS  SLISLLTNQS SF PDLELVYVTER+GIS+IFYDA+FGG G+S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS+LEIPHRLQIPLLD+EQ+ EVRVS KDRPSLSGDYL+YVSTHEDP E RTSWAAVYSR+LKSG+TRRLTPYGIADFSPS+SPSGIWTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTDLYIFLTRDGS RVKVVEHGGWPCWAD+STLYFHRRGDDQWLSIY+AILPSHGEI  DSV IERLTPPGLHVFTPATSPANKNLIAVATR
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFN+VTGEFK+LT+ VSPN+HH NPF+SADGT IGYHKCRGD NGRKSN LFFENVRSPVSNLSLFRIA SFPSFSP GDR+AY +FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVINRDGSN+REVFSGAAFSTAWDPVR+GVVYTSAGPDFA VS+ VDIISVNVD++E+N KKLTTNG NNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGESKGLR LTEG WTDTMCSWSPDGDWIAFSSDR+NPG+GSFDLFLIHPNGTGLRKLFQSG  GRANHPNW PDGK LVFTTDNAGISAEP+SNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEIY IKLDGS+LQRLTHNSYEDGTPTWSPRYI+P+NVE LYDVEPCGFEDCHWLNQ  +A NDIRP+LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

A0A5D3CCB6 DPP6 N-terminal domain-like protein0.0e+0087.89Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFDI+TLP D N+ PS+ DE  ITDGQSVNFNGYFPSSSS  SLISLLTNQS SF PDLELVYVTER+GIS+IFYDA+FGG G+S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS+LEIPHRLQIPLLD+EQ+ EVRVS KDRPSLSGDYL+YVSTHEDP E RTSWAAVYSR+LKSG+TRRLTPYGIADFSPS+SPSGIWTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTDLYIFLTRDGS RVKVVEHGGWPCWAD+STLYFHRRGDDQWLSIY+AILPSHGEI  DSV IERLTPPGLHVFTPATSPANKNLIAVATR
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFN+VTGEFK+LT+ VSPN+HH NPF+SADGT IGYHKCRGD NGRKSN LFFENVRSPVSNLSLFRIA SFPSFSP GDR+AY +FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVINRDGSN+REVFSGAAFSTAWDPVR+GVVYTSAGPDFA VS+ VDIISVNVD++E+N KKLTTNG NNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGESKGLR LTEG WTDTMCSWSPDGDWIAFSSDR+NPG+GSFDLFLIHPNGTGLRKLFQSG  GRANHPNW PDGK LVFTTDNAGISAEP+SNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEIY IKLDGS+LQRLTHNSYEDGTPTWSPRYI+P+NVE LYDVEPCGFEDCHWLNQ  +A NDIRP+LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

A0A6J1FF50 uncharacterized protein LOC1114449130.0e+0087.41Show/hide
Query:  SIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVRR
        SIVFTTLGRS YAFD+FTLP D N  PS  DET ITDG+SVNFNGYFPSSSSS S++SLLTNQSQS RPDLELVYVTER+GIS+IFYDAIFGG G S RR
Subjt:  SIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVRR

Query:  RSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKGWA
        RS LEIPHRLQIPL+DDEQ+ EVRVS KDRP+LSGDYL+YVSTHEDPGE RTSWAAVYSRSL+SG TRRLTPYGIADFSPS+SPSG+WTAVASYGEKGWA
Subjt:  RSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKGWA

Query:  GEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATRRP
        GEVEELSTD+YIFLTRDG+ RVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIY+AILPS GEIS DSV IERLTPPGLHVFTPATSPANKNLIAVATRRP
Subjt:  GEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATRRP

Query:  DSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGLYV
        DSSFRHIELFNLVTGEFKELTKAVSPNSHH NPFISADGT IGYHKCRGD NGRKSN LF ENVRSPVSNLSLFRI GSFPSFSP GDR+A+V+FPGLYV
Subjt:  DSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGLYV

Query:  INRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDG
        INRDGSN+++V+ GAAFSTAWDPVREGVVYTSAGPDFA +S+ VDIISVNVDDDE+  K+LT NG NNAFPSPSPDGKWIVFRSG++GYKNLYIMDAVDG
Subjt:  INRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDG

Query:  ESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPY
        E+K L  LTEGQWTDTMCSWSPDGDWIAF+SDR NPG+GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK LVFTTDNAGISAEPISNPHHYQPY
Subjt:  ESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPY

Query:  GEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        GEIY IK+DGS+LQRLTHNSYEDGTPTWSPRYISP+NV+  YDVEPCGFEDCHWLNQK KA N+I+P LTGPRCS
Subjt:  GEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

A0A6J1K3I0 uncharacterized protein LOC1114897390.0e+0087.3Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
        G+SIVFTTLGRS YAFD+FTLP D N  PS  DET ITDG+SVNFNGYFPSSSSS S+ISLLTNQSQS RPDLELVYVTER+GIS+IFYDAI+GG G S 
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV

Query:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG
        RRRS LEIPHRLQIPLLDDEQ+ EVRVS KDRPSLSGDYL+YVSTHEDPGE RTSWAAVYSRSL+SG TRRLTPYGIADFSPS+SPSG+WTAVASYGEKG
Subjt:  RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKG

Query:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR
        WAGEVEELSTD+YIFLTRDG+ RVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIY+AILPS GEIS DSV IERLTPPGLHVFTPATSPANKNLIAVA+R
Subjt:  WAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATR

Query:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL
        RPDSSFRHIELFNLVTGEFKELTK VSPNSHH NPFISADGT IGYHKCRGD NGRKSN LF ENVRSPVSNLSLFRI GSFPSFSP GDR+A+V+FPGL
Subjt:  RPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGL

Query:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV
        YVI+RDGSN+++V+ GAAFSTAWDPVREGVVYTSAGPDFA +S+ VDIISVNVDDDE N KKLT NG NNAFPSPSPDGKWIVFRSG++GYKNLYIMDAV
Subjt:  YVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAV

Query:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ
        DGE+K L  LTEGQWTDTMCSWSPDGDWIAF+SDR NPG+GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK LVFTTDNAGISAEPISNPHHYQ
Subjt:  DGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQ

Query:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS
        PYGEI+ IK+DGS+LQRLTHNSYEDGTPTWSPRYISP+NV+  YDVEPCGFEDCHWLNQK KA N+I+P LTGPRCS
Subjt:  PYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCGFEDCHWLNQKKKADNDIRPILTGPRCS

SwissProt top hitse value%identityAlignment
Q1GE19 Tol-Pal system protein TolB3.9e-1930.12Show/hide
Query:  PSFSPVGDRVAYVD----FPGLYVINRDGSNKREVFSG---AAFSTAWDPVREGVVYT-SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPS
        P FSP GDRV Y      FP ++V++     +R + +G    +F+  + P  + +VY+ S G +        D+ S++++    N ++LT+       PS
Subjt:  PSFSPVGDRVAYVD----FPGLYVINRDGSNKREVFSG---AAFSTAWDPVREGVVYT-SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPS

Query:  PSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSP
         SPDG  IVF S ++G   LY+M A  GE+K    ++ GQ       WSP GD+IAF+       +G F + ++  +G+  R L  S        P WSP
Subjt:  PSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSP

Query:  DGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRL-THNSYEDGTPTWSP
        +G+ ++FT +  G S +             +Y + + G NL+ + T +   D  P+WSP
Subjt:  DGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRL-THNSYEDGTPTWSP

Q28TS0 Tol-Pal system protein TolB4.3e-1827.09Show/hide
Query:  THIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVD----FPGLYVINRDGSNKREVFSGAA----FSTAWDPVREGVVYT
        T + +    G  N R       +   + V  L+  R     P FSP GDR+ Y      FP +Y+++     +R + +  A    FS  + P  + VV++
Subjt:  THIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVD----FPGLYVINRDGSNKREVFSGAA----FSTAWDPVREGVVYT

Query:  SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSS
                  +  DI S+++    +  ++LT        PS SPDG  IVF S ++G + +YIM A  GE++ +   T    T     WSP GD+IAF+ 
Subjt:  SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSS

Query:  DRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSP
              +G F + ++  +G+  R L    S+     P W+P+G+ ++FT +  G    P            +Y + + G NLQR+         P WSP
Subjt:  DRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSP

Q2LRP7 Tol-Pal system protein TolB6.6e-1928.47Show/hide
Query:  KSNPLFFENVRSPVSNLSLFRIAGS-----FPSFSPVGDRVAYVDF----PGLYVINRDGSNKREV--FSGAAFSTAWDPVREGVVYTSAGPDFAAVSTI
        +S+ L      S +    L R+AGS      P +SP G  +A+  +    P ++V++  GS  +++  F G     AW P    ++ T    D      +
Subjt:  KSNPLFFENVRSPVSNLSLFRIAGS-----FPSFSPVGDRVAYVDF----PGLYVINRDGSNKREV--FSGAAFSTAWDPVREGVVYTSAGPDFAAVSTI

Query:  VDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDL
        +D+ S  V       ++LT N   +  P  SPDG+ I F S  +G   +Y+M+A  G+   +R LT     +T  +WSP G  IA+       GSG + +
Subjt:  VDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDL

Query:  FLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPR
        F I  +G  +R+L  +  AG    P+WSPDG+FL F+  + G S               I I+  +   ++ L  ++     P WSPR
Subjt:  FLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPR

Q3A097 Tol-Pal system protein TolB5.6e-1827.91Show/hide
Query:  PSFSPVGDRVAYVDF----PGLYVIN-RDGSNKREVF-SGAAFSTAWDPVREGVVYT---SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFP
        P FSP G  V Y  +    P LY      G   R  F  G   +  + P    +  T   +  P+   + T           D    ++LT++   +  P
Subjt:  PSFSPVGDRVAYVDF----PGLYVIN-RDGSNKREVF-SGAAFSTAWDPVREGVVYT---SAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFP

Query:  SPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWS
        S SP G  + F S + G  ++++MD + G++   R    G++  T  +WSPDG  IAF+        G FD++ + P+GT  R+L  +   G   HP WS
Subjt:  SPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWS

Query:  PDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWS
        PD +FLV+++D  G                 IYI++ DG+ ++R++    +   P WS
Subjt:  PDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWS

Q8KEQ0 Protein TolB homolog9.2e-2130.31Show/hide
Query:  PSFSPVGDRVAYVDF----PGLYVINRDGSNKREVFS-GAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSP
        P+ SP G  +A+ D+    P LY+ N     K  V   G   S AW P    +V T       +     D+  +  D   +  ++LT  G  +  P+ SP
Subjt:  PSFSPVGDRVAYVDF----PGLYVINRDGSNKREVFS-GAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSP

Query:  DGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK
        DG  + F S + G   ++I D   G+   +R LT     +T  SWSP+GD I +SS +    SG  ++F I+ +G+GL +L  +  +G   HP+WSPDG 
Subjt:  DGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGK

Query:  FLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWS
         +VF++   G                 +Y++  DGSN + L +   E   P+WS
Subjt:  FLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWS

Arabidopsis top hitse value%identityAlignment
AT1G21670.1 LOCATED IN: cell wall, plant-type cell wall1.6e-20654.69Show/hide
Query:  SGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGIS
        +G++I+FTT+GR ++ FDIFTLP   +  PS  DE ++TDG+S+NFNGYF  +S S +LISLL  ++Q    D+ L+YVTER G   + YD +       
Subjt:  SGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGIS

Query:  VRRRSKLEIPHRLQIPLLD-DEQEKEVRV-SLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYG
                +  R+Q+PL   +EQ+  + V S+KD P L+  YL++VSTHE+PG+P  SWAAVYS  L++  TRRLTP GIADFSP++SPSG WTAVAS+G
Subjt:  VRRRSKLEIPHRLQIPLLD-DEQEKEVRV-SLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYG

Query:  EKGWAGEV--EELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLI
        EKGW   +  +E+S+D+Y+FLT+DG+ RVKVVE GGWP W DDSTLYFHR+ DD W+S+Y+AILP  G ++  SV I+R+TPPGLH FTPATSP N N I
Subjt:  EKGWAGEV--EELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLI

Query:  AVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYV
        AVATRRP S  RH+ELF+L   EF ELT+ VSP SHH NPF+S D + +GYH CRGD  GRK+     +++++  ++LSLFR  G+FPS SP GDR A+V
Subjt:  AVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYV

Query:  DFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVD--DDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKN
         F G++V+N DGS  R++     F T WDP+R G+VYTS+GP  A   + +DI+++NVD        KKLTT G NNAFP PSPDGK IVFRS ++G KN
Subjt:  DFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVD--DDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKN

Query:  LYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPI
        LYIMDA  GES GL  LT G W DT+ +WSPDG+WI F+S+R+ PG+   +++++HP+GTGLRKL Q+ +   + HP +SPD K +VFTT  AGISAE I
Subjt:  LYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPI

Query:  SNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPR
         NPH   P  EI+ + LDGS L RLTHNS EDG P W P+
Subjt:  SNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPR

AT1G21680.1 DPP6 N-terminal domain-like protein1.3e-26464.72Show/hide
Query:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPD---LELVYVTERDGISKIFYDAIFGG-I
        G++I+FTTLGRS Y FDIF L       PS   E +ITDG+SVNFNGYFP  S SP+L+SLL +++     D   L L+YVTER+G S ++YD ++GG  
Subjt:  GNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPD---LELVYVTERDGISKIFYDAIFGG-I

Query:  GISVRRRSKLEIPHRLQIPLLD--DEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVA
            +RRS LE P R+Q+PLL   D        S KD+PSLSG++++YVSTHE  GEPR SW AVYS  LK+G+TRRLTP G+ADFSP++SPSG  TAVA
Subjt:  GISVRRRSKLEIPHRLQIPLLD--DEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVA

Query:  SYGEKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRG-DDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKN
        SYGE+GW GEVEEL TD+Y+FLTRDGSHRVKVVEHGGWPCW D+STLYFHRR  +D W+S+Y+AILP +G ++ +SV I+R+TPPG+H FTPATSP N  
Subjt:  SYGEKGWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRG-DDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKN

Query:  LIAVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVA
         +AVATRRP S +RH+ELF+L   EF ELT+ V+P SHHLNPF+S D + +GYH CRGD NGR+S  LF EN+++   +LSLFRI GSFPSFSP GDR+A
Subjt:  LIAVATRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVA

Query:  YVDFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVD--DDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGY
        YV  PG++V+  DGS +REV+ G AFSTAWDPVR G+VY+S+GP FA   T VD+IS++VD  D   + ++LTTNG NNAFP PSPDGK IVFRSG+TG+
Subjt:  YVDFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVD--DDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGY

Query:  KNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAE
        KNLYIMDA  GES GL  LTEG WTDTMC+WSPDG+WIAF+SDR++PGSGSF+LFLIHPNGTGLRKL QSG+ GR NHP +SPD K LVFT+D AGISAE
Subjt:  KNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGSGSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAE

Query:  PISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVE-GLYDVEPCGFEDCHWLNQ
        PISNPHHYQPYG+I+ +KLDGSN++RLTHNSYEDGTP W+PR+I P NVE    +   C FEDCHWLN+
Subjt:  PISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVE-GLYDVEPCGFEDCHWLNQ

AT4G01870.1 tolB protein-related2.7e-13240.49Show/hide
Query:  SIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVRR
        +I+FTT+GR+ Y FD+F+L +      +T  E ++TDG SVNFN  F +  S                   ++V+V+ER+G ++I+            + 
Subjt:  SIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISVRR

Query:  RSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLY-VSTHEDPGEPRTSWAAVYSRSLKSG--ITRRLTPYGIADFSPSISPSGIWTAVASYGEK
        RS +  P   QIP   +           DRP ++ +  LY +S HE P     +W+A+Y+  L S      R+TP   ADFSP++S SG + AVASYG +
Subjt:  RSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLY-VSTHEDPGEPRTSWAAVYSRSLKSG--ITRRLTPYGIADFSPSISPSGIWTAVASYGEK

Query:  GWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSH-GEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVA
         W GE  E++TD+ +F       RV + E GGWP W+ DST++FH + DD W SI++  +P +  E +   +   R+TP GLH FTPA     K  IA+A
Subjt:  GWAGEVEELSTDLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSH-GEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVA

Query:  TRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAY-VDF
        TRR   + RHIE+++L    F+ +T++++P+ HH NPF+S D   +GYH+ RG+    +S     E++ SP+  L L RI GSFPS SP GD +A   DF
Subjt:  TRRPDSSFRHIELFNLVTGEFKELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAY-VDF

Query:  P---GLYVINRDGSNKREVFSG-AAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKK----------LTTNGVNNAFPSPSPDGKWIV
            G+ V   DGS +  +     AF  +W P    V+YTS GP F+     V I  +  D  +    K             N  NNAFPS SPDGK IV
Subjt:  P---GLYVINRDGSNKREVFSG-AAFSTAWDPVREGVVYTSAGPDFAAVSTIVDIISVNVDDDEKNFKK----------LTTNGVNNAFPSPSPDGKWIV

Query:  FRSGQTGYKNLYIMDAVDGESK--GLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNP-GSGSFDLFLIHPNGTGLRKLFQSGSAG-------RANHPNWS
        FRSG++G+KNLYI+DAV+GES   G+R LT+G W DTM  WSP GD I FSS+R NP  +  F  +++ P+GTGLR++  SG  G       R NH +++
Subjt:  FRSGQTGYKNLYIMDAVDGESK--GLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNP-GSGSFDLFLIHPNGTGLRKLFQSGSAG-------RANHPNWS

Query:  PDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTW
         DG +LVF  + +G++AEP++ P+ +QPYG++Y++KLDG+ L+RLT N YEDGTPTW
Subjt:  PDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAAAAGGGCGGCGATATCATCAGTGGCAACAGCATAGTCTTCACCACCCTCGGCAGGTCCTCCTACGCCTTCGATATCTTCACCCTACCGGTTGATGGCAACAG
TTACCCTTCGACAGAGGATGAAACCCAAATCACCGACGGCCAATCCGTCAATTTCAATGGCTATTTCCCTTCCTCCTCATCATCTCCTTCGCTTATATCTCTTTTAACAA
ACCAATCACAGTCCTTTAGGCCAGATTTGGAACTCGTCTATGTCACAGAAAGAGACGGAATCTCCAAAATCTTCTACGATGCTATTTTCGGTGGCATCGGAATTAGTGTC
AGACGGCGGTCGAAGCTCGAGATTCCTCATCGACTACAAATTCCCCTTTTGGATGATGAGCAGGAAAAGGAAGTTAGAGTTTCTCTCAAAGATCGGCCGAGTTTAAGTGG
CGATTATTTGCTGTACGTGTCGACTCACGAGGATCCTGGTGAGCCCAGAACGAGTTGGGCAGCAGTGTACTCAAGGAGTTTGAAATCGGGTATAACTCGGCGATTAACAC
CTTATGGAATCGCCGATTTTAGCCCCTCTATTTCGCCGTCTGGAATTTGGACAGCTGTGGCTTCTTACGGCGAGAAGGGTTGGGCCGGCGAGGTTGAGGAACTAAGTACC
GATTTATACATATTCCTAACTCGCGACGGAAGTCACCGAGTTAAGGTCGTTGAACATGGTGGCTGGCCGTGTTGGGCAGACGATTCAACACTATATTTTCACCGGAGAGG
CGATGACCAGTGGCTGAGTATATACAAAGCAATTCTACCAAGTCATGGAGAAATTTCTCCTGATTCAGTGAAAATCGAACGACTCACTCCACCTGGGTTGCACGTGTTCA
CTCCAGCCACTTCACCGGCGAACAAAAACCTCATTGCCGTCGCAACAAGAAGACCTGATTCATCTTTTCGCCATATAGAGTTGTTCAACCTCGTCACTGGAGAGTTCAAG
GAGCTAACAAAAGCGGTTTCACCTAATTCTCACCATCTCAATCCGTTCATCTCTGCCGACGGGACTCATATCGGTTACCACAAGTGCAGAGGCGACGACAATGGAAGAAA
AAGCAACCCCCTATTCTTTGAGAACGTTCGTAGCCCTGTTTCTAATTTATCCTTATTTCGAATTGCCGGTTCTTTTCCCTCTTTTTCGCCTGTTGGTGACCGTGTAGCCT
ACGTGGACTTCCCAGGATTATACGTCATCAATCGGGACGGCTCAAACAAACGGGAGGTTTTCTCTGGCGCAGCATTCTCCACAGCGTGGGATCCAGTGCGAGAAGGTGTA
GTGTACACCAGTGCCGGTCCCGATTTTGCAGCGGTGAGTACTATAGTCGATATCATTTCAGTCAACGTCGATGACGACGAGAAAAATTTCAAGAAGCTGACGACGAACGG
TGTAAACAATGCCTTCCCATCGCCGTCCCCCGACGGGAAATGGATCGTTTTCCGGTCGGGTCAAACCGGGTACAAGAACCTGTACATAATGGACGCTGTCGACGGTGAGA
GCAAAGGGCTTCGCCCGCTTACAGAGGGTCAGTGGACGGATACGATGTGCAGTTGGTCGCCGGACGGCGATTGGATCGCATTTTCATCGGACCGAGACAATCCTGGTTCC
GGGAGCTTCGATTTGTTCTTGATCCACCCTAATGGAACCGGGTTGAGGAAGCTGTTTCAAAGCGGTTCAGCAGGTCGGGCGAACCACCCGAATTGGAGTCCAGACGGGAA
GTTTCTAGTATTCACTACGGATAATGCAGGAATATCTGCGGAGCCAATATCCAACCCACATCACTATCAACCCTACGGTGAAATCTACATTATCAAATTGGATGGCTCTA
ATCTTCAGAGGCTGACTCATAACTCCTACGAGGATGGGACCCCCACGTGGAGCCCACGTTACATCAGCCCCATTAACGTGGAGGGCTTATACGACGTGGAGCCCTGTGGG
TTTGAAGATTGTCACTGGCTTAACCAAAAGAAGAAGGCCGATAATGATATTAGGCCCATCTTAACTGGGCCTCGATGCAGCCATAATCCATCAACTCGCCCTCCAAACCC
ATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAAAAGGGCGGCGATATCATCAGTGGCAACAGCATAGTCTTCACCACCCTCGGCAGGTCCTCCTACGCCTTCGATATCTTCACCCTACCGGTTGATGGCAACAG
TTACCCTTCGACAGAGGATGAAACCCAAATCACCGACGGCCAATCCGTCAATTTCAATGGCTATTTCCCTTCCTCCTCATCATCTCCTTCGCTTATATCTCTTTTAACAA
ACCAATCACAGTCCTTTAGGCCAGATTTGGAACTCGTCTATGTCACAGAAAGAGACGGAATCTCCAAAATCTTCTACGATGCTATTTTCGGTGGCATCGGAATTAGTGTC
AGACGGCGGTCGAAGCTCGAGATTCCTCATCGACTACAAATTCCCCTTTTGGATGATGAGCAGGAAAAGGAAGTTAGAGTTTCTCTCAAAGATCGGCCGAGTTTAAGTGG
CGATTATTTGCTGTACGTGTCGACTCACGAGGATCCTGGTGAGCCCAGAACGAGTTGGGCAGCAGTGTACTCAAGGAGTTTGAAATCGGGTATAACTCGGCGATTAACAC
CTTATGGAATCGCCGATTTTAGCCCCTCTATTTCGCCGTCTGGAATTTGGACAGCTGTGGCTTCTTACGGCGAGAAGGGTTGGGCCGGCGAGGTTGAGGAACTAAGTACC
GATTTATACATATTCCTAACTCGCGACGGAAGTCACCGAGTTAAGGTCGTTGAACATGGTGGCTGGCCGTGTTGGGCAGACGATTCAACACTATATTTTCACCGGAGAGG
CGATGACCAGTGGCTGAGTATATACAAAGCAATTCTACCAAGTCATGGAGAAATTTCTCCTGATTCAGTGAAAATCGAACGACTCACTCCACCTGGGTTGCACGTGTTCA
CTCCAGCCACTTCACCGGCGAACAAAAACCTCATTGCCGTCGCAACAAGAAGACCTGATTCATCTTTTCGCCATATAGAGTTGTTCAACCTCGTCACTGGAGAGTTCAAG
GAGCTAACAAAAGCGGTTTCACCTAATTCTCACCATCTCAATCCGTTCATCTCTGCCGACGGGACTCATATCGGTTACCACAAGTGCAGAGGCGACGACAATGGAAGAAA
AAGCAACCCCCTATTCTTTGAGAACGTTCGTAGCCCTGTTTCTAATTTATCCTTATTTCGAATTGCCGGTTCTTTTCCCTCTTTTTCGCCTGTTGGTGACCGTGTAGCCT
ACGTGGACTTCCCAGGATTATACGTCATCAATCGGGACGGCTCAAACAAACGGGAGGTTTTCTCTGGCGCAGCATTCTCCACAGCGTGGGATCCAGTGCGAGAAGGTGTA
GTGTACACCAGTGCCGGTCCCGATTTTGCAGCGGTGAGTACTATAGTCGATATCATTTCAGTCAACGTCGATGACGACGAGAAAAATTTCAAGAAGCTGACGACGAACGG
TGTAAACAATGCCTTCCCATCGCCGTCCCCCGACGGGAAATGGATCGTTTTCCGGTCGGGTCAAACCGGGTACAAGAACCTGTACATAATGGACGCTGTCGACGGTGAGA
GCAAAGGGCTTCGCCCGCTTACAGAGGGTCAGTGGACGGATACGATGTGCAGTTGGTCGCCGGACGGCGATTGGATCGCATTTTCATCGGACCGAGACAATCCTGGTTCC
GGGAGCTTCGATTTGTTCTTGATCCACCCTAATGGAACCGGGTTGAGGAAGCTGTTTCAAAGCGGTTCAGCAGGTCGGGCGAACCACCCGAATTGGAGTCCAGACGGGAA
GTTTCTAGTATTCACTACGGATAATGCAGGAATATCTGCGGAGCCAATATCCAACCCACATCACTATCAACCCTACGGTGAAATCTACATTATCAAATTGGATGGCTCTA
ATCTTCAGAGGCTGACTCATAACTCCTACGAGGATGGGACCCCCACGTGGAGCCCACGTTACATCAGCCCCATTAACGTGGAGGGCTTATACGACGTGGAGCCCTGTGGG
TTTGAAGATTGTCACTGGCTTAACCAAAAGAAGAAGGCCGATAATGATATTAGGCCCATCTTAACTGGGCCTCGATGCAGCCATAATCCATCAACTCGCCCTCCAAACCC
ATAA
Protein sequenceShow/hide protein sequence
MLEKGGDIISGNSIVFTTLGRSSYAFDIFTLPVDGNSYPSTEDETQITDGQSVNFNGYFPSSSSSPSLISLLTNQSQSFRPDLELVYVTERDGISKIFYDAIFGGIGISV
RRRSKLEIPHRLQIPLLDDEQEKEVRVSLKDRPSLSGDYLLYVSTHEDPGEPRTSWAAVYSRSLKSGITRRLTPYGIADFSPSISPSGIWTAVASYGEKGWAGEVEELST
DLYIFLTRDGSHRVKVVEHGGWPCWADDSTLYFHRRGDDQWLSIYKAILPSHGEISPDSVKIERLTPPGLHVFTPATSPANKNLIAVATRRPDSSFRHIELFNLVTGEFK
ELTKAVSPNSHHLNPFISADGTHIGYHKCRGDDNGRKSNPLFFENVRSPVSNLSLFRIAGSFPSFSPVGDRVAYVDFPGLYVINRDGSNKREVFSGAAFSTAWDPVREGV
VYTSAGPDFAAVSTIVDIISVNVDDDEKNFKKLTTNGVNNAFPSPSPDGKWIVFRSGQTGYKNLYIMDAVDGESKGLRPLTEGQWTDTMCSWSPDGDWIAFSSDRDNPGS
GSFDLFLIHPNGTGLRKLFQSGSAGRANHPNWSPDGKFLVFTTDNAGISAEPISNPHHYQPYGEIYIIKLDGSNLQRLTHNSYEDGTPTWSPRYISPINVEGLYDVEPCG
FEDCHWLNQKKKADNDIRPILTGPRCSHNPSTRPPNP