; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003449 (gene) of Snake gourd v1 genome

Gene IDTan0003449
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPB1 domain-containing protein
Genome locationLG09:69966092..69968961
RNA-Seq ExpressionTan0003449
SyntenyTan0003449
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587664.1 hypothetical protein SDJN03_16229, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.07Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATATTA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVGLEE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +P AAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPI Q QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P + QKTVPAAMPEMAAS+YRTAVTTTPPPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

KAG7021629.1 hypothetical protein SDJN02_15355, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.2Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATATTA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVGLEE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +P AAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTT PPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRN
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RN
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRN

XP_022925527.1 uncharacterized protein LOC111432842 [Cucurbita moschata]0.0e+0089.22Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATATTA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVGLEE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +P AAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTT PPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

XP_022973326.1 uncharacterized protein LOC111471889 [Cucurbita maxima]0.0e+0089.22Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATA TA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVG EE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +PAAAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTTPPPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

XP_023529543.1 uncharacterized protein LOC111792366 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0088.77Show/hide
Query:  MDPPPPPPATATT---AAAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MD PPPP ATATT    AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATT---AAAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT++SPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVGLEE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHEN+NRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +P AAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTTPPPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

TrEMBL top hitse value%identityAlignment
A0A0A0LPU6 PB1 domain-containing protein0.0e+0085.76Show/hide
Query:  PPPPATATTAAAANFAATSI--HHQ-NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRL
        PPPPA+ATTAAAAN+A  S+  HHQ NYQ+SIESSPRS+NAD WDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDLCMRL
Subjt:  PPPPATATTAAAANFAATSI--HHQ-NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRL

Query:  SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS
        SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLFLFF KPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS
Subjt:  SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS

Query:  ATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGN
        ATMECMVNLDAV  SDSCNDLE+QGHDSLGHIDKQVVKNS+SAQD+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+E EGRVQDPKVGLEEQFG 
Subjt:  ATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGN

Query:  MNFGVPPIPTAL-AAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD
        MNFGVPPIPTAL AAAATI L  +SNHENANRV  DDER EQ G VAFRKPPLPLQTLQNR VA P VS GF LPSPDSVASDSSI SANSQSKP+YF D
Subjt:  MNFGVPPIPTAL-AAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD

Query:  QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHP-AAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQY
        QI+R+N +Q+ PI+ESDGF+TSQQV IQ L DPAY+LTSQLDQ    +QPQQF+H TTHYIHHHHP AAAGHVPVQ YYHPIYT TPSQQQLHHPIDQQY
Subjt:  QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHP-AAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQY

Query:  PVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLH
        PVYLMPITQ QPTYNM VQSSPAETP+A PNRQASASP +V+S + Y+DSNQP L+PQK V AAMPEMAA+VYRTAVT+ PPPLLQVPHNQFQQPYMGL 
Subjt:  PVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLH

Query:  QMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        QMNYPSQS+AVAP PSGTANYGFDY  AP Q+I   PMASQYQTMTQ AAAAALSDASRQLP DG+ QQQ+RNS
Subjt:  QMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

A0A5A7USG9 Ataxin-2-like protein0.0e+0084.76Show/hide
Query:  PPPPATATTAAAANFAATSIHHQ---NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRL
        P PPA ATTAAAANFA  S+HHQ   NYQDSIESSPRS+NAD WDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH++LSDLCMRL
Subjt:  PPPPATATTAAAANFAATSIHHQ---NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRL

Query:  SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS
        SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASP KPLRIRLFLFF KPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS
Subjt:  SRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS

Query:  ATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGN
        ATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQVVK S+S QD+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+E EGRVQDPKVGLEEQFG 
Subjt:  ATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGN

Query:  MNFGVPPIPTAL-AAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD
        MNFGVPPIPTAL AAAAT+ L  +SN ENANRV  DDER EQ G VAFRKPPLPLQTLQNR VA P VS GF LPSPDSVASDSSI SANSQSKP+YF D
Subjt:  MNFGVPPIPTAL-AAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD

Query:  QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHP-AAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQY
        QI+RDN + + PI+ESDGF+TSQQV IQ LHDP Y+LTSQLDQ    +QPQQF+HT THYIHHHHP AAAGHVPVQ YYHPIYT TPSQQQLHHPIDQQY
Subjt:  QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHP-AAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQY

Query:  PVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLH
        PVYLMPITQ QPTYNM VQSSPAETP+A PNRQASASP +V+S + Y+DS+QP L+PQK V AAMPEMAA+VYRTAVT+ PPPLLQVPHNQFQQPY+GL 
Subjt:  PVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLH

Query:  QMNYPSQSIAV--APTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        QMNYPSQS+AV  APTPSGTANYGFDY  AP Q+I   PMASQYQTMTQ AAAAALSDASRQLP DG+ QQQ+RNS
Subjt:  QMNYPSQSIAV--APTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

A0A6J1C2E1 uncharacterized protein LOC1110068150.0e+0084.43Show/hide
Query:  PPPPATATTAAAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRT
        PPPPA A TAAAA          NYQDS+ESSPRS+NADSWDDP+ PVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SL+DLCMRLSRT
Subjt:  PPPPATATTAAAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRT

Query:  LLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDSATM
        LLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDSATM
Subjt:  LLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDSATM

Query:  ECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGNMNF
        ECMV+LDAVR SDSCNDLE+QG DSLGH DKQV  NSNS  D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+E EGRVQDPKVGLEEQFG MNF
Subjt:  ECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGNMNF

Query:  GVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQDQIIR
        GVPP+PTALA   T+   A+ NHENANR  +DDER E GGPVAFRKPPLPLQ LQNRAVAGPAVS GFSLPSPDSVASDSSI SANSQSKPMY+QDQI+R
Subjt:  GVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQDQIIR

Query:  DNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVYLM
        DNT+  IPI+ESDG+MTSQQV IQQLHDP YVLTSQLD Q+QPQQPQQF+H+TTHYIHHHHPAA GHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVY+M
Subjt:  DNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVYLM

Query:  PITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQK-TVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLHQMNY
        PI Q Q  YN+PVQSS AETPMAAPNRQAS SP MV+SPV Y+DSN PPL+ QK   PAAM EMAA++YRTAVT+TPPPLLQVPHNQ+QQPYMGL QMNY
Subjt:  PITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPLFPQK-TVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLHQMNY

Query:  PSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLR
        PSQSIAVA  PSG ANYGFDY  AP Q+I   PMASQYQTM QAAAAAALSDASRQLPADG+ QQ +R
Subjt:  PSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLR

A0A6J1EBX8 uncharacterized protein LOC1114328420.0e+0089.22Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATATTA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVGLEE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +P AAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTT PPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

A0A6J1I8C4 uncharacterized protein LOC1114718890.0e+0089.22Show/hide
Query:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL
        MDPPPPP ATA TA   AAANFAAT+I H NYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGG+TRIVVVDRH+SLSDL
Subjt:  MDPPPPPPATATTA---AAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDL

Query:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
        CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRIT+ASPLKPLRIRLF+FFTKPET ASMGSLLDDAKHETWFVDALN+SAGMIPRG
Subjt:  CMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE
        LSDSATMECMVNLDAVR SDSCNDLE+QGHDSLGHIDKQV KNSNSA D+QSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRV+EAE RVQDPKVG EE
Subjt:  LSDSATMECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEE

Query:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY
        QFG M+FGVPPIPTALAAAA +SLAAVSNHENANRV NDDER EQGGPVAFRKPPLPLQTLQNR V  PAVS GFSLPSPDSVASDSSITSANSQSKPMY
Subjt:  QFGNMNFGVPPIPTALAAAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMY

Query:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ
        FQDQI+RD+++ +IPI+ESDGFMTSQQVQIQQLHDPAYVLTSQLD Q+QPQQPQQF+H+TTHYIHH +PAAAGHVPV SYYHP+YTQTPSQQQLHHPIDQ
Subjt:  FQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQ

Query:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM
        QYPVYLMPITQ QPTYNMPVQSSPAETPMAAPNRQASASPT+VASPVAY+DS NQ P +PQKTVPAAMPEMAAS+YRTAVTTTPPPLLQV HNQFQQPYM
Subjt:  QYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDS-NQPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYM

Query:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS
        GL QMNYP QSIA+AP PSG ANYGFDY  AP Q+I   PM+SQYQTMT AAAAAALSDASRQLP+DGS QQQ RNS
Subjt:  GLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSI---PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQLRNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein5.9e-10540.62Show/hide
Query:  MDPPPPPPATATTAAAANFAATS--------IHHQNYQDSIESSPRSNNADSWDD-PLPPVPG---------AKLRLMCSYGGHIIPRPHDKSLCYVGGE
        M+PPP   ++   A+ +  AAT+            +Y +S++SSPRS   D WDD P P   G         +KLR MCSYGGHI+PRPHDKSLCY+GG+
Subjt:  MDPPPPPPATATTAAAANFAATS--------IHHQNYQDSIESSPRSNNADSWDD-PLPPVPG---------AKLRLMCSYGGHIIPRPHDKSLCYVGGE

Query:  TRIVVVDRHASLSDLCMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDR-ITLASPLKPLRIRLFLFFTKPETAASMGSLLD-DAKHE
        TRIVVVDR++SL  L  RLS TLL+GR FTLKYQLP EDLDSLIS+ TDEDL+NMIEEYDR I+ ++  KP R+RLFLF +KPE   SMG +L+  AK +
Subjt:  TRIVVVDRHASLSDLCMRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDR-ITLASPLKPLRIRLFLFFTKPETAASMGSLLD-DAKHE

Query:  TWFVDALNNSAGMIPRGLSDSAT-MECMVNLD---AVRASDSCNDLEDQGHDSLGHIDK------QVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPS
         WF++AL NSAG++ RG SDS T +  ++ LD   A+R++   N+  D    S+    +      Q  +     QD+  +PDSP ++  SSFGS+SS PS
Subjt:  TWFVDALNNSAGMIPRGLSDSAT-MECMVNLD---AVRASDSCNDLEDQGHDSLGHIDK------QVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPS

Query:  MSNLPPIRVRVDEAEGRVQDP---KVGLEEQFGNMNFG---------------VPPIPTALA-AAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKP
        ++NLPPIRV V+E  G    P    +G+EEQF   N G                PP+P  +A  AA ++ A VSN   A RV +DDER + G    +RKP
Subjt:  MSNLPPIRVRVDEAEGRVQDP---KVGLEEQFGNMNFG---------------VPPIPTALA-AAATISLAAVSNHENANRVLNDDERMEQGGPVAFRKP

Query:  PLPLQTLQN---------RAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD---QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTS
        P P    QN         ++ +G     G  LPSP+SV+SDSS+++     +P  +Q+   QI   +T+ +  I+ SD      Q Q Q   DPAY+L  
Subjt:  PLPLQTLQN---------RAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQD---QIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTS

Query:  QLDQQQQPQQP-QQFLHTTT--HYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHP--IDQQ-YPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQA
        Q +QQ    QP QQF+HT     YIHHH    +  +PV +Y     +Q P Q    H   +DQQ YPVY +        Y+MPV  SP+ +  A      
Subjt:  QLDQQQQPQQP-QQFLHTTT--HYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHP--IDQQ-YPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQA

Query:  SASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMA-ASVYRTAVTTTPPPLL-QVPHNQFQQPYMGLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQS
          + TM+           PP    ++V +  PEM  A VY TA       ++ Q+P N  QQ +MG  Q+ +P QS       +G  NYG++Y       
Subjt:  SASPTMVASPVAYSDSNQPPLFPQKTVPAAMPEMA-ASVYRTAVTTTPPPLL-QVPHNQFQQPYMGLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQS

Query:  I----PMA-SQYQTMTQAAAAAALSD--ASRQLPADGSHQQQLRNS
        I    PM  +QYQTMT    A  + D  A+ +LPA+ +  QQ+R+S
Subjt:  I----PMA-SQYQTMTQAAAAAALSD--ASRQLPADGSHQQQLRNS

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein2.2e-9941.69Show/hide
Query:  MDPPPPPPATATTAAAANFAATSIHHQNYQDSIESSPRSNNADSW--DDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLC
        + P P P A  ++A  A   + ++     Q  I+ SPR+   D+    +PL  VPGAKLRLMCS+GGHI+PRPHDKSL Y GGETRIVVVDR ASLS L 
Subjt:  MDPPPPPPATATTAAAANFAATSIHHQNYQDSIESSPRSNNADSW--DDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLC

Query:  MRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDR-ITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG
         RLS  LLNGR FTLKYQLP EDLDSL++I TDEDLENMIEEYDR  + A+     R+RLFLF  K ETAA+MGSLLD  K +TWFVDALN S G++PRG
Subjt:  MRLSRTLLNGRPFTLKYQLPHEDLDSLISIATDEDLENMIEEYDR-ITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRG

Query:  LSDSATM-ECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQ--VVKN---SNSAQDIQSIPDSPAVE-NDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQD
        LSDSA +   +VNLD   AS    ++++   +  G  +K+  +V N   S+    + S+PDSP +E   SS GSSSS PS SNLPPIRVRV E +     
Subjt:  LSDSATM-ECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQ--VVKN---SNSAQDIQSIPDSPAVE-NDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQD

Query:  PKVGLEEQFGNMNFG---------------------VPPIPTALAAAATISLAAVSNHENAN-RVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAV
            +EEQ   M F                      +PP     AA A  +         +N  V  +D+R E G    +RKPPLP+Q +       P  
Subjt:  PKVGLEEQFGNMNFG---------------------VPPIPTALAAAATISLAAVSNHENAN-RVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAV

Query:  SPGFSLPSPDSVASDSSITSANSQSKPMYFQDQ---------IIRDNTI-----QSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFL
          G+ L SPDSVASD+SI+SA S SKPMY+QDQ         + +  T      Q +P +E+    TS Q     L  P    T+   QQQQP   Q FL
Subjt:  SPGFSLPSPDSVASDSSITSANSQSKPMYFQDQ---------IIRDNTI-----QSIPISESDGFMTSQQVQIQQLHDPAYVLTSQLDQQQQPQQPQQFL

Query:  HTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPL
        H    YI H            S Y PIY    S QQ      Q YPVY+M + Q+Q       Q  PA TP   PN +   +                  
Subjt:  HTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSNQPPL

Query:  FPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGL-----HQMNYPSQSIAVAP--TPSGTANYG--FDYAIAPVQ-------------SI
                + PE A +VYR A    PP ++Q+   Q Q  YMG      H  N  +     AP  + +  ANYG  F+Y  +P +             +I
Subjt:  FPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGL-----HQMNYPSQSIAVAP--TPSGTANYG--FDYAIAPVQ-------------SI

Query:  PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQ
        P+AS YQ+MT AAAAAAL+D S+Q+  DG  QQQ
Subjt:  PMASQYQTMTQAAAAAALSDASRQLPADGSHQQQ

AT4G05150.1 Octicosapeptide/Phox/Bem1p family protein3.0e-3235.15Show/hide
Query:  DSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTLLNGRP-FTLKYQLPHEDLDSLISIA
        DS+ SSPRS       D  P     ++R MC++GG I+PRP D  LCYVGG+ R+V V RH + + L  +L++  L+G+   ++KYQLP+EDLD+LIS++
Subjt:  DSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTLLNGRP-FTLKYQLPHEDLDSLISIA

Query:  TDEDLENMIEEYDRITLASPLKPLRIRLFLFF-------TKPETAASMGSLLDDA-KHETWFVDALNNSAGMIPRGLSD------------------SAT
        TDED+ENM++EYDR+      +  R+RLFLF             A+S+ SLLD +   E WF+DALN  +      +S+                  S  
Subjt:  TDEDLENMIEEYDRITLASPLKPLRIRLFLFF-------TKPETAASMGSLLDDA-KHETWFVDALNNSAGMIPRGLSD------------------SAT

Query:  MECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVP----SMSNLPPIRVRVDEAEGRVQDPK
         + +  LD    +   ++L D+  D    I ++V   S+        P SP  +  S +GS+SS P    S   LPP      E+   V  PK
Subjt:  MECMVNLDAVRASDSCNDLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVP----SMSNLPPIRVRVDEAEGRVQDPK

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein7.7e-2848.3Show/hide
Query:  HQNYQDSIESSPRS-----NNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTLL---NGRPFTLKYQL
        + +Y DS ESSPRS      N   W+D        K++LMCSYGG I PRPHD  L YV G+T+I+ VDR      L  +LS       +G   + KYQL
Subjt:  HQNYQDSIESSPRS-----NNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTLL---NGRPFTLKYQL

Query:  PHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKP
        P EDLD+LIS+  DEDLE+M+ EYDR+ L    KP R+RLFLF + P
Subjt:  PHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKP

AT5G64430.1 Octicosapeptide/Phox/Bem1p family protein1.5e-2843.39Show/hide
Query:  HQNYQDSIESSPRSNNADSWDDPLPP---------VPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTL----LNGRPFT
        + +Y DS +SSPRS   + +D+P PP             K++ MCSYGG I PRPHD  L YV GET+I+ VDR      L  +LS         G   T
Subjt:  HQNYQDSIESSPRSNNADSWDDPLPP---------VPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTL----LNGRPFT

Query:  LKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS
         KYQLP EDLD+LIS+  D+DLE+M+ EYDR+   S  KP R+RLFLF      ++  GS          FV+ALN     +PR LS+S
Subjt:  LKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCCACCACCACCTCCCGCCACTGCAACAACGGCAGCCGCCGCTAACTTTGCAGCGACCTCCATTCACCACCAGAACTATCAGGATTCCATAGAATCGTCACC
TCGCTCCAACAATGCCGACTCTTGGGACGACCCTCTCCCGCCAGTTCCAGGCGCCAAGCTCCGCCTGATGTGCAGCTATGGCGGCCACATTATCCCTCGTCCTCATGATA
AGTCTCTCTGCTACGTCGGCGGCGAAACCCGCATTGTCGTCGTCGACCGCCATGCCTCTCTTTCTGACCTCTGCATGCGCCTCTCACGAACTCTCCTTAATGGCCGCCCA
TTCACCCTTAAGTACCAGCTCCCTCACGAAGATCTCGATTCCCTAATTTCTATCGCGACGGATGAAGATCTCGAGAACATGATCGAGGAATACGACCGAATCACCTTGGC
TTCGCCGTTGAAACCTCTGCGAATCCGTCTGTTCCTCTTCTTCACCAAGCCTGAAACCGCGGCTTCTATGGGTTCGTTGCTCGACGATGCCAAACACGAAACGTGGTTCG
TCGATGCCCTAAATAACTCGGCCGGGATGATTCCGAGGGGGCTTTCCGATTCGGCTACCATGGAGTGTATGGTGAATCTCGACGCGGTTCGTGCGAGTGATTCGTGTAAT
GATTTGGAAGATCAAGGTCACGATTCTTTAGGGCATATTGACAAACAGGTTGTGAAGAATAGTAATTCTGCCCAAGATATCCAATCCATTCCTGATTCTCCCGCGGTGGA
AAATGATTCTTCGTTTGGTTCGTCTTCTTCGGTGCCGTCGATGTCCAACTTGCCGCCGATACGTGTTCGTGTGGACGAGGCGGAGGGCAGAGTTCAAGATCCGAAGGTGG
GATTAGAGGAGCAGTTTGGGAATATGAATTTTGGTGTACCTCCAATTCCTACTGCCTTGGCTGCGGCAGCAACTATTTCATTGGCTGCTGTTTCTAATCACGAAAATGCT
AATCGGGTCCTTAACGACGACGAGAGAATGGAGCAGGGGGGTCCGGTGGCATTCCGGAAACCTCCATTGCCATTGCAGACCTTGCAAAACAGGGCTGTTGCAGGTCCTGC
TGTTAGCCCTGGTTTTAGTCTACCGTCGCCGGATTCTGTTGCCAGTGATAGTAGCATAACATCTGCAAATTCTCAGTCCAAACCCATGTATTTCCAAGACCAAATTATAC
GAGACAACACAATCCAATCCATTCCCATTTCAGAAAGCGATGGTTTCATGACAAGTCAACAAGTACAAATTCAACAGCTCCATGACCCTGCCTATGTATTAACTTCACAA
TTAGATCAGCAGCAGCAGCCGCAACAGCCGCAACAATTCCTCCACACCACCACCCATTACATCCATCACCACCACCCTGCAGCCGCGGGGCATGTCCCCGTTCAATCGTA
TTATCATCCCATTTACACACAAACACCATCTCAACAGCAACTTCATCATCCTATTGATCAGCAATATCCAGTCTATTTAATGCCCATTACGCAGGCTCAACCGACCTACA
ATATGCCTGTTCAATCTAGTCCTGCTGAGACTCCAATGGCTGCACCAAACCGGCAAGCATCAGCTAGTCCTACCATGGTCGCTTCTCCAGTTGCTTACAGTGATAGTAAT
CAACCGCCCCTCTTCCCTCAGAAGACAGTTCCTGCAGCCATGCCAGAAATGGCAGCAAGCGTGTATAGAACTGCAGTAACCACGACTCCTCCTCCTTTGTTACAGGTTCC
TCATAATCAGTTCCAGCAACCATACATGGGTCTTCACCAAATGAACTATCCATCTCAGTCCATAGCAGTCGCTCCCACTCCCTCGGGCACTGCTAATTATGGTTTTGACT
ACGCTATTGCTCCAGTGCAGAGCATCCCGATGGCTTCTCAGTACCAAACTATGACTCAGGCAGCGGCAGCTGCAGCTCTATCAGATGCTTCTAGACAGCTTCCGGCAGAT
GGCTCACACCAGCAGCAGCTCAGGAATTCATAG
mRNA sequenceShow/hide mRNA sequence
ATTTTTTACAAACAAAAATATATAAAAAGAAATGATTAAATAACACCCATGTTTCAATTAAAGACTCCAGTTGTGTATTCCGTTCGAATCAACCATTAAAACCCAATTTT
TCCCATCTGAAAAACTGAATTCATTAGTACTGATTCACTGTAATGGATCCTCCACCACCACCTCCCGCCACTGCAACAACGGCAGCCGCCGCTAACTTTGCAGCGACCTC
CATTCACCACCAGAACTATCAGGATTCCATAGAATCGTCACCTCGCTCCAACAATGCCGACTCTTGGGACGACCCTCTCCCGCCAGTTCCAGGCGCCAAGCTCCGCCTGA
TGTGCAGCTATGGCGGCCACATTATCCCTCGTCCTCATGATAAGTCTCTCTGCTACGTCGGCGGCGAAACCCGCATTGTCGTCGTCGACCGCCATGCCTCTCTTTCTGAC
CTCTGCATGCGCCTCTCACGAACTCTCCTTAATGGCCGCCCATTCACCCTTAAGTACCAGCTCCCTCACGAAGATCTCGATTCCCTAATTTCTATCGCGACGGATGAAGA
TCTCGAGAACATGATCGAGGAATACGACCGAATCACCTTGGCTTCGCCGTTGAAACCTCTGCGAATCCGTCTGTTCCTCTTCTTCACCAAGCCTGAAACCGCGGCTTCTA
TGGGTTCGTTGCTCGACGATGCCAAACACGAAACGTGGTTCGTCGATGCCCTAAATAACTCGGCCGGGATGATTCCGAGGGGGCTTTCCGATTCGGCTACCATGGAGTGT
ATGGTGAATCTCGACGCGGTTCGTGCGAGTGATTCGTGTAATGATTTGGAAGATCAAGGTCACGATTCTTTAGGGCATATTGACAAACAGGTTGTGAAGAATAGTAATTC
TGCCCAAGATATCCAATCCATTCCTGATTCTCCCGCGGTGGAAAATGATTCTTCGTTTGGTTCGTCTTCTTCGGTGCCGTCGATGTCCAACTTGCCGCCGATACGTGTTC
GTGTGGACGAGGCGGAGGGCAGAGTTCAAGATCCGAAGGTGGGATTAGAGGAGCAGTTTGGGAATATGAATTTTGGTGTACCTCCAATTCCTACTGCCTTGGCTGCGGCA
GCAACTATTTCATTGGCTGCTGTTTCTAATCACGAAAATGCTAATCGGGTCCTTAACGACGACGAGAGAATGGAGCAGGGGGGTCCGGTGGCATTCCGGAAACCTCCATT
GCCATTGCAGACCTTGCAAAACAGGGCTGTTGCAGGTCCTGCTGTTAGCCCTGGTTTTAGTCTACCGTCGCCGGATTCTGTTGCCAGTGATAGTAGCATAACATCTGCAA
ATTCTCAGTCCAAACCCATGTATTTCCAAGACCAAATTATACGAGACAACACAATCCAATCCATTCCCATTTCAGAAAGCGATGGTTTCATGACAAGTCAACAAGTACAA
ATTCAACAGCTCCATGACCCTGCCTATGTATTAACTTCACAATTAGATCAGCAGCAGCAGCCGCAACAGCCGCAACAATTCCTCCACACCACCACCCATTACATCCATCA
CCACCACCCTGCAGCCGCGGGGCATGTCCCCGTTCAATCGTATTATCATCCCATTTACACACAAACACCATCTCAACAGCAACTTCATCATCCTATTGATCAGCAATATC
CAGTCTATTTAATGCCCATTACGCAGGCTCAACCGACCTACAATATGCCTGTTCAATCTAGTCCTGCTGAGACTCCAATGGCTGCACCAAACCGGCAAGCATCAGCTAGT
CCTACCATGGTCGCTTCTCCAGTTGCTTACAGTGATAGTAATCAACCGCCCCTCTTCCCTCAGAAGACAGTTCCTGCAGCCATGCCAGAAATGGCAGCAAGCGTGTATAG
AACTGCAGTAACCACGACTCCTCCTCCTTTGTTACAGGTTCCTCATAATCAGTTCCAGCAACCATACATGGGTCTTCACCAAATGAACTATCCATCTCAGTCCATAGCAG
TCGCTCCCACTCCCTCGGGCACTGCTAATTATGGTTTTGACTACGCTATTGCTCCAGTGCAGAGCATCCCGATGGCTTCTCAGTACCAAACTATGACTCAGGCAGCGGCA
GCTGCAGCTCTATCAGATGCTTCTAGACAGCTTCCGGCAGATGGCTCACACCAGCAGCAGCTCAGGAATTCATAGGCAGTACACGCTAATCAGGTAAGACAAGAAAAATA
TGTTGTGTTTTGTTCCTTCTTTCTCAAATCATTGTTACTAAAACTTATTCTCATTGCCGTATCCTGGTTGAAAATTTTATGCTCTACTCTATTACTCTGCTTTTAGACCA
AATTGTAGACAAAACGTTGTAATGCTTTATACTATTTATCATAGATTAATG
Protein sequenceShow/hide protein sequence
MDPPPPPPATATTAAAANFAATSIHHQNYQDSIESSPRSNNADSWDDPLPPVPGAKLRLMCSYGGHIIPRPHDKSLCYVGGETRIVVVDRHASLSDLCMRLSRTLLNGRP
FTLKYQLPHEDLDSLISIATDEDLENMIEEYDRITLASPLKPLRIRLFLFFTKPETAASMGSLLDDAKHETWFVDALNNSAGMIPRGLSDSATMECMVNLDAVRASDSCN
DLEDQGHDSLGHIDKQVVKNSNSAQDIQSIPDSPAVENDSSFGSSSSVPSMSNLPPIRVRVDEAEGRVQDPKVGLEEQFGNMNFGVPPIPTALAAAATISLAAVSNHENA
NRVLNDDERMEQGGPVAFRKPPLPLQTLQNRAVAGPAVSPGFSLPSPDSVASDSSITSANSQSKPMYFQDQIIRDNTIQSIPISESDGFMTSQQVQIQQLHDPAYVLTSQ
LDQQQQPQQPQQFLHTTTHYIHHHHPAAAGHVPVQSYYHPIYTQTPSQQQLHHPIDQQYPVYLMPITQAQPTYNMPVQSSPAETPMAAPNRQASASPTMVASPVAYSDSN
QPPLFPQKTVPAAMPEMAASVYRTAVTTTPPPLLQVPHNQFQQPYMGLHQMNYPSQSIAVAPTPSGTANYGFDYAIAPVQSIPMASQYQTMTQAAAAAALSDASRQLPAD
GSHQQQLRNS