; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003662 (gene) of Snake gourd v1 genome

Gene IDTan0003662
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110415317
Genome locationLG09:32236130..32241219
RNA-Seq ExpressionTan0003662
SyntenyTan0003662
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022299.1 hypothetical protein SDJN02_16030 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-15787.46Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYAGLC+RP   AI I NCSSEVG++RRQVLEQLDGELAKGDDR+ALSLLKE QGKPGGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLG+
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        +GSLNLQAGT FVDFEFLEEVNAGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

XP_022154618.1 uncharacterized protein LOC111021839 [Momordica charantia]3.8e-16089.39Show/hide
Query:  MQYAGLCYRP------AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDAT
        MQYAGLC RP      AIGI NCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKES+GKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLD T
Subjt:  MQYAGLCYRP------AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDAT

Query:  LGSIERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEA
        LGSIERN+QL AALLGVSAWNVFD SPQ IFY S+GFLFLWTLDSVA NGGVGSLVLD IGH FSQKYHNRVIQHEAGHFLIAYLLGILPK YTLSSLEA
Subjt:  LGSIERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEA

Query:  LKREGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIA
         ++EGSLN QAGT FVD EFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRR+E A
Subjt:  LKREGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIA

Query:  RAKLADSMSSGKSVGNCIDIIENNIDFSDI
        RAKLAD+MSS KSVG+CIDIIENN+D SD+
Subjt:  RAKLADSMSSGKSVGNCIDIIENNIDFSDI

XP_022925962.1 uncharacterized protein LOC111433221 isoform X1 [Cucurbita moschata]8.0e-15887.77Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        +QYAGLC+RP   AI I NCSSEVG++RRQVLEQLDGELAKGDDRAALSLLKE QGKPGGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLG+
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEVNAGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

XP_022971098.1 uncharacterized protein LOC111469866 isoform X1 [Cucurbita maxima]3.4e-15687.77Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYAGLC+RP   AI I NCSSEV LRRRQVLEQLDGELAKGDDRAALS+LKE QGK GGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLGS
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEV AGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

XP_023530264.1 uncharacterized protein LOC111792881 isoform X1 [Cucurbita pepo subsp. pepo]4.7e-15888.07Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYAGLC+RP   AI I NCS EVG+RRRQVL+QLDGELAKGDDRAALSLLKE QGKPGGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLGS
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEVNAGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

TrEMBL top hitse value%identityAlignment
A0A1S3BQI6 uncharacterized protein LOC103492217 isoform X19.9e-15485.93Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYA L YRP   A GIWNCSSEVGLRRRQVLEQ+D ELAKGDDRAAL LLKESQGK  G+RCFGAARQIPQRLYTL+ELKLNGIETSSLLSPLD+TLGS
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IER +QL A LL VSAWN+F+ +PQ IFYLSLGFLFLWTLDSVA NGGVGSLVLDTIGH FS+KYHNRVIQHEAGHFLIAYLLG+LPK YT SS EA ++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLD+LLKGLGFTQKK DSQVRWAVLNTVLILRRHE AR K
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCID+IEN+I   D+
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

A0A5D3CDZ5 Uncharacterized protein9.9e-15485.93Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYA L YRP   A GIWNCSSEVGLRRRQVLEQ+D ELAKGDDRAAL LLKESQGK  G+RCFGAARQIPQRLYTL+ELKLNGIETSSLLSPLD+TLGS
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IER +QL A LL VSAWN+F+ +PQ IFYLSLGFLFLWTLDSVA NGGVGSLVLDTIGH FS+KYHNRVIQHEAGHFLIAYLLG+LPK YT SS EA ++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLD+LLKGLGFTQKK DSQVRWAVLNTVLILRRHE AR K
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCID+IEN+I   D+
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

A0A6J1DKS7 uncharacterized protein LOC1110218391.9e-16089.39Show/hide
Query:  MQYAGLCYRP------AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDAT
        MQYAGLC RP      AIGI NCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKES+GKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLD T
Subjt:  MQYAGLCYRP------AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDAT

Query:  LGSIERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEA
        LGSIERN+QL AALLGVSAWNVFD SPQ IFY S+GFLFLWTLDSVA NGGVGSLVLD IGH FSQKYHNRVIQHEAGHFLIAYLLGILPK YTLSSLEA
Subjt:  LGSIERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEA

Query:  LKREGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIA
         ++EGSLN QAGT FVD EFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRR+E A
Subjt:  LKREGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIA

Query:  RAKLADSMSSGKSVGNCIDIIENNIDFSDI
        RAKLAD+MSS KSVG+CIDIIENN+D SD+
Subjt:  RAKLADSMSSGKSVGNCIDIIENNIDFSDI

A0A6J1ED20 uncharacterized protein LOC111433221 isoform X13.9e-15887.77Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        +QYAGLC+RP   AI I NCSSEVG++RRQVLEQLDGELAKGDDRAALSLLKE QGKPGGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLG+
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEVNAGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

A0A6J1I5V2 uncharacterized protein LOC111469866 isoform X11.6e-15687.77Show/hide
Query:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS
        MQYAGLC+RP   AI I NCSSEV LRRRQVLEQLDGELAKGDDRAALS+LKE QGK GGLRCFGAARQIP+R+YTLDELKLNGIE SSLLSPLDATLGS
Subjt:  MQYAGLCYRP---AIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGS

Query:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR
        IERN+Q  AAL+GV AWNVF I+PQ +FYLS+GFLFLWTLDSV  NGGVGSLVLDTIGH FSQKYHNRVIQHEAGHFLIAYLLG++PK YTLSSLEAL++
Subjt:  IERNVQLVAALLGVSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKR

Query:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK
        EGSLNLQAGT FVDFEFLEEV AGKVS TMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKK DSQVRWAVLNTVLILRRHE ARAK
Subjt:  EGSLNLQAGTTFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAK

Query:  LADSMSSGKSVGNCIDIIENNIDFSDI
        LAD+MSSGKSVGNCIDIIE +IDF DI
Subjt:  LADSMSSGKSVGNCIDIIENNIDFSDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54680.1 unknown protein1.0e-3341.38Show/hide
Query:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLEEV---------------NAGKVSATMLNRFSCIALAGVATEYLLYGCAE
        V+QHE+GHFL+ YLLG+LP+ Y + +LEA+++  S N+     FV FEFL++V               N G +S+  LN FSC+ L G+ TE++L+G +E
Subjt:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLEEV---------------NAGKVSATMLNRFSCIALAGVATEYLLYGCAE

Query:  GGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI
        G  +DI KL+ +L+ LGFT+ + ++ ++WAV NTV +L  H+ AR  LA++M+  K +  CI+ IE+ I    I
Subjt:  GGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI

AT1G54680.2 unknown protein1.0e-3341.38Show/hide
Query:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLEEV---------------NAGKVSATMLNRFSCIALAGVATEYLLYGCAE
        V+QHE+GHFL+ YLLG+LP+ Y + +LEA+++  S N+     FV FEFL++V               N G +S+  LN FSC+ L G+ TE++L+G +E
Subjt:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLEEV---------------NAGKVSATMLNRFSCIALAGVATEYLLYGCAE

Query:  GGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI
        G  +DI KL+ +L+ LGFT+ + ++ ++WAV NTV +L  H+ AR  LA++M+  K +  CI+ IE+ I    I
Subjt:  GGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI

AT1G54680.3 unknown protein4.5e-3442.26Show/hide
Query:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLE---------EVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADI
        V+QHE+GHFL+ YLLG+LP+ Y + +LEA+++  S N+     FV FEFL+         ++N G +S+  LN FSC+ L G+ TE++L+G +EG  +DI
Subjt:  VIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLE---------EVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADI

Query:  NKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI
         KL+ +L+ LGFT+ + ++ ++WAV NTV +L  H+ AR  LA++M+  K +  CI+ IE+ I    I
Subjt:  NKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI

AT5G27290.1 unknown protein1.9e-13375.4Show/hide
Query:  CSSEVGLR-RRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGSIERNVQLVAALLGVSAWN
        CSSE GL  RRQ LEQ+D +L+ GD+RAALSL+K+ QGKP GLRCFGAARQ+PQRLYTL+ELKLNGI  +SLLSP D TLGSIERN+Q+ A   G+ AW 
Subjt:  CSSEVGLR-RRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGSIERNVQLVAALLGVSAWN

Query:  VFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFL
         FD+S Q +F+L+LGF+FLWTLD V+FNGG+GSLVLDT GH FSQ+YHNRV+QHEAGHFL+AYL+GILP+ YTLSSLEAL++EGSLN+QAG+ FVD+EFL
Subjt:  VFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFL

Query:  EEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDII
        EEVN+GKVSATMLNRFSCIALAGVATEYLLYG AEGGL DI+KLD L+K LGFTQKK DSQVRW+VLNT+L+LRRHEIAR+KLA +MS G+SVG+CI II
Subjt:  EEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDII

Query:  ENNIDFSDI
        E++ID SDI
Subjt:  ENNIDFSDI

AT5G27290.2 unknown protein6.9e-9975.65Show/hide
Query:  CSSEVGLR-RRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGSIERNVQLVAALLGVSAWN
        CSSE GL  RRQ LEQ+D +L+ GD+RAALSL+K+ QGKP GLRCFGAARQ+PQRLYTL+ELKLNGI  +SLLSP D TLGSIERN+Q+ A   G+ AW 
Subjt:  CSSEVGLR-RRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGSIERNVQLVAALLGVSAWN

Query:  VFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFL
         FD+S Q +F+L+LGF+FLWTLD V+FNGG+GSLVLDT GH FSQ+YHNRV+QHEAGHFL+AYL+GILP+ YTLSSLEAL++EGSLN+QAG+ FVD+EFL
Subjt:  VFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFL

Query:  EEVNAGKVSATMLNRFSCIALAGVATEYLL
        EEVN+GKVSATMLNRFSCIALAGVATEYLL
Subjt:  EEVNAGKVSATMLNRFSCIALAGVATEYLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTATGCAGGTTTATGTTACCGTCCAGCGATCGGAATCTGGAACTGTTCATCGGAGGTCGGTCTCCGGCGGCGGCAAGTGCTTGAACAACTGGACGGAGAGTTAGC
CAAAGGTGATGATCGAGCCGCCCTCTCTCTCCTTAAGGAATCGCAGGGAAAACCTGGTGGCCTTCGATGCTTCGGCGCTGCTCGTCAGATTCCACAAAGACTTTACACAT
TGGATGAGCTAAAGCTGAATGGAATTGAAACTTCATCTCTTTTATCACCATTGGATGCAACCCTTGGTTCCATTGAAAGAAATGTTCAACTTGTTGCTGCTTTGCTTGGT
GTTTCTGCTTGGAATGTGTTTGACATTAGTCCCCAACTCATCTTCTACCTTTCACTCGGGTTCCTGTTTCTTTGGACCTTGGATTCGGTAGCTTTCAATGGAGGAGTTGG
AAGTTTAGTTCTCGATACCATTGGTCACAATTTTAGTCAAAAGTACCACAACAGAGTTATTCAACATGAAGCGGGTCATTTCTTGATTGCGTACTTGCTCGGAATTCTTC
CGAAGAGCTATACGTTGTCGAGTTTGGAAGCGTTGAAGAGGGAAGGATCTCTCAATCTTCAAGCAGGCACAACTTTTGTTGATTTTGAATTCCTTGAAGAAGTCAATGCA
GGAAAAGTTTCAGCAACGATGTTGAACAGATTCTCATGCATAGCACTTGCTGGTGTGGCTACTGAGTATCTTCTTTATGGATGTGCAGAGGGAGGCCTTGCTGATATTAA
CAAGTTGGATTTGTTGCTGAAAGGGTTAGGGTTTACACAGAAGAAGACAGATTCTCAAGTAAGATGGGCAGTTCTAAACACTGTTCTCATATTGCGCCGCCATGAAATTG
CAAGAGCTAAGCTTGCAGATTCTATGTCTTCTGGAAAATCTGTTGGAAATTGTATTGACATTATAGAAAATAATATTGATTTTTCTGATATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTATGCAGGTTTATGTTACCGTCCAGCGATCGGAATCTGGAACTGTTCATCGGAGGTCGGTCTCCGGCGGCGGCAAGTGCTTGAACAACTGGACGGAGAGTTAGC
CAAAGGTGATGATCGAGCCGCCCTCTCTCTCCTTAAGGAATCGCAGGGAAAACCTGGTGGCCTTCGATGCTTCGGCGCTGCTCGTCAGATTCCACAAAGACTTTACACAT
TGGATGAGCTAAAGCTGAATGGAATTGAAACTTCATCTCTTTTATCACCATTGGATGCAACCCTTGGTTCCATTGAAAGAAATGTTCAACTTGTTGCTGCTTTGCTTGGT
GTTTCTGCTTGGAATGTGTTTGACATTAGTCCCCAACTCATCTTCTACCTTTCACTCGGGTTCCTGTTTCTTTGGACCTTGGATTCGGTAGCTTTCAATGGAGGAGTTGG
AAGTTTAGTTCTCGATACCATTGGTCACAATTTTAGTCAAAAGTACCACAACAGAGTTATTCAACATGAAGCGGGTCATTTCTTGATTGCGTACTTGCTCGGAATTCTTC
CGAAGAGCTATACGTTGTCGAGTTTGGAAGCGTTGAAGAGGGAAGGATCTCTCAATCTTCAAGCAGGCACAACTTTTGTTGATTTTGAATTCCTTGAAGAAGTCAATGCA
GGAAAAGTTTCAGCAACGATGTTGAACAGATTCTCATGCATAGCACTTGCTGGTGTGGCTACTGAGTATCTTCTTTATGGATGTGCAGAGGGAGGCCTTGCTGATATTAA
CAAGTTGGATTTGTTGCTGAAAGGGTTAGGGTTTACACAGAAGAAGACAGATTCTCAAGTAAGATGGGCAGTTCTAAACACTGTTCTCATATTGCGCCGCCATGAAATTG
CAAGAGCTAAGCTTGCAGATTCTATGTCTTCTGGAAAATCTGTTGGAAATTGTATTGACATTATAGAAAATAATATTGATTTTTCTGATATCTAA
Protein sequenceShow/hide protein sequence
MQYAGLCYRPAIGIWNCSSEVGLRRRQVLEQLDGELAKGDDRAALSLLKESQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETSSLLSPLDATLGSIERNVQLVAALLG
VSAWNVFDISPQLIFYLSLGFLFLWTLDSVAFNGGVGSLVLDTIGHNFSQKYHNRVIQHEAGHFLIAYLLGILPKSYTLSSLEALKREGSLNLQAGTTFVDFEFLEEVNA
GKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADINKLDLLLKGLGFTQKKTDSQVRWAVLNTVLILRRHEIARAKLADSMSSGKSVGNCIDIIENNIDFSDI