; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G002440 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G002440
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCmo_Chr12:1600883..1602100
RNA-Seq ExpressionCmoCh12G002440
SyntenyCmoCh12G002440
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585423.1 hypothetical protein SDJN03_18156, partial [Cucurbita argyrosperma subsp. sororia]1.2e-20698.11Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRIDLGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNG FPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPA-GNHVFPGQSNHLSLLRSRLLAP
        SPRKSRSGI DRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL ENDIEASVQQPA GNHVFPGQSNHLSLLRSRLLAP
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPA-GNHVFPGQSNHLSLLRSRLLAP

Query:  LGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQ
        LGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLD YLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQ
Subjt:  LGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQ

Query:  IQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
        IQGRVINGLLPNNQLH RHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
Subjt:  IQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

XP_022951516.1 uncharacterized protein LOC111454310 isoform X1 [Cucurbita moschata]7.0e-236100Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG
        SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG

Query:  IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ
        IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ
Subjt:  IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ

Query:  GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ
        GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ
Subjt:  GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ

Query:  LCPSM
        LCPSM
Subjt:  LCPSM

XP_022951518.1 uncharacterized protein LOC111454310 isoform X2 [Cucurbita moschata]9.5e-225100Show/hide
Query:  MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
        MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Subjt:  MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG

Query:  MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG
        MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG
Subjt:  MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG

Query:  GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV
        GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV
Subjt:  GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV

Query:  NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM
        NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM
Subjt:  NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM

XP_023002392.1 uncharacterized protein LOC111496247 [Cucurbita maxima]4.1e-20497.02Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRI LGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNA QAKAAPPIPTSAQSIPIWSNGGFPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL
        SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+HVFPGQSNHLSLLRSRLLAPL
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL

Query:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI
        GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQI
Subjt:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI

Query:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
        QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRAS +
Subjt:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

XP_023537842.1 uncharacterized protein LOC111798750 [Cucurbita pepo subsp. pepo]6.2e-20898.1Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRIDLGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIW NGGFPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL
        SPRKSRSGIRDRKLKDRPNGMVECISHQSAGK+DGSCKITMDNDVATLCDYQRPVQHLQGVAEL ENDIEASVQQPAGNH+FPGQSN LSLLRSRLLAPL
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL

Query:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI
        GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLA+KQQI
Subjt:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI

Query:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
        QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
Subjt:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

TrEMBL top hitse value%identityAlignment
A0A6J1GHT2 uncharacterized protein LOC111454310 isoform X13.4e-236100Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG
        SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG

Query:  IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ
        IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ
Subjt:  IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQ

Query:  GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ
        GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ
Subjt:  GRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQ

Query:  LCPSM
        LCPSM
Subjt:  LCPSM

A0A6J1GHW6 uncharacterized protein LOC111454310 isoform X24.6e-225100Show/hide
Query:  MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
        MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Subjt:  MLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG

Query:  MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG
        MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG
Subjt:  MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG

Query:  GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV
        GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV
Subjt:  GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV

Query:  NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM
        NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM
Subjt:  NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM

A0A6J1HF85 uncharacterized protein LOC1114630001.8e-16073.85Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI-----------------
        M+PQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLI SILKNACQAKAAPPI                 
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI-----------------

Query:  -------------PTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-
                     PTS Q IPIWSN GFP+SPRK RSGIRDRKLKDR     PN  VECIS QSA K+DGSC+I MDN  AT CDYQRPVQHLQGV EL 
Subjt:  -------------PTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-

Query:  ENDIEASVQQPAGNHVF---------PGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSG
        EN+IEA VQ+P+G  V            QSN  SLLRSRLLAPLGIPFCSASIGGA K RPVDCGG+FS SD+G LLDTESL RRMEQIAA QGLGSVS 
Subjt:  ENDIEASVQQPAGNHVF---------PGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSG

Query:  DCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWP
        DCA+ILNKVLDVYLKQLIRSCVDLVG +WPA+EPEKPLA+ QQIQG+VING+LPNNQLH  HSN NGE  ++ RL CSISLLDFK+AMELNPKQLGEDWP
Subjt:  DCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWP

Query:  LLMEKICLRASEE
        LL+EKI +RA  E
Subjt:  LLMEKICLRASEE

A0A6J1K7Q1 uncharacterized protein LOC1114924141.8e-15772.25Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIP----------------
        M+PQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLI SILKNACQAKAAPPIP                
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIP----------------

Query:  --------------TSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-
                      TS Q IPIWSN GF +SPRK RSGIRDRKLKDR     PN  VECIS QSA K+DGSC+I MDN  AT CDYQRPVQHLQGV EL 
Subjt:  --------------TSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-

Query:  ENDIEASVQQPAGNHVF--------------PGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGL
        EN+IEA VQ+P+G  V                 QSN  SLLRSRLLAPLGIPFCSASIGGA K RPVDCGG+FS SD+G LLDTESL RRMEQIAA QGL
Subjt:  ENDIEASVQQPAGNHVF--------------PGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGL

Query:  GSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQL
        GSVS DCA+ILNKVLDVYLKQLIRSCVDLVG  WP +EPEKPLA+ QQIQG+VING+LPNNQLH  HSN N E  ++ RL CSISLLDFK+AMELNPKQL
Subjt:  GSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQL

Query:  GEDWPLLMEKICLRASEE
        GEDWPLL+EKI +RA  E
Subjt:  GEDWPLLMEKICLRASEE

A0A6J1KJD6 uncharacterized protein LOC1114962472.0e-20497.02Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL
        MRPQQSLRI LGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNA QAKAAPPIPTSAQSIPIWSNGGFPL
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPL

Query:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL
        SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+HVFPGQSNHLSLLRSRLLAPL
Subjt:  SPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPL

Query:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI
        GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQI
Subjt:  GIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI

Query:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
        QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRAS +
Subjt:  QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein2.7e-3632.97Show/hide
Query:  RIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPP-IPTSAQSIPIWSNGGFPLSPRKSR
        R++  E+K+ I + +G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L+ SILKNA  AK+ PP  P  +    ++ +  FP SPRK R
Subjt:  RIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPP-IPTSAQSIPIWSNGGFPLSPRKSR

Query:  SGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSA
        S    RK +DRP+ +      QS         +T  ND +     + P++    V  +E+  E  V+Q  G+   P   +     RS L APLG+ F   
Subjt:  SGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSA

Query:  SIGGARKARPVDCGG--DFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVI
        S     KAR     G    +    G L D  +L  R+E+    +G+  +S D A++LN+ L+ Y+++LI  C+ L                         
Subjt:  SIGGARKARPVDCGG--DFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVI

Query:  NGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
                           A+ + R   ++S+LDF  AME+NP+ LGE+WP+ +EKIC RASEE
Subjt:  NGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

AT2G24530.1 unknown protein2.3e-8344.44Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPP------------------
        M+  Q  RI L ELK  IVK  G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI SIL+NA  AK+ PP                  
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPP------------------

Query:  --------IPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRP-----NGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIE
                IP  +Q  P+WSNG  P+SPRK RSG+++RK +DRP     NG VE + HQ   ++D    + M+N      DYQR  +++    E + +  
Subjt:  --------IPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRP-----NGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIE

Query:  ASVQQP--------AGNHVFPGQSN----HLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGD-FSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGD
          V++P        A   +   Q+      ++L  S L+APLGIPFCSAS+GG+ +  PV    +  S  D G L D E L +RME IA  QGL  VS +
Subjt:  ASVQQP--------AGNHVFPGQSN----HLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGD-FSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGD

Query:  CASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPL
        CA  LN +LDVYLK+LI SC DLVG+     +P K    KQQ Q +++NG+ P N L  +  N + +    H    S+S+LDF+ AMELNP+QLGEDWP 
Subjt:  CASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPL

Query:  LMEKICLRASEERN
        L E+I LR+ EE++
Subjt:  LMEKICLRASEERN

AT4G31440.1 unknown protein1.6e-6843.08Show/hide
Query:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTS---AQSIPIWSNGG
        M+  Q  RIDL ELK  IVK +G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI SIL+NA  AK+ P +  S    +S+ +    G
Subjt:  MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTS---AQSIPIWSNGG

Query:  FPLSPRKSRSGIRDRKLKD--RPNGMVECISHQSAGK---DDGSCKITMDNDVATLCDYQRPVQHLQG-----VAELENDIEASVQQPAGNHVFPGQSNH
            P +SRS   D    D    NG++  +   +       D  C +  +  V     Y RP ++        +   E    +   Q A       ++  
Subjt:  FPLSPRKSRSGIRDRKLKD--RPNGMVECISHQSAGK---DDGSCKITMDNDVATLCDYQRPVQHLQG-----VAELENDIEASVQQPAGNHVFPGQSNH

Query:  LSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPA
          L    ++APLGIPFCSAS+GG R+  PV       S  D G L DTE L +RME IA  QGLG VS +C+ +LN +LD+YLK+L++SCVDL G+    
Subjt:  LSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPA

Query:  YEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEER
          P K    KQQ +  ++NG+  NN  H + SN   + T   R Q S+SLLDF++AMELNP QLGEDWPLL E+I +   EER
Subjt:  YEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEER

AT4G33890.1 unknown protein4.2e-3732.29Show/hide
Query:  QQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI--------------PTSAQS
        Q S R+D  E+K+ I + +G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI SI+KNAC AK+ P I                ++Q 
Subjt:  QQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI--------------PTSAQS

Query:  IPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLS
         P+  +  F  S RK RS    RKL+DRP+ +       S    +        +    L    RP   +  V E E      V+Q AG     G  +  S
Subjt:  IPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLS

Query:  LLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF---SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSW
          R  L APLG+   S   G  RK  +    C   F   +  + G L DT +L  R+E+    +GL  ++ D  S+LN  LDV++++LI  C+ L  +  
Subjt:  LLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF---SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSW

Query:  PAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
                                       R   +N + T + R    +S+ DF+  MELN + LGEDWP+ MEKIC RAS++
Subjt:  PAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE

AT4G33890.2 unknown protein4.2e-3732.29Show/hide
Query:  QQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI--------------PTSAQS
        Q S R+D  E+K+ I + +G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI SI+KNAC AK+ P I                ++Q 
Subjt:  QQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPI--------------PTSAQS

Query:  IPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLS
         P+  +  F  S RK RS    RKL+DRP+ +       S    +        +    L    RP   +  V E E      V+Q AG     G  +  S
Subjt:  IPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLS

Query:  LLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF---SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSW
          R  L APLG+   S   G  RK  +    C   F   +  + G L DT +L  R+E+    +GL  ++ D  S+LN  LDV++++LI  C+ L  +  
Subjt:  LLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF---SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSW

Query:  PAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE
                                       R   +N + T + R    +S+ DF+  MELN + LGEDWP+ MEKIC RAS++
Subjt:  PAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGATGCTCGGAACCGATCGATCGAAACGTTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGCTATGTTGTCGTGTTCTTGGGAGGGAAAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTACTTCAGCTCAAAGTATTCCCATTTGGTCTAATGGAGGTTTTCCATTGTCTCCAAGAAAGAGCCGGTCCGGGATTCGT
GACCGTAAACTCAAGGACAGACCGAACGGGATGGTTGAATGCATCTCGCATCAATCAGCAGGCAAGGACGATGGAAGCTGTAAAATCACGATGGATAATGATGTTGCAAC
TCTGTGTGACTATCAGAGACCAGTGCAGCATTTGCAGGGAGTTGCAGAATTGGAAAACGATATCGAGGCTAGTGTTCAGCAACCAGCTGGAAATCACGTCTTCCCGGGAC
AGTCGAATCACTTGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATAGGCGGGGCTCGCAAAGCGAGACCTGTGGATTGTGGG
GGCGATTTTAGCATTAGTGATATTGGTCGTTTGTTGGATACCGAGTCGCTGGGACGACGTATGGAACAAATAGCTGCAGGACAGGGCTTAGGCAGTGTTTCTGGAGATTG
TGCTAGTATTTTGAATAAGGTATTGGATGTATATTTGAAGCAGTTAATTCGTTCTTGTGTTGACTTGGTTGGATCATCATGGCCTGCATATGAGCCTGAGAAACCTCTTG
CCTATAAGCAGCAGATTCAGGGGAGGGTTATCAATGGCCTGTTGCCTAATAATCAATTACATGGACGACATAGCAATGTCAATGGCGAAGCTACGTACAAGCACAGATTA
CAATGCTCGATATCGTTGCTCGACTTCAAACTAGCAATGGAGCTTAACCCGAAACAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTCTGCGTGCATCCGA
AGAAAGAAACGACTCTAATCGATCTATTTATCCCATTCCACAGTTTGATTGTCCCATCACGGTTCAAAGGGATCGAAAAACGATCTCAACAACCTCTCAGCTATGCCCAT
CTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGATGCTCGGAACCGATCGATCGAAACGTTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGCTATGTTGTCGTGTTCTTGGGAGGGAAAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTACTTCAGCTCAAAGTATTCCCATTTGGTCTAATGGAGGTTTTCCATTGTCTCCAAGAAAGAGCCGGTCCGGGATTCGT
GACCGTAAACTCAAGGACAGACCGAACGGGATGGTTGAATGCATCTCGCATCAATCAGCAGGCAAGGACGATGGAAGCTGTAAAATCACGATGGATAATGATGTTGCAAC
TCTGTGTGACTATCAGAGACCAGTGCAGCATTTGCAGGGAGTTGCAGAATTGGAAAACGATATCGAGGCTAGTGTTCAGCAACCAGCTGGAAATCACGTCTTCCCGGGAC
AGTCGAATCACTTGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATAGGCGGGGCTCGCAAAGCGAGACCTGTGGATTGTGGG
GGCGATTTTAGCATTAGTGATATTGGTCGTTTGTTGGATACCGAGTCGCTGGGACGACGTATGGAACAAATAGCTGCAGGACAGGGCTTAGGCAGTGTTTCTGGAGATTG
TGCTAGTATTTTGAATAAGGTATTGGATGTATATTTGAAGCAGTTAATTCGTTCTTGTGTTGACTTGGTTGGATCATCATGGCCTGCATATGAGCCTGAGAAACCTCTTG
CCTATAAGCAGCAGATTCAGGGGAGGGTTATCAATGGCCTGTTGCCTAATAATCAATTACATGGACGACATAGCAATGTCAATGGCGAAGCTACGTACAAGCACAGATTA
CAATGCTCGATATCGTTGCTCGACTTCAAACTAGCAATGGAGCTTAACCCGAAACAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTCTGCGTGCATCCGA
AGAAAGAAACGACTCTAATCGATCTATTTATCCCATTCCACAGTTTGATTGTCCCATCACGGTTCAAAGGGATCGAAAAACGATCTCAACAACCTCTCAGCTATGCCCAT
CTATGTGA
Protein sequenceShow/hide protein sequence
MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIR
DRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCG
GDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRL
QCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEERNDSNRSIYPIPQFDCPITVQRDRKTISTTSQLCPSM