; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000754 (gene) of Chayote v1 genome

Gene IDSed0000754
OrganismSechium edule (Chayote v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationLG03:9769889..9772514
RNA-Seq ExpressionSed0000754
SyntenySed0000754
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-18782.61Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK CCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNEDGGA FPTS Q IP+WS+ GFP+SPRKCRSGI   K KD PS L PN KV+C S QSA KEDGSC+I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV
        ENNIEARVQ+P GKQVL  ++QVEDREEARQSN+SSLLRSRLLAPLG+PFCSASIGGA K RPVDCGG+FSFSD+GHL+DTESLRRRMEQIAAVQGLGSV
Subjt:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV

Query:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW
        SAD ANI+NKVLD+YLKQLIRSCVDLVGA PA+EPEKPLA  QQIQGKVINGMLPNNQL     NG GE +HE RL CSISL DFKVAMELNPKQLGEDW
Subjt:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW

Query:  PLLMEKISMRASEE
        PLL+EKISMRA  E
Subjt:  PLLMEKISMRASEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.1e-18381.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDLGELKSQIVK LG DRSKRYFFYLNRFLSQKLSKNEFDKSCCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        ++ DGNEDGGA FPTS Q+IP WS+G   +SPRKCRSGI   K KD PS LGPNGKV+C SH SA         +MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ
        ENNIE RV QP GKQVLHNKIQ     VEDREEA QSN SSLLRSRLLAPLG+PFCSAS GG  K RPVDCGGDFSF D+GHL+DTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ

Query:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ
        GLGSVSAD ANI+NKVLD+YLKQLIRSCVDLVGA PAYEPEKPLA KQQIQGKVINGMLPNNQL GR  NG  E +HEHRLQCSISL DFKVAMELNP Q
Subjt:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ

Query:  LGEDWPLLMEKISMRASEE
        LGEDWPLL+EKI MRA  E
Subjt:  LGEDWPLLMEKISMRASEE

XP_022132327.1 uncharacterized protein LOC111005206 [Momordica charantia]1.4e-18380.48Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDLGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK C R+ GR+NLW+HNQLIQSILKNACQAKAAPP+P AG PKT  QSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNED GA +PTS QSIP+WS+GGFP SPRK RSGI   K KD PSPLGPNGKV+C SHQSAGK+DGSCK+ M NGDA LCDYQRPVQHLQGVAELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQPGKQVLHNKI-----QVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDF-SFSDIGHLMDTESLRRRMEQIAAVQ
        ENNIEAR++  GKQVL+NKI     +V DREEA  S  S LL+SRLLAPLG+PFCSASIGGA KARP D GGDF SFSDIGHL DTESLRRRMEQIAAV 
Subjt:  ENNIEARVQQPGKQVLHNKI-----QVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDF-SFSDIGHLMDTESLRRRMEQIAAVQ

Query:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACP-AYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPK
        GLGSVSADSANI+NKVLD+YLKQLIRSCV LVG CP   EPEKPL  K Q+QGKVINGMLPNNQL GR  NG  E MHEHRL+CS+SL DFKVAMELNPK
Subjt:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACP-AYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPK

Query:  QLGEDWPLLMEKISMRASEE
        QLGEDWPLL+EKI MRA EE
Subjt:  QLGEDWPLLMEKISMRASEE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.6e-18782.61Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK CCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNEDGGA FPTS Q IP+WS+ GFP+SPRKCRSGI   K KD PS L PN KV+C S QSA KEDGSC+I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV
        ENNIEARVQ+P GKQVL  ++QVEDREEARQSN+SSLLRSRLLAPLG+PFCSASIGGA K RPVDCGG+FSFSD+GHL+DTESLRRRMEQIAAVQGLGSV
Subjt:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV

Query:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW
        SAD ANI+NKVLD+YLKQLIRSCVDLVGA PA+EPEKPLA  QQIQGKVINGMLPNNQL     NG GE +HE RL CSISL DFKVAMELNPKQLGEDW
Subjt:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW

Query:  PLLMEKISMRASEE
        PLL+EKISMRA  E
Subjt:  PLLMEKISMRASEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]3.0e-18681.53Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK CCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNEDGGA FPTS Q IP+WS+ GFP+SPRKCRSGI   K KD PS L PN KV+C S QSA KEDGSC+I +DNG+A  CDYQRPVQHLQGV ELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQ---VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGL
        ENNIEARVQ+P GKQVL  +++   VEDREEARQSN+SSLLRSRLLAPLG+PFCSASIGGA K RPVDCGG+FSFSD+GHL+DTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQQP-GKQVLHNKIQ---VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGL

Query:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG
        GSVSAD ANI+NKVLD+YLKQLIRSCVDLVGA PA+EPEKPLA  QQIQGKVINGMLPNNQL     NG GE +HE RL CSISL DFKVAMELNPKQLG
Subjt:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG

Query:  EDWPLLMEKISMRASEE
        EDWPLL+EKISMRA  E
Subjt:  EDWPLLMEKISMRASEE

TrEMBL top hitse value%identityAlignment
A0A1S3BCQ5 uncharacterized protein LOC1034882315.2e-18481.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDLGELKSQIVK LG DRSKRYFFYLNRFLSQKLSKNEFDKSCCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        ++ DGNEDGGA FPTS Q+IP WS+G   +SPRKCRSGI   K KD PS LGPNGKV+C SH SA         +MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ
        ENNIE RV QP GKQVLHNKIQ     VEDREEA QSN SSLLRSRLLAPLG+PFCSAS GG  K RPVDCGGDFSF D+GHL+DTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ

Query:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ
        GLGSVSAD ANI+NKVLD+YLKQLIRSCVDLVGA PAYEPEKPLA KQQIQGKVINGMLPNNQL GR  NG  E +HEHRLQCSISL DFKVAMELNP Q
Subjt:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ

Query:  LGEDWPLLMEKISMRASEE
        LGEDWPLL+EKI MRA  E
Subjt:  LGEDWPLLMEKISMRASEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein5.2e-18481.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDLGELKSQIVK LG DRSKRYFFYLNRFLSQKLSKNEFDKSCCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        ++ DGNEDGGA FPTS Q+IP WS+G   +SPRKCRSGI   K KD PS LGPNGKV+C SH SA         +MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ
        ENNIE RV QP GKQVLHNKIQ     VEDREEA QSN SSLLRSRLLAPLG+PFCSAS GG  K RPVDCGGDFSF D+GHL+DTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQP-GKQVLHNKIQ-----VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQ

Query:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ
        GLGSVSAD ANI+NKVLD+YLKQLIRSCVDLVGA PAYEPEKPLA KQQIQGKVINGMLPNNQL GR  NG  E +HEHRLQCSISL DFKVAMELNP Q
Subjt:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ

Query:  LGEDWPLLMEKISMRASEE
        LGEDWPLL+EKI MRA  E
Subjt:  LGEDWPLLMEKISMRASEE

A0A6J1BTJ5 uncharacterized protein LOC1110052066.8e-18480.48Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDLGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK C R+ GR+NLW+HNQLIQSILKNACQAKAAPP+P AG PKT  QSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNED GA +PTS QSIP+WS+GGFP SPRK RSGI   K KD PSPLGPNGKV+C SHQSAGK+DGSCK+ M NGDA LCDYQRPVQHLQGVAELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQPGKQVLHNKI-----QVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDF-SFSDIGHLMDTESLRRRMEQIAAVQ
        ENNIEAR++  GKQVL+NKI     +V DREEA  S  S LL+SRLLAPLG+PFCSASIGGA KARP D GGDF SFSDIGHL DTESLRRRMEQIAAV 
Subjt:  ENNIEARVQQPGKQVLHNKI-----QVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDF-SFSDIGHLMDTESLRRRMEQIAAVQ

Query:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACP-AYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPK
        GLGSVSADSANI+NKVLD+YLKQLIRSCV LVG CP   EPEKPL  K Q+QGKVINGMLPNNQL GR  NG  E MHEHRL+CS+SL DFKVAMELNPK
Subjt:  GLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACP-AYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPK

Query:  QLGEDWPLLMEKISMRASEE
        QLGEDWPLL+EKI MRA EE
Subjt:  QLGEDWPLLMEKISMRASEE

A0A6J1HF85 uncharacterized protein LOC1114630001.7e-18782.61Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK CCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNEDGGA FPTS Q IP+WS+ GFP+SPRKCRSGI   K KD PS L PN KV+C S QSA KEDGSC+I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV
        ENNIEARVQ+P GKQVL  ++QVEDREEARQSN+SSLLRSRLLAPLG+PFCSASIGGA K RPVDCGG+FSFSD+GHL+DTESLRRRMEQIAAVQGLGSV
Subjt:  ENNIEARVQQP-GKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV

Query:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW
        SAD ANI+NKVLD+YLKQLIRSCVDLVGA PA+EPEKPLA  QQIQGKVINGMLPNNQL     NG GE +HE RL CSISL DFKVAMELNPKQLGEDW
Subjt:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDW

Query:  PLLMEKISMRASEE
        PLL+EKISMRA  E
Subjt:  PLLMEKISMRASEE

A0A6J1K7Q1 uncharacterized protein LOC1114924141.5e-18380.82Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQPQQSLRIDL ELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDK CCR+ GRENLW+HNQLIQSILKNACQAKAAPP+P AG PKT  Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
        VI DGNEDGGA F TS Q IP+WS+ GF MSPRKCRSGI   K KD PS L PN KV+C S QSA KEDGSC+I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VI-DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQP-GKQVLHNKIQ---VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGL
        ENNIEARVQ+P GKQVL  +++   VEDREEARQSN+SSLLRSRLLAPLG+PFCSASIGGA K RPVDCGG+FSFSD+GHL+DTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQQP-GKQVLHNKIQ---VEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGL

Query:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG
        GSVSAD ANI+NKVLD+YLKQLIRSCVDLVG  P +EPEKPLA  QQIQGKVINGMLPNNQL     NG  E +HE RL CSISL DFKVAMELNPKQLG
Subjt:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG

Query:  EDWPLLMEKISMRASEE
        EDWPLL+EKISMRA  E
Subjt:  EDWPLLMEKISMRASEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.0e-3530.71Show/hide
Query:  RIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVIDGNED
        R++  E+K+ I + +G  R+  YF  L +FL+ ++SK+EFDK C +  GREN+ +HN+L++SILKNA  AK+ PP                         
Subjt:  RIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVIDGNED

Query:  GGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEARVQQ
            +P  +    ++    FP SPRKCRS   KF+D PSPLGP GK    +                  D ++   QR          LP          
Subjt:  GGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEARVQQ

Query:  PGKQVLHNKIQVEDREEARQSNQSSLLRSR--LLAPLGVPFCSASIGGACKARPVDCGG--DFSFSDIGHLMDTESLRRRMEQIAAVQGLGSVSADSANI
                 + VED EE  Q   S  ++SR  L APLGV F   S     KAR     G    +    G L D  +LR R+E+   ++G+  +S DSAN+
Subjt:  PGKQVLHNKIQVEDREEARQSNQSSLLRSR--LLAPLGVPFCSASIGGACKARPVDCGG--DFSFSDIGHLMDTESLRRRMEQIAAVQGLGSVSADSANI

Query:  MNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDWPLLMEKI
        +N+ L+ Y+++LI  C+ L                                           A  + R   ++S+ DF  AME+NP+ LGE+WP+ +EKI
Subjt:  MNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDWPLLMEKI

Query:  SMRASEE
          RASEE
Subjt:  SMRASEE

AT2G24530.1 unknown protein1.6e-8946.78Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP
        MQ  Q  RI L ELK  IVK  G +RS+RYF+YL RFLSQKL+K+EFDK+C R+ GRENL +HNQLI+SIL+NA  AK+ PP  +AG       S K + 
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISP

Query:  VI---DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAE
             DG E  G   P  +Q  PVWS+G  P+SPRK RSG+   K +D PSPLG NGKV+   HQ   +ED    + M+NG     DYQR  ++   VA+
Subjt:  VI---DGNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGI--HKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAE

Query:  LPENNIEARVQQPGKQVLHNKIQVEDREEARQSNQS--SLLRSRLLAPLGVPFCSASIGGACKARPVDCGGD-FSFSDIGHLMDTESLRRRMEQIAAVQG
          +      V++P          V  R++  Q  Q+  +L  S L+APLG+PFCSAS+GG+ +  PV    +  S  D G L D E LR+RME IA  QG
Subjt:  LPENNIEARVQQPGKQVLHNKIQVEDREEARQSNQS--SLLRSRLLAPLGVPFCSASIGGACKARPVDCGGD-FSFSDIGHLMDTESLRRRMEQIAAVQG

Query:  LGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAY-EPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ
        L  VS + A  +N +LD+YLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L+ +  NG  +   +H    S+S+ DF+ AMELNP+Q
Subjt:  LGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAY-EPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQ

Query:  LGEDWPLLMEKISMRASEE
        LGEDWP L E+IS+R+ EE
Subjt:  LGEDWPLLMEKISMRASEE

AT4G31440.1 unknown protein1.8e-6741.69Show/hide
Query:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCP-KTLIQSAKIS
        MQ  Q  RIDL ELK  IVK +G +RS RYF+YL RFLSQKL+K+EFDKSC R+ GRENL +HN+LI+SIL+NA  AK+ P V ++G P K+L+   +  
Subjt:  MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCP-KTLIQSAKIS

Query:  PVIDGNEDGGAAFPTSNQSIPVWSSGGF-PMSPRKCRSGIHKFKDGPSPLGPNGKV-DCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP
           DG E+  +  P   ++    S+G    + P  C       +D P PLG NGKV   F++   G+         +   A LC  ++            
Subjt:  PVIDGNEDGGAAFPTSNQSIPVWSSGGF-PMSPRKCRSGIHKFKDGPSPLGPNGKV-DCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNIEARVQQPGKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD-CGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV
                   GK  +   I  +D  + R      L    ++APLG+PFCSAS+GG  +  PV       S  D G L DTE LR+RME IA  QGLG V
Subjt:  ENNIEARVQQPGKQVLHNKIQVEDREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD-CGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSV

Query:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAY-EPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGED
        SA+ + ++N +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN    +  N   +   E   Q S+SL DF+VAMELNP QLGED
Subjt:  SADSANIMNKVLDIYLKQLIRSCVDLVGACPAY-EPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGED

Query:  WPLLMEKISMRASEE
        WPLL E+IS+   EE
Subjt:  WPLLMEKISMRASEE

AT4G33890.1 unknown protein1.9e-3730.46Show/hide
Query:  QQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVID
        Q S R+D  E+K+ I + +G  R++ YF  L RF + K++K+EFDK C +  GR+N+ +HN+LI+SI+KNAC AK+ P + K G           S V  
Subjt:  QQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVID

Query:  GNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEA
        GN D       ++Q  P+     F  S RKCRS   K +D PSPLGP GK     H      + S   +    +      + PV+               
Subjt:  GNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEA

Query:  RVQQPGKQVLHNKIQVEDREEARQ---SNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD----CGGDF---SFSDIGHLMDTESLRRRMEQIAAVQGL
                     + VE+ EE  Q    + S   R  L APLGV   S S+      + V     C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  RVQQPGKQVLHNKIQVEDREEARQ---SNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD----CGGDF---SFSDIGHLMDTESLRRRMEQIAAVQGL

Query:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG
          ++ DS +++N  LD+++++LI  C+ L       +                           R R    +   + R    +S+ DF+  MELN + LG
Subjt:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG

Query:  EDWPLLMEKISMRASEE
        EDWP+ MEKI  RAS++
Subjt:  EDWPLLMEKISMRASEE

AT4G33890.2 unknown protein1.9e-3730.46Show/hide
Query:  QQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVID
        Q S R+D  E+K+ I + +G  R++ YF  L RF + K++K+EFDK C +  GR+N+ +HN+LI+SI+KNAC AK+ P + K G           S V  
Subjt:  QQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVID

Query:  GNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEA
        GN D       ++Q  P+     F  S RKCRS   K +D PSPLGP GK     H      + S   +    +      + PV+               
Subjt:  GNEDGGAAFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEA

Query:  RVQQPGKQVLHNKIQVEDREEARQ---SNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD----CGGDF---SFSDIGHLMDTESLRRRMEQIAAVQGL
                     + VE+ EE  Q    + S   R  L APLGV   S S+      + V     C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  RVQQPGKQVLHNKIQVEDREEARQ---SNQSSLLRSRLLAPLGVPFCSASIGGACKARPVD----CGGDF---SFSDIGHLMDTESLRRRMEQIAAVQGL

Query:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG
          ++ DS +++N  LD+++++LI  C+ L       +                           R R    +   + R    +S+ DF+  MELN + LG
Subjt:  GSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYEPEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLG

Query:  EDWPLLMEKISMRASEE
        EDWP+ MEKI  RAS++
Subjt:  EDWPLLMEKISMRASEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGACGCTCGGAACCGACCGGTCGAAACGATACTTCTTTTACTTGAATAGGTT
CTTGAGCCAAAAGCTGAGTAAGAATGAGTTTGATAAGTCATGTTGCCGTATATTTGGGAGGGAGAATCTTTGGATGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAGTACCTAAGGCAGGCTGTCCGAAAACTTTGATTCAATCTGCAAAAATTTCCCCTGTTATAGACGGGAATGAGGATGGTGGAGCT
GCTTTTCCTACTTCAAATCAAAGTATTCCTGTTTGGTCCAGTGGAGGTTTTCCAATGTCTCCGAGAAAGTGCAGGTCCGGGATACACAAGTTCAAGGACGGCCCCAGTCC
ACTCGGGCCGAATGGGAAGGTTGATTGTTTCTCACATCAATCAGCAGGCAAGGAAGATGGCAGTTGCAAAATCTCCATGGATAATGGTGATGCAGCTTTGTGTGACTATC
AAAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGCTAGAGTTCAGCAACCTGGAAAGCAAGTCCTACACAATAAGATCCAGGTTGAA
GATAGAGAAGAAGCTCGACAGTCAAATCAGTCAAGTTTACTTCGGAGTCGATTACTAGCACCTCTCGGGGTTCCTTTTTGCTCAGCTAGTATTGGTGGGGCCTGCAAAGC
GAGGCCTGTGGATTGTGGGGGTGACTTTAGCTTTAGTGACATTGGTCATTTGATGGACACCGAGTCATTGAGACGACGCATGGAACAAATTGCTGCAGTACAGGGCCTAG
GCAGTGTTTCTGCAGACTCTGCTAATATTATGAATAAGGTGTTGGATATATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGCCCTGCATATGAG
CCTGAGAAACCTCTTGCCCTTAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCCAATAATCAATTACGTGGACGACAAAGAAATGGAAAAGGAGAAGCTAT
GCACGAGCACAGATTACAGTGCTCGATATCATTGCGCGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTT
CTATGCGTGCATCTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
GACAGAGAAAATCGAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCAAGCCATTTCAAGCTTCAATTCATCTTCAATTCGCCAAGTTCTTCAAAGCTTGTGAATTC
TGCTGAAAGAATGATGGGTTCGTATTGGTTTGTCGACCCAGTACGCGTTTTTGTTTGTTGTTGATTTTTTTTGTTCTGAACTTGGGTTTGTTTGTTTTAAGTTAGGGTTT
TGATTTGGGATTTATTTGCAATAGGTTTGGTATGGTTTTGAATTTTGGCGGGATTTAGTGAGCTATGTAGTGGGTTTCACTTCTAGGGTTGTGGTTTTGTGAAGTTGCGA
TTTGGAGTTGCATTTTGTGGTTTGTAGGGTAAAAGGAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGTTGGTATCCATTTTGGGGTTTCCTTGATCTGGTGGAGTG
AATTTTAGGAACTTGATCAAATGATGCTGACCTTGAAAATTAATCTGCATTGATTGATATATGTGGTGGTGAGTATTTTGTTTTTTTTTTGTTTGGTGGACTTGAATAGT
GATTTTGTTCCTCTTACAGTCTGGAATTGAGTAATTTTGTGAAGGGTTTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATT
GAAATCTCAGATAGTGAAGACGCTCGGAACCGACCGGTCGAAACGATACTTCTTTTACTTGAATAGGTTCTTGAGCCAAAAGCTGAGTAAGAATGAGTTTGATAAGTCAT
GTTGCCGTATATTTGGGAGGGAGAATCTTTGGATGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCACCAGTACCTAAGGCAGGC
TGTCCGAAAACTTTGATTCAATCTGCAAAAATTTCCCCTGTTATAGACGGGAATGAGGATGGTGGAGCTGCTTTTCCTACTTCAAATCAAAGTATTCCTGTTTGGTCCAG
TGGAGGTTTTCCAATGTCTCCGAGAAAGTGCAGGTCCGGGATACACAAGTTCAAGGACGGCCCCAGTCCACTCGGGCCGAATGGGAAGGTTGATTGTTTCTCACATCAAT
CAGCAGGCAAGGAAGATGGCAGTTGCAAAATCTCCATGGATAATGGTGATGCAGCTTTGTGTGACTATCAAAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCT
GAAAACAATATTGAGGCTAGAGTTCAGCAACCTGGAAAGCAAGTCCTACACAATAAGATCCAGGTTGAAGATAGAGAAGAAGCTCGACAGTCAAATCAGTCAAGTTTACT
TCGGAGTCGATTACTAGCACCTCTCGGGGTTCCTTTTTGCTCAGCTAGTATTGGTGGGGCCTGCAAAGCGAGGCCTGTGGATTGTGGGGGTGACTTTAGCTTTAGTGACA
TTGGTCATTTGATGGACACCGAGTCATTGAGACGACGCATGGAACAAATTGCTGCAGTACAGGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTATGAATAAGGTG
TTGGATATATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGCCCTGCATATGAGCCTGAGAAACCTCTTGCCCTTAAGCAGCAGATTCAGGGGAA
GGTTATCAATGGCATGTTGCCCAATAATCAATTACGTGGACGACAAAGAAATGGAAAAGGAGAAGCTATGCACGAGCACAGATTACAGTGCTCGATATCATTGCGCGACT
TCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTCTATGCGTGCATCTGAGGAATAATACAACTTTGATATATCT
ATTTATCCCATTCCACAGTTCGATTACCCTGTTATGGTTCATAGGCGTCGAGAATGGATCTCAAAGACTTCTCTGCTATGTTCATTTGTGTGAGTAAACAAGATTGCTCA
GAACTTGATAAAACCTCAGTCTTGCAAAGATGACAAGCACGTCAAACATCAAAATTGACGTTCATTCTCGGGAATCATTTAACCGTGACTAAGCTCGGGCGCTGCCAGTC
AGAATGCCCACAGTGATTGATCTATGTTGAGTTAAGGATCTCAAGCTTTTCGTGTAAATTTAGATTACTGAGGAAATATACAAAATTTTGCAGAGATATAGTTTGTGTAG
CTTGGTGAGAGGTAATTACTGGTGCAAAAGCTTAGTGTTGCTCTTTGTATTTAAATTTGAGAATTGATTTGAATTTCTTATGAATAACTCATCAACAGATAACAATCTGC
AGATAGTATCCTCTGTATTTACTGCTTACATTTTTTGTTTTTGGGTTAGACAAGACGATGCAGATGCTATAAAATGATATATCATTTCTATTCAGAAAGATCTGTGTTCA
AAATTTTACACATTTATAATGCAATAGTCAGC
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKTLGTDRSKRYFFYLNRFLSQKLSKNEFDKSCCRIFGRENLWMHNQLIQSILKNACQAKAAPPVPKAGCPKTLIQSAKISPVIDGNEDGGA
AFPTSNQSIPVWSSGGFPMSPRKCRSGIHKFKDGPSPLGPNGKVDCFSHQSAGKEDGSCKISMDNGDAALCDYQRPVQHLQGVAELPENNIEARVQQPGKQVLHNKIQVE
DREEARQSNQSSLLRSRLLAPLGVPFCSASIGGACKARPVDCGGDFSFSDIGHLMDTESLRRRMEQIAAVQGLGSVSADSANIMNKVLDIYLKQLIRSCVDLVGACPAYE
PEKPLALKQQIQGKVINGMLPNNQLRGRQRNGKGEAMHEHRLQCSISLRDFKVAMELNPKQLGEDWPLLMEKISMRASEE