; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08314 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08314
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCarg_Chr05:3340327..3341565
RNA-Seq ExpressionCarg08314
SyntenyCarg08314
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-238100Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
        ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
Subjt:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA

Query:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
        DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
Subjt:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL

Query:  LLEKISMRAFTE
        LLEKISMRAFTE
Subjt:  LLEKISMRAFTE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]9.4e-19684.96Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +P+GKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFTE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKISMRAFTE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]4.5e-23899.76Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
        ENNIEARVQRP+GKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
Subjt:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA

Query:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
        DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
Subjt:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL

Query:  LLEKISMRAFTE
        LLEKISMRAFTE
Subjt:  LLEKISMRAFTE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]4.0e-23196.88Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVF TSTQGIPIWSNEGF +SPRKCRSGIRDRKLKDRPSLLAPNLKVECIS QSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQRP+GKQVLQMQ     VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG
        GSVSADCANILNKVLDVYLKQLIRSCVDLVG WP FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGN EVVHERRLHCSISLLDFKVAMELNPKQLG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFTE
        EDWPLLLEKISMRAFTE
Subjt:  EDWPLLLEKISMRAFTE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]1.6e-23598.08Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIM+DNGNATSCDYQRPVQHLQGV+ELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQRP+GKQVLQMQ     VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG
        GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFTE
        EDWPLLLEKISMRAFTE
Subjt:  EDWPLLLEKISMRAFTE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein1.0e-19283.53Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +P+GKQ LQ        +VEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFTE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKISMRAFTE

A0A1S3BCQ5 uncharacterized protein LOC1034882314.5e-19684.96Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +P+GKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFTE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKISMRAFTE

A0A5A7VF96 SAGA-Tad1 domain-containing protein4.5e-19684.96Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +P+GKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFTE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKISMRAFTE

A0A6J1HF85 uncharacterized protein LOC1114630002.2e-23899.76Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
        ENNIEARVQRP+GKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA
Subjt:  ENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA

Query:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
        DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL
Subjt:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPL

Query:  LLEKISMRAFTE
        LLEKISMRAFTE
Subjt:  LLEKISMRAFTE

A0A6J1K7Q1 uncharacterized protein LOC1114924142.0e-23196.88Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
        VIEDGNEDGGAVF TSTQGIPIWSNEGF +SPRKCRSGIRDRKLKDRPSLLAPNLKVECIS QSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELP

Query:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQRP+GKQVLQMQ     VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQRPAGKQVLQMQ-----VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG
        GSVSADCANILNKVLDVYLKQLIRSCVDLVG WP FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGN EVVHERRLHCSISLLDFKVAMELNPKQLG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFTE
        EDWPLLLEKISMRAFTE
Subjt:  EDWPLLLEKISMRAFTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein3.3e-3731.05Show/hide
Query:  RIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNE

Query:  DGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENNIEAR
                      ++ +  FP SPRKCRS    RK +DRPS L P  K     PQS         +   N  + S   + P++                
Subjt:  DGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENNIEAR

Query:  VQRPAGKQVLQMQVEDREEARQSNRSSLLRSR--LLAPLGIPFCSASIGGAHKTRPVDCGG--NFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSADCA
                   + VED EE  Q   S  ++SR  L APLG+ F   S     K R     G    +    G L D  +LR R+E+   ++G+  +S D A
Subjt:  VQRPAGKQVLQMQVEDREEARQSNRSSLLRSR--LLAPLGIPFCSASIGGAHKTRPVDCGG--NFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSADCA

Query:  NILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLE
        N+LN+ L+ Y+++LI  C+ L                                              ++R   ++S+LDF  AME+NP+ LGE+WP+ LE
Subjt:  NILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLE

Query:  KISMRAFTE
        KI  RA  E
Subjt:  KISMRAFTE

AT2G24530.1 unknown protein6.2e-9748.8Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQ  Q  RI LCELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+   ST+A     
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHL----QGV
           DG E  G + P  +Q  P+WSN   P+SPRK RSG+++RK +DRPS L  N KVE +  Q  C+ED    + M+NG     DYQR  +++     G 
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHL----QGV

Query:  FELPENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGN-FSFSDMGHLLDTESLRRRMEQIAAVQGL
        F  P      + + P  +++  + + D +   +  R +L  S L+APLGIPFCSAS+GG+ +T PV       S  D G L D E LR+RME IA  QGL
Subjt:  FELPENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGN-FSFSDMGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQL
          VS +CA  LN +LDVYLK+LI SC DLVGA     +P K     QQ Q K++NG+ P N L     NG+ ++   R+ H S+S+LDF+ AMELNP+QL
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFTE
        GEDWP L E+IS+R+F E
Subjt:  GEDWPLLLEKISMRAFTE

AT4G31440.1 unknown protein1.7e-7343.3Show/hide
Query:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISP

Query:  VIEDGNEDGGAVFPTSTQGIPIWSNEGF-PVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQH---LQGV
          EDG E+  ++ P   +     SN     V P  C     DR ++D+P  L  N KV  + P +                     Y RP ++       
Subjt:  VIEDGNEDGGAVFPTSTQGIPIWSNEGF-PVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQH---LQGV

Query:  FELPENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD-CGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL
        F  P     A  +  +GK  +   +   +EA+    S+     ++APLGIPFCSAS+GG  +T PV       S  D G L DTE LR+RME IA  QGL
Subjt:  FELPENNIEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD-CGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQL
        G VSA+C+ +LN +LD+YLK+L++SCVDL GA      P K     QQ + +++NG+  NN  H   SN   ++  E+    S+SLLDF+VAMELNP QL
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFTE
        GEDWPLL E+IS+  F E
Subjt:  GEDWPLLLEKISMRAFTE

AT4G33890.1 unknown protein1.2e-3629.78Show/hide
Query:  QQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIE

Query:  DGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENN
         GN D       ++Q  P+  +  F  S RKCRS    RKL+DRPS L P                      +   ++ +   +  +   Q   EL    
Subjt:  DGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENN

Query:  IEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD----CGGNF---SFSDMGHLLDTESLRRRMEQIAAVQGLG
              RP  + V   + E+ E+    + S   R  L APLG+   S S+      + V     C  +F   +  + G L DT +LR R+E+   ++GL 
Subjt:  IEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD----CGGNF---SFSDMGHLLDTESLRRRMEQIAAVQGLG

Query:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGE
         ++ D  ++LN  LDV++++LI  C+ L                        N     +++  +    N +   + R    +S+ DF+  MELN + LGE
Subjt:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGE

Query:  DWPLLLEKISMRA
        DWP+ +EKI  RA
Subjt:  DWPLLLEKISMRA

AT4G33890.2 unknown protein1.2e-3629.78Show/hide
Query:  QQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIE

Query:  DGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENN
         GN D       ++Q  P+  +  F  S RKCRS    RKL+DRPS L P                      +   ++ +   +  +   Q   EL    
Subjt:  DGNEDGGAVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENN

Query:  IEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD----CGGNF---SFSDMGHLLDTESLRRRMEQIAAVQGLG
              RP  + V   + E+ E+    + S   R  L APLG+   S S+      + V     C  +F   +  + G L DT +LR R+E+   ++GL 
Subjt:  IEARVQRPAGKQVLQMQVEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVD----CGGNF---SFSDMGHLLDTESLRRRMEQIAAVQGLG

Query:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGE
         ++ D  ++LN  LDV++++LI  C+ L                        N     +++  +    N +   + R    +S+ DF+  MELN + LGE
Subjt:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGE

Query:  DWPLLLEKISMRA
        DWP+ +EKI  RA
Subjt:  DWPLLLEKISMRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGTGTGAATTGAAATCTCAAATAGTGAAGAAGCTCGGAACTGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCGATACCTGCAGCAGGCTATCCGAAAACTTCGACGCAAGCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGA
GCTGTTTTTCCTACTTCCACTCAAGGTATTCCCATTTGGTCCAATGAAGGTTTTCCTGTGTCCCCAAGAAAGTGCAGGTCTGGGATACGCGACCGCAAACTCAAAGATAG
GCCGAGTCTACTTGCGCCAAACTTGAAGGTTGAATGTATCTCACCTCAATCAGCATGCAAGGAAGATGGCAGCTGTAGGATCATGATGGATAATGGCAATGCAACTTCGT
GTGACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTTTTGAACTACCTGAAAACAACATTGAGGCTAGAGTTCAGCGACCAGCAGGGAAGCAAGTTCTACAGATGCAG
GTTGAAGACAGGGAAGAAGCGAGACAGTCAAACCGATCGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCTCA
CAAAACAAGGCCTGTGGATTGTGGGGGCAATTTTAGCTTCAGTGATATGGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCAGTACAGG
GCCTAGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAACAAGGTGTTGGATGTATATTTGAAGCAGTTAATCAGGTCTTGCGTTGACTTGGTTGGAGCATGGCCTGCA
TTTGAGCCTGAGAAGCCTCTTGCCCATAACCAGCAAATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAATTACACAGGCTACATAGCAATGGAAATGGAGA
AGTTGTGCACGAGCGCAGATTACACTGCTCGATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTCTGCTACTGGAGA
AAATTAGTATGCGTGCATTCACGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGTGTGAATTGAAATCTCAAATAGTGAAGAAGCTCGGAACTGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCGATACCTGCAGCAGGCTATCCGAAAACTTCGACGCAAGCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGA
GCTGTTTTTCCTACTTCCACTCAAGGTATTCCCATTTGGTCCAATGAAGGTTTTCCTGTGTCCCCAAGAAAGTGCAGGTCTGGGATACGCGACCGCAAACTCAAAGATAG
GCCGAGTCTACTTGCGCCAAACTTGAAGGTTGAATGTATCTCACCTCAATCAGCATGCAAGGAAGATGGCAGCTGTAGGATCATGATGGATAATGGCAATGCAACTTCGT
GTGACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTTTTGAACTACCTGAAAACAACATTGAGGCTAGAGTTCAGCGACCAGCAGGGAAGCAAGTTCTACAGATGCAG
GTTGAAGACAGGGAAGAAGCGAGACAGTCAAACCGATCGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCTCA
CAAAACAAGGCCTGTGGATTGTGGGGGCAATTTTAGCTTCAGTGATATGGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCAGTACAGG
GCCTAGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAACAAGGTGTTGGATGTATATTTGAAGCAGTTAATCAGGTCTTGCGTTGACTTGGTTGGAGCATGGCCTGCA
TTTGAGCCTGAGAAGCCTCTTGCCCATAACCAGCAAATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAATTACACAGGCTACATAGCAATGGAAATGGAGA
AGTTGTGCACGAGCGCAGATTACACTGCTCGATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTCTGCTACTGGAGA
AAATTAGTATGCGTGCATTCACGGAATGA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGG
AVFPTSTQGIPIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGNATSCDYQRPVQHLQGVFELPENNIEARVQRPAGKQVLQMQ
VEDREEARQSNRSSLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA
FEPEKPLAHNQQIQGKVINGMLPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE