; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019604 (gene) of Snake gourd v1 genome

Gene IDTan0019604
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationLG02:94755700..94758503
RNA-Seq ExpressionTan0019604
SyntenyTan0019604
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-19885.2Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        VIEDGNEDGGA FPT TQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+EARVQ+PAGKQVLQ        +VEDREE  QSN+S LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKI M A  E
Subjt:  LGEDWPLQMEKICMCASEE

XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]2.9e-20085.99Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPI VAGYPKTS QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        ++EDGNEDGGA FPT TQNIP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGDA LCDY+RPVQ+LQG+AELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+E RV QP+GKQ LQNKIQVE TKVEDREE GQSN S LLRSRLLAPLGIPFCSASIGGARK RPVD GGDFS SD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGR+SNG+ E +HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICM
        LGEDWPL +EKICM
Subjt:  LGEDWPLQMEKICM

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]3.1e-20286.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        ++EDGNEDGGA FPT TQNIP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKICM A  E
Subjt:  LGEDWPLQMEKICMCASEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]1.6e-19885.44Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        VIEDGNEDGGA F T TQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  +SNGN E +HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKI M A  E
Subjt:  LGEDWPLQMEKICMCASEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]1.5e-20186.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        VIEDGNEDGGA FPT TQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I +DNG+A  CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKI M A  E
Subjt:  LGEDWPLQMEKICMCASEE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein1.4e-20085.99Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPI VAGYPKTS QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        ++EDGNEDGGA FPT TQNIP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGDA LCDY+RPVQ+LQG+AELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+E RV QP+GKQ LQNKIQVE TKVEDREE GQSN S LLRSRLLAPLGIPFCSASIGGARK RPVD GGDFS SD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGR+SNG+ E +HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICM
        LGEDWPL +EKICM
Subjt:  LGEDWPLQMEKICM

A0A1S3BCQ5 uncharacterized protein LOC1034882311.5e-20286.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        ++EDGNEDGGA FPT TQNIP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKICM A  E
Subjt:  LGEDWPLQMEKICMCASEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein1.5e-20286.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        ++EDGNEDGGA FPT TQNIP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGDA LCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKICM A  E
Subjt:  LGEDWPLQMEKICMCASEE

A0A6J1HF85 uncharacterized protein LOC1114630003.8e-19884.96Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        VIEDGNEDGGA FPT TQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+EARVQ+P+GKQVLQ        +VEDREE  QSN+S LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKI M A  E
Subjt:  LGEDWPLQMEKICMCASEE

A0A6J1K7Q1 uncharacterized protein LOC1114924147.6e-19985.44Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP
        VIEDGNEDGGA F T TQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+A  CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELP

Query:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ
        ENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Subjt:  ENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ

Query:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  +SNGN E +HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLQMEKICMCASEE
        LGEDWPL +EKI M A  E
Subjt:  LGEDWPLQMEKICMCASEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein2.1e-3931.25Show/hide
Query:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNE

Query:  DGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENNVEAR
                      ++ +  FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   QR                   
Subjt:  DGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENNVEAR

Query:  VQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSR--LLAPLGIPFCSASIGGARKAR--PVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQGLG
                     + +E   VED EE  Q   S  ++SR  L APLG+ F   S     KAR    +G    +    G L D  +LR R+E+   ++G+ 
Subjt:  VQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSR--LLAPLGIPFCSASIGGARKAR--PVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQGLG

Query:  SVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGE
         +S D A++LN+ L+ Y+++LI  C+ L                                           A  + R   ++S+LDF  AME+NP+ LGE
Subjt:  SVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGE

Query:  DWPLQMEKICMCASEE
        +WP+Q+EKIC  ASEE
Subjt:  DWPLQMEKICMCASEE

AT2G24530.1 unknown protein7.0e-9647.63Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGY-PKTSAQSAKIS
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLS+KL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+  K +A  ++  
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGY-PKTSAQSAKIS

Query:  PVIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAEL
            DG E  G   P  +Q+ P+WSNG  P+SPRK RSG+++RK +DRPSPLG NGKVE + HQ V +ED    + M+NG     DYQR  +++      
Subjt:  PVIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAEL

Query:  PENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGD-FSFSDIGHLLDTESLRRRMEQIAT
          +  +    +P  K  + NK ++    + D +   +  +  L  S L+APLGIPFCSAS+GG+ +  PV    +  S  D G L D E LR+RME IA 
Subjt:  PENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGD-FSFSDIGHLLDTESLRRRMEQIAT

Query:  VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELN
         QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ +   +H    S+S+LDF+ AMELN
Subjt:  VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELN

Query:  PKQLGEDWPLQMEKICMCASEE
        P+QLGEDWP   E+I + + EE
Subjt:  PKQLGEDWPLQMEKICMCASEE

AT4G31440.1 unknown protein1.5e-7444.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLS+KL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISP

Query:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAEL
          EDG E+  +  P   +N    SNG       K R G   DR ++D+P PLG NGKV                     G  A   Y RP ++       
Subjt:  VIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAEL

Query:  PENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVD-GGGDFSFSDIGHLLDTESLRRRMEQIAT
        P+   ++    PA ++ +  K QV      D E    +    L    ++APLGIPFCSAS+GG R+  PV       S  D G L DTE LR+RME IA 
Subjt:  PENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVD-GGGDFSFSDIGHLLDTESLRRRMEQIAT

Query:  VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELN
         QGLG VSA+C+ +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   +   E   Q S+SLLDF+VAMELN
Subjt:  VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELN

Query:  PKQLGEDWPLQMEKICMCASEE
        P QLGEDWPL  E+I +   EE
Subjt:  PKQLGEDWPLQMEKICMCASEE

AT4G33890.1 unknown protein1.2e-3931.84Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIE

Query:  DGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENN
         GN D        +Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK               + +T  N        +  +   Q   EL    
Subjt:  DGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENN

Query:  VEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ
                     L ++  VE   VE+ EE  Q    + S   R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+
Subjt:  VEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ

Query:  IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAME
           ++GL  ++ D  S+LN  LDV++++LI  C+ L       +                           R    N +   + R    +S+ DF+  ME
Subjt:  IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAME

Query:  LNPKQLGEDWPLQMEKICMCASEE
        LN + LGEDWP+ MEKIC  AS++
Subjt:  LNPKQLGEDWPLQMEKICMCASEE

AT4G33890.2 unknown protein1.2e-3931.84Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIE

Query:  DGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENN
         GN D        +Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK               + +T  N        +  +   Q   EL    
Subjt:  DGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENN

Query:  VEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ
                     L ++  VE   VE+ EE  Q    + S   R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+
Subjt:  VEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ

Query:  IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAME
           ++GL  ++ D  S+LN  LDV++++LI  C+ L       +                           R    N +   + R    +S+ DF+  ME
Subjt:  IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAME

Query:  LNPKQLGEDWPLQMEKICMCASEE
        LN + LGEDWP+ MEKIC  AS++
Subjt:  LNPKQLGEDWPLQMEKICMCASEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGATTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCGAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATAGCTGTAGCAGGCTATCCGAAAACTTCGGCGCAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGTGGA
GCTGCTTTTCCTACTCCCACTCAAAATATTCCTATATGGTCTAATGGAGGTTTTCCAATGTCTCCGAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAGGACAG
GCCGAGTCCACTCGGACCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGTAGGCAAGGAAGATGGCAGCTACAAAATCACGATGGACAATGGCGATGCAGCTCTTT
GTGACTATCAAAGGCCAGTGCAGCATCTGCAAGGAGTTGCTGAACTGCCTGAAAACAATGTTGAGGCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTCCTACAGAATAAG
ATCCAGGTTGAGGGAACCAAGGTTGAAGACAGGGAAGAACCGGGACAGTCAAATCAGTCGTGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTC
AGCTAGTATCGGTGGGGCCCGCAAAGCGAGGCCTGTGGATGGTGGGGGTGATTTTAGCTTTAGTGATATTGGTCATTTGTTGGATACTGAGTCATTGAGACGACGCATGG
AACAAATTGCTACAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGAC
TTGGTTGGAGCATGGCCTGCATATGAGCCCGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAAGTTATCAATGGCATGTTGCCAAATAATCAATTACATGGGCG
ATATAGCAATGGAAATGGAGAAGCTATGCACGAACACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCTATGGAGCTTAACCCAAAGCAACTTGGGGAAG
ACTGGCCTTTGCAGATGGAGAAAATTTGTATGTGTGCATCCGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
CGCAGAACTATCAAAACTCAAATAATAATCAGCAAAAACCCAACAAATAAAATCTAAAATTATAAGAGAGAGAGAGAGAAAAAAAAAAAAAAGAGAGAGAGAAAGAGAGA
CAGAGGTTGAAAGACAGAGAAAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCGAGCCATTCAGCTTCAATTTCTCTTCAATTCGTCAAGTTCGTCAAAGGT
CAGAGCTCAAGTAATCCATATACCCTTGTTGTTCAAATGCTATTGAGCTTAGTTGGCGAGATTCAATGTGCTTCATATCTCTGCAATTTGTGCTCTGTTCCCTCTACTTA
ACTTCGCGCAACTCACTGATCTCTTTCTTCGACTACGAAATTTCGTGTTTCGAACTTCTGAGCTTGTGAATTCTGCTGAAAGAATGAGGGGTTTGTATTGGTTCGTCGAC
CCAATACGCGTTTTCGTTAGTTGTTGAAAAATTTTGCGTGTTCTGAACTCGGGTTTCGTTGTTCGAAGTTAGGGTTTTGATTTGCGATTGATTTGCAATAGGTTTGATTT
GGTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTCTAGGGTTGTGGTTTTGAGAAGTTGCGATTTGGAGTTGCATTTTGTGCTTTGTAGTTGGG
TAAAAGAAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGTTGAAATTCATTTCGGGATTTCCTTGATCTGGTGGGGTGGATTTTAGGAACTTGCTTAAATGATGCTG
ACCTTGAAAATTAATCTGCATTGATTGATATATGTGGGAGTGAGAACTTTGTGGTTATTTTTTTCTGTTTTTTTTGGTGAAGTTGAATAGTGATTGTGTTCCTCTTCCTG
TTTGGAATTGAGTGATTTTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGAGCTTGAGAATTGATTTGGGTGAATTGAAATCTCAGATAGTGAAG
AAGCTTGGAACCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCGAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAG
GGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCACCAATAGCTGTAGCAGGCTATCCGAAAACTTCGGCGC
AATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGTGGAGCTGCTTTTCCTACTCCCACTCAAAATATTCCTATATGGTCTAATGGAGGTTTTCCAATG
TCTCCGAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAGGACAGGCCGAGTCCACTCGGACCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGTAGGCAA
GGAAGATGGCAGCTACAAAATCACGATGGACAATGGCGATGCAGCTCTTTGTGACTATCAAAGGCCAGTGCAGCATCTGCAAGGAGTTGCTGAACTGCCTGAAAACAATG
TTGAGGCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTCCTACAGAATAAGATCCAGGTTGAGGGAACCAAGGTTGAAGACAGGGAAGAACCGGGACAGTCAAATCAGTCG
TGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCCCGCAAAGCGAGGCCTGTGGATGGTGGGGGTGATTTTAGCTT
TAGTGATATTGGTCATTTGTTGGATACTGAGTCATTGAGACGACGCATGGAACAAATTGCTACAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAGTATTTTGA
ATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATATGAGCCCGAGAAACCTCTTGCCCACAAGCAGCAGATT
CAGGGGAAAGTTATCAATGGCATGTTGCCAAATAATCAATTACATGGGCGATATAGCAATGGAAATGGAGAAGCTATGCACGAACACAGATTACAGTGCTCGATATCGTT
GCTTGACTTCAAGGTAGCTATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCAGATGGAGAAAATTTGTATGTGTGCATCCGAGGAATGAAACAACTCTG
ATATATCTATTTATCCCATTCCACAGTTTGATTGCCCCATCATAGTTCAAAGGGGTCGAGAAGGGATCTCAAAGACCTCTCAGCTATGCTCATTTGTGTGAGTAAACAGA
TTGCTCAAAACTTGATAAAAGCTCTGGCTTGCAAAGATGACAAGCAGGTCAAATATCAAAATCGCATCATTTAACTGTGACTAAGCTTGGGCGCTGTCGGTCAGAATGCC
CATCGGTGATCGATCTACATTTCGCATCTAAACTGGCTGTAGCCGCGATTCAAGGATCTCAAGCTTTTCATGTAAATTTAGATCACTGAGGAAATATACAAAATTTTGCA
GAGATACAGTTTGTGTAGCTTGGTGGGAGGTAATTACTGGTGCAGGAGCGTAGTGTTGCTCTTTGTGTATTTAAATCTGAGAATTGATTTGAATTTCATATGAATAACTC
ATCAACAAACAACAATCTGCAGATAGTATCCTCTGTATTTCATGCTTACTCTTTTTTTTTTTTTTTTAGATAGAAAGATGCAGATGCTATAAAATGATATAGCATTTCTA
TTTAGAAAGATCTGTGTTCAAAATGTTCCACATTTGTAATGCAATAGTCAGCCA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGG
AAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNK
IQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQGLGSVSADCASILNKVLDVYLKQLIRSCVD
LVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE